Scsi Errors.
Justin Bennett
justin.bennett at dynabrade.com
Wed Sep 24 10:03:18 EDT 2003
Its a new box, It's less than one week old, Its got 6 drives, 3 logical
(mirrored) 2 scsi raid (adaptec 2120s) controllers. I put this in last
wednesnight, so the drives are all but new.. I did have a defective spot
found on one of the drives since we put it live. Adaptec has a raid
manager command line program and it gives you drive details.
AAC0> disk list
Executing: disk list
C:ID:L Device Type Blocks Bytes/Block Usage Shared Rate
------ -------------- --------- ----------- ---------------- ------ ----
0:00:0 Disk 71687372 512 Initialized NO 320
0:01:0 Disk 71687372 512 Initialized NO 320
0:02:0 Disk 143374744 512 Initialized NO 320
0:03:0 Disk 143374744 512 Initialized NO 320
AAC0> disk show defects (0:3:0)
Executing: disk show defects (BUS=0,ID=3,LUN=0)
Number of PRIMARY defects on drive: 921
Number of GROWN defects on drive: 1
Defect 1 at cylinder 928, head 1, sector 855
So at some point (possibly lat night) it found a defect on that drive,
all the others the grown defects are 0.
Mark Musone wrote:
>This is more likely than not a slowly failing drive...i'd get all the
>data off that drive and get a new one..
>
>It could also be a failing controller, but since it seemed to be
>pointing to one scsi id, it's hard to say..plus you said it was raid but
>I don’t mknow any more detailed info about the scsi setup
>
>-Mark
>
>
>-----Original Message-----
>From: owner-nflug at nflug.org [mailto:owner-nflug at nflug.org] On Behalf Of
>Justin Bennett
>Sent: Wednesday, September 24, 2003 8:23 AM
>To: nflug at nflug.org
>Subject: Scsi Errors.
>
>I'm using an adaptec 2120 Raid controller. Last night about 4:00 AM I
>got these errors: As you can see this is basically a NAS box sharing
>files via NFS. you can see the failed mount attempts at the bottom.
>After a reboot and FSCK the machine came backup and all the data appears
>
>ok. Ideas?
>
>
>Sep 24 04:02:47 nas kernel: aacraid:ID(0:03:0) Abort Time-out. Resetting
>
>bus.
>Sep 24 04:05:12 nas kernel: ies exhausted)
>Sep 24 04:05:12 nas kernel: aacraid: Host adapter reset request. SCSI
>hang ?
>Sep 24 04:05:12 nas kernel: aacraid:ID(0:03:0) Abort Time-out. Resetting
>
>bus.
>Sep 24 04:05:12 nas kernel: aacraid:SCSI bus reset issued on channel 0
>Sep 24 04:05:12 nas kernel: aacraid: Host adapter reset request. SCSI
>hang ?
>Sep 24 04:05:12 nas kernel: aacraid:ID(0:03:0) Abort Time-out. Resetting
>
>bus.
>Sep 24 04:05:12 nas kernel: aacraid:SCSI bus reset issued on channel 0
>Sep 24 04:05:12 nas kernel: scsi: device set offline - command error
>recover fai
>led: host 0 channel 0 id 1 lun 0
>Sep 24 04:05:12 nas kernel: SCSI disk error : host 0 channel 0 id 1 lun
>0 return
> code = 6000000
>Sep 24 04:05:12 nas kernel: I/O error: dev 08:13, sector 524288
>Sep 24 04:05:12 nas kernel: SCSI disk error : host 0 channel 0 id 1 lun
>0 return
> code = 6000000
>Sep 24 04:05:12 nas kernel: I/O error: dev 08:13, sector 0
>Sep 24 04:05:12 nas rpc.mountd: authenticated mount request from
>cheetah.dynabra
>de.com:686 for /export/home/username (/export/home)
>Sep 24 04:05:12 nas kernel: I/O error: dev 08:13, sector 8
>Sep 24 04:05:12 nas rpc.mountd: authenticated mount request from
>cheetah.dynabra
>de.com:686 for /export/home/username (/export/home)
>Sep 24 04:05:12 nas kernel: SCSI disk error : host 0 channel 0 id 1 lun
>0 return
> code = 6000000
>
>
>
--
Justin Bennett
Network Administrator
RHCE (Redhat Certified Linux Engineer)
Dynabrade, Inc.
8989 Sheridan Dr.
Clarence, NY 14031
More information about the nflug
mailing list