Scsi Errors.

Justin Bennett justin.bennett at dynabrade.com
Wed Sep 24 10:03:18 EDT 2003


Its a new box, It's less than one week old, Its got 6 drives, 3 logical 
(mirrored) 2 scsi raid (adaptec 2120s) controllers. I put this in last 
wednesnight, so the drives are all but new.. I did have a defective spot 
found on one of the drives since we put it live. Adaptec has a raid 
manager command line program and it gives you drive details.

AAC0> disk list
Executing: disk list

C:ID:L  Device Type     Blocks    Bytes/Block Usage            Shared Rate
------  --------------  --------- ----------- ---------------- ------ ----
0:00:0   Disk            71687372  512         Initialized      NO     320
0:01:0   Disk            71687372  512         Initialized      NO     320
0:02:0   Disk            143374744 512         Initialized      NO     320
0:03:0   Disk            143374744 512         Initialized      NO     320

AAC0> disk show defects (0:3:0)
Executing: disk show defects (BUS=0,ID=3,LUN=0)

Number of PRIMARY defects on drive: 921

Number of GROWN defects on drive: 1
Defect 1 at cylinder 928, head 1, sector 855



So at some point (possibly lat night) it found a defect on that drive, 
all the others the grown defects are 0.




Mark Musone wrote:

>This is more likely than not a slowly failing drive...i'd get all the
>data off that drive and get a new one..
>
>It could also be a failing controller, but since it seemed to be
>pointing to one scsi id, it's hard to say..plus you said it was raid but
>I don’t mknow any more detailed info about the scsi setup
>
>-Mark
>
>
>-----Original Message-----
>From: owner-nflug at nflug.org [mailto:owner-nflug at nflug.org] On Behalf Of
>Justin Bennett
>Sent: Wednesday, September 24, 2003 8:23 AM
>To: nflug at nflug.org
>Subject: Scsi Errors.
>
>I'm using an adaptec 2120 Raid controller. Last night about 4:00 AM I 
>got these errors: As you can see this is basically a NAS box sharing 
>files via NFS. you can see the failed mount attempts at the bottom. 
>After a reboot and FSCK the machine came backup and all the data appears
>
>ok. Ideas?
>
>
>Sep 24 04:02:47 nas kernel: aacraid:ID(0:03:0) Abort Time-out. Resetting
>
>bus.
>Sep 24 04:05:12 nas kernel: ies exhausted)
>Sep 24 04:05:12 nas kernel: aacraid: Host adapter reset request. SCSI
>hang ?
>Sep 24 04:05:12 nas kernel: aacraid:ID(0:03:0) Abort Time-out. Resetting
>
>bus.
>Sep 24 04:05:12 nas kernel: aacraid:SCSI bus reset issued on channel 0
>Sep 24 04:05:12 nas kernel: aacraid: Host adapter reset request. SCSI
>hang ?
>Sep 24 04:05:12 nas kernel: aacraid:ID(0:03:0) Abort Time-out. Resetting
>
>bus.
>Sep 24 04:05:12 nas kernel: aacraid:SCSI bus reset issued on channel 0
>Sep 24 04:05:12 nas kernel: scsi: device set offline - command error 
>recover fai
>led: host 0 channel 0 id 1 lun 0
>Sep 24 04:05:12 nas kernel: SCSI disk error : host 0 channel 0 id 1 lun 
>0 return
> code = 6000000
>Sep 24 04:05:12 nas kernel:  I/O error: dev 08:13, sector 524288
>Sep 24 04:05:12 nas kernel: SCSI disk error : host 0 channel 0 id 1 lun 
>0 return
> code = 6000000
>Sep 24 04:05:12 nas kernel:  I/O error: dev 08:13, sector 0
>Sep 24 04:05:12 nas rpc.mountd: authenticated mount request from 
>cheetah.dynabra
>de.com:686 for /export/home/username (/export/home)
>Sep 24 04:05:12 nas kernel:  I/O error: dev 08:13, sector 8
>Sep 24 04:05:12 nas rpc.mountd: authenticated mount request from 
>cheetah.dynabra
>de.com:686 for /export/home/username (/export/home)
>Sep 24 04:05:12 nas kernel: SCSI disk error : host 0 channel 0 id 1 lun 
>0 return
> code = 6000000
>
>  
>

-- 
Justin Bennett
Network Administrator
RHCE (Redhat Certified Linux Engineer)
Dynabrade, Inc.
8989 Sheridan Dr.
Clarence, NY 14031
 





More information about the nflug mailing list