I messed up my array and I don't know what to do

Status
Not open for further replies.

Tired_

Dabbler
Joined
Feb 17, 2013
Messages
29
Hi. I have a 6 drive RAID-Z2 array, which has been working fine for several months. A few days ago, I logged in and saw that the array was degraded, and that one particular drive had had errors. The drive was only 4 months old, so I figured it must be broken, so I shut down the FreeNAS computer, removed that drive, put it in another computer and ran SeaTools on it to diagnose it. After quite a few hours, SeaTools said everything was cool, so I decided to plug the drive back in to FreeNAS and fire it up. I booted to the USB stick, like always, but it hung up on 'mountd', and wouldn't continue to boot. I left it overnight, and it stayed stuck on that item. At this point I started to panic, and downloaded a copy of the latest stable FreeNAS, wrote it to another USB drive, and tried to boot off of that. It booted, but when I tried to Auto-Import the RAID-Z2, it sits on Please Wait for about 20 minutes then says "An error has occured". After this, I can (sort of) access the array from the command line..."zpool status" lists it, which it didn't before, but it lists it with two unavailable drives. Below are the results of some commands I've noticed people ask for in similar situations to mine, all taken after the Auto-Import errored out:

Code:
[root@freenas] ~# zpool status -v
  pool: MAIN
 state: DEGRADED
status: One or more devices are faulted in response to IO failures.
action: Make sure the affected devices are connected, then run 'zpool clear'.
   see: http://www.sun.com/msg/ZFS-8000-HC
  scan: scrub repaired 144M in 6h32m with 0 errors on Fri Feb 15 20:02:12 2013
config:

        NAME                                            STATE     READ WRITE CKS                     UM
        MAIN                                            DEGRADED     0     0                          1
          raidz2-0                                      DEGRADED     0     0                          6
            gptid/3e54ea70-1e5d-11e2-b155-00261889a36e  ONLINE       0     0                          0
            952549345672949843                          UNAVAIL      0     0                          0  was /dev/gptid/3f04f9e9-1e5d-11e2-b155-00261889a36e
            13589355565508211589                        UNAVAIL      0     0                          0  was /dev/gptid/85232dd5-77b3-11e2-9414-00261889a36e
            gptid/feb27aa0-3f54-11e2-8fb8-00261889a36e  ONLINE       0     0                          0
            gptid/40ca2b7b-1e5d-11e2-b155-00261889a36e  ONLINE       0     0                          0
            gptid/0c4307e2-21f7-11e2-95fd-00261889a36e  ONLINE       0     0                          0

errors: Permanent errors have been detected in the following files:

        <metadata>:<0x1d>


Code:
[root@freenas] ~# gpart show
=>     63  3935169  da1  MBR  (1.9G)
       63  1930257    1  freebsd  [active]  (942M)
  1930320       63       - free -  (31k)
  1930383  1930257    2  freebsd  (942M)
  3860640     3024    3  freebsd  (1.5M)
  3863664    41328    4  freebsd  (20M)
  3904992    30240       - free -  (14M)

=>        34  5860533101  ada0  GPT  (2.7T)
          34          94        - free -  (47k)
         128     4194304     1  freebsd-swap  (2.0G)
     4194432  5856338696     2  freebsd-zfs  (2.7T)
  5860533128           7        - free -  (3.5k)

Segmentation fault


Code:
[root@freenas] ~# glabel status
                                      Name  Status  Components
                             ufs/FreeNASs3     N/A  da1s3
                             ufs/FreeNASs4     N/A  da1s4
gptid/3e54ea70-1e5d-11e2-b155-00261889a36e     N/A  ada0p2
gptid/feb27aa0-3f54-11e2-8fb8-00261889a36e     N/A  ada3p2
gptid/40ca2b7b-1e5d-11e2-b155-00261889a36e     N/A  ada4p2
gptid/0c4307e2-21f7-11e2-95fd-00261889a36e     N/A  ada5p2
                            ufs/FreeNASs1a     N/A  da1s1a


Code:
[root@freenas] ~# camcontrol devlist
<ST3000DM001-9YN166 CC4H>          at scbus1 target 0 lun 0 (ada0,pass0)
<ST2000DM001-9YN164 CC4B>          at scbus2 target 0 lun 0 (pass1,ada1)
<ST2000DM001-9YN164 CC4B>          at scbus3 target 0 lun 0 (ada2,pass2)
<ST3000DM001-1CH166 CC24>          at scbus4 target 0 lun 0 (ada3,pass3)
<WDC WD15EADS-00P8B0 01.00A01>     at scbus5 target 0 lun 0 (ada4,pass4)
<WDC WD30EZRX-00MMMB0 80.00A80>    at scbus6 target 0 lun 0 (ada5,pass5)
<ASMT 2105 0>                      at scbus7 target 0 lun 0 (pass6,da0)
<Mass Storage Device \001\000\000?>  at scbus8 target 0 lun 0 (pass7,da1)


With two drives unavailable, I'm terrified that I will do something else wrong and lose my data. Where can I go from here?
 

ProtoSD

MVP
Joined
Jul 1, 2011
Messages
3,348
My first question is, what the hell happened during the "gpart show" that caused a segmentation fault?

The next thing that seems strange that may not be important, the output from camcontrol. The last column pass1,ada1 is swapped around, I don't know if that is significant.

Do you know which drive you pulled out, which one is it in the camcontrol list?

I'm just thinking out loud. Stay calm and don't do anything without confirming your idea with someone here.

If anyone else wants to jump in, I don't have any clear ideas at the moment.
 

Tired_

Dabbler
Joined
Feb 17, 2013
Messages
29
I have no idea what caused the segmentation fault, but it happens every time I type that command (gpart show).

The drive that I pulled was
Code:
<ST2000DM001-9YN164 CC4B>          at scbus3 target 0 lun 0 (ada2,pass2)
 

ProtoSD

MVP
Joined
Jul 1, 2011
Messages
3,348
I have no idea what caused the segmentation fault, but it happens every time I type that command (gpart show).

That's a little concerning. I would like to see if the disk you removed is causing that. Can you power down, disconnect the cables for that drive, power back up and do another gpart show?
 

ProtoSD

MVP
Joined
Jul 1, 2011
Messages
3,348
Actually, if you can boot back up with the spare flash drive you created (If you didn't restore your config to it), it would probably be a little better if it wasn't trying to mount your pool when it reboots until we can get some answers.
 

paleoN

Wizard
Joined
Apr 22, 2012
Messages
1,403
My first question is, what the hell happened during the "gpart show" that cause a segmentation fault?
First time I've seen that.

What are the hardware specs and how are the drives connected. It looks like the ada1, controller or both are misbehaving.

This will probably work except the last one:
Code:
gpart show ada2
gpart show ada3
gpart show ada4
gpart show ada5

gpart show ada1


See if SMART can see anything:
Code:
smartctl -q noserial -a /dev/ada1


Are you sure you pulled the correct drive originally? Maybe you're just lucky.

After the above I would shutdown and carefully re-seat the drives, possibly change the cable for ada1.
 

paleoN

Wizard
Joined
Apr 22, 2012
Messages
1,403
That's a little concerning. I would like to see if the disk you removed is causing that.
His glabel status failed to return anything for ada1 as well.

Actually, if you can boot back up with the spare flash drive you created (If you didn't restore your config to it), it would probably be a little better if it wasn't trying to mount your pool when it reboots until we can get some answers.
+1
 

Tired_

Dabbler
Joined
Feb 17, 2013
Messages
29
The segmentation fault happened after Auto-Import (from the GUI) had failed. I rebooted (to the spare flash drive I created, without unplugging the ada1 drive), then tried gpart show immediately, without trying to Auto-Import first...it gave me this:

Code:
[root@freenas] ~# gpart show
=>     63  3935169  da1  MBR  (1.9G)
       63  1930257    1  freebsd  [active]  (942M)
  1930320       63       - free -  (31k)
  1930383  1930257    2  freebsd  (942M)
  3860640     3024    3  freebsd  (1.5M)
  3863664    41328    4  freebsd  (20M)
  3904992    30240       - free -  (14M)

=>        34  5860533101  ada0  GPT  (2.7T)
          34          94        - free -  (47k)
         128     4194304     1  freebsd-swap  (2.0G)
     4194432  5856338696     2  freebsd-zfs  (2.7T)
  5860533128           7        - free -  (3.5k)

=>        34  3907029101  ada1  GPT  (1.8T)
          34          94        - free -  (47k)
         128     4194304     1  freebsd-swap  (2.0G)
     4194432  3902834696     2  freebsd-zfs  (1.8T)
  3907029128           7        - free -  (3.5k)

=>        34  5860533101  ada3  GPT  (2.7T)
          34          94        - free -  (47k)
         128     4194304     1  freebsd-swap  (2.0G)
     4194432  5856338696     2  freebsd-zfs  (2.7T)
  5860533128           7        - free -  (3.5k)

=>        34  2930277101  ada4  GPT  (1.4T)
          34          94        - free -  (47k)
         128     4194304     1  freebsd-swap  (2.0G)
     4194432  2926082703     2  freebsd-zfs  (1.4T)

=>      0  1930257  da1s1  BSD  (942M)
        0       16         - free -  (8.0k)
       16  1930241      1  !0  (942M)



No segmentation fault...I haven't tried it with the drive pulled, I will do that now.
 

Tired_

Dabbler
Joined
Feb 17, 2013
Messages
29
The system has a Pentium D processor, an Asus P5Q SE PLUS motherboard, and 4GB of DDR2 RAM. The drives are connected to the motherboard via SATA, using the six ports that come on the motherboard.
 

ProtoSD

MVP
Joined
Jul 1, 2011
Messages
3,348
Interesting, gpart show is only showing 4 of your 6 disks. I'd check the cables like PaleoN said. You could have bumped/loosened them without realizing it.
 

paleoN

Wizard
Joined
Apr 22, 2012
Messages
1,403
Interesting, gpart show is only showing 4 of your 6 disks. I'd check the cables like PaleoN said. You could have bumped/loosened them without realizing it.
Good catch. Didn't notice ada5 missing there.

If it's not the cables then you would appear to have a more significant hardware problem.
 

Tired_

Dabbler
Joined
Feb 17, 2013
Messages
29
I swapped the cable with the drive that I had pulled (luckily, I have like 30 brand-new SATA cables!), and restarted, now gpart show says:

Code:
[root@freenas] ~# gpart show
=>     63  3935169  da1  MBR  (1.9G)
       63  1930257    1  freebsd  [active]  (942M)
  1930320       63       - free -  (31k)
  1930383  1930257    2  freebsd  (942M)
  3860640     3024    3  freebsd  (1.5M)
  3863664    41328    4  freebsd  (20M)
  3904992    30240       - free -  (14M)

=>        34  5860533101  ada0  GPT  (2.7T)
          34          94        - free -  (47k)
         128     4194304     1  freebsd-swap  (2.0G)
     4194432  5856338696     2  freebsd-zfs  (2.7T)
  5860533128           7        - free -  (3.5k)

=>        34  3907029101  ada1  GPT  (1.8T)
          34          94        - free -  (47k)
         128     4194304     1  freebsd-swap  (2.0G)
     4194432  3902834696     2  freebsd-zfs  (1.8T)
  3907029128           7        - free -  (3.5k)

=>        34  5860533101  ada3  GPT  (2.7T)
          34          94        - free -  (47k)
         128     4194304     1  freebsd-swap  (2.0G)
     4194432  5856338696     2  freebsd-zfs  (2.7T)
  5860533128           7        - free -  (3.5k)

=>        34  2930277101  ada4  GPT  (1.4T)
          34          94        - free -  (47k)
         128     4194304     1  freebsd-swap  (2.0G)
     4194432  2926082703     2  freebsd-zfs  (1.4T)

=>        34  5860533101  ada5  GPT  (2.7T)
          34          94        - free -  (47k)
         128     4194304     1  freebsd-swap  (2.0G)
     4194432  5856338696     2  freebsd-zfs  (2.7T)
  5860533128           7        - free -  (3.5k)

=>      0  1930257  da1s1  BSD  (942M)
        0       16         - free -  (8.0k)
       16  1930241      1  !0  (942M)


That's five drives now, not six (but not four!). Should I try to auto-import again?
 

Tired_

Dabbler
Joined
Feb 17, 2013
Messages
29
Here's the SMART information.

Code:
[root@freenas] ~# smartctl -q noserial -a /dev/ada1
smartctl 5.43 2012-06-30 r3573 [FreeBSD 8.3-RELEASE-p5 amd64] (local build)
Copyright (C) 2002-12 by Bruce Allen, http://smartmontools.sourceforge.net

=== START OF INFORMATION SECTION ===
Model Family:     Seagate Barracuda (SATA 3Gb/s, 4K Sectors)
Device Model:     ST2000DM001-9YN164
Firmware Version: CC4B
User Capacity:    2,000,398,934,016 bytes [2.00 TB]
Sector Sizes:     512 bytes logical, 4096 bytes physical
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   8
ATA Standard is:  ATA-8-ACS revision 4
Local Time is:    Sun Feb 17 18:14:44 2013 PST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
See vendor-specific Attribute list for marginal Attributes.

General SMART Values:
Offline data collection status:  (0x82) Offline data collection activity
                                        was completed without error.
                                        Auto Offline Data Collection: Enabled.
Self-test execution status:      (   0) The previous self-test routine completed
                                        without error or no self-test has ever
                                        been run.
Total time to complete Offline
data collection:                (  609) seconds.
Offline data collection
capabilities:                    (0x7b) SMART execute Offline immediate.
                                        Auto Offline data collection on/off support.
                                        Suspend Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine
recommended polling time:        (   1) minutes.
Extended self-test routine
recommended polling time:        ( 259) minutes.
Conveyance self-test routine
recommended polling time:        (   2) minutes.
SCT capabilities:              (0x3085) SCT Status supported.

SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000f   117   099   006    Pre-fail  Always       -       149414216
  3 Spin_Up_Time            0x0003   097   092   000    Pre-fail  Always       -       0
  4 Start_Stop_Count        0x0032   100   100   020    Old_age   Always       -       358
  5 Reallocated_Sector_Ct   0x0033   100   100   036    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x000f   068   060   030    Pre-fail  Always       -       6612133
  9 Power_On_Hours          0x0032   097   097   000    Old_age   Always       -       2752
 10 Spin_Retry_Count        0x0013   100   100   097    Pre-fail  Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   020    Old_age   Always       -       44
183 Runtime_Bad_Block       0x0032   100   100   000    Old_age   Always       -       0
184 End-to-End_Error        0x0032   100   100   099    Old_age   Always       -       0
187 Reported_Uncorrect      0x0032   100   100   000    Old_age   Always       -       0
188 Command_Timeout         0x0032   100   100   000    Old_age   Always       -       0
189 High_Fly_Writes         0x003a   100   100   000    Old_age   Always       -       0
190 Airflow_Temperature_Cel 0x0022   058   044   045    Old_age   Always   In_the_past 42 (0 102 42 41 0)
191 G-Sense_Error_Rate      0x0032   100   100   000    Old_age   Always       -       0
192 Power-Off_Retract_Count 0x0032   100   100   000    Old_age   Always       -       46
193 Load_Cycle_Count        0x0032   100   100   000    Old_age   Always       -       1277
194 Temperature_Celsius     0x0022   042   056   000    Old_age   Always       -       42 (0 19 0 0 0)
197 Current_Pending_Sector  0x0012   100   100   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0010   100   100   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x003e   200   200   000    Old_age   Always       -       0
240 Head_Flying_Hours       0x0000   100   253   000    Old_age   Offline      -       30107720747288
241 Total_LBAs_Written      0x0000   100   253   000    Old_age   Offline      -       41868940385406
242 Total_LBAs_Read         0x0000   100   253   000    Old_age   Offline      -       91007179873939

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
No self-tests have been logged.  [To run self-tests, use: smartctl -t]


SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.


I also tried gpart show for each individual disk. They all showed up fine except for ada2, which said "gpart: No such geom: ada2."

Trying to anticipate, I ran smartctl on /dev/ada2 (which isn't showing up anywhere else). It says this:

Code:
[root@freenas] ~# smartctl -q noserial -a /dev/ada2
smartctl 5.43 2012-06-30 r3573 [FreeBSD 8.3-RELEASE-p5 amd64] (local build)
Copyright (C) 2002-12 by Bruce Allen, http://smartmontools.sourceforge.net

=== START OF INFORMATION SECTION ===
Model Family:     Seagate Barracuda (SATA 3Gb/s, 4K Sectors)
Device Model:     ST2000DM001-9YN164
Firmware Version: CC4B
User Capacity:    2,000,398,934,016 bytes [2.00 TB]
Sector Sizes:     512 bytes logical, 4096 bytes physical
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   8
ATA Standard is:  ATA-8-ACS revision 4
Local Time is:    Sun Feb 17 18:18:09 2013 PST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
See vendor-specific Attribute list for marginal Attributes.

General SMART Values:
Offline data collection status:  (0x82) Offline data collection activity
                                        was completed without error.
                                        Auto Offline Data Collection: Enabled.
Self-test execution status:      (   0) The previous self-test routine completed
                                        without error or no self-test has ever
                                        been run.
Total time to complete Offline
data collection:                (  584) seconds.
Offline data collection
capabilities:                    (0x7b) SMART execute Offline immediate.
                                        Auto Offline data collection on/off support.
                                        Suspend Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine
recommended polling time:        (   1) minutes.
Extended self-test routine
recommended polling time:        ( 223) minutes.
Conveyance self-test routine
recommended polling time:        (   2) minutes.
SCT capabilities:              (0x3085) SCT Status supported.

SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000f   111   099   006    Pre-fail  Always       -       37801032
  3 Spin_Up_Time            0x0003   095   094   000    Pre-fail  Always       -       0
  4 Start_Stop_Count        0x0032   100   100   020    Old_age   Always       -       199
  5 Reallocated_Sector_Ct   0x0033   100   100   036    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x000f   068   060   030    Pre-fail  Always       -       7058748
  9 Power_On_Hours          0x0032   098   098   000    Old_age   Always       -       2459
 10 Spin_Retry_Count        0x0013   100   100   097    Pre-fail  Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   020    Old_age   Always       -       58
183 Runtime_Bad_Block       0x0032   100   100   000    Old_age   Always       -       0
184 End-to-End_Error        0x0032   100   100   099    Old_age   Always       -       0
187 Reported_Uncorrect      0x0032   100   100   000    Old_age   Always       -       0
188 Command_Timeout         0x0032   100   100   000    Old_age   Always       -       0
189 High_Fly_Writes         0x003a   100   100   000    Old_age   Always       -       0
190 Airflow_Temperature_Cel 0x0022   065   042   045    Old_age   Always   In_the_past 35 (0 181 35 34 0)
191 G-Sense_Error_Rate      0x0032   100   100   000    Old_age   Always       -       0
192 Power-Off_Retract_Count 0x0032   100   100   000    Old_age   Always       -       39
193 Load_Cycle_Count        0x0032   100   100   000    Old_age   Always       -       623
194 Temperature_Celsius     0x0022   035   058   000    Old_age   Always       -       35 (0 21 0 0 0)
197 Current_Pending_Sector  0x0012   100   100   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0010   100   100   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x003e   200   200   000    Old_age   Always       -       0
240 Head_Flying_Hours       0x0000   100   253   000    Old_age   Offline      -       233637630970069
241 Total_LBAs_Written      0x0000   100   253   000    Old_age   Offline      -       42117724833803
242 Total_LBAs_Read         0x0000   100   253   000    Old_age   Offline      -       137480934656173

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Short offline       Completed without error       00%      2454         -
# 2  Short offline       Completed without error       00%      2450         -
# 3  Short offline       Completed without error       00%      2450         -
# 4  Short offline       Completed without error       00%      2445         -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.
 

Tired_

Dabbler
Joined
Feb 17, 2013
Messages
29
Code:
[root@freenas] ~# camcontrol devlist
<ST3000DM001-9YN166 CC4H>          at scbus1 target 0 lun 0 (ada0,pass0)
<ST2000DM001-9YN164 CC4B>          at scbus2 target 0 lun 0 (ada1,pass1)
<ST2000DM001-9YN164 CC4B>          at scbus3 target 0 lun 0 (ada2,pass2)
<ST3000DM001-1CH166 CC24>          at scbus4 target 0 lun 0 (ada3,pass3)
<WDC WD15EADS-00P8B0 01.00A01>     at scbus5 target 0 lun 0 (ada4,pass4)
<WDC WD30EZRX-00MMMB0 80.00A80>    at scbus6 target 0 lun 0 (ada5,pass5)
<ASMT 2105 0>                      at scbus7 target 0 lun 0 (pass6,da0)
<Mass Storage Device \001\000\000?>  at scbus8 target 0 lun 0 (pass7,da1)
 

paleoN

Wizard
Joined
Apr 22, 2012
Messages
1,403
That's five drives now, not six (but not four!). Should I try to auto-import again?
I'm assuming you wiped ada2 during SeaTools. If not you still have a problem.

STOP, and fix your temperature problem. At least two, most likely more, drives exceeded their maximum operating temperatures in the past. This can damage the drives.
 

ProtoSD

MVP
Joined
Jul 1, 2011
Messages
3,348
I'm thinking the Seatools scan was destructive and wiped the partition info. I think that disk is going to need to be resilvered. I'm not sure the best way at the moment, what do you think PaleoN?
 

Tired_

Dabbler
Joined
Feb 17, 2013
Messages
29
I have the side of the case taken off. Two of the drives (ada 1 and 2) were adjacent to one another in the case, they aren't any longer; every drive has at least one empty space between them. That will have to do for temperature remediation till I can go get more case fans or something. Does the STOP mean my array is gone for good?
 

paleoN

Wizard
Joined
Apr 22, 2012
Messages
1,403
Does the STOP mean my array is gone for good?
No, it means if this was my array I would:
Code:
shutdown -p now
Which powers off the machine. I would leave it off until I had airflow directly over the drives.

If ada1 is OK, which it appears to be from the SMART output, then the array is likely in better shape than it first appeared.


I think that disk is going to need to be resilvered. I'm not sure the best way at the moment, what do you think PaleoN?
A resilver causes a lot of seeks and nothing generates heat like a lot of seeking.
 
Status
Not open for further replies.
Top