Disco rovinato?

ciccio

Dabbler
Joined
Jan 15, 2019
Messages
27
facendo uno zpool status -v mi trovo questo sul disco di avvio:



pool: freenas-boot
state: DEGRADED
status: One or more devices has experienced an error resulting in data
corruption. Applications may be affected.
action: Restore the file in question if possible. Otherwise restore the
entire pool from backup.
see: http://illumos.org/msg/ZFS-8000-8A
scan: scrub repaired 13K in 0 days 00:00:04 with 60 errors on Wed Apr 10 03:45:04 2019
config:

NAME STATE READ WRITE CKSUM
freenas-boot DEGRADED 0 0 0
ada3p2 DEGRADED 0 0 0 too many errors

errors: Permanent errors have been detected in the following files:

<metadata>:<0x45>
<metadata>:<0x47>
<metadata>:<0x4e>
<metadata>:<0x50>
<metadata>:<0x52>
<metadata>:<0x56>
<metadata>:<0x57>
<metadata>:<0x59>
<metadata>:<0x65>
<metadata>:<0x77>
<metadata>:<0x78>
<metadata>:<0x82>
<metadata>:<0x85>
<metadata>:<0x8b>
<metadata>:<0x8d>
freenas-boot/ROOT/default:<0x0>
//usr/local/share/smartmontools
freenas-boot/ROOT/default@2019-04-02-12:48:05:<0x0>
freenas-boot/ROOT/default@2019-04-02-12:48:05:/usr/local/lib/migrate93/django/contrib/contenttypes/locale/it
freenas-boot/ROOT/default@2019-04-02-12:48:05:/usr/local/lib/migrate93/django/contrib/contenttypes/locale/km
freenas-boot/ROOT/default@2019-04-02-12:48:05:/usr/local/lib/python3.6/site-packages/django/contrib/admindocs/locale/io/LC_MESSAGES
freenas-boot/ROOT/default@2019-04-02-12:48:05:/usr/local/lib/migrate93/django/contrib/contenttypes/locale/kn/LC_MESSAGES
freenas-boot/ROOT/default@2019-04-02-12:48:05:/usr/local/lib/migrate93/django/contrib/contenttypes/locale/ko
freenas-boot/ROOT/default@2019-04-02-12:48:05:/usr/local/lib/migrate93/django/contrib/gis/admin
freenas-boot/ROOT/default@2019-04-02-12:48:05:/usr/local/lib/migrate93/django/contrib/contenttypes/locale/ko/LC_MESSAGES
freenas-boot/ROOT/default@2019-04-02-12:48:05:/usr/local/lib/python3.6/site-packages/django/contrib/gis/geos/__pycache__/factory.cpython-36.opt-1.pyc
freenas-boot/ROOT/default@2019-04-02-12:48:05:/usr/local/lib/migrate93/django/contrib/auth/locale/te/LC_MESSAGES/django.mo
freenas-boot/ROOT/default@2019-04-02-12:48:05:/usr/local/share/sssd/locale/hu
freenas-boot/ROOT/default@2019-04-02-12:48:05:/usr/local/share/sssd/locale/id/LC_MESSAGES
freenas-boot/ROOT/default@2019-04-02-12:48:05:/usr/local/lib/python3.6/site-packages/asyncssh/__pycache__/saslprep.cpython-36.opt-1.pyc
freenas-boot/ROOT/default@2019-04-02-12:48:05:/usr/local/share/sssd/locale/nb/LC_MESSAGES
freenas-boot/ROOT/default@2019-04-02-12:48:05:/usr/local/www/dojo/dojo/cldr/nls/el
freenas-boot/ROOT/default@2019-04-02-12:48:05:/usr/local/lib/migrate93/django/contrib/gis/tests/inspectapp
freenas-boot/ROOT/default@2019-04-02-12:48:05:/usr/local/lib/migrate93/django/contrib/gis/tests/layermap/__pycache__
freenas-boot/ROOT/default@2019-04-02-12:48:05:/usr/local/lib/migrate93/django/contrib/gis/db/backends/spatialite
freenas-boot/ROOT/default@2019-04-02-12:48:05:/usr/local/lib/migrate93/django/contrib/gis/db/backends/spatialite/__pycache__
freenas-boot/ROOT/default@2019-04-02-12:48:05:/usr/local/share/syslog-ng/include/scl/loggly
freenas-boot/ROOT/default@2019-04-02-12:48:05:/usr/local/www/dojo/dojo/cldr/nls/da/currency.js.map
freenas-boot/ROOT/default@2019-04-02-12:48:05:/usr/local/share/syslog-ng/include/scl/rewrite
freenas-boot/ROOT/default@2019-04-02-12:48:05:/usr/local/www/dojo/dojo/cldr/nls/da/hebrew.js
freenas-boot/ROOT/default@2019-04-02-12:48:05:/usr/local/lib/python3.6/site-packages/aiohttp/__pycache__/frozenlist.cpython-36.opt-1.pyc
freenas-boot/ROOT/default@2019-04-02-12:48:05:/usr/local/lib/python3.6/site-packages/aiohttp/__pycache__/http_exceptions.cpython-36.opt-1.pyc
freenas-boot/ROOT/default@2019-04-02-12:48:05:/usr/local/lib/migrate93/django/contrib/contenttypes/locale/he
freenas-boot/ROOT/default@2019-04-02-12:48:05:/usr/share/locale/zh_CN.GB18030/LC_COLLATE
freenas-boot/ROOT/default@2019-04-02-12:48:05:/usr/local/lib/migrate93/django/contrib/contenttypes/locale/id
freenas-boot/ROOT/Initial-Install:<0x0>



Premetto che neanche due settimane fa si era bloccato e ho dovutor reinstallare (e riconfigurare tutto).

Come faccio a testare il disco di avvio per capire se è danneggiato o è solo una questione di files?
 

glauco

Guru
Joined
Jan 30, 2017
Messages
526
Ma cos'è, una chiavetta USB?
 

glauco

Guru
Joined
Jan 30, 2017
Messages
526
smartctl -x /dev/ada3 trova degli errori?
 

ciccio

Dabbler
Joined
Jan 15, 2019
Messages
27
smartctl 6.6 2017-11-05 r4594 [FreeBSD 11.2-STABLE amd64] (local build)
Copyright (C) 2002-17, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Device Model: SanDisk SSD PLUS 240GB
Serial Number: 1837E2806890
LU WWN Device Id: 5 001b44 8b98681d2
Firmware Version: UF4500RL
User Capacity: 240,065,183,744 bytes [240 GB]
Sector Size: 512 bytes logical/physical
Rotation Rate: Solid State Device
Form Factor: 2.5 inches
Device is: Not in smartctl database [for details use: -P showall]
ATA Version is: ACS-2 T13/2015-D revision 3
SATA Version is: SATA 3.2, 6.0 Gb/s (current: 3.0 Gb/s)
Local Time is: Thu Apr 11 00:47:56 2019 PDT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
AAM feature is: Unavailable
APM feature is: Disabled
Rd look-ahead is: Enabled
Write cache is: Enabled
DSN feature is: Unavailable
ATA Security is: Disabled, NOT FROZEN [SEC1]
Wt Cache Reorder: Unavailable

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status: (0x00) Offline data collection activity
was never started.
Auto Offline Data Collection: Disabled.
Self-test execution status: ( 32) The self-test routine was interrupted
by the host with a hard or soft reset.
Total time to complete Offline
data collection: ( 120) seconds.
Offline data collection
capabilities: (0x15) SMART execute Offline immediate.
No Auto Offline data collection support.
Abort Offline collection upon new
command.
No Offline surface scan supported.
Self-test supported.
No Conveyance Self-test supported.
No Selective Self-test supported.
SMART capabilities: (0x0003) Saves SMART data before entering
power-saving mode.
Supports SMART auto save timer.
Error logging capability: (0x01) Error logging supported.
General Purpose Logging supported.
Short self-test routine
recommended polling time: ( 2) minutes.
Extended self-test routine
recommended polling time: ( 42) minutes.

SMART Attributes Data Structure revision number: 1
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAGS VALUE WORST THRESH FAIL RAW_VALUE
5 Reallocated_Sector_Ct -O--CK 100 100 000 - 0
9 Power_On_Hours -O--CK 100 100 000 - 1220
12 Power_Cycle_Count -O--CK 100 100 000 - 48
165 Unknown_Attribute -O--CK 100 100 000 - 8
166 Unknown_Attribute -O--CK 100 100 --- - 0
167 Unknown_Attribute -O--CK 100 100 --- - 0
168 Unknown_Attribute -O--CK 100 100 --- - 2
169 Unknown_Attribute -O--CK 100 100 --- - 193
170 Unknown_Attribute -O--CK 100 100 --- - 0
171 Unknown_Attribute -O--CK 100 100 000 - 0
172 Unknown_Attribute -O--CK 100 100 000 - 0
173 Unknown_Attribute -O--CK 100 100 000 - 0
174 Unknown_Attribute -O--CK 100 100 000 - 8
184 End-to-End_Error -O--CK 100 100 --- - 0
187 Reported_Uncorrect -O--CK 100 100 000 - 0
188 Command_Timeout -O--CK 100 100 --- - 0
194 Temperature_Celsius -O---K 067 045 000 - 33 (Min/Max 14/45)
199 UDMA_CRC_Error_Count -O--CK 100 100 --- - 0
230 Unknown_SSD_Attribute -O--CK 100 100 000 - 8589934594
232 Available_Reservd_Space PO--CK 100 100 005 - 100
233 Media_Wearout_Indicator -O--CK 100 100 --- - 15
234 Unknown_Attribute -O--CK 100 100 000 - 58
241 Total_LBAs_Written ----CK 100 100 000 - 22
242 Total_LBAs_Read ----CK 100 100 000 - 44
244 Unknown_Attribute -O--CK 000 100 --- - 0
||||||_ K auto-keep
|||||__ C event count
||||___ R error rate
|||____ S speed/performance
||_____ O updated online
|______ P prefailure warning

General Purpose Log Directory Version 1
SMART Log Directory Version 1 [multi-sector log support]
Address Access R/W Size Description
0x00 GPL,SL R/O 1 Log Directory
0x01 SL R/O 1 Summary SMART error log
0x02 SL R/O 1 Comprehensive SMART error log
0x03 GPL R/O 16 Ext. Comprehensive SMART error log
0x04 GPL,SL R/O 8 Device Statistics log
0x06 SL R/O 1 SMART self-test log
0x07 GPL R/O 1 Extended self-test log
0x10 GPL R/O 1 NCQ Command Error log
0x11 GPL R/O 1 SATA Phy Event Counters log
0x30 GPL,SL R/O 9 IDENTIFY DEVICE data log
0x80-0x9f GPL,SL R/W 16 Host vendor specific log
0xa1 GPL,SL VS 1 Device vendor specific log
0xa2 GPL,SL VS 2 Device vendor specific log
0xa3 GPL,SL VS 1 Device vendor specific log
0xa7 GPL,SL VS 1 Device vendor specific log
0xa9 GPL,SL VS 3 Device vendor specific log

Warning! SMART Extended Comprehensive Error Log Structure error: invalid SMART checksum.
SMART Extended Comprehensive Error Log Version: 1 (16 sectors)
No Errors Logged

SMART Extended Self-test Log Version: 1 (1 sectors)
No self-tests have been logged. [To run self-tests, use: smartctl -t]

Selective Self-tests/Logging not supported

SCT Commands not supported

Device Statistics (GP Log 0x04)
Page Offset Size Value Flags Description
0x01 ===== = = === == General Statistics (rev 1) ==
0x01 0x008 4 48 --- Lifetime Power-On Resets
0x01 0x010 4 1220 --- Power-on Hours
0x01 0x018 6 47903177 --- Logical Sectors Written
0x01 0x028 6 93858713 --- Logical Sectors Read
0x01 0x038 6 1220 --- Date and Time TimeStamp
0x05 ===== = = === == Temperature Statistics (rev 1) ==
0x05 0x008 1 33 --- Current Temperature
0x05 0x010 1 - --- Average Short Term Temperature
0x05 0x018 1 - --- Average Long Term Temperature
0x05 0x020 1 45 --- Highest Temperature
0x05 0x028 1 26 --- Lowest Temperature
0x05 0x030 1 35 --- Highest Average Short Term Temperature
0x05 0x038 1 35 --- Lowest Average Short Term Temperature
0x05 0x040 1 - --- Highest Average Long Term Temperature
0x05 0x048 1 - --- Lowest Average Long Term Temperature
0x05 0x050 4 0 --- Time in Over-Temperature
0x05 0x058 1 95 --- Specified Maximum Operating Temperature
0x05 0x060 4 0 --- Time in Under-Temperature
0x05 0x068 1 0 --- Specified Minimum Operating Temperature
0x07 ===== = = === == Solid State Device Statistics (rev 1) ==
0x07 0x008 1 0 N-- Percentage Used Endurance Indicator
|||_ C monitored condition met
||__ D supports DSN
|___ N normalized value

Pending Defects log (GP Log 0x0c) not supported

SATA Phy Event Counters (GP Log 0x11)
ID Size Value Description
0x0003 2 0 R_ERR response for device-to-host data FIS
0x0004 2 0 R_ERR response for host-to-device data FIS
0x0006 2 0 R_ERR response for device-to-host non-data FIS
0x0007 2 0 R_ERR response for host-to-device non-data FIS
0x0009 2 5 Transition from drive PhyRdy to drive PhyNRdy
0x000a 2 6 Device-to-host register FISes sent due to a COMRESET
0x000f 2 0 R_ERR response for host-to-device data FIS, CRC
0x0012 2 0 R_ERR response for host-to-device non-data FIS, CRC
0x0001 2 0 Command failed due to ICRC error
 

glauco

Guru
Joined
Jan 30, 2017
Messages
526
Dimenticavo... prima esegui un test breve:
smartctl /dev/ada3 -t short
Dopo circa 3 minuti, vedi se ha rilevato errori con
smartctl /dev/ada3 -l selftest
Se ha rilevato errori, se l'SSD è ancora in garanzia, vai sul sito del produttore e fai un RMA.
Se non ha rilevato errori, fai un test lungo:
smartctl /dev/ada3 -t long
e dopo svariate ore controlla con
smartctl /dev/ada3 -l selftest
 

glauco

Guru
Joined
Jan 30, 2017
Messages
526
Anche il long è completato senza errori?
 

ciccio

Dabbler
Joined
Jan 15, 2019
Messages
27
SI. ci ha messo 42' e ha dato tre righe con "completed without errors"
 

glauco

Guru
Joined
Jan 30, 2017
Messages
526
Non so che dirti...
Hai provato a dare zpool scrub freenas-boot e dopo qualche minuto zpool status freenas-boot?
 
Top