Smart: Difference between revisions
From DWIKI
Line 32: | Line 32: | ||
smartctl -a /dev/sda | smartctl -a /dev/sda | ||
== check smart status of disk in raid array == | |||
Something like | |||
smartctl -a /dev/bus/0 -d sat+megaraid,11 | |||
== check error log == | == check error log == |
Revision as of 08:57, 17 August 2022
Disk monitoring
Links
- Smartmontools wiki
- Wikipedia article
- Understanding SMART reports
- https://www.thomas-krenn.com/en/wiki/SMART_tests_with_smartctl
- Smartmontools with megaraid
- https://wiki.archlinux.org/index.php/S.M.A.R.T.
- smartmontools FAQ
- Smart hard drive stats
- Smart ASC error codes
Tools
- smartctl
- gsmartcontrol
Useful commands
enable smart
smartctl -i /dev/sda
check smart status
smartctl -a /dev/sda
check smart status of disk in raid array
Something like
smartctl -a /dev/bus/0 -d sat+megaraid,11
check error log
smartctl -l error /dev/sdb
Device: /dev/bus/0 [megaraid_disk_09] [SAT]
Try
smartctl --scan
Some codes and messages
LBA_of_first_error
Device is: Not in smartctl database [for details use: -P showall]
Try
/usr/sbin/update-smart-drivedb
otherwise check out https://www.smartmontools.org/wiki/FAQ#MyATASATAdriveisnotinthesmartctlsmartddatabase
Uncorrectable Sector Count
Check https://medium.com/@satyeshukumar/how-to-fix-uncorrectable-sector-count-warning-5a38c56d3faf
198 Offline Uncorrectable
bad sign, on ssd mostly/only when number gets high
SSD specific
- [https://unix.stackexchange.com/questions/106678/how-to-check-the-life-left-in-ssd-or-the-mediums-wear-level
- Life span of SSD
231 SSD_Life_Left
Percentage of life left
233 Media Wearout Indicator
241 Lifetime_Writes_GiB
Amount of data written so far
smartd: Failed SMART usage Attribute
Might be yelling about a disk that's already been replaced, try restarting smartd
FAQ
List disks
smartctl --scan
A mandatory SMART command failed: exiting. To continue, add one or more '-T permissive' options.
meh
Device: /dev/bus/0 [megaraid_disk_09] [SAT], failed to read SMART Attribute Data
Controller probably doesn't allow smart,
FAILURE PREDICTION THRESHOLD EXCEEDED [asc=5d, ascq=0]
time to replace disk?