-
安装smartmontools:
- Ubuntu/Debian:
sudo apt-get install smartmontools - CentOS/RHEL:
sudo yum install smartmontools
- Ubuntu/Debian:
-
查看设备列表:
lsblk或sudo fdisk -l确定硬盘路径(如 /dev/sda) -
基础健康检查:
sudo smartctl -H /dev/sda
关键输出:"SMART overall-health self-assessment test result: PASSED" -
完整SMART信息:
sudo smartctl -a /dev/sda
重点关注:- Temperature(温度)
- Reallocated_Sector_Ct(重映射扇区)
- Current_Pending_Sector(待映射扇区)
- Power_On_Hours(通电时长)
-
启用SMART监控(若未激活):
sudo smartctl -s on /dev/sda -
执行离线测试:
- 短测试:
sudo smartctl -t short /dev/sda - 长测试:
sudo smartctl -t long /dev/sda - 查看进度:
sudo smartctl -l selftest /dev/sda
- 短测试:
-
监控建议:
- 配置cron定时检查(示例每日扫描):
0 0 * * * /usr/sbin/smartctl -H /dev/sda | grep PASSED || echo "ALERT: Disk Failure" - 集成到Zabbix/Prometheus通过smartctl_exporter
- 配置cron定时检查(示例每日扫描):
-
NVMe硬盘专用命令:
sudo smartctl -a /dev/nvme0n1
注意:数值异常(如Reallocated_Sector_Ct > 0)需立即备份并更换硬盘