Debian: Munin - S.M.A.R.T.

Im BIOS muss S.M.A.R.T.-Monitoring aktiviert sein!

smartmontools

smartmontools installieren

apt-get install smartmontools

smartctl einrichten

SMART muss nun für jede Festplatte, die überwacht werden soll, aktiviert werden. Das geschieht mit dem Schalter -s on:

smartctl -s on /dev/sda

Ein erster Test mit…

smartctl -a /dev/sda

… sollte eine solche oder ähnliche Ausgabe erzeugen

smartctl version 5.38 [i686-pc-linux-gnu] Copyright (C) 2002-8 Bruce Allen
Home page is http://smartmontools.sourceforge.net/

=== START OF INFORMATION SECTION ===
Device Model:     SAMSUNG HD502IJ
Serial Number:    S13TJ1KQ303836
Firmware Version: 1AA01109
User Capacity:    500.107.862.016 bytes
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   8
ATA Standard is:  ATA-8-ACS revision 3b
Local Time is:    Tue Mar  3 13:05:05 2009 CET

==> WARNING: May need -F samsung or -F samsung2 enabled; see manual for details.

SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00)	Offline data collection activity
					was never started.
					Auto Offline Data Collection: Disabled.
Self-test execution status:      (   0)	The previous self-test routine completed
					without error or no self-test has ever 
					been run.
Total time to complete Offline 
data collection: 		 (7608) seconds.
Offline data collection
capabilities: 			 (0x7b) SMART execute Offline immediate.
					Auto Offline data collection on/off support.
					Suspend Offline collection upon new
					command.
					Offline surface scan supported.
					Self-test supported.
					Conveyance Self-test supported.
					Selective Self-test supported.
SMART capabilities:            (0x0003)	Saves SMART data before entering
					power-saving mode.
					Supports SMART auto save timer.
Error logging capability:        (0x01)	Error logging supported.
					General Purpose Logging supported.
Short self-test routine 
recommended polling time: 	 (   2) minutes.
Extended self-test routine
recommended polling time: 	 ( 128) minutes.
Conveyance self-test routine
recommended polling time: 	 (  14) minutes.
SCT capabilities: 	       (0x003f)	SCT Status supported.
					SCT Feature Control supported.
					SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000f   100   100   051    Pre-fail  Always       -       0
  3 Spin_Up_Time            0x0007   086   086   011    Pre-fail  Always       -       5130
  4 Start_Stop_Count        0x0032   100   100   000    Old_age   Always       -       99
  5 Reallocated_Sector_Ct   0x0033   100   100   010    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x000f   253   253   051    Pre-fail  Always       -       0
  8 Seek_Time_Performance   0x0025   100   100   015    Pre-fail  Offline      -       9866
  9 Power_On_Hours          0x0032   099   099   000    Old_age   Always       -       3777
 10 Spin_Retry_Count        0x0033   100   100   051    Pre-fail  Always       -       0
 11 Calibration_Retry_Count 0x0012   100   100   000    Old_age   Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       99
 13 Read_Soft_Error_Rate    0x000e   100   100   000    Old_age   Always       -       0
183 Unknown_Attribute       0x0032   100   100   000    Old_age   Always       -       0
184 Unknown_Attribute       0x0033   100   100   099    Pre-fail  Always       -       0
187 Reported_Uncorrect      0x0032   100   100   000    Old_age   Always       -       0
188 Unknown_Attribute       0x0032   100   100   000    Old_age   Always       -       0
190 Airflow_Temperature_Cel 0x0022   074   068   000    Old_age   Always       -       26 (Lifetime Min/Max 26/27)
194 Temperature_Celsius     0x0022   073   068   000    Old_age   Always       -       27 (Lifetime Min/Max 26/27)
195 Hardware_ECC_Recovered  0x001a   100   100   000    Old_age   Always       -       19952
196 Reallocated_Event_Count 0x0032   100   100   000    Old_age   Always       -       0
197 Current_Pending_Sector  0x0012   100   100   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0030   100   100   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x003e   100   100   000    Old_age   Always       -       0
200 Multi_Zone_Error_Rate   0x000a   100   100   000    Old_age   Always       -       0
201 Soft_Read_Error_Rate    0x000a   253   253   000    Old_age   Always       -       0

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 0
Warning: ATA Specification requires self-test log structure revision number = 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Short offline       Completed without error       00%      3586         -
# 2  Short offline       Aborted by host               00%      3586         -
# 3  Extended offline    Aborted by host               90%      3586         -

SMART Selective Self-Test Log Data Structure Revision Number (0) should be 1
SMART Selective self-test log data structure revision number 0
Warning: ATA Specification requires selective self-test log data structure revision number = 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

Die smartmontools verfügen über einen Daemon, der in definierbaren Intervallen die Festplatten prüfen kann. Um den zu aktivieren, ist /etc/default/smartmontools anzupassen.

smartmontools starten

/etc/init.d/smartmontools start

Munin

Plugins aktivieren

Bei dem smart_-Plugin handelt es sich um ein sogenanntes Wildcard-Plugin. Das heisst, dass die zu überwachenden Festplatte als Variable nach dem Unterstrich angegeben wird.

ln -s /usr/share/munin/plugins/smart_ /etc/munin/plugins/smart_sda
ln -s /usr/share/munin/plugins/hddtemp_smartctl /etc/munin/plugins/hddtemp_smartctl

/etc/munin/plugin-conf.d ist noch anzupassen:

[hddtemp_smartctl]
user root
group disk
env.drives sda

[smart_*]
user root
group disk
env.smartpath /usr/sbin/smartctl

munin-node neu starten

/etc/init.d/munin-node restart

Fertig.

Erste SeiteVorherige SeiteZurück zur ÜbersichtNächste SeiteLetzte Seite

Nach oben
 
  wiki/anleitungen/debian_munin-smart.txt · Zuletzt geändert: 2009/10/06 12:19 (Externe Bearbeitung)
Valid XHTML 1.0 Valid CSS