I´m try to recover my rrds, because when i execute the rrd_corrupt_files.pl the result is:
71887 total RRDs
63224 good RRDs
8663 errored RRDs
0 RRDs removed
and this information is critical.
How can i recover this information?
I am glad that the Opmantek Support team was able to assist you with your issue, remember that it is always best to run the rrd_resize.pl tool with backups enabled, so you have a backup of your RRD before modification.
Well, when i do a query to the rrds the health.rrd, mib2ip.rrd and reach.rrd are not update since Jul 28:
-rw-rw----. 1 nmis nmis 5879712 Jul 31 12:42 bgpPeer-184.108.40.206.rrd
-rw-rw----. 1 nmis nmis 5879712 Jul 31 12:42 bgpPeer-220.127.116.11.rrd
-rw-rw----. 1 nmis nmis 981352 Jul 31 12:42 fanstatus-1.rrd
-rw-rw----. 1 nmis nmis 981352 Jul 31 12:42 fanstatus-2.rrd
-rw-rw----. 1 nmis nmis 981352 Jul 31 12:42 fanstatus-3.rrd
-rw-rw----. 1 nmis nmis 9230080 Jul 28 05:54 health.rrd
-rw-rw----. 1 nmis nmis 13069840 Jul 28 05:54 mib2ip.rrd
-rw-rw----. 1 nmis nmis 981352 Jul 31 12:42 psustatus-1.rrd
-rw-rw----. 1 nmis nmis 981352 Jul 31 12:42 psustatus-2.rrd
-rw-rw----. 1 nmis nmis 13886600 Jul 28 05:54 reach.rrd
-rw-rw----. 1 nmis nmis 1961024 Jul 31 12:42 tempstatus-1.rrd
And when I execute this command:
[root@qcnln228v health]# rrdtool lastupdate health.rrd
ERROR: 'health.rrd' is not an RRD file
Really this is the problem!!
Thanks a lot for you attention.
If you are looking to repair your bad RRD files running rrd_corrupt_files.pl may not be the way to go. This script takes two arguments, run=true and remove=true. When both are set to true rrd_corrupt_files.pl will find RRD files with errors in them and unlink them, essentially removing them from the RRD collection.
My next question would be are you sure your RRDs are corrupt? What symptoms are you experiencing? Usually when there is a large number of corrupt files it's an issue with the storage system and these problems wont be isolated to one set of file or folders. With such a large number of errored RRDs your best bet might be to restore from a backup (after testing your storage system for reliability).