RAID Data Recovery Emergency Guide

What to do if your RAID system crashes

If you have problem with RAID server, the likely situation is you are not able to access the RAID volume any more. It may not be a trivial task to recover the data on your own as even the most experienced system engineers may be familiar with standard configuration without the necessary skill sets to rescue a corrupted or inaccessible RAID volume. However, by following some of the outlined process here, it may help you to minimize further loss of data or at least it will increase your chance of successful recovery with the right expert.


There are a number of possible situations here.

Situation 1
One disk failure, and the RAID server is running at degraded mode. Only some critical data is required.

This is the most likely situation that the user could recover the data fully and without the need to send the RAID system for recovery work. As the RAID is still accessible but without any more fault tolerance redundancy, any failure of next disk will result in complete RAID server failure.

Critical data should be copied out as soon as possible before any rebuild attempt is performed. At this point, the remaining disks making up the RAID volume could be near imminent failure. RAID rebuild process is generally IO intensive and could stress the disks to complete failure. One stands a greater chance to copy out the required data before total failure occurs. Once the critical data is copied out, standard rebuild process could then be carried out.

 

Situation 2
One disk failure, and the RAID server is running at degraded mode but in this case, it is important to save everything including the boot-up operating system and application.

Some RAID server is used as application and data server. It may not be sufficient to recover just the data as re-configuration of application may not be possible or too cumbersome. The only feasible option is to rebuild the degraded RAID volume. However, it is advisable that one should backup the disk image from all the working disks before a rebuild is perform.

See how to backup disk image.

In case the rebuild function fails or any more disks fail subsequently, one still has the backup disk image to work on further.

Back to top

Situation 3
Loss of RAID disk volume due to multiple disks failure, system crashes, power surge, lost of RAID configuration settings or other unknown reasons.

In such a case, one may need to seek data recovery assistance. Before proceeding, you may want to read some of the information below :

  • If your data is really important, before you part with your system, you may want to backup the disk image of all the working disks. It is better to play safe so you will always have a backup set to rework in case the original raid server suffered further corruption of any kinds

  • Carefully take note some of the following info if you have them:
    • Strip block size (normally a multiple factor of 8K) and order of disk elements in which the RAID volume is formed. Such info could normally be found in the RAID BIOS or RAID configuration Manager.
    • Description of problems
    • Description of user’s attempt
    • List of critical data and folders and any special requirements
    • Label each disk before taking them out and carefully note the corresponding position. This is extremely important if the number of disk elements is approaching 14.
    • Carefully pack your disks for delivery to recovery expert. Both working and damage disks are needed.

If the RAID volume is not longer accessible, do not attempt any rebuild as such act is meaningless and will only complicate the situation. Rebuild only makes sense if you have a degraded and accessible RAID volume.

Back to top