Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Swarm volumes can be replaced after either an admin-initiated Retire (see Managing Chassis and DisksDrives) or a Swarm-initiated failure resulting from I/O errors (see Retiring Volumes). After a volume is A volume can be replaced after being marked either Retired or Unavailable, it can be replaced.

Administrators can insert a drive into a running node without restarting the server, provided that the server hardware supports this function and disk.volumes = all all is configured. This feature (called hot plugging or hot swapping) lets you add allows adding storage capacity to a node at any time.

See See Hot Swapping and Plugging DrivesDisks.

Identifying the Drive

When a volume is marked unavailable or retired, its The physical drive light turns on and stays on for one hour . When you need help identifying a failed or failing drive, use when a volume is marked unavailable or retired. Use the drive light features of the UI when identifying a failed or failing drive:

  • Swarm UI: Click through Cluster > Hardware to view the chassis, and enable the drive light.

    To

    Click the disk light toggle in the summary row to flash the drive light for a specific drive

    , click the disk light toggle in its summary row. When you enable drive lights manually, they will

    . Drive lights remain lit until

    you turn them off

    turned off when enabling manually

    See 

    See Managing Chassis and Drives.

  • Legacy Admin Console: The Identify feature

    lets you identify

    allows identification of a Retired volume that

    needs

    need to be replaced.

    However,

    Use process of elimination if the volume was marked Unavailable

    , use process of elimination

    : identify each of the working volumes in the chassis to determine which one does not flash and therefore needs to be replaced. 

Once you have identified the correct drive, you can simply remove Remove the drive and verify its the serial number with the message in the UI . When you insert a new drive, Swarm will recognize that once the correct drive has been identified. Swarm recognizes a new volume is available and will then format formats it for use when a new drive is inserted.

See Drive Identification Plugin.

Suspending Volume Recovery

While Suspend volume recovery while replacing a failed hard drive, be sure to suspend volume recovery:

  • Swarm UI: In the Swarm UI, administrators  Administrators can suspend an in-process volume recovery using the Suspend Recovery option under the settings (gear) icon in the Swarm UI. After the drive is replaced, resume the Resume the recovery using either the Enable Disk Recovery button in the banner message or the Enable Recovery under the settings gear icon after the drive is replaced.

Info

...

Tip

For drive-related events requiring user action (such as drive removal), Swarm helps

...

locate the hardware by including the SCSI locator (bus ID) and volume serial number in the log message

...

displayed in the UI. (v9.2)

  • Legacy Admin Console:

    In the Settings menu, select
    1. Select Volume: Suspend Recovery in the Settings menu.

    2. Remove the defective drive and install the replacement drive.

    Ensure that
    1. Verify the new drive appears in the Swarm Admin Console and has a non-zero stream count after several minutes of cluster activity.

    In the Settings menu, turn
    1. Turn off Volume: Suspend Recovery in the Settings menu.