TIP : remplacer un disque cassé sur Isilon

Voici une petite procédure rapide, qui est certes décrite dans la documentation officielle Isilon (voir les liens en fin d’article), mais que je propose de vous détailler en français dans le texte. Et comme c’est Vendredi, cela m’évitera de vous ennuyer avec des grands discours stratégiques 🙂 . L’opération consiste donc à réaliser un remplacement de disque HS sur un noeud Isilon.

Une fois le nouveau disque reçu du support EMC, extrayez le disque en panne et remplacez-le par son remplaçant, physiquement. Ensuite, logguez vous via SSH sur la console d’un des Noeuds du cluster concerné. Vous pouvez repérer le disque en question via la command isi devices -a status -d X où X représente le numéro du noeud. Vous pouvez les faire dans l’ordre ou récupérer son numéro directement depuis l’interface graphique d’administration.

IridiumTU-1#  isi devices -a status -d 2
Node 2, [ATTN]
  Bay 1        Lnum N/A     [REPLACE]      SN:N/A                 N/A
  Bay 2        Lnum 34      [HEALTHY]      SN:PN1234P9G2PDUP      /dev/da2
  Bay 3        Lnum 17      [HEALTHY]      SN:PN2234P9G373TU      /dev/da19
  Bay 4        Lnum 16      [HEALTHY]      SN:PN1234P9G2982P      /dev/da20
  Bay 5        Lnum 33      [HEALTHY]      SN:PN1234P9G2PHZP      /dev/da3
  Bay 6        Lnum 15      [HEALTHY]      SN:PN1234P9G2LPLP      /dev/da21
  Bay 7        Lnum 14      [HEALTHY]      SN:PN2234P9G3B7ZU      /dev/da22
  Bay 8        Lnum 13      [HEALTHY]      SN:PN1234P9G294PP      /dev/da23
  Bay 9        Lnum 32      [HEALTHY]      SN:PN1234P9G2PV2P      /dev/da4
  Bay 10       Lnum 12      [HEALTHY]      SN:PN1234P9G2AL3P      /dev/da24
  Bay 11       Lnum 11      [HEALTHY]      SN:PN2234P9G371UU      /dev/da25
  Bay 12       Lnum 10      [HEALTHY]      SN:PN1234P9G298XP      /dev/da26
  Bay 13       Lnum 31      [HEALTHY]      SN:PN2234P9G0SH4W      /dev/da5
  Bay 14       Lnum 30      [HEALTHY]      SN:PN1234P9G2SS8P      /dev/da6
  Bay 15       Lnum 29      [HEALTHY]      SN:PN2234P9G2V9BU      /dev/da7
  Bay 16       Lnum 9       [HEALTHY]      SN:PN1234P9G2HZ0P      /dev/da27
  Bay 17       Lnum 28      [HEALTHY]      SN:PN1234P9G2NBLP      /dev/da8
  Bay 18       Lnum 27      [HEALTHY]      SN:PN1234P9G1X2TP      /dev/da9
  Bay 19       Lnum 26      [HEALTHY]      SN:PN2234P9G374GU      /dev/da10
  Bay 20       Lnum 8       [HEALTHY]      SN:PN1234P9G383HP      /dev/da28
  Bay 21       Lnum 25      [HEALTHY]      SN:PN1234P9G2A25P      /dev/da11
  Bay 22       Lnum 24      [HEALTHY]      SN:PN1234P9G3M2EP      /dev/da12
  Bay 23       Lnum 7       [HEALTHY]      SN:PN2234P9G3GD6U      /dev/da29
  Bay 24       Lnum 6       [HEALTHY]      SN:PN1234P9G34KVP      /dev/da30
  Bay 25       Lnum 5       [HEALTHY]      SN:PN1234P9G27GZP      /dev/da31
  Bay 26       Lnum 4       [HEALTHY]      SN:PN2234P9G3BVMU      /dev/da32
  Bay 27       Lnum 3       [HEALTHY]      SN:PN1234P9G2NGKP      /dev/da33
  Bay 28       Lnum 23      [HEALTHY]      SN:PN1234P9G2PJ2P      /dev/da13
  Bay 29       Lnum 2       [HEALTHY]      SN:PN1234P9G27E7P      /dev/da34
  Bay 30       Lnum 22      [HEALTHY]      SN:PN1234P9G2PHNP      /dev/da14
  Bay 31       Lnum 1       [HEALTHY]      SN:PN2234P9G3D6MU      /dev/da35
  Bay 32       Lnum 21      [HEALTHY]      SN:PN2234P8KKMR6Y      /dev/da15
  Bay 33       Lnum 0       [HEALTHY]      SN:PN1234P9G3NZ2P      /dev/da36
  Bay 34       Lnum 20      [HEALTHY]      SN:PN2234P9G3JDZU      /dev/da16
  Bay 35       Lnum 19      [HEALTHY]      SN:PN2234P9G374AU      /dev/da17
  Bay 36       Lnum 18      [HEALTHY]      SN:PN2234P9G3A1EU      /dev/da18

IridiumTU-1# isi devices -a status -d 2

Node 2, [ATTN]

Bay 1 Lnum N/A [REPLACE] SN:N/A N/A

Bay 2 Lnum 34 [HEALTHY] SN:PN1234P9G2PDUP /dev/da2

Bay 3 Lnum 17 [HEALTHY] SN:PN2234P9G373TU /dev/da19

Bay 4 Lnum 16 [HEALTHY] SN:PN1234P9G2982P /dev/da20

Bay 5 Lnum 33 [HEALTHY] SN:PN1234P9G2PHZP /dev/da3

Bay 6 Lnum 15 [HEALTHY] SN:PN1234P9G2LPLP /dev/da21

Bay 7 Lnum 14 [HEALTHY] SN:PN2234P9G3B7ZU /dev/da22

Bay 8 Lnum 13 [HEALTHY] SN:PN1234P9G294PP /dev/da23

Bay 9 Lnum 32 [HEALTHY] SN:PN1234P9G2PV2P /dev/da4

Bay 10 Lnum 12 [HEALTHY] SN:PN1234P9G2AL3P /dev/da24

Bay 11 Lnum 11 [HEALTHY] SN:PN2234P9G371UU /dev/da25

Bay 12 Lnum 10 [HEALTHY] SN:PN1234P9G298XP /dev/da26

Bay 13 Lnum 31 [HEALTHY] SN:PN2234P9G0SH4W /dev/da5

Bay 14 Lnum 30 [HEALTHY] SN:PN1234P9G2SS8P /dev/da6

Bay 15 Lnum 29 [HEALTHY] SN:PN2234P9G2V9BU /dev/da7

Bay 16 Lnum 9 [HEALTHY] SN:PN1234P9G2HZ0P /dev/da27

Bay 17 Lnum 28 [HEALTHY] SN:PN1234P9G2NBLP /dev/da8

Bay 18 Lnum 27 [HEALTHY] SN:PN1234P9G1X2TP /dev/da9

Bay 19 Lnum 26 [HEALTHY] SN:PN2234P9G374GU /dev/da10

Bay 20 Lnum 8 [HEALTHY] SN:PN1234P9G383HP /dev/da28

Bay 21 Lnum 25 [HEALTHY] SN:PN1234P9G2A25P /dev/da11

Bay 22 Lnum 24 [HEALTHY] SN:PN1234P9G3M2EP /dev/da12

Bay 23 Lnum 7 [HEALTHY] SN:PN2234P9G3GD6U /dev/da29

Bay 24 Lnum 6 [HEALTHY] SN:PN1234P9G34KVP /dev/da30

Bay 25 Lnum 5 [HEALTHY] SN:PN1234P9G27GZP /dev/da31

Bay 26 Lnum 4 [HEALTHY] SN:PN2234P9G3BVMU /dev/da32

Bay 27 Lnum 3 [HEALTHY] SN:PN1234P9G2NGKP /dev/da33

Bay 28 Lnum 23 [HEALTHY] SN:PN1234P9G2PJ2P /dev/da13

Bay 29 Lnum 2 [HEALTHY] SN:PN1234P9G27E7P /dev/da34

Bay 30 Lnum 22 [HEALTHY] SN:PN1234P9G2PHNP /dev/da14

Bay 31 Lnum 1 [HEALTHY] SN:PN2234P9G3D6MU /dev/da35

Bay 32 Lnum 21 [HEALTHY] SN:PN2234P8KKMR6Y /dev/da15

Bay 33 Lnum 0 [HEALTHY] SN:PN1234P9G3NZ2P /dev/da36

Bay 34 Lnum 20 [HEALTHY] SN:PN2234P9G3JDZU /dev/da16

Bay 35 Lnum 19 [HEALTHY] SN:PN2234P9G374AU /dev/da17

Bay 36 Lnum 18 [HEALTHY] SN:PN2234P9G3A1EU /dev/da18

Ici, le disque cassé se trouve dans le noeud 2, bay 1. Tapez enfin la commande isi devices -a add -d X:Y où X représente le numéro du noeud et Y le numéro de la baie :

IridiumTU-1#  isi devices -a add -d 2:1
Initiating add on bay 1
. . . . . . . . . . .
!! The add operation is in-progress. A OneFS-formatted drive was found
!! in bay 1 and is being added to the file system. Wait a few minutes
!! and then list all drives to verify that the add operation completed
!! successfully.

IridiumTU-1# isi devices -a add -d 2:1

Initiating add on bay 1

. . . . . . . . . . .

!! The add operation is in-progress. A OneFS-formatted drive was found

!! in bay 1 and is being added to the file system. Wait a few minutes

!! and then list all drives to verify that the add operation completed

!! successfully.

C’est terminé, le disque va être checké, initialisé et reconstruit automatiquement. Vous pouvez suivre ces étapes via la commande isi devices -a status -d X:Y :

IridiumTU-1#  isi devices -a status -d 2:1
Node 2, [ATTN]
Details for Drive in Bay 1, Node 2
Lnum: N/A
Status: [PREPARING]

(...)

IridiumTU-1#  isi devices -a status -d 2:1
Node 2, [ATTN]
Details for Drive in Bay 1, Node 2
Lnum: 37
Status: [HEALTHY]
Bay  1/da1    is HGST HUS724030ALA640 FW:MF8OAAC0 SN:PN1234P9GJM25X, 5860533168 blks

SMART data (Bay  1/da1   ) --
       Reallocation Count: 64/64 vs 05 (raw 0000) [flags 0033]
        Probational Count: 64/64 vs 00 (raw 0000) [flags 0022]
      Raw Read Error Rate: 64/64 vs 10 (raw 0000) [flags 000b]
          HDD Temperature: e6/e6 vs 00 (raw 001a) [flags 0002]
             Spin Retries: 64/64 vs 3c (raw 0000) [flags 0013]
               CRC Errors: c8/c8 vs 00 (raw 0000) [flags 000a]
   Throughput Performance: 64/64 vs 36 (raw 0000) [flags 0005]
              Spinup Time: 8b/8b vs 18 (raw 01ef) [flags 0007]
             Spinup Count: 64/64 vs 00 (raw 000c) [flags 0012]
          Seek Error Rate: 64/64 vs 43 (raw 0000) [flags 000b]
         Seek Performance: 64/64 vs 14 (raw 0000) [flags 0005]
            Power-on Time: 64/64 vs 00 (raw 0007) [flags 0012]
             Power Cycles: 64/64 vs 00 (raw 000c) [flags 0032]
 Emergency Retract Cycles: 64/64 vs 00 (raw 000c) [flags 0032]
       Load/Unload Cycles: 64/64 vs 00 (raw 000c) [flags 0012]
   Offline Realloc Events: 64/64 vs 00 (raw 0000) [flags 0032]
  Scan Uncorrectable Errs: 64/64 vs 00 (raw 0000) [flags 0008]
SMART status is threshold NOT exceeded (Bay  1/da1   )

SMART errlog (Bay  1/da1   ) --
  error index 00   historic error count 0000
  reserved:
    00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
    00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
    00 00 00 00 00 00 00 00 00
  version 01  cksum ff(ok)

IridiumTU-1# isi devices -a status -d 2:1

Node 2, [ATTN]

Details for Drive in Bay 1, Node 2

Lnum: N/A

Status: [PREPARING]

(...)

IridiumTU-1# isi devices -a status -d 2:1

Node 2, [ATTN]

Details for Drive in Bay 1, Node 2

Lnum: 37

Status: [HEALTHY]

Bay 1/da1 is HGST HUS724030ALA640 FW:MF8OAAC0 SN:PN1234P9GJM25X, 5860533168 blks

SMART data (Bay 1/da1 ) --

Reallocation Count: 64/64 vs 05 (raw 0000) [flags 0033]

Probational Count: 64/64 vs 00 (raw 0000) [flags 0022]

Raw Read Error Rate: 64/64 vs 10 (raw 0000) [flags 000b]

HDD Temperature: e6/e6 vs 00 (raw 001a) [flags 0002]

Spin Retries: 64/64 vs 3c (raw 0000) [flags 0013]

CRC Errors: c8/c8 vs 00 (raw 0000) [flags 000a]

Throughput Performance: 64/64 vs 36 (raw 0000) [flags 0005]

Spinup Time: 8b/8b vs 18 (raw 01ef) [flags 0007]

Spinup Count: 64/64 vs 00 (raw 000c) [flags 0012]

Seek Error Rate: 64/64 vs 43 (raw 0000) [flags 000b]

Seek Performance: 64/64 vs 14 (raw 0000) [flags 0005]

Power-on Time: 64/64 vs 00 (raw 0007) [flags 0012]

Power Cycles: 64/64 vs 00 (raw 000c) [flags 0032]

Emergency Retract Cycles: 64/64 vs 00 (raw 000c) [flags 0032]

Load/Unload Cycles: 64/64 vs 00 (raw 000c) [flags 0012]

Offline Realloc Events: 64/64 vs 00 (raw 0000) [flags 0032]

Scan Uncorrectable Errs: 64/64 vs 00 (raw 0000) [flags 0008]

SMART status is threshold NOT exceeded (Bay 1/da1 )

SMART errlog (Bay 1/da1 ) --

error index 00 historic error count 0000

reserved:

00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00

00 00 00 00 00 00 00 00 00

version 01 cksum ff(ok)

Bon week-end !

Référence, doc EMC X-Series and NL-Series Field Replacement Manual

TIP : remplacer un disque cassé sur Isilon

Laisser un commentaire Annuler la réponse