| Oracle® StorageTek SL150 Modular Tape Library System User Guide E24606-04 |
|
![]() Previous |
![]() Next |
Monitoring library operations for component failures and operational problems is one of the few regular tasks that you need to perform when operating an StorageTek SL150 Modular Tape Library System. The library is designed to operate automatically, under the control of host software. So, under normal circumstances, little or no operator intervention is needed. You merely need to monitor the system for abnormal conditions. Careful monitoring and prompt corrective actions minimize library downtime and maximize the availability of your data.
This section starts by outlining the main approaches to monitoring the health and operations of the StorageTek SL150 Modular Tape Library System and the interfaces that support the,:
Then it describes the processes for handling any problems that you detect:
The SL150 user interfaces provide a comprehensive set of remote and local monitoring tools. This section explains how to use them.
The remote, web-based user interface is the main tool for monitoring the SL150 Modular Tape Library System. It provides straightforward monitoring and troubleshooting tools and a comprehensive set of management interface. It is accessible from any workstation that has a web browser installed and network access to the library. Routine system monitoring is as simple as Checking the System Dashboard and Checking Health Details.
To monitor the SL150 tape library, you need only glance at the Library Health and host connection state indicators found at the top of every screen in the remote user interface. These two indicators summarize the overall operating condition of the library and the status of its connection with its host system.

The library Health indicator represents the aggregate state of every component in the system. When the library is Operational, all library components are themselves Operational and the library as a whole is fully functional and available for use. When the library is Degraded, the library is working and available for use, but one or more components have either failed or been degraded by subcomponent failures. When the library has Failed, it is unusable. Either a critical, non-redundant component has failed or a faulty bridged drive has broken the control-data path between the library and the host.
The library host-connection status indicator shows whether the Library is Online and available to host applications or unavailable due to maintenance or system faults. The library may be unavailable for any of the following reasons: the Library is Offline, a Magazine is Open, the Library is Initializing, the Library is Powering Down, the library is undergoing Maintenance, library Start-up Failed, or the Library is Inoperative and needs to be restarted.
If the library Health indicator shows that the library is Degraded or Failed, click on the indicator to view details. The remote user interface displays a fault table with a Date and Time, a Fault Code and Fault Name a list of Suspect Components, and a Recommended Action for correcting each problem.

Faults can have side effects that make other, healthy components appear faulty. So the Suspect Components field of each fault record lists possible causes in order of probability. For example, if the expansion controller on module 3 has lost its connection to the library controller and the motherboard, chassis, and power supply of the same module appear to have failed, the Suspect Components field will list the module interconnect cable that links module 3 to the library controller first, followed by the other, less-likely components.
For more information on interpreting library health information, see "Troubleshooting".
The SL150 local operator panel lets you check on library status and configuration when you are working directly with the hardware and cannot easily use the browser-based user interface. The local operator panel can be especially valuable when you are Checking Library and Component LED Indicators.
In the operator panel, the Health and host connection state indicators are again located at the top of the every screen. If the library Health indicator shows that the library is Degraded or Failed, click on the indicator to view details.

The local operator panel displays the same health details as the remote user interface, but in abbreviated form: a Fault Code that includes the fault name, a list of Suspect Components, and a corresponding Recommended Action:

For more information on interpreting library health information, see "Checking Health Details" and "Troubleshooting".
When you have physical access to the library, you can monitor its overall status and the status of individual Customer Replaceable Unit (CRU) components by checking the indicators on the library base module. A pair of light-emitting diode (LED) indicators shows the health of the library and the health of each of its customer-replaceable components:
![]() |
If the OK LED is green and the alert (!) LED is off, the library or component is fully Operational. |
![]() |
If the OK LED is green and the alert (!) LED is amber, the library or component is Degraded. It is still operating but with reduced functionality. |
![]() |
If the OK LED is off and the alert (!) LED is amber, the library or component has Failed and cannot perform its function. |
Library health indicators are located on the front panel of the library base unit at upper left, alongside the power button. Health indicators for the customer-replaceable local operator panel are found alongside the upper left corner of the touch-screen display.

The rear panel of the base unit also holds library health indicators health indicators, this time located at the top just left of center. Additional indicators show the status of customer replaceable units (CRUs), such as the robot/library controller, the power supplies, and tape drives.

The module controller in the upper right corner of the rear panel of an expansion module holds an additional set of LED health indicators.

Network management applications can monitor the SL150 Modular Tape Library using the Simple Network Management Protocol (SNMP). You can configure the library's SNMP agent to automatically send traps that alert network management stations to faults and configuration changes.
The SL150 library supports both version 2c and version 3 of the SNMP standard, so you can select the version that best meets your needs. Version 2c is backward compatible with earlier versions of the standard but is insecure (authentication credentials—community strings—and management data are exchanged in clear text). SNMP version 3 is not backwards compatible but supports a more secure authentication method and encryption of the management data.
The SNMP tab of the Settings screen lets you carry out the following SNMP configuration tasks:
For detailed information on SNMP and Oracle's StorageTek Modular Libraries, see the SNMP Reference Guide in the SL150 Modular Tape Library System Customer Documentation Library.
SNMP users are allowed to access the library's SNMP agent. To add a user, proceed as follows.
Log in under the Administrator or Service role.
Select Settings from the menu on the left side of the interface. A set of tabbed property sheets appears.
Select the SNMP tab.
Go to the SNMP Users table.
Click on Add SNMP User in the control bar above the table.
When the Add SNMP User dialog appears, select an SMNP version from the Version list. Choose v3 unless you have no security concerns and need compatibility with older SNMP versions, in which case choose v2c.
If you chose v2c, enter a community string (a password or phrase) in the Community Name text field. Then go to step 12.
If you chose v3, enter a name for the new SNMP user in the User Name field.
Select one of the user-authentication methods listed in the SNMP v3 Authentication Protocol list. Choose SHA for best security or MD5 for compatibility with systems that use MD5.
Enter a strong authentication password in the SNMP v3 Authentication Passphrase text field.
To secure management data during network transmission, choose an encryption standard from the SNMP v3 Privacy Protocol list. Select AES (Advanced Encryption Standard) for best security or DES (Data Encryption Standard) for compatibility with systems that use DES.
Enter a strong data-protection password in the SNMP v3 Privacy Passphrase text field.
To change the access parameters of an SNMP user, proceed as follows:
Log in under the Administrator or Service role.
Select Settings from the menu on the left side of the interface.
A set of tabbed property sheets appears.
Select the SNMP tab.
Go to the SNMP Users table.
Click on Edit SNMP User in the control bar above the table. The Edit SNMP User dialog appears.
If you need to change the SMNP version used, click the Version list. Choose v3 unless have no security concerns and need compatibility with older SNMP versions, in which case choose v2c.
If you changed to SNMP v2c in the preceding step, enter a community string (a password or phrase) in the Community Name text field. Then go to step 12.
If you just need to change an SNMP v2c community string, make your change in the Community Name text field. Then go to the last step.
If you changed to SNMP v3 in step 5, enter a name for the SNMP user in the User Name field.
If you just need to change an SNMP v3 user name, make your changes in the User Name field.
If you need to change the SNMP v3 authentication method, select one of the methods listed in the Authentication Protocol list. Choose SHA for best security or MD5 for compatibility with systems that use MD5.
If you need to change the SNMP v3 authentication password, make your changes in the Authentication Passphrase text field.
If you need to change the SNMP v3 data-encryption method, chose a standard from the Privacy Protocol list. Select AES (Advanced Encryption Standard) for best security or DES (Data Encryption Standard) for compatibility with systems that use DES.
If you need to change the SNMP v3 data-protection password, make your changes in the Privacy Passphrase text field.
To deny an existing SNMP user access to the SNMP agent, proceed as follows:
Log in under the Administrator or Service role.
Select Settings from the menu on the left side of the interface. A set of tabbed property sheets appears.
Select the SNMP tab.
Go to the SNMP Users table.
Click on Delete SNMP User in the control bar above the table. The Delete SNMP User dialog appears.
An SNMP trap recipient is a network management station that you designate to receive notifications sent by the SNMP agent on the library. To add a trap recipient, proceed as follows.
Log in under the Administrator or Service role.
Select Settings from the menu on the left side of the interface. A set of tabbed property sheets appears.
Select the SNMP tab.
Go to the SNMP Trap Recipients table.
Click on Add Trap Recipient in the control bar above the table. The Add Trap Recipient dialog appears.
Enter the IP address of the management host that should receive the trap in the Host text field.
Enter the severity of the notifications that should be sent to this management station in the Trap Level field.
See Appendix C, "SNMP Traps," for a list of valid entries.
Select an SMNP version from the Version list.
If you chose v2c, enter a community string (a password or phrase) in the Community Name text field. Then go to the last step.
If you chose v3, enter the name of the SNMP user that will have access to the traps in the Trap User Name field.
To authenticate the user, select one of the methods listed in the SNMP v3 Authentication Protocol list. Choose SHA for best security or MD5 for compatibility with systems that use MD5.
Enter a strong authentication password in the SNMP v3 Authentication Passphrase text field.
To secure management data during network transmission, choose an encryption standard from the SNMP v3 Privacy Protocol list. Select AES (Advanced Encryption Standard) for best security or DES (Data Encryption Standard) for compatibility with systems that use DES.
Enter a strong data-protection password in the SNMP v3 Privacy Passphrase text field.
In most cases, accept the default, library-generated value in the Engine ID field. If you must override this value, enter a hexadecimal value that starts with 0x and does not contain either all zeroes (0) or all sixteens (F).
An SNMP trap recipient is a network management station that you designate to receive notifications sent by the SNMP agent on the library. To modify trap recipient information, proceed as follows.
Log in under the Administrator or Service role.
Select Settings from the menu on the left side of the interface. A set of tabbed property sheets appears.
Select the SNMP tab.
Go to the SNMP Trap Recipients table.
Click on Edit Trap Recipient in the control bar above the table. The Edit Trap Recipient dialog appears.
To change the IP address of the management host that should receive the trap, enter the change in the Host text field.
To change the severity of the notifications that are sent to this management station, enter the change in the Trap Level field.
See SNMP Traps, for a list of valid entries.
If you need to change the SMNP version used, select either v3 or v2c from the Version list.
If you changed to v2c, enter a community string (a password or phrase) in the Community Name text field. Then go to the last step.
If you just need to change an SNMP v2c community string, make your change in the Community Name text field. Then go to the last step.
If you changed to v3, enter the name of the SNMP user that will have access to the traps in the Trap User Name field.
If you just need to change the name of the SNMP user that will have access to the traps, enter the change in the Trap User Name field
To change the SNMP v3 authentication method, select one of the methods listed in the Authentication Protocol list. Choose SHA for best security or MD5 for compatibility with systems that use MD5.
To change the SNMP v3 authentication password, make your changes in the Authentication Passphrase text field.
To change the SNMP v3 data-encryption method used, choose a standard from the Privacy Protocol list. Select AES (Advanced Encryption Standard) for best security or DES (Data Encryption Standard) for compatibility with systems that use DES.
To change the SNMP v3 data-protection password, enter the changes into the Privacy Passphrase text field.
If you must change the SNMP v3 agent identifier, enter the changes in the Engine ID field. Your entry must be a hexadecimal value that starts with 0x and does not contain either all zeroes (0) or all sixteens (F).
To remove an SNMP management station from the trap recipients list, proceed as follows:
Log in under the Administrator or Service role.
Select Settings from the menu on the left side of the interface. A set of tabbed property sheets appears.
Select the SNMP tab.
Go to the SNMP Trap Recipients table.
Click on Delete Trap Recipient in the control bar above the table. The Delete Trap Recipient dialog appears.
To test your SNMP configuration, proceed as follows:
Log in under the Administrator or Service role.
Select Settings from the menu on the left side of the interface. A set of tabbed property sheets appears.
Select the SNMP tab.
Go to the SNMP Trap Recipients table.
Click on Send a Test Trap in the control bar above the table. The Send a Test Trap dialog appears.
Press OK to send the test trap to all configured recipients or Cancel to abort.
You can download the management information base (MIB) using the procedure below:
Log in under the Administrator or Service role.
Select Settings from the menu on the left side of the interface. A set of tabbed property sheets appears.
Select the SNMP tab.
Click the Download Mib icon in the upper left corner of the SNMP tab.

When your browser's file-download dialog opens, save the STREAMLINE-TAPE-LIBRARY-MIB.txt file to disk.
The library Health indicator at the top of every SL150 browser-based user interface screen is your main diagnostic tool. If library health is Degraded or Failed there is a problem that you need to investigate.
To diagnose library health problems, Take the Library Off Line, and Use the Health Table.
If the Health indicator shows that the library is Degraded or Failed, proceed as follows:
Click on the Degraded or Failed indicator.
The user interface displays a Library Health table with a row for each fault detected on the system. Each row contains fields for the Date and Time, Fault Code, and Fault Name associated with the problem, a prioritized list of Suspect Components that might be causing the problem, and the Recommended Action for correcting the problem.
For each fault record in the table, Step Through the Suspect Components Listed in the Fault Record.
If you close all open fault records and still cannot get the library into Operational condition, go to "Getting Parts and Technical Support", Check for Relevant Knowledge Articles, and, if you cannot find a resolution, Create a Service Request. Include fault codes and details of the actions that you have taken.
Otherwise, if desired, you can make sure that the library is functioning normally by Performing a Library Self Test.
Note the Fault Code for the record.
Note the first component listed in the Suspect Components field of the record (the field may identify up to five components).
Note the corresponding Recommended Action.
If the Fault Code is 9108 and the Suspect Components field lists one or more magazines, carry out the procedure "Locate and Remove a Cartridge that is Stuck in a Magazine Slot".
If the Fault Code is 9108 and the Suspect Components field lists one or more drives, carry out the procedure "Free a Cartridge that is Stuck in a Drive".
If the Fault Code falls in the range 9102-9107 or is code 9109, an obstruction may be interfering with the robot. Carry out the procedure "Locate and Clear Obstructions, Such as Loose or Protruding Cartridges".
Otherwise, carry out the recommended action shown. Consult the volumes in the SL150 Modular Tape Library System Customer Documentation Library for instructions. For updates and additional information, see "Getting Parts and Technical Support", and Check for Relevant Knowledge Articles.
If the connection status indicator shows that the Library is Inoperative, you need to restart it. See "Restart the Library".
Once you have carried out the recommended action and, if necessary, restarted the library, see if the corresponding fault record has been closed. See "Check the Library Health Log".
If the Type field of the Health Log record shows that the record is still Open, you have not yet isolated the problem. Note the next item listed in the Suspect Components field of the record, and repeat steps 3-8.
If the Type field shows that the record is Closed and the library Health status is now Operational, stop here.
If the Type field shows that the record is Closed but the library Health status is still Degraded, check for remaining faults. See "Use the Health Table".
If the Library Health table lists an event with fault code 9108 and includes one or more magazines in the Suspect Components field for the event, a cartridge is probably stuck in a storage cell.
To locate and correct the problem, proceed as follows:
Select the first magazine listed in the Suspect Components field of the fault record.
Take the magazine out of the enclosure. See "Manually Unlatch and Remove the Magazine".
Working methodically from one end to the other, grasp each cartridge by the top and bottom gripping surfaces, and see if it moves freely.
If a cartridge is stuck, gently try to free it, and check the source storage cell for defects.
If you find any damage to the magazine, request a replacement. Go to "Getting Parts and Technical Support", and Create a Service Request.
If you cannot free a stuck cartridge, do not force it. Go to "Getting Parts and Technical Support", and Check for Relevant Knowledge Articles. If you cannot find a resolution, Create a Service Request.
Otherwise, reseat the cartridge in its slot and make sure that it is moving freely.
Reinstall the magazine in its bay.
The connection status indicator now shows that the Library is Inoperative, so you need to restart. See "Restart the Library".
Once the library restarts, see if the corresponding fault record has been closed. Go to "Check the Library Health Log".
If the Type field of the Health Log record shows that the record is still Open, you have not yet isolated the problem. Repeat the procedure "Step Through the Suspect Components Listed in the Fault Record", this time investigating the next suspect component listed in the fault record.
If the fault record is Closed and the library Health status is now Operational, stop here. The library is again operating normally.
If the fault record has been closed but the library Health status is still Degraded, check for remaining, open faults. See "Use the Health Table".
If the Library Health table lists an event with fault code 9108 and includes one or more tape drives in the Suspect Components field for the event, a tape cartridge is probably stuck in the drive. To resolve the problem, proceed as follows:
Note the order and identities of the components listed in the Suspect Components field of the Library Health table row that corresponds to the fault event.
Go to directly to "Getting Parts and Technical Support", and Check for Relevant Knowledge Articles. If you cannot find a resolution, Create a Service Request.
If the Library Health table lists a fault code in the range 9102-9107 or code 9109, a mispositioned cartridge or similar obstruction is probably interfering with the robot.
To locate and clear the obstruction, proceed as follows:
Select the magazine that will give you the best view of the interior of the library.
If you felt any significant resistance when attempting to remove the magazine, you may have found the obstruction stop (a tape cartridge may be protruding from one of the slots in this magazine and catching on the library structure). Select a magazine on the opposite side of the library, and repeat step 2.
Using the empty magazine bay as a window, look inside the library and try to locate obstructions, particularly loose or protruding cartridges. Look across, above, and below.
If you cannot see any obstructions, gain additional visibility by removing additional magazines.
If you see a cartridge protruding from a storage slot in one of the magazines or lying on the bottom of the library, reach in through the open magazine bay and remove it, if you can. Provide additional clearance, if necessary, by removing additional magazines.
If you cannot remove the problem cartridge, stop here, go to "Getting Parts and Technical Support", and Check for Relevant Knowledge Articles. If you cannot find a resolution, Create a Service Request.
If you successfully removed the problem cartridge, remove the source magazine and check the source storage cell for defects.
If you find any damage to the source magazine, request a replacement. Go to "Getting Parts and Technical Support", and Create a Service Request.
Otherwise, reseat the errant cartridge in its storage cell.
Reinstall all removed magazines in their original bays.
Once you have reinserted the magazines, go to "Restart the Library". Then go to "Check the Library Health Log".
If the Type field of the Health Log record shows that the record is still Open, you have not yet isolated the problem. Repeat the procedure "Step Through the Suspect Components Listed in the Fault Record", this time investigating the next suspect component listed in the fault record.
If the fault record has been closed and the library Health status is now Operational, stop here.
If the fault record has been closed but the library Health status is still Degraded, check for remaining open faults. See "Use the Health Table".
To release the automatic magazine latches during troubleshooting, manually override the software-controlled latches using the following procedure.
|
Caution: Use this procedure only during specified installation, maintenance, and troubleshooting procedures or under the direction of an Oracle service representative. Do not routinely override the automated magazine latches.To release a magazine during normal operation, use the user interface software: see "Unlatch the Magazine". |
If the library is online, carry out the steps under "Take the Library Off Line" before proceeding further.
Insert a 1/64-in Allen wrench into the hole at the lower inside corner of the cartridge magazine (a large paperclip that has been straightened into an L shape will do in an emergency).

Align the Allen wrench parallel to the magazine face and slowly push it into the hole.
Grasp the handle on the front of the magazine and slowly and gently pull the magazine a short distance out of the library. Stop if you feel any resistance other than the usual slight friction.
Remove the wrench.
Support the bottom of the magazine with your other hand.
Slowly and gently start to pull the magazine free of the module.
If you feel any significant resistance, stop immediately, particularly if you are looking for an obstruction (a tape cartridge may be protruding from one of the slots in this magazine and catching on the library structure).
If the magazine slides freely, draw it out of the library, handling it carefully so that cartridges do not spill out.
Set the magazine aside on a flat, stable, uncluttered surface, and make a note of the module that you removed it from.
Return to what you were doing, either "Locate and Remove a Cartridge that is Stuck in a Magazine Slot" or "Locate and Clear Obstructions, Such as Loose or Protruding Cartridges".
In a troubleshooting situation, you may have to restart the library more than once. Proceed as follows:
Select Service from the menu on the left side of the browser-based user interface, and select the Health Log tab from the tabbed property sheet on the screen.
In the Health Log table, locate the row for the fault that you have been working on.
Note the value of the Type field.
Return to what you were doing:
The library has a self-test feature that exercises all of the library's capabilities and can thus assist with fault isolation and validation of repair actions. The self-test moves a diagnostic cartridge between the reserve slot, storage slots, drives, and mailslot. A Basic Self Test performs seven moves. A Full Self Test moves the diagnostic tape to every empty slot in the library and performs a get/put to every occupied slot. The exact number of moves is, in this case, proportional to the size of the library.
To carry out a self test, proceed as follows:
If you have not already done so, take the library offline. See "Take the Library Off Line".
Select Library from the menu on the left side of the interface. The graphical library map appears.
Click the Library control at the top left of the Library screen.
Select Run Self Test from the Library control's context menu.
Select Basic Self Test or Full Self Test from the menu. A basic test will take a few minutes. A full test will take considerably longer.
To run the specified test, press OK when the confirmation bar appears at the top of the screen. When the test completes, the result bar will display the number of moves performed. Press OK to close the result bar.
To skip the test press Cancel.
To end a running test prematurely, press the Stop the Test button that appears after the test starts. It may take a short while for the test to stop. When it does, press OK.
If the diagnostic cartridge was not returned to its reserved cell at the end of the test, move the diagnostic cartridge now. See "Move Tape Cartridges Using the Browser-Based User Interface".
If you need to work on diagnostics off-line or if you need to open a service request, save the contents of the system health log to a file. Proceed as follows:
Log in to the browser-based user interface under the Administrator or Service role.
Select Service from the menu on the left side of the interface. A set of tabbed property sheets appears.
Select the Health Log tab.
Select Export from the control bar at the top of the tab.
When the download dialog appears, save the file to disk.
The HTML-based SL150_HealthLog.xls file is compatible with current spreadsheet applications, such as Microsoft Excel 2010 and Apache OpenOffice Calc 3.4. You can also view these files in web browsers: simply rename the file using the .html file extension in place of .xsl.
When you need firmware updates, upgrades, technical assistance, service, or replacement parts, proceed as follows:
Open a web browser window, and log in to My Oracle Support at https://support.oracle.com.
Generally, you should start by checking for knowledge articles that address your issue. From the tabbed menu bar that runs across the top of the page, select Knowledge and Check for Relevant Knowledge Articles.
If you need to check or upgrade the library firmware, select Patches & Updates. Then Get Firmware Updates.
For all other issues, select Service Requests. Then Create a Service Request.
To find relevant knowledge articles, go the Select a product line or a product area of the Knowledge tab.
Using the Browse menu, select Sun Microsystems > Storage-Tape > Libraries—SL-series > StorageTek SL150 Modular Tape Library System.
Go to the What do you want to do? area.
Select an option from the list: Overview, Use Product, Troubleshoot, Patching and Maintenance, Install and Configure, Upgrade, Optimize Performance, Security, or Certify (not all options may be relevant to the SL150 library).
Go to the Patch Search area of the Patches & Updates tab.
Click on the Product or Family (Advanced) link.
Check the Include all products in a family checkbox.
In the Product is field, enter SL, and then select StorageTek SL150 Modular Tape Library System from the list of search results.
Check the Exclude all superseded patches text box.
Press the Search button.
In the Patch Search Results table, click on the hyperlinked Patch Name of a patch that you need to download.
When the patch description appears, note the details. Then press the Download button if you want to download the file immediately or Press the Add to Plan button to add it to a group of patches that will be applied together.
Follow the instructions on screen.
Then carry out the steps listed in "Update the Firmware".
If you have not already done so, carry out the steps in "Export the Health Log" so that you can include full status information with your request when prompted.
Click the Help control at the upper right corner of the Service Requests tab.
From the drop-down menu, select Creating Service Requests (video) or the text article How do I create a new SR?