This appendix provides troubleshooting information for installing Oracle Clusterware.
See Also:
The Oracle Database 11g Oracle RAC documentation set included with the installation media in the Documentation directory:Oracle Clusterware Administration and Deployment Guide
Oracle Real Application Clusters Administration and Deployment Guide
This appendix contains the following topics:
To address troubleshooting issues, Oracle recommends that you install OS Watcher, and if you intend to install an Oracle RAC database, RACDDT. You must have access to OracleMetaLink to download OS Watcher and RACDDT.
OS Watcher (OSW) is a collection of UNIX/Linux shell scripts that collect and archive operating system and network metrics to aid Oracle Support in diagnosing various issues related to system and performance. OSW operates as a set of background processes on the server and gathers operating system data on a regular basis. The scripts use common utilities such as vmstat
, netstat
and iostat
.
RACDDT is a data collection tool designed and configured specifically for gathering diagnostic data related to Oracle RAC technology. RACDDT is a set of scripts and configuration files that is run on one or more nodes of an Oracle RAC cluster. The main script is written in Perl, while a number of proxy scripts are written using Korn shell. RACDDT will run on all supported UNIX and Linux platforms, but is not supported on any Windows platforms.
OSW is also included in the RACDDT script file, but is not installed by RACDDT. OSW must be installed on each node where data is to be collected.
To download binaries for OS Watcher and RACDDT, go to the following URL:
https://metalink.oracle.com
Download OSW by searching for OS Watcher, and downloading the binaries from the User Guide bulletin. Installation instructions for OSW are provided in the user guide. Download RACDDT by searching for RACDDT, and downloading the binaries from the RACDDT User Guide bulletin.
The following is a list of examples of types of errors that can occur during installation. It contains the following issues:
Failed to connect to server, Connection refused by server, or Can't open display
Nodes unavailable for selection from the OUI Node Selection screen
PROT-8: Failed to import data from specified file to the cluster registry
/etc/oratab
pointing to a non-existent Oracle home. The OUI error file should show the following error: "java.io.IOException: /home/oracle/OraHome//bin/kfod: not found" (OracleMetalink bulletin 276454.1)/etc/oratab
pointing to a non-existing Oracle home.$ xhost fully_qualified_remote_host_name
For example:
$ xhost somehost.example.com
Then, enter the following commands, where workstation_name
is the host name or IP address of your workstation.
Bourne or Korn shell:
$ DISPLAY=workstation_name:0.0
$ export DISPLAY
To determine whether X Window applications display correctly on the local system, enter the following command:
$ xclock
The X clock should appear on your monitor.If the X clock appears, then close the X clock and start Oracle Universal Installer again.
Run the shell command ifconfig -a
. Compare the output of this command with the contents of the /etc/hosts
file to ensure that the node IP is listed.
Run the shell command nslookup
to see if the host is reachable.
As the oracle
user, attempt to connect to the node with ssh
or rsh
. If you are prompted for a password, then user equivalence is not set up properly. Review the section "Configuring SSH or RCP on All Cluster Nodes".
rootupgrade
. To confirm, look for the error "utopen:12:Not enough space in the backing store" in the log file $ORA_CRS_HOME/log/
hostname/client/ocrconfig_
pid
.log
./var/opt/oracle/srvConfig.loc
, and copy the contents of this raw device to the new device using the command dd
.time stamp 2005-04-04 14:49:49 is 106 s in the future
If Oracle Universal Installer (OUI) does not display the Node Selection page, then perform clusterware diagnostics by running the olsnodes
-v
command from the binary directory in your Oracle Clusterware home (CRS_home
/bin
on Linux and UNIX-based systems, and CRS_home
\BIN
on Windows-based systems) and analyzing its output. Refer to your clusterware documentation if the detailed output indicates that your clusterware is not running.
In addition, use the following command syntax to check the integrity of the Cluster Manager:
cluvfy comp clumgr -n node_list -verbose
In the preceding syntax example, the variable node_list
is the list of nodes in your cluster, separated by commas.
If you use more than one NIC for the interconnect, then you must use NIC bonding, or the interconnect will fail.
If you install Oracle Clusterware and Oracle RAC, then they must use the same NIC or bonded NIC cards for the interconnect.
If you use bonded NIC cards, then they must be on the same subnet.