troubleshooting root.sh problem

------*for 10g and 11.1

1.查证公网,私网的节点名是可以互相ping通的
2.---查证OCR/Voting 文件对oracle 、root用户都是可以读写的,盘的权限
    Dd if=/dev/raw/raw1 of=/dev/null   --验证可读

Pre Install:
 OCR    - root:oinstall   - 640
 Voting - oracle:oinstall - 660
Post Install:
 OCR    - root:oinstall   - 640
 Voting - oracle:oinstall - 644

In RHAS 4.0, permissions should be added to /etc/rc.d/rc.local.  See
Note 293819.1 for more information.

3.在执行root.sh前确保ocr、voting盘是干净的--(第一次安装时)
Example:清理磁盘头

 dd if=/dev/zero of=/dev/traindata_dg/ocrV1064_100m.dbf bs=8192

count=12800
 dd if=/dev/zero of=/dev/traindata_dg/V1064_vote_01_20m.dbf bs=8192

count=2560


4.Verify that the Oracle user has permissions on /var/tmp

(specifically
/var/tmp/.oracle)

5.Is pam being used?  Look for pam_unix messages in the messages

file.  The pam configuration might need to be altered to allow the

root.sh to complete.   ---pam_unix :传统的密码验证模块

6.Verify that the correct vendor clusterware version is being used

(if vendor clusterware is being used).  If on Sun, make sure you are

using the latest UDLM.
If on Sun, make sure the udlm has the keyword "reentrant".  Example:

 > more /var/sadm/pkg/ORCLudlm/pkginfo | grep VERSION
 VERSION=Dev Release 10/29/03, 64bit 3.3.4.7 reentrant

7. Veirfy that crs, css, or evm is not already running ( ps -ef |

grep d.bin )


------debug root.sh------

1.crsctl stop crs  (root用户)

2. Backup the entire Oracle Clusterware home.

3.Execute <CRS_HOME>/install/rootdelete.sh on all nodes

4. Execute <CRS_HOME>/install/rootdeinstall.sh on the installing node

5. The following commands should return nothing:
* ps -e | grep -i 'ocs[s]d'
* ps -e | grep -i 'cr[s]d.bin'
* ps -e | grep -i 'ev[m]d.bin'
Eventually kill those processes or reboot the node.

6. Remove all files from /tmp/.oracle and /var/tmp/.oracle

7. edit the root.sh and add 'sh -x' before the two commands executed

by it, e.g.
#!/bin/sh
sh -x /u01/app/oracle/product/crs102/install/rootinstall
sh -x /u01/app/oracle/product/crs102/install/rootconfig
8. collect the output via, e.g.
script /tmp/rootsh-node1.log
./root.sh
exit

9. Please send the rootsh-node1.log to Oracle Support for analyzing.

- In some cases, these messages can be found in the rootsh-

<node_name>.log file: