Re: Oracle 10g RAC on AIX

  • From: Sanjay Mishra <smishra_97@xxxxxxxxx>
  • To: Jeremy Schneider <jeremy.schneider@xxxxxxxxxxxxxx>
  • Date: Mon, 28 Jan 2008 13:56:08 -0800 (PST)

Jeremy

This is dev environment. I had patched the CRS/RDBMS to 10203 and then applied 
6440669 merge Patch. I had now even rebooted all nodes and now all daemon are 
back. ocrcheck is also showing everything is fine but now crs_stat -t is 
hanging on all of them. When it was rebooted, it came fine bit after 15-20 
minutes, when I tried again, it is hanging

While applying the 6440669 patch, it has the last part on AIx it has special 
instrction
To try and identify the likely cause please execute the following  commands  
# and provide the output to your support representative, who will be  able to  
# identify the corrective steps.
#     genld -l | grep <CRS_HOME>
#     genkld | grep <CRS_HOME>    ( full or partial path will do )

I had few rows coming in the output. I had even ran the /usr/sbin/slibclean 
again and ran the above gen... command bit still see it like 
$genkld | grep "/u01/app/crs/10.2"
 900000003d75100    103e0 /u01/app/crs/10.2/lib/libdbcfg10.a[shr_dbcfg10.o]
 900000002691100  162fbbd /u01/app/crs/10.2/lib/libttsh10.a[shr_ttsh10.o]
 900000003d2c100    40de1 /u01/app/crs/10.2/lib/libocrb10.a[shr_ocrb10.o]
 900000003d28100     162d /u01/app/crs/10.2/lib/libskgxn2.a[shr_skgxn2.o]
 900000003d1d100     a1cc /u01/app/crs/10.2/lib/libocrutl10.a[shr_ocrutl10.o]
 900000003cc1100    5b130 /u01/app/crs/10.2/lib/libocr10.a[shr_ocr10.o]
 9000000025eb100    a5459 /u01/app/crs/10.2/lib/libhasgen10.a[shr_hasgen10.o]
$genld -l | grep "/u01/app/crs/10.2"
  900000003d28100    162d  /u01/app/crs/10.2/lib/libskgxn2.a[shr_skgxn2.o]
  900000003d2c100   40de1  /u01/app/crs/10.2/lib/libocrb10.a[shr_ocrb10.o]
  900000003d1d100    a1cc  /u01/app/crs/10.2/lib/libocrutl10.a[shr_ocrutl10.o]
  900000002691100  162fbbd  /u01/app/crs/10.2/lib/libttsh10.a[shr_ttsh10.o]
  9000000025eb100   a5459  /u01/app/crs/10.2/lib/libhasgen10.a[shr_hasgen10.o]
  900000003cc1100   5b130  /u01/app/crs/10.2/lib/libocr10.a[shr_ocr10.o]
  900000003d2c100   40de1  /u01/app/crs/10.2/lib/libocrb10.a[shr_ocrb10.o]
  900000003d28100    162d  /u01/app/crs/10.2/lib/libskgxn2.a[shr_skgxn2.o]
  900000003d1d100    a1cc  /u01/app/crs/10.2/lib/libocrutl10.a[shr_ocrutl10.o]
  900000003cc1100   5b130  /u01/app/crs/10.2/lib/libocr10.a[shr_ocr10.o]
  900000002691100  162fbbd  /u01/app/crs/10.2/lib/libttsh10.a[shr_ttsh10.o]
  9000000025eb100   a5459  /u01/app/crs/10.2/lib/libhasgen10.a[shr_hasgen10.o]
  900000003d1d100    a1cc  /u01/app/crs/10.2/lib/libocrutl10.a[shr_ocrutl10.o]
  900000003d2c100   40de1  /u01/app/crs/10.2/lib/libocrb10.a[shr_ocrb10.o]
  900000003cc1100   5b130  /u01/app/crs/10.2/lib/libocr10.a[shr_ocr10.o]
  900000003d28100    162d  /u01/app/crs/10.2/lib/libskgxn2.a[shr_skgxn2.o]
  900000002691100  162fbbd  /u01/app/crs/10.2/lib/libttsh10.a[shr_ttsh10.o]
  9000000025eb100   a5459  /u01/app/crs/10.2/lib/libhasgen10.a[shr_hasgen10.o]
  900000003d1d100    a1cc  /u01/app/crs/10.2/lib/libocrutl10.a[shr_ocrutl10.o]
  900000003d2c100   40de1  /u01/app/crs/10.2/lib/libocrb10.a[shr_ocrb10.o]
  900000003d28100    162d  /u01/app/crs/10.2/lib/libskgxn2.a[shr_skgxn2.o]
  900000003cc1100   5b130  /u01/app/crs/10.2/lib/libocr10.a[shr_ocr10.o]
  900000002691100  162fbbd  /u01/app/crs/10.2/lib/libttsh10.a[shr_ttsh10.o]
  9000000025eb100   a5459  /u01/app/crs/10.2/lib/libhasgen10.a[shr_hasgen10.o]

I don't know about this and so as per patch instruction , open the issue to 
Oracle support. 

There is no error during the patch application on any node.

----- Original Message ----
From: Jeremy Schneider <jeremy.schneider@xxxxxxxxxxxxxx>
To: smishra_97@xxxxxxxxx
Cc: oracle-l@xxxxxxxxxxxxx
Sent: Monday, January 28, 2008 4:36:09 PM
Subject: Re: Oracle 10g RAC on AIX

Is this a production system?  If it's not production, then I'm curious - does 
the system consistently come up in this state when you reboot everything?  
Also, I couldn't tell from your email - did you run those commands (ps, crsctl, 
init.d stop) on node 1 or on the other nodes?



On 1/28/08, Sanjay Mishra <smishra_97@xxxxxxxxx> wrote:
Hi
 
I am working on 10g RAC on AIX and having the strange problem. I applied 
Patchset 2 and now getting some problem in access crs
 
On Node 1, it shows in crs_stat -t that all Resources gsd/ons/vip are up on all 
5 modes while on all other nodes, it shows the process are running but crs_stat 
-t gives the following error
CRS-0184: Cannot communicate with the CRS daemon.

 
ps -ef|grep crs gives the output as 
    root 565460 716832   0 14:12:33      -  0:22 /u01/app/crs/10.2/bin/crsd.bin 
reboot
    root 589936      1   0 14:12:46      -  0:00 /u01/app/crs/10.2/bin/racgmain 
ora.gkd122.vip rundetach ora.gkd122.vip 1 startorp gkd122
  oracle 614456 860378   0 14:12:41      -  0:00 
/u01/app/crs/10.2/bin/evmlogger.bin -o /u01/app/crs/10.2/evm/log/evmlogger.info 
-l /u01/app/crs/10.2/evm/log/evmlogger.log
  oracle 635124 426140   0 14:12:36      -  0:05 /u01/app/crs/10.2/bin/ocssd.bin
  oracle 675854      1   0 14:12:47      -  0:00 /u01/app/crs/10.2/opmn/bin/ons 
-d
    root 716832      1   0 14:12:31      -  0:00 /bin/sh /etc/init.crsd run
  oracle 733338 684146   0 14:12:34      -  0:00 /bin/sh -c cd 
/u01/app/crs/10.2/log/gkd122/cssd/oclsomon; ulimit -c unlimited; 
/u01/app/crs/10.2/bin/oclsomon  || exit $?
  oracle 737488 675854   0 14:12:47      -  0:00 /u01/app/crs/10.2/opmn/bin/ons 
-d
    root 843828 667806   0 14:12:34      -  0:00 
/u01/app/crs/10.2/bin/oprocd.bin run -t 1000 -m 500 -f
  oracle 860378 761930   0 14:12:33      -  0:00 /u01/app/crs/10.2/bin/evmd.bin
  oracle 864454 733338   0 14:12:34      -  0:01 
/u01/app/crs/10.2/bin/oclsomon.bin

 
I tried to stop crsctl check crs
$crsctl check crs
CSS appears healthy
Try againEVM appears healthy

I tried to stop crs using crsctl stop crs or /etc/init.crs stop as root but the 
processes are still running.
 
Any advice to look into it
 
Sanjay



Be a better friend, newshound, and know-it-all with Yahoo! Mobile. Try it now.



-- 
Jeremy Schneider
Chicago, IL
http://www.ardentperf.com/category/technical


      
____________________________________________________________________________________
Never miss a thing.  Make Yahoo your home page. 
http://www.yahoo.com/r/hs

Other related posts: