Re: Oracle 10g "Ghost" SID Eating up CPU

From: Martin Berger <martin.a.berger@xxxxxxxxx>
To: kjped1313@xxxxxxxxx
Date: Wed, 2 Jun 2010 20:19:00 +0200

Kellyn,

As you still have a sid and serial#, have you tried to 'alter systemdisconnect session' or 'alter system kill session' ?? maybe you shouldnote dhe saddr and paddr

do you have a paddr to this sid?

Even if v$process does not shows anything, maybe x$ksupgp or x$ksupror other structures which shows (prarallel) processes can give yousome more details.


but if you can avoid bouncing the DB it would be worth investigating!

I'd also second Tim Gorman: if you identify all OS-process IDs whicheats CPUs, can you identify all of these within your database(s)? Orare some processes remaining you cannot link to any purpose?


sorry, more questions than answers.

Martin


Am 01.06.2010 um 20:25 schrieb Kellyn Pedersen:

The last two weekends, due to some new code, my main datawarehouse/OLTP, (yes, I know it's an oxy-moron and it's 10.2.0.4 on Linux,64bit with a number of one off patches for parallel bugs...) hasbeen overwhelmed by 32 CTAS concurrently running, all requesting 4parallel on large table selects. Parallel was downgraded a numberof times 75% or more during this step in their package.This is the second time I've come back in after the occurrance tofind one parallel coordinator session running- on it's own, noother producers/consumers, no parent SID, just this one processeating up CPU-
SID SERIAL# STATUS OSUSER PROCESS MACHINE PROGRAM ROW_WAIT_OBJ#PDDL_STATUS PQ_STATUS EVENT
P1TEXT                              SECONDS_IN_WAIT
540 20564 ACTIVE sdev_user 31988 appmachine prodmachine (P039)2815532 ENABLED ENABLED PX Deq: Execution sleeptime/senderid 141170
If you try to search for the OS Process, (31988), it doesn't exist,the SQL_ID is unknown but I can see it was sitting on the primarykey for a particular table, (although different one than the lasttime this ghost was present last week!) What I believe happened isthat the parallel query died, but the coorindator is still out there.
ERROR at line 1:
ORA-12805: parallel query server died unexpectedly
I found a couple of these errors, (12805) in trace files from thetimes that parallel was downgraded. The process doesn't exist onthe app server, I don't have an OS PID to kill and I can't kill itat the Oracle session level, (ORA-00030: User session ID does notexist.) Last time we had a maintenance window and solved theproblem quickly with a database cycle, but here I am again- HOW DOI GET RID of this thing!?!? It's starting to eat up CPU and won'tdie... :(
 SID PID Coordinator SPID  Group Set Degree Req Degree Wait Event
540 134 54031988 PX Deq:Execution Msg
Anybody have any ideas? I actually have two P039 processes in mydatabase right now! This cannot be good! :(
Kellyn Pedersen

Sr. Database Administrator

I-Behavior Inc.

http://www.linkedin.com/in/kellynpedersen

www.dbakevlar.blogspot.com
"Go away before I replace you with a very small and efficient shellscript..."

Follow-Ups:
- Re: Oracle 10g "Ghost" SID Eating up CPU
  - From: Kellyn Pedersen

References:
- Oracle 10g "Ghost" SID Eating up CPU
  - From: Kellyn Pedersen

Re: Oracle 10g "Ghost" SID Eating up CPU

Other related posts: