[Linuxtrent] Re: kernel crash

  • From: Emanuele Olivetti <olivetti@xxxxxx>
  • To: linuxtrent@xxxxxxxxxxxxx
  • Date: Wed, 08 Nov 2006 14:33:15 +0100

Altro crash proprio ora. In realta' non si e' bloccato tutto subito: mi sono
tenuto una console di root aperta e un sistema per essere avvertito al volo
(ok, questa e' un po' lunga da spiegare) e ho provato immediatamente a lanciare
un reboot remoto (l'altra volta ho avuto qualche secondo e ho lanciato dmesg
ma il sistema si e' bloccato definitivamente, quindi stavolta ho provato solo
il 'reboot'). Comunque niente da fare, pure il reboot si e' bloccato ad un 
qualche
punto. Questo il (lungo) log di stavolta:

---------------------------------------------------------------------------------------
Nov  8 14:01:17 localhost kernel: BUG: unable to handle kernel paging request 
at virtual address 00020000
Nov  8 14:01:17 localhost kernel:  printing eip:
Nov  8 14:01:17 localhost kernel: c0141339
Nov  8 14:01:17 localhost kernel: *pde = 00000000
Nov  8 14:01:17 localhost kernel: Oops: 0000 [#1]
Nov  8 14:01:17 localhost kernel: SMP
Nov 8 14:01:17 localhost kernel: Modules linked in: w83627hf hwmon_vid i2c_isa i2c_dev sg scsi_mod xt_tcpudp xt_limit ipt_MASQUERADE xt_state iptable_nat ip_nat ip_conntrack nfnetlink iptable_filter ip_tables x_tables ipv6 ext2 mbcache dm_snapshot dm_mirror dm_mod psmouse ide_generic snd_intel8x0 snd_ac97_codec snd_ac97_bus snd_pcm snd_timer snd_mpu401 snd_mpu401_uart snd_page_alloc shpchp pci_hotplug analog snd_rawmidi snd_seq_device snd floppy i2c_sis96x i810_audio sis_agp agpgart 8250_pnp i2c_core ac97_codec evdev soundcore rtc pcspkr parport_pc parport ns558 gameport reiserfs usbhid ide_cd cdrom ide_disk generic ohci_hcd sis5513 ide_core 8139cp ehci_hcd sis900 usbcore 8139too mii thermal processor fan
Nov  8 14:01:17 localhost kernel: CPU:    0
Nov  8 14:01:17 localhost kernel: EIP:    0060:[<c0141339>]    Not tainted VLI
Nov  8 14:01:17 localhost kernel: EFLAGS: 00010006   (2.6.17-2-vserver-k7 #1)
Nov  8 14:01:17 localhost kernel: EIP is at find_get_page+0x1e/0x38
Nov  8 14:01:17 localhost kernel: eax: 00020000   ebx: 0000f335   ecx: 00020000 
  edx: 00000000
Nov  8 14:01:17 localhost kernel: esi: f5cebf00   edi: 0000f335   ebp: 0000f335 
  esp: d783be40
Nov  8 14:01:17 localhost kernel: ds: 007b   es: 007b   ss: 0068
Nov  8 14:01:17 localhost kernel: Process rsync (pid: 5369[#0], 
threadinfo=d783a000 task=da28b570)
Nov  8 14:01:17 localhost kernel: Stack: 00001000 c10fcd80 c01419e5 dfca6300 
dfca6348 f5cebf00 f5cebe50 000316ff
Nov  8 14:01:17 localhost kernel:        00000000 0000f340 0000f340 0000f334 
316ff858 00000000 00000000 00001000
Nov  8 14:01:17 localhost kernel:        0000f32a 00000020 00000000 00000000 
0000f33f 0000f34a 00000020 00000020
Nov  8 14:01:17 localhost kernel: Call Trace:
Nov  8 14:01:17 localhost kernel:  <c01419e5> do_generic_mapping_read+0x148/0x41f  
<c0142514> __generic_file_aio_read+0x16f/0x1b6
Nov  8 14:01:17 localhost kernel:  <c01411e3> file_read_actor+0x0/0xca  
<c014349e> generic_file_read+0x0/0xac
Nov  8 14:01:17 localhost kernel:  <c0143536> generic_file_read+0x98/0xac  
<c012cd3b> autoremove_wake_function+0x0/0x2d
Nov  8 14:01:17 localhost kernel:  <c015a45e> vfs_read+0x9f/0x13e  <c015a8a7> 
sys_read+0x3c/0x63
Nov  8 14:01:17 localhost kernel:  <c0102af3> sysenter_past_esp+0x54/0x75
Nov 8 14:01:17 localhost kernel: Code: 90 ff 47 10 fb 5a 89 d8 5b 5e 5f 5d c3 56 89 c6 8d 40 10 53 89 d3 e8 23 d2 13 00 8d 46 04 89 da e8 91 8d 07 00 85 c0 89 c1 74 10 <8b> 00 89 ca f6 c4 40 74 03 8b 51 0c 90 ff 42 04 90 ff 46 10 fb
Nov  8 14:01:17 localhost kernel: EIP: [<c0141339>] find_get_page+0x1e/0x38 
SS:ESP 0068:d783be40
Nov  8 14:01:17 localhost kernel:  BUG: warning at 
kernel/softirq.c:141/local_bh_enable()
Nov  8 14:01:17 localhost kernel:  <c012107f> local_bh_enable+0x25/0x64  
<c0220c6d> lock_sock+0x85/0x8d
Nov  8 14:01:17 localhost kernel:  <c021ea69> sock_fasync+0x5c/0x111  
<c021f9dc> sock_close+0x1e/0x2a
Nov  8 14:01:17 localhost kernel:  <c015ab44> __fput+0x8d/0x16e  <c01584ea> 
filp_close+0x4e/0x54
Nov  8 14:01:17 localhost kernel:  <c011e259> put_files_struct+0x64/0xc0  
<c011f1c7> do_exit+0x19f/0x703
Nov  8 14:01:17 localhost kernel:  <c0114974> bust_spinlocks+0x3a/0x43  
<c0103eee> die+0x1d3/0x288
Nov  8 14:01:17 localhost kernel:  <c0103f7e> die+0x263/0x288  <c0114ea6> 
do_page_fault+0x441/0x526
Nov  8 14:01:17 localhost kernel:  <c0105026> do_IRQ+0x1e/0x24  <c0114a65> 
do_page_fault+0x0/0x526
Nov  8 14:01:17 localhost kernel:  <c01036f7> error_code+0x4f/0x54  <c0141339> 
find_get_page+0x1e/0x38
Nov  8 14:01:17 localhost kernel:  <c01419e5> do_generic_mapping_read+0x148/0x41f  
<c0142514> __generic_file_aio_read+0x16f/0x1b6
Nov  8 14:01:17 localhost kernel:  <c01411e3> file_read_actor+0x0/0xca  
<c014349e> generic_file_read+0x0/0xac
Nov  8 14:01:17 localhost kernel:  <c0143536> generic_file_read+0x98/0xac  
<c012cd3b> autoremove_wake_function+0x0/0x2d
Nov  8 14:01:17 localhost kernel:  <c015a45e> vfs_read+0x9f/0x13e  <c015a8a7> 
sys_read+0x3c/0x63
Nov  8 14:01:17 localhost kernel:  <c0102af3> sysenter_past_esp+0x54/0x75
Nov  8 14:01:17 localhost kernel: BUG: warning at 
kernel/softirq.c:141/local_bh_enable()
Nov  8 14:01:17 localhost kernel:  <c012107f> local_bh_enable+0x25/0x64  
<c021eb12> sock_fasync+0x105/0x111
Nov  8 14:01:17 localhost kernel:  <c021f9dc> sock_close+0x1e/0x2a  <c015ab44> 
__fput+0x8d/0x16e
Nov  8 14:01:17 localhost kernel:  <c01584ea> filp_close+0x4e/0x54  <c011e259> 
put_files_struct+0x64/0xc0
Nov  8 14:01:17 localhost kernel:  <c011f1c7> do_exit+0x19f/0x703  <c0114974> 
bust_spinlocks+0x3a/0x43
Nov  8 14:01:17 localhost kernel:  <c0103eee> die+0x1d3/0x288  <c0103f7e> 
die+0x263/0x288
Nov  8 14:01:17 localhost kernel:  <c0114ea6> do_page_fault+0x441/0x526  
<c0105026> do_IRQ+0x1e/0x24
Nov  8 14:01:17 localhost kernel:  <c0114a65> do_page_fault+0x0/0x526  
<c01036f7> error_code+0x4f/0x54
Nov  8 14:01:17 localhost kernel:  <c0141339> find_get_page+0x1e/0x38  
<c01419e5> do_generic_mapping_read+0x148/0x41f
Nov  8 14:01:17 localhost kernel:  <c0142514> __generic_file_aio_read+0x16f/0x1b6  
<c01411e3> file_read_actor+0x0/0xca
Nov  8 14:01:17 localhost kernel:  <c014349e> generic_file_read+0x0/0xac  
<c0143536> generic_file_read+0x98/0xac
Nov  8 14:01:17 localhost kernel:  <c012cd3b> autoremove_wake_function+0x0/0x2d  
<c015a45e> vfs_read+0x9f/0x13e
Nov  8 14:01:17 localhost kernel:  <c015a8a7> sys_read+0x3c/0x63  <c0102af3> 
sysenter_past_esp+0x54/0x75
Nov  8 14:01:17 localhost kernel: BUG: warning at 
kernel/softirq.c:141/local_bh_enable()
Nov  8 14:01:17 localhost kernel:  <c012107f> local_bh_enable+0x25/0x64  
<c027363f> unix_release_sock+0x5c/0x1bf
Nov  8 14:01:17 localhost kernel:  <c021f706> sock_release+0x11/0x85  
<c021f9e4> sock_close+0x26/0x2a
Nov  8 14:01:17 localhost kernel:  <c015ab44> __fput+0x8d/0x16e  <c01584ea> 
filp_close+0x4e/0x54
Nov  8 14:01:17 localhost kernel:  <c011e259> put_files_struct+0x64/0xc0  
<c011f1c7> do_exit+0x19f/0x703
Nov  8 14:01:17 localhost kernel:  <c0114974> bust_spinlocks+0x3a/0x43  
<c0103eee> die+0x1d3/0x288
Nov  8 14:01:17 localhost kernel:  <c0103f7e> die+0x263/0x288  <c0114ea6> 
do_page_fault+0x441/0x526
Nov  8 14:01:17 localhost kernel:  <c0105026> do_IRQ+0x1e/0x24  <c0114a65> 
do_page_fault+0x0/0x526
Nov  8 14:01:17 localhost kernel:  <c01036f7> error_code+0x4f/0x54  <c0141339> 
find_get_page+0x1e/0x38
Nov  8 14:01:17 localhost kernel:  <c01419e5> do_generic_mapping_read+0x148/0x41f  
<c0142514> __generic_file_aio_read+0x16f/0x1b6
Nov  8 14:01:17 localhost kernel:  <c01411e3> file_read_actor+0x0/0xca  
<c014349e> generic_file_read+0x0/0xac
Nov  8 14:01:17 localhost kernel:  <c0143536> generic_file_read+0x98/0xac  
<c012cd3b> autoremove_wake_function+0x0/0x2d
Nov  8 14:01:17 localhost kernel:  <c015a45e> vfs_read+0x9f/0x13e  <c015a8a7> 
sys_read+0x3c/0x63
Nov  8 14:01:17 localhost kernel:  <c0102af3> sysenter_past_esp+0x54/0x75
Nov  8 14:01:17 localhost kernel: BUG: warning at 
kernel/softirq.c:141/local_bh_enable()
Nov  8 14:01:17 localhost kernel:  <c012107f> local_bh_enable+0x25/0x64  
<c0220c6d> lock_sock+0x85/0x8d
Nov  8 14:01:17 localhost kernel:  <c0222d0a> skb_dequeue+0x39/0x3f  <c021ea69> 
sock_fasync+0x5c/0x111
Nov  8 14:01:17 localhost kernel:  <c021f9dc> sock_close+0x1e/0x2a  <c015ab44> 
__fput+0x8d/0x16e
Nov  8 14:01:17 localhost kernel:  <c01584ea> filp_close+0x4e/0x54  <c011e259> 
put_files_struct+0x64/0xc0
Nov  8 14:01:17 localhost kernel:  <c011f1c7> do_exit+0x19f/0x703  <c0114974> 
bust_spinlocks+0x3a/0x43
Nov  8 14:01:17 localhost kernel:  <c0103eee> die+0x1d3/0x288  <c0103f7e> 
die+0x263/0x288
Nov  8 14:01:17 localhost kernel:  <c0114ea6> do_page_fault+0x441/0x526  
<c0105026> do_IRQ+0x1e/0x24
Nov  8 14:01:17 localhost kernel:  <c0114a65> do_page_fault+0x0/0x526  
<c01036f7> error_code+0x4f/0x54
Nov  8 14:01:17 localhost kernel:  <c0141339> find_get_page+0x1e/0x38  
<c01419e5> do_generic_mapping_read+0x148/0x41f
Nov  8 14:01:17 localhost kernel:  <c0142514> __generic_file_aio_read+0x16f/0x1b6  
<c01411e3> file_read_actor+0x0/0xca
Nov  8 14:01:17 localhost kernel:  <c014349e> generic_file_read+0x0/0xac  
<c0143536> generic_file_read+0x98/0xac
Nov  8 14:01:17 localhost kernel:  <c012cd3b> autoremove_wake_function+0x0/0x2d  
<c015a45e> vfs_read+0x9f/0x13e
Nov  8 14:01:17 localhost kernel:  <c015a8a7> sys_read+0x3c/0x63  <c0102af3> 
sysenter_past_esp+0x54/0x75
Nov  8 14:01:17 localhost kernel: BUG: warning at 
kernel/softirq.c:141/local_bh_enable()
Nov  8 14:01:17 localhost kernel:  <c012107f> local_bh_enable+0x25/0x64  
<c021eb12> sock_fasync+0x105/0x111
Nov  8 14:01:17 localhost kernel:  <c021f9dc> sock_close+0x1e/0x2a  <c015ab44> 
__fput+0x8d/0x16e
Nov  8 14:01:17 localhost kernel:  <c01584ea> filp_close+0x4e/0x54  <c011e259> 
put_files_struct+0x64/0xc0
Nov  8 14:01:17 localhost kernel:  <c011f1c7> do_exit+0x19f/0x703  <c0114974> 
bust_spinlocks+0x3a/0x43
Nov  8 14:01:17 localhost kernel:  <c0103eee> die+0x1d3/0x288  <c0103f7e> 
die+0x263/0x288
Nov  8 14:01:17 localhost kernel:  <c0114ea6> do_page_fault+0x441/0x526  
<c0105026> do_IRQ+0x1e/0x24
Nov  8 14:01:17 localhost kernel:  <c0114a65> do_page_fault+0x0/0x526  
<c01036f7> error_code+0x4f/0x54
Nov  8 14:01:17 localhost kernel:  <c0141339> find_get_page+0x1e/0x38  
<c01419e5> do_generic_mapping_read+0x148/0x41f
Nov  8 14:01:17 localhost kernel:  <c0142514> __generic_file_aio_read+0x16f/0x1b6  
<c01411e3> file_read_actor+0x0/0xca
Nov  8 14:01:17 localhost kernel:  <c014349e> generic_file_read+0x0/0xac  
<c0143536> generic_file_read+0x98/0xac
Nov  8 14:01:17 localhost kernel:  <c012cd3b> autoremove_wake_function+0x0/0x2d  
<c015a45e> vfs_read+0x9f/0x13e
Nov  8 14:01:17 localhost kernel:  <c015a8a7> sys_read+0x3c/0x63  <c0102af3> 
sysenter_past_esp+0x54/0x75
Nov  8 14:01:17 localhost kernel: BUG: warning at 
kernel/softirq.c:141/local_bh_enable()
Nov  8 14:01:17 localhost kernel:  <c012107f> local_bh_enable+0x25/0x64  
<c027363f> unix_release_sock+0x5c/0x1bf
Nov  8 14:01:17 localhost kernel:  <c021f706> sock_release+0x11/0x85  
<c021f9e4> sock_close+0x26/0x2a
Nov  8 14:01:17 localhost kernel:  <c015ab44> __fput+0x8d/0x16e  <c01584ea> 
filp_close+0x4e/0x54
Nov  8 14:01:17 localhost kernel:  <c011e259> put_files_struct+0x64/0xc0  
<c011f1c7> do_exit+0x19f/0x703
Nov  8 14:01:17 localhost kernel:  <c0114974> bust_spinlocks+0x3a/0x43  
<c0103eee> die+0x1d3/0x288
Nov  8 14:01:17 localhost kernel:  <c0103f7e> die+0x263/0x288  <c0114ea6> 
do_page_fault+0x441/0x526
Nov  8 14:01:17 localhost kernel:  <c0105026> do_IRQ+0x1e/0x24  <c0114a65> 
do_page_fault+0x0/0x526
Nov  8 14:01:17 localhost kernel:  <c01036f7> error_code+0x4f/0x54  <c0141339> 
find_get_page+0x1e/0x38
Nov  8 14:01:17 localhost kernel:  <c01419e5> do_generic_mapping_read+0x148/0x41f  
<c0142514> __generic_file_aio_read+0x16f/0x1b6
Nov  8 14:01:17 localhost kernel:  <c01411e3> file_read_actor+0x0/0xca  
<c014349e> generic_file_read+0x0/0xac
Nov  8 14:01:17 localhost kernel:  <c0143536> generic_file_read+0x98/0xac  
<c012cd3b> autoremove_wake_function+0x0/0x2d
Nov  8 14:01:17 localhost kernel:  <c015a45e> vfs_read+0x9f/0x13e  <c015a8a7> 
sys_read+0x3c/0x63
Nov  8 14:01:17 localhost kernel:  <c0102af3> sysenter_past_esp+0x54/0x75
Nov  8 14:02:26 localhost kernel: Kernel logging (proc) stopped.
Nov  8 14:02:26 localhost kernel: Kernel log daemon terminating.
-----------------------------------------------------------------------------------------------------------------

Alcune note interessanti. Per stressare il sistema (visto che al momento 
_voglio_
capire il problema) ieri sera avevo lanciato l'upload di tre ISO da 700 Mb verso
un'altro server, raggiungibile su internet al 30Kb/sec (circa 20 ore di upload).
Avevo notato da tempo che i crash erano in concomitanza di carichi di rete 
maggiori
della norma e infatti oggi, a a circa due terzi dell'upload e' successo il 
crash.
Al momento avevo anche alcuni ssh sul PC in questione con poco traffico.

Come appare dai log, il processo che e' morto e' rsync (che ho usato per fare
l'upoload descritto prima).

Provo a cambiare kernel. Che mi suggerite? Al momento e' 2.6.17-2-vserver-k7
e potrei tornare a un 2.6.17-2-386. Non vedo altre versioni di linux-image-*
sulla debian testing...

Emanuele


--
Per iscriversi  (o disiscriversi), basta spedire un  messaggio con OGGETTO
"subscribe" (o "unsubscribe") a mailto:linuxtrent-request@xxxxxxxxxxxxx


Other related posts: