[pskmail] Re: Aw: Re: Server stops working randomly

  • From: "Gunnar Bulukin" <gunnar@xxxxxxxxxx>
  • To: <pskmail@xxxxxxxxxxxxx>
  • Date: Fri, 13 Apr 2012 16:20:39 +0200

Fantastic!!!!
 
//gunnar

  _____  

From: pskmail-bounce@xxxxxxxxxxxxx [mailto:pskmail-bounce@xxxxxxxxxxxxx] On
Behalf Of Rein Couperus
Sent: Friday, April 13, 2012 4:15 PM
To: pskmail@xxxxxxxxxxxxx
Subject: [pskmail] Aw: Re: Server stops working randomly


I have probably found the error...

The sysread from fldigi hangs when it gets no data. Unfortunately there 

is no timeout, so I have to think of something else, at least there should
be an 

error message ("fldigi is dead Jim...")...


Still this error only occurs when the fldigi ARQ socket in disfunctional (I
HAD to use this word...).


Rein "I dont believe in friday the 13th..." PA0R


-- 
http://pa0r.blogspirit.com


Hi,

I have had this issue up tree to four times and the answer I get from Rein
is this:

>The server is freezing because the ARQ interface in fldigi crashed, and the

>server gets a buffer overflow when trying to write to fldigi.
>The messages you see are from the internet APRS backbone.
>Rein PA0R

So gentlemen, this is a fldigi problem and nothing else!?
I restarted my server yesterday at 13.15 UTC it was working last night until
30m went down now it is still running but 30m is not quite open yet so I can
not see if it gets any replies.

73 de Gunnar


-----Original Message-----
From: pskmail-bounce@xxxxxxxxxxxxx [mailto:pskmail-bounce@xxxxxxxxxxxxx] On
Behalf Of remi.chateauneu@xxxxxxxxx
Sent: Friday, April 13, 2012 1:50 AM
To: pskmail@xxxxxxxxxxxxx
Cc: John Douyere
Subject: [pskmail] Re: Server stops working randomly

Can you run it under gdb please ? It should not prevent execution during
days.

Thanks

R

Le 12.04.2012 23:53, John Douyere a écrit :
> Issue is as described before. Here is a log when the sever stops
responding:
>
> !. Server log
>
> ### Server v. Pskmail_server 1.5.1. (C) 2011 PA0R
> 05:01 UTC Apr-12-2012: Program start
> 05:01 UTC Apr-12-2012: Connected to netherlands.aprs2.net 
> <http://netherlands.aprs2.net> port 1314 POS=33S:150E Outside BigEar 
> geo area...
> BigEar serverport:10148
> BigEar not available
> Scanning: 3534500,3540000,7192000,7195500,14111000,
> Offset = 0 minute(s)
> Pskmodes:MFSK32,XXXX,XXXX,XXXX,XXXX,
> .
> 05:01 UTC Apr-12-2012:
> Listening to the radio
> initialized
> 05:01 UTC Apr-12-2012:
> Send>APRS-IS:VK2ETA-1>PSKAPR:@120501z3350.15SP15057.44E&PSKmail 1.5.1
> server
>
> DL8AH->SM0RWO
> DL8AH->SM0RWO
> 9A3ATZ->SM0RWO
> 9A3ATZ->SM0RWO
> DL9YCS-5->9A1CRA
> DL9YCS-5->9A1CRA
> DL9YCS-5->SM0RWO
> DL9YCS-5->SM0RWO
> DL9YCS-5->SM0RWO
> DL9YCS-5->OE5RTL
> DH5JF-8->9A1CRA
> DL9YCS-5->OE5RTL
> DL9YCS-5->OE5RTL
> OE3WGW-10->OE3WGW
> OE3WGW-10->OE3WGW
> OE3WGW-10->OE3WGW
> OE3WGW-10->OE3WGW
> OE3WGW-10->OE3WGW
> OE3WGW-10->OE3WGW
> OE3WGW-10->OE3WGW
> OE3WGW-10->OE3WGW
> OE3WGW-10->OE3WGW
> OE3WGW-10->OE3WGW
> OE3WGW-10->OE3WGW
> OE3WGW-8->OE3WGW
> OE3WGW-8->OE3WGW
> OE3WGW-8->OE3WGW
> OE3WGW-8->OE3WGW
> OE3WGW-8->OE3WGW
> OE3WGW-8->OE3WGW
> OE3WGW-8->OE3WGW
> OE3WGW-8->OE3WGW
> OE3WGW-8->OE3WGW
> OE3WGW-8->OE3WGW
> OE3WGW-8->OE3WGW
> OE3WGW-8->OE3WGW
> OE3WGW-8->OE3WGW
> F1GHX->9A1CRA
> F1GHX->DK4XI-3
> OE3WGW-8->OE3WGW
> OE3WGW-8->OE3WGW
> OE3WGW-8->OE3WGW
> PA0R->PA0R-1
> PA0R->PA0R-1
> SP8DXZ->SM0RWO
> SP8DXZ->SM0RWO
> PA0R->PA0R-1
> SP8DXZ->SA6BQZ-1
> PA0R->PA0R-1
> DL9YCS-5->OE5RTL
> DH5JF-8->SA6BQZ-1
> DH5JF-8->OE5RTL
> DH5JF-8->SA6BQZ-1
> DH5JF-8->9A1CRA
> DF7AET->SM0RWO
> SP8DXZ->SA6BQZ-1
> OE3GRC->DK4XI-3
> OE3GRC->DK4XI-3
> OE3GRC->DK4XI-3
> SP8DXZ->SM0RWO
> OE3GRC->DK4XI-3
> OE3GRC->DK4XI-3
> SP8DXZ->SM0RWO
> OE3WGW-8->OE3WGW
> OE3WGW-8->OE3WGW
> OE3WGW-8->OE3WGW
> OE3WGW-8->OE3WGW
> OE3WGW-8->OE3WGW
> OE3WGW-8->OE3WGW
> SP8DXZ->SM0RWO
> SP8DXZ->SM0RWO
> OE3WGW-8->OE3WGW
> OE3WGW-8->OE3WGW
> OE5RPP->SA6BQZ-1
> <End of Log>
>
> 2. Here is a copy of the TOP output:
>
> 4033 pts/1 S+ 0:01 /bin/bash ./web2email 120
> 4038 pts/2 Ss 0:00 bash
> 4133 ? S 0:00 [usbhid_resumer]
> 4140 ? S< 0:00 udevd --daemon
> 4141 ? S< 0:00 udevd --daemon
> 4174 pts/2 Sl+ 650:15 src/fldigi
> 4190 pts/3 Ss 0:00 bash
> 13204 tty1 Sl 0:12 /usr/lib/firefox-3.6.24/firefox-bin
> 13274 tty1 Sl 0:01 /usr/lib/firefox-3.6.24/plugin-container
> /usr/lib/fla
> 13281 pts/1 S+ 0:00 sleep 120
> 13292 pts/4 Ss 0:00 bash
> 13310 pts/4 R+ 0:00 ps ax
> 28329 pts/3 S+ 0:00 /bin/sh /usr/local/bin/pskmail_server
> 28330 pts/3 S+ 0:23 /usr/bin/perl -w?
> /usr/local/share/pskmail_server/rfl
> 28344 pts/3 S+ 0:10 /usr/bin/perl -w?
> /usr/local/share/pskmail_server/rfl
> 28345 pts/3 S+ 0:20 /usr/bin/perl -w?
> /usr/local/share/pskmail_server/rfl
> 28354 pts/3 S+ 0:32 /usr/bin/perl -w?
> /usr/local/share/pskmail_server/rfl
> 28355 pts/3 S+ 0:19 /usr/bin/perl -w?
> /usr/local/share/pskmail_server/rfl
> 28365 pts/3 S+ 4:51 /usr/bin/perl -w?
> /usr/local/share/pskmail_server/rfl
> jdouyere@Pskmail:~$
>
> 3. I noticed that I can use CTRL-C to stop the server. I have to do a 
> killall rflinkserver.pl <http://rflinkserver.pl> from another terminal 
> to stop the server
>
>
> 4. The server.log file contains the following (consistent with the 
> terminal display of the server):
>
>
> 05:01 UTC Apr-12-2012: Program start
> Program start: 05:01 UTC Apr-12-2012
> 05:01 UTC Apr-12-2012: Connected to netherlands.aprs2.net 
> <http://netherlands.aprs2.net> port 1314
> 05:01 UTC Apr-12-2012:
> Listening to the radio
> 05:01 UTC Apr-12-2012:
> Send>APRS-IS:VK2ETA-1>PSKAPR:@120501z3350.15SP15057.44E&PSKmail 1.5.1
> server
>
> 5. I checked the Fldigi log and there was no activity at all (not even
> unproto) on the server before it stopped responding.
>
> 6. Running 1.5.1 under Ubuntu 10.04 LTS
>
> A real mystery and a shame now that my Fldigi does not lock-up anymore.
> It happens randomly, sometimes after a week or very shortly like in 
> this case.
>
> Next week I will try to run the server under Puppy Linux to see if it 
> makes any difference.
>
> Regards,
>
> John





Other related posts: