<div dir="ltr">Hi Răzvan, It was happening at least once a day. It started to happen when we reach ~80 registered terminals, not a big load. Now I'm using the event_rabbitmq module with 400 terminals and it's working fine.<div><br></div><div>The event is a custom one, I called "E_REGISTERED", it's just to notify an external process that a particular terminal has registered, this is part of the configuration that crashes:<div><div><br></div><div><div><i>startup_route {</i></div><div><i> subscribe_event("E_REGISTERED", "xmlrpc:10.10.11.2:10080:OSEvent");</i></div><div><i>}</i></div><div><i>...</i></div><div><i>...</i></div><div><i>route {</i></div><div><i>...</i></div><div><i>...</i></div><div><i> if (is_method("REGISTER")) {<br></i></div><div><div><i>...</i></div><div><i>...</i></div><div><i> $avp(attr-name) = "username";<br></i></div><div><i> $avp(attr-val) = $tU;</i></div><div><i> $avp(attr-name) = "domain";</i></div><div><i> $avp(attr-val) = $td;</i></div><div><i> raise_event("E_REGISTERED", $avp(attr-name), $avp(attr-val));</i></div></div><div><i>...</i></div><div><i>...</i></div><div><i>}</i></div><div class="gmail_extra"><div><div class="gmail_signature"><div dir="ltr"><br></div></div></div>
<div class="gmail_extra">Another thing: when I compiled with DBG_QM_MALLOC instead of F_MALLOC to debug, I didn't have any crashes for about 5 days. Maybe I should have waited more time to confirm, but it seems that the first memory manager solved the issue.</div><div class="gmail_extra"><br></div><div class="gmail_extra">Regarding to the core files, it seems than some module (even_xmlrpc for me..) it's freeing memory that it should not. After this issue I realized that the module was in beta, so I moved to the rabbitmq</div><div class="gmail_extra"><br></div><div class="gmail_extra">Thanks for your reply</div><div class="gmail_extra">Federico</div><div class="gmail_extra"><br></div><br><div class="gmail_quote">On Fri, May 8, 2015 at 7:47 AM, Răzvan Crainea <span dir="ltr"><<a href="mailto:razvan@opensips.org" target="_blank">razvan@opensips.org</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex">
<div text="#000000" bgcolor="#FFFFFF">
<tt>Hi, Federico!<br>
<br>
Is this easily replicating, or it happens once in a while? Also,
what events are you raising?<br>
<br>
Best regards,<br>
</tt>
<pre cols="72">Răzvan Crainea
OpenSIPS Solutions
<a href="http://www.opensips-solutions.com" target="_blank">www.opensips-solutions.com</a></pre><div><div class="h5">
<div>On 04/24/2015 05:44 PM, Federico Edorna
wrote:<br>
</div>
</div></div><blockquote type="cite"><div><div class="h5">
<div dir="ltr">
<div class="gmail_extra">Just in case somebody deal with the
same issue, the problem seems to be event_xmlrpc module. I
tried with the event_datagram to notify the external process
and I got no more crashes for a couple of weeks.</div>
<div class="gmail_extra">Now I'm using event_rabbit module
instead of datagram without problems for a couple of days.</div>
<div class="gmail_extra"><br>
</div>
<div class="gmail_extra">
<br>
<div class="gmail_quote">On Mon, Apr 6, 2015 at 4:46 PM,
Federico Edorna <span dir="ltr"><<a href="mailto:fedorna@anura.com.ar" target="_blank">fedorna@anura.com.ar</a>></span>
wrote:<br>
<blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex">
<div dir="ltr">
<div class="gmail_extra"><span><span style="font-size:12.8000001907349px">Hello, I'm
getting core dumps in version 1.11.3.</span></span>
<div><span style="font-size:12.8000001907349px">Unlike
other opensips we are running without problems,
we're using some extra modules in this config
because opensips needs to notify an external
process (via event_xmlrpc) when a terminal
registers, and that external process afterwards
sends opensips (via mi_datagram/t_uac_dlg) a MWI
NOTIFY for the terminal.</span>
<div><br>
<div style="font-size:12.8000001907349px">I'm
pasting 3 backtraces (commit cbaf569, but it
happened for previous commits too)</div>
<div style="font-size:12.8000001907349px"><br>
</div>
<div><span style="font-size:12.8000001907349px"><a href="http://pastebin.com/xZ2zqJ0F" target="_blank">http://pastebin.com/xZ2zqJ0F</a></span><br>
</div>
<div><span style="font-size:12.8000001907349px"><a href="http://pastebin.com/8DWhsMfK" target="_blank">http://pastebin.com/8DWhsMfK</a></span><br>
</div>
<div><span style="font-size:12.8000001907349px"><a href="http://pastebin.com/9ERCD3mZ" target="_blank">http://pastebin.com/9ERCD3mZ</a></span><br>
</div>
<div>
<div>
<div style="font-size:12.8000001907349px"><br>
</div>
<div style="font-size:12.8000001907349px">This
is what syslog shows:</div>
<div style="font-size:12.8000001907349px"><br>
</div>
<div style="font-size:12.8000001907349px">
<div>2015-04-03T13:45:16.228227-03:00
bermeja
/home/gc/local/opensips/sbin/opensips[24272]:
CRITICAL:core:recv_all: 1st recv on 36
failed: Connection reset by peer</div>
<div>2015-04-03T13:45:16.228249-03:00
bermeja
/home/gc/local/opensips/sbin/opensips[24272]:
CRITICAL:core:handle_tcp_child: read from
tcp child 0 (pid 24240, no 0) Connection
reset by</div>
<div> peer [104]</div>
<div>2015-04-03T13:45:16.228260-03:00
bermeja
/home/gc/local/opensips/sbin/opensips[24272]:
CRITICAL:core:receive_fd: EOF on 38</div>
<div>2015-04-03T13:45:16.250712-03:00
bermeja
/home/gc/local/opensips/sbin/opensips[24214]:
<a>INFO:core:handle_sigs</a>: child process 24240
exited by a signal 11</div>
<div>2015-04-03T13:45:16.250727-03:00
bermeja
/home/gc/local/opensips/sbin/opensips[24214]:
<a>INFO:core:handle_sigs</a>: core was generated</div>
<div>2015-04-03T13:45:16.250735-03:00
bermeja
/home/gc/local/opensips/sbin/opensips[24214]:
<a>INFO:core:handle_sigs</a>: terminating due to
SIGCHLD</div>
<div>2015-04-03T13:45:16.250800-03:00
bermeja
/home/gc/local/opensips/sbin/opensips[24270]:
<a>INFO:core:sig_usr</a>: signal 15 received</div>
</div>
<div style="font-size:12.8000001907349px"><br>
</div>
<div style="font-size:12.8000001907349px">----</div>
<div style="font-size:12.8000001907349px"><br>
</div>
<div style="font-size:12.8000001907349px">
<div>2015-04-03T13:54:48.179260-03:00
bermeja
/home/gc/local/opensips/sbin/opensips[21747]:
CRITICAL:core:recv_all: 1st recv on 36
failed: Connection reset by peer</div>
<div>2015-04-03T13:54:48.179289-03:00
bermeja
/home/gc/local/opensips/sbin/opensips[21747]:
CRITICAL:core:handle_tcp_child: read from
tcp child 0 (pid 21715, no 0) Connection
reset by</div>
<div> peer [104]</div>
<div>2015-04-03T13:54:48.179307-03:00
bermeja
/home/gc/local/opensips/sbin/opensips[21747]:
CRITICAL:core:receive_fd: EOF on 38</div>
<div>2015-04-03T13:54:48.179373-03:00
bermeja
/home/gc/local/opensips/sbin/opensips[21688]:
<a>INFO:core:handle_sigs</a>: child process 21715
exited by a signal 11</div>
<div>2015-04-03T13:54:48.179388-03:00
bermeja
/home/gc/local/opensips/sbin/opensips[21688]:
<a>INFO:core:handle_sigs</a>: core was generated</div>
<div>2015-04-03T13:54:48.179402-03:00
bermeja
/home/gc/local/opensips/sbin/opensips[21688]:
<a>INFO:core:handle_sigs</a>: terminating due to
SIGCHLD</div>
<div>2015-04-03T13:54:48.179417-03:00
bermeja
/home/gc/local/opensips/sbin/opensips[21746]:
<a>INFO:core:sig_usr</a>: signal 15 received</div>
<div>2015-04-03T13:54:48.179426-03:00
bermeja
/home/gc/local/opensips/sbin/opensips[21745]:
<a>INFO:core:sig_usr</a>: signal 15 received</div>
<div>2015-04-03T13:54:48.179435-03:00
bermeja
/home/gc/local/opensips/sbin/opensips[21743]:
<a>INFO:core:sig_usr</a>: signal 15 received</div>
</div>
<div style="font-size:12.8000001907349px"><br>
</div>
<div style="font-size:12.8000001907349px">----</div>
<div style="font-size:12.8000001907349px"><br>
</div>
<div style="font-size:12.8000001907349px">
<div>2015-04-03T14:44:01.064875-03:00
bermeja
/home/gc/local/opensips/sbin/opensips[31736]:
CRITICAL:core:recv_all: 1st recv on 36
failed: Connection reset by peer</div>
<div>2015-04-03T14:44:01.064898-03:00
bermeja
/home/gc/local/opensips/sbin/opensips[31736]:
CRITICAL:core:handle_tcp_child: read from
tcp child 0 (pid 31704, no 0) Connection
reset by</div>
<div> peer [104]</div>
<div>2015-04-03T14:44:01.064922-03:00
bermeja
/home/gc/local/opensips/sbin/opensips[31736]:
CRITICAL:core:receive_fd: EOF on 38</div>
<div>2015-04-03T14:44:01.064943-03:00
bermeja
/home/gc/local/opensips/sbin/opensips[31678]:
<a>INFO:core:handle_sigs</a>: child process 31704
exited by a signal 11</div>
<div>2015-04-03T14:44:01.064954-03:00
bermeja
/home/gc/local/opensips/sbin/opensips[31678]:
<a>INFO:core:handle_sigs</a>: core was generated</div>
<div>2015-04-03T14:44:01.064963-03:00
bermeja
/home/gc/local/opensips/sbin/opensips[31678]:
<a>INFO:core:handle_sigs</a>: terminating due to
SIGCHLD</div>
<div>2015-04-03T14:44:01.066539-03:00
bermeja
/home/gc/local/opensips/sbin/opensips[31736]:
<a>INFO:core:sig_usr</a>: signal 15 received</div>
</div>
<div style="font-size:12.8000001907349px"><br>
</div>
<div style="font-size:12.8000001907349px">Thanks
in advance</div>
</div>
</div>
</div>
<span><font color="#888888">
<div style="font-size:12.8000001907349px">Federico</div>
</font></span></div>
</div>
</div>
</blockquote>
</div>
<br>
</div>
</div>
<br>
<fieldset></fieldset>
<br>
</div></div><pre>_______________________________________________
Users mailing list
<a href="mailto:Users@lists.opensips.org" target="_blank">Users@lists.opensips.org</a>
<a href="http://lists.opensips.org/cgi-bin/mailman/listinfo/users" target="_blank">http://lists.opensips.org/cgi-bin/mailman/listinfo/users</a>
</pre>
</blockquote>
<br>
</div>
<br>_______________________________________________<br>
Users mailing list<br>
<a href="mailto:Users@lists.opensips.org">Users@lists.opensips.org</a><br>
<a href="http://lists.opensips.org/cgi-bin/mailman/listinfo/users" target="_blank">http://lists.opensips.org/cgi-bin/mailman/listinfo/users</a><br>
<br></blockquote></div><br></div></div></div></div></div>