<html><body>
<p>Hi Bogden,<br>
<br>
Thanks for explaining the child processes involved -- I misunderstood what was happening.<br>
<br>
Unfortunately, I don't have the core anymore. My recollection is that I couldn't print anything useful due to compiler optimization. That said, this should re-create pretty easily, and I'll get more dumps next time it happens.<br>
<br>
Regards,<br>
Kennard<br>
<br>
<img width="16" height="16" src="cid:1__=07BBFD26DFE71D5F8f9e8a93df9@logitech.com" border="0" alt="Inactive hide details for Bogdan-Andrei Iancu ---10/05/2010 01:41:38 AM---Hi Kennard, The core was generated by process 22255:"><font color="#424282">Bogdan-Andrei Iancu ---10/05/2010 01:41:38 AM---Hi Kennard, The core was generated by process 22255:</font><br>
<br>
<font size="2" color="#5F5F5F">From:        </font><font size="2">Bogdan-Andrei Iancu <bogdan@voice-system.ro></font><br>
<font size="2" color="#5F5F5F">To:        </font><font size="2">OpenSIPS users mailling list <users@lists.opensips.org></font><br>
<font size="2" color="#5F5F5F">Date:        </font><font size="2">10/05/2010 01:41 AM</font><br>
<font size="2" color="#5F5F5F">Subject:        </font><font size="2">Re: [OpenSIPS-Users] opensips tm timer core dump</font><br>
<font size="2" color="#5F5F5F">Sent by:        </font><font size="2">users-bounces@lists.opensips.org</font><br>
<hr width="100%" size="2" align="left" noshade style="color:#8091A5; "><br>
<br>
<br>
<tt>Hi Kennard,<br>
<br>
The core was generated by process 22255:<br>
[22238]: INFO:core:handle_sigs: child process 22255 exited by a <br>
signal 11<br>
<br>
and this process also reported mem problems:<br>
[22255]: ERROR:tm:new_t: out of mem<br>
<br>
Can you print the "tl" or "ptr" variables in frame 0?<br>
<br>
Regards,<br>
Bogdan<br>
<br>
Kennard_White@logitech.com wrote:<br>
><br>
> Running against opensips HEAD, I got a segfault in the tm timer code. <br>
> I believe this is triggered by running out of shared memory.<br>
><br>
><br>
> The stack trace:<br>
><br>
> (gdb) where<br>
> #0 0x00007fe8f8d96212 in insert_timer_unsafe (new_tl=0x7fe8f66337b0,<br>
> list_id=WT_TIMER_LIST, ext_timeout=<value optimized out>) at timer.c:731<br>
> #1 set_1timer (new_tl=0x7fe8f66337b0, list_id=WT_TIMER_LIST,<br>
> ext_timeout=<value optimized out>) at timer.c:904<br>
> #2 0x00007fe8f8d78ac8 in t_release_transaction (trans=0x7fe8f6633730)<br>
> at t_funcs.c:122<br>
> #3 0x00007fe8f8d808e5 in t_unref (p_msg=<value optimized out>)<br>
> at t_lookup.c:1152<br>
> #4 0x0000000000483ae5 in exec_post_req_cb ()<br>
> #5 0x000000000046c1e4 in receive_msg ()<br>
> #6 0x00000000004bc77c in udp_rcv_loop ()<br>
> #7 0x000000000042de9c in main ()<br>
><br>
> The offending code (I believe):<br>
> if (tl->time_out==ptr->time_out) {<br>
> tl->ld_tl = ptr->ld_tl<br>
> ptr->ld_tl = 0;<br>
> tl->ld_tl->ld_tl = tl; <-- SEG FAULT HERE (according to trace)<br>
> } else {<br>
> tl->ld_tl = tl;<br>
> }<br>
><br>
> Unfortunately, due to optimization I cannot dump anything useful, and <br>
> I'm not convinced the actual fault is on the line indicated. Note that <br>
> the core dump is not one of the processes that reported out of memory. <br>
> Maybe one of the other processes left the timer list in a corrupt state?<br>
><br>
> The log file:<br>
> Sep 29 11:43:36 org-sip01 /var/run/openser/opensips-pres[22255]: <br>
> ERROR:tm:sip_msg_cloner: no more share memory<br>
> Sep 29 11:43:36 org-sip01 /var/run/openser/opensips-pres[22255]: <br>
> ERROR:tm:new_t: out of mem<br>
> Sep 29 11:43:36 org-sip01 /var/run/openser/opensips-pres[22255]: <br>
> ERROR:tm:t_newtran: new_t failed<br>
> Sep 29 11:43:36 org-sip01 /var/run/openser/opensips-pres[22254]: <br>
> WARNING:core:fm_malloc: Not enough free memory, will atempt defragmenation<br>
> Sep 29 11:43:36 org-sip01 /var/run/openser/opensips-pres[22254]: <br>
> ERROR:tm:sip_msg_cloner: no more share memory<br>
> Sep 29 11:43:36 org-sip01 /var/run/openser/opensips-pres[22254]: <br>
> ERROR:tm:new_t: out of mem<br>
> Sep 29 11:43:36 org-sip01 /var/run/openser/opensips-pres[22254]: <br>
> ERROR:tm:t_newtran: new_t failed<br>
> Sep 29 11:43:36 org-sip01 /var/run/openser/opensips-pres[22238]: <br>
> INFO:core:handle_sigs: child process 22255 exited by a signal 11<br>
> Sep 29 11:43:36 org-sip01 /var/run/openser/opensips-pres[22238]: <br>
> INFO:core:handle_sigs: core was generated<br>
> Sep 29 11:43:36 org-sip01 /var/run/openser/opensips-pres[22238]: <br>
> INFO:core:handle_sigs: terminating due to SIGCHLD<br>
> Sep 29 11:43:36 org-sip01 /var/run/openser/opensips-pres[22256]: <br>
> INFO:core:sig_usr: signal 15 received<br>
><br>
> ------------------------------------------------------------------------<br>
><br>
> _______________________________________________<br>
> Users mailing list<br>
> Users@lists.opensips.org<br>
> </tt><tt><a href="http://lists.opensips.org/cgi-bin/mailman/listinfo/users">http://lists.opensips.org/cgi-bin/mailman/listinfo/users</a></tt><tt><br>
> <br>
<br>
<br>
-- <br>
Bogdan-Andrei Iancu<br>
OpenSIPS Bootcamp<br>
15 - 19 November 2010, Edison, New Jersey, USA<br>
</tt><tt>www.voice-system.ro</tt><tt><br>
<br>
<br>
_______________________________________________<br>
Users mailing list<br>
Users@lists.opensips.org<br>
</tt><tt><a href="http://lists.opensips.org/cgi-bin/mailman/listinfo/users">http://lists.opensips.org/cgi-bin/mailman/listinfo/users</a></tt><tt><br>
</tt><br>
</body></html>