freeswitch
freeswitch copied to clipboard
Random issues in Sip Profile (suspect squlite DB corruption)
With 1.10.10, after operating for a few days, one of my SIP profile's will develop corruption. Initially it shows itself as SOME (and only SOME, not all) domains on that profile will experience BLF "sticking". BLF will correctly update to a ringing state, with caller ID update, and it will stop blinking (ringing) correctly, but it will be stuck on in use indefinitely. The problem will escalate until eventually that profile will suddenly stop responding to SIP commands (INVITES, OPTIONS, etc), causing all devices on that profile to eventually show unregistered, and immediately stops call processing. This only effects one profile (I have 4 running on this server).
So far, it appears to be random. It has occurred on two servers (both in a BDC cluster), and does not show up immediately. Stopping freeswitch, deleting the sqlite dbs, and restarting it will fix it for a while, but the problem will eventually re-occur. No specific triggers found thus far.
I am not a programmer, so I have no idea how to do any kind of debug traces. The servers in question are in production with a total of about 700 extensions subscribed, so any issues affecting performance have to be corrected immediately (restart, clear, etc); time taken to further troubleshoot is unacceptable, as it prolongs the service outage.
We have not been able to reproduce on lab server, only production server with production traffic volumes are affected.