Issues with this error - freenas kernel: sonewconn: pcb 0xfffff801c8f631d0: Listen queue overflow

Noctrin

Dabbler
Joined
Dec 27, 2017
Messages
11
Feb 27 17:07:09 freenas kernel: sonewconn: pcb 0xfffff801c8f631d0: Listen queue overflow: 151 already in queue awaiting acceptance (12 occurrences)
Feb 27 17:08:29 freenas kernel: sonewconn: pcb 0xfffff801c8f631d0: Listen queue overflow: 151 already in queue awaiting acceptance (2 occurrences)
Feb 27 17:09:58 freenas kernel: sonewconn: pcb 0xfffff801c8f631d0: Listen queue overflow: 151 already in queue awaiting acceptance (3 occurrences)
...

I've been having this for a while since 11.1 I think. It will go away after a server restart and be fine for a few weeks. I have tried looking at incoming connections and i don't see anything out of the ordinary, i also don't fully understand how to properly debug this. I googled furiously and followed a number of threads on similar issues, but i cannot get to the bottom of it. Here's hoping someone can help me with this.

Hardware:
Dell R510
2 x 8 core Intel
32GB Ram
1 Nic (onboard) connected to network (2nd port available)
1 VM running ubuntu and zoneminder, nextcloud and pi-hole -- other than nextcloud nothing is open to outside.
4 Jails: 1 for plex (open) 1 for radar, sonarr..etc, 1 jail for grafana and a few other monitoring tools and 1 jail for deluge

FreeNas 11.2 (Stable)

Probably running a few other things as well.

I can't say i see any errors or any abnormal behavior when these messages start and i can't seem to find a trigger for them. They just seem to randomly start and will not go away, even if i start turning off the VM or jails. After a reboot all is well for a few weeks.

Any help appreciated, thank you!
 

Chris Moore

Hall of Famer
Joined
May 2, 2015
Messages
10,080

sretalla

Powered by Neutrality
Moderator
Joined
Jan 1, 2016
Messages
9,703
I get that error on a system with 128GB of RAM.

I get it fairly regularly, but can't put it together with anything that's an actual problem.

I tried in the past in another thread to work on tunables to increase the queue length, but even massive increases there didn't make a difference at all.

I stopped worrying about it.
 

Noctrin

Dabbler
Joined
Dec 27, 2017
Messages
11
I get that error on a system with 128GB of RAM.

I get it fairly regularly, but can't put it together with anything that's an actual problem.

I tried in the past in another thread to work on tunables to increase the queue length, but even massive increases there didn't make a difference at all.

I stopped worrying about it.

I'm upgrading my Dell R510 with a thinkserver RD450. Once i do, if the error persists, I can at least say with decent confidence the issue is software related.
 

sretalla

Powered by Neutrality
Moderator
Joined
Jan 1, 2016
Messages
9,703
I'm upgrading my Dell R510 with a thinkserver RD450. Once i do, if the error persists, I can at least say with decent confidence the issue is software related.
I already saw the problem on a completely different mobo and NIC combination, so already confirm it's something to do with software.
 

sretalla

Powered by Neutrality
Moderator
Joined
Jan 1, 2016
Messages
9,703
Oops, I posted a response to the wrong thread here... sorry, have removed it now.
 
Last edited:

Noctrin

Dabbler
Joined
Dec 27, 2017
Messages
11
did you manage to find the device id causing the buffer overflow, in my case, i cannot find "0xfffff801c8f631d0" listed anywhere. I assume it's virtual but i have no clue where it gets it from.

Other thing i can think of is we figure out what we're both using, find the overlap and maybe it will point something out. I'm fairly out of ideas, but as you said, it doesnt seem to cause any problems for me either, so i might just forget about it.
 

sretalla

Powered by Neutrality
Moderator
Joined
Jan 1, 2016
Messages
9,703
No, I could never find a still-running process to match to the error.
 

Dudleydogg

Explorer
Joined
Aug 30, 2014
Messages
50
The Error message is caused by ZoneMinder, did you guys ever resolve this? Kind of a pain to reboot the Plugin on daily basis.
 

sretalla

Powered by Neutrality
Moderator
Joined
Jan 1, 2016
Messages
9,703
I have it an a box that doesn't run that plugin. What makes you think it's the cause?

The error doesn't seem to align with any actual problem in system performance for me.
 

GEOptic

Dabbler
Joined
Apr 18, 2018
Messages
42
freenas kernel: sonewconn are flooding here as well. 11.2U6
But I had no issues for weeks after I went from 11.1U7 to 11.2U6.
 

anodos

Sambassador
iXsystems
Joined
Mar 6, 2014
Messages
9,554
Are you using samba? I think in something like 11.1-U7 I added the ability to modify Samba's socket listen backlog by modifying the sysctl kern.ipc.soacceptqueue, and restarting samba. The default listen backlog in Samba was 50 (for smbd processes). in 11.1-U7 the change to use the sysctl value bumped it to 128. Generally, bumping up this value may help with handling sudden bursts of connections, but if the server is totally overloaded you may end up just shifting the problem somewhere else.
 

GEOptic

Dabbler
Joined
Apr 18, 2018
Messages
42
indeed it started after I started to use SMS share. But I did not make it available public...

Now, I think it's the source of my Pools not found. I changed permissions (wrongly on the pool thinking it was on the SMB share) then reset the permission thru command line chown + setfalc... then removed old unwanted share (but forgot to remove first the periodic snapshot... Et voilà... Pool disapeared.
The source, but not the fault... Now I'm trying to remove periodic snapshot thru command line (infinite turning wheel on the GUI).
Oh well... bummer... I had lots of other stuff to do...
 
Last edited:

G8One2

Patron
Joined
Jan 2, 2017
Messages
248
I get these errors if I restart the Plex jail while there are streams going at the same time. I think its because the buffer in Plex is ahead of the stream, then if you restart the jail while a stream is playing, it resets the buffer. Because Plex can continue the steam until it runs out of buffer from the server. If that's the case, stopping all streams and restarting the jail or plug in usually stops that error for me. Doesn't mean that's your issue, but might be worth a try.
 

GEOptic

Dabbler
Joined
Apr 18, 2018
Messages
42
Thanks for your input G8One2 !!! ;)

We have no jails / plug in at all!!!
I think the UI have some issues reading the configuration files … but I don't know where to double check this.
It must be SMB/AD permissions related, but then I'm not that sure ...
But since the UI won't let me do anything, I'm trying from command line! :)
Not yet found a solution … a lot of sonewconn queue overflow message ...
 

anodos

Sambassador
iXsystems
Joined
Mar 6, 2014
Messages
9,554
Thanks for your input G8One2 !!! ;)

We have no jails / plug in at all!!!
I think the UI have some issues reading the configuration files … but I don't know where to double check this.
It must be SMB/AD permissions related, but then I'm not that sure ...
But since the UI won't let me do anything, I'm trying from command line! :)
Not yet found a solution … a lot of sonewconn queue overflow message ...
Try increasing the value of the sysctl I mentioned sysctl kern.ipc.soacceptqueue=512. Once you do this, you will need to restart samba service samba_server onerestart
 
Joined
Jan 18, 2017
Messages
525
I have this recurring nightmare as well, one jail with a java applet running now does this once a month. I kept raising the number and stopped at 4096 I can handle restarting the jail once a month, but I would really like to figure out how to solve this....
 
Top