Linux – How to check if a process is non-blocking in linux without using a stack tracer

blockingkernellinuxsocket

A multi-cpu server is running several processes. One process has a thread that should always be in a spinning state, using 100% of the CPU it's been assigned. My current method (besides asking the developer…) is using strace on the process which waits for information to arrive at it's open file descriptor and checks it continuously using recvfrom(2) where erno is set to EAGAIN and method is returning -1 when no packets are to be read from network socket.

I'm not comfortable stack tracing production set-ups, and it's a unwieldy way of determining this information at best. I was poking about proc(5) and thought that the value of the flags field in /proc/[pid]/fdinfo might be useful to check if that process was using a socket that called open(2) with the O_NONBLOCK mode.

I'm struggling to reverse engineer this value at the moment. I know it represents the bitwise OR of the file status and file mode. So I think I can check the source headers for the value of constants open(2) uses on that particular kernel and then bitwise OR them until I find a value that matched what's in fdinfo. That seems rather clunky, if anybody can validate the above method (I can't yet) or provide a more elegant solution I'd be much obliged.

I also know fnctl(2) can set a file descriptor to a non-blocking state, but am treating that equivalent to open for the moment

Best Answer

Yes, this is a valid way to check that the socket is non-blocking.

The value for a non-blocking socket is 04000, non-blocking sockets in /proc/<pid>/fdinfo are represented in octal.

You can validate this behaviour with python.

Python 2.7.5 (default, Feb 19 2014, 13:47:28) 
[GCC 4.8.2 20131212 (Red Hat 4.8.2-7)] on linux2
Type "help", "copyright", "credits" or "license" for more information.
>>> from socket import *
>>> import os
>>> from os import O_NONBLOCK
>>> s = socket(AF_INET, SOCK_STREAM)
>>> s.setblocking(0)
>>> print open("/proc/self/fdinfo/{0}".format(s.fileno())).read(4096)
pos:    0
flags:  04002

>>> if 04002 & O_NONBLOCK:
...   print "yes"
... else:
...   print "no"
... 
yes

So, now you know how, I must point out that your developer is doing it wrong. If non-blocking sockets are something they want to use, thats fine - however they should setup an epoll(2) on the socket and block on the poll instead.

The program gains nothing from read(2) on a non blocking socket that produces EAGAIN -- as a matter of fact, the result is worse because nearly all system calls are a preemption point where the kernel can context switch you anyway.

This developer is wasting power, CPU cycles that could be used for idling threads and is not actually gaining any benefits he/she things they are from doing it this way.

If the developer wants to be 'cache-line' friendly, pin his tasks to a particular CPU and be done with it.

Related Solutions

Does Mac OS X throttle the RATE of socket creation

So it turns out the Mac OS X ephemeral port range is fairly low.

Wikipedia informs me that IANA suggests 49152 to 65535 as "dynamic and/or private ports" while many Linux kernels use 32768 to 61000. OS X uses the IANA range. This means Linux has almost twice the available ephemeral ports. Since each closed socket goes through a TIME_WAIT state (that I didn't know about) the rate is just overwhelming my system.

How to fix?

sudo sysctl -w net.inet.ip.portrange.first=32768
sudo sysctl -w net.inet.ip.portrange.hifirst=32768

This will give about double the range.

(Thanks to Spiff who answered in more detail here: https://superuser.com/questions/145989/does-mac-os-x-throttle-the-rate-of-socket-creation)

Linux – VFS: file-max limit 1231582 reached

After a little more testing, I believe this to be an NFS server bug. When a process on an NFS client places a write lock on a file, the server reserves an open file handle (this may be the wrong terminology -- my apologies to any actual kernel gurus reading this). This would probably be OK if the server released the handle when the lock is released, but it apparently doesn't.

My original problem occurred with rrdtool. rrdtool opens a file for read/write, locks the file for writing, makes its changes, and exits. Each time I run rrdtool, the number of open files on the server increases by one. (Nitpicky detail -- the server actually allocates in chunks of 32, so it's more like "32 runs make 32 open file descriptors", but that's an insignificant detail in the long run)

I wrote a minimal test program to verify this behavior. Indeed, opening the file, locking it, then exiting is sufficient to trigger this. Explicitly releasing the lock before exiting does not help in any way. Opening the file without locking it does not trigger the problem.

So far, I still have not found a way to release the resources on the server, other than rebooting. Restarting the NFS service is insufficient, as noted above.

I still haven't tested NFS version 3. Perhaps it works better.

Anyway, thanks for trying. Hopefully my experiences can be of some help to someone else in the future.

One last update: J. Bruce Fields, one of the NFSv4 developers, has confirmed that this is a bug, and says it's limited to NFSv4. Apparently I was the first to report it. He's hoping to have a patch soon.

Remember, kids: When you find a bug, find the proper place to report it, and there's a good chance it'll get fixed. Hurray for open source. :-)

Best Answer

Related Solutions

Does Mac OS X throttle the RATE of socket creation

Linux – VFS: file-max limit 1231582 reached

Related Topic