Will write()s larger than PIPE_BUF_MAX be interleaved? #1

jstorimer · 2013-04-08T03:24:23Z

This implementation is actually identical to Ruby's Logger, but without the mutex. So I wondered why Logger used a mutex there.

A call to write() is atomic. But, AFAIK, only for writes up to PIPE_BUF_MAX. I couldn't reproduce it, but technically if you're logging something bigger than this, it could interleave with what other threads are writing.

On my system this value is 512 bytes.

$ ulimit -a | grep pipe
pipe size (512 bytes, -p) 1

jstorimer · 2013-04-08T03:25:48Z

I looked around and found this, which suggests that this is only an issue when writing to a pipe.

Can anyone with a longer unix beard than me comment on this? Why else would Logger need a mutex?

jacknagel · 2013-04-12T17:16:06Z

My unix beard is certainly not longer than yours, but from my reading, it appears that POSIX only guarantees atomicity when dealing with pipes or FIFOs, and does not say anything about regular files. It would seem there are no guarantees wrt atomicity when writing to files, and even if there were, there are a lot of filesystems out there that do not fully conform to POSIX anyway.

jstorimer · 2013-05-30T14:56:37Z

@jacknagel My research pointed in the opposite direction. It seems that MonoLogger (and Logger for that matter) are doing the right thing wrt atomically appending to files.

I'll close this for now until something more concrete comes up on the other side.

jacknagel · 2013-05-30T15:32:44Z

The syscall itself may be atomic, but I don't think that has anything to do with whether the write of the data is actually atomic. write(2) can be interrupted by a signal, in which case it returns the number of bytes written, which may or may not be the number of bytes we tried to write. One must then call write again with the correct number of bytes and offset into the data in order to complete the operation, and there are ample opportunities in this sequence for interleaving to occur.

I'd love to be wrong about this, but it seems that any stronger guarantees are getting into ACID territory.

Really, as long as this library is using IO#write, discussing this may be pointless. There are potentially two layers of buffering (Ruby's internal buffering, and the libc functions that Ruby wraps) here. I think one has to use IO#syswrite to get anywhere close to any guarantees that write(2) may or may not provide. By doing so you lose the benefits of buffering, of course, so perhaps that is why the stdlib logger was using a mutex in the first place.

jstorimer · 2013-05-30T21:29:38Z

I forgot about the signal interaction, that could definitely cause issues. This alone warrants the use of a mutex.

Ruby's buffering is out of the question here. IO#sync is set to true on the open file, so Ruby's buffering is skipped (which is equivalent to using IO#syswrite). I don't know enough about the lower layers of buffering to know if write order will be preserved. I would think that by the time the write(2) function returns that the data has occupied its proper place in the kernel buffer, but I don't know.

steveklabnik · 2013-05-30T22:00:07Z

How the hell am I not watching my own repository? So sorry about this, because I never got an email, I never saw that this discussion was happening.

So, basically, yes, I just took this from OpsCode, who took this from the standard library. I am unsure if it's a 100% perfect solution, but I'm very interested in making it one.

We're using it in Resque and nobody has reported any problems so far. mono_logger has had 100k downloads without any real reported problems yet.

mperham · 2013-05-30T22:46:24Z

I've found mono_logger to not be thread-safe when used under heavy load with Sidekiq but could not figure out why.

See #2

jstorimer · 2013-06-06T15:50:34Z

Eric Wong got in touch after he saw my article and tried to set the record straight. I posted his messages on the usp.ruby mailing list at http://librelist.com/browser//usp.ruby/2013/6/5/o-append-atomicity/.

Relevant exceprts:

Signals can't interrupt writes to regular files on reasonable FSes.

As long as:

1) you're on a reasonably POSIX-compliant FS[1]
2) you have enough free space
3) your writes are reasonably-sized[2]
4) you're not exceeding RLIMIT_FSIZE

(maybe some other rare cases I've forgotten at this hour of the night)

Then O_APPEND is atomic and write-in-full for all reasonably-sized
writes to regular files.

cbandy · 2014-05-08T15:38:23Z

Ruby's buffering is out of the question here. IO#sync is set to true on the open file, so Ruby's buffering is skipped (which is equivalent to using IO#syswrite).

I wanted to chime in that STDOUT has sync set to false.

% ruby -e 'puts STDOUT.sync.inspect'
false

% ruby -e 'puts STDERR.sync.inspect'
true

In fact, OpsCode has committed a change[1] to ensure that all log devices have sync set to true. Perhaps this discussion is related?

[1] chef/chef@b7d801c

jstorimer closed this as completed May 30, 2013

dkubb mentioned this issue Feb 29, 2016

Fix coverage of Isolation namespace mbj/mutant#536

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Will write()s larger than PIPE_BUF_MAX be interleaved? #1

Will write()s larger than PIPE_BUF_MAX be interleaved? #1

jstorimer commented Apr 8, 2013

jstorimer commented Apr 8, 2013

jacknagel commented Apr 12, 2013

jstorimer commented May 30, 2013

jacknagel commented May 30, 2013

jstorimer commented May 30, 2013

steveklabnik commented May 30, 2013

mperham commented May 30, 2013

jstorimer commented Jun 6, 2013

cbandy commented May 8, 2014

Will write()s larger than PIPE_BUF_MAX be interleaved? #1

Will write()s larger than PIPE_BUF_MAX be interleaved? #1

Comments

jstorimer commented Apr 8, 2013

jstorimer commented Apr 8, 2013

jacknagel commented Apr 12, 2013

jstorimer commented May 30, 2013

jacknagel commented May 30, 2013

jstorimer commented May 30, 2013

steveklabnik commented May 30, 2013

mperham commented May 30, 2013

jstorimer commented Jun 6, 2013

cbandy commented May 8, 2014