pipe writes, ERESTARTSYS and SA_RESTART
Yann Droneaud
ydroneaud at opteya.com
Thu Aug 4 14:38:06 EDT 2022
Hi,
Le 04/08/2022 à 20:01, Viacheslav Biriukov a écrit :
>
> But what I can't understand is how and where the kernel modifies the
> arguments of the write system call and where it collects the return
> values of all these restarts, thus the userspace caller ultimately
> sees the correct number of written bytes.
>
> With strace I can see all that retries, for example:
>
> write(1, ""..., 33554431) = 65536
> write(1, ""..., 33488895) = ? ERESTARTSYS (To be
> restarted if SA_RESTART is set)
> write(1, ""..., 33488895) = ? ERESTARTSYS (To be
> restarted if SA_RESTART is set)
> write(1, ""..., 33488895) = ? ERESTARTSYS (To be
> restarted if SA_RESTART is set)
> write(1, ""..., 33488895) = 33488895
>
> Here there were 4 restarts (I sent 4 signals), 3 of them returned
> ERESTARTSYS and 2 were able to write to the pipe. Also for restarts
> strace shows the correct 3rd argument, which is decrementing.
>
> The caller in the userspace in the end sees that it was able to write
> 65536+33488895 bytes. Which is correct and what the man 7 pipe describes.
>
> My question is how and where it does that. I tried to dig in the
> kernel source code but can't find the place where this tracking occurs.
>
It doesn't. SA_RESTART is only meant to retry syscall that would have
returned EINTR.
In such case, there's no tracking to do, because nothing was actually
written, so the syscall can be restarted with the same parameters.
Regards.
--
Yann Droneaud
OPTEYA
More information about the Kernelnewbies
mailing list