瀏覽代碼

[AF_UNIX]: Fix stream recvmsg() race.

A recv() on an AF_UNIX, SOCK_STREAM socket can race with a
send()+close() on the peer, causing recv() to return zero, even though
the sent data should be received.

This happens if the send() and the close() is performed between
skb_dequeue() and checking sk->sk_shutdown in unix_stream_recvmsg():

process A  skb_dequeue() returns NULL, there's no data in the socket queue
process B  new data is inserted onto the queue by unix_stream_sendmsg()
process B  sk->sk_shutdown is set to SHUTDOWN_MASK by unix_release_sock()
process A  sk->sk_shutdown is checked, unix_release_sock() returns zero

I'm surprised nobody noticed this, it's not hard to trigger.  Maybe
it's just (un)luck with the timing.

It's possible to work around this bug in userspace, by retrying the
recv() once in case of a zero return value.

Signed-off-by: Miklos Szeredi <mszeredi@suse.cz>
Signed-off-by: David S. Miller <davem@davemloft.net>
Miklos Szeredi 18 年之前
父節點
當前提交
3c0d2f3780
共有 1 個文件被更改,包括 10 次插入3 次删除
  1. 10 3
      net/unix/af_unix.c

+ 10 - 3
net/unix/af_unix.c

@@ -1744,20 +1744,23 @@ static int unix_stream_recvmsg(struct kiocb *iocb, struct socket *sock,
 		int chunk;
 		int chunk;
 		struct sk_buff *skb;
 		struct sk_buff *skb;
 
 
+		unix_state_lock(sk);
 		skb = skb_dequeue(&sk->sk_receive_queue);
 		skb = skb_dequeue(&sk->sk_receive_queue);
 		if (skb==NULL)
 		if (skb==NULL)
 		{
 		{
 			if (copied >= target)
 			if (copied >= target)
-				break;
+				goto unlock;
 
 
 			/*
 			/*
 			 *	POSIX 1003.1g mandates this order.
 			 *	POSIX 1003.1g mandates this order.
 			 */
 			 */
 
 
 			if ((err = sock_error(sk)) != 0)
 			if ((err = sock_error(sk)) != 0)
-				break;
+				goto unlock;
 			if (sk->sk_shutdown & RCV_SHUTDOWN)
 			if (sk->sk_shutdown & RCV_SHUTDOWN)
-				break;
+				goto unlock;
+
+			unix_state_unlock(sk);
 			err = -EAGAIN;
 			err = -EAGAIN;
 			if (!timeo)
 			if (!timeo)
 				break;
 				break;
@@ -1771,7 +1774,11 @@ static int unix_stream_recvmsg(struct kiocb *iocb, struct socket *sock,
 			}
 			}
 			mutex_lock(&u->readlock);
 			mutex_lock(&u->readlock);
 			continue;
 			continue;
+ unlock:
+			unix_state_unlock(sk);
+			break;
 		}
 		}
+		unix_state_unlock(sk);
 
 
 		if (check_creds) {
 		if (check_creds) {
 			/* Never glue messages from different writers */
 			/* Never glue messages from different writers */