It's a higher bandwidth signal. There's a lot of information conveyed in a person's face and posture. Seeing their reaction to what you say tells you more than just hearing their voice. It's also useful for avoiding collisions where everybody tries to talk at once, and even for identifying who clearly wants to say something, but may be hesitant to do so.