Multi-modal Interaction: Speech, Gesture, VisionSynthesizing audio and visual cues for intuitive human-robot communication.