2 docs tagged with "robotics"

Chapter 6 - Conversational Robotics: VLA & Multimodal AI

Chapter 6 explores conversational robotics, integrating GPT, Whisper speech recognition, and multimodal interactions to create Visual-Language-Action (VLA) pipelines.

Speech Recognition and Natural Language Understanding

Implementing robust voice interaction using OpenAI Whisper and local NLU techniques.