- Realtime video understanding
- Live speech conversation
- Multimodal streaming
- Voice customization
Gemini Multimodal Live (Coming Soon)
Google’s multimodal live streaming model (Coming Soon)
Gemini Multimodal Live is Google’s model for realtime multimodal interactions. It supports: