A Python application that demonstrates Google's Gemini Live API capabilities with real-time audio, camera/screen capture, and text interaction.
- Real-time audio input/output
- Camera video feed integration
- Screen capture capability
- Text-based interaction
- Voice responses using Gemini's Puck voice model
- Python 3.9+
- Google Gemini API key
- Clone the repository
- Install the required dependencies:
pip install google-genai opencv-python pyaudio pillow mss