๐ค Multimodal Chatbot with Gemma 3n
This chatbot can process multiple types of input:
- Text: Regular text messages
- PDF: Extract and analyze document content
- Audio: Transcribe speech to text (supports WAV, MP3, M4A, FLAC, recorded audio)
Setup: Enter your OpenRouter API key below to get started
๐ฏ How to Use Each Tab:
๐ฌ Text Chat: Simple text conversations with the AI
๐ PDF Chat: Upload a PDF and ask questions about its content
๐ค Audio Chat: Upload or record audio files for transcription and analysis
- Supports: WAV, MP3, M4A, FLAC, OGG formats for uploads
- Recorded audio is processed directly from your microphone
- Best results with clear speech and minimal background noise
๐ Combined Chat: Use multiple input types together for comprehensive analysis
๐ Getting an API Key:
- Go to OpenRouter.ai
- Sign up for an account
- Navigate to the API Keys section
- Create a new API key
- Copy and paste it in the field above
โ ๏ธ Current Limitations:
- Audio transcription requires internet connection for best results
- Large files may take longer to process
- Recorded audio quality depends on your microphone