๐Ÿค– Multimodal Chatbot with Gemma 3n

This chatbot can process multiple types of input:

  • Text: Regular text messages
  • PDF: Extract and analyze document content
  • Audio: Transcribe speech to text (supports WAV, MP3, M4A, FLAC, recorded audio)

Setup: Enter your OpenRouter API key below to get started

๐ŸŽฏ How to Use Each Tab:

๐Ÿ’ฌ Text Chat: Simple text conversations with the AI

๐Ÿ“„ PDF Chat: Upload a PDF and ask questions about its content

๐ŸŽค Audio Chat: Upload or record audio files for transcription and analysis

  • Supports: WAV, MP3, M4A, FLAC, OGG formats for uploads
  • Recorded audio is processed directly from your microphone
  • Best results with clear speech and minimal background noise

๐ŸŒŸ Combined Chat: Use multiple input types together for comprehensive analysis

๐Ÿ”‘ Getting an API Key:

  1. Go to OpenRouter.ai
  2. Sign up for an account
  3. Navigate to the API Keys section
  4. Create a new API key
  5. Copy and paste it in the field above

โš ๏ธ Current Limitations:

  • Audio transcription requires internet connection for best results
  • Large files may take longer to process
  • Recorded audio quality depends on your microphone