All-Model-Chat  by yeahhe365

Multimodal chatbot interface for Google Gemini API

Created 2 months ago
289 stars

Top 90.9% on SourcePulse

GitHubView on GitHub
Project Summary

<All Model Chat is a powerful, multimodal chat interface designed for seamless interaction with Google's Gemini API family. It offers a rich user experience for developers and AI enthusiasts, enabling advanced features like dynamic model selection, multimodal input, code execution, and voice interaction, all within a customizable web application.>

How It Works

All Model Chat leverages React 19 and TypeScript for its frontend, utilizing the @google/genai SDK to interact with Gemini models. It supports a wide range of Gemini models (2.5 Pro, Flash, Flash Lite), Imagen image generation, and text-to-speech capabilities. Key features include web search, code execution, URL context, adjustable AI parameters (Temperature, Top-P), and a "thought process" visualization for Gemini models. It also integrates speech-to-text and text-to-speech functionalities, along with a "Canvas Assistant" for generating frontend code like ECharts and Graphviz diagrams.

Quick Start & Requirements

  • Access: The application is designed to be used directly in a web browser at all-model-model-chat.pages.dev.
  • API Key: A Google Gemini API key is required. Users can obtain this from Google AI Studio and input it into the application's settings. Multiple API keys can be entered, one per line, for key rotation.
  • Setup: No backend or installation is necessary. Setup involves accessing the URL and configuring the API key in the settings.
  • Documentation: Further details and features can be explored via the provided links in the README.

Highlighted Details

  • Multimodal Input: Supports image, video, audio, PDF, code, and text files with various upload methods (drag-and-drop, paste, file selector, camera, microphone).
  • Enhanced Chat Experience: Features persistent chat history in local storage, conversation grouping, scene management for custom prompts, message editing/deletion/retry, and export options (PNG, HTML, TXT).
  • Developer Tools: Includes a built-in log viewer for inspecting application behavior, API calls, and API key usage, aiding in debugging and understanding.
  • Customization: Offers extensive customization options, including adjustable AI parameters, custom system prompts, and theming via CSS variables.

Maintenance & Community

The project appears to be actively maintained by yeahhe365. Specific details on community channels, partnerships, or a roadmap are not explicitly detailed in the provided README excerpt.

Licensing & Compatibility

The README does not explicitly state the project's license. Compatibility for commercial use or closed-source linking would depend on the underlying license of the project and its dependencies.

Limitations & Caveats

The application's functionality is dependent on the availability and performance of the Google Gemini API. The README does not detail any known bugs, unsupported platforms, or specific performance limitations beyond the inherent characteristics of the AI models used. The reliance on browser localStorage for API keys means keys are not synchronized across devices or browsers.

Health Check
Last Commit

1 day ago

Responsiveness

Inactive

Pull Requests (30d)
2
Issues (30d)
0
Star History
90 stars in the last 30 days

Explore Similar Projects

Feedback? Help us improve.