Workshop for building with Gemini using the Gemini API and Python SDK
Top 86.0% on sourcepulse
This repository provides a hands-on workshop for developers to learn how to build applications using Google's Gemini API and Python SDK. It covers text prompting, multimodal understanding, and agentic capabilities, targeting developers looking to integrate advanced AI features into their projects.
How It Works
The workshop is structured into three parts, guiding users through Gemini's capabilities. It emphasizes practical application via notebooks and exercises, progressing from basic text prompting and streaming to complex multimodal inputs (image, video, audio, documents, code) and agentic features like tool usage and function calling.
Quick Start & Requirements
Highlighted Details
Maintenance & Community
The repository is maintained by Patrick Loeber. Further resources and related projects are linked in the README, including Gemini 2.0 features and a Live API cookbook.
Licensing & Compatibility
The repository's license is not specified in the README. Compatibility for commercial use or closed-source linking is not detailed.
Limitations & Caveats
The README does not specify the exact Python version requirements or provide explicit installation instructions beyond needing the Gemini API SDK. The project appears to be a workshop guide rather than a runnable application with a defined setup process.
3 weeks ago
Inactive