crewai-experiments  by majacinka

CrewAI experiments for startup analysis and newsletter creation

created 1 year ago
993 stars

Top 38.1% on sourcepulse

GitHubView on GitHub
Project Summary

This repository showcases experiments with the CrewAI framework, demonstrating how to build AI agent teams for tasks like startup idea examination and newsletter generation using both API-based and local LLMs. It targets developers and researchers interested in practical applications of multi-agent AI systems and provides insights into the performance of various LLMs with CrewAI.

How It Works

The project utilizes CrewAI to orchestrate a team of AI agents, assigning them specific roles and tasks. It explores different LLMs, including OpenAI's GPT-4, Gemini Pro, and various models run locally via Ollama, to assess their effectiveness in executing complex workflows and their ability to utilize provided tools. The experiments highlight the challenges and successes in getting agents to adhere to task requirements and leverage their tool-use capabilities.

Quick Start & Requirements

  • Install: pip install crewai
  • Prerequisites: Python 3.8+, Ollama for local models, API keys for OpenAI/Gemini.
  • Setup: Requires configuring API keys and potentially downloading local LLM models via Ollama.

Highlighted Details

  • Evaluates OpenAI GPT-4, Gemini Pro, and local models (Mistral 7B, OpenChat 3.5, Nous Hermes, OpenHermes 2.5, Starling 7B, Llama 2 variants, Phi-2) with CrewAI.
  • Compares agent performance on tasks like startup idea analysis and newsletter generation with web scraping.
  • Documents specific LLM behaviors, including tool usage failures and output quality variations.

Maintenance & Community

This is a personal experiment repository, not an official project with dedicated maintenance or community channels.

Licensing & Compatibility

The licensing is not specified in the README. Compatibility for commercial use or closed-source linking is undetermined.

Limitations & Caveats

The experiments are focused on specific use cases and LLM configurations, and the results may not generalize to all scenarios. Several local models struggled with tool usage, producing generic text instead of executing assigned tasks. The project is marked as "WIP" for the email classifier task.

Health Check
Last commit

1 year ago

Responsiveness

1 week

Pull Requests (30d)
0
Issues (30d)
0
Star History
15 stars in the last 90 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), Victor Taelin Victor Taelin(Author of Bend, Kind, HVM), and
4 more.

AgentBench by THUDM

0.6%
3k
Benchmark for evaluating LLMs as agents across diverse environments
created 2 years ago
updated 6 months ago
Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), Taranjeet Singh Taranjeet Singh(Cofounder of Mem0), and
1 more.

awesome-llm-apps by Shubhamsaboo

6.4%
54k
LLM app collection with AI agents and RAG examples
created 1 year ago
updated 2 days ago
Feedback? Help us improve.