edgartools  by dgunning

Python SDK for SEC EDGAR data extraction and analysis

created 2 years ago
1,105 stars

Top 35.2% on sourcepulse

GitHubView on GitHub
Project Summary

EdgarTools is a Python library designed to simplify the extraction and analysis of financial data from SEC EDGAR filings. It targets financial analysts, researchers, and developers who need to access and process this data efficiently, offering a significant reduction in code complexity compared to manual web scraping or parsing.

How It Works

EdgarTools provides an intuitive Python API to interact with the SEC EDGAR database, retrieving filings from 1994 onwards. It automatically parses various filing types (10-K, 10-Q, 4, 13F, etc.) into structured Python objects and pandas DataFrames, leveraging efficient libraries for data handling. The library also includes features for clean text extraction suitable for LLM pipelines and automatic rate limiting to prevent SEC blocks.

Quick Start & Requirements

Highlighted Details

  • Access any SEC filing since 1994.
  • Automatic parsing into pandas DataFrames.
  • Specialized data objects for each form type.
  • Clean text extraction for AI/LLM pipelines.
  • Insider transaction monitoring and fund holdings analysis.

Maintenance & Community

  • Active development with a roadmap including enhanced visualizations and ML integrations.
  • Community support via GitHub Issues and Discussions.
  • Contributing guide available for community involvement.

Licensing & Compatibility

  • MIT License.
  • Permissive for commercial use and integration with closed-source projects.

Limitations & Caveats

The library is actively under development, with planned features like interactive dashboards and ML integrations still in progress.

Health Check
Last commit

2 days ago

Responsiveness

Inactive

Pull Requests (30d)
11
Issues (30d)
28
Star History
210 stars in the last 90 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), Andre Zayarni Andre Zayarni(Cofounder of Qdrant), and
1 more.

refinery by code-kern-ai

0.1%
1k
Open-source tool for NLP data scaling, assessment, and maintenance
created 3 years ago
updated 7 months ago
Feedback? Help us improve.