edgartools  by dgunning

Python SDK for SEC EDGAR data extraction and analysis

Created 2 years ago
1,244 stars

Top 31.7% on SourcePulse

GitHubView on GitHub
Project Summary

EdgarTools is a Python library designed to simplify the extraction and analysis of financial data from SEC EDGAR filings. It targets financial analysts, researchers, and developers who need to access and process this data efficiently, offering a significant reduction in code complexity compared to manual web scraping or parsing.

How It Works

EdgarTools provides an intuitive Python API to interact with the SEC EDGAR database, retrieving filings from 1994 onwards. It automatically parses various filing types (10-K, 10-Q, 4, 13F, etc.) into structured Python objects and pandas DataFrames, leveraging efficient libraries for data handling. The library also includes features for clean text extraction suitable for LLM pipelines and automatic rate limiting to prevent SEC blocks.

Quick Start & Requirements

Highlighted Details

  • Access any SEC filing since 1994.
  • Automatic parsing into pandas DataFrames.
  • Specialized data objects for each form type.
  • Clean text extraction for AI/LLM pipelines.
  • Insider transaction monitoring and fund holdings analysis.

Maintenance & Community

  • Active development with a roadmap including enhanced visualizations and ML integrations.
  • Community support via GitHub Issues and Discussions.
  • Contributing guide available for community involvement.

Licensing & Compatibility

  • MIT License.
  • Permissive for commercial use and integration with closed-source projects.

Limitations & Caveats

The library is actively under development, with planned features like interactive dashboards and ML integrations still in progress.

Health Check
Last Commit

20 hours ago

Responsiveness

1 day

Pull Requests (30d)
3
Issues (30d)
37
Star History
89 stars in the last 30 days

Explore Similar Projects

Starred by John Resig John Resig(Author of jQuery; Chief Software Architect at Khan Academy), Sasha Rush Sasha Rush(Research Scientist at Cursor; Professor at Cornell Tech), and
2 more.

llmparser by kyang6

0%
426
LLM tool for structured data extraction and classification
Created 2 years ago
Updated 2 years ago
Feedback? Help us improve.