Tool for stock investment strategy via LLM analysis of annual reports
Top 98.4% on sourcepulse
This repository provides tools for enhancing stock investment strategies by analyzing company annual reports using Large Language Models (LLMs). It targets quantitative analysts, researchers, and investors seeking to leverage AI for financial data processing and predictive modeling, aiming to improve portfolio performance against benchmarks like the S&P 500.
How It Works
The project follows a pipeline: downloading 10-K filings from the SEC, converting them to PDF for token efficiency, generating embeddings using ChromaDB, and then querying these embeddings with an LLM (like GPT-3.5) to extract scores as features. These features are used in a Linear Regression model within a Jupyter Notebook to predict stock returns and construct investment portfolios.
Quick Start & Requirements
Highlighted Details
Maintenance & Community
The project is associated with a published paper, indicating academic backing. No specific community channels (Discord, Slack) or active maintenance signals are provided in the README.
Licensing & Compatibility
The repository's code is not explicitly licensed. The associated paper is available under a Creative Commons license (implied by arXiv). Compatibility for commercial use or closed-source linking is not specified.
Limitations & Caveats
The project relies on external LLM APIs, which may incur costs and have usage limitations. The effectiveness of the predictive model is dependent on the quality of LLM embeddings and the chosen features, and no performance benchmarks are provided. The setup process requires managing multiple complex dependencies in separate environments.
1 day ago
1 day