Discover and explore top open-source AI tools and projects—updated daily.
SDK for internet-scale financial data
Top 31.3% on SourcePulse
FinNLP provides a comprehensive toolkit for accessing and processing internet-scale financial data, specifically targeting researchers and developers interested in applying Large Language Models (LLMs) and Natural Language Processing (NLP) to financial markets. It offers pipelines for data acquisition and LLM training/finetuning, aiming to democratize financial data access.
How It Works
The project focuses on data ingestion from diverse sources, including financial news (Finnhub, Sina, Eastmoney), social media (Stocktwits, Reddit, Weibo), and company announcements (SEC, Juchao). It employs a modular design with specific classes for each data source, allowing users to download and process data via Python scripts. The use of proxies is integrated to mitigate IP blocking during scraping.
Quick Start & Requirements
pip install finnlp
Highlighted Details
Maintenance & Community
The project is part of the AI4Finance Foundation. Further community engagement details are not explicitly provided in the README.
Licensing & Compatibility
Limitations & Caveats
The project is primarily for academic and research purposes, explicitly stating it is not financial advice. Some data sources may require specific configurations (e.g., Weibo cookies) or are marked as "Soon" for full support. Proxy usage is recommended but not guaranteed to prevent all IP blocks.
1 year ago
Inactive