Discover and explore top open-source AI tools and projects—updated daily.
LingyvKongAI model for extracting structured data from charts
Top 98.8% on SourcePulse
Summary
OneChart is the official codebase for a novel approach to chart structural extraction, presented at ACM Multimedia 2024. It aims to purify the process of extracting structured information from charts by introducing a single auxiliary token. This project is targeted at researchers and developers working on visual question answering, information extraction from documents, and chart understanding, offering a more robust method for converting visual chart data into structured formats like Python dictionaries.
How It Works
The core innovation of OneChart lies in its "Purify the Chart Structural Extraction via One Auxiliary Token" methodology. While specific algorithmic details are not deeply elaborated, the approach integrates an auxiliary token into the model's processing pipeline. This token is designed to guide and refine the extraction of chart structures, potentially improving accuracy and robustness compared to methods that do not explicitly handle structural purification. The project builds upon the "Vary" codebase and initial weights, suggesting a foundation in existing large vision-language model architectures.
Quick Start & Requirements
OneChart_code/, create a conda environment (conda create -n onechart python=3.10 -y, conda activate onechart), and install dependencies (pip install -e ., pip install -r requirements.txt, pip install ninja).kppkkp/OneChart) and a local demo script (vary/demo/run_opt_v1.py) is provided.https://huggingface.co/kppkkp/OneChart.Highlighted Details
ChartSE).Maintenance & Community
The project is associated with authors from multiple institutions. No specific community channels (like Discord/Slack) or roadmap links are provided. The release date is recent (September 2024).
Licensing & Compatibility
The data, code, and checkpoints are explicitly stated to be "intended and licensed for research use only." Usage is restricted by the license agreement of "Vary, Opt," upon which the project is built. This implies potential copyleft or non-commercial restrictions inherited from the base project, making commercial use or integration into closed-source products highly questionable without further clarification.
Limitations & Caveats
The project is strictly licensed for research purposes only, posing a significant barrier to commercial adoption. The README also notes that the project builds upon "Vary" and "Opt," suggesting potential dependencies and licensing complexities inherited from those base projects that are not fully detailed here.
7 months ago
Inactive
whoiskatrin
varunshenoy