ST-Raptor  by weAIDB

Answering questions over complex semi-structured tables

Created 2 months ago
276 stars

Top 93.8% on SourcePulse

GitHubView on GitHub
Project Summary

ST-Raptor is a tool designed for answering natural language questions over semi-structured tables, supporting diverse formats like HTML, CSV, and Markdown. It targets users who need precise answers from complex tables without the overhead of fine-tuning existing models. The primary benefit is its ability to handle intricate table layouts and integrate flexibly with various LLMs and VLMs, offering a no-fine-tuning approach.

How It Works

ST-Raptor employs a novel approach combining a Vision-Language Model (VLM) with a hierarchical organization (HO-Tree) construction algorithm. This VLM-LLM integration allows it to interpret complex table structures and extract relevant information. A two-stage validation mechanism is utilized to ensure the reliability and accuracy of the generated answers. The system's advantage lies in its ability to process tables without requiring task-specific fine-tuning, making it adaptable to new datasets and table types.

Quick Start & Requirements

Installation involves cloning the repository, setting up a conda environment (conda create -n straptor python=3.10, conda activate straptor, pip install -r requirements.txt), and installing wkhtmltox and font packages (fonts-noto-cjk, fonts-wqy-microhei). Model configuration requires significant resources: the recommended setup (Deepseek-V3, InternVL2.5 26B, Multilingual-E5-Large-Instruct) demands approximately 160GB of GPU memory. Alternatively, API calls can be configured for LLM, VLM, and Embedding models. Configuration details for models and API endpoints are managed in ./utils/constnts.py.

Highlighted Details

  • Achieves state-of-the-art performance on the SSTQA benchmark, with 72.39% accuracy and 52.19% ROUGE-L score, outperforming various NL2SQL, fine-tuning, agent, and VLM-based methods.
  • Supports a wide array of input table formats including HTML, CSV, and Markdown, alongside Excel files.
  • Requires no additional fine-tuning for question-answering tasks.
  • Features a visualization platform based on Gradio for inspecting HO-Tree structures and interacting with the model.

Maintenance & Community

The project maintains an active community through a WeChat group for discussions on complex semi-structured table analysis.

Licensing & Compatibility

ST-Raptor is released under the MIT License, permitting broad use and compatibility with closed-source applications.

Limitations & Caveats

The project roadmap indicates planned support for image inputs and expansion of the table extraction module to handle table types beyond the current scope. The high GPU memory requirement (160GB) for the recommended local model configuration presents a significant barrier to entry for users without substantial hardware resources.

Health Check
Last Commit

1 day ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
2
Star History
21 stars in the last 30 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems") and Andreas Jansson Andreas Jansson(Cofounder of Replicate).

natural-sql by cfahlgren1

0%
865
Text-to-SQL LLMs with strong performance
Created 1 year ago
Updated 1 year ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Jeff Hammerbacher Jeff Hammerbacher(Cofounder of Cloudera), and
1 more.

tapas by google-research

0.2%
1k
Table QA models for end-to-end neural table-text understanding
Created 5 years ago
Updated 1 year ago
Feedback? Help us improve.