ST-Raptor by weAIDB

Answering questions over complex semi-structured tables

Created 4 months ago

292 stars

Top 90.5% on SourcePulse

Project Summary

ST-Raptor is a tool designed for answering natural language questions over semi-structured tables, supporting diverse formats like HTML, CSV, and Markdown. It targets users who need precise answers from complex tables without the overhead of fine-tuning existing models. The primary benefit is its ability to handle intricate table layouts and integrate flexibly with various LLMs and VLMs, offering a no-fine-tuning approach.

How It Works

ST-Raptor employs a novel approach combining a Vision-Language Model (VLM) with a hierarchical organization (HO-Tree) construction algorithm. This VLM-LLM integration allows it to interpret complex table structures and extract relevant information. A two-stage validation mechanism is utilized to ensure the reliability and accuracy of the generated answers. The system's advantage lies in its ability to process tables without requiring task-specific fine-tuning, making it adaptable to new datasets and table types.

Quick Start & Requirements

Installation involves cloning the repository, setting up a conda environment (conda create -n straptor python=3.10, conda activate straptor, pip install -r requirements.txt), and installing wkhtmltox and font packages (fonts-noto-cjk, fonts-wqy-microhei). Model configuration requires significant resources: the recommended setup (Deepseek-V3, InternVL2.5 26B, Multilingual-E5-Large-Instruct) demands approximately 160GB of GPU memory. Alternatively, API calls can be configured for LLM, VLM, and Embedding models. Configuration details for models and API endpoints are managed in ./utils/constnts.py.

Highlighted Details

Achieves state-of-the-art performance on the SSTQA benchmark, with 72.39% accuracy and 52.19% ROUGE-L score, outperforming various NL2SQL, fine-tuning, agent, and VLM-based methods.
Supports a wide array of input table formats including HTML, CSV, and Markdown, alongside Excel files.
Requires no additional fine-tuning for question-answering tasks.
Features a visualization platform based on Gradio for inspecting HO-Tree structures and interacting with the model.

Maintenance & Community

The project maintains an active community through a WeChat group for discussions on complex semi-structured table analysis.

Licensing & Compatibility

ST-Raptor is released under the MIT License, permitting broad use and compatibility with closed-source applications.

Limitations & Caveats

The project roadmap indicates planned support for image inputs and expansion of the table extraction module to handle table types beyond the current scope. The high GPU memory requirement (160GB) for the recommended local model configuration presents a significant barrier to entry for users without substantial hardware resources.

ST-Raptor by weAIDB

Explore Similar Projects

ToolQA by night-chen

natural-sql by cfahlgren1

StructEqTable-Deploy by InternScience

ChatKBQA by LHRLAB

Awesome-LLM-Tabular by johnnyhwu

Table-Pretraining by microsoft

Tabular-LLM by SpursGoZmy

XiYan-SQL by XGenerationLab

UnifiedSKG by xlang-ai

TAG-Bench by TAG-Research

Awesome-LLM-based-Text2SQL by DEEP-PolyU

tapas by google-research