SmartResume by alibaba

AI-powered resume parsing system

Created 3 months ago

335 stars

Top 82.4% on SourcePulse

Project Summary

An intelligent, layout-aware resume parsing system, SmartResume ingests resumes in PDF, image, and Office formats to extract clean text and reconstruct reading order. It leverages LLMs to convert this content into structured fields like basic info, education, and work experience, benefiting engineers and researchers by providing structured data for efficient analysis.

How It Works

SmartResume processes resumes by first extracting clean text using OCR and PDF metadata. It then reconstructs the correct reading order by employing layout detection. Finally, Large Language Models (LLMs) are utilized to convert this semantically ordered content into structured data fields. This layout-aware approach is advantageous for accurately interpreting resumes where visual formatting is critical to meaning.

Quick Start & Requirements

Installation: Clone the repository, create and activate a conda environment (conda create -n resume_parsing python=3.9, conda activate resume_parsing), and install dependencies (pip install -e .).
Prerequisites: Python >= 3.9, CUDA >= 11.0 (optional for GPU acceleration), Memory >= 8GB, Storage >= 10GB.
Configuration: Requires editing configs/config.yaml to add API keys.
Links: Code repository (implied), Model, Demo, Technical Report (English/Chinese).

Highlighted Details

Layout Detection achieves an mAP@0.5 of 92.1%.
Information Extraction reports an Overall Accuracy of 93.1%.
Processing Speed is noted at 1.22s per single page.
Supports many major global languages.

Maintenance & Community

The project includes a TODO list indicating ongoing development, such as optimizing model loading and enhancing vLLM deployment. No specific community channels (e.g., Discord, Slack) or notable contributors/sponsorships are detailed in the provided text.

Licensing & Compatibility

The project states it is licensed under "LICENSE," with plans to adopt more permissive licenses. However, the codebase is a refactored version due to open-source compliance requirements, and internal PDF parsing/OCR components were replaced with open-source alternatives. This suggests potential licensing ambiguities or restrictions that require further investigation for commercial use or closed-source integration.

Limitations & Caveats

This is a refactored version of the original system due to open-source compliance, with internal PDF parsing and OCR components replaced by open-source alternatives, potentially impacting compatibility with the original implementation. Some features may not be fully functional. Ongoing development is indicated by a TODO list, including optimizing model loading and enhancing vLLM deployment support.

SmartResume by alibaba

Explore Similar Projects

ferrules by AmineDiro

Versatile-OCR-Program by ses4255

yomitoku by kotaro-kinoshita

api-llm-ocr by yigitkonur

HunyuanOCR by Tencent-Hunyuan

mPLUG-DocOwl by X-PLUG

nlm-ingestor by nlmatics

AdvancedLiterateMachinery by AlibabaResearch

llm_aided_ocr by Dicklesworthstone

dots.ocr by rednote-hilab

olmocr by allenai

langextract by google