GPT-3-based resume parser as a REST API
Top 96.9% on sourcepulse
This project provides a REST API for parsing resume PDFs into structured JSON data, leveraging GPT-3 for natural language understanding. It targets developers and HR professionals needing to automate resume screening and data extraction, offering a cost-effective and accurate solution for transforming unstructured resume content.
How It Works
The system utilizes GPT-3's text-davinci-002 engine to interpret resume content extracted from PDFs. It converts PDFs to text using pdftotext
and then sends this text to the OpenAI API for parsing into predefined JSON fields. This approach allows for robust handling of varied resume formats and sophisticated extraction of information like job titles, education, and project details.
Quick Start & Requirements
pdftotext
dependencies../build.sh
..env
file or as an environment variable../run.sh
to start the Flask server on localhost:5001
.Highlighted Details
text-davinci-002
.Maintenance & Community
No specific information on contributors, community channels, or roadmap is provided in the README.
Licensing & Compatibility
The README does not specify a license. Compatibility for commercial use or closed-source linking is not detailed.
Limitations & Caveats
The project relies on an external OpenAI API key, incurring costs per parse. The absence of a live demo is attributed to these API costs. No specific limitations on PDF formats or parsing accuracy are detailed beyond the general capabilities of GPT-3.
2 years ago
Inactive