The openFDA project provides open APIs, data downloads, and a developer community for FDA public datasets, including drugs, foods, and medical devices. It aims to make this data accessible for research and development, empowering users to build applications and gain insights into FDA-regulated products.
How It Works
This project utilizes a Python-based Luigi pipeline to process raw FDA data into a JSON format suitable for Elasticsearch. An Elasticsearch cluster stores this processed data, and a Node.js API server, built with Express and Elasticsearch.js, serves the data via a documented JSON interface (api.fda.gov). This architecture allows for efficient data ingestion, storage, and querying of large public health datasets.
Quick Start & Requirements
bootstrap.sh
for Python virtualenv and Node.js package setup. Docker is recommended via docker-compose up
.vm.max_map_count
to 262144. Windows users should use git clone ... --config core.autocrlf=input
.http://localhost:8000/status
.Highlighted Details
Maintenance & Community
The project is an FDA initiative. Community interest may drive the addition of more data pipelines.
Licensing & Compatibility
The repository does not explicitly state a license. Compatibility for commercial use or closed-source linking is not specified.
Limitations & Caveats
The project explicitly states: "Do not rely on openFDA to make decisions regarding medical care." Only a subset of pipelines (NSDE, CAERS, Substance Data, Device Clearance, Device PMA, Device Event) are included in the Docker setup due to complexity and network access requirements.
2 years ago
1 week