openfda  by FDA

Open FDA data APIs and pipelines

created 11 years ago
620 stars

Top 54.0% on sourcepulse

GitHubView on GitHub
1 Expert Loves This Project
Project Summary

The openFDA project provides open APIs, data downloads, and a developer community for FDA public datasets, including drugs, foods, and medical devices. It aims to make this data accessible for research and development, empowering users to build applications and gain insights into FDA-regulated products.

How It Works

This project utilizes a Python-based Luigi pipeline to process raw FDA data into a JSON format suitable for Elasticsearch. An Elasticsearch cluster stores this processed data, and a Node.js API server, built with Express and Elasticsearch.js, serves the data via a documented JSON interface (api.fda.gov). This architecture allows for efficient data ingestion, storage, and querying of large public health datasets.

Quick Start & Requirements

  • Installation: Run bootstrap.sh for Python virtualenv and Node.js package setup. Docker is recommended via docker-compose up.
  • Prerequisites: Elasticsearch 7, Python 3.6+, Node.js 14+. Linux users may need to increase vm.max_map_count to 262144. Windows users should use git clone ... --config core.autocrlf=input.
  • Resources: Docker setup includes Elasticsearch, API, and Python containers. API is available after pipelines complete; check http://localhost:8000/status.
  • Documentation: https://open.fda.gov

Highlighted Details

  • Powers the official api.fda.gov endpoints.
  • Includes pipelines for drugs, foods, and medical devices.
  • Provides Elasticsearch schemas for data sets.
  • Supports querying via standard openFDA syntax.

Maintenance & Community

The project is an FDA initiative. Community interest may drive the addition of more data pipelines.

Licensing & Compatibility

The repository does not explicitly state a license. Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

The project explicitly states: "Do not rely on openFDA to make decisions regarding medical care." Only a subset of pipelines (NSDE, CAERS, Substance Data, Device Clearance, Device PMA, Device Event) are included in the Docker setup due to complexity and network access requirements.

Health Check
Last commit

2 years ago

Responsiveness

1 week

Pull Requests (30d)
0
Issues (30d)
1
Star History
13 stars in the last 90 days

Explore Similar Projects

Starred by Dominik Moritz Dominik Moritz(Professor at CMU; ML Researcher at Apple) and Casey Caruso Casey Caruso(Managing Partner of Topology Ventures).

latent-scope by enjalot

0.4%
717
Scientific tool for latent space investigation
created 2 years ago
updated 2 months ago
Starred by Andrej Karpathy Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n), Alex Cheema Alex Cheema(Cofounder of EXO Labs), and
3 more.

Perplexica by ItzCrazyKns

0.3%
23k
AI-powered search engine alternative
created 1 year ago
updated 1 day ago
Feedback? Help us improve.