Discover and explore top open-source AI tools and projects—updated daily.
laminlabsData framework for scalable biological R&D
Top 97.4% on SourcePulse
<2-3 sentences summarising what the project addresses and solves, the target audience, and the benefit.> LaminDB is an open-source data framework for biological R&D, addressing the critical need for reproducible, traceable, and validated datasets and models at scale. It targets scientists and engineers in academia and biotech, providing essential context and memory for complex biological data, transforming fragmented research into a scalable, compounding process.
How It Works
The project implements a lineage-native lakehouse architecture, leveraging Postgres/SQLite for metadata and supporting bio-formats like AnnData and .zarr. It integrates directly with the pydata stack, offering a unified API for querying, tracing, and validating data. This approach provides crucial context and memory, auto-tracking code, compute environments, and data lineage with minimal code changes, simplifying complex biological data management and enabling agentic R&D.
Quick Start & Requirements
pip install lamindb (full dependencies) or pip install lamindb-core (minimal).Highlighted Details
bionty plugin for programmatic experimental design and semantic data management.Maintenance & Community
LaminDB is adopted by researchers at leading institutions like Pfizer, scverse, Harvard, and MIT. LaminHub serves as a collaboration platform. Specific community links (Discord, Slack) or roadmap details are not provided in the README.
Licensing & Compatibility
The README does not specify a software license. This omission requires clarification for commercial use or integration into closed-source projects.
Limitations & Caveats
No explicit limitations, alpha status, or known bugs are detailed in the provided README content.
9 hours ago
Inactive
hbctraining