Discover and explore top open-source AI tools and projects—updated daily.
Binary pattern analyzer for malware research, reverse engineering, and threat hunting
Top 60.8% on SourcePulse
Binlex is a framework for malware analysis and reverse engineering that treats binary code as "DNA" by breaking it down into hierarchical genetic traits: genomes, chromosomes, allele pairs, and genes. It enables pattern detection, similarity analysis, and threat hunting across large malware datasets, offering a fast, flexible, and extensible alternative to Python-only tools.
How It Works
Binlex disassembles binary files (PE, MachO, ELF) into instructions, basic blocks, and functions, representing these as "genomes." Within each genome, it extracts patterns as "chromosomes," composed of "allele pairs" (bytes) and "genes" (nibbles). This structured representation allows for similarity hashing (Minhash, TLSH, SHA256) and feature extraction, facilitating the identification of commonalities and variations in malware. It also incorporates a vector database with a Graph Neural Network (GNN) for advanced function similarity matching.
Quick Start & Requirements
cargo build --release
. Python bindings require maturin
.~/.idapro/plugins/
and install requirements. Run docker-compose up -d
for the binlex server.cargo doc --open
Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
1 month ago
1 day