Discover and explore top open-source AI tools and projects—updated daily.
AIDC-AIFrameworks and benchmarks for challenging agentic search
Top 91.6% on SourcePulse
This project addresses critical limitations in AI agents for real-world applications, focusing on domain-specific reasoning, hierarchical rule application, and large-scale information seeking. It introduces benchmarks and frameworks like HSCodeComp and DeepWideSearch to evaluate and advance agent capabilities, benefiting researchers and developers aiming for more robust AI agents.
How It Works
The initiative introduces several key components: HSCodeComp for hierarchical rule application, DeepWideSearch for deep-and-wide information seeking, Table-as-Search for a hierarchical multi-agent framework formalizing search as table completion, and UMEM for a self-evolving memory system that jointly optimizes memory extraction and management. These approaches aim to tackle complex decision-making, broad exploration with deep reasoning, structured information synthesis, and generalizable long-term memory.
Quick Start & Requirements
pip install -r requirements.txt for HSCodeComp, DeepWideSearch, Table-as-Search, and pip install -e . for UMEM.Marco-DeepResearch-Family/HSCodeComp/README.mdMarco-DeepResearch-Family/DeepWideSearch/README.mdMarco-DeepResearch-Family/Table-as-Search/README.mdMarco-DeepResearch-Family/UMEM/README.mdMarco-DeepResearch-Family/README.mdHighlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
HSCodeComp shows a significant gap remains versus human experts (95.0% vs. 65.0% for Marco Agent), indicating room for improvement. Datasets are constructed from publicly accessible data, with a disclaimer about potential copyright issues or improper content.
5 days ago
Inactive
open-thought
agno-agi