Osintgraph  by XD-MHLOO

Instagram data analysis tool with AI agent and Neo4j visualization

Created 6 months ago
456 stars

Top 66.2% on SourcePulse

GitHubView on GitHub
Project Summary

OSINTGraph is an open-source tool designed for in-depth social network analysis and OSINT investigations specifically targeting Instagram. It assists users in mapping and analyzing a target's social connections, revealing interests and affiliations through data collection, AI-powered analysis, and Neo4j visualization. The tool is beneficial for security researchers, digital forensic analysts, and anyone needing to understand complex social dynamics on Instagram.

How It Works

OSINTGraph operates in two phases: Reconnaissance and Analysis & Investigation. During Reconnaissance, it scrapes public Instagram data, including profile metadata, followers, followees, posts, and comments, storing this information in a Neo4j graph database. Optionally, it leverages the Gemini API for AI-powered pre-analysis of posts and accounts, generating structured reports. The Analysis & Investigation phase allows users to query the collected data using a natural language AI Agent or directly through Neo4j visualizations, enabling deep dives into relationships, activities, and patterns.

Quick Start & Requirements

  • Installation: pipx install osintgraph or pip install osintgraph (within a virtual environment).
  • Prerequisites:
    • Instagram Account (preferably not the main account).
    • Neo4j Database instance (free tier available).
    • Gemini API Key (free tier available).
    • Optional: Firefox browser User Agent string.
  • Setup: Run osintgraph setup to configure credentials.
  • Data Collection: osintgraph discover TARGET_INSTAGRAM_USERNAME --limit follower=100 followee=100 post=2
  • Analysis: osintgraph agent
  • Visualization: Access Neo4j Console.
  • Documentation: See "Getting Started" and "OSINTGraph AI Agent – Getting Started Guide" sections in the README.

Highlighted Details

  • AI Agent supports natural language queries, keyword/semantic searches, and template-based analyses for accelerated investigations.
  • Neo4j visualization allows interactive exploration of social networks and relationships.
  • AI-powered pre-analysis of posts (image and content) and accounts provides early insights.
  • Supports custom template creation for repeatable and controlled investigations.

Maintenance & Community

  • Dependencies include Instaloader, Neo4j, LangGraph, and Gemini/Google Generative AI.
  • Community insights on avoiding account suspension are provided, crediting @ahmdrz.
  • Links to a GitHub repository for templates are available.

Licensing & Compatibility

  • The README does not explicitly state the license type. Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

  • When scraping followers/followees, only usernames and basic profile info are collected; full profiles require separate discover runs.
  • Instagram's limitations may result in partial collection of likes and comments.
  • The tool relies on external APIs (Gemini), and template runs are internally rate-limited.
  • Account suspension can occur if Instagram's terms of service are violated; the README provides tips to mitigate this risk.
Health Check
Last Commit

1 month ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
2
Star History
33 stars in the last 30 days

Explore Similar Projects

Feedback? Help us improve.