open-source-research-notebooks  by bellingcat

Jupyter notebooks for digital investigations

created 1 year ago
258 stars

Top 98.6% on sourcepulse

GitHubView on GitHub
Project Summary

This repository provides a collection of Jupyter notebooks designed to assist open-source researchers, journalists, and fact-checkers in conducting digital investigations using command-line tools and code. It offers a structured way to learn and apply these techniques, simplifying complex workflows and enhancing documentation.

How It Works

The project leverages Jupyter Notebooks for their interactive nature, combining documentation with executable code. This approach simplifies software installation and execution, reduces "works on my machine" issues, and provides a safer environment for running unfamiliar code, especially when hosted on platforms like Google Colaboratory or Binder. Notebooks can execute both bash commands and Python code, allowing for versatile digital investigation tasks.

Quick Start & Requirements

Notebooks can be run directly via services like Google Colab, Binder, or Kaggle Notebooks. A basic understanding of Jupyter Notebooks is recommended.

Highlighted Details

  • Notebooks cover a range of tools including Telegram phone number checking, Wayback Machine analytics, geoclustering, VKontakte scraping, and search grid generation.
  • Advanced notebooks include AI-powered audio/video transcription (OpenAI Whisper), email-to-account tracing (Holehe), username-to-account tracing (Maigret), Wayback Machine website downloading, video downloading (YT-DLP), metadata analysis (ExifTool), and AI face comparison (Deepface).
  • Notebooks are categorized by required technical expertise, from entry-level to those requiring higher technical know-how or code adaptation.

Maintenance & Community

This project is associated with Bellingcat, a well-known investigative journalism group. Further community-developed tools and methods are also included.

Licensing & Compatibility

The repository's license is not explicitly stated in the provided README. Compatibility for commercial use or closed-source linking would require clarification of the licensing terms.

Limitations & Caveats

The README does not specify the exact license, which may impact commercial use. Some notebooks may require specific technical understanding or code adaptation beyond basic execution.

Health Check
Last commit

1 month ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
14 stars in the last 90 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems) and Carol Willing Carol Willing(Core Contributor to CPython, Jupyter).

genai by rgbkrk

0%
352
IPython extension for generative AI assistance in Jupyter notebooks
created 3 years ago
updated 1 year ago
Feedback? Help us improve.