File Fisher - DOJ Epstein Disclosures Downloader

░▒█▀▀▀░░▀░░█░░█▀▀░░░▒█▀▀▀░░▀░░█▀▀░█░░░░█▀▀░█▀▀▄
░▒█▀▀░░░█▀░█░░█▀▀░░░▒█▀▀░░░█▀░▀▀▄░█▀▀█░█▀▀░█▄▄▀
░▒█░░░░▀▀▀░▀▀░▀▀▀░░░▒█░░░░▀▀▀░▀▀▀░▀░░▀░▀▀▀░▀░▀▀

File Fisher - DOJ Epstein Disclosures Downloader

Download documents from the Department of Justice's Epstein Files Transparency Act disclosure portal.

🔰 Never Used GitHub or Command Line Before?

👉 START HERE - Complete Beginner's Guide 👈

This guide explains EVERYTHING step-by-step:

✅ How to download from GitHub (with exact clicks)
✅ Where to put the files
✅ How to open command line (Windows/Mac/Linux)
✅ How to install Python
✅ How to run the program
✅ What to do if something goes wrong

No technical knowledge needed! Written for complete beginners.

🌟 Two Download Methods

Method 1: CSV Downloader (RECOMMENDED) ⭐

✅ Direct download links - no bot detection!
✅ Reliable and fast (~2 sec/file)
✅ 575 files across 12 data sets
📖 CSV Method Guide

Method 2: Web Scraper

⚠️ May encounter bot detection
✅ Auto-discovers new files

🚀 Quick Start

# Setup (run once)
./scripts/setup.sh          # Linux/Mac
scripts\setup.bat           # Windows

# Run
./run.sh                    # Linux/Mac
run.bat                     # Windows

# Or manually
source venv/bin/activate
python src/csv_downloader.py

🎯 Interactive Menu

The menu allows you to:

Select individual data sets (1-12)
Download all data sets at once (option 99)
Add multiple data sets in one session
See your download configuration before starting

📖 Documentation

Beginner's Guide - Step-by-step instructions
CSV Method - Recommended download method
Interactive Menu - Menu guide
Quick Reference - All commands
Test Suite - Testing documentation
Validation Report - Ship readiness validation

📁 Project Structure

Epstein_File_fisher/
├── src/                 # Source code
│   ├── csv_downloader.py   # CSV downloader (recommended)
│   ├── scraper.py          # Web scraper
│   └── config.py           # Settings
├── tests/               # Test suite (21 tests)
│   ├── test_config.py      # Config tests
│   ├── test_csv_downloader.py  # CSV tests
│   ├── test_scraper.py     # Scraper tests
│   ├── test_integration.py # Integration tests
│   └── run_tests.py        # Test runner
├── scripts/             # Setup scripts
│   ├── setup.sh
│   └── setup.bat
├── docs/                # Documentation
├── run.sh / run.bat     # Quick run scripts
└── requirements.txt

💾 Output

Files download to: ~/Documents/Epstein/

Documents/Epstein/
├── data_set_1/
│   ├── documents/  # PDFs
│   ├── videos/     # MP4, MOV
│   ├── audio/      # MP3
│   ├── images/     # JPG, PNG
│   └── archives/   # ZIP
├── data_set_2/
└── ... (12 total)

🛠️ Requirements

Python 3.8+
pip
~150 GB free space (for all files)

📝 Usage Examples

# Interactive menu
python src/csv_downloader.py

# Specific data sets
python src/csv_downloader.py --data-sets 1 2 3

# Custom CSV file
python src/csv_downloader.py /path/to/links.csv --data-sets 8

# Metadata only
python src/csv_downloader.py --no-download

🧪 Testing

The project includes a comprehensive test suite with 21 tests:

# Run all tests
python3 tests/run_tests.py

# Run individual test files
python3 tests/test_config.py
python3 tests/test_csv_downloader.py
python3 tests/test_scraper.py
python3 tests/test_integration.py

Test Coverage:

✅ Configuration validation
✅ CSV downloader functionality
✅ Web scraper initialization
✅ Error handling
✅ End-to-end workflows

See tests/README.md for details.

⚠️ Legal Notice

These are public records from the U.S. Department of Justice. Use responsibly for research, journalism, or public interest purposes.

I deff used AI for this. :)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

File Fisher - DOJ Epstein Disclosures Downloader

🔰 Never Used GitHub or Command Line Before?

👉 START HERE - Complete Beginner's Guide 👈

🌟 Two Download Methods

Method 1: CSV Downloader (RECOMMENDED) ⭐

Method 2: Web Scraper

🚀 Quick Start

🎯 Interactive Menu

📖 Documentation

📁 Project Structure

💾 Output

🛠️ Requirements

📝 Usage Examples

🧪 Testing

⚠️ Legal Notice

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 4

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
docs		docs
logs		logs
scripts		scripts
src		src
tests		tests
.gitignore		.gitignore
GETTING_STARTED.md		GETTING_STARTED.md
README.md		README.md
VALIDATION_REPORT.md		VALIDATION_REPORT.md
requirements.txt		requirements.txt
run.bat		run.bat
run.sh		run.sh

KowaiAI/Epstein_File_fisher

Folders and files

Latest commit

History

Repository files navigation

File Fisher - DOJ Epstein Disclosures Downloader

🔰 Never Used GitHub or Command Line Before?

👉 START HERE - Complete Beginner's Guide 👈

🌟 Two Download Methods

Method 1: CSV Downloader (RECOMMENDED) ⭐

Method 2: Web Scraper

🚀 Quick Start

🎯 Interactive Menu

📖 Documentation

📁 Project Structure

💾 Output

🛠️ Requirements

📝 Usage Examples

🧪 Testing

⚠️ Legal Notice

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 4

Uh oh!

Languages

Packages