░▒█▀▀▀░░▀░░█░░█▀▀░░░▒█▀▀▀░░▀░░█▀▀░█░░░░█▀▀░█▀▀▄
░▒█▀▀░░░█▀░█░░█▀▀░░░▒█▀▀░░░█▀░▀▀▄░█▀▀█░█▀▀░█▄▄▀
░▒█░░░░▀▀▀░▀▀░▀▀▀░░░▒█░░░░▀▀▀░▀▀▀░▀░░▀░▀▀▀░▀░▀▀
Download documents from the Department of Justice's Epstein Files Transparency Act disclosure portal.
This guide explains EVERYTHING step-by-step:
- ✅ How to download from GitHub (with exact clicks)
- ✅ Where to put the files
- ✅ How to open command line (Windows/Mac/Linux)
- ✅ How to install Python
- ✅ How to run the program
- ✅ What to do if something goes wrong
No technical knowledge needed! Written for complete beginners.
- ✅ Direct download links - no bot detection!
- ✅ Reliable and fast (~2 sec/file)
- ✅ 575 files across 12 data sets
- 📖 CSV Method Guide
⚠️ May encounter bot detection- ✅ Auto-discovers new files
# Setup (run once)
./scripts/setup.sh # Linux/Mac
scripts\setup.bat # Windows
# Run
./run.sh # Linux/Mac
run.bat # Windows
# Or manually
source venv/bin/activate
python src/csv_downloader.pyThe menu allows you to:
- Select individual data sets (1-12)
- Download all data sets at once (option 99)
- Add multiple data sets in one session
- See your download configuration before starting
- Beginner's Guide - Step-by-step instructions
- CSV Method - Recommended download method
- Interactive Menu - Menu guide
- Quick Reference - All commands
- Test Suite - Testing documentation
- Validation Report - Ship readiness validation
Epstein_File_fisher/
├── src/ # Source code
│ ├── csv_downloader.py # CSV downloader (recommended)
│ ├── scraper.py # Web scraper
│ └── config.py # Settings
├── tests/ # Test suite (21 tests)
│ ├── test_config.py # Config tests
│ ├── test_csv_downloader.py # CSV tests
│ ├── test_scraper.py # Scraper tests
│ ├── test_integration.py # Integration tests
│ └── run_tests.py # Test runner
├── scripts/ # Setup scripts
│ ├── setup.sh
│ └── setup.bat
├── docs/ # Documentation
├── run.sh / run.bat # Quick run scripts
└── requirements.txt
Files download to: ~/Documents/Epstein/
Documents/Epstein/
├── data_set_1/
│ ├── documents/ # PDFs
│ ├── videos/ # MP4, MOV
│ ├── audio/ # MP3
│ ├── images/ # JPG, PNG
│ └── archives/ # ZIP
├── data_set_2/
└── ... (12 total)
- Python 3.8+
- pip
- ~150 GB free space (for all files)
# Interactive menu
python src/csv_downloader.py
# Specific data sets
python src/csv_downloader.py --data-sets 1 2 3
# Custom CSV file
python src/csv_downloader.py /path/to/links.csv --data-sets 8
# Metadata only
python src/csv_downloader.py --no-downloadThe project includes a comprehensive test suite with 21 tests:
# Run all tests
python3 tests/run_tests.py
# Run individual test files
python3 tests/test_config.py
python3 tests/test_csv_downloader.py
python3 tests/test_scraper.py
python3 tests/test_integration.pyTest Coverage:
- ✅ Configuration validation
- ✅ CSV downloader functionality
- ✅ Web scraper initialization
- ✅ Error handling
- ✅ End-to-end workflows
See tests/README.md for details.
These are public records from the U.S. Department of Justice. Use responsibly for research, journalism, or public interest purposes.
I deff used AI for this. :)
