CongressionalEpstein Files Community Platforms & Research Tools
0 EFTA citations1,583 words1 persons referenced
A comprehensive directory of platforms, tools, and resources for searching, exploring, and analyzing the Jeffrey Epstein DOJ files released under the Epstein Files Transparency Act (EFTA), 2025-2026.
Epstein Files Community Platforms & Research Tools
A comprehensive directory of platforms, tools, and resources for searching, exploring, and analyzing the Jeffrey Epstein DOJ files released under the Epstein Files Transparency Act (EFTA), 2025-2026.
Last updated: 2026-02-15
Official Government Sources
| ---------- | ----- | ------------- |
| DOJ Epstein Library | justice.gov/epstein | Official EFTA repository. Datasets 1-12, court records, FOIA. Bulk downloads removed Feb 6, 2026. |
| House Oversight Committee | oversight.house.gov | Estate documents (20,000+ pages), DOJ-provided records, photo releases. |
| Platform | URL | Coverage | Description |
| ---------- | ----- | ---------- | ------------- |
| Epstein Exposed | epsteinexposed.com | 1.5M+ docs, 1,480 persons | Interactive network graph, flight map, timeline, degrees-of-separation pathfinding. Blog describes methodology. |
| Epstein Suite | epsteinsuite.com | DOJ + emails + flights | AI summaries, OCR search, entity extraction. "Type two names and trace the shortest path." |
| EFTA Search | eftasearch.com | EFTA releases | AI summaries, thumbnail previews, semantic filters. Built for journalists. |
| Sifter Labs | epstein-files.org | 33,891 docs | Semantic search, entity extraction, embeddings. 200+ hours AI/OCR processing. Creator: Andrew Walsh MD, PhD. |
| Epstein Secrets | epsteinsecrets.com | 33,682 docs, 70K entities | Network visualization (5,500 nodes), 30 content categories, hop distance from Epstein. |
| SearchTheFiles | searchthefiles.com | Flights, black book, court docs | Focus on flight logs, address book, arrest warrants, trial transcripts. 10GB+ unsealed docs. |
| EpsteinWeb | epsteinweb.org | Oversight emails | OCR search on scanned documents, degrees-of-separation visualization. |
| Epstein Unboxed | epsteinunboxed.com | Document corpus | AI-powered search with answers and citations. |
Jmail Ecosystem
| ---------- | ----- | ------------- |
| Jmail | jmail.world | Gmail-like interface to browse Epstein's emails. CJR-recognized. Creators: Riley Walz & Luke Igel. |
| JWiki | jmail.world/wiki | AI-generated encyclopedia grounded in Jmail data and government releases. |
| Jemini | jmail.world (integrated) | Natural-language AI search across EFTA text releases. |
| Jamazon | jmail.world (integrated) | Epstein's Amazon order history. |
| Jacebook, Jmessage, Jotify, Jdrive | jmail.world (integrated) | Facebook, iMessage, music, and Google Drive viewers. |
| ---------- | ----- | ------------- |
| Zeteo | zeteo.com | 26,039 searchable documents. Active coverage by Mehdi Hasan. |
| Boston Globe | bostonglobe.com | Oversight JPGs converted to PDFs on DocumentCloud with OCR. |
| CBS News | cbsnews.com | Searchable interface with Data Set and document type filtering. |
| Al Jazeera | aljazeera.com | Visual navigation guide to datasets, people, and timeline. |
| DocumentCloud | documentcloud.org | Multiple journalist-uploaded collections with full-text search. |
Network Visualization & Analysis
| ---------- | ----- | ------------- |
| SvetimFM Visualizations | GitHub / Live | 68,798 docs. Entity network, timeline, financials (16,169 txns), UMAP embeddings. Claude 3 Haiku NER. |
| Alvaro Franco Martins | alvarofrancomartins.com | K-core graph decomposition, AI relationship summaries. Academic analysis. |
| Epstein Network (phelix001) | GitHub | 19,154 docs. Extracted hidden/redacted content from DS8 annotation overlays. |
| SomaliScan | somaliscan.com | Cross-references Epstein entities against PPP loans, FEC contributions, federal grants. |
GitHub Repositories
| ---------- | ----- | ------------- |
| Epstein-research-data (ours) | GitHub | All 12 DS processed. 1.38M docs, 2.73M pages. Full-text corpus, redaction DB, transcripts, person registry. |
| Epstein-research (ours) | GitHub | 100+ forensic investigation reports, analysis scripts, replication package. |
| Epstein File Explorer (Donnadieu) | GitHub / Live | All 12 DS. PDF viewer, people directory, D3 network, timeline, full-text search. |
| michelcrypt4d4mus | GitHub / PyPI | Color-coded highlighting. Text messages, chronological emails, regex search. Installable Python package. |
| EpsteinFiles (markramm) | GitHub | Whoosh-based full-text search for 2,895 Oversight docs. Flask web UI. |
| epstein-document-search (paulgp) | GitHub | Download, process, index court docs into Meilisearch. |
| FULL_EPSTEIN_INDEX (theelderemo) | GitHub / HuggingFace | Unified multi-source archive. Living archive, MIT License. |
| EES (Dicklesworthstone) | GitHub | Client-side zero-backend email search. Two-index system for instant search. |
| ---------- | ----- | ------------- |
| EpsteinFiles-RAG | GitHub | RAG pipeline on 20K dataset. Chroma vector DB, Groq LLM, Streamlit UI. |
| Epstein RAG MCP | GitHub | MCP server for local LLMs. Qdrant vector store. Works with Claude Desktop. |
| Epstein Email MCP | GitHub | MCP demo with 2,322 emails. OpenAI Apps SDK integration. |
| Epsteingate | epsteingate.org | AI-ranked analysis of 20K+ docs. GPT scoring 0-100 per passage. Power linkage analysis. |
Wiki / Encyclopedia
| ---------- | ----- | ------------- |
| Epstein Wiki | epsteinwiki.com | Community-driven collaborative research platform. Cross-references filings, depositions, FOIA, journalism. |
Datasets (Hugging Face)
| --------- | ----- | ------------- |
| EPSTEIN_FILES_20K (tensonaut) | HuggingFace | 25K+ text files from Oversight releases. Tesseract OCR. |
| epstein-files-20k (teyler) | HuggingFace | 20K dataset for RAG evaluation. |
| Post-OCR Embeddings (svetfm) | HuggingFace | 69,290 document embeddings (nomic-embed-text, 768-dim). CC-BY-4.0. |
| FBI Files (svetfm) | HuggingFace | FBI-specific files dataset. |
| Parsed Emails (notesbymuneeb) | HuggingFace | 5,082 threads, 16,447 messages. Structured metadata. |
| FULL_EPSTEIN_INDEX (theelderemo) | HuggingFace | Mirror of GitHub unified archive. |
| ---------- | ----- | ------------- |
| yung-megafone | GitHub | Primary community mirror. Torrent magnets, Archive.org mirrors, integrity verification. |
| Surebob downloader | GitHub | CLI download/archive tool with torrent support and checksums. |
| efgrabber (segin) | GitHub | High-performance multithreaded downloader. Up to 1,000 concurrent. Qt5 GUI. |
| EpsteinLibraryMediaScraper | GitHub | Scrapes media URLs from DOJ Epstein Library. Node.js. |
| Apify Scraper API | apify.com | Commercial scraping API. Programmatic HTTP access. |
Archive.org Collections
| ------------ | ----- | ------------- |
| Datasets 1-5 Combined | archive.org | Complete searchable collection of first 5 DOJ Data Sets. |
| Dataset 10 (incomplete) | archive.org | Incomplete DS10 mirror. |
| ---------- | ----- | ------------- |
| EpsteIn (LinkedIn scanner) | GitHub | Scans your LinkedIn connections against the Epstein files. Local privacy. Uses DugganUSA API. Covered by 404 Media. |
| DocETL Explorer | docetl.org | AI analysis of 2,322 emails. Tone/topic analysis, entity extraction. Built for $8.04. |
| Epstein Files Browser | GitHub / Live | Browse/view files with celebrity recognition filtering. |
| ----------- | ----- | ------------- |
| r/Epstein | reddit.com/r/Epstein | Active research community. Rep. Maxwell Frost engaged directly for file identification help. |
| ---------- | ----- | ------------- |
| Erkan's Field Diary | erkansaka.net | Curated tool directory with use-case recommendations. |
| SOCRadar Guide | socradar.io | Cybersecurity-focused guide. Warns about malware in torrents. |
| Eren Ozkaradeniz | medium.com | Technical walkthrough of tools and DOJ structure. |
| Axios | axios.com | Overview of available tools and search methods. |
Summary
| Official Government Sources | 3 |
| Comprehensive Search Platforms | 9 |