Skip to main content
Skip to content
Reference

Data Dictionary

A complete reference for every field, classification, and category used in the Epstein Exposed database. For researchers building on our data or journalists verifying our methodology.

1,626,107
Documents
1,593
Persons
1,708
Flights
9,961
Emails

Persons

Each person in the database is identified by a unique slug and categorized by their primary role relevant to the Epstein case. Person data includes biographical information, document associations, flight records, and known connections.

Fields

FieldTypeDescription
idstringUnique identifier (e.g., 'jeffrey-epstein')
slugstringURL-safe identifier used in page URLs
namestringFull display name
aliasesstring[]Known alternative names or spellings
categoryenumPrimary classification (see categories below)
descriptiontextDetailed biography and Epstein case involvement
shortBiostringOne-line summary for listings and cards
nationalitystringPrimary nationality
notablePositionsstring[]Professional titles and roles
blackBookEntrybooleanWhether they appear in Epstein's contact book
blackBookPhonesnumberCount of phone numbers in the black book
tagsstring[]Freeform labels (e.g., 'victim', 'inner-circle')
imageUrlstring?Headshot URL (Wikimedia Commons with attribution)
flightCountcomputedNumber of flight log appearances
documentCountcomputedNumber of associated documents
connectionCountcomputedNumber of known connections to other persons
emailCountcomputedNumber of associated emails

Person Categories

politician

Elected officials, government appointees, diplomats, political operatives

business

Business executives, entrepreneurs, financial professionals, hedge fund managers

royalty

Members of royal families and aristocratic lineages

celebrity

Entertainers, actors, musicians, media personalities, public figures

associate

Known associates of Epstein, staff members, personal contacts, recruiters

legal

Attorneys, judges, prosecutors, law enforcement involved in Epstein cases

academic

Professors, researchers, university administrators, scientists

socialite

High-society figures, philanthropists, socialites

military-intelligence

Military officers, intelligence operatives, national security officials

other

Persons who don't fit neatly into other categories

Documents

Documents range from single-page court orders to multi-thousand-page depositions. Each document is categorized by source, linked to mentioned persons, and full-text searchable via OCR extraction.

Fields

FieldTypeDescription
idstringUnique document identifier (e.g., 'gov.uscourts.nysd.447706.1090.0')
titlestringDocument title or filename
categorystringDocument type classification
sourceenumOrigin source (see sources below)
datestring?Document date (ISO format when available)
summarytextAI-generated or manually written summary
personIdsstring[]IDs of persons mentioned (auto-linked + manual)
pdfUrlstring?Direct URL to original PDF
pageCountnumber?Number of pages (when known)
ocrTexttext?Full OCR-extracted text (156K+ documents)
redactionScorefloat?Percentage of document that is redacted (0-100)

Document Sources

court-filing

Primary court documents filed in federal and state proceedings

deposition

Sworn testimony taken under oath during pre-trial discovery

fbi

FBI investigative reports and memoranda obtained via FOIA or court releases

doj

Department of Justice official releases, indictments, plea agreements

foia

Freedom of Information Act responses from federal agencies

financial

Financial records, wire transfers, bank statements, tax documents

media

Journalistic investigations, news reports, interview transcripts

victim-statement

Victim impact statements, survivor testimonies

police-report

Law enforcement investigation reports and records

subpoena

Grand jury and trial subpoenas (257 subpoenas indexed)

efta

Electronic File Transfer Agreement documents from SDNY prosecution (28,942 PDFs)

other

Documents that don't fit standard categories

Flights

Flight records from Epstein's private aircraft, primarily the Boeing 727 ("Lolita Express") and various Gulfstream jets. Sourced from FAA records and pilot logbooks entered into court evidence.

Fields

FieldTypeDescription
idstringUnique flight identifier
datestringFlight date (ISO format)
aircraftstringAircraft type (e.g., 'Boeing 727-31')
tailNumberstringFAA registration number (e.g., 'N908JE')
originstringDeparture location
destinationstringArrival location
passengersarrayList of passengers with name and person ID
sourcestringData source (e.g., 'pilot-log', 'faa-record')

Connections

Connections represent relationships between persons, derived from shared documents, co-flights, court testimony, and manual research. Each connection has an evidence-based strength rating.

Connection Strength Levels

strong

Direct evidence from court documents: co-defendants, employer/employee, victim/perpetrator, family, or 3+ shared documents

moderate

Circumstantial evidence: shared flights, phone contacts, mentioned together in depositions, 1-2 shared documents

weak

Tangential connections: same social circles, one-time mentions, disputed or unconfirmed links

Fields

FieldTypeDescription
personId1stringFirst person in the pair
personId2stringSecond person in the pair
strengthenumstrong | moderate | weak
summarytextDescription of the relationship and evidence
coFlightsnumberNumber of shared flights
coDocumentsnumberNumber of shared documents
relationshipTypestring?Typed classification (e.g., traveled_with, associated_with)
weightnumber?Numeric weight for network graph rendering
dateFirststring?Earliest evidence date
dateLaststring?Latest evidence date

Trust Levels

Every piece of data is assigned a trust level based on its source reliability and verification status. This helps researchers assess the confidence level of any claim.

Court Document

Primary sources filed with courts — highest reliability. Includes unsealed filings, exhibits, and judicial rulings.

Official Record

Government records: DOJ releases, FBI reports, FOIA responses, flight logs. Authenticated by issuing agencies.

Sworn Testimony

Depositions and trial testimony given under oath. Subject to perjury penalties but represents one party's account.

Investigative

Journalist investigations and researcher findings. Cross-referenced where possible but not primary sources.

Unverified

Uncorroborated claims, anonymous tips, or single-source information. Included for completeness with clear labeling.

Person Matching Methodology

Documents are automatically linked to persons using a multi-step matching pipeline:

  1. Multi-word name matching — Names with 2+ words are matched using word-boundary regex against document OCR text. This prevents false positives (e.g., "Prince" alone matching Prince Andrew).
  2. Single-word name exclusion — Single-word names (e.g., "Maxwell") are never auto-matched due to legal liability. These are only linked through manual review.
  3. Co-occurrence analysis — When two persons appear in the same document, a co-occurrence record is created. The database tracks 25,700+ person pairs.
  4. Mention context extraction — For each person-document link, a snippet of surrounding text is stored to show how the person is referenced.

Data Access

All data is available for researchers and journalists:

This data dictionary was last updated on February 17, 2026. If you have questions about our methodology or data structure, please visit our contribute page or reach out through the community forum.