Headline
The picture changes when the FBI vault is OCR'd
The original textual subset (Department of War mission reports + NASA debriefings + State cables) contained one explicit "extraterrestrial" reference and zero claims of recovered material, biologics, or non-human intelligence. With OCR applied to the FBI 62-HQ-83894 vault sections, that picture shifts: contactee-era vocabulary, claims of recovered magnesium alloy, "humanoid space beings", "interplanetary" references, and Cold-War-era discussion of UFO origin appear repeatedly.
Important caveat: almost all FBI-vault hits sit inside incoming public correspondence — citizens writing to J. Edgar Hoover claiming UFO encounters, or media clippings the Bureau filed for reference. They are not government conclusions. Hoover's outgoing replies are bureaucratic non-answers ("the FBI is an investigative agency and does not draw conclusions"). The hits document what people claimed and what the FBI was tracking, not what the government acknowledged.
NHI · biologics · occupant lexicon — side by side
Hits per pattern · Modern subset (2020s) vs FBI vault (1947–1968)
Same patterns evaluated against both corpora. Modern subset = the 91 OCR'd Department of War / NASA / State documents. FBI vault = sections OCR'd by the new tesseract pipeline.
Physics-anomaly lexicon — side by side
Hits per pattern · Modern subset vs FBI vault
Both corpora are sparse on physics-defying language. The FBI vault contains contactee-era claims of unusual capability, but the explicit "right-angle turn / instantaneous acceleration / no propulsion" structured-form language is absent.
NHI references by FBI section
Per-section NHI breakdown
Stacked bar showing which OCR'd FBI sections contribute which NHI categories. As more sections complete OCR, this chart grows.
Recovered testimony — verbatim NHI quotes
Extraterrestrial / interplanetary / planetary origin
Recovery / biologics / occupant / contactee
Quotes pulled verbatim from OCR text. Some are noisy due to scan quality; click through to the section dashboard for the source page.
Recovered testimony — physics-anomaly quotes
Pattern matches with surrounding sentence context. False positives are likely from OCR noise — review the underlying page in the section dashboard before treating any single quote as a finding.
Honest read
What the new OCR adds
OCR'ing the FBI vault scans turns ~1,800 pages of previously unsearchable historical record into machine-readable text. The first section to clear the pipeline (Section 10, 184 pages) yielded 78,556 words, ten "extraterrestrial" references, four "contactee" references, four planetary-origin references, and one explicit recovered-alloy claim. The historical record is materially richer in disclosure-era vocabulary than the modern operational subset.
What the FBI vault is, on inspection
The FBI 62-HQ-83894 file is a public-correspondence archive: civilians writing to Hoover with theories, sightings, and concerns; media clippings the Bureau filed for awareness; Hoover's standard non-committal replies. The presence of NHI language tracks who the public was when they wrote, not what the FBI investigated or concluded. The investigative substance — if any — is in the case files, not the file itself.
One concrete material claim worth tracing
Section 10 contains a verbatim claim of UFO debris recovered "on at least three occasions, in one case described as a magnesium alloy, in another as pure magnesium, and in a third case, attributed to an official of the Canadian government." This is a traceable historical claim — possibly referencing the 1947 Maury Island incident or related episodes. It belongs in the file as testimony / reference; whether it survives independent corroboration is a separate exercise.
What this does NOT change
The modern subset (DoW mission reports 2020-2024, NASA debriefings, State cables) still contains zero NHI/biologics/recovery vocabulary in its searchable text. The Greece 2023 "right-angle turns" remains an 80-mph object. The 1994 Kazakhstan cable remains one captain's opinion. The procedural release of the FBI vault adds historical breadth, not contemporary disclosure.