Massive OSINT index goes public

A new open-source OSINT search engine indexes more than 16 billion rows across roughly 1,300 distinct databases, giving researchers and hunters a single place to query disparate public data. (x.com)

Open-source intelligence is the practice of finding clues in public places — court dockets, company records, social posts, maps, and breach data — and a new search engine says it has put much of that sprawl behind one query box. (github.com, github.com) The project surfaced publicly this week through posts from Anastasis Vasileiadis, who said the index is open source and searchable across more than 16 billion rows from about 1,300 databases. The original post was on X, though the page did not render through this search tool. (24vids.com, x.com) In plain terms, a row is one record in one table — like a single breached account, property filing, or company entry — and the pitch is consolidation. Investigators usually hop between separate tools for usernames, emails, domains, and public records, then correlate the matches by hand. (github.com, github.com) That workflow has produced a crowded market of directories and point tools rather than one standard index. GitHub’s main OSINT topic page lists thousands of projects, while large curated lists still send users out to dozens or hundreds of separate services. (github.com, github.com) The closest commercial comparison is the breach-search business, where companies already advertise giant unified datasets. HackCheck says it searches 16,285,718,986 records, but it is a paid service, not an open-source codebase published for inspection and reuse. (hackcheck.io) That distinction matters for researchers who want to see how sources are joined, filtered, and updated. Open-source projects let other developers audit the code, self-host the stack, and test whether a result came from public records, scraped web data, or breach material. (github.com, github.com) It also sharpens the old argument inside open-source intelligence over what counts as “open.” The United States intelligence definition cited on GitHub’s OSINT page refers to publicly available information, while many practitioner toolkits mix in leaked or breached data that may be widely circulating but not lawfully public. (github.com, github.com) That legal line changes by country and by dataset. European privacy law, United States state breach-notification rules, and website terms of service can all affect whether collecting, indexing, or redistributing a database is allowed. (github.com, github.com) For now, the public facts are the size claim, the open-source framing, and the promise of one place to search many databases at once. Whether the tool becomes a staple for journalists, threat hunters, and amateur investigators will depend on source quality, update cadence, and how carefully it handles data that was public, leaked, or somewhere in between. (hackcheck.io, github.com, github.com)

Get your own daily briefing

Scout delivers personalized news, insights, and conversations tailored to your role and industry.

Download on the App Store

Shared from Scout - Be the smartest in the room.