Sourcebot Takes Aim at the Real Bottleneck in Software: Understanding Code, Not Writing It
Sourcebot, an AI code search tool built for massive codebases, is emerging from Y Combinator’s Fall 2025 batch with a message the rest of the AI tooling industry has been slow to embrace. Writing code is no longer the hard part – understanding it is. And for both human engineers and the AI agents increasingly working alongside them, that understanding gap is the real thing holding large engineering organizations back.
Founded in 2024 by Michael Sukkarieh and Brendan Kellam, Sourcebot is already deployed at some of the world’s largest and most technically demanding companies, including NVIDIA, Red Hat, Wikimedia, and Arista Networks. The San Francisco-based company gives developers and AI agents instant regex search across millions of lines of code, plus the ability to ask questions spanning thousands of repositories using any flagship reasoning model. Because the product is open source and runs on-prem, enterprises can get it deployed in minutes — often before other code intelligence tools have even made it through security review.
The founding story is almost as old as the product. Sukkarieh and Kellam met a decade ago at McGill University and have been building together full-time for more than two years. Before Sourcebot, they wrote and debugged code across some of the most complex systems on the planet, doing tours at Microsoft, EA, Ubisoft, Google, and Meta. That combined experience left them with a strong conviction: at scale, developer productivity is rarely limited by how quickly someone can type, and increasingly rarely by how quickly an AI can autocomplete. It is limited by how long it takes to figure out what already exists, how it connects, and what can safely be changed.
That conviction has only gotten more relevant in the last year. The explosion of coding agents has shifted more and more of the writing to machines, but those machines still hallucinate, still invent non-existent functions, and still fail to fit gracefully into unfamiliar repos. Sourcebot’s thesis is that richer, faster code context is the highest-leverage fix. For developers, it means onboarding into unfamiliar systems dramatically faster. For AI agents, it means generating code that lines up with the conventions and architecture of the wider codebase rather than contradicting it.
Sourcebot’s open-source roots are also part of its moat. Thousands of engineers already rely on the tool in their day-to-day work, many of them at companies whose security posture makes proprietary, cloud-hosted offerings a non-starter. Being able to point to deployments at NVIDIA, Red Hat, Wikimedia, and Arista Networks does two things at once: it reassures risk-averse buyers, and it keeps Sourcebot’s roadmap anchored to what the hardest customers actually need.
The company is tagged by YC in Artificial Intelligence, Developer Tools, DevSecOps, B2B, and Open Source – a rare combination that reflects just how broad the implications of code understanding have become. As AI agents take on a larger share of implementation work, the tools that let them reason over an entire organization’s code will become some of the most strategic software in the stack.