Anthropic Co-founder: Building Claude Code, Lessons From GPT-3 & LLM System Design

Aug 21, 2025

AI Startups Product Management Business User Experience

Summary

The podcast episode features Tom Brown, co-founder of Anthropic, who shares his unique journey from modest academic beginnings to contributing significantly to AI breakthroughs, including the development of GPT-3 at OpenAI and Anthropic's Claude models. The discussion opens with Anthropic's early uncertainty and mission-driven focus, beginning amid the COVID-19 pandemic with limited resources while competing with established AI giants. Brown emphasizes adopting a survivalist 'wolf mindset' crucial for innovation in startup environments, contrasting with traditional task-driven roles. Reflecting on his early experience at YC startup Grouper, he illustrates how iterative product development and user-centered design laid the groundwork for later AI ventures. Transitioning into AI research involved intense self-study overcoming skepticism toward AI safety and research as a viable career path. A key technical insight arose from scaling laws, revealing that increasing compute predictably enhances AI model intelligence, guiding the strategic emphasis on infrastructure at OpenAI and Anthropic. Anthropic's culture of radical transparency through public Slack channels and mission alignment supported rapid growth and cohesive development. The group's evolution included a cautious product launch approach, focusing first on infrastructure before the market expanded post-ChatGPT, with Claude eventually evolving into specialized models notably proficient in coding tasks. Brown notes an industry-wide discrepancy between benchmark results and real-world developer preferences, hinting at an 'X factor' in user experience and AI model capabilities. He also describes how internal tool development, like Claude Code—treating AI as a direct user—boosted productivity and market success. The episode closes with reflections on unprecedented AI infrastructure scale, hardware-software trade-offs, and career advice encouraging intrinsic motivation over traditional credentials, highlighting the dynamic and uncertain landscape of AI innovation.

Key Takeaways

1Anthropic's foundation was characterized by uncertainty and a mission-driven resolve despite facing strong competitors like OpenAI, starting with seven co-founders during the COVID pandemic without a clear product vision.
2Tom Brown’s path into AI was highly non-traditional, combining startup experience with rigorous self-study to overcome his modest academic record, exemplifying how practical entrepreneurship and self-education can substitute for formal credentials in AI careers.
3The discovery and application of scaling laws underpin much of the progress in large language models, revealing a predictable and nearly linear increase in intelligence with increased compute, which shaped strategic decisions at OpenAI and Anthropic.
4Anthropic’s organizational culture of radical transparency, using fully public Slack channels and mission-centric alignment, fostered effective collaboration and cohesion during rapid scaling.
5Anthropic’s initial product development prioritized robust infrastructure and compute acquisition over immediate market releases, with the launch of early tools like the Claude Slack bot preceding widespread commercial adoption catalyzed by ChatGPT.
6Focusing on coding capabilities within large language models became a deliberate strategic niche for Anthropic, improving product-market fit and enabling emergent agentic coding functionalities that resonated strongly with developers.
7Real-world developer preference for Anthropic’s coding models significantly outstrips benchmark predictions, suggesting that conventional quantitative benchmarks fail to capture key qualitative features valued by users.
8Anthropic’s internal development of Claude Code, an AI tool treating the AI itself as a primary user, demonstrates a user-centric and iterative approach that significantly contributed to the product’s external market success.
9The podcast highlights the unprecedented scale and rapid growth of AI infrastructure investment, described as the largest infrastructure build-out in history, surpassing historic projects like Apollo and Manhattan.
10Career advice from Tom Brown encourages young engineers to value intrinsic motivation, practical skills, and risk-taking over traditional markers like formal degrees or employment at major tech firms, reflecting shifts in AI talent pathways.

Notable Quotes

"When we started out, we didn't seem like we were going to be successful at all. OpenAI had a billion dollars and like all of these, all of the star power. And we had seven co-founders in COVID, like trying to build something. And we didn't know if we were necessarily going to make a product or what the products would look like."

"I think by being there with the other co-founders without anyone telling us what to do, basically we had to figure out how to live. The company would die by default. In school there was a lot of like a feeling of more of people would give me tasks and I would do the tasks. It's kind of like a dog waiting for like food to be like bed to them in their bowl or something like that. And I think for that company, it was more like wolves and we have to like hunt our food. Otherwise, like our kids are going to starve or something like that. I think that that mindset, I think, has been like the most valuable mindset that shift that I've had for trying to do like bigger, more exciting things."

"At the time I was like okay it seems like sometime in our lifetimes we might end up making transformative AI. If we do that would be the biggest thing. Maybe there's some way that I could help out. But also I got like a B- in linear algebra in college. And so it seemed like at the time you needed to be just top superstar in order to try to help out with that at all. And so I think I had like a lot of uncertainty about whether I would be able to help."

"So I spent like a whole summer like three months after Grouper doing that because honestly I was I was like kind of burned out for Grouper where I know startups like the highs are high like the lows are low and we weren't working at the end. Our business wasn't succeeding our revenue was going down but I my main job still was like recruiting engineers. And so I had to like pitch them on the stream that I'd had but I like no longer really sounds like a death march. Yeah. And so I was super burnt out and I was like okay Tom like chill out do some yoga like do some CrossFit like build an art car."

"The main problem that I think we were solving was the it's hard to like go and put yourself out there and go like talk to someone new and they might just be like I don't want to talk to you. You seem weird. And so we solved that by just blind matching. Tinder came out while we were doing Grouper and Tinder solved that same problem with both people have to show interest before you get matched. So there's also no worries about getting rejected. And I think that they just had better that was a better solution to that same problem. So good work Tinder. Good work all the swipers."

""There's a paucity of people who know both machine learning and distributed systems. So like yes you should do that.""

""The big breakthrough in GPT-3 was like use more compute and using GPUs.""

""Definitely like seeing that line of reliably you get more intelligence if you spend more compute with the right recipe was the main thing that was at least for me was like this is a thing that's like happening happening now.""

""I think we had a culture where like everything is on Slack. 100% of things on Slack. And within that all public channels. Great communication.""

""And so I think that group was very focused on like how do we make sure that that's taken seriously enough. And that like we've built an institution that can handle the weight of that.""

""Yeah, I wish that we had like more foresight on that. But no, I think it was surprising for us to like how big of a deal it was. And then I think 3.7 Sonnet also like surprised us by how much it unlocked like agentic coding. I think for each of these things, yeah, we move quite fast in rolling them out. And so we really often don't know what the results are going to be there.""

""If you pull like the YC founders, they prefer using anthropic models for coding by like a huge margin. It's much larger than what you would predict if you just looked at the benchmark results. So there seems to be some X factor that makes people really like these models for coding.""

""She had the compiled binary and she was like, Claude, can you decompile this? Like, yeah, can you disassemble the assembly? And Claude chewed on it for 10 minutes and like made a C version of it. And so then she had the thing that she can modify. It's insane.""

""We don't teach to the test. Because I do feel like if you start doing that, then like it has weird bad incentives. Maybe we could like put that team under marketing or something like that and then ignore all the benchmarks. But I think that that's one reason why there's some train test mismatch there.""

""One thing that's interesting to look at is just that humanity is on track for like the largest infrastructure build out of all time. Now this is gonna be larger than the Apollo project, larger than the Manhattan project. It'll be bigger than both of them next year. If it keeps on the current trajectory, which is like roughly three X per year increase in spending on AGI compute, which is just bonkers.""

← All episodes Browse issues