The UI Moment, Again

In November 2022, ChatGPT's launch felt like a paradigm shift. Suddenly, everyone was talking about AI. The technology press couldn't stop writing about it. Your relatives asked you about it at Thanksgiving. It seemed to appear overnight.

But it didn't appear overnight. Large language models had existed for years. GPT-3 launched in 2020. BERT came out in 2018. The transformer architecture that powers all of this dates back to 2017. Researchers had been steadily improving these systems, publishing papers, iterating on architectures, scaling up training runs.

What changed in November 2022 wasn't the underlying technology. It was the interface. ChatGPT made a simple bet: put a language model behind a chat interface and let anyone use it for free. No API keys, no Python notebooks, no papers to read. Just a text box and a conversation.

That interface unlocked something. It made the capability legible to non-researchers. People could feel what the model could do by talking to it. The chat format was familiar—everyone texts, everyone uses messaging apps. Within that familiar frame, the AI's capabilities became visible. It could write, reason, explain, code, analyze. The technology had been there, but the interface made it undeniable.

This is how technologies break out of research labs. Not through incremental improvements to benchmarks, but through interfaces that make capabilities viscerally obvious to normal humans.

The Industry Is Stumbling Around in the Dark

Since ChatGPT's launch, we've seen an explosion of AI applications. Every software category has been AI-ified:

Search: Perplexity, ChatGPT search, Bing Chat
Writing assistants: Notion AI, Google Docs AI, Grammarly
Image generation: Midjourney, DALL-E, Stable Diffusion
Video generation: Sora, Runway, Pika
Voice: ElevenLabs, voice cloning, AI podcasts
Music: Suno, Udio, AI-generated soundtracks
Customer service: AI chatbots on every website
Personal assistants: Replika, Character.AI, custom GPTs
Education: Khan Academy's Khanmigo, Duolingo Max
Design: Figma AI, Canva AI, Adobe Firefly
Productivity: Mem, Notion AI, meeting transcription
Code: GitHub Copilot, Cursor, Claude Code, Devin

The list goes on. There's an AI tool for everything. Venture funding has flooded into AI startups. Every company is adding "AI-powered" to their feature list. We're in the hype phase.

What Sticks?

But hype cycles pass. The question isn't what's hot right now, but what remains when the novelty wears off. What application of LLMs is so fundamentally valuable that it becomes infrastructure? What's the killer app that justifies all this investment and attention?

I think the answer is software engineering automation.

Not because it's the flashiest application. Not because it's the most consumer-facing. But because it's the one where:

The value is measurable
The users are willing to pay
The technology is good enough
The feedback loops are strong enough that the tech doesn't need to be perfect
It compounds: better tools make better tools

The Value Is Measurable

Most AI applications exist in fuzzy value territory. Is an AI-generated image "better" than one you'd commission from an artist? Depends on your taste. Does an AI writing assistant make you a better writer? Hard to say, maybe you'd have written something better yourself given time.

Software engineering is different. The value is concrete:

Did the code work? Yes or no. It either compiles, passes tests, and does what it's supposed to, or it doesn't.
How fast was it delivered? You can measure velocity—features shipped per sprint, time from spec to deployment.
How many bugs were caught? Fewer production incidents, faster debugging cycles—these are trackable metrics.

There's no subjectivity. If an AI coding assistant lets you ship twice as fast with the same quality, that's not a matter of taste. It's a measurable business outcome.

And the business outcomes are enormous. Software engineering is expensive. Senior engineers at tech companies make $300K–500K+ total comp. At scale, even modest productivity gains are worth millions. If an AI tool makes a 100-person engineering team 20% more productive, that's like adding 20 engineers—$6–10M/year in value.

Companies will pay for that. They already are.

Users Who Are Willing to Pay

Consumer AI applications face a harsh reality: users expect AI to be free. In the long term, it will be "free" . Costs for power, maybe some hardware, but you won't subscribe to some service. Fundamentally, there are few moats in AI. The data is largely publicly available (give or take discussions of fair use). The model architectures for LLMs are publicly disclosed. The fundamental gatekeeper is access to compute. Even then, through techniques like model distillation and quantization, the vast majority of capability can be captured and run on very limited hardware. The current barriers for self-hosted consumer AI come down to ease of setup, use, and maintenance. But as many who have instituted paywalls (newspapers, apps, etc.) have found, even charging one cent for your service causes a drastic reduction in your userbase. I fully expect that in the long term, consumer AI will be serviced by something akin to a personal AI computer, a device you'd buy from a manufacturer like Apple that gives you a simple interface to a local model that feels no worse than the trillion-parameter Opus-class models running in datacenters.

Businesses have a different value calculus and are far less cost-sensitive. Time and human capital are as important as dollars. Being able to move faster with fewer resources is worth the extra monetary cost. SaaS as a business model has demonstrated this. Workday, Justworks, JIRA, Slack, Tableau, all companies that automate common business functions, allowing companies to have less support staff, move faster, and coordinate better. This doesn't mean you don't have HR specialists or project managers or accountants or data analysts, but you certainly don't need as many.

For a long time, software engineering was an exception. It was a high-skill job, and for the most part you could never have enough great engineers. But with tools like Claude Code, one senior engineer with clear intention can implement a feature that used to take a month and a few junior implementers. A $20/month AI coding assistant that saves even an hour per week is worth 10x its cost for an engineer making $150K/year. There's also a management aspect: regulating access to codebases and infrastructure, enforcing best practices. Having a company-wide provider of AI services allows some semblance of control and guarantees about how employees conduct their work.

The Technology Is Good Enough

Many AI applications are still in the "impressive demo, rough product" phase. AI-generated video is stunning for 10 seconds, then the physics break. AI music sounds plausible until you listen carefully. AI search is great until it confidently hallucinates.

Code generation has crossed a threshold. It's not perfect, but it's useful right now. Why is code different? A few reasons:

Formal verification loops: Code either runs or it doesn't. Tests pass or fail. The compiler catches type errors. There's immediate, deterministic feedback that lets the AI iterate toward correctness. This is unlike creative tasks where "correctness" is subjective.

Massive training data: Code is public (GitHub, Stack Overflow), structured, and abundant. LLMs have seen millions of repositories and billions of lines of code. They've learned patterns deeply.

Clear intent from context: When writing code, the surrounding context—existing functions, type signatures, comments, variable names—provides strong signal about what should come next. The AI isn't generating from a blank slate.

Tolerance for iteration: Developers already iterate. Writing code is revising code. An AI that generates something 80% right that you refine is still a massive win. You don't need perfection, just a good starting point.

This combination means code generation is past the toy phase. It's delivering real value in production workflows today.

Feedback Loops Provide Leniency to Imperfect Systems

This might be the most important factor. AI coding assistants achieve tasks via feedback. Where there's a strong pass-or-fail signal, with enough iteration an AI agent can solve the problem to some degree.

Compare this to other AI applications:

AI art: Users either like an image or they don't. Taste is subjective. Hard to use that to improve the model systematically.
AI writing: Did this make the writing "better"? Hard to measure. Most users can't articulate why they prefer one version over another.
AI search: Did the answer satisfy the user? Maybe they closed the tab because they found what they needed, or maybe because the answer was wrong. Ambiguous signal.

Even when users have preferences, you end up with what we see in consumer-facing chat interfaces: degenerate behavior like sycophancy gets reinforced even when humans know it's suboptimal. People love having their egos stroked, and due to how models trained against human feedback this behavior gets reinforced. If humans prefer responses that flatter them, that behavior propagates.

Code has ground truth. The compiler and tests are the judge. This lets AI coding tools improve faster than other applications.

Additionally, the code generation task aligns with LLM strengths. LLMs are pattern-matching engines trained on vast amounts of text. Code is structured, pattern-rich, and well-represented in training data.

It Compounds—Better Tools Make Better Tools

AI coding assistants make themselves better.

The teams building Cursor, Claude Code, and GitHub Copilot use their own tools. This use case is being dogfooded more than any other at the companies developing LLMs. As people say in product design, the best products are the ones you make for yourself.

This is a compounding loop that doesn't exist for most AI applications. AI-generated art doesn't help you build better art generators—it probably makes them worse. AI writing assistants don't make it easier to train better writing models. But AI code generation directly accelerates AI research and development.

This suggests a future where AI-powered software development accelerates—not because models get bigger (though they will), but because the tools for building models get more targeted on this use case, where the feedback signals are most observable.

The Staying Power

When the current AI hype cycle cools, what remains?

Some consumer AI apps may stick. Chat-mediated research probably survives. AI-generated art may become a standard tool for certain creative workflows, particularly in marketing where art is commodified. Voice AI for transcription and customer service likely becomes commodity infrastructure.

But I think the most durable, highest-value application is software engineering automation. Because:

Businesses need it and will pay for it reliably
The technology works well enough to deliver real value today
The feedback loops mean it keeps getting better
It compounds—better coding tools accelerate everything else

This doesn't mean programmers disappear. It means programming changes. The job shifts from writing every line by hand to directing AI systems, reviewing their output, and focusing on architecture and intent. The cognitive load moves up the stack.

But unlike other AI applications that might be fads, coding automation has staying power because it's embedded in the critical path of building everything else. Every software company, every tech product, every digital service—they all need code. Making that code faster and cheaper to write is valuable to everyone.

The Interface Moment for Coding Agents

Just like ChatGPT made LLMs legible to the general public through a simple chat interface, we're approaching an interface moment for coding agents.

Until recently, AI coding tools were mostly autocomplete-plus. Copilot suggests the next line. Cursor helps you edit faster. These are powerful, but they're incremental improvements to existing workflows.

The interface moment is here: the abstraction level is shifting. Instead of writing code and having AI assist, you describe intent and AI generates working systems. The unit of work isn't "write this function" but "build this feature" or "refactor this architecture."

When this interface matures, when the abstraction is natural and the reliability is high enough that you trust it, that's when coding automation becomes truly sticky. Not just for early adopters and AI enthusiasts, but for every engineer, every team, every company, normal people.

Conclusion

ChatGPT's launch wasn't about new technology. It was about making existing technology legible through the right interface. The chat box made LLM capabilities undeniable to millions of people.

Coding agents follow the same pattern. The technology isn't new or novel. What's coming is the interface refinement that makes software engineering automation natural and reliable. It will also expose software engineering to people who have no clue how to write a single line of code or what a command line is.

It's no longer a novelty or a power-user tool. It will be infrastructure—the default way software gets built.

That's the killer app. The one that remains when the hype fades. The one that justifies the investment, the attention, and the transformation.