Four Kinds of AI in Chip Design, and Only Two of Them Pay the Bills

Summary. People say “AI in chip design” like it means one thing. It doesn’t. It means at least four things, separated by roughly a decade of maturity. Two of them quietly run in production and make real money. The other two own the headlines and the funding rounds while shipping a fraction of the value. Here is how to tell them apart, and why the people who win never pretend one is the other.

The question that exposes everyone

Somebody’s VP reads a LinkedIn post about an AI that designed a chip, walks into Monday standup, and asks why the team hasn’t done that yet. If you have worked in this industry for more than a year, you have lived this exact moment, and watched the room go quiet, because the honest answer is complicated and the VP wants a yes or no.

Here is the thing nobody puts on a slide. “AI in chip design” is not one technology. It is four, and smashing them into a single buzzword is how programs get funded on a demo and then blindsided when tapeout slips two quarters. At an advanced node, a slipped quarter is a number with a lot of zeros and a very awkward board update.

So let me split them the way an engineer who has actually shipped silicon thinks about it. Not by where they sit in the flow. By whether they survive contact with a real chip.

Type one: optimization, the one that quietly works

Reinforcement learning and brute force search pointed at problems with a clean number to chase. Floorplanning, placement, routing, the whole RTL to GDSII knob twisting marathon. Most mature category, and nobody posts about it, because working software is boring and outrage is not.

It is real. DSO.ai has crossed three hundred commercial tapeouts. AlphaChip has placed macros in multiple TPU generations and an Arm server CPU, and Google open sourced the method. Cerebrus customers buy it again, which in this industry is the only review that counts.

Why it works matters more than that it works. Physical design has a scoreboard: power, timing, area, congestion. A machine that tries a billion configurations while you sleep beats the human who got through forty before the standup. Nothing to misinterpret, no intent to guess. That narrow setup is exactly where today’s AI is strong, and surprise, that is exactly where the money is. Anyone who has babysat a long run knows it is not magic. You still set the recipe and catch the run that wandered off. But it earns its seat, and that is the bar.

Type two: predictive ML, the boring one that prints money

Plain predictive machine learning, stitched into test and yield and verification flows since long before “agentic” was a word. Defect classification on wafer maps. Scan compression settings. Coverage prediction. Triaging which of four hundred failing seeds actually matter. Aging forecasts from on chip monitors.

None of it is exciting. All of it makes money. KLA built an eleven billion dollar revenue base partly on models that sort defects faster than any human squinting at a SEM image. PDF Solutions sells yield analytics into basically every serious fab. This wins because it eats narrow, repetitive, high volume problems where a one percent accuracy bump multiplies across millions of die into real margin.

The unsexy lesson every grizzled person in this field already knows: boring prediction at scale beats a jaw dropping generation demo every quarter, on the only document that matters.

Type three: generative AI, the one your manager saw on Twitter

Large language models spitting out RTL, testbenches, assertions, and docs from a prompt. Loudest category by far. Least proven in a real flow by a wider margin.

The demos are legitimately cool, and that is the trap. A team generated a working RISC V core from a two hundred word spec in twelve hours and the internet lost its mind. Then you read the footnotes. Simulation only. Academic process kit. Never fabricated. Tens of billions of tokens for a core about as fancy as something from 2011. Nice research result. Not a product.

And the part the hype crowd never mentions, the part every verification engineer is screaming about in the replies: “it compiles” is not “it works.” Machine written RTL looks perfectly reasonable and then synthesizes to garbage or quietly drifts off spec in the corner case that ships to your biggest customer. The model writing code faster does not delete the work. It shoves it downstream into verification, already 70% of the schedule and the least appreciated job in the building, and upstream into your spec being airtight, which it never is.

Where it actually helps: autocompleting boilerplate, translating between languages, drafting the first ugly version of docs, explaining some cursed uncommented module. Engineers like it. It is also a long way from autonomous, and everyone doing the real work knows it even when their slides say otherwise.

Type four: agentic AI, the promise everyone is fronting on

Agents that orchestrate the other three, chaining tools and decisions with a human supposedly stepping back. Every big vendor shipped one in early 2026. Startups are raising eye watering rounds on “autonomous engineering workforce.” The direction is right. That is not the issue.

The receipts are. Nobody has shown a full, zero human in the loop tapeout of a real industrial chip. Not a toy, not a sim, a shipped and signed off chip. The serious surveys say this plainly. What agents do well today is compress the soul crushing middle of a workflow, the glue work, the chasing. Valuable, but not a teammate you hand a spec and trust to return silicon you would sign.

Confusing the promise with the present is the most expensive mistake a good org can make right now, because it tempts you to pull real engineers off the org chart against a capability that has not landed. Then first silicon comes back, the bug is in the block the agent owned, and nobody can explain it to the customer.

The wall nobody is selling tickets to: signoff

Signoff needs accuracy that probabilistic models cannot promise yet. A billion transistor chip that is 99% correct is not 99% done. It is broken, and you find out which 1% the expensive way. Timing signoff, DRC, functional closure want determinism. A model that is usually right is, by definition, not allowed near the last mile.

This is not a gap one more training run closes. It is what happens when statistics meet a domain that turns a single escaped bug into a respin and a slipped launch. Until that changes, a human signs the silicon. So stop asking whether AI removes the signature. Ask how much of the misery before it AI can take off your plate. That question has real answers. The other one is a fantasy your CFO should not be funding.

So what do you actually do

If you sell optimization or prediction, just sell the results. They are real and provable. The moment you wrap them in autonomy cosplay, the people who know start tuning you out.

If you build generation or agents, be brutally clear about the line between helping an engineer go faster and replacing the engineer. These buyers have been burned by EDA marketing for thirty years. They smell an inflated number from the parking lot, and the second they catch one, every other claim gets mentally deleted. The fastest way to torch your credibility is to promise the fourth type and ship the third.

The teams that win the next five years are not the ones with the loudest autonomy story. They are the ones who know exactly which type they are shipping, charge for the value that genuinely exists, and quietly let the boring money printing categories fund the ambitious ones until those grow up. The revolution is already here. It just walked in through the unglamorous door marked optimization and prediction, while the crowd kept staring at the window where the autonomous agent is supposed to make its entrance any minute now.