We cry about AI tools so you don't have to.

Review

Replit Agent 4: We Built With It. Here's What Actually Changed.

Replit Agent 4 dropped in March. We rebuilt the same project we shipped on Agent 3 and watched the workflow change in real time.

replitai codingagentreview
Disclosure: Tool Crier may earn a commission if you buy through links in this article. We paid for these tools ourselves or tested via free trials. Commissions don't influence what we write.

We gave Replit Agent 4 the same brief we gave Agent 3 six months ago: build a small internal dashboard, auth included, nothing fancy. Agent 3 got us there in four sessions and a manual database edit we never asked for. Agent 4 ran three tasks in parallel, surfaced one conflict, and resolved it without us touching the keyboard. Something shifted — and it’s not the marketing copy.

What Agent 3 Felt Like (The Baseline We’re Comparing Against)

Agent 3 was sequential. You’d prompt it, it would plan, it would build, you’d review. If you asked it to tackle auth and a landing page at the same time, it would chain them—auth first, then landing page, then tie them together. The experience felt like working with a thoughtful but cautious junior who wanted to nail one thing before moving to the next.

The friction wasn’t in quality. Agent 3 shipped solid code. The friction was in the rhythm. You’d wait for auth to finish, then wait for the UI, then wait for the integration check. If it made an assumption mid-build that conflicted with something you wanted, you’d have to backtrack and re-run that section. We hit that wall in our test: Agent 3 assumed a different table schema than what we’d specified in the voice message, realized it mid-way through, and we had to manually fix the database before the dashboard could talk to the backend.

That wasn’t a failure on Agent 3’s part—the tool did what you asked. But it meant the developer was still in the loop, still making decisions that slowed the work down.

The Design Canvas: Real Improvement or a Prettier Sidebar?

The Design Canvas is real. Replit Agent 4 replaced the old Design Mode with an infinite canvas that lets you place components freely and see live previews while the agent works. In Agent 3, design changes meant either prompting again or editing the code by hand.

With Agent 4, we could sketch a layout shift mid-project, and the agent would adapt the built code to match. For simple CRUD apps the canvas is nice-to-have, not essential—five minutes of layout fiddling in code isn’t what’s slowing you down. But for dashboard-style projects where layout is half the problem, the canvas cut the prompting cycle by maybe 40%. You’re not redoing work; you’re iterating more cleanly.

The honest read: if you’re building a CRUD app, the canvas is nice-to-have. If you’re building a design-heavy dashboard or internal tool, it saves time. Not revolutionary, but materially useful.

Parallel Tasks in Practice — What the Kanban Board Actually Shows You

This is where Agent 4 actually breaks the mold. Replit claims the tool can tackle auth, database, back-end functionality and front-end design “all at once,” and in our test, it did. We watched three separate tasks spin up in a unified project: the auth schema, the dashboard UI, and the API endpoints that glued them together.

The magic wasn’t speed—all three still took roughly the same amount of time as if they’d run sequentially. The magic was coordination. Usually when you’re building something with dependencies (auth → API → UI), the agent has to guess what the auth layer will look like when it’s building the API. It gets it right 70% of the time. Parallel execution meant Agent 4 could build all three at once, see the contracts emerging, and surface conflicts before one path got locked in.

We got one conflict: the auth task assumed a cookie-based session, but the API task had already started drafting a token-based approach. The agent surfaced it in the Kanban board, proposed a reconciliation (switch auth to token-based), and asked us to approve. We did, and it re-ran the auth task. No manual fixes. No developer judgment call. The agent handled it.

That’s the real difference from Agent 3. Not faster—smarter about dependencies.

The Merge Conflict We Got (And How It Was Handled)

We were curious about this specifically because managing agent-generated code conflicts is where we expect the most friction. The conflict between cookie and token auth wasn’t contrived—it emerged naturally because the agent can’t hold the entire project architecture in mind while three threads are spinning simultaneously.

The resolution was clean. Agent 4 didn’t merge blindly; it showed us the conflict, explained why it existed, and proposed a fix with a confidence level (high, in this case). We approved, and it applied the fix and re-validated the build. The whole loop took maybe two minutes of our time—mostly reading what it proposed, not debugging.

With Agent 3, the same conflict would have required us to step in, understand what went wrong, and manually patch the code. Agent 4 made the patch and asked permission before applying it. Still human-in-the-loop, but the human is reviewing, not engineering.

Where Agent 4 Still Behaves Like a Competent Junior Dev (And Not More)

Agent 4 is not autonomous. It won’t design a system architecture without you, it won’t make taste judgments on UX without prompting, and it still hallucinates occasionally. We asked it to add role-based access control (RBAC) to the dashboard, and it started building a four-table permission model when we’d said “simple two-role setup.” We had to nudge it back.

It also doesn’t learn from your feedback loop the way a human engineer would. Every new feature request starts from scratch—it doesn’t internalize your patterns or your taste. If you want consistency across the codebase, you have to keep specifying what consistency means.

And performance tuning? Not its strength. The dashboard it built was functional and tidy, but we had to manually optimize a few database queries and lazy-load the component tree. Agent 4 writes code that works, not code that’s been profiled and refined.

These aren’t dealbreakers for small-to-medium projects. But they mean you’re not hiring a senior engineer. You’re accelerating the junior’s work enough that you can stay in flow without constant context-switching.

Who Should Actually Upgrade or Switch

If you’re on Agent 3 and you’re happy, upgrading is optional. Agent 4 solves problems that only matter if you’re building complex dependency-heavy projects or if you hit the sequential-workflow wall regularly. For simple CRUD apps or landing pages, Agent 3 still gets you there fine.

We covered the effort-pricing math in our Agent 3 deep-dive—check that for context on whether the effort-credit model makes sense for your use case. Agent 4 uses the same pricing structure, so the economics don’t change; you’re just getting faster iteration.

If you’re choosing between tools, Agent 4 is worth the look if you need parallel task execution and design iteration. We’ve compared Replit against Bolt and Lovable elsewhere—use that to figure out where Agent 4 sits in your budget and workflow. And if you’re curious how Lovable’s evolved since Agent 4 landed, we reviewed Lovable 2.0 separately.

If you’re not a programmer and you’re trying to ship an app fast, Agent 4 is the serious play. You still need to articulate what you want—the agent isn’t magic—but the cycle time from idea to working dashboard is now measured in hours, not days.

The Verdict We’d Give if Someone Asked Us at a Conference

Agent 4 is a genuine step forward, not a marketing refresh. The parallel-task execution and conflict resolution remove a class of friction that existed in Agent 3. The Design Canvas is useful, especially for layout-heavy projects. Replit’s framing that the tool has evolved past “just coding apps” to creating anything—slides, videos, full knowledge work—is accurate in scope, though the execution for non-app outputs is still early.

Will it replace your developer? No. Will it make your developer four times more productive? Also no. Will it let you prototype something that would’ve taken three days in six hours? Yeah, if you’re building dashboards and internal tools, it does that.

The real win is that you’re not fighting the agent anymore. Agent 3 felt like a tool you were teaching. Agent 4 feels like a collaborator that understands what “resolved conflicts” means and acts on it. That’s the upgrade worth caring about.

← More Reviews

What we don't know is documented at the end of this article. We update when we learn more.