NHacker Next
  • new
  • past
  • show
  • ask
  • show
  • jobs
  • submit
Show HN: Plandex v2 – open source AI coding agent for large projects and tasks (github.com)
maxwelljoslyn 13 hours ago [-]
It seems that much like Aider, you use separate models for creating code edits and validating them. That's a win in my book. It seems Claude Code does not do that, which is part of the reason it racks up a (relatively) large bill for long work sessions (that and the cost of sonnet-3.7).

I bounce back and forth between Aider, Claude Code, and Simon Willison's LLM tool ("just" a GOOD wrapper for using LLMs at the CLI, unlike the other two which are agent-y.) LLM is my favorite because I usually don't need/want full autonomy, but Claude Code has started to win me over for straightforward stuff. Plandex looks cool enough to throw into the rotation!

My main concern at this point is that I use a Mac and as far as I understand it Docker containers can have pretty poor performance on the Mac, so I'm wondering if that will carry over to performance of Plandex. (I don't use Docker at all so I'm not sure if that's outdated info.)

danenania 12 hours ago [-]
> It seems that much like Aider, you use separate models for creating code edits and validating them.

That's right. To apply edits, Plandex first attempts a deterministic edit based on the edit snippet. In some cases this can be used without validation, and in others a validation step is needed. A "race" is then orchestrated with o3-mini between an aider-style diff edit, a whole file build, and (on the cloud service) a specialized model. I actually wrote a comment about how this works (while maintaining efficiency/cost-effectiveness) a couple days ago: https://news.ycombinator.com/item?id=43673412

And on the Docker question, it should be working well on Mac.

Onawa 12 hours ago [-]
Docker containers can be somewhat slower due to most available Docker images targeting x86. If you build for Arm, it should be better.
erikcelander 2 hours ago [-]
orbstack > docker on mac
sagarpatil 4 hours ago [-]
I spend hours everyday researching AI products. Why am I only hearing about plandex now? Looks very promising, I’ll give it a try. Please up your marketing game, the product looks solid!
viraptor 4 hours ago [-]
Maybe because they had a release hiatus for a bit. V1 was cool and had great ideas / interface, but needed some polish in practice (new models weren't well supported, it for in endless cycles, etc.) The explicit planning, server model, visible steps and a few other novel things were impressive though.

Hopefully the V2 will bring the polish and reliability.

LeonidBugaev 4 hours ago [-]
You should check https://probeai.dev/ too. Thats one of those building blocks which makes AI trully understand the code.
jmcpheron 11 hours ago [-]
Plandex was one of the first agentic style coating system to I tried several months ago, and it worked very well. But I've been using the cursor and windsurf style editors more recently because of their popularity. And their effectiveness is honestly pretty great.

Would you classify Plandex as more similar to a terminal interface like Claude Code? Also it looks like Open AI released a similar terminal based tool today. https://github.com/openai/codex

Do you see an obvious distinctions or pros/cons between the terminal tools and the IDE systems?

danenania 10 hours ago [-]
> Would you classify Plandex as more similar to a terminal interface like Claude Code? Also it looks like Open AI released a similar terminal based tool today. https://github.com/openai/codex

Yes, I would say Plandex is generally similar in spirit to both Claude Code and OpenAI's new Codex tool. All three are agentic coding tools with a CLI interface.

A couple areas where I think Plandex can have an edge:

- Most importantly, it's almost never the case these days that a single provider offers the best models across the board for coding. Instead, each provider has their strengths and weaknesses. By slotting in the best model for each role, regardless of provider, Plandex is able to get the best of all worlds. For example, it currently uses Sonnet 3.7 by default for planning and coding, which by most accounts is still the best coding model. But for the narrow task of file edits, o3-mini offers drastically better speed, cost, and overall results. Similarly, if you go above Sonnet 3.7's context limit (200k tokens), Plandex can seamlessly move you over to a Gemini model.

- It offers some unique features, like writing all changes to a sandbox by default instead of directly to project files, that in my experience make a big difference for getting usable results and not leaving behind a mess by accident. I won't list all the features again here, but if you go through the README, I think you'll find a number of capabilities are quite helpful and aren't offered by other tools.

> Do you see an obvious distinctions or pros/cons between the terminal tools and the IDE systems?

I'm a Cursor subscriber and I use both Cursor and Plandex regularly for different kinds of tasks. For me, Cursor works better for smaller, more localized changes, while Plandex offers a better workflow for tasks that involve many steps, many files, or need many attempts to figure out the right prompt (since Plandex has more robust version control). I think once you are editing many files in one go, the IDE tab-based paradigm starts to break down a bit and it can become difficult to keep a high level perspective on everything that's changing.

Also, I'd say the terminal is naturally a better fit for executing scripts, installing dependencies, running tests and so on. It has your environment already configured, and it's able to control execution in a much more structured and reliable way. Plandex, for example, can tentatively apply a bunch of pending changes to your project, execute an LLM-generated script, and then roll back everything if the script fails. It's pretty hard to achieve that kind of low-level process control from an IDE.

MadsRC 3 hours ago [-]
This looks great!

With the self-host option, it’s not really clear through the docs if one is able to override the base url of the different model providers?

I’m running my own OpenAI, Anthropic, Vertex and Bedrock compatible API, can I have it use that instead?

ako 3 hours ago [-]
Interesting to see that even with these type of tools coding it takes 8 months. That is not the general impression people have of ai assisted coding. Any thoughts on how you could improve plandex to bring down 8 months to 1 month or less?
jtwaleson 7 hours ago [-]
Nice! I tried it out when you launched last year but found it pretty expensive to use. I believe I spent $5 for half an hour of coding or so. Can you share what the typical costs are now, since the model prices have changed significantly?
chrisweekly 13 hours ago [-]
This looks powerful and useful. Also, IMHO it's an exemplary "Show HN"; nice job w/ the description.
danenania 12 hours ago [-]
Thanks for the kind words! I really appreciate it.
iambateman 12 hours ago [-]
Really cool! Looking forward to checking this out.

I really like my IDE (PHPStorm) but I want Cursor-like functionality, where it’s aware of my codebase and able to make changes iteratively. It sounds like this is what I need?

Excited to give this a go, thanks for sharing.

Btw one of the videos is private.

danenania 12 hours ago [-]
Thanks! I'd love to hear your feedback.

> I want Cursor-like functionality, where it’s aware of my codebase and able to make changes iteratively. It sounds like this is what I need?

Yes, Plandex uses a tree-sitter project map to identify relevant context, then makes a detailed plan, then implements each step in the plan.

> Btw one of the videos is private.

Oops, which video did you mean? Just checked them all on the README and website in incognito mode and they all seem to be working for me.

iambateman 11 hours ago [-]
danenania 10 hours ago [-]
Oh I see, in the HN post above. Sorry about that! Seems it's too late for me to edit, but here's the correct URL - https://youtu.be/g-_76U_nK0Y

I'll ping the mods to see if they can edit it.

elliot07 12 hours ago [-]
Congrats on the V2 launch. Does Plandex support MCP? Will take it for a test drive tonight.
danenania 12 hours ago [-]
Thanks! It doesn't support MCP yet, but it has some MCP-like features built-in. For example, it can launch a browser, pull in console logs or errors, and send them to model for debugging (either step-by-step or fully automated).
ErikBjare 12 hours ago [-]
Awesome to see you're still at it. v2 looks great, I will take it for a spin.
danenania 11 hours ago [-]
Thanks Erik! I'd love to hear your thoughts.
esafak 13 hours ago [-]
I think you should have put the "terminal-based" qualifier in the title and lede.
danenania 12 hours ago [-]
Yeah that's fair enough. The way I look at it though, a lot of the value of Plandex is in the underlying infrastructure. While I've stuck with the terminal so far in order to try to keep a narrower focus, I plan to add other clients in the future.
handfuloflight 12 hours ago [-]
How efficient is it in constructing context?
danenania 12 hours ago [-]
It constructs context to maximize cacheability. The context window is also carefully managed for efficiency and focus. During implementation steps, only the relevant files are loaded.
sreejithr 6 hours ago [-]
we cooked?
ramesh31 7 hours ago [-]
CLI interfaces are not where it's at for this stuff.

What makes Cline the king of codegen agents right now IMO (from a UI/UX perspective) is how well they handle displaying the code, opening files, and scrolling the cursor as it changes. Even in a fully autonomous agentic flow, you still really want to be reading the code as it is written, to maintain context and keep steering it correctly. Having to go back and look at a huge diff after all of your updates is a real pain and slows things down.

victor9000 7 hours ago [-]
UI/UX is one of the last things I'm worried about when leveraging generative tools. When I need to inspect what the model has done, a quick git diff does the trick.
6 hours ago [-]
longmabacgi4 4 hours ago [-]
Cách hack trong roblox
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact
Rendered at 10:19:55 GMT+0000 (Coordinated Universal Time) with Vercel.