PaPoo
cover

Brick tries to route each prompt to the right model

For Claude and Claude Code users, this is interesting because it attacks a very real pain point: once you have multiple models in play, the “which one should handle this?” problem becomes annoying and expensive fast. Brick is basically trying to sit in front of that mess and make the choice for you, with a strong emphasis on keeping the Claude Code workflow intact.

image_0001.png

Key Points

image_0002.png

image_0004.svg

image_0003.svg

My Take

image_0005.svg

What strikes me is that this is less about “smart routing” as a flashy idea and more about brutally practical cost control. If you’ve spent time around Claude Code, you know the temptation is to leave it on the strongest model and stop thinking about it. That works until the bill shows up, or until you realize half your prompts were simple enough for something cheaper.

image_0007.svg

image_0006.svg

I think the most compelling part here is the Claude Code integration. A lot of tooling in this space is technically clever but operationally annoying. Brick seems aimed at the opposite: keep the familiar UX, change the backend logic, and don’t force the user to become a routing expert. That’s the right instinct. Developers will tolerate a lot if the tool stays out of the way.

image_0008.svg

At the same time, I’d be curious whether the “capability and complexity extraction” is as reliable as the README makes it sound. This might be genuinely strong, or it might work well on the obvious cases and get fuzzy on borderline prompts. Routing is one of those problems where the demo can look magical and the long tail can be messy. I’d want to see how it behaves on real coding sessions: short follow-ups, ambiguous bug reports, mixed reasoning-and-implementation tasks, and prompts that need a strong model for one tiny part but not the rest.

image_0010.svg

image_0009.svg

The “no cascades” part is also interesting. I like the discipline of making one decision and living with it. Cascades sound elegant on paper, but in practice they can feel like token compost. Still, a single-shot router only helps if its judgment is good enough. If it misroutes too often, you save money in the wrong places and lose time in the right ones.

image_0011.svg

I’d actually try this in a private Claude Code setup if I were juggling multiple models, especially if I were paying for more than one frontier model and wanted a cleaner default. The observability angle matters too. If Brick can show me what it routed where, and whether “savings” are real rather than hand-wavy, that’s much more interesting than yet another routing abstraction with a glossy pitch.

image_0013.png

image_0012.svg

The big takeaway: Brick is trying to make model selection boring, and that’s probably the right ambition. If it works as advertised, it could be one of those utilities you stop noticing because it quietly saves money and keeps you on the right model.

image_0015.png

Reference: GitHub - regolo-ai/brick-SR1: brick is a smart AI Models router, based on complexity & capabilities extraction from the query to the models via proprietary spatial embedding algorythm

image_0016.svg

同じ著者の記事