For Claude and Claude Code users, this is interesting because it attacks a very real pain point: once you have multiple models in play, the “which one should handle this?” problem becomes annoying and expensive fast. Brick is basically trying to sit in front of that mess and make the choice for you, with a strong emphasis on keeping the Claude Code workflow intact.

model: "brick" style OpenAI-compatible endpoint.brick claude on to set ANTHROPIC_BASE_URL in Claude Code settings and start the router.eco, lite, mid, pro, and max, which map your cost/quality preference to different model tiers.opus, sonnet, or haiku bypasses Brick entirely.brick claude status gives a live dashboard with routing counts, effort distribution, difficulty mix, and estimated savings.What strikes me is that this is less about “smart routing” as a flashy idea and more about brutally practical cost control. If you’ve spent time around Claude Code, you know the temptation is to leave it on the strongest model and stop thinking about it. That works until the bill shows up, or until you realize half your prompts were simple enough for something cheaper.
I think the most compelling part here is the Claude Code integration. A lot of tooling in this space is technically clever but operationally annoying. Brick seems aimed at the opposite: keep the familiar UX, change the backend logic, and don’t force the user to become a routing expert. That’s the right instinct. Developers will tolerate a lot if the tool stays out of the way.
At the same time, I’d be curious whether the “capability and complexity extraction” is as reliable as the README makes it sound. This might be genuinely strong, or it might work well on the obvious cases and get fuzzy on borderline prompts. Routing is one of those problems where the demo can look magical and the long tail can be messy. I’d want to see how it behaves on real coding sessions: short follow-ups, ambiguous bug reports, mixed reasoning-and-implementation tasks, and prompts that need a strong model for one tiny part but not the rest.
The “no cascades” part is also interesting. I like the discipline of making one decision and living with it. Cascades sound elegant on paper, but in practice they can feel like token compost. Still, a single-shot router only helps if its judgment is good enough. If it misroutes too often, you save money in the wrong places and lose time in the right ones.
I’d actually try this in a private Claude Code setup if I were juggling multiple models, especially if I were paying for more than one frontier model and wanted a cleaner default. The observability angle matters too. If Brick can show me what it routed where, and whether “savings” are real rather than hand-wavy, that’s much more interesting than yet another routing abstraction with a glossy pitch.

The big takeaway: Brick is trying to make model selection boring, and that’s probably the right ambition. If it works as advertised, it could be one of those utilities you stop noticing because it quietly saves money and keeps you on the right model.
