For Claude Code & Codex users

The platform your agent
already knows how to drive.

Benmore is one binary plus a hosted dev environment. Install it, bootstrap once, launch Claude Code - then just say what you want. Your app is live on a dev URL while the agent builds, verifies it in a real browser, records demo videos, and publishes when you say go.

$ brew install benmore-studio/benmore/benmore
$benmore bootstrap# sign in / sign up + installs the agent skill
$cd ~/Benmore
$claude# that's it - start talking
macOS & Linux · signed & notarized · framework source on GitHub

How it works

No local server, no Docker, no build step. The dev environment is hosted - you watch the app take shape in a browser tab while the agent works.

1

Install & bootstrap

benmore bootstrap signs you in (or creates your account), sets up your ~/Benmore workspace, and installs the agent skill - the operating manual Claude Code loads automatically. After that, your agent knows the entire platform surface.

2

Say what you want - it's live while you build

Every app is born live in a hosted dev environment. Each edit the agent makes ships there in about a second. Open the dev URL on your laptop or phone and watch it evolve in real time.

while you build
https://<app>-dev.benmore.ai
Your sandbox. The agent builds, tests, and iterates here.
when you say "publish"
https://<app>.benmore.ai
Production. Code promotes dev → prod; data is migrated with a backup.
3

Publish when you're ready

Say "publish it" and the agent promotes your code to production with auto-SSL - dev stays your sandbox. Every change is a git commit, so "undo that" always works too.

Don't learn commands. Say what you want.

These are real prompts. The skill teaches your agent the tools behind them - you never have to.

Build me an invoicing app with client logins and overdue reminders. Show me when it's live.
Behind the scenes: the agent writes a Prisma schema + TSX frontend and pushes. The runtime generates the REST API, auth, and real-time updates - no handler code. Live on your dev URL in minutes.
Open the app in a real browser, sign up as a test user, and verify onboarding end-to-end. Record it so I can watch.
Behind the scenes: browser_check drives headless Chrome against your dev app - JS executed, clicks and typing included - and records a scrubbable session replay. You get a link and literally watch what the agent did.
Make a shareable demo video of the checkout flow and put it on a page I can send to a client.
Behind the scenes: a shared recording mints a public, login-free replay link plus an embed. The agent pastes it into an edge - an isolated shareable page on its own URL. Inputs are masked at capture.
POST /api/invoices returns 403 for managers - figure out why and fix it. Prove it works after.
Behind the scenes: the agent probes the route as a real signed-in user, tails server logs, queries the live database, fixes the access rule, and re-probes to show the 200.
Undo whatever you just did.
Behind the scenes: every pushed file auto-commits to the app's own git history. One revert and the app is back - instantly, including from your phone at 2am.
Before we publish: run a security scan and fix anything it finds.
Behind the scenes: the platform's security scan plus write-time validators. The runtime already enforces parameterized SQL, CSRF, per-user scoping, and rate limits by default - the agent can't ship past the gate.

Why agents ship faster here

The runtime does the undifferentiated work, so the agent's edits are small, declarative, and verifiable. Less surface to get wrong, more it can prove.

auto-CRUD API
Every model gets REST endpoints - list, create, batch, search, pagination - with zero handler code to hallucinate.
auth built in
Sessions, signup, OAuth, MFA, roles & RBAC, CSRF - enforced by the runtime, not agent-written code.
browser verification
Headless-Chrome checks with screenshots and recorded session replays - the agent proves the UI works, you watch.
real-time
SSE and WebSocket events for every table, scoped per user and tenant.
write-time validation
Every push runs framework validators first - antipatterns are rejected with a fix hint the agent acts on.
git history per app
Every push is a commit. Roll back any change instantly - the agent's safety net is built in.

Not in a terminal? Same platform.

Claude Code & Codex (this page)
benmore bootstrap installs the skill automatically - the full operating manual for building, verifying, recording, and publishing. Nothing else to configure.
Claude.ai, Cursor & MCP clients
Add the Benmore MCP server as a connector and drive the same platform from chat.
MCP server URL
https://benmore.ai/mcp

Three commands, then it's conversation.

Install, bootstrap, launch your agent - your first app can be live before the coffee's done.

Read the docs GitHub