kimiflare FAQ

Common questions about kimiflare — the open-source, Cloudflare-native AI coding agent and Claude Code alternative.

What is kimiflare?

kimiflare is an open-source, terminal-based AI coding agent — a self-hosted alternative to Claude Code — that runs on your own Cloudflare account. It is powered by the Kimi K2.7 model on Cloudflare Workers AI, with every request routed through your own Cloudflare AI Gateway for per-request logs, response caching, and authoritative cost.

Is kimiflare a Claude Code alternative?

Yes. kimiflare is an open-source Claude Code alternative. It offers the same terminal-first agentic workflow — reading and editing files, running commands, web research, and image understanding — but runs on Cloudflare Workers AI using Kimi K2.7 on your own account, and is MIT-licensed. See the full Claude Code alternative breakdown.

How do I run an AI coding agent on my own Cloudflare account?

Install kimiflare with npm install -g kimiflare (or run npx kimiflare), then run kimiflare. The onboarding flow connects your Cloudflare account and AI Gateway. All inference runs on Cloudflare Workers AI under your credentials — no third-party API keys are required.

What model does kimiflare use?

kimiflare uses Kimi K2.7 (@cf/moonshotai/kimi-k2.7-code) on Cloudflare Workers AI, with a 262K-token context window. You can override the model with the --model flag.

What is an AI coding harness?

An AI coding harness is the program that wraps a large language model with tools, permissions, context management, and a user interface so it can act as an autonomous coding agent — reading and editing files, running commands, and researching the web. kimiflare is a Cloudflare-native, open-source coding harness for the terminal. Read more: What is an AI coding harness?

How much does kimiflare cost?

kimiflare itself is free and open source (MIT). You pay only Cloudflare's usage for Workers AI inference on your own account. Because every request flows through your Cloudflare AI Gateway, kimiflare shows authoritative per-turn cost from the gateway logs in the /cost command.

Does kimiflare work on Windows, macOS, and Linux?

Yes. kimiflare runs on macOS, Linux, and Windows. It is distributed as the kimiflare npm package and requires Node.js 20 or newer.

Is kimiflare private? Does it read my whole conversation?

Inference runs on your own Cloudflare account, and kimiflare's cross-session memory is explicit and opt-in — memories are stored only when you ask via the remember, recall, and forget tools. The agent does not surveil your conversation.

Can I extend kimiflare with my own tools?

Yes. kimiflare supports the Model Context Protocol (MCP), so you can plug in external tool servers — local stdio servers or remote SSE endpoints — for GitHub, Sentry, docs search, databases, and more.

Install kimiflare   Why a Claude Code alternative?