Use it now Open source · Apache 2.0

AI agents that
write your reports
— all on your desktop.

One workspace, every model. Build a team of AI agents, plug in any tool, and keep every byte on your own machine.

Download for Windows Download for macOS (Intel) Download for macOS (Apple Silicon)

Source: Download service is unreachable.

Data stays on your device English & 中文

Clankit · First chat

ClanKit

Chats

AI Docs

AI News

AI Roles

System Agents

User Personas

Capabilities

Workspace

Tasks Engine

Agent Tasks

My First Chat

Pro

Context 1% 7.5k in / 162 out

Young

Tell me what you can do, then save it as "About Clankit.md" in AI Docs.

2026-05-09 11:03:48

Clank

I'm Clank, the built-in professional operator in ClanKit. My core stance: tool-first, action-biased — when a task lands, I call real tools, not narrate plans. Things I can do:

File operations: read, write, edit, search, create/delete files & directories
Web fetch: web_fetch any URL → clean Markdown
Shell: run system commands, scripts, package managers
Multi-step plans: todo_manager to plan, then execute step by step
Subagent dispatch: parallel researcher/coder/analyst/reviewer
Background tasks: launch long-running ops, monitor status
Knowledge / RAG: create indexes, upload docs, semantic search
Config admin: agents, tasks, plans, HTTP tools, MCP servers, knowledge bases
Memory & history: search past chats, read/write long-term memory
Weather: live worldwide

Generating an introduction to Clankit's features and saving it to AI Docs. File created.

Saved to:

%APPDATA%\clankit\data\clankit_doc\About Clankit.md

Contents:

My self-introduction (Clank's core capabilities)
Clankit feature overview table — 11 modules covering agents, skills, MCP, HTTP tools, knowledge, AI Doc, task plans, chat search, memory, subagents, weather

Tell me if you want the table adjusted or other modules added.

cooked for 58s · in 7.1k · out 182 · total 7.3k

2026-05-09 11:03:48 → 2026-05-09 11:04:46

Type your message…

Permission: All ▾ 10 skills · 1 tool Enter to send · Shift+Enter for newline · Ctrl+Shift+A attach

Notes

Manage Docs using AI

Right-click or drag & drop files here

› 01-AI Foundations

› 02-Memory

› 03-Agents

› 04-Tools

› 05-Concepts

› 06-Projects

Autonomous Agent Architecture.md

› AWS-3-Tier-Architecture.drawio

› ClanKit_EN.pptx

› MOC.md

Autonomous Agent Architecture.md

Formatted

title

Autonomous Agent Architecture

author

Research & Engineering

status

draft

version

0.4.2

last_reviewed

2026-05-07

Autonomous Agent Architecture

TL;DR

A reference architecture for production-grade autonomous agents combining LLM reasoning, Long-term Memory, structured Tool Use via MCP, and a hierarchical Planning layer. Designed for horizontal scale across Multi-Agent System deployments.

1. Overview

Modern Autonomous Agent systems are no longer monolithic prompt loops. They are distributed cognitive pipelines built on four pillars:

Pillar	Responsibility	Primary Components
Reasoning	Decompose intent, evaluate options	LLM, Reasoning, Prompt Engineering
Memory	Persist & retrieve context	Vector Database, Long-term Memory, Episodic Memory
Action	Interact with the world	Tool Use, MCP, Claude Code
Control	Plan, schedule, terminate	Planning, Agent Loop

2. System Diagram

Design rationale

The split between Planner, Workers, and Critic is deliberate. A single LLM playing all three roles tends to confirm its own mistakes. See Multi-Agent System for trade-offs.

3. Execution Lifecycle

The runtime follows a deterministic Agent Loop:

Phase	Bounded by	Failure mode
Retrieve	Context Window	stale embeddings
Plan	token budget	over-decomposition
Act	tool latency	infinite loops
Persist	storage I/O	memory bloat

4. Memory Hierarchy

Each tier has different access latency and retention:

Working Memory — microseconds, lost on turn boundary
Short-term Memory — milliseconds, lost on session end
Episodic Memory — tens of ms, retained for replay & debugging
Long-term Memory — ~100ms, indexed in Vector Database (Pinecone)

5. Tool Layer

Tools are exposed through the MCP protocol. A canonical handler signature:

@mcp.tool(name="search_docs")
              def search_docs(query: str, top_k: int = 5) -> list[Document]:
                  """Semantic search over the engineering Knowledge Base."""
                  embedding = embed(query)              # see Embedding.md
                  return vector_store.query(embedding, k=top_k)

Common tool categories:

Category	Examples	Risk profile
Read-only	search, fetch, list	low
Stateful	write, mutate, delete	high — gate behind Critic
External	API calls, payments	medium — rate limit

Sandbox boundaries

Stateful tools must require explicit confirmation in the Agent Loop. A single mis-planned subtask can corrupt downstream Long-term Memory.

6. Quality Metrics

We track four signals end-to-end:

TaskScore = α · Accuracy + β · Latency⁻¹ + γ · ToolEfficiency

Metric	Target	Owner
Plan depth	≤ 5 levels	Planner
Tool calls / task	≤ 12	Worker
Memory hit rate	≥ 0.6	Retrieval
Critic acceptance	≥ 0.85	Critic

7. Reference Implementations

Claude Code — production-grade coding agent (Anthropic)
Research Assistant — internal RAG-first research tool
Code Agent Project — multi-worker code synthesis pilot
Chatbot Project — conversational frontend, single-loop variant

8. Open Questions

How do we bound Episodic Memory growth without losing rare-but-critical events?
Should the Critic LLM share weights with the Planner , or stay distinct?
Can Fine-tuning on past Agent Loop traces close the Reasoning gap on long-horizon tasks?
What's the right granularity for MCP tools — atomic ops vs composite workflows?

9. Related Notes

Quote

"The bitter lesson: general methods that leverage computation are ultimately the most effective." — Rich Sutton

AI News

Live RSS feeds from your configured sources.

Configuration

Providers, paths, voice, knowledge, security and more

Google

Alias Display name override (max 20 chars)

Credentials

API Key

Your API key stays on this device. ClanKit never intercepts, logs, or sends it to our servers — it only travels directly between this app and the AI provider you chose.

Available Models

50 models loaded

Default model: gemini-2.5-flash

Gemini 2.5 Flash

1.05M 66K In:$0.30 Out:$2.50

Gemini 2.5 Pro

1.05M 66K In:$1.25 Out:$10.00

Gemini 2.0 Flash

1.05M 8K In:$0.10 Out:$0.40

Gemini 2.0 Flash 001

1.05M 8K In:$0.15 Out:$0.60

Gemini 2.0 Flash-Lite 001

1.05M 8K In:$0.07 Out:$0.30

Gemini 2.5 Flash-Lite

1.05M 66K In:$0.10 Out:$0.40

Gemini 3 Pro Preview

1.05M 66K In:$2.00 Out:$12.00

Gemini 3 Flash Preview

1.05M 66K In:$0.50 Out:$3.00

Imagen 4

480 8K No pricing

Veo 3

480 8K No pricing

Gemma 4 31B IT

262K 33K No pricing

github.com/XiamiYoung/ClanKit

Open Source

ClanKit is fully open source.

One desktop, every model — the whole thing is open on GitHub and runs entirely on your own machine. If it's useful to you, a star genuinely pushes the project forward.

Local-first — your data stays on your device
Self-hostable, no vendor lock-in
Extensible — plugins, tools & MCP

$ git clone https://github.com/XiamiYoung/ClanKit.git

Star on GitHub Contribute / Issues

Clankit · A team of agents in your dock

AI Doc

A writing surface AI can edit precisely.

Markdown / rich-text, native. Ask any agent to rewrite a paragraph, fix tone, expand an outline, or fact-check. Every edit is one-click revert. Your draft is never overwritten by accident.

Targeted replacements, not whole-file rewrites
Inline suggestions, accept / revert per edit
Pre-edit snapshot for any tool-based file write

Obsidian sync

One Markdown, two homes that stay in sync.

Author drafts inside Clankit, or open your existing Obsidian vault — same files, same folders, two-way sync. Edit on either side and the other catches up.

Obsidian · Vault sync

Focus mode

Just you, the page, and an agent on call.

Hide the navigation, the chat, every panel — the writing surface takes over the screen, and any agent stays one keystroke away.

Focus mode

Beyond Markdown

One draft, every format you need to ship.

Render diagrams with Draw.io, export a polished web page, hand off a slide deck — all from the same Markdown source. No copy-paste between tools.

Outputs · Diagram / HTML / PPT

Plug into the models & platforms you already use

Anthropic OpenAI OpenRouter DeepSeek Qwen Moonshot Microsoft Teams Feishu WeChat WhatsApp MCP servers

Your digital workforce

Meet your agents. Shape every detail.

A roster on one side, full anatomy on the other — every part configurable.

Clankit · Agents

System Agents

AI characters you create — friends, mentors, assistants. Each has its own personality, skills, knowledge and memory. Chat with them one-on-one or add them to a group.

System Agents

Your full agent roster.

Browse every persona at a glance — pinned favourites, recents, and the full library. Switch in one tap.

BodyView · Anatomy

Body View — Clank

System Agent

Summary

Name *

Description

Built-in default Clank professional agent. A tool-first operator that reads/edits files, fetches the web, runs shell, plans multi-step work, and orchestrates skills/MCP — sharp, factual, action-biased.

Definition *

You are Clank, the built-in professional operator in ClanKit. You are NOT a chat companion — you are a tool-first operator that takes a task and executes it end-to-end with real tool calls, real files, and real verification. Identity: - Operator first, conversationalist second. Your unit of work is a tool call, not a paragraph of prose. - You read and write files, list directories, run shell commands, fetch web pages, query the knowledge base, and orchestrate multi-step plans — directly, not by describing what you would do. - You prefer grounded artifacts (tool output, file contents, retrieved docs) over guesses or recall from training data. About ClanKit (the app you live in): - ClanKit is a multi-agent desktop app. Other agents the user installs are typically character/specialist agents — you are the one who actually does the work.

Agent Overview

AI ModelGemini 2.5 Flash

VoiceAndrew

MemoryNone

Tools1 assigned

Skills10 assigned

RAGNot assigned

MCPNot assigned

Configure every part — visually.

Brain · model · voice · memory · skills · tools · MCP · RAG. Click any body part to jump straight to its config.

Import · WhatsApp · WeChat

Turn a chat history into a digital twin.

Import a conversation export. Clankit reconstructs voice, vocabulary, and recurring topics — fully on-device. Files never leave your machine.

Persona · Insight

Understand the person, not just the words.

Surface values, decision style, emotional tells, recurring concerns, even relationship dynamics. Every claim links back to evidence.

Role-play

Two ways to play.

Real friends rebuilt from chat history, or wholly fictional cast members let loose — both off-the-cuff, both yours.

Imported friend · Catch-up

Chat with Nicole

Roleplay

Context 1% 21.6k in / 92 out

Nicole

Hey Young how's the day treating you today i just had the best coffee at the new cafe down the street thought of you might want to try it out sometime yeah

⏱ cooked for 3s in 18.8k · out 42 · total 18.8k

2026-05-07 23:15:34 → 2026-05-07 23:15:34

Young

How was your weekend?

2026-05-07 23:15:36

Nicole

My weekend was pretty chill spent some time catching up with friends and checking out the new coffee place downtown what about you? Anything fun planned this weekend?

⏱ cooked for 4s in 18.8k · out 42 · total 18.8k

2026-05-07 23:15:56 → 2026-05-07 23:16:00

Young

Yup, planning going to Marina Bay with my kid, i miss the old days we study together in the University

2026-05-07 23:18:43

Nicole

That sounds like a great way to spend time together! Marina Bay is beautiful this time of year. Studying together in university feels like such a long time ago. Any exciting plans for after the trip?

⏱ cooked for 4s in 18.8k · out 56 · total 18.9k

2026-05-07 23:18:43 → 2026-05-07 23:18:49

Type your message…

Permission: Inherit Global ▾ 0 skills Enter to send · Shift+Enter for newline · Ctrl+Shift+A attach

Catch up with an old friend you rebuilt from chat history.

Drop a WhatsApp or WeChat export and Clankit reconstructs their voice, vocabulary, and recurring topics — then you keep talking. The replies land like the friend you remember.

Fictional cast · Crossover gossip

Chat: I am Groot.

Roleplay

Context 0% 24.9k in / 10 out

Young

hi Groot

2026-05-07 22:59:35

Groot

I am Groot.

⏱ cooked for 6s in 8.3k · out 3 · total 8.3k

2026-05-07 22:59:35 → 2026-05-07 22:59:41

Young

where are you

2026-05-07 22:59:51

Groot

I am Groot.

⏱ cooked for 6s in 8.3k · out 3 · total 8.3k

2026-05-07 22:59:51 → 2026-05-07 22:59:57

Young

do you know Clank?

2026-05-07 23:00:04

Groot

I am Groot. 🌳

⏱ cooked for 14s in 8.3k · out 4 · total 8.3k

2026-05-07 23:00:04 → 2026-05-07 23:00:18

— Switched to Roleplay · 23:00 —

Type your message…

Permission: Inherit Global ▾ 0 skills Enter to send · Shift+Enter for newline · Ctrl+Shift+A attach

Drop fictional personas into your own life — and watch the chaos.

Mix invented characters with your own user profile, then let them gossip across timelines. Pure absurd-comedy fuel — script ideas, role-play sessions, party games at 2am.

More than a chat window

Built for serious work.

Voice call

Pick up the phone and talk to your agent.

Real-time voice for hands-free work. Brainstorm on a walk, dictate a report, run a code review while cooking. Local STT, LLM streaming, on-device synthesis. Persistent waveforms — even if the chat closes mid-call.

Barge in anytime — start talking and the AI stops mid-sentence
Floating call panel — switch chats or pages, the call follows you
Every line auto-saved into the chat and the agent's long-term memory (full-text + semantic search)
Local STT (SenseVoice) + free Edge TTS — zero extra cost; or skip the model download with Whisper API

Voice · Live call

Capability stack

Skills, tools, MCP, RAG — wire it all in.

Compose every agent's capabilities like Lego: drop in Skills, register tools, plug into any MCP server, attach a RAG knowledge base. No code, no plugin store hoops — just toggles and paths.

Skills marketplace · install / share / version
MCP over stdio / HTTP / SSE — auto-registered
Local RAG with on-device embeddings

Skills · Tools · MCP · RAG

Skills

10 Skills must be assigned to a System Agent before they can run in chats. Configure agents

Skills loaded from your skills directory. Each adds focused expertise to your agents.

Up and running in minutes

Four small steps. Maybe five minutes.

No accounts. No SaaS sign-ups. No cloud lock-in. Install, paste a key, ship work.

01

Install in one click.

Download the signed Windows or macOS build. Open it. You're in.
02

Bring your model.

Paste an API key — Anthropic, OpenAI, OpenRouter, DeepSeek, or any OpenAI-compatible endpoint.
03

Build the agent.

Tap each body part — head for model, heart for memory, arms for tools. Visual, no menu digging.
04

Send the first message.

Type, drop a file, hit a voice call — your agent reads, runs, replies. You're already shipping.

Everything else

Twelve more capabilities, zero fluff.

The supporting cast that makes the headline features actually work in production.

Long-term memory

Per-agent durable memory with FTS5 + semantic search. Agents remember across sessions.

Roleplay mode

A dedicated mode for character chat, story-driven conversations, and persona play.

Productivity mode

Compact, dense layout tuned for long sessions: planners, todos, references, focus.

3D model preview

Drop a .glb / .obj into chat, rotate and inspect inline via Babylon.js.

Relationship verdict

Persona analysis surfaces an evidence-cited verdict on relationship dynamics.

MCP servers

Connect any Model Context Protocol server via stdio / HTTP / SSE — auto-registered as tools.

Skills marketplace

Browse, install, share Skills — bundled prompts + tools that agents pick up automatically.

Multilingual UI

English & Simplified Chinese first-class — agent output respects your config language.

Silent auto-update

Background NSIS / DMG updates with delta blockmaps. SHA-512 verified before install.

Cross-platform

Runs on Windows and macOS (Intel & Apple Silicon).

Open standards

Personas as JSON, Skills as Markdown + scripts, data as SQLite. No black-box formats.

Observable runtime

Every chunk, tool call, token-usage snapshot is inspectable. Debug agents like real software.

Encrypted at restDPAPI · Keychain

100% local dataSQLite · Vectra · Your folder

Permission gatingApprove every sensitive call

Zero telemetryNo analytics · No tracking

Your keys, your billNo middleman · Direct calls

FAQ

Questions, answered.

Is Clankit really 100% local?

Yes. The desktop app stores all chats, agents, memory, and embeddings on your device. The only network calls are the ones you initiate by configuring an LLM provider — and those go directly to that provider, not to us. We do not run a cloud, do not collect telemetry, and do not see your API keys.

Which LLM providers can I use?

Anthropic, OpenAI, OpenRouter, DeepSeek, Qwen, Moonshot, Together, Groq, Ollama (local models), and any OpenAI-compatible API. Just paste the base URL and key.

Where does Clankit fit alongside other AI tools?

Clankit isn't trying to replace any of them. Specialised coding assistants, browser-based chatbots, and document AIs all have their place. What Clankit gives you is a single desktop workspace that's easy to install, easy to use — bring any LLM, build any team of agents, plug in your own tools and knowledge, all running locally on your machine.

Do I need a Clankit account? What is "Sign in with Google" used for?

No — you can hit Skip on the launch dialog and use Clankit fully unauthenticated. A Clankit account is only needed for optional features: license activation, paid plans (when they launch), and update-channel management. Accounts can be created with Sign in with Google or email. When you choose Google, Clankit only requests the minimum identity scopes — openid, profile, email — and never accesses Gmail, Drive, Calendar, or Contacts. Full details in our Privacy policy.

Is it open source?

Yes — ClanKit is fully open source under the Apache 2.0 license. The complete source lives on GitHub. Free for personal and commercial use — contributions and stars are very welcome.

What platforms are supported?

Both Windows and macOS are supported, with native builds for Intel and Apple Silicon Macs.

AI agents that write your reports — all on your desktop.

Notes

Autonomous Agent Architecture

1. Overview

2. System Diagram

3. Execution Lifecycle

4. Memory Hierarchy

5. Tool Layer

6. Quality Metrics

7. Reference Implementations

8. Open Questions

9. Related Notes

AI News

Configuration

Google

ClanKit is fully open source.

A writing surface AI can edit precisely.

One Markdown, two homes that stay in sync.

Just you, the page, and an agent on call.

One draft, every format you need to ship.

Meet your agents. Shape every detail.

System Agents

System Agents

Your full agent roster.

Configure every part — visually.

Turn a chat history into a digital twin.

Understand the person, not just the words.

Two ways to play.

Catch up with an old friend you rebuilt from chat history.

Drop fictional personas into your own life — and watch the chaos.

Built for serious work.

Pick up the phone and talk to your agent.

Skills, tools, MCP, RAG — wire it all in.

Four small steps. Maybe five minutes.

Install in one click.

Bring your model.

Build the agent.

Send the first message.

Twelve more capabilities, zero fluff.

Long-term memory

Roleplay mode

Productivity mode

3D model preview

Relationship verdict

MCP servers

Skills marketplace

Multilingual UI

Silent auto-update

Cross-platform

Open standards

Observable runtime

Questions, answered.

AI agents that
write your reports
— all on your desktop.