Vibe Coding with Specification

Published on 2025-09-03

Last updated on 2026-02-26

Technical

Over the past few days, I decided to revisit and improve ankihelper, an Android application for creating Anki flashcards. Originally developed by mmjang, development stopped in 2021. Despite this, I still use it to create flashcards for new words, so continuing its development seemed worthwhile. However, as someone without much experience in software development, I found myself struggling with basic principles. That’s when I turned to AI programming as my last resort. Last year, I managed to update the codebase using zed, which took me a week to make it run on Android 11 and above, but it’s still not perfect.

Writing Specifications for Code

Last month, I came across The New Code — Sean Grove, OpenAI on Twitter, and I became obsessed with the idea. The concept is simple: create a specification to guide AI in implementing features for you. It’s feasible, so I started working on it.

Since I’ve used AnkiHelper for a long time, I know what features I need. Here’s my plan:

Fix permission requests on first launch (new Android versions changed something, and it can’t request permission now)
Update the UI to Material 3
Popup Edit Mode Design Documentation
Add LLM feature (sometimes the built-in dictionary doesn’t have a word, so we can use LLM as a dictionary)
Remove unused dictionaries
Other UI improvements

Choosing Code Tools

CLI tools

Commercials
- Claude Code
- OpenAI Codex CLI
- Qwen Code / iFlow CLI / Qoder CLI
- Gemini CLI
- Copilot CLI
- kimi cli
- …
Opensource
- opencode
- code
- nanocoder
- octofriend
orchestrator
- https://github.com/moazbuilds/CodeMachine-CLI

claude code

This year, new code tools have emerged rapidly. Anthropic released claude, a command-line tool for coding, which is a game changer. OpenAI followed with Codex, Google released Gemini CLI, and Qwen team introduced qwen-code, among others.

Since I live in China, Qwen is my best choice. It may not have the best performance, but it’s free for developers, offering 2,000 requests per day with decent results.

make sure to configure context7 mcp server,

using nvm

1
curl -o- https://raw.githubusercontent.com/nvm-sh/nvm/v0.40.3/install.sh | bash
2
nvm install 22
3
nvm alias default 22

installation agent tools

1
npm install -g @anthropic-ai/claude-code

1
npm install -g @qwen-code/qwen-code@latest

1
npm install -g @google/gemini-cli

~/.qwen/settings.json

1
{
2
  "mcpServers": {
3
      "context7": {
4
        "httpUrl": "https://mcp.context7.com/mcp",
5
        "headers": {
6
          "CONTEXT7_API_KEY": "xxx"
7
        }
8
      },
9
      "deepwiki":{
10
          "httpUrl": "https://mcp.deepwiki.com/mcp"
11
      }
12
  },
13
  "selectedAuthType": "qwen-oauth"
14
}

I’d like to use claude, but it’s a bit expensive and requires some extra steps to access. now domestic comanpy also support cluade, I requested to use with claude

1
claude --dangerously-skip-permissions

~/.claude/settings.json

zhipu

1
{
2
  "env": {
3
    "ANTHROPIC_BASE_URL": "https://api.z.ai/api/anthropic", // or https://open.bigmodel.cn/api/anthropic
4
    "API_TIMEOUT_MS": "3000000",
5
    "ANTHROPIC_DEFAULT_HAIKU_MODEL": "glm-4.5-air",
6
    "ANTHROPIC_DEFAULT_SONNET_MODEL": "glm-4.6",
7
    "ANTHROPIC_DEFAULT_OPUS_MODEL": "glm-4.6",
8
    "ANTHROPIC_API_KEY": "xxx",
9
  }
10
}

deepseek

1
{
2
  "env": {
3
    "ANTHROPIC_BASE_URL":"https://api.deepseek.com/anthropic",
4
    "ANTHROPIC_API_KEY": "xxx",
5
    "ANTHROPIC_MODEL": "deepseek-chat",
6
    "ANTHROPIC_SMALL_FAST_MODEL": "deepseek-chat"
7
  }
8
}

moonshot

1
{
2
  "env": {
3
    "ANTHROPIC_BASE_URL":"https://api.moonshot.ai/anthropic",
4
    "ANTHROPIC_API_KEY": "xxx",
5
    "ANTHROPIC_MODEL": "kimi-k2-0905-turbo-preview",
6
    "ANTHROPIC_SMALL_FAST_MODEL": "kimi-k2-0905-turbo-preview"
7
  }
8
}

add some environment variables,
- MAX_MCP_OUTPUT_TOKENS: this will increase the allowed token from MCP server, sometimes MCP tools (such as figma) will respond with large content which is essential
```
1
"MAX_MCP_OUTPUT_TOKENS": 100000
```
- CLAUDE_CODE_MAX_OUTPUT_TOKENS: max output claude code
```
1
"CLAUDE_CODE_MAX_OUTPUT_TOKENS": 50000
```
- “ENABLE_TOOL_SEARCH”: “true”,
- “CLAUDE_CODE_EXPERIMENTAL_AGENT_TEAMS”: “0”

set default mode

1
...
2
"permissions": {
3
  "defaultMode": "bypassPermissions",
4
  "allow": [
5
    "mcp__pencil"
6
  ]
7
},
8
...

onboarding
- ~/.claude.json "hasCompletedOnboarding": true, make sure claude will not ask to onboard again.
  Terminal window
```
1
jq -r '.hasCompletedOnboarding' ~/.claude.json
2
true
```
status line
- CCometixLine
- ccstatusline
  - configure status line show more info, especially for context, claude will compact the context when the context reached about 95%,
memory management
- nowledge mem
  - first install nowledge app on your mac, it will start mcp server locally
  - install the plugins in claude code
    1 /plugin marketplace add nowledge-co/community 2 /plugin install nowledge-mem@nowledge-community
  - add mcp server to claude code
    Terminal window
    1 claude mcp add --transport http nowledge-mem http://localhost:14242/mcp --scope user
    or
    Terminal window
    1 claude mcp add --transport http nowledge-mem http://localhost:14242/mcp
  - save/retrieve the memeory via slash commands or via mcp server
    - slash command /save or /sum
    - use mcp tools
- specstory: Turn your AI development conversations into searchable, shareable knowledge.
CLAUDE.md

init CLAUDE.md using zcf
- https://agents.md/
- https://www.builder.io/blog/agents-md
- https://www.aitmpl.com/agents
model switch/apihub tools
Claude Code API Switcher
cc-switch
cc-mirror
Claude Code Model Switcher
claude_code_router
CC Mate
Claude Relay Service: 自行搭建Claude API中转服务，支持多账户管理
axonhub: is an all-in-one AI development platform that provides unified API gateway, project management, and comprehensive development tools. It offers OpenAI, Anthropic, and AI SDK compatible API layers, transforming requests to various AI providers through a transformer pipeline architecture. The platform features comprehensive tracing capabilities, project-based organization, and integrated playground for rapid prototyping, helping developers and enterprises better manage AI development workflows.
quotio: Quotio is a native macOS application for managing CLIProxyAPI - a local proxy server that powers your AI coding agents. It helps you manage multiple AI accounts, track quotas, and configure CLI tools in one place.
AIClient-2-API:A powerful proxy that can unify the requests of various client-only large model APIs (Gemini CLI, Antigravity, Qwen Code, Kiro …), simulate requests, and encapsulate them into a local OpenAI-compatible interface.
AntigravityManager: Professional multi-account manager for Google Gemini & Claude AI
add mcp server

following the commands to add mcp tools to claude, you can use same command to add them in qwen, gemini, just replace claude with the desired command
- mcp registry
  - https://github.com/mcp
- context7
  
  create context7 api on https://context7.com/dashboard
  Terminal window
```
1
claude mcp add -s user -t http context7 https://mcp.context7.com/mcp --header "CONTEXT7_API_KEY: YOUR_API_KEY"
```
- deepwiki
  Terminal window
```
1
claude mcp add -s user -t http deepwiki https://mcp.deepwiki.com/mcp
```
- chrome-devtools-mcp lets your coding agent (such as Gemini, Claude, Cursor or Copilot) control and inspect a live Chrome browser
  Terminal window
```
1
claude mcp add -s user -t stdio chrome-devtools npx chrome-devtools-mcp@latest
```
  it can check and page, get screenshot to find if the page behaves like what you want
- figma, use official mcp, see https://www.figma.com/mcp-catalog/
  Terminal window
```
1
claude mcp add -s user -t http figma-remote-mcp https://mcp.figma.com/mcp
```
  add more rules to ~/.claude/CLAUDE.md for user scope and also project level CLAUDE.md, especially https://developers.figma.com/docs/figma-mcp-server/add-custom-rules#rules-to-ensure-consistently-good-output
- playwright
  Terminal window
```
1
claude mcp add -s user -t stdio playwright npx @playwright/mcp@latest
```
- exa: create api keys on https://dashboard.exa.ai/api-keys
  Terminal window
```
1
claude mcp add -s user -t http exa "https://mcp.exa.ai/mcp" --header "EXA_API_KEY: xxx"
```
- astro mcp
  Terminal window
```
1
claude mcp add -s user  --transport http astro-docs https://mcp.docs.astro.build/mcp
```
- time mcp
  Terminal window
```
1
claude mcp add -s user -t stdio time-mcp npx time-mcp
```
  add a rule to ~/.claude/CLAUDE.md or project CLAUDE.md which make claude also find the latest materials when search/find documents/materials
```
1
## Time MCP rules (MUST follow)
2

3
- in every prompt, add the current date and time as an extra info for context
```
- Claude Code with GLM coding plan to process images with its MCP(only work when claude code with glm 4.6 model)
  Terminal window
```
1
claude mcp add -s user  --env Z_AI_API_KEY=api_key Z_AI_MODE=ZAI -t stdio zai-mcp-server  npx  "@z_ai/mcp-server"
```
- stackoverflow
  Terminal window
```
1
claude mcp add -s user -t stdio stack-mcp-server npx  mcp-remote https://mcp.stackoverflow.com
```
some other configs for claude, may still need more configuration
- zcf
- spec-kit
- ruler: create CLAUDE.md
- dotclaude: multiple config, maybe only reference CLAUDE.md
- claude-code-templates: also maybe only reference CLAUDE.md
- CodeRabbit CLI: Free AI code reviews in your CLI

Getting Started

Initially, i just prompt for every spec

In the codebase, I created a specification directory. For each feature, I made a subdirectory for the AI to generate the spec.

For each feature, I simply describe the requirements, including UI layout (I don’t use Figma, just words), functions, and input/output formats.

1
Please create a new subdirectory under specifications for the following features/fixes:
2
- fix issue 1
3
- fix issue 2
4
- implment feature 1
5

6
<add more description or requirements here>
7

8
Follow this systematic approach:
9
1. Research Phase: Conduct comprehensive research including:
10
   - Best practices and design patterns
11
   - Official documentation and API references
12
   - Informative blog posts and tutorials
13
   - Relevant GitHub issues and discussions
14
   - Performance considerations and edge cases
15
   - Use MCP tools such `websearch`, `exa`, `context7`, `deepwiki` to find all kinds of materials
16
   - <other specific documents, such api and documents for a lib/package>
17
2. Specification Phase: Create comprehensive documents:
18
   - Technical specification with architecture decisions
19
   - Detailed implementation plan with milestones
20
   - Task list with prioritized subtasks
21
   - Add a final task to commit and push changes
22

23
3. after specification is created, do a second validation check
24

25
4. Execution Phase: Create and coordinate three specialized sub-agents in parallel:
26

27
   Development Agent responsibilities:
28
   - Generate code following established patterns
29
   - Apply linting and formatting
30
   - Build after each code generation
31
   - Document implementation choices
32

33
   Testing Agent responsibilities:
34
   - Validate builds run without errors
35
   - Execute CodeRabbit analysis with `coderabbit --prompt-only`, let it run as long as it needs (run it in the background) and fix any issues.
36
   - Write and run unit/integration tests
37
   - Document edge cases and test coverage
38
   - Use MCP tools to conduct additional testing wherever possible
39
   - For website pages project, use MCP tools such as `playwright` and `chrome-devtools` to test and validate the website pages: exercise inputs, checks, buttons, and navigation; do not invoke npm or playwright directly
40

41
   Documentation Agent responsibilities:
42
   - Track implementation progress
43
   - Create implementation-summary.md with:
44
     * Technical decisions rationale
45
     * Code structure overview
46
     * Challenges encountered and solutions
47
     * Performance metrics
48

49
5. Coordination: Progress through tasks sequentially:
50
   - Assign next task only after previous completion
51
   - Stop for human review at major checkpoints
52
   - Ensure all agents work with consistent context
53
   - Maintain a single source of truth for specifications
54

55
Upon completion of all tasks, commit with descriptive message and push changes to repository.

Update: on 2025-11-07

In the last couple of days, i improved the process by creating a cumstom commands, which fit call these process into a slash command, each time i just supply the required error info, or feature requirements, and additional documents

i think this is not perfect, i will create more process for different project with different command

create file under ~/.claude/commands/dev/

1
---
2
name: dev: impl-and-fix
3
description: fix the error or implement new functions during the development
4
category: dev
5
---
6
whenever user ask you to do following things
7
- fix a issue or bug
8
- fix build warnings or errors
9
- implement a new feature
10
- improve a feature
11
- improve the performance
12
- resolve the deprecation
13
- refactor the code
14

15
Follow this systematic approach:
16

17
1. Analyze the information user provided, identify which specification to
18
   - If found related specs, one or more choices, ask user to confirm
19
   - If not found any related specs, create one sub directory under specification, the name pattern of the sub directory is [index(number)]-[feature name of fix name]
20
   - Never create documents under project root or directly under specification, documents must be created under sub directory of specification
21
   - In the following phases, if the phases have output files, they should always be stored using in the sub specification directory [index(number)]-[feature name or fix name], with the document named [index(number)]-[document name].md
22

23
2. Clarify the requirements
24
   - If it is a feature, ask questions like following but not limited:
25
     - If it has design, such Figma design, if yes, ask user to provide
26
     - What technical stack will be used
27
     - What programming languages it will use
28
     - What is the structure
29
     - You can also find the latest best practices question/clarification and ask user to provide
30
     - what programming languages will be used
31

32
   - If it is a bug fix, ask questions like following but not limited:
33
     - What environment, mobile or desktop
34
     - OS
35
     - Browser
36
     - Screenshot with error message
37
     - Logs (build logs, runtime logs, debug logs)
38
     - You can also find the latest best practices question/clarification and ask user to provide
39

40
3. Research Phase: Conduct comprehensive research including:
41
   - When doing research, add current time from time MCP to context
42
   - Best practices and design patterns
43
   - Official documentation and API references
44
   - Informative blog posts and tutorials
45
   - Relevant GitHub issues and discussions
46
   - Performance considerations and edge cases
47
   - Use MCP tools like `websearch`, `exa`, `context7`, `deepwiki` to find materials
48
   - Use all available skills
49
   - Use specialist agents like search-expecialist, search-specialist
50
  - Find latest documents for latest versions of libraries/tools for development
51
   - Create one file:
52
     - Research report
53

54
4. Debug Phase: Conduct comprehensive research including (apply to error or bug):
55
   - Analyze the codebase based on information from the Research Phase
56
    - If you need to find any pattern in the code, you can use the `ast-grep` skill to identify and locate it
57
   - Propose root cause analysis
58
   - Utilize multiple skills
59
   - use specialist agents like debugging specialist,
60
   - Create files:
61
     - Debug analysis document
62

63
5. Assessment Phase: Assess the current code including architecture, code style, and frameworks:
64
   - Evaluate if current architecture is optimal compared to best practices
65
   - Determine if we are using the latest code rules and formatting standards
66
   - Check if we are using the latest packages, libraries, and frameworks
67
   - Identify if there are better options available
68
   - Create files:
69
     - Assessment document
70

71
6. Specification Phase: Create comprehensive documents:
72
   - Technical specification with architecture decisions by referencing the documents created by previous phases
73
    - Following the api documents, make sure use aligned api specification
74
   - Detailed implementation plan with milestones
75
   - Task list with prioritized subtasks
76
   - Add a final task to commit and push changes
77
   - Use specialist agents like backend-architect, database-architect, cloud-architect, and other relevant specialists to help create the specification
78
   - Create 3 files:
79
     - One specification
80
     - One implementation plan based on the specification
81
     - One task list broken down from the implementation plan
82
7. Review Phase: Review the specification, plan, and task list:
83

84
   - Ensure they are aligned with our requirements
85
   - Verify they follow current best practices
86
   - Confirm they include proper code constraints for industrial standards and best practices
87
   - Validate the specification is executable and testable
88

89
8. Execution Phase: Create and coordinate three specialized sub-agents simultaneously, running in parallel using `subagent-driven-development` skill:
90

91
   Prohibit to pause or stop during this phase. If there are multiple implementation options, always choose the one that continues to implement all tasks, unless it is not feasible or there is a better solution.
92

93
   During the execution phase, when you need to locate specific code patterns for modification (add/delete/edit operations), utilize the `ast-grep` skill to efficiently search and filter through the codebase.
94

95
   Development Agent responsibilities:
96
   - Generate code following established patterns
97
   - Apply linting and formatting
98
   - Build after each code generation
99
   - Document implementation choices
100
   - Make sure to use the latest libraries and tools
101
   - Ensure no build warnings or errors exist; all issues must be fixed, not suppressed
102
   - Organize code in modular, loosely-coupled components
103
   - Maintain consistent data schemas across the codebase
104
   - Employ specialist agents like rust-pro, backend-developer, frontend-developer,
105
     mobile-developer, ios-developer, and other relevant specialists
106

107
   Testing Agent responsibilities:
108
   - Validate builds run without errors
109
   - Execute CodeRabbit analysis with `coderabbit --prompt-only`, let it run as long as it needs (run it in the background) and fix any issues
110
   - Write and run unit/integration tests
111
   - Document edge cases and test coverage
112
   - Use MCP tools to conduct additional testing wherever possible
113
   - For website pages project, use playwright-skills and MCP tools such as `playwright` and `chrome-devtools` to test and validate the website pages: exercise inputs, checks, buttons, and navigation; do not invoke npm or playwright directly
114
   - Employ specilist agent like superpowers:code-reviewer and  other relevant specialists
115

116
   Documentation Agent responsibilities:
117
   - Track implementation progress
118
   - Create implementation-summary.md with:
119
     * Technical decisions rationale
120
     * Code structure overview
121
     * Challenges encountered and solutions
122
     * Performance metrics
123
   - Employ specialist agents like documentation-expert and api-documenter and other relevant specialists
124

125
   After execution, in addition to the code and test, create one more file:
126
   - One implementation summary
127

128
9. Coordination: Progress through tasks sequentially:
129
   - Assign next task only after previous completion
130
   - Ensure all agents work with consistent context
131
   - Maintain a single source of truth for specifications
132
   - Only request user confirmation when encountering significant architectural changes or blocking obstacles
133

134

135
10. Cleanup Phase: Perform comprehensive cleanup:
136
    - Remove any temporary files or code created during the process
137
    - Delete obsolete code that was replaced during implementation
138
    - Remove unused imports and dependencies
139
    - Clean up debug logs and comments that are no longer needed
140
    - Ensure no leftover development artifacts remain in the codebase
141

142
11. Upon completion of all tasks, commit with descriptive message and push changes to repository.

Implementation

After several iterations, the spec is ready. I ask Qwen Code to implement the feature, strictly following the spec.

During the process, it will ask for permissions such as create files, execute commands, or build the project and so on. You can approve these requests. Any errors that occur are captured and fixed automatically. This saves a lot of time compared to when I used Zed for development.

sometimes, it may distract from the spec, you still need to stop it, ask to stickly follow your spec.

Debugging

Sometimes, the implementation and build process succeed, but when installed on the phone, the app crashes. In that case, I post the crash log, and the AI analyzes and fixes it.

1
adb logcat | grep ankihelper

This command retrieves logs from the application.

also we can start the app from adb

1
adb shell am start -n com.mmjang.ankihelper/.ui.LauncherActivity

this will start the laucher activity of the application.

It’s often helpful to ask the AI to add more debug logs to identify problems. Adding more log.d() statements in the code helps:

1
log.d("the response of the function");

using adb to show the memory leak when build and deployed app is a debug app

1
adb logcat -s LeakCanary:*

Takeaways

Here’s what I learned from the process:

Communication is key. Make sure the AI fully understands your requirements and creates an executable spec.
Have a clear blueprint for the feature. If your vision is ambiguous, the implementation will likely fail.

After updating AnkiHelper, I tried to use the same approach to integrate Marp presentations into Astro. I wanted to create an integration to convert Marp Markdown presentations to html and served in astro project, but after several rounds, I still couldn’t succeed. This was because I didn’t fully understand Astro’s internal build pipeline, including collections and integrations.

Today, I used Gemini Guided Learning to try to understand the process and key points of Astro integration. Maybe I’ll find a new angle to create a separate spec and try again. Using what I learned from Gemini Guided Learning, I’ve updated my site to better handle Marp presentations.

Tools

CodeRabbit CLI

install
Terminal window
```
1
curl -fsSL https://cli.coderabbit.ai/install.sh | sh
```
then execute to login to get authenticate token

use it

with claude

1
implement the tasks and run `coderabbit --prompt-only`, let it run as long as it needs (run it in the background) and fix any issues.

or use it separately in a code base by simply run
Terminal window
```
1
coderabbit
```

spec-kit: Toolkit to help you get started with Spec-Driven Development,

here is the steps to use it

install uv

1
curl -LsSf https://astral.sh/uv/install.sh | bash

install specify with uv

1
uv tool install specify-cli --from git+https://github.com/github/spec-kit.git

install and initialize

1
specify init my-project --ai claude  --script sh

or initialize in the current directory

1
specify init  --here --ai claude  --script sh

ask llm to fill the constitution

1
/constitution fill the constitution wiwth the basre minumun requirement for <xxxx app> based on the template

create the spec

1
/specify <description of the requirement, not the technical details>

create plan from the spec

1
/plan <add some technical requirements, ask it to create the plans>

breakdown the plan to tasks
Terminal window
```
1
/tasks break down the plan to tasks
```

implement, work with coderabbit

1
/implement implement the tasks one by one and run `coderabbit --prompt-only`, let it run as long as it needs (run it in the background) and fix any issues.

openspec: Toolkit to help you get started with Spec-Driven Development,
- init openspec in your project
  Terminal window
```
1
$ openspec init
```
- draft a proposal via /openspec:proposal add <new feature>
- check and review what openspec creates
  Terminal window
```
1
$ openspec list                       # Confirm the change folder exists
2
$ openspec validate <new feature>     # Validate spec formatting
3
$ openspec show <new feature>         # Review proposal, tasks, and spec delta
```
- Refine the Specs, if required
- Implement the Change via /openspec:apply <new feature>
Hooks
- https://github.com/disler/claude-code-hooks-mastery
- https://github.com/diet103/claude-code-infrastructure-showcase
- pre-compact Hooks
each time, when the context window is full, claude will auto compact the session, so some info will be lost, so i created a pre-compact hook to save the memory and thread when this event happend
- add hooks config in ~/.claude/settings.json
```
1
"hooks": {
2
  "PreCompact": [
3
    {
4
      "hooks": [
5
        {
6
          "type": "command",
7
          "command": "node $CLAUDE_HOME/hooks/pre-compact.js"
8
        }
9
      ]
10
    }
11
  ]
12
}
```
- save the hook at ~/.claude/hooks/pre-compact.js this hook uses mcporter to call

Appendix

config for linux to limit resource usage

/etc/systemd/system/user@1001.service.d/override.conf

1
[Service]
2
User=jenningsl
3
Slice=user-10001.slice
4
Delegate=pids memory cpu cpuset io

/etc/systemd/system/user-1001.slice.d/override.conf

1
[Unit]
2
Description=Resource limits  for user jenningsl
3
DefaultDependencies=no
4
Before=slices.target
5
Requires=-.slice
6
After=-.slice
7
[Slice]
8
MemoryAccounting=true
9
MemoryHigh=75%
10
MemoryMax=85%
11
IOAccounting=true
12
IOReadBandwidthMax=/dev/vda 70M
13
IOWriteBandwidthMax=/dev/vda 80M
14
CPUQuota=150%
15
AllowedCPUs=0-1

Back to Blog