Getting Started: Setting Up Your Instruction File¶

The instruction file is the highest-leverage artifact in agent-assisted development: it gives agents the context to navigate your codebase, follow conventions, and run your toolchain.

Pick your file¶

Each tool reads a different file. Pick the one that matches your tool, or keep several if your team uses more than one:

Tool	File	Location
Claude Code	`CLAUDE.md`	Repo root or `.claude/CLAUDE.md`
GitHub Copilot	`copilot-instructions.md`	`.github/copilot-instructions.md`
Any AGENTS.md-compatible tool	`AGENTS.md`	Repo root

Using more than one tool? See how to converge multiple instruction files. The rest of this page is tool-agnostic -- the principles apply whatever the file name.

Bootstrap or start from scratch¶

Claude Code users: run /init. Claude reads your codebase -- build systems, test frameworks, code patterns -- and generates a starting file (Anthropic, Best Practices for Claude Code). If a CLAUDE.md already exists, /init suggests improvements rather than overwriting it.

Everyone else: create the file by hand. A blank file with four sections beats no file at all.

Minimal viable structure¶

Start with these four sections. Each one answers a question the agent asks within its first few actions:

# project-name

Brief description of what the project does and its primary language/framework.

## Commands

- Build: `npm run build`
- Test: `npm test`
- Lint: `npm run lint`
- Single test: `npm test -- path/to/file.test.ts`

## Conventions

- Commits follow Conventional Commits format
- Use 2-space indentation
- Error handling: return errors, don't throw

## Architecture

- API handlers: `src/api/handlers/`
- Database models: `src/models/`
- Shared utilities: `src/lib/`
- See `docs/architecture.md` for full system design

That is roughly 20 lines. It gives the agent enough to navigate, build, test, and follow your conventions from the first interaction.

What to include and what to leave out¶

Include	Exclude
Build, test, lint commands	Full documentation (link instead)
Conventions that deviate from defaults	Generic advice ("write clean code")
Architectural constraints and navigation pointers	Task-specific instructions (put in the prompt)
Things the agent gets wrong repeatedly	Knowledge the agent can discover from code

Pruning test: for each line, ask "Would removing this cause the agent to make mistakes?" If not, cut it.

Iterate, do not prewrite¶

The most effective instruction files are grown, not designed upfront. The progression we recommend:

graph LR
    A["Week 1<br/>Navigation only"] --> B["Week 2<br/>Add one linter rule"]
    B --> C["Week 3<br/>Add one feedback loop"]
    C --> D["Week 4+<br/>Encode principles<br/>as lint rules"]

Week 1: project identity, build/test commands, directory layout. About 20 lines. Enough to stop the agent from guessing where things live.

Week 2: add the convention the agent breaks most often. One rule, stated specifically. "Use unknown over any in TypeScript" is useful; "follow TypeScript best practices" is not. GitHub's guidance for copilot-instructions.md says the same: "start small and iterate based on results, beginning with 10–20 specific instructions that address your most common review needs, then test whether these are influencing Copilot" (GitHub, Adding repository custom instructions for GitHub Copilot).

Week 3: add a feedback loop -- a command the agent should run to check its own output. "Run npm test before committing" or "Run npx eslint --fix after editing .ts files."

Week 4 onward: when you find yourself adding the same instruction again and again, encode it as a lint rule or pre-commit hook instead. Deterministic enforcement beats probabilistic compliance.

Keep it short¶

Target under 200 lines per file. Every line consumes context budget before the agent starts working on your actual task. Long instruction files reduce adherence — instruction-following accuracy degrades as instruction density increases, with even leading frontier models achieving only 68% accuracy at 500 instructions (IFScale, 2025).

When you outgrow 200 lines:

Claude Code: split into @path imports or .claude/rules/ files with path-scoped frontmatter
Copilot: use .github/instructions/*.instructions.md files with applyTo globs
AGENTS.md: break into multiple AGENTS.md files in subdirectories

Instructions are context, not enforcement¶

Agents read instruction files on a best-effort basis. They are not configuration. Specificity and conciseness improve compliance, but they cannot guarantee it — adherence is bounded by the instruction compliance ceiling.

For rules that must never be violated, use deterministic mechanisms:

Pre-commit hooks
CI checks
Linter rules
File permission restrictions

The instruction file tells the agent what to aim for. Hooks and CI tell it what it cannot ship. Both are necessary; neither is sufficient alone. Anthropic's own guidance agrees: "Use hooks for actions that must happen every time with zero exceptions. Unlike CLAUDE.md instructions which are advisory, hooks are deterministic and guarantee the action happens" (Anthropic, Best Practices for Claude Code).

Let the agent write its own file¶

Ask the agent to draft or improve the instruction file after it has explored the codebase. It surfaces context it actually needs rather than what you guess:

Analyze this codebase and draft a CLAUDE.md covering build/test
commands, key conventions, and directory layout. Keep it under 50 lines.

Review and trim the output -- the agent often discovers conventions you follow implicitly but never documented. Treat it as a first draft, not a finished file.

Example: from zero to effective¶

A real progression for a TypeScript API project:

Day 1Week 2Month 2

# billing-api

TypeScript + Express API for subscription billing.

## Commands

- Test: `pnpm test`
- Build: `pnpm build`
- Dev: `pnpm dev`

# billing-api

TypeScript + Express API for subscription billing.

## Commands

- Test: `pnpm test`
- Build: `pnpm build`
- Dev: `pnpm dev`
- Single test: `pnpm test -- --testPathPattern=<file>`

## Conventions

- Use `unknown` over `any`
- Errors return Result types, never throw
- All handlers in `src/handlers/` export a single default function

# billing-api

TypeScript + Express API for subscription billing. Monorepo with
`packages/api`, `packages/shared`, `packages/worker`.

## Commands

- Test: `pnpm test`
- Build: `pnpm build`
- Lint: `pnpm lint` (run before committing)
- Single test: `pnpm test -- --testPathPattern=<file>`

## Conventions

- Use `unknown` over `any`
- Errors return Result types, never throw
- All handlers in `src/handlers/` export a single default function
- Database queries use the repository pattern in `src/repos/`
- Shared types live in `packages/shared/src/types/`

## Architecture

- See `docs/architecture.md` for system design
- Webhook processing: `packages/worker/src/webhooks/`
- Do not modify `src/generated/` — these files are auto-generated

Each version adds only what the agent needed and did not have. Nothing is added speculatively.

When this backfires¶

Instruction files create value when they are maintained. They create liability when they are not:

Stale structural references mislead. Directory paths, file names, and module boundaries change. An instruction file that documents src/api/handlers/ after a refactor actively directs the agent to the wrong place. Update the file or remove the reference when the codebase changes.
Auto-generated files underperform. Asking the agent to draft its own instruction file is a useful bootstrapping technique, but LLM-generated context files tend to be generic and verbose. The output works as a first draft — not a finished file; shipping it unread is the cargo-cult agent setup failure. Review and trim hard before committing.
Over-specification reduces adherence. Adding more rules does not guarantee more compliance. Instruction-following accuracy degrades as instruction density increases. A file with 30 specific, high-signal rules outperforms one with 150 that includes noise.

Key Takeaways¶

Start with a ~20-line file covering project identity, build/test commands, and directory layout — enough to stop the agent from guessing
Use tool-provided bootstraps (/init for Claude Code) rather than hand-writing from scratch
Grow the file in response to observed failures; do not prewrite rules the agent has not yet violated
Keep total length under 200 lines — instruction-following accuracy degrades as instruction density climbs
When the same correction repeats, encode it as a pre-commit hook, linter rule, or CI check — deterministic enforcement beats probabilistic compliance
Prune stale structural references (paths, file names) when the codebase changes — outdated instructions actively misdirect the agent