Agent Loop - Coding Agent

Overview

This project is a simple coding agent implemented with an Agent Loop architecture.

It runs AI LLM calls in a loop using Vercel's AI SDK v5 and depending on the response, it can read files, search the repo, write patches, run commands, and evaluate work quality. The agent continues this loop until it reaches a final answer or a maximum number of iterations.

Supported AI Providers

The agent supports multiple AI providers through Vercel's AI SDK v5:

OpenAI (GPT-5-mini, GPT-5, etc.)
Anthropic (Claude 4.x Sonnet, etc.)
Google (Gemini 2.5 Flash, etc.)

You can switch between providers using the --provider CLI option or by setting the appropriate environment variables.

Key Features

Manual Tool Calls: Uses manual tool calls instead of OpenAI function calling for more control
Dual Patch Formats: Supports both full-file patches (for new files) and unified diff patches (for incremental improvements)
AST-Based Refactoring: Advanced TypeScript refactoring using the TypeScript compiler API for symbol renaming, import management, and structural changes
Structured Patch Generation: Generate precise patches from natural language instructions with line-specific edits
Work Evaluation: Built-in evaluation tool that analyzes created files and provides structured feedback with scores, strengths, improvements, and specific suggestions
Diff Parsing: Unified diff patch parsing with comprehensive error handling
Iterative Workflow: Agent follows a structured workflow: create → evaluate → improve with diff patches → re-evaluate

Tools Available

read_files: Read and analyze existing files
search_repo: Search the repository for patterns or content
write_patch: Apply patches in unified diff format (preferred) or full-file format
generate_patch: Generate structured patches from natural language instructions
ast_refactor: Perform AST-based refactoring operations using TypeScript compiler API
run_cmd: Execute shell commands
evaluate_work: Analyze files and provide structured feedback for improvements
final_answer: Complete the task and generate a summary

High-Level Agent Loop

flowchart LR
    A[User Goal] --> B[Agent Loop]
    B --> C[LLM Decision]
    C --> D[Execute Tool]
    D --> E[Update Context]
    E --> F{Task Complete?}
    F -->|No| C
    F -->|Yes| G[Final Answer]
    
    subgraph "Available Tools"
        H[read_files]
        I[search_repo]
        J[write_patch]
        K[generate_patch]
        L[ast_refactor]
        M[run_cmd]
        N[evaluate_work]
    end
    
    D -.-> H
    D -.-> I
    D -.-> J
    D -.-> K
    D -.-> L
    D -.-> M
    D -.-> N
    
    style A fill:#e1f5fe
    style B fill:#fff3e0
    style C fill:#f3e5f5
    style D fill:#e8f5e8
    style E fill:#fff9c4
    style F fill:#ffecb3
    style G fill:#c8e6c9
    style H fill:#f0f8ff
    style I fill:#f0f8ff
    style J fill:#f0f8ff
    style K fill:#f0f8ff
    style L fill:#f0f8ff
    style M fill:#f0f8ff
    style N fill:#f0f8ff

Detailed Architecture Diagram

flowchart TD
    A[Start Agent] --> B[Initialize Config & Reset Token Stats]
    B --> C[Setup Safety Caps & Transcript]
    C --> D[Step Counter: 1 to maxSteps]
    D --> E[Make OpenAI API Call with Retries]
    
    E --> F{API Call Success?}
    F -->|Failed| G[Log Error & Return with Token Stats]
    F -->|Success| H[Parse JSON Response]
    
    H --> I{Parse Success?}
    I -->|Parse Error| J[Default to final_answer]
    I -->|Success| K{Decision Type?}
    
    K -->|read_files| L[Read Files Handler]
    K -->|search_repo| M[Search Repository Handler]
    K -->|write_patch| N[Write Patch Handler]
    K -->|generate_patch| O[Generate Patch Handler]
    K -->|ast_refactor| P[AST Refactor Handler]
    K -->|run_cmd| Q[Run Command Handler]
    K -->|evaluate_work| R[Evaluate Work Handler]
    K -->|final_answer| S[Generate Summary with OpenAI]
    K -->|unknown| T[Log Error & Add to Transcript]
    
    L --> U[Update Transcript with Results]
    M --> U
    N --> V{Check Write Limit}
    O --> W{Check Write Limit}
    P --> X{Check Write Limit}
    Q --> Y{Check Command Limit}
    R --> Z[Analyze Files & Generate Structured Feedback]
    T --> U
    
    V -->|Within Limit| U
    V -->|Exceeded| AA[Stop: Write Limit Reached]
    W -->|Within Limit| U
    W -->|Exceeded| AA
    X -->|Within Limit| U
    X -->|Exceeded| AA
    Y -->|Within Limit| U
    Y -->|Exceeded| BB[Stop: Command Limit Reached]
    
    Z --> CC[Add Evaluation Results to Transcript]
    CC --> U
    
    U --> DD{Step < maxSteps?}
    DD -->|Yes| D
    DD -->|No| EE[Stop: Max Steps Reached]
    
    S --> FF[Display Token Summary & Return Result]
    AA --> FF
    BB --> FF
    EE --> FF
    G --> FF
    FF --> GG[Process Exit]
    
    style A fill:#e1f5fe
    style FF fill:#c8e6c9
    style GG fill:#ffcdd2
    style E fill:#fff3e0
    style K fill:#f3e5f5
    style R fill:#e8f5e8
    style Z fill:#e8f5e8
    style CC fill:#e8f5e8
    style V fill:#fff9c4
    style W fill:#fff9c4
    style X fill:#fff9c4
    style Y fill:#fff9c4
    style O fill:#f0f8ff
    style P fill:#f0f8ff

CLI Usage

Installation

Install globally to use from anywhere on your system:

npm install -g .

If you need extra permissions, then:

chmod +x <path>/agent-loop/dist/src/cli.js

Then you can run it with:

Or use directly without installation:

npx agent-loop

Basic Usage

Run the CLI in interactive mode:

agent-loop

Or provide a prompt directly:

agent-loop --prompt "Create a simple HTML page with CSS styling"

CLI Options

agent-loop [options]

Options:
  -p, --prompt <prompt>           Direct prompt to execute (skips interactive mode)
  -m, --max-steps <number>        Maximum number of steps to execute (default: 20)
  -w, --max-writes <number>       Maximum number of file writes (default: 10)
  -c, --max-commands <number>     Maximum number of commands to run (default: 20)
  --provider <provider>           AI provider to use (openai, anthropic, google) (default: openai)
  --model <model>                 Specific model to use (optional)
  --no-console-log                Disable console logging
  --file-log                      Enable file logging
  --log-file <path>               Log file path (default: agent-log.txt)
  --test-command <command>        Test command to run (default: npm test --silent)
  --test-args <args>              Test command arguments (comma-separated)
  -h, --help                      Display help for command
  -V, --version                   Display version number

Examples

# Interactive mode
agent-loop

# Direct prompt
agent-loop --prompt "Create a React component for a todo list"

# With custom limits
agent-loop --prompt "Build a calculator app" --max-steps 30 --max-writes 15

# With custom test command
agent-loop --prompt "Create a Node.js API" --test-command "npm" --test-args "test,run"

# With file logging
agent-loop --prompt "Create a website" --file-log --log-file my-agent.log

# Using different AI providers
agent-loop --prompt "Create a React app" --provider anthropic
agent-loop --prompt "Build a Python API" --provider google --model gemini-1.5-pro
agent-loop --prompt "Write TypeScript types" --provider openai --model gpt-4

Environment Variables:

AI Provider API Keys (choose one):

OPENAI_API_KEY : Your OpenAI API key
ANTHROPIC_API_KEY : Your Anthropic API key
GOOGLE_API_KEY : Your Google API key

Agent Configuration:

AGENT_CONSOLE_LOGGING=false : Disable console logging (default: true)
AGENT_FILE_LOGGING=true : Enable file logging (default: false)
AGENT_LOG_FILE=path/to/log : Log file path (default: agent-log.txt)

Provider Selection:

You can specify which AI provider to use via CLI options:

--provider openai : Use OpenAI (default)
--provider anthropic : Use Anthropic
--provider google : Use Google
--model <model-name> : Specify a specific model (optional)

Installation

npm install
npm start

Name		Name	Last commit message	Last commit date
Latest commit History 60 Commits
.cursor/rules		.cursor/rules
examples/asteroids		examples/asteroids
src		src
.gitignore		.gitignore
package.json		package.json
planning.md		planning.md
readme.md		readme.md
tsconfig.json		tsconfig.json
vitest.config.ts		vitest.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Agent Loop - Coding Agent

Overview

Supported AI Providers

Key Features

Tools Available

High-Level Agent Loop

Detailed Architecture Diagram

CLI Usage

Installation

Basic Usage

CLI Options

Examples

Environment Variables:

AI Provider API Keys (choose one):

Agent Configuration:

Provider Selection:

Installation

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

devhelpr/agent-loop

Folders and files

Latest commit

History

Repository files navigation

Agent Loop - Coding Agent

Overview

Supported AI Providers

Key Features

Tools Available

High-Level Agent Loop

Detailed Architecture Diagram

CLI Usage

Installation

Basic Usage

CLI Options

Examples

Environment Variables:

AI Provider API Keys (choose one):

Agent Configuration:

Provider Selection:

Installation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages