wylab/nanobot

Fork 0

Files

T

code-server 8cb5d93005

Build Nanobot OAuth / cleanup (push) Has been cancelled

Details

Build Nanobot OAuth / build (push) Has been cancelled

Details

docs: add PR testing workflow guide

Comprehensive guide for using the staging environment:
- Quick start with test-pr.sh script
- Manual testing methods
- Cache verification procedures
- Session management
- Troubleshooting tips

Includes examples for multi-turn testing and cache validation.

2026-03-09 18:07:57 +01:00

5.8 KiB

Raw Blame History

PR Testing Workflow

Guide for testing Pull Requests using the local staging environment.

Quick Start

./test-pr.sh <pr-number> "test message"

Staging Environment

Location: /config/workspace/.nanobot-staging/

Components:

config.json — Staging configuration (channels disabled, shared OAuth)
workspace/ — Isolated workspace for tool operations
workspace/sessions/ — Session storage (separate from production)

Key differences from production:

No external channels (Telegram disabled)
Uses NANOBOT_CONFIG environment variable
Gateway runs on localhost:18791 (vs production's 18790)
restrictToWorkspace: true for safety

Testing a PR

Method 1: Helper Script (Recommended)

# Test PR with default message
./test-pr.sh 31

# Test with custom message
./test-pr.sh 31 "test the hidden message feature"

What it does:

Fetches PR from wylab remote (force updates if branch exists)
Checks out PR branch locally
Installs in editable mode with uv pip install -e .
Runs test with staging config via NANOBOT_CONFIG env var
Leaves branch checked out for further testing

After testing:

git checkout main  # Return to main branch

Method 2: Manual Testing

# 1. Fetch and checkout PR
cd /config/workspace/nanobot-oauth-port/nanobot-fork
git fetch wylab pull/<N>/head:pr-<N>
git checkout pr-<N>

# 2. Install in editable mode
uv pip install -e .

# 3. Test with staging config
NANOBOT_CONFIG=/config/workspace/.nanobot-staging/config.json \
  .venv/bin/nanobot agent -m "test message"

# 4. For multi-turn testing
NANOBOT_CONFIG=/config/workspace/.nanobot-staging/config.json \
  .venv/bin/nanobot agent  # Interactive mode

# 5. Return to main
git checkout main

Method 3: Gateway Validation

Test that gateway starts without errors:

NANOBOT_CONFIG=/config/workspace/.nanobot-staging/config.json \
  .venv/bin/nanobot gateway

# Kill with Ctrl+C when validated

Verifying Cache Behavior

To verify prompt caching works correctly (important for performance):

# Enable logs to see cache metrics
NANOBOT_CONFIG=/config/workspace/.nanobot-staging/config.json \
  .venv/bin/nanobot agent --logs -m "Turn 1: list files"

# Look for cache metrics in output:
# - cache_write: New cache entries created
# - cache_read: Tokens read from cache

What to look for:

Turn 1: High cache_write, moderate cache_read
Turn 2+: Low cache_write, high cache_read (reusing cache)
cache_read should increase across turns as context grows

Example healthy pattern:

Turn 1: cache_write=354 cache_read=3563
Turn 2: cache_write=255 cache_read=3917  ← Same as Turn 1 end
Turn 3: cache_write=113 cache_read=4172  ← Growing with context

Session Management

Clear session for fresh test

rm -f /config/workspace/.nanobot-staging/workspace/sessions/cli_direct.jsonl

View session contents

cat /config/workspace/.nanobot-staging/workspace/sessions/cli_direct.jsonl | jq

Check for specific features (e.g., hidden signatures)

cat /config/workspace/.nanobot-staging/workspace/sessions/cli_direct.jsonl | grep "_hidden_sig"

Common Testing Scenarios

Test tool execution

./test-pr.sh 31 "List all Python files in the current directory"

Test multi-turn conversation

NANOBOT_CONFIG=/config/workspace/.nanobot-staging/config.json \
  .venv/bin/nanobot agent

# Then interact naturally:
> list files in current directory
> how many python files are there?
> what's the total size?

Test error handling

./test-pr.sh 31 "Try to read a file that doesn't exist: /nonexistent.txt"

Test with thinking mode

The staging config has thinking_budget: 10000 enabled by default, so all tests use extended thinking.

Troubleshooting

"No API key configured" error

Cause: NANOBOT_CONFIG env var not set
Fix: Ensure you're using NANOBOT_CONFIG=/config/workspace/.nanobot-staging/config.json

"Module not found" after checkout

Cause: Need to reinstall after switching branches
Fix: Run uv pip install -e . after checkout

Changes not applying

Cause: Using cached .pyc files
Fix: Clear pycache: find . -type d -name __pycache__ -exec rm -rf {} + 2>/dev/null || true

Session has stale data

Cause: Previous test left session data
Fix: rm /config/workspace/.nanobot-staging/workspace/sessions/cli_direct.jsonl

Best Practices

Clear session between PR tests to avoid cross-contamination
Test with tool use to trigger agentic behavior (not just simple Q&A)
Check cache metrics for performance-sensitive PRs
Run with --logs to see detailed behavior during development
Return to main after testing to avoid accidental commits on PR branches

Integration with CI/CD

The staging environment is currently manual-only. Future enhancements:

Automated PR testing via Gitea Actions
Cache validation in CI pipeline
Multi-PR parallel testing using git worktrees
Regression test suite against production behavior

File Locations Reference

Path	Purpose
`/config/workspace/nanobot-oauth-port/nanobot-fork/`	Local nanobot repository
`/config/workspace/.nanobot-staging/`	Staging environment root
`/config/workspace/.nanobot-staging/config.json`	Staging configuration
`/config/workspace/.nanobot-staging/workspace/`	Staging workspace
`/config/workspace/.nanobot-staging/workspace/sessions/`	Session storage
`/config/workspace/nanobot-oauth-port/nanobot-fork/test-pr.sh`	Helper script

nanobot README - Main project documentation
CLAUDE.md - Development guide for Claude Code
config/schema.py - Configuration schema

5.8 KiB Raw Blame History