cc agent init

chitalian · chitalian · commit 099a3ed9eb8b · 2025-10-21T18:40:29.000-07:00
diff --git a/cc-agent/.gitignore b/cc-agent/.gitignore
@@ -0,0 +1 @@
+private
diff --git a/cc-agent/README.md b/cc-agent/README.md
@@ -0,0 +1,69 @@
+This is an internal tool to autonomously run agents in a loop in github codespaces.
+
+# Context
+
+Github codespaces are an amazing product and we build codespaces periodically with the latest Helicone changes (see ../.devcontainer/\*) for more information on how these codespaces are generated.
+
+_The idea_ is to spawn isolated github codespaces that can be easily used for agent loops, where they can run `--dangerously-skip-permissions`.
+
+Github codespaces is an amazing tool because they have a CLI perfect for managing your environments and are preconfigured with git creds to allow you to push changes.
+
+```bash
+gh cs --help
+Connect to and manage codespaces
+
+USAGE
+  gh codespace [flags]
+
+ALIASES
+  cs
+
+AVAILABLE COMMANDS
+  code:        Open a codespace in Visual Studio Code
+  cp:          Copy files between local and remote file systems
+  create:      Create a codespace
+  delete:      Delete codespaces
+  edit:        Edit a codespace
+  jupyter:     Open a codespace in JupyterLab
+  list:        List codespaces
+  logs:        Access codespace logs
+  ports:       List ports in a codespace
+  rebuild:     Rebuild a codespace
+  ssh:         SSH into a codespace
+  stop:        Stop a running codespace
+  view:        View details about a codespace
+
+INHERITED FLAGS
+  --help   Show help for command
+
+LEARN MORE
+  Use `gh <command> <subcommand> --help` for more information about a command.
+  Read the manual at https://cli.github.com/manual
+```
+
+# Getting started
+
+```bash
+# run the below gh cs create
+> gh cs create
+? Repository: Helicone/helicone
+  ✓ Codespaces usage for this repository is paid for by chitalian
+? Branch (leave blank for default branch):
+? Choose Machine Type: 4 cores, 16 GB RAM, 32 GB storage (Prebuild ready)
+```
+
+connect to it and open vscode with..
+
+```bash
+gh cs code
+# or ssh instead if that's more your thing
+gh cs ssh
+```
+
+Note this will run the build script which may take up to 5 minutes... be patient
+
+edit the `src/task.md` file with the task you
+
+# Claude code agent in a loop
+
+The idea here is to run
diff --git a/cc-agent/run.sh b/cc-agent/run.sh
@@ -0,0 +1,66 @@
+#!/bin/bash
+
+# Configuration
+SLEEP_TIME_SECONDS=0    # Time to sleep between iterations (in seconds) - default: 60 (1 minute)
+DONE_FILE="./.agent/DONE.md"  # File to check for completion
+mkdir -p ./.agent
+
+# Run wallet testing loop until DONE.md file is created
+
+# Remove the DONE file if it exists from a previous run
+if [ -f "$DONE_FILE" ]; then
+  rm "$DONE_FILE"
+  echo "Removed existing $DONE_FILE from previous run"
+fi
+
+ITERATION=0
+START_TIME=$(date +%s)
+
+echo "========================================"
+echo "Starting Admin Wallet Testing Loop"
+echo "Start time: $(date)"
+echo "Will run until $DONE_FILE is created"
+echo "========================================"
+echo ""
+
+while [ ! -f "$DONE_FILE" ]; do
+  ITERATION=$((ITERATION + 1))
+
+  echo "========================================"
+  echo "Iteration #$ITERATION at $(date)"
+  echo "========================================"
+
+  # Run Claude Code with the prompt
+  if [ $ITERATION -eq 1 ]; then
+    # First iteration: start fresh without --continue
+    cat prompt.md | claude -p --dangerously-skip-permissions
+  else
+    # Subsequent iterations: use --continue
+    cat prompt.md | claude -p --dangerously-skip-permissions --continue
+  fi
+
+  # Check if DONE file was created
+  if [ -f "$DONE_FILE" ]; then
+    echo ""
+    echo "DONE file detected. Exiting loop."
+    break
+  fi
+
+  # Sleep before next iteration
+  if [ $SLEEP_TIME_SECONDS -gt 0 ]; then
+    echo ""
+    echo "Sleeping for $SLEEP_TIME_SECONDS seconds..."
+    sleep $SLEEP_TIME_SECONDS
+    CURRENT_TIME=$(date +%s)
+    ELAPSED=$((CURRENT_TIME - START_TIME))
+    echo "Elapsed time: $((ELAPSED / 60)) minutes"
+    echo ""
+  fi
+done
+
+echo ""
+echo "========================================"
+echo "Testing Loop Complete"
+echo "End time: $(date)"
+echo "Total iterations: $ITERATION"
+echo "========================================"
diff --git a/cc-agent/src/base_prompt.md b/cc-agent/src/base_prompt.md
@@ -0,0 +1,23 @@
+**Environment Note:** The development environment is already configured. All environment variables and dependencies are set up.
+
+## Prerequisites - Start Required Services
+
+Before running the tests, ensure these services are running in the background:
+
+1. **Workers** - Run in background: `./run_all_workers.sh`
+2. **Jawn (Backend API)** - Run in background from `/valhalla/jawn`: `yarn dev`
+3. **Web (Frontend)** - Run in background from `/web`: `yarn dev:local -p 3000`
+
+**IMPORTANT: Use the Playwright MCP tools to automate browser interactions. You have access to:**
+
+here are keys you can use to send test requests to them models through the worker to visualize them in helicone
+
+Also please make sure we are testing this, by sending a request to openai and reproduce the issue that we can see in playwright. take screenshots for proof
+
+You will run on a loop for the next few hours. good luck!
+
+If everything is working and you were able to test it manually, please create a doc in the scratchpad name "./.agent/DONE.md" with proof that everything works well and you were able to reproduce the error and the error is now fixed with a summary of the fixes and all the tests and builds are working
+
+^ It's okay if you did not finish, we will re-run you in a min. Only write the DONE.md if you are 1000000% done
+
+## MAKE SURE YOU ARE 100% done, if not DO NOT write the done file... really make sure...
diff --git a/cc-agent/src/task.md b/cc-agent/src/task.md
@@ -0,0 +1,13 @@
+# Task to complete....
+
+Can you run `act workflow_dispatch -W .github/workflows/e2e-test-suite.yml -j e2e-tests`
+
+## and please get it to work...
+
+You will run on a loop for the next 3 hours. good luck!
+
+If everythig is working and you were able to test it manually, please create a doc in the scratchpad name "./.agent/DONE.md" with proof that everything works well and you were able to reproduce the error and the error is now fixed with a summary of the fixes and all the tests and builds are working
+
+^ It's okay if you did not finish, we will re-run you in a min. Only write the DONE.md if you are 1000000% done
+
+## MAKE SURE YOU ARE 100% done, if not DO NOT write the done file... really make sure...