Add CodeAct module #8222

TomeHirata · 2025-05-15T13:30:07Z

This PR add a new module called CodeAct, which adopts the CodeAct architecture to manipulate the Python interpreter with predefined tools.

from dspy.predict import CodeAct
def factorial(n):
    if n == 1:
        return 1
    return n * factorial(n-1)

act = CodeAct("n->factorial", tools=[factorial])
act(n=5) # 120

…eat/codeact/init

dspy/predict/code_act.py

okhat · 2025-05-17T16:46:11Z

dspy/predict/react.py

@@ -47,6 +47,7 @@ def __init__(self, signature, tools: list[Callable], max_iters=5):

        for idx, tool in enumerate(tools.values()):
            instr.append(f"({idx + 1}) {tool}")
+        instr.append("You should pass the tool argument in JSON format")


Maybe revise to "When providing next_tool_args, the value inside the field must be in JSON format".

Also I'm confused about line 37, which has

"When selecting the next_tool_name and its next_tool_args, the tool must be one of:\n",

I thought that was removed? Am I thinking of a different PR?

Line 37 will be removed in #8190 👍

dspy/predict/code_act.py

chenmoneygithub · 2025-05-19T18:29:04Z

dspy/predict/code_act.py

+        Example:
+
+            ```python
+            from dspy.predict import CodeAct


Can we follow up with a code tutorial demonstrating a use case of CodeAct?

Yup, I will file a follow up PR for documentation

chenmoneygithub · 2025-05-19T18:31:09Z

dspy/predict/code_act.py

+        if any(
+            not inspect.isfunction(tool.func) for tool in tools
+        ):
+            raise ValueError("CodeAct only accepts functions and not callable objects.")


curious - why do we have this constraint?

Since it is hard to define the callable object in the python interpreter. Unlike functions, we need to define class and what parameters are passed to the object initialization, which is not visible.

dspy/predict/code_act.py

chenmoneygithub

Awesome work!

chenmoneygithub · 2025-05-19T18:44:55Z

dspy/predict/code_act.py

+            f"Your goal is to generate executable Python code that collects any necessary information for producing {outputs}.\n"
+            "For each iteration, you will generate a code snippet that either solves the task or progresses towards the solution.\n"
+            "Ensure any output you wish to extract from the code is printed to the console. The code should be enclosed in a fenced code block.\n"
+            f"When all information for producing the outputs ({outputs}) are available to be extracted, mark `finished=True` besides the final Python code.\n"


should that be "besides the final Python code" or finished=True and generated_code is not None be mutual exclusive?

Since we have an extractor at last, I designed it in a way that the code passed with finished=True is executed to minimize the interaction count.

chenmoneygithub · 2025-05-19T18:45:58Z

dspy/predict/code_act.py

+    def forward(self, **kwargs):
+        # Define the tool funcitons in the interpreter
+        for tool in self.tools.values():
+            self.interpreter(inspect.getsource(tool.func))


shall we move it to init, and only run this line for call-time tools? (the new change in ReAct)

Since python interpreter is shutdown every forward, this needs to happen in forward too.

chenmoneygithub · 2025-05-19T18:50:14Z

tests/predict/test_code_act.py

+        return a + b
+
+@skip_if_deno_not_available
+def test_codeact_tool_validation():


We may also want to test the input flow - use a mock litellm.completion to capture the prompt, and validate our tool + code information are correctly included in the prompt.

chenmoneygithub

Code looks good to me! I will need to play with this new Module a bit today or tomorrow.

chenmoneygithub · 2025-05-24T23:27:10Z

I encountered an error when the function arg is of pydantic type:

import dspy

lm_4o_mini = dspy.LM("openai/gpt-4o-mini")
dspy.configure(
    lm=lm_4o_mini,
)


from pydantic import BaseModel

from dspy.datasets import DataLoader

kwargs = dict(fields=("claim", "supporting_facts", "hpqa_id", "num_hops"), input_keys=("claim",))
hover = DataLoader().from_huggingface(dataset_name="hover-nlp/hover", split="train", trust_remote_code=True, **kwargs)

hpqa_ids = set()
filtered_hover = []
for x in hover:
    if x["num_hops"] == 3 and x["hpqa_id"] not in hpqa_ids:
        hpqa_ids.add(x["hpqa_id"])
        filtered_hover.append(
            dspy.Example(claim=x.claim, titles=list(set([y["key"] for y in x.supporting_facts]))).with_inputs("claim")
        )
hover = filtered_hover

trainset, devset, testset = hover[:100], hover[100:150], hover[650:]

example = trainset[0]

print("Claim:", example.claim)
print("Pages that must be retrieved:", example.titles)

DOCS = {}


class SearchInput(BaseModel):
    query: str


def search(query: str, k: int) -> list[str]:
    results = dspy.ColBERTv2(url="http://20.102.90.50:2017/wiki17_abstracts")(query, k=k)
    results = [x["text"] for x in results]

    for result in results:
        title, text = result.split(" | ", 1)
        DOCS[title] = text

    return results


def search_wikipedia(query: SearchInput) -> list[str]:
    """Returns top-5 results and then the titles of the top-5 to top-30 results."""

    topK = search(query.query, 30)
    titles, topK = [f"`{x.split(' | ')[0]}`" for x in topK[5:30]], topK[:5]
    return topK + [f"Other retrieved pages have titles: {', '.join(titles)}."]


def lookup_wikipedia(title: str) -> str:
    """Returns the text of the Wikipedia page, if it exists."""

    if title in DOCS:
        return DOCS[title]

    results = [x for x in search(title, 10) if x.startswith(title + " | ")]
    if not results:
        return f"No Wikipedia page found for title: {title}"
    return results[0]


instructions = "Find all Wikipedia titles relevant to verifying (or refuting) the claim."
signature = dspy.Signature("claim -> titles: list[str]", instructions)

tools = [dspy.Tool(search_wikipedia), dspy.Tool(lookup_wikipedia)]

codeact = dspy.CodeAct(signature, tools=tools, max_iters=20)

output = codeact(claim="David Gregory was born in 1625.")

Error:

dspy.primitives.python_interpreter.InterpreterError: NameError: ["name 'SearchInput' is not defined"]

Looks like we also need to register the custom types to the interpreter.

I tried to change query: SearchInput to query: str, but doesn't work either with the following error:

[[ ## observation_18 ## ]]
Failed to execute the generated code: NameError: ["name 'search' is not defined"]

search is a helper function called inside search_wikipedia(query: str), and the registration misses it.

TomeHirata · 2025-05-26T07:42:58Z

@chenmoneygithub Thanks for trying this out. That is a known limitation where Tool functions can only depend on built-in naming. In other words, tools should be self-contained. We could re-register non-tool variables in locals(), but I worry that this causes unnecessary latency or failures of the re-registration due to external packages or definition ordering. Since LLM can generate code rather than json to pass args to the tools, I don't think this is the blocking limitation. The langgraph-codeact also has this limitation.

add CodeAct

f62bc60

TomeHirata requested review from chenmoneygithub and okhat May 15, 2025 13:30

fix type

5ab5d99

TomeHirata force-pushed the feat/codeact/init branch from a6ef78b to 5ab5d99 Compare May 15, 2025 13:32

TomeHirata added 5 commits May 16, 2025 13:08

change arg format

4b6ae68

fix test

be7b428

Merge branch 'main' into feat/codeact/init

a6cc1d0

remove unused f string

0aa7319

Merge branch 'feat/codeact/init' of github.com:TomeHirata/dspy into f…

2ad8696

…eat/codeact/init

okhat reviewed May 17, 2025

View reviewed changes

dspy/predict/code_act.py Outdated Show resolved Hide resolved

okhat reviewed May 17, 2025

View reviewed changes

address comments

7288cb4

chenmoneygithub reviewed May 19, 2025

View reviewed changes

address comments

192d0f2

chenmoneygithub reviewed May 21, 2025

View reviewed changes

Merge branch 'main' into feat/codeact/init

38d7e8e

chenmoneygithub merged commit 4ea5041 into stanfordnlp:main May 29, 2025
3 checks passed

Add CodeAct module #8222

Add CodeAct module #8222

Conversation

TomeHirata commented May 15, 2025

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

TomeHirata May 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

chenmoneygithub left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

chenmoneygithub left a comment

Choose a reason for hiding this comment

Uh oh!

chenmoneygithub commented May 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

TomeHirata commented May 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

TomeHirata May 19, 2025 •

edited

Loading

chenmoneygithub commented May 24, 2025 •

edited

Loading

TomeHirata commented May 26, 2025 •

edited

Loading