🧠 OuroTrace

OuroTrace is a framework for evaluating Chain-of-Thought (CoT) reasoning in Ouroboros (UT) models. It provides tools for structured prompting, dataset creation, experiment management, and result analysis.

Key Features

Flexible Experimentation: Load experiment configs from JSON, run single or batch experiments, and analyze results.
Dataset Utilities: Generate and preprocess datasets for tasks like N-ary Addition, P-hop Induction, Symbolic i-GSM, and more.
Evaluation: Tools for analyzing and visualizing experiment outcomes.
Environment Setup: Utilities for path configuration and Colab integration.

Quick Start

Import main components:

from src import (
    load_config_from_json, create_test_datasets,
    OuroThinkingExperiment, OuroBatchExperiment,
    analyze_experiment_results
)

🗂️ Modules Overview

config_loader.py: Loads and parses experiment configuration files (JSON).
data_generator.py: Generates and preprocesses datasets for supported reasoning tasks.
model.py: Defines experiment classes and manages CoT evaluation logic.
evaluation_analysis.py: Provides functions for analyzing and visualizing experiment results.
utils.py: Contains helper functions for environment setup and path management.
runner.py: Orchestrates batch experiment execution and result collection.

License

Apache-2.0

Tasks Evaluation

You can run end-to-end using one of the notebooks below to perform evaluation for this model on several tasks including:

Simple reasoning tasks: n-ary addition, p-hop induction and i-GSM problems.
Perplexity calculation: calculate the perplexity which measure the uncertainty of the model of predicting the next token, which has a strong connection to cross entropy loss.
Reasoning primitives: variable assignment in code, math and equation of level 0 and 1 using 5-shot prompting to instruct the model.

Name		Name	Last commit message	Last commit date
Latest commit History 508 Commits
configs		configs
results		results
results_UT_all		results_UT_all
src		src
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
main.py		main.py
ouro_trace.ipynb		ouro_trace.ipynb
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧠 OuroTrace

Key Features

Quick Start

🗂️ Modules Overview

License

Tasks Evaluation

Notebooks

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🧠 OuroTrace

Key Features

Quick Start

🗂️ Modules Overview

License

Tasks Evaluation

Notebooks

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages