Add support for newer OpenAI models (GPT-5, O1, O3, O4) by cmungall · Pull Request #11 · ai4curation/ai-blame

cmungall · 2026-03-29T00:27:16Z

Summary

This PR extends model detection to support newer OpenAI models including GPT-5, O1, O3, and O4 series, while also improving test reliability and fixing a default model reference.

Key Changes

Model Detection: Updated is_codex_model() function in both extractor.rs and parsers/common.rs to recognize:
- GPT-5 models (gpt-5-*)
- O1 series models (o1-* and o1)
- O3 series models (o3-* and o3)
- O4 series models (o4-* and o4)
Test Reliability: Added git configuration to disable commit signing during tests in git_batch.rs to prevent test failures in environments with GPG signing enabled
Default Model: Updated default model in parsers/codex.rs from gpt-5.2-codex to codex-mini-latest for better accuracy
Documentation: Updated comments to clarify support for "Codex CLI / GitHub Copilot" instead of just "GitHub Copilot"

Implementation Details

The model detection checks use both prefix matching (starts_with()) for versioned models and exact matching for base model names (e.g., o1, o3, o4) to handle both naming conventions used by OpenAI.

https://claude.ai/code/session_01MyDdEXVauCTcrkk6zBfK5J

Recognize o1, o3, o4 and gpt-5 model prefixes used by Codex CLI, and update default fallback model to codex-mini-latest. https://claude.ai/code/session_01MyDdEXVauCTcrkk6zBfK5J

The test_batch_reader_valid_file test was failing because global git commit signing configuration caused commits in the temp test repo to fail. https://claude.ai/code/session_01MyDdEXVauCTcrkk6zBfK5J

claude · 2026-03-29T00:27:36Z

Claude finished @cmungall's task —— View job

Code Review Complete ✅

I've thoroughly reviewed this PR that adds support for newer OpenAI models. Overall, this is a well-structured and safe enhancement that follows the project's coding standards.

✅ Summary of Changes

The PR successfully adds support for:

GPT-5 series: gpt-5-*
O1 series: o1-* and o1
O3 series: o3-* and o3
O4 series: o4-* and o4

Changes were made consistently in both src/extractor.rs:68-81 and src/parsers/common.rs:24-37 with proper dual approach (prefix + exact matching).

✅ Code Quality Assessment

Excellent Practices Observed:

Consistent Implementation: Model detection logic is duplicated correctly in both locations with identical patterns
Comprehensive Coverage: Both prefix matching (starts_with()) and exact matching handle different OpenAI naming conventions
Test Reliability: Git signing disabled in tests prevents GPG-related failures in CI environments
Appropriate Default: Updated default model from gpt-5.2-codex to codex-mini-latest for better accuracy

Follows Repository Standards:

✅ Uses proper Rust formatting and naming conventions
✅ No .unwrap() or panic-prone code in production paths
✅ Maintains existing error handling patterns
✅ Comments updated to reflect "Codex CLI / GitHub Copilot" scope

🔍 Detailed Analysis

Security: ✅ No Concerns

No new attack vectors introduced
Model detection is purely string matching
Git configuration changes are test-only and safe

Performance: ✅ Excellent

starts_with() operations are O(1) for prefixes
Exact equality checks are O(1)
No performance regression from additional model checks

Maintainability: ✅ Good

Model detection centralized in reusable functions
Clear naming patterns make future additions straightforward
Proper abstraction between core logic and parser implementations

Test Coverage: ✅ Adequate

Git test fix ensures reliable CI execution
Existing model detection tests will cover new patterns through current test cases
Consider adding explicit test cases for new model names in future iterations

🎯 Specific Technical Feedback

Model Detection Logic (`src/extractor.rs:68-81`, `src/parsers/common.rs:24-37`)

The dual approach is well-designed:

model.starts_with("gpt-5")        // Catches gpt-5-turbo, gpt-5-preview, etc.
|| model.starts_with("o1-")       // Catches o1-mini, o1-preview, etc.  
|| model == "o1"                  // Catches exact base model names

This handles both OpenAI's versioned models and base model references correctly.

Git Test Configuration (`src/git_batch.rs:155-159`)

The addition of commit.gpgsign=false is exactly the right fix for CI environments where GPG signing would cause test failures. This is a common and necessary pattern for reliable git testing.

Default Model Update (`src/parsers/codex.rs:204`)

Changing from gpt-5.2-codex to codex-mini-latest makes sense as the former appears to be a non-existent model identifier.

🚀 Recommendations

Consider Future-Proofing: The current pattern works well. For future model additions, consider if a regex-based approach might be cleaner, but the current string matching is perfectly adequate and more readable.
Documentation: Consider updating any user-facing documentation to mention the expanded model support, though this may be outside the scope of this PR.
Monitoring: Consider logging when new model names are encountered to help identify future models to add support for.

✅ Approval Recommendation

This PR demonstrates excellent engineering practices:

Safe and conservative changes with no breaking modifications
Consistent implementation across the codebase
Proper testing considerations with the git signing fix
Clear intent and scope focused solely on model support expansion

The changes are well-implemented, follow project conventions, and pose no security or performance risks. Recommended for approval and merge.

Copilot

Pull request overview

This PR updates the Rust parsing/extraction logic to recognize newer OpenAI model naming schemes (GPT-5 + O-series) when detecting Codex/Copilot traces, and tweaks related defaults/test setup to reduce misclassification and CI flakiness.

Changes:

Extend is_codex_model() matching to include gpt-5*, o1*, o3*, and o4* (plus bare o1/o3/o4).
Update the Codex CLI session parser default model string to codex-mini-latest.
In git-related tests, disable commit signing to avoid failures when GPG signing is enabled by default.

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 3 comments.

File	Description
`src/parsers/common.rs`	Expands shared Codex/OpenAI model detection to include GPT-5 and O-series names.
`src/extractor.rs`	Expands extractor-side Codex/OpenAI model detection used for attribution/format inference.
`src/parsers/codex.rs`	Adjusts the default model string used when parsing Codex CLI sessions.
`src/git_batch.rs`	Updates test repo initialization to disable commit signing during tests.

Copilot · 2026-03-29T00:30:23Z

+        || model.starts_with("gpt-5")
+        || model.starts_with("o1-")
+        || model.starts_with("o3-")
+        || model.starts_with("o4-")
+        || model == "o1"


New model-prefix support was added here (gpt-5*, o1*, o3*, o4*), but the existing tests only exercise gpt-4/codex-* model strings. Please add/extend tests (e.g., in tests/extractor_test.rs) to assert that these newer model names are recognized and correctly attributed as Codex/Copilot traces.

Copilot · 2026-03-29T00:30:24Z

+        // Disable commit signing for tests
+        Command::new("git")
+            .args(["config", "commit.gpgsign", "false"])
+            .current_dir(&repo_path)
+            .output()?;


The new git config commit.gpgsign false call ignores the command result. Please check the exit status (and surface stderr) so test failures are easier to diagnose if git isn't available or the config write fails (same applies to the other git config/init calls in this helper).

Copilot · 2026-03-29T00:30:24Z

+    // Check for known OpenAI Codex and GPT model prefixes used by Codex CLI / GitHub Copilot
    model.starts_with("codex-")
        || model.starts_with("gpt-4")
        || model.starts_with("gpt-3.5")
        || model.starts_with("gpt-35")
+        || model.starts_with("gpt-5")
+        || model.starts_with("o1-")
+        || model.starts_with("o3-")
+        || model.starts_with("o4-")
+        || model == "o1"
+        || model == "o3"
+        || model == "o4"
 }



is_codex_model is duplicated here and in src/parsers/common.rs. Since extractor.rs already depends on crate::parsers (e.g., ParserRegistry), consider calling crate::parsers::common::is_codex_model(model) instead and removing this local helper to avoid the two implementations drifting over time.

Suggested change

// Check for known OpenAI Codex and GPT model prefixes used by Codex CLI / GitHub Copilot

model.starts_with("codex-")

|| model.starts_with("gpt-4")

|| model.starts_with("gpt-3.5")

|| model.starts_with("gpt-35")

|| model.starts_with("gpt-5")

|| model.starts_with("o1-")

|| model.starts_with("o3-")

|| model.starts_with("o4-")

|| model == "o1"

|| model == "o3"

|| model == "o4"

}

// Delegate to the shared helper to avoid duplicated model-prefix logic.

crate::parsers::common::is_codex_model(model)

}

Inspired by git-ai's /ask skill. Users can select code and type /ask to find the original AI conversation that produced it, with the agent answering in first person about its decisions and intent. https://claude.ai/code/session_01MyDdEXVauCTcrkk6zBfK5J

claude · 2026-04-12T18:39:21Z

Claude finished @cmungall's task —— View job

Comprehensive PR Review ✅

I've thoroughly reviewed this PR that adds support for newer OpenAI models (GPT-5, O1, O3, O4) to the ai-blame codebase. Overall, this is a well-implemented enhancement with good attention to quality and consistency.

Review Tasks Completed:

Read and analyze changed files
Review model detection implementation
Assess code quality and best practices
Check for potential bugs and issues
Evaluate performance considerations
Review security aspects
Analyze test coverage
Provide comprehensive feedback

✅ Summary of Changes

The PR successfully extends model detection to support:

GPT-5 series: gpt-5-* patterns
O1 series: o1-* and o1 (exact match)
O3 series: o3-* and o3 (exact match)
O4 series: o4-* and o4 (exact match)

Additional improvements:

Updated default model from gpt-5.2-codex to codex-mini-latest
Fixed git test reliability by disabling commit signing
Added /ask skill for querying AI code authorship

🔍 Code Quality Assessment

Strengths:

Consistent Implementation: Model detection logic is properly duplicated in both src/extractor.rs:68-81 and src/parsers/common.rs:24-37 with identical patterns
Smart Detection Strategy: Uses both prefix matching (starts_with()) and exact matching to handle different OpenAI naming conventions appropriately
Test Infrastructure: Git signing disabled in tests (src/git_batch.rs:155-159) prevents GPG-related CI failures
Appropriate Default: Changed default model to codex-mini-latest which is more accurate than the non-existent gpt-5.2-codex
Documentation Updates: Comments properly reflect "Codex CLI / GitHub Copilot" scope

Repository Standards Compliance:

✅ Follows Rust formatting and naming conventions
✅ No .unwrap() or panic-prone code in production paths
✅ Maintains existing error handling patterns
✅ Uses crate-relative imports appropriately
✅ No global mutable state introduced

🚨 Issues and Recommendations

Critical - Code Duplication (Addresses Copilot Review)

Problem: is_codex_model() function exists in both src/extractor.rs:68-81 and src/parsers/common.rs:24-37

Impact: The two implementations can drift apart over time, leading to inconsistent model detection

Recommendation: Remove the duplicate in extractor.rs and call the shared version:

// In src/extractor.rs:68-81, replace the function with:
fn is_codex_model(model: &str) -> bool {
    crate::parsers::common::is_codex_model(model)
}

This follows the DRY principle and ensures a single source of truth for model detection logic.

Important - Missing Test Coverage (Addresses Copilot Review)

Problem: Existing tests only cover gpt-4/codex-* models. New model patterns (GPT-5, O1, O3, O4) are not tested.

Recommendation: Add test cases in tests/extractor_test.rs (or similar) to verify:

#[test]
fn test_new_model_detection() {
    assert!(is_codex_model("gpt-5-turbo"));
    assert!(is_codex_model("o1"));
    assert!(is_codex_model("o1-preview"));
    assert!(is_codex_model("o3-mini"));
    assert!(is_codex_model("o4"));
}

Minor - Git Command Error Handling (Addresses Copilot Review)

Issue: In src/git_batch.rs:156-159, git config commands don't check exit status

Current:

Command::new("git")
    .args(["config", "commit.gpgsign", "false"])
    .current_dir(&repo_path)
    .output()?;  // Only checks if command runs, not if it succeeds

Recommendation: Check command status to surface errors:

let output = Command::new("git")
    .args(["config", "commit.gpgsign", "false"]) 
    .current_dir(&repo_path)
    .output()?;
    
if !output.status.success() {
    eprintln!("Warning: Failed to disable git signing: {}", 
              String::from_utf8_lossy(&output.stderr));
}

🔒 Security Review

✅ No Security Concerns:

Model detection is purely string matching with no external inputs
Git configuration changes are test-only and safe
No new attack vectors introduced
Path validation maintained in existing git operations

🚀 Performance Analysis

✅ Excellent Performance:

starts_with() operations are O(1) for short prefixes
Exact equality checks are O(1)
Additional model checks add negligible overhead
No performance regression anticipated

📋 Detailed Technical Feedback

Model Detection Logic (`src/extractor.rs:68-81`, `src/parsers/common.rs:24-37`)

The dual approach handles OpenAI's naming patterns well:

model.starts_with("gpt-5")    // gpt-5-turbo, gpt-5-preview, etc.
|| model.starts_with("o1-")   // o1-mini, o1-preview, etc.
|| model == "o1"              // Base model name

Git Test Configuration (`src/git_batch.rs:155-159`)

The addition of commit.gpgsign=false correctly fixes CI environments where GPG signing would cause test failures.

Default Model Update (`src/parsers/codex.rs:204`)

Changing to codex-mini-latest makes sense - gpt-5.2-codex appears to be non-existent.

Ask Skill Addition (`.claude/skills/ask.md`)

Well-designed skill for querying AI code authorship context. Properly scoped subagent with appropriate tool restrictions.

🎯 Recommendations Summary

Must Fix:

Eliminate code duplication - consolidate is_codex_model() functions
Add test coverage for new model patterns

Should Fix:

Improve git command error handling in test setup

Future Considerations:

Consider logging when unknown model names are encountered
Document expanded model support in user-facing materials

✅ Overall Assessment

This PR demonstrates solid engineering practices:

Conservative, safe changes with no breaking modifications
Consistent implementation across the codebase
Proper attention to testing infrastructure
Clear scope focused on model support expansion

The changes are well-implemented, follow project conventions, and pose no security or performance risks. With the code duplication and test coverage issues addressed, this would be an excellent addition to the codebase.

Recommendation: Approve with minor changes requested

claude added 2 commits March 28, 2026 15:06

Add support for latest OpenAI model identifiers in Codex detection

c523003

Recognize o1, o3, o4 and gpt-5 model prefixes used by Codex CLI, and update default fallback model to codex-mini-latest. https://claude.ai/code/session_01MyDdEXVauCTcrkk6zBfK5J

Fix git_batch test failure by disabling commit signing in test repo

702b720

The test_batch_reader_valid_file test was failing because global git commit signing configuration caused commits in the temp test repo to fail. https://claude.ai/code/session_01MyDdEXVauCTcrkk6zBfK5J

Copilot AI review requested due to automatic review settings March 29, 2026 00:27

Copilot started reviewing on behalf of cmungall March 29, 2026 00:27 View session

Copilot AI reviewed Mar 29, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for newer OpenAI models (GPT-5, O1, O3, O4)#11

Add support for newer OpenAI models (GPT-5, O1, O3, O4)#11
cmungall wants to merge 3 commits into
mainfrom
claude/update-claude-codex-af1vB

cmungall commented Mar 29, 2026

Uh oh!

claude Bot commented Mar 29, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Mar 29, 2026

Uh oh!

Copilot AI Mar 29, 2026

Uh oh!

Copilot AI Mar 29, 2026

Uh oh!

claude Bot commented Apr 12, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

cmungall commented Mar 29, 2026

Summary

Key Changes

Implementation Details

Uh oh!

claude Bot commented Mar 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Code Review Complete ✅

✅ Summary of Changes

✅ Code Quality Assessment

Excellent Practices Observed:

Follows Repository Standards:

🔍 Detailed Analysis

Security: ✅ No Concerns

Performance: ✅ Excellent

Maintainability: ✅ Good

Test Coverage: ✅ Adequate

🎯 Specific Technical Feedback

Model Detection Logic (src/extractor.rs:68-81, src/parsers/common.rs:24-37)

Git Test Configuration (src/git_batch.rs:155-159)

Default Model Update (src/parsers/codex.rs:204)

🚀 Recommendations

✅ Approval Recommendation

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Mar 29, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 29, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 29, 2026

Choose a reason for hiding this comment

Uh oh!

claude Bot commented Apr 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Comprehensive PR Review ✅

✅ Summary of Changes

🔍 Code Quality Assessment

Strengths:

Repository Standards Compliance:

🚨 Issues and Recommendations

Critical - Code Duplication (Addresses Copilot Review)

Important - Missing Test Coverage (Addresses Copilot Review)

Minor - Git Command Error Handling (Addresses Copilot Review)

🔒 Security Review

🚀 Performance Analysis

📋 Detailed Technical Feedback

Model Detection Logic (src/extractor.rs:68-81, src/parsers/common.rs:24-37)

Git Test Configuration (src/git_batch.rs:155-159)

Default Model Update (src/parsers/codex.rs:204)

Ask Skill Addition (.claude/skills/ask.md)

🎯 Recommendations Summary

Must Fix:

Should Fix:

Future Considerations:

✅ Overall Assessment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

claude Bot commented Mar 29, 2026 •

edited

Loading

Model Detection Logic (`src/extractor.rs:68-81`, `src/parsers/common.rs:24-37`)

Git Test Configuration (`src/git_batch.rs:155-159`)

Default Model Update (`src/parsers/codex.rs:204`)

claude Bot commented Apr 12, 2026 •

edited

Loading

Model Detection Logic (`src/extractor.rs:68-81`, `src/parsers/common.rs:24-37`)

Git Test Configuration (`src/git_batch.rs:155-159`)

Default Model Update (`src/parsers/codex.rs:204`)

Ask Skill Addition (`.claude/skills/ask.md`)