Wrap file contents in XML tags #21

Rolland-He · 2025-07-08T21:06:53Z

helpers/template_utils.py:
=== {filename} === -> <submission file="correct_submission.py">

Example:

Compare the student's code and solution code...

The student's submission file is correct_submission.py.
The instructor's solution file is solution.py.

Files to Reference:
<submission file="correct_submission.py">
(Line 1) def fizzbuzz(n: int) -> list:
...
</submission>

<solution file="solution.py">
(Line 1) def fizzbuzz(n: int) -> list:
...
</solution>

test_output is also included here.

integration_test.py: Update tests.

I have also cleaned the commit history.

for more information, see https://pre-commit.ci

wkukka1

Great job, @Rolland-He! I’ve added a couple of minor suggestions around code style.

wkukka1 · 2025-07-09T13:41:48Z

ai_feedback/helpers/template_utils.py

+            lines = text_content.split('\n')
            for i, line in enumerate(lines, start=1):
-                stripped_line = line.rstrip('\n').rstrip()
+                stripped_line = line.rstrip()


Duplicated logic for wrapping lines with XML tags, both for PDFs and regular text files. Consider refactoring the repeated line formatting into a helper function—e.g., _wrap_lines_with_xml(lines, tag_name, filename)

It was actually part of the original code, I’ve moved it into a helper function already. I’ll work on simplifying the logic further. Thanks for pointing that out!

wkukka1 · 2025-07-09T13:46:34Z

ai_feedback/helpers/template_utils.py


    Args:
-        assignment_files (list[str]): List of file paths to process
+        submission (Path): Student's submission file path


should be Optional[Path] for all these args

…grading-feedback into xml-tag-clean

for more information, see https://pre-commit.ci

…grading-feedback into xml-tag-clean

for more information, see https://pre-commit.ci

wkukka1

Looks Good! @Rolland-He

david-yz-liu · 2025-07-10T02:08:30Z

ai_feedback/helpers/template_utils.py

+    Returns:
+        str: Formatted content with XML tags and line numbers
+    """
+    content = f"<{tag_name} file=\"{filename}\">\n"


Use the attribute name filename instead of file, here and throughout all XML tags that have to do with files

david-yz-liu · 2025-07-10T02:14:44Z

ai_feedback/helpers/template_utils.py

+    content = f"<{tag_name} file=\"{filename}\">\n"
+
+    for i, line in enumerate(lines, start=1):
+        if is_pdf:


Okay looking at this more carefully, I'm realizing that actually for text extracted from PDFs I don't think we need to do the line numbering (which was really to support precise annotations for code). I'm not sure we need to call this function at all when the given text is from a PDF, and instead we can just pass the raw text directly as the file contents to the LLM.

…grading-feedback into xml-tag-clean

Rolland-He and others added 3 commits July 8, 2025 16:13

Copy changes of template_utils from xml-tag branch

b25e216

Copy changes of integeration_tests from xml-tag branch

9852a58

[pre-commit.ci] auto fixes from pre-commit.com hooks

b95fb53

for more information, see https://pre-commit.ci

Rolland-He requested a review from wkukka1 July 8, 2025 21:15

Rolland-He closed this Jul 9, 2025

Rolland-He deleted the xml-tag-clean branch July 9, 2025 12:51

Rolland-He restored the xml-tag-clean branch July 9, 2025 12:51

Rolland-He reopened this Jul 9, 2025

wkukka1 reviewed Jul 9, 2025

View reviewed changes

Rolland-He added 4 commits July 9, 2025 10:06

Make submission optional

654a8ef

Merge branch 'xml-tag-clean' of https://github.com/Rolland-He/ai-auto…

73b482b

…grading-feedback into xml-tag-clean

Make submission optional

c35dbe0

Add helper func to reduce duplicated logic

ce8ba7b

Rolland-He requested a review from wkukka1 July 9, 2025 14:19

pre-commit-ci bot and others added 4 commits July 9, 2025 14:20

[pre-commit.ci] auto fixes from pre-commit.com hooks

2f2a01c

for more information, see https://pre-commit.ci

Fix Optional format

7902ecb

Merge branch 'xml-tag-clean' of https://github.com/Rolland-He/ai-auto…

10e5804

…grading-feedback into xml-tag-clean

[pre-commit.ci] auto fixes from pre-commit.com hooks

a14c3b5

for more information, see https://pre-commit.ci

wkukka1 reviewed Jul 9, 2025

View reviewed changes

Rolland-He requested a review from david-yz-liu July 9, 2025 14:47

david-yz-liu reviewed Jul 10, 2025

View reviewed changes

Rolland-He added 4 commits July 10, 2025 09:32

Update attribute name

c83d556

Update tests accordingly

c654457

Merge branch 'xml-tag-clean' of https://github.com/Rolland-He/ai-auto…

2b8ee04

…grading-feedback into xml-tag-clean

Remove logic of line numbering for pdf

2da9921

Rolland-He requested a review from david-yz-liu July 10, 2025 13:58

david-yz-liu approved these changes Jul 10, 2025

View reviewed changes

david-yz-liu merged commit 67b089f into MarkUsProject:main Jul 10, 2025
2 checks passed

wkukka1 pushed a commit to wkukka1/ai-autograding-feedback that referenced this pull request Aug 29, 2025

Wrap file contents in XML tags for prompts (MarkUsProject#21)

3cf7e6c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Wrap file contents in XML tags #21

Wrap file contents in XML tags #21

Uh oh!

Rolland-He commented Jul 8, 2025

Uh oh!

wkukka1 left a comment

Uh oh!

wkukka1 Jul 9, 2025

Uh oh!

Rolland-He Jul 9, 2025

Uh oh!

wkukka1 Jul 9, 2025

Uh oh!

Rolland-He Jul 9, 2025

Uh oh!

wkukka1 left a comment

Uh oh!

david-yz-liu Jul 10, 2025

Uh oh!

Rolland-He Jul 10, 2025

Uh oh!

david-yz-liu Jul 10, 2025

Uh oh!

Rolland-He Jul 10, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Wrap file contents in XML tags #21

Wrap file contents in XML tags #21

Uh oh!

Conversation

Rolland-He commented Jul 8, 2025

Uh oh!

wkukka1 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

wkukka1 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants