Updated parsing of problem.yaml by using checks as described in the specification #295

Zazmuz · 2025-04-10T10:29:26Z

Added checks for legacy and 2023-07 for parsing problem.yaml, making sure its correct and defaulting values.
Made sure to keep the code itself simple so that eventual changes and upkeep to it will be easy to go through with.
It has been tested on most PO problems from the last 5 years as well as islandic problems.

For the new format it has not been tested as much (due to lack of available configs), only a few edgecases and few self made configs.

If there is anything that looks weird or any questions I will be available.

…m.yaml. Co-authored-by: square-cylinder <[email protected]> Cleaned up and added more checks for problem.yaml Co-authored-by: square-cylinder <[email protected]> detect unknown fields f-strings compatible with older python versions fix bug

made it more obvious if old folder name is used, or folder is misspelled made check error messages clearer for people newer to problemtools closest word functionality bugfix index error

readded comment stating weird it is Name change and pytest fix

…roduced in python 3.12

Merged conflict in hello test

…ystem

gkreitz

I started looking at this PR, and I think it needs relatively large amount of work. I don't understand several of the approach changes from #294 at all (not trying to merge the data into a common structure, putting all code in verifyproblem instead of a separate module, not having any test cases).

I ran out of time while reviewing this, but setting this as request changes to not delay giving you the feedback I had for the part I had time to review.

gkreitz · 2025-04-12T08:36:45Z

problemtools/verifyproblem.py

@@ -262,7 +269,7 @@ def check(self, context: Context) -> bool:
        self.check_size_limits(self.ansfile)
        self._problem.getProblemPart(InputValidators).validate(self)
        anssize = os.path.getsize(self.ansfile) / 1024.0 / 1024.0
-        outputlim = self._problem.get(ProblemConfig)['limits']['output']
+        outputlim = self._problem.get(ProblemConfigLegacy)['limits']['output']


Changing ProblemConfig to ProblemConfigLegacy like this feels dangerous, and makes a reader worry about how this works when reading in a new problem config. I'd advice at the very least signaling some intent here by accessing via ProblemConfigBase for fields that are the same for both formats.

A better solution would be to ensure the config as read by the rest of the program looks the same (has the same keys, and the same types) regardless of what version was used. I believe you did this in #294.

gkreitz · 2025-04-12T08:37:44Z

problemtools/verifyproblem.py

        # Some deprecated properties are inherited from problem config during a transition period
-        problem_grading = problem.get(ProblemConfig)['grading']
+        problem_grading = problem.get(ProblemConfigLegacy)['grading']


How does this work with a new config file, which lacks the grading key? (This question applies to many places where you access keys which are named differently, or have different value types in the two versions.)

gkreitz · 2025-04-12T09:00:45Z

problemtools/verifyproblem.py

+            self.error(f'License needs to be one of: {self._VALID_LICENSES}')
+        if self._data['license'] == 'unknown':
+            self.warning("License is 'unknown'")
+    def fix_config_structure(self):


It looks to me like ProblemConfigBase is an abstract base class, and that this is an abstract method? If so, consider inheriting from ABC and use the decorator @abstractmethod here.

gkreitz · 2025-04-12T09:05:31Z

problemtools/verifyproblem.py

+
+
+    def check(self, context: Context) -> bool:
+        if self._check_res is True:


Why did you rewrite the memoization of _check_res this way? It looks like the memoization is now broken.

gkreitz · 2025-04-12T09:06:37Z

problemtools/verifyproblem.py

+            return self._check_res
+        elif self._check_res is not False:
+            self._check_res = True
+        to_check = [prop for prop in dir(self) if prop.startswith('_check_') and callable(getattr(self, prop))]


I'm not a fan of trying to find all methods with a special name and running them like this. Why was this approach taken? Do you end up with so many check methods that you can't cleanly list them?

gkreitz · 2025-04-12T09:07:03Z

problemtools/verifyproblem.py

+            self._check_res = True
+        to_check = [prop for prop in dir(self) if prop.startswith('_check_') and callable(getattr(self, prop))]
+        for prop in to_check:
+            self.info(f'Checking "{prop}"')


Internal methods called should not be logged at info level.

gkreitz · 2025-04-12T09:14:24Z

problemtools/verifyproblem.py

+        return best
+
+class ProblemConfigLegacy(ProblemConfigBase):
+    DEFAULT_LIMITS = {


These defaults used to be specified in a problem.yaml config file (and user-configurable, as is documented in README.md). Does the fact that you list the defaults here imply that you dropped support for having the defaults configurable via config file (and user-configurable via a config file in the home directory)? If so, it feels like a big step backwards to hardcode these values rather than have them in a config file.

gkreitz · 2025-04-12T09:16:33Z

problemtools/verifyproblem.py

+
+    def fix_config_structure(self):
+        self._data.setdefault('problem_format_version', formatversion.VERSION_LEGACY)
+        if self._data.get('problem_format_version') != formatversion.VERSION_LEGACY:


These checks (checking that something is one of a few allowed values) get very repetitive. It looks like you ened a helper function, something like
check_allowed_values(self, key: str, values: list, error_fn = self.error)

gkreitz · 2025-04-12T09:20:46Z

problemtools/verifyproblem.py

+
+        if 'name' in self._data:
+            val = self._data['name']
+            if type(val) is not str:


Similarly, here, you should have a check_type() function to make the code less repetitive.

Alternatively, both such check functions can probably be implemented using json-schemas.

Zazmuz added 7 commits April 10, 2025 10:36

make statement dir not found error more obvious

ae32854

made it more obvious if old folder name is used, or folder is misspelled made check error messages clearer for people newer to problemtools closest word functionality bugfix index error

Field hints, updated usage of config and readded comment

75707ad

readded comment stating weird it is Name change and pytest fix

removal of possessive quantifiers in regex statement since it was int…

cf4eb8d

…roduced in python 3.12

mypy error fixes and merge

928cfc1

Merged conflict in hello test

Merge remote-tracking branch 'upstream/develop' into new-new-config-s…

8d3829c

…ystem

fixed some CodeFactor issues

9aee251

gkreitz requested changes Apr 12, 2025

View reviewed changes

gkreitz closed this May 16, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Updated parsing of problem.yaml by using checks as described in the specification #295

Updated parsing of problem.yaml by using checks as described in the specification #295

Uh oh!

Zazmuz commented Apr 10, 2025

Uh oh!

gkreitz left a comment

Uh oh!

gkreitz Apr 12, 2025

Uh oh!

gkreitz Apr 12, 2025

Uh oh!

gkreitz Apr 12, 2025

Uh oh!

gkreitz Apr 12, 2025

Uh oh!

gkreitz Apr 12, 2025

Uh oh!

gkreitz Apr 12, 2025

Uh oh!

gkreitz Apr 12, 2025

Uh oh!

gkreitz Apr 12, 2025

Uh oh!

gkreitz Apr 12, 2025

Uh oh!

Uh oh!



		def check(self, context: Context) -> bool:
		if self._check_res is True:

Updated parsing of problem.yaml by using checks as described in the specification #295

Updated parsing of problem.yaml by using checks as described in the specification #295

Uh oh!

Conversation

Zazmuz commented Apr 10, 2025

Uh oh!

gkreitz left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!