Update Regexp for Ending #588

mscuthbert · 2025-05-23T20:05:14Z

This updates the regular expression for the <ending number="..."> attribute, to allow for multiple spaces after the comma: <ending number="2, 3">

The schema change is finished -- I want to integrate the doctools better into a testing eco-system.

See #580

This updates the regular expression for the `<ending number="...">` attribute, to allow for multiple spaces after the comma: `<ending number="2, 3">` The schema change is finished -- I want to integrate the doctools better into a testing eco-system.

mdgood · 2025-05-23T20:24:12Z

This is wrong. Tokens "have no internal sequences of two or more spaces." See the definition at https://www.w3.org/TR/xmlschema-2/#token.

mdgood

Please close or delete this PR as it violates the definition of an XML Schema token.

mscuthbert · 2025-05-23T22:16:05Z

Hi Michael -- I'll do that, but can you explain why the first half of the regex makes sense then:

([ ]*)|([1-9][0-9]*(, ?[1-9][0-9]*)*)

As I now understand it, if "1, 2" is not a valid token then neither is " " -- as it was the first half is validating against the pre-normalized version and the second half is validating against the normalized version? That does not make sense to me.

If these regex's are going on post normalized, should it not be

()|([1-9][0-9]*(, ?[1-9][0-9]*)*)

mdgood · 2025-05-23T23:27:16Z

That case is to cover the empty string. I don't remember why it was specified that way instead of what you propose - if that wasn't a valid schema regex, if there was a problem with parser software handling it, or I just used the first thing I thought of. It may not be ideal, but at least let's not make it any worse. 😀

mscuthbert · 2025-05-24T00:11:00Z

That case is to cover the empty string. I don't remember why it was specified that way instead of what you propose - if that wasn't a valid schema regex, if there was a problem with parser software handling it, or I just used the first thing I thought of. It may not be ideal, but at least let's not make it any worse. 😀

Agreed -- at least you can see that I wanted to make both sides consistent. We'll go the opposite direction.

But this ends up not being a change requiring a XST since the parser will trim it in 4.0 and in 4.1

mscuthbert · 2025-05-24T00:30:41Z

The docs changes got caught up in the move from Windows line-endings to Unix. Here's the relevant added lines in the docs:

<h3 id="values">Changed Attributes/Values</h3>
<ul>
<li>The <a href="../../musicxmlreference/data-types/ending-number/">ending-number</a> type used
    in the <code>number</code> attribute of the
    <a href="../../musicxml-reference/elements/ending/">&lt;ending&gt;</a>, previously had a regex
    that implied that empty spaces were different from the empty string.  This has been clarified.
</li>
</ul>


<h2 id="documentation">Documentation Changes</h2>
<li>The <a href="../../musicxmlreference/data-types/ending-number/">ending-number</a> type used
    in the <code>number</code> attribute of the
    <a href="../../musicxml-reference/elements/ending/">&lt;ending&gt;</a>, previously had documentation
    implying that empty spaces were different from the empty string.  This has been clarified.
</li>

lemzwerg · 2025-05-24T04:04:09Z

schema/musicxml.xsd

 		</xs:annotation>
 		<xs:restriction base="xs:token">
-			<xs:pattern value="([ ]*)|([1-9][0-9]*(, ?[1-9][0-9]*)*)"/>
+			<xs:pattern value="()|([1-9][0-9]*(, ?[1-9][0-9]*)*)"/>


I still think it makes sense to change ' ?' to '[ ]?' to increase readability.

lemzwerg · 2025-05-24T04:08:06Z

docs/version-history/41/index.html

+<li>The <a href="../../musicxmlreference/data-types/ending-number/">ending-number</a> type used
+    in the <code>number</code> attribute of the
+    <a href="../../musicxml-reference/elements/ending/">&lt;ending&gt;</a>, previously had a regex
+    that implied that empty spaces were different from the empty string.  This has been clarified.


'Empty spaces' sounds like a pleonasm 🙂

I suggest to replace this with 'a sequence of consecutive space characters' or something like that.

mdgood · 2025-06-09T21:21:03Z

I still don't think we should make this change as I don't see it adding value, only risk. It doesn't increase the power of what MusicXML can represent and this hasn't been a significant point of confusion in 17 years. Any change requires testing to make sure it works and documentation to explain the change. The cost isn't worth it.

mscuthbert · 2025-08-04T20:33:42Z

Agreeing with Michael -- not sufficient improvement to make a change that could break some (non-conforming) code out there. Closing.

Update Regexp for Ending

2d468bb

This updates the regular expression for the `<ending number="...">` attribute, to allow for multiple spaces after the comma: `<ending number="2, 3">` The schema change is finished -- I want to integrate the doctools better into a testing eco-system.

mdgood requested changes May 23, 2025

View reviewed changes

Opposite direction, thanks mdgood.

e00f53e

has -> had

075b6d0

mscuthbert mentioned this pull request May 24, 2025

ending-number documentation doesn't fit shown regular expression #580

Open

lemzwerg reviewed May 24, 2025

View reviewed changes

mscuthbert closed this Aug 4, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update Regexp for Ending #588

Update Regexp for Ending #588

Uh oh!

mscuthbert commented May 23, 2025 •

edited

Loading

Uh oh!

mdgood commented May 23, 2025

Uh oh!

mdgood left a comment

Uh oh!

mscuthbert commented May 23, 2025 •

edited

Loading

Uh oh!

mdgood commented May 23, 2025

Uh oh!

mscuthbert commented May 24, 2025

Uh oh!

mscuthbert commented May 24, 2025

Uh oh!

lemzwerg May 24, 2025

Uh oh!

lemzwerg May 24, 2025

Uh oh!

mdgood commented Jun 9, 2025

Uh oh!

mscuthbert commented Aug 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Update Regexp for Ending #588

Update Regexp for Ending #588

Uh oh!

Conversation

mscuthbert commented May 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mdgood commented May 23, 2025

Uh oh!

mdgood left a comment

Choose a reason for hiding this comment

Uh oh!

mscuthbert commented May 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mdgood commented May 23, 2025

Uh oh!

mscuthbert commented May 24, 2025

Uh oh!

mscuthbert commented May 24, 2025

Uh oh!

lemzwerg May 24, 2025

Choose a reason for hiding this comment

Uh oh!

lemzwerg May 24, 2025

Choose a reason for hiding this comment

Uh oh!

mdgood commented Jun 9, 2025

Uh oh!

mscuthbert commented Aug 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

mscuthbert commented May 23, 2025 •

edited

Loading

mscuthbert commented May 23, 2025 •

edited

Loading