Added default replacement words #2097
Open
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
First I also added this (but removed it again for this PR):
But this also changes the words when using "say...", which is unwanted. With "say...", you normally say more than one word, and it must generally be assumed to be likely that the context words give enough information for the correct spelling. With "word...", however, which is contextless, it would be useful to produce the more common spellings by default. If the user needs another spelling, "phones..." must be used. Are y'all generally open to implement a differentiation between contextless and contextful word replacements? Then I would open an issue for this.
Regarding American spellings: From my experience, the speech recognition models are already likely to produce American spellings. The list should contain corrections of other spellings to avoid inconsistency. To cater to users that need other spelling systems, an additional replacement layer after the current one would be needed. The lists for this would probably need to be very large (all verbs ending in "ize" etc. pp.) and may be out of the scope of this repository.