Adding transliterate feature by zyronix · Pull Request #69 · NetherlandsForensicInstitute/demeuk

zyronix · 2025-07-22T16:44:45Z

No description provided.

Copilot

Pull Request Overview

This pull request adds a transliterate feature to the application, which converts text from one script to another (e.g., Cyrillic to Latin characters). The feature accepts a 2-letter ISO language code and can be combined with the --non-ascii option for further character conversion.

Adds transliterate functionality with language code parameter
Includes test coverage for Serbian transliteration
Documents the new feature with usage examples

Reviewed Changes

Copilot reviewed 3 out of 5 changed files in this pull request and generated 2 comments.

File	Description
tests/test_app.py	Adds test case for transliterate feature using Serbian test data
tests/conftest.py	Creates test input file with Cyrillic text for transliteration testing
docs/usage.rst	Documents the new transliterate feature with usage instructions and examples

Comments suppressed due to low confidence (1)

tests/test_app.py:1017

The test should verify that the output file was created successfully and handle potential file I/O errors. Consider adding error handling or using a context manager with proper cleanup.

    with open('testdata/output55') as f:

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Wineh

Paar kleine puntjes, voor de rest approve!

Wineh · 2025-07-23T08:49:46Z

                                        becomes u, ç becomes c.
        --trim                          Enables removing newlines representations from end and beginning. Newline
                                        representations detected are '\\n', '\\r', '\n', '\r', '<br>', and '<br />'.
+        --transliterate <language>      Transliterate a strings, for example "ipsum" becomes "իպսում". Language is iso


Hier ook -s bij "strings"

Wineh · 2025-07-23T09:01:00Z

        --trim                          Enables removing newlines representations from end and beginning. Newline
                                        representations detected are '\\n', '\\r', '\n', '\r', '<br>', and '<br />'.
+        --transliterate <language>      Transliterate a strings, for example "ipsum" becomes "իպսում". Language is iso
+                                        2 letter code. Examples: ru, sr, ua


Wellicht de volledige output van
>>> transliterate.get_available_language_codes() ['ka', 'sr', 'l1', 'ru', 'mn', 'uk', 'mk', 'el', 'hy', 'bg']

hier toevoegen?

Adding transliterate feature

a280bf5

zyronix requested review from Wineh, Copilot and gochujang-c July 22, 2025 16:44

Copilot AI reviewed Jul 22, 2025

View reviewed changes

Comment thread docs/usage.rst Outdated

Comment thread docs/usage.rst Outdated

zyronix and others added 2 commits July 22, 2025 18:46

Apply suggestions from code review

9657f92

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

❤️ tox

7110ad4

Wineh approved these changes Jul 23, 2025

View reviewed changes

Applying feedback

4ea00ac

zyronix merged commit fbea76c into master Jul 24, 2025
4 checks passed

zyronix deleted the feature/transliterate branch July 24, 2025 08:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Adding transliterate feature#69

Adding transliterate feature#69
zyronix merged 4 commits into
masterfrom
feature/transliterate

zyronix commented Jul 22, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Wineh left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Wineh Jul 23, 2025

Uh oh!

Wineh Jul 23, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

zyronix commented Jul 22, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Wineh left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Wineh Jul 23, 2025

Choose a reason for hiding this comment

Uh oh!

Wineh Jul 23, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants