Skip to content

Sentence splitting and lists #16

@dchest

Description

@dchest

In plain text and Markdown it's common to use lists like this:

This is a simple text. It uses lists:
- This is a list item.
- This is a second list item.

Unfortunately, the model doesn't consider such list items as separate sentences. Is there a possibility for improvement here? For example, consider a single line break as an indicator that the sentence could end? I assume there's a collision with some other sentence structure that makes it necessary to consider those as a single sentence?

Sentences:

1. This is a simple text.
2. It uses lists: - This is a list item. - This is a second list item.
Screenshot 2024-06-17 at 11 36 27

Two line breaks, however, make it work:

This is a simple text. It uses lists:

- This is a list item.

- This is a second list item.

Sentences:

1. This is a simple text.
2. It uses lists:
3. - This is a list item.
4. - This is a second list item.
Screenshot 2024-06-17 at 11 36 46

Thank you!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions