Skip to content

Processing xml input files #139

@yashkens

Description

@yashkens

Hi,
I am using ufal.udpipe for Python for processing files encoded in xml format. There is an example part of my input file:

<p>Звери</p>

<p>***</p>
<p>Посадили утку в курятник. Она ходит себе, оглядывается. Потом спрашивает:</p>
<p>-- Господа, а где здесь пруд?</p>
<p>А сверху, с насеста:</p>
<p>-- Мадам, здесь не прут, здесь топчут!</p>

Is it possible to store xml tags in SpaceAfter and SpaceBefore fields? The manual mentions:

Note that in theory not only spaces, but also other original content can be saved in this way (for example XML tags if the input was encoded in a XML file).

Unfortunately, I couldn't find any directions on how to do that. Please, can anyone give any suggestions?

Thanks in advance!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions