Skip to content

[API Proposal]: Get/Enumerate line spans from TextReader #114987

Open
@pawchen

Description

@pawchen

Background and motivation

When parsing a file line by line, and there's no need to keep the whole line as a string, span based APIs will be helpful for reducing unnecessary allocations up to the file size, as we already have span based Regex APIs.

API Proposal

namespace System.IO;

public class TextReader
{
    public ReadOnlySpan<char> ReadLineAsSpan();
    public TextReaderLineEnumerator EnumerateLines();
}

public ref struct TextReaderLineEnumerator
{
    // Similar to SpanLineEnumerator
}

API Usage

ReadLineAsSpan:

using var reader = new StreamReader(file);

while (!reader.EndOfStream)
{
    ReadOnlySpan<char> line = reader.ReadLineAsSpan();   // <----- gets the span

    if (line.TrimStart().StartsWith('['))
    {
        // continue parsing with Regex
        foreach (var match in OpenBlockRegex.EnumerateMatches(line))
        {
        }
    }
        // or ignore the line
}

EnumerateLines:

using var reader = new StreamReader(file);

foreach (ReadOnlySpan<char> line in reader.EnumerateLines())
{
    if (line.TrimStart().StartsWith('['))
    {
        // continue parsing with Regex
        foreach (var match in OpenBlockRegex.EnumerateMatches(line))
        {
        }
    }
        // or ignore the line
}

Alternative Designs

/// <returns> false when no more lines can be read (EndOfStream)</returns>
/// <param name="length"> -1 when the span.Length is less than the line</param>
bool TryReadLine(Span<char> destination, out int length);

Risks

For the span returning proposal, subsequent read will invalidate the span, user should not keep the value before the next call.

For the TryReadLine design, user need to manage the buffer by themselves.

Metadata

Metadata

Assignees

No one assigned

    Labels

    api-suggestionEarly API idea and discussion, it is NOT ready for implementationarea-System.IOuntriagedNew issue has not been triaged by the area owner

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions