Skip to content

Implementation of csvkit for stream merging csv #136

@tillenglert

Description

@tillenglert

Description of feature

Currently some processes, like MERGE_PREDICTIONS use a custom concat_tsv.py script enabling streamed merging of many cvs, as prediction tables tend to get very large and multiple 100ed files are concatenated.

To reduce maintenance load we could implement an nf-core module for:
https://csvkit.readthedocs.io/en/latest/contributing.html#streaming-versus-buffering

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementImprovement for existing functionality

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions