Skip to content

in2csv: Importing messy data #1289

@hseg

Description

@hseg

I have multiple spreadsheets with a few metadata rows before the actual data. This trips up csvkit's automatic header detection. I'm processing them in bulk, so I can't rely on pre-counting the number of metadata lines for each.
In particular, this means that in2csv's behaviour where it automatically fills in the first line expecting it to be the header is counterproductive here -- I'm forced to invoke it as in2csv -H | tail -n+2.
Can in2csv be given an --ignore-missing-headers option or somesuch? I'll be piping it into xan or miller for further cleanup, so I don't need a cleverer handling method than that.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions