When running ‘deduplicate’, on a file containing a sequence_id column, it would be nice if the (first occurring) sequence identifier can be retained, instead of removing this column. Maybe add a general warning stating that only the first id is retained if such an input file is supplied.