Skip to content

Corrected usage in cdhit-utility.c++#51

Open
ucpete wants to merge 1 commit into
weizhongli:masterfrom
ucpete:ucpete-usage-correction
Open

Corrected usage in cdhit-utility.c++#51
ucpete wants to merge 1 commit into
weizhongli:masterfrom
ucpete:ucpete-usage-correction

Conversation

@ucpete

@ucpete ucpete commented Aug 1, 2017

Copy link
Copy Markdown

The usage presented in the tools in the CD-HIT suite that take nucleotide sequences as input is incorrect. For example, when one calls cd-hit-est (v4.7), there are references to amino acids, e.g.:

   -c	sequence identity threshold, default 0.9
 	this is the default cd-hit's "global sequence identity" calculated as:
 	number of identical amino acids in alignment
 	divided by the full length of the shorter sequence

There are several small inconsistencies throughout the usage; I have made the appropriate fixes to the cdhit-utility.c++ file so that when one calls a nucleotide sequence tool, one sees 'nucleotide' or 'nt,' and when one calls a protein sequence tool, one sees 'amino acid' or 'aa.'

These changes are all cosmetic, but as a user of the tool, I have been confused in the past about which tool I was using, or wanted to use, after reading the usage.

The usage presented in the tools in the CD-HIT suite that take
nucleotide sequences as input is incorrect. For example, when one calls
cd-hit-est (v4.7), there are references to amino acids:

<pre>
   -c	sequence identity threshold, default 0.9
 	this is the default cd-hit's "global sequence identity" calculated as:
 	number of identical <b>amino acids</b> in alignment
 	divided by the full length of the shorter sequence
</pre>

There are several small inconsistencies throughout the usage; I have
fixed them throughout the cdhit-utility.c+++ file.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant