Thanks for your earlier replies. Really enjoying this software. One thing I have noticed is that outputs seem biased towards returning top results for proteins that have longer sequence lengths. It also appears that there is a filter for sequence length available. What solutions/workflows have you found to be most useful for eliminating this bias, i.e. what cutoffs to use, which proteins to eliminate from index, etc. Thanks in advance.