Dear,
In the Supplementary Materials, it is stated:
Next, 5 sequences of length N were randomly chosen from within the region and matched
back to the sequence using the R function matchPattern (max.mismatch=N/3 with.indels=T). For each
set of matches we identified overlapping repeats. If the overlap was less than 10 nucleotides, the overlap
was divided at the midpoint between the repeats. If the overlap was 10 nucleotides or greater, the larger
repeat was kept. The set of non-overlapping matches with the highest number was kept for further
analysis.
Would it be possible to explain more in details how the set of matches obtained with matchPattern was processed.
Thank you in advance for your help!
Dear,
In the Supplementary Materials, it is stated:
Next, 5 sequences of length N were randomly chosen from within the region and matched
back to the sequence using the R function matchPattern (max.mismatch=N/3 with.indels=T). For each
set of matches we identified overlapping repeats. If the overlap was less than 10 nucleotides, the overlap
was divided at the midpoint between the repeats. If the overlap was 10 nucleotides or greater, the larger
repeat was kept. The set of non-overlapping matches with the highest number was kept for further
analysis.
Would it be possible to explain more in details how the set of matches obtained with matchPattern was processed.
Thank you in advance for your help!