sequenceSniffer

flags members of sequences of at least the specified length that appear at least twice in the specified data columns.

reformats specified columns to long format
calculates n-grams starting at the specified minimum sequence length
marks data points that are part of an n-gram that occurs at least twice
increases n-gram length until no more duplicate n-grams are found.

please note

the function does not yet check for overlapping sequences within a specific n-gram length, i.e. a sequence "A A B A A B A" will count and mark n-gram "A A B A" as duplicated
longer sequences will overwrite shorter sequences that they overlap with, i.e., in the above example,3-gram "A B A" will be overwritten by 4-gram "A A B A".

also please note

use your brain when interpreting the results.
finding sequences in data with very low cardinality is normal.
finding sequences in censored data is normal.
if you pick odd grouping levels, the randomisation test will give you similarly odd results.

static version

supply csv filename
provide column range
set minimum sequence length
knit document

-> displays your data with identified repeat-sequence member ship colour-coded

-> randomly reorders data within a specified grouping level (e.g., original data columns & a treatment column in the orignal data) 1000 times and calculates the distribution of counts of datapoints that are part of a repeated sequence

dynamic version

run document
enter filename etc in UI

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
README.md		README.md
sequenceSniffer.Rmd		sequenceSniffer.Rmd
static_sequencesSniffer.Rmd		static_sequencesSniffer.Rmd
static_sequencesSniffer.html		static_sequencesSniffer.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

sequenceSniffer

please note

also please note

static version

dynamic version

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

sequenceSniffer

please note

also please note

static version

dynamic version

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages