Skip to content

totalNumberOfWords #2

@patrickjae

Description

@patrickjae

In mallet branch, HDPGibbsSampler line 71-73
simply adding the length of data to totalNumberOfWords should be faster than traversing an index and adding one each time, especially for very large documents
might look like this:

totalNumberOfWords += ((FeatureSequence) corpus.get(d).getData()).getLength();

Metadata

Metadata

Assignees

Labels

No labels
No labels

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions