Open
Description
Hello!
I'm currently using text2vec to create embeddings for a dataset of tweets. Since each document is pretty short, I want to implement a Simple Word Embedding Model (SWEM), specifically SWEM-concat, whereby I concatenate together the average of all word vectors in a document, and the result from max-pooling. This method is discussed in this paper.
I can get the document averages by normalizing the dtm
object and then taking the dot product of that with the word vector object, but I'm struggling with how to get the max-pooling results. Any help would be much appreciated!
Thanks!
Metadata
Metadata
Assignees
Labels
No labels