Open
Description
From #603 (comment)
Would be nice to try some real-world validation of the encoder.
@crimsoncress is working on classification in the vision domain (using the MNIST examples), you could do similar with a text-classification dataset.
- spam/ham is the classics of text-classification
- we might look for some interesting NLP datasets, such as "topic classification", "document summary"