Missing Models #493

jaseweston · 2018-01-10T02:03:28Z

This is a list of models not yet in ParlAI that would be great to have. Feel free to add more to the list also! We will remove individual items when they are done.

BiDaf model for QA: https://allenai.github.io/bi-att-flow/
model in decaNLP that are missing: https://github.com/salesforce/decaNLP
Hierarchical Encoder Decoder for Dialog Modelling https://github.com/julianser/hed-dlg
A general-purpose encoder-decoder framework for Tensorflow https://github.com/google/seq2seq
Seq2Seq from http://opennmt.net
Utilities from AI2 toolkit?
ELMo word embeddings: https://github.com/allenai/allennlp/blob/master/tutorials/how_to/elmo.md

jsedoc · 2018-05-14T05:20:33Z

@jaseweston for HRED there's already changes made in Julian's fork of ParlAI (https://github.com/julianser/ParlAI).

theSage21 · 2018-10-09T10:21:52Z

If nobody is doing bidaf I can add it. It will take me some time though.

jaseweston · 2018-10-12T14:12:02Z

If nobody is doing bidaf I can add it. It will take me some time though.

@theSage21 sure that would be great!

theSage21 · 2018-10-12T14:21:53Z

I can lay out the code like in the drqa system?

jaseweston · 2018-10-12T14:29:49Z

@alexholdenmiller can give advice, perhaps

uralik · 2018-10-12T15:46:07Z

@theSage21, I guess right now the best thing is to use TorchAgent as a base class, check seq2seq agent for example

alexholdenmiller · 2018-10-12T18:28:20Z

Yes we definitely prefer using the TorchAgent parent class e.g. how seq2seq or memnn or example_seq2seq are set up. It eliminates a lot of copy-pasta from the model.

alexholdenmiller · 2018-10-12T20:30:40Z

I should caveat my recommendation: if you're not using pyorch then there will be a few inefficiencies (e.g. casting the torch tensors into another format), but it will still likely simplify the code. You're certainly welcome to roll it from scratch, the TorchAgent (parlai/core/torch_agent) just includes a lot of basic stuff like remembering the conversation history, vectorizing the text, and putting it into batches to feed into the model.

ricsinaruto · 2018-11-30T12:18:25Z

Commenting here to let interested people know, that I have a somewhat working integration of the VHCR model on this fork: https://github.com/Mrpatekful/ParlAI/tree/dialogwae.
VHCR is a state-of-the-art dialog model, and I used the official implementation (https://github.com/ctr4si/A-Hierarchical-Latent-Structure-for-Variational-Conversation-Modeling).

The model is far from done however, I haven't really tested it yet (the loss seems to at least go down). I am still working on the generating function at test time, and I haven't even thought about how to integrate beam search yet. I plan to send a PR when I finish these tasks. I am happy to collaborate if anyone is up to it.

alexholdenmiller · 2018-11-30T20:08:07Z

Thanks for the updates @ricsinaruto!

I wanted to quickly note that @stephenroller landed #1260 two weeks ago, which provides a lot of the wrapping around typical generator code. This makes the seq2seq code at parlai/agents/seq2seq/seq2seq.py remarkably short in the current version, and includes functionality for doing beam search for you. You might find it quite a bit easier to rebase and subclass this new TorchGeneratorAgent (parlai/core/torch_generator_agent.py).

ricsinaruto · 2018-12-01T11:00:34Z

Yeah I knew about that but thanks for bringing it to my attention. So far I actually subclassed the seq2seq because it has a lot of funcionality, but I will change it to this new generator agent as it would be cleaner.

alexholdenmiller · 2018-12-03T15:40:43Z

Yes in the master branch nearly all of the functionality you were using in your fork has been moved to the TorchGeneratorAgent, actually!

github-actions · 2020-06-03T00:11:49Z

This issue has not had activity in 30 days. Marking as stale.

agilebean · 2021-09-19T03:23:24Z

In my experiments with BlenderBot 1.0, the 1B was nearly as fast as the 400M model but showed much better conversational performance. The 1B was also much faster than the 3B model.

Therefore, may I ask the BlenderBot 2.0 team @stephenroller @alexholdenmiller et al.:
Any chance you consider releasing a 1B model also for BlenderBot 2.0 soon?

I guess this might benefit many other people as well :)

jaseweston added the Help Wanted label Jan 10, 2018

jaseweston mentioned this issue Jun 6, 2018

ELMo word embeddings #822

Closed

github-actions bot added the stale-issue label Jun 3, 2020

github-actions bot closed this as completed Jun 3, 2020

stephenroller reopened this Jun 3, 2020

stephenroller added donotreap Avoid automatically marking as stale. and removed stale-issue labels Jun 3, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Missing Models #493

Missing Models #493

jaseweston commented Jan 10, 2018 •

edited

Loading

jsedoc commented May 14, 2018

theSage21 commented Oct 9, 2018

jaseweston commented Oct 12, 2018

theSage21 commented Oct 12, 2018

jaseweston commented Oct 12, 2018

uralik commented Oct 12, 2018

alexholdenmiller commented Oct 12, 2018

alexholdenmiller commented Oct 12, 2018

ricsinaruto commented Nov 30, 2018

alexholdenmiller commented Nov 30, 2018

ricsinaruto commented Dec 1, 2018

alexholdenmiller commented Dec 3, 2018

github-actions bot commented Jun 3, 2020

agilebean commented Sep 19, 2021

Missing Models #493

Missing Models #493

Comments

jaseweston commented Jan 10, 2018 • edited Loading

jsedoc commented May 14, 2018

theSage21 commented Oct 9, 2018

jaseweston commented Oct 12, 2018

theSage21 commented Oct 12, 2018

jaseweston commented Oct 12, 2018

uralik commented Oct 12, 2018

alexholdenmiller commented Oct 12, 2018

alexholdenmiller commented Oct 12, 2018

ricsinaruto commented Nov 30, 2018

alexholdenmiller commented Nov 30, 2018

ricsinaruto commented Dec 1, 2018

alexholdenmiller commented Dec 3, 2018

github-actions bot commented Jun 3, 2020

agilebean commented Sep 19, 2021

jaseweston commented Jan 10, 2018 •

edited

Loading