Open
Description
Describe the feature
LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation
with Linear Computational Complexity
https://arxiv.org/pdf/2412.09856
LinGen is non-transformer based SSM Mamba2 based model. We are working with the first author to implement this LinGen MATE architecture for video generation. We like to add LinGen MATE architecture as a model option to InternLM EVO. We would like to discuss with internlm EVO team about this implementation
Will you implement it?
- I would like to implement this feature and create a PR!
Activity