Skip to content

mamei16/MADLAD-400-WebUI

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

50 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Image

Features

This app automatically splits longer texts into paragraphs and sentences before translating, thus overcoming the short maximum text length of the MADLAD-400 models.

How to run

  1. Install https://github.com/mamei16/llama-cpp-binaries (This will install a fork of llama.cpp, which adds support for madlad models to the llama-server.)
  2. Install the other requirements
  3. Download a GGUF version of a madlad400 model. Note that GGUF models from the official repo do not work. Models that are confirmed to work:
  4. Launch the app with: python app.py <path_to_your_gguf>

VRAM Requirements

If you run an 8-bit quantized version of the 10B model, it can be run with 11GB of VRAM. The Q8_0 version of the 3B model can be run with 6GB of VRAM.

About

A simple Gradio app for local translation using the GGUF versions of MADLAD-400

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages