We have access to HBS bi-text data on websites such as: * Opus: https://opus.nlpl.eu/ * MaCoCu: https://macocu.eu/ I'm having conversations with local NLP experts, non-profits, and organizations that might help us acquire much higher quality bi-text. [Related issue](https://github.com/gordicaleksa/Open-NLLB/issues/10) (but not quite the same).