Change the repository type filter
All
Repositories list
41 repositories
cisnlp.github.io
PublicHomepage of cisnlpGlotLID
Public💬 Language Identification with Support for More Than 2000 Labels -- EMNLP 2023GlotWeb
Public🕸 GlotWeb: Web Indexing for Minority Languages (WWW 2026)GlotScript
Public🖋 Resource and Tool for Writing System Identification (Unicode 17.0) -- LREC 2024- This is the codebase for "Large Reasoning Models Are (Not Yet) Multilingual Latent Reasoners"
multypo
PublicKLAR-CLC
PublicLanguage-Mixing
Publicmanchu-in-context-mt
PublicMIB-circuit-track
Publicspatial_intuitions
Public- Tracing Multilingual Factual Knowledge Acquisition in Pretraining
MEXA
Public🔍 Multilingual Evaluation of English-Centric LLMs via Cross-Lingual AlignmentGlotCC
Public🕸 GlotCC Dataset and Pipline -- NeurIPS 2024code-specific-neurons
Public💻🔍 How Programming Concepts and Neurons Are Shared in Code Language Modelsoscar-io
Publicungoliant
Publicoscar-tools
PublicLangSAMP
PublicLangSAMP: Language-Script Aware Multilingual Pretraininganalogical_reasoning
PublicTransliteration-PPA
PublicBreaking the Script Barrier in Multilingual Pre-Trained Language Models with Transliteration-Based Post-Training Alignmentlohoravens-webpage
PublicMaskLID
Public💬 MaskLID: Code-Switching Language Identification through Iterative Masking -- ACL 2024Taxi1500
PublicTransMI
PublicTransMI: A Framework to Create Strong Baselines from Multilingual Pretrained Language Models for Transliterated DataTransliCo
PublicTransliCo: A Contrastive Learning Framework to Address the Script Barrier in Multilingual Pretrained Language ModelsSpatial_Schemas
PublicXAMPLER
Public