awesome-turkish-vlm

A curated list of models, datasets and other useful resources for Turkish Vision-Language Models (VLM).

Content

awesome-turkish-vlm

🚀 Introduction

This repository is an awesome list of curated resources dedicated to Turkish Vision-Language Models (VLM). It includes pretrained/fine-tuned models, datasets, and related works you need to get started or deepen your research in Turkish multimodal AI.

Contributions and suggestions are warmly welcomed! 🌟

🤖 Models

Note

We prioritize adding models that explicitly state in their documentation or model card that they have been trained for Turkish. However, other models that are observed to perform well with Turkish in practice may also be included.

Name	Size	License
ytu-ce-cosmos/Turkish-LLaVA-v0.1	8B	MIT
TraVisionLM-base	875M	Apache 2.0
TraVisionLM-DPO	875M	Apache 2.0
TraVisionLM-Object-Detection-ft	875M	Apache 2.0
mistralai/Mistral-Small-3.2-24B-Instruct-2506	24B	Apache-2.0
mistralai/Mistral-Small-3.1-24B-Instruct-2503	24B	Apache-2.0
CohereLabs/aya-vision-8b	8B	CC-BY-NC-4.0
CohereLabs/aya-vision-32b	32B	CC-BY-NC-4.0
utter-project/EuroVLM-1.7B-Preview	1.7B	Apache-2.0
utter-project/EuroVLM-9B-Preview	9B	Apache-2.0
Gemma 3	4B-12B-27B	Gemma
Qwen2-VL	2B-7B-72B	Apache-2.0
Qwen2.5-VL	3B-7B-32B-72B	Apache-2.0

📚 Datasets

Name	Description	Size	License
ytu-ce-cosmos/turkce-kitap	Book cover collection designed to improve the Turkish OCR ability.	108k
ytu-ce-cosmos/Turkish-LLaVA-Pretrain	Pretraining dataset for Turkish base VLM.	595k
ytu-ce-cosmos/Turkish-LLaVA-Finetune	Fine-tuning dataset for Turkish VLM.	522k
ucsahin/COCO-OD-TR-Single-Objects-v2	Combined dataset for object detection task.	153k
ucsahin/Turkish-VLM-Mix-Benchmark	Combined dataset for benchmark.	35k
ucsahin/TR-VLM-DPO-Dataset	DPO formatted instruction-following dataset.	10k
atasoglu/flickr30k-turkish	Turkish translation of the Flickr30k dataset.	30k
atasoglu/flickr8k-turkish-mt	Turkish translation of the Flickr8k dataset.	8k
atasoglu/flickr8k-turkish	Native Turkish version of the Flickr8k dataset.	8k	CC0 1.0
atasoglu/flickr8k-turkish-detailed-captions	Flickr8k dataset with long captions.	8k	CC0 1.0
99eren99/LLaVA1.5-Data-Turkish	Combined dataset for instruction-following.		CC BY 4.0
mcemilg/laion2B-multi-turkish-subset	Turkish subset of the Laion2b-multi.	34M	CC BY 4.0
NexusAI-tddi/VisIT-Bench-tr	Bench dataset for instruction-following.	574
umarigan/turkish_clip_dataset_with_text_embeddings	Turkish-English caption pairs with embeddings.	410k	CreativeML Open RAIL-M
YxBxRyXJx/cut_TRV_ver2_1019	Turkish object detection dataset.	2k
selimc/tr-textbook-ColPali	Dataset for document retrieving from Turkish textbook.	3k
muhammetfatihaktug/bilim_teknik_mini_colpali	Dataset for document retrieving from Turkish science magazine.	4k	MIT
umarigan/PD12M-Turkish	A large Turkish captioning dataset.	12M	CDLA-Permissive-2.0

🧑‍💻 Code

📓 Notebooks

🔗 Repositories

smol-vision: Recipes for shrinking, optimizing, customizing cutting edge vision models.
turkish-llava-notebooks: Collection of notebooks for Turkish LLaVA model on various on various use-cases.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

awesome-turkish-vlm

Content

🚀 Introduction

🤖 Models

📚 Datasets

🧑‍💻 Code

📓 Notebooks

🔗 Repositories

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

awesome-turkish-vlm

Content

🚀 Introduction

🤖 Models

📚 Datasets

🧑‍💻 Code

📓 Notebooks

🔗 Repositories

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages