pretraining

Star

Here are 222 public repositories matching this topic...

LlamaFamily / Llama-Chinese

Star

Llama中文社区，实时汇总最新Llama学习资料，构建最好的中文Llama大模型开源生态，完全开源可商用

agent rl llama pretraining llm llama4

Updated Apr 6, 2025
Python

microsoft / LMOps

Star

General technology for enabling AI capabilities w/ LLMs and MLLMs

nlp prompt agi lm gpt language-model pretraining llm promptist x-prompt lmops

Updated Jun 30, 2025
Python

OFA-Sys / OFA

Star

Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

prompt chinese image-captioning pretrained-models visual-question-answering multimodal text-to-image-synthesis vision-language pretraining referring-expression-comprehension prompt-tuning

Updated Apr 24, 2024
Python

X-PLUG / mPLUG-Owl

Star

mPLUG-Owl: The Powerful Multi-modal Large Language Model Family

Updated Apr 2, 2025
Python

ChandlerBang / awesome-self-supervised-gnn

Star

Papers about pretraining and self-supervised learning on Graph Neural Networks (GNN).

machine-learning deep-learning graph-mining graph-neural-networks self-supervised-learning pre-training pretraining graph-self-supervised-learning

Updated Feb 2, 2024
Python

[ICLR'23 Spotlight🔥] The first successful BERT/MAE-style pretraining on any convolutional network; Pytorch impl. of "Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling"

Updated Jan 23, 2024
Python

yuewang-cuhk / awesome-vision-language-pretraining-papers

Star

Recent Advances in Vision and Language PreTrained Models (VL-PTMs)

bert vision-and-language multimodal-deep-learning pretraining vl-ptms

Updated Aug 19, 2022

qqlu / Entity

Star

EntitySeg Toolbox: Towards Open-World and High-Quality Image Segmentation

computer-vision deep-learning cnn pytorch segmentation object-detection pretrained-models image-segmentation semantic-segmentation pretrained-weights instance-segmentation panoptic-segmentation fcos pretraining detectron2 condinst

Updated Nov 30, 2023
Jupyter Notebook

YehLi / xmodaler

Star

X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).

image-captioning video-captioning visual-question-answering vision-and-language cross-modal-retrieval pretraining tden

Updated Feb 27, 2023
Python

deepmodeling / Uni-Mol

Star

Official Repository for the Uni-Mol Series Methods

deep-learning molecular-modeling pre-trained-model pretraining

Updated May 29, 2025
Python

seal-rg / recurrent-pretraining

Star

Pretraining and inference code for a large-scale depth-recurrent language model

reasoning pretraining llms recurrent-depth

Updated Sep 5, 2025
Python

PKU-YuanGroup / LanguageBind

Star

【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment

multi-modal zero-shot pretraining language-central

Updated Mar 25, 2024
Python

Alibaba-MIIL / ImageNet21K

Star

Official Pytorch Implementation of: "ImageNet-21K Pretraining for the Masses"(NeurIPS, 2021) paper

mixer multi-label-classification downstream pretraining vision-transformer imagenet21k semantic-softmax single-label

Updated Jan 11, 2023
Python

zubair-irshad / Awesome-Robotics-3D

Star

A curated list of 3D Vision papers relating to Robotics domain in the era of large models i.e. LLMs/VLMs, inspired by awesome-computer-vision, including papers, codes, and related websites

computer-vision robotics navigation benchmarks simulations manipulation scene-graph grasping nerf 3d pointclouds vlm diffusion-models pretraining policy-learning foundation-models llm vision-language-model gaussian-splatting

Updated Jul 19, 2025

AGI-Arena / MARS

Star

The official implementation of MARS: Unleashing the Power of Variance Reduction for Training Large Models

optimizer optimization-algorithms fine-tuning pretraining large-language-models

Updated Oct 6, 2025
Python

alibaba / Megatron-LLaMA

Star

Best practice for training LLaMA models in Megatron-LM

pytorch llama distributed-training pretraining deepspeed megatron-lm llm

Updated Jan 2, 2024
Python

cxcscmu / Craw4LLM

Star

Official repository for "Craw4LLM: Efficient Web Crawling for LLM Pretraining"

crawler web-crawler crawling web-crawling pre-training pretraining large-language-models llm

Updated Feb 24, 2025
Python

westlake-repl / SaProt

Star

Saprot: Protein Language Model with Structural Alphabet (AA+3Di)

protein-structure protein representation-learning protein-sequence predicted-structures pretraining structure-aware protein-language-model alphafold2 foldseek structural-alphbet protein-llm

Updated Oct 9, 2025
Python

PITI-Synthesis / PITI

Star

PITI: Pretraining is All You Need for Image-to-Image Translation

computer-vision image-generation image-to-image-translation image-synthesis pretraining

Updated Jun 2, 2024
Python

Coobiw / MPP-LLaVA

Star

Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Train your own 8B/14B LLaVA-training-like MLLM on RTX3090/4090 24GB.

fine-tuning pipeline-parallelism pretraining model-parallel deepspeed mllm multimodal-large-language-models qwen video-large-language-models video-language-model

Updated Mar 10, 2025
Jupyter Notebook

Improve this page

Add a description, image, and links to the pretraining topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the pretraining topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pretraining

Here are 222 public repositories matching this topic...

LlamaFamily / Llama-Chinese

microsoft / LMOps

OFA-Sys / OFA

X-PLUG / mPLUG-Owl

ChandlerBang / awesome-self-supervised-gnn

keyu-tian / SparK

yuewang-cuhk / awesome-vision-language-pretraining-papers

qqlu / Entity

YehLi / xmodaler

deepmodeling / Uni-Mol

seal-rg / recurrent-pretraining

PKU-YuanGroup / LanguageBind

Alibaba-MIIL / ImageNet21K

zubair-irshad / Awesome-Robotics-3D

AGI-Arena / MARS

alibaba / Megatron-LLaMA

cxcscmu / Craw4LLM

westlake-repl / SaProt

PITI-Synthesis / PITI

Coobiw / MPP-LLaVA

Improve this page

Add this topic to your repo