1+ Entry , ID , Keywords
2+ 1 , https://arxiv.org/abs/1310.4546 , Improved Skip-gram
3+ 2 , https://arxiv.org/abs/1406.2661 , " GAN, Generative Adverserial Networks"
4+ 3 , https://arxiv.org/abs/1409.0473 , Encoder-Decoder Translation
5+ 4 , https://arxiv.org/abs/1409.3215 , " seq2seq LSTM, sequence LSTM"
6+ 5 , https://arxiv.org/abs/1409.1556 , " VGGNet, VGG-Net"
7+ 6 , https://arxiv.org/abs/1502.03167 , " BatchNorm, Batch Norm, Batch Normalization"
8+ 7 , https://arxiv.org/abs/1505.04597 , " U-Net, unet"
9+ 8 , https://arxiv.org/abs/1512.03385 , " ResNet, Residual Network, Residual Paths"
10+ 9 , https://arxiv.org/abs/1608.06993 , " DenseNet, Dense Neural Network"
11+ 10 , https://arxiv.org/abs/1506.02640 , " YOLO, You Only Look Once"
12+ 11 , https://arxiv.org/abs/1412.6980 , " Adam, Adam optimizer"
13+ 12 , https://arxiv.org/abs/1706.03762 , " Attention, Transformer, Attention Mechanism"
14+ 13 , https://arxiv.org/abs/1703.06870 , Mask R-CNN
15+ 14 , https://arxiv.org/abs/1812.04948 , " StyleGAN, style gan, style generative adversial network"
16+ 15 , https://arxiv.org/abs/1810.04805 , " BERT, Bidirectional Transformer"
17+ 16 , https://arxiv.org/abs/1803.03635 , " Lottery Ticket Hypothesis, Neural Network Pruning"
18+ 17 , https://arxiv.org/abs/2003.08934 , " Neural Radiance Field, NeRF"
19+ 18 , https://arxiv.org/abs/2010.11929 , " Vision Transformer, ViT, "
20+ 19 , https://arxiv.org/abs/2001.08361 , Scaling Laws
21+ 20 , https://arxiv.org/abs/2112.10752 , " Latent Diffusion Model, LDM, Latent Diffusion"
22+ 21 , https://arxiv.org/abs/1301.3781 , " Word2Vec, Bag of word"
23+ 22 , https://arxiv.org/abs/1312.6114 , " VAE, Variational Autoencoder"
24+ 23 , https://arxiv.org/abs/1312.5602 , " Atari Reinforcment Learning, Q-learning"
25+ 24 , https://arxiv.org/abs/2005.14165 , " GPT3, GPT-3"
26+ 25 , https://arxiv.org/abs/1807.03039 , " Glow, Generative Flow"
27+ 26 , https://arxiv.org/abs/2102.12092 , DALLE
28+ 27 , https://arxiv.org/abs/1907.11692 , " RoBERTa, Robust BERT pretraining"
29+ 28 , https://arxiv.org/abs/2402.03300 , " GRPO, Deepseek Math"
30+ 29 , https://arxiv.org/abs/2203.02155 , " RLHF, InstructGPT"
31+ 30 , https://arxiv.org/abs/2103.00020 , " CLIP, Contrastive Language Image Pretraining"
32+ 31 , https://arxiv.org/abs/2103.14030 , " SWIN, SWIN Transformer, Shifted Windows "
33+ 32 , https://arxiv.org/abs/2204.02311 , " PaLM, Pathways Language Model"
34+ 33 , https://arxiv.org/abs/2004.10934 , YOLOv4
35+ 34 , https://arxiv.org/abs/1910.10683 , " T5, text-to-text transformer"
36+ 35 , https://arxiv.org/abs/2204.08583 , " VQGAN, vector quantized gan, vector quantized generative adverserial network"
37+ 36 , https://arxiv.org/abs/1611.07004 , " pix2pix, cGAN image to image, PatchGAN discrimnator"
38+ 37 , https://arxiv.org/abs/1707.06347 , " PPO, Proximal Policy Optimization"
39+ 38 , https://arxiv.org/abs/2111.06377 , " MAE, Masked AE, Masked Autoencoder"
40+ 39 , https://arxiv.org/abs/1409.4842 , " Inception, Inceptionv1, Inception CNN"
41+ 40 , https://arxiv.org/abs/1602.07261 , Inceptionv4
42+ 41 , https://arxiv.org/abs/1512.00567 , Inceptionv3
43+ 42 , https://arxiv.org/abs/1406.1078 , " GRU, Gated Recurrent Unit"
44+ 43 , https://arxiv.org/abs/1411.4038 , " FCN, fully convolution network"
45+ 44 , https://arxiv.org/abs/2304.07193 , " DINOv2, improved self distillation with no labels"
46+ 45 , https://arxiv.org/abs/2104.14294 , " DINO, self-distillation with no labels"
47+ 46 , https://arxiv.org/abs/2312.00752 , mamba
48+ 47 , https://arxiv.org/abs/2106.09685 , " LoRA, low rank adaptation, low rank adapter"
49+ 48 , https://arxiv.org/abs/2305.14314 , " QLoRA, quantized lora, quantized low rank adapter"
50+ 49 , https://arxiv.org/abs/2205.14135 , " FlashAttention, Flash Attention"
51+ 50 , https://arxiv.org/abs/2304.02643 , " SAM, segment anything, segment anything model"
52+ 51 , https://arxiv.org/abs/2302.05543 , " ControlNET, "
53+ 52 , https://arxiv.org/abs/2303.08774 , " GPT4, GPT-4, openai gpt"
54+ 53 , https://arxiv.org/abs/2005.11401 , " rag, retrieval augmented generation"
55+ 54 , https://arxiv.org/abs/1910.01108 , " DistilBert, distilled bert"
56+ 55 , https://arxiv.org/abs/1802.05365 , " ELMo, embeddings from language model"
57+ 56 , https://arxiv.org/abs/1711.00937 , " vq-vae, vector quantized vae, vector quantized variational autoencoder"
58+ 57 , https://arxiv.org/abs/2006.11239 , " Diffusion Probabalistic Model, Denoising Diffusion Probabalistic, Diffusion Probabalistic, DDPM"
59+ 58 , https://arxiv.org/abs/2010.02502 , " Difussion Implicit Model, Denosining Diffusion Implcit, Diffusion Implicit, DDIM"
60+ 59 , https://arxiv.org/abs/1701.07875 , " WGAN, Wassterstein GAN"
61+ 60 , https://arxiv.org/abs/1709.01507 , " SE block, SE layer, Squeeze-and-Excitation"
0 commit comments