|
59 | 59 | - Adobe releases **Firefly 2**. |
60 | 60 |
|
61 | 61 | ## November |
62 | | -- **Stable Diffusion XL Turbo** is released - A fast model that allows the creation of an image in one step in real time. |
| 62 | +- **Stable Diffusion XL Turbo** is released - A fast model that allows the creation of an image in one step in real-time. |
63 | 63 |
|
64 | 64 | ## December |
65 | 65 | - **Midjourney v6** is launched. |
|
244 | 244 | ## June 2025 |
245 | 245 | - Google releases **Gemini 2.5 Pro** (final production-ready version), which leads benchmarks across the board. |
246 | 246 | - ElevenLabs rolls out **Eleven v3 (alpha)** TTS with fine grained emotion control and support for 70+ languages. |
247 | | -- OpenAI debuts **o3 pro**, an enhanced reasoning model offering extended context and real time tool integrations. |
| 247 | +- OpenAI debuts **o3 pro**, an enhanced reasoning model offering extended context and real-time tool integrations. |
248 | 248 |
|
249 | 249 | ## July 2025 |
250 | 250 | - xAI releases **Grok 4**, achieving a new SOTA of 15.9% on ARC-AGI v2 and 25.4% on Humanity’s Last Exam. (*special*) |
|
253 | 253 | - Google introduces **Gemini Deep Think**, which also earns an IMO 2025 gold by solving five of six problems with parallel reasoning. (*special*) |
254 | 254 | - Alibaba open-sources two variants, **Qwen3-235B-A22B-Instruct-2507** (instruction-tuned) and **Qwen3-Coder**, for general LLM use and automated code generation. |
255 | 255 | - Moonshot AI debuts **Kimi K2**, a Chinese LLM praised for its open-research focus and robust performance. |
256 | | -- Chinese startup Zhipu open-sources **GLM-4.5**, a 130 B-parameter model tailored for intelligent-agent applications. |
| 256 | +- Chinese startup Zhipu open-sources **GLM-4.5**, a 130 B-parameter model tailored for intelligent-agent applications. |
| 257 | + |
| 258 | +## August 2025 |
| 259 | +- Google introduced **Gemini 2.5 Deep Think**, a special "extended thinking" mode for solving complex problems and exploring alternatives. (*special*) |
| 260 | +- Anthropic released **Claude Opus 4.1**, an upgrade focused on improving agentic capabilities and real-world coding. |
| 261 | +- Google DeepMind announced **Genie 3.0**, a "world model" for creating interactive 3D environments from text, maintaining consistency for several minutes. (*special*) |
| 262 | +- OpenAI released **gpt-oss-120b** and **gpt-oss-20b**, a family of open-source models with high reasoning capabilities, optimized to run on accessible hardware. |
| 263 | +- OpenAI launched **GPT-5**, the company's next-generation model, with significant improvements in coding and a dynamic "thinking" mode to reduce hallucinations. |
| 264 | +- DeepSeek released **DeepSeek V3.1**, a hybrid model combining fast and slow "thinking" modes to improve performance in agentic tasks and tool use. |
| 265 | +- Google launched a preview of **Gemini 2.5 Flash Image** (showcased as *nano-banana*), an advanced model for precise image editing, merging, and maintaining character consistency. (*special*) |
0 commit comments