Skip to content

owainlewis/awesome-artificial-intelligence

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

219 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

Awesome Artificial Intelligence

A curated collection of must-use, actively maintained resources for building and shipping AI systems.

Focus: AI engineering (RAG, agents, evals, guardrails, deploy) plus the best books, guides, papers, and a carefully selected set of tools.


πŸ“š Learn

Deep, durable knowledge β€” still valuable five years from now.

Books

Modern & Practical

Foundational

Courses

Beginner

Intermediate / Advanced

Focused

Landmark Papers

Research that shaped modern AI β€” worth reading to understand the "why" behind today's architectures.


πŸ›  Build

The toolchain for building with AI. Personal note: you don't need tons of frameworks β€” start with simple LLM calls and work up.

Guides & Playbooks

Frameworks

  • PocketFlow β€” Extremely minimalist AI agent framework in just 100 lines of code. Fantastic way to learn.
  • Google ADK β€” Google's Agent Development Kit (Python, Java). Great local development experience + A2A + MCP.
  • Pydantic-AI β€” Typed, structured LLM orchestration framework built on Pydantic models for safe, predictable outputs.
  • LangGraph β€” Build multi-agent workflows with stateful graphs on top of LangChain.
  • CrewAI β€” Agent orchestration with structured tasks and human-in-the-loop controls.
  • AutoGen β€” Microsoft's framework for multi-agent conversation and collaboration.
  • LlamaIndex β€” Data framework for ingesting, indexing, and querying private data with LLMs.
  • Haystack β€” Open-source search/RAG framework with modular pipelines.
  • Docling β€” Great library for ingesting any kind of document for RAG ⭐

Evals

IDEs

  • Cursor β€” LLM-powered IDE for multi-file edits and codebase-aware chat.
  • GitHub Copilot β€” In-IDE code completion, chat, and refactors.

πŸ€– Agents

Harnesses that turn LLMs into autonomous workers. The model is swappable; the harness is the product.

Coding

For live capability comparison, see Terminal-Bench and SWE-bench.

  • Claude Code β€” Anthropic's CLI agent; multi-file codebase refactoring with long context.
  • Codex CLI β€” OpenAI's Rust-based local terminal agent; lightweight and fast.
  • Gemini CLI β€” Google's official open-source terminal agent; long-context repo exploration.
  • Cursor CLI β€” Cursor's terminal-native agent with sandboxed permissions.
  • Aider β€” Git-integrated pair programming with surgical edits and undo.
  • OpenCode β€” Provider-agnostic terminal harness with a strong TUI.
  • OpenHands β€” Open-source autonomous SWE platform; browser + shell + editor loop.
  • Cline β€” Open-source agentic IDE extension with strong multi-provider support.
  • Continue β€” Open-source IDE + CLI assistant with source-controlled rules.
  • Goose β€” Block's extensible, MCP-driven local agent.
  • Factory Droid β€” Benchmark-leading multi-model harness with BYOK local execution.
  • Amp β€” Sourcegraph's commercial agentic coding tool with strong product UX.
  • Mistral Vibe β€” Mistral's agentic coding CLI, powered by Devstral.
  • Qwen Code β€” Alibaba's terminal coding agent, optimized for Qwen models.
  • Pi β€” Highly customizable terminal harness; minimal base prompt, extension-driven.
  • Nanocoder β€” Private, local-first agent for Ollama and LM Studio.
  • Kilo CLI β€” Multi-mode agent with a unified gateway to 500+ models.

🧠 Models

State-of-the-art models by modality.

πŸ’¬ Language

The major frontier labs.

  • ChatGPT β€” Best for general reasoning, tool use, and the broadest ecosystem.
  • Claude β€” Best for long-context analysis, coding, and structured thinking.
  • Gemini β€” Best for multimodal tasks and Google ecosystem integration.
  • Grok β€” Best for real-time information via X and very long context.
  • Llama β€” Best open-weight family for self-hosting and fine-tuning.
  • Mistral β€” Best for lightweight, high-performance open-weight models.
  • DeepSeek β€” Best for cost-efficient reasoning with open weights.
  • Qwen β€” Best for multilingual and Chinese-first applications.
  • Kimi β€” Best for long-context instruction following.
  • GLM β€” Frontier-tier Chinese model with open weights.
  • Cohere β€” Best for enterprise LLMs with strong retrieval-augmented generation APIs.

πŸ–Ό Image

  • GPT Image β€” OpenAI's integrated image generation with near-perfect text rendering.
  • Midjourney β€” Artistic and photorealistic images.
  • Adobe Firefly β€” Integrated into Creative Cloud; commercial-safe.
  • Ideogram β€” Precise, legible text in generated images.
  • Flux β€” High-res, prompt-editable, open-weight images.

πŸŽ₯ Video

  • Google Veo β€” High-quality video with synchronized audio.
  • Runway β€” Video editing + generation with granular creative control.
  • Kling β€” Cinematic, realistic video generation.

πŸŽ™ Audio

  • ElevenLabs β€” High-quality text-to-speech and voice cloning.
  • Suno β€” AI music from text prompts.

πŸ“Š Compare

Live benchmarks, pricing, and the latest model versions.

  • OpenRouter β€” Unified API + live pricing across ~300 models.
  • LMArena β€” Human-preference Elo rankings for text, image, and video.
  • Artificial Analysis β€” Speed, price, and quality benchmarks across providers.

πŸ“‘ Follow

Stay current without drowning in noise.

Newsletters


Releases

No releases published

Packages

 
 
 

Contributors