Skip to content

From Prompt to Production: Building Real LLM Apps with RAG & Agents #408

@anujsainimca

Description

@anujsainimca

Talk title

From Prompt to Production: Building Real LLM Apps with RAG & Agents

Short talk description

This talk focuses on how to move beyond simple prompts and build real-world LLM-powered applications using Retrieval-Augmented Generation (RAG) and agentic workflows. We will explore how LLMs interact with external data, tools, and memory to solve practical problems. Through a live demo, attendees will learn how to build a document Q&A system and extend it into a simple agent capable of multi-step reasoning. This session is designed for Python developers and data enthusiasts looking to understand how modern AI systems are actually built and deployed in production.

Long talk description

Large Language Models are powerful, but real-world applications require more than just prompting. This talk bridges the gap between experimentation and production by introducing two key paradigms: Retrieval-Augmented Generation (RAG) and agent-based systems.

We will begin by understanding the limitations of standalone LLMs, including hallucinations and lack of real-time knowledge. From there, we introduce RAG — a technique that connects LLMs with external data sources using embeddings and vector databases, enabling accurate and context-aware responses.

Next, we explore agentic systems, where LLMs can reason, plan, and use tools to complete multi-step tasks. We will discuss how agents differ from simple pipelines and when to use them effectively.

The session includes a live coding demo where we:

Build a document-based Q&A system using RAG
Extend it into a simple agent that can perform multi-step tasks

By the end, attendees will have a clear understanding of how to design, build, and extend LLM applications for real-world use cases.

What format do you have in mind?

Workshop (45-60 minutes, hands-on)

Talk outline / Agenda

  • Introduction: From prompts to real applications (5 mins)
  • Limitations of plain LLMs (hallucination, no memory) (5 mins)
  • RAG fundamentals (embeddings, retrieval, generation) (10 mins)
  • Live Demo: Build a document Q&A system (10 mins)
  • Introduction to agents (tools, planning, workflows) (10 mins)
  • Live Demo: Simple agent with multi-step reasoning (10 mins)
  • Best practices + pitfalls (5 mins)
  • Q&A (5–10 mins)

Key takeaways

  • Understanding of how RAG improves LLM accuracy using external data
  • Practical knowledge of building a document Q&A system in Python
  • Clear distinction between pipelines and agent-based systems
  • Best practices for designing scalable LLM applications
  • Awareness of common pitfalls like hallucination and poor retrieval

What domain would you say your talk falls under?

Data Science and Machine Learning

Duration (including Q&A)

45 minutes (35 min talk + 10 min Q&A)

Prerequisites and preparation

  • Basic Python knowledge
  • Familiarity with APIs (helpful but not required)
  • No prior experience with LLMs required

Resources and references

No response

Link to slides/demos (if available)

No response

Twitter/X handle (optional)

No response

LinkedIn profile (optional)

No response

Profile picture URL (optional)

No response

Speaker bio

I am an AI practitioner with 15+ years of experience working across machine learning, NLP, and large-scale data systems. I have been actively working on Generative AI applications, including LLM-based systems, RAG pipelines, and agentic workflows. I enjoy mentoring students and professionals in building practical, production-ready AI solutions. I have also conducted multiple sessions and workshops focused on applied AI and real-world problem solving.

Availability

09/

Accessibility & special requirements

No response

Speaker checklist

  • I have read and understood the PyDelhi guidelines for submitting proposals and giving talks
  • I have read and acknowledged the PyDelhi accessibility guidelines and will ensure my presentation materials (slides, videos, demos) follow these recommendations
  • I will make my talk accessible to all attendees and will proactively ask for any accommodations or special requirements I might need
  • I agree to share slides, code snippets, and other materials used during the talk with the community
  • I will follow PyDelhi's Code of Conduct and maintain a welcoming, inclusive environment throughout my participation
  • I understand that PyDelhi meetups are community-centric events focused on learning, knowledge sharing, and networking, and I will respect this ethos by not using this platform for self-promotion or hiring pitches during my presentation, unless explicitly invited to do so by means of a sponsorship or similar arrangement
  • If the talk is recorded by the PyDelhi team, I grant permission to release the video on PyDelhi's YouTube channel under the CC-BY-4.0 license, or a different license of my choosing if I am specifying it in my proposal or with the materials I share

Additional comments

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    awaiting response from authorThis proposal needs a response from the author in order for a decision to be made on its schedulingneeds more informationThis proposal needs more information in order for a decision to be made on its acceptanceproposalWish to present at PyDelhi? This label gets added when the "Talk Proposal" option is chosen.

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions