You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
From Prompt to Production: Building Real LLM Apps with RAG & Agents
Short talk description
This talk focuses on how to move beyond simple prompts and build real-world LLM-powered applications using Retrieval-Augmented Generation (RAG) and agentic workflows. We will explore how LLMs interact with external data, tools, and memory to solve practical problems. Through a live demo, attendees will learn how to build a document Q&A system and extend it into a simple agent capable of multi-step reasoning. This session is designed for Python developers and data enthusiasts looking to understand how modern AI systems are actually built and deployed in production.
Long talk description
Large Language Models are powerful, but real-world applications require more than just prompting. This talk bridges the gap between experimentation and production by introducing two key paradigms: Retrieval-Augmented Generation (RAG) and agent-based systems.
We will begin by understanding the limitations of standalone LLMs, including hallucinations and lack of real-time knowledge. From there, we introduce RAG — a technique that connects LLMs with external data sources using embeddings and vector databases, enabling accurate and context-aware responses.
Next, we explore agentic systems, where LLMs can reason, plan, and use tools to complete multi-step tasks. We will discuss how agents differ from simple pipelines and when to use them effectively.
The session includes a live coding demo where we:
Build a document-based Q&A system using RAG
Extend it into a simple agent that can perform multi-step tasks
By the end, attendees will have a clear understanding of how to design, build, and extend LLM applications for real-world use cases.
What format do you have in mind?
Workshop (45-60 minutes, hands-on)
Talk outline / Agenda
Introduction: From prompts to real applications (5 mins)
Limitations of plain LLMs (hallucination, no memory) (5 mins)
Introduction to agents (tools, planning, workflows) (10 mins)
Live Demo: Simple agent with multi-step reasoning (10 mins)
Best practices + pitfalls (5 mins)
Q&A (5–10 mins)
Key takeaways
Understanding of how RAG improves LLM accuracy using external data
Practical knowledge of building a document Q&A system in Python
Clear distinction between pipelines and agent-based systems
Best practices for designing scalable LLM applications
Awareness of common pitfalls like hallucination and poor retrieval
What domain would you say your talk falls under?
Data Science and Machine Learning
Duration (including Q&A)
45 minutes (35 min talk + 10 min Q&A)
Prerequisites and preparation
Basic Python knowledge
Familiarity with APIs (helpful but not required)
No prior experience with LLMs required
Resources and references
No response
Link to slides/demos (if available)
No response
Twitter/X handle (optional)
No response
LinkedIn profile (optional)
No response
Profile picture URL (optional)
No response
Speaker bio
I am an AI practitioner with 15+ years of experience working across machine learning, NLP, and large-scale data systems. I have been actively working on Generative AI applications, including LLM-based systems, RAG pipelines, and agentic workflows. I enjoy mentoring students and professionals in building practical, production-ready AI solutions. I have also conducted multiple sessions and workshops focused on applied AI and real-world problem solving.
Availability
09/
Accessibility & special requirements
No response
Speaker checklist
I have read and understood the PyDelhi guidelines for submitting proposals and giving talks
I have read and acknowledged the PyDelhi accessibility guidelines and will ensure my presentation materials (slides, videos, demos) follow these recommendations
I will make my talk accessible to all attendees and will proactively ask for any accommodations or special requirements I might need
I agree to share slides, code snippets, and other materials used during the talk with the community
I will follow PyDelhi's Code of Conduct and maintain a welcoming, inclusive environment throughout my participation
I understand that PyDelhi meetups are community-centric events focused on learning, knowledge sharing, and networking, and I will respect this ethos by not using this platform for self-promotion or hiring pitches during my presentation, unless explicitly invited to do so by means of a sponsorship or similar arrangement
If the talk is recorded by the PyDelhi team, I grant permission to release the video on PyDelhi's YouTube channel under the CC-BY-4.0 license, or a different license of my choosing if I am specifying it in my proposal or with the materials I share
Talk title
From Prompt to Production: Building Real LLM Apps with RAG & Agents
Short talk description
This talk focuses on how to move beyond simple prompts and build real-world LLM-powered applications using Retrieval-Augmented Generation (RAG) and agentic workflows. We will explore how LLMs interact with external data, tools, and memory to solve practical problems. Through a live demo, attendees will learn how to build a document Q&A system and extend it into a simple agent capable of multi-step reasoning. This session is designed for Python developers and data enthusiasts looking to understand how modern AI systems are actually built and deployed in production.
Long talk description
Large Language Models are powerful, but real-world applications require more than just prompting. This talk bridges the gap between experimentation and production by introducing two key paradigms: Retrieval-Augmented Generation (RAG) and agent-based systems.
We will begin by understanding the limitations of standalone LLMs, including hallucinations and lack of real-time knowledge. From there, we introduce RAG — a technique that connects LLMs with external data sources using embeddings and vector databases, enabling accurate and context-aware responses.
Next, we explore agentic systems, where LLMs can reason, plan, and use tools to complete multi-step tasks. We will discuss how agents differ from simple pipelines and when to use them effectively.
The session includes a live coding demo where we:
Build a document-based Q&A system using RAG
Extend it into a simple agent that can perform multi-step tasks
By the end, attendees will have a clear understanding of how to design, build, and extend LLM applications for real-world use cases.
What format do you have in mind?
Workshop (45-60 minutes, hands-on)
Talk outline / Agenda
Key takeaways
What domain would you say your talk falls under?
Data Science and Machine Learning
Duration (including Q&A)
45 minutes (35 min talk + 10 min Q&A)
Prerequisites and preparation
Resources and references
No response
Link to slides/demos (if available)
No response
Twitter/X handle (optional)
No response
LinkedIn profile (optional)
No response
Profile picture URL (optional)
No response
Speaker bio
I am an AI practitioner with 15+ years of experience working across machine learning, NLP, and large-scale data systems. I have been actively working on Generative AI applications, including LLM-based systems, RAG pipelines, and agentic workflows. I enjoy mentoring students and professionals in building practical, production-ready AI solutions. I have also conducted multiple sessions and workshops focused on applied AI and real-world problem solving.
Availability
09/
Accessibility & special requirements
No response
Speaker checklist
Additional comments
No response