From Prompt to Production: Building Real LLM Apps with RAG & Agents

### Talk title

From Prompt to Production: Building Real LLM Apps with RAG & Agents

### Short talk description

This talk focuses on how to move beyond simple prompts and build real-world LLM-powered applications using Retrieval-Augmented Generation (RAG) and agentic workflows. We will explore how LLMs interact with external data, tools, and memory to solve practical problems. Through a live demo, attendees will learn how to build a document Q&A system and extend it into a simple agent capable of multi-step reasoning. This session is designed for Python developers and data enthusiasts looking to understand how modern AI systems are actually built and deployed in production.

### Long talk description

Large Language Models are powerful, but real-world applications require more than just prompting. This talk bridges the gap between experimentation and production by introducing two key paradigms: Retrieval-Augmented Generation (RAG) and agent-based systems.

We will begin by understanding the limitations of standalone LLMs, including hallucinations and lack of real-time knowledge. From there, we introduce RAG — a technique that connects LLMs with external data sources using embeddings and vector databases, enabling accurate and context-aware responses.

Next, we explore agentic systems, where LLMs can reason, plan, and use tools to complete multi-step tasks. We will discuss how agents differ from simple pipelines and when to use them effectively.

The session includes a live coding demo where we:

Build a document-based Q&A system using RAG
Extend it into a simple agent that can perform multi-step tasks

By the end, attendees will have a clear understanding of how to design, build, and extend LLM applications for real-world use cases.



### What format do you have in mind?

Workshop (45-60 minutes, hands-on)

### Talk outline / Agenda


- Introduction: From prompts to real applications (5 mins)
- Limitations of plain LLMs (hallucination, no memory) (5 mins)
- RAG fundamentals (embeddings, retrieval, generation) (10 mins)
- Live Demo: Build a document Q&A system (10 mins)
- Introduction to agents (tools, planning, workflows) (10 mins)
- Live Demo: Simple agent with multi-step reasoning (10 mins)
- Best practices + pitfalls (5 mins)
- Q&A (5–10 mins)

### Key takeaways

- Understanding of how RAG improves LLM accuracy using external data
- Practical knowledge of building a document Q&A system in Python
- Clear distinction between pipelines and agent-based systems
- Best practices for designing scalable LLM applications
- Awareness of common pitfalls like hallucination and poor retrieval

### What domain would you say your talk falls under?

Data Science and Machine Learning

### Duration (including Q&A)

45 minutes (35 min talk + 10 min Q&A)

### Prerequisites and preparation

- Basic Python knowledge
- Familiarity with APIs (helpful but not required)
- No prior experience with LLMs required

### Resources and references

_No response_

### Link to slides/demos (if available)

_No response_

### Twitter/X handle (optional)

_No response_

### LinkedIn profile (optional)

_No response_

### Profile picture URL (optional)

_No response_

### Speaker bio

I am an AI practitioner with 15+ years of experience working across machine learning, NLP, and large-scale data systems. I have been actively working on Generative AI applications, including LLM-based systems, RAG pipelines, and agentic workflows. I enjoy mentoring students and professionals in building practical, production-ready AI solutions. I have also conducted multiple sessions and workshops focused on applied AI and real-world problem solving.

### Availability

09/

### Accessibility & special requirements

_No response_

### Speaker checklist

- [x] I have read and understood the [PyDelhi guidelines](https://github.com/pydelhi/talks/blob/main/guidelines/speaking.md) for submitting proposals and giving talks
- [x] I have read and acknowledged the [PyDelhi accessibility guidelines](https://github.com/pydelhi/talks/blob/main/guidelines/accessibility.md) and will ensure my presentation materials (slides, videos, demos) follow these recommendations
- [x] I will make my talk accessible to all attendees and will proactively ask for any accommodations or special requirements I might need
- [x] I agree to share slides, code snippets, and other materials used during the talk with the community
- [x] I will follow PyDelhi's Code of Conduct and maintain a welcoming, inclusive environment throughout my participation
- [x] I understand that PyDelhi meetups are community-centric events focused on learning, knowledge sharing, and networking, and I will respect this ethos by not using this platform for self-promotion or hiring pitches during my presentation, unless explicitly invited to do so by means of a sponsorship or similar arrangement
- [x] If the talk is recorded by the PyDelhi team, I grant permission to release the video on PyDelhi's YouTube channel under the CC-BY-4.0 license, or a different license of my choosing if I am specifying it in my proposal or with the materials I share

### Additional comments

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

From Prompt to Production: Building Real LLM Apps with RAG & Agents #408

Talk title

Short talk description

Long talk description

What format do you have in mind?

Talk outline / Agenda

Key takeaways

What domain would you say your talk falls under?

Duration (including Q&A)

Prerequisites and preparation

Resources and references

Link to slides/demos (if available)

Twitter/X handle (optional)

LinkedIn profile (optional)

Profile picture URL (optional)

Speaker bio

Availability

Accessibility & special requirements

Speaker checklist

Additional comments

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

From Prompt to Production: Building Real LLM Apps with RAG & Agents #408

Description

Talk title

Short talk description

Long talk description

What format do you have in mind?

Talk outline / Agenda

Key takeaways

What domain would you say your talk falls under?

Duration (including Q&A)

Prerequisites and preparation

Resources and references

Link to slides/demos (if available)

Twitter/X handle (optional)

LinkedIn profile (optional)

Profile picture URL (optional)

Speaker bio

Availability

Accessibility & special requirements

Speaker checklist

Additional comments

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions