📝 Background & Objective
We have a vast collection of podcast content on our website. Currently, our workflow involves a community manager manually copying transcripts into AI chatbots to generate show notes, timestamps, and summaries.
To improve efficiency and discoverability, we want to fully automate this pipeline. The goal is to optimize our podcast pages so they are easily ranked and understood by both search engines and modern AI chatbots/crawlers.
We need to update or create the following podcast elements to enhance our site. You can take as many parts to work on as you want.
🛠 Developer & Contribution Guidelines
We want to give you as much creative and technical freedom as possible to solve this!
- Tech Stack & LLM: You are free to use any programming language (Python, Node.js, etc.) and any LLM provider (OpenAI, Gemini, Anthropic, local models, etc.) of your choice.
- Execution Environment: You can build this however you see fit. It could be a local CLI tool, a standalone script, or a fully automated CI/CD pipeline (like GitHub Actions).
- Scope: Choose your own scope! You can submit a PR for just one single feature (e.g., only automating timestamps) or tackle multiple parts at once.
📂 Resources You Can Use
1. Title (& Subtitle)
We currently write podcast titles manually. We want to optimize them for SEO and clarity so they immediately signal the main themes to search engines. You can find them in the front matter of each podcast as a title: entry.
Example: title: "How to Navigate Career Transitions, Overcome Imposter Syndrome, and Build for _podcast/s07e04-career-coaching.md
Suggested Prompt:
Based on the podcast timestamps and guest bio, generate an SEO-optimized title:
- Should be under 100 characters.
- Focus primarily on the main themes and topics in the timestamps, not the guest’s current role.
- Mention the guest’s name and company/role at the time of recording (if available).
- Make it clear, keyword-rich, and engaging.
2. Meta-description
Currently, we do not have podcast meta-descriptions for most episodes. We want to automate writing and optimizing them to improve search engine snippet results. Find or add these in the front matter as a description: entry.
Suggested Prompt:
Based on the podcast transcript, make an SEO-optimized meta description:
- Length: 140-155 characters.
- Formula Structure: Problem/Benefit + What's Inside + Soft CTA.
- No formatting/markup - plain text only.
- Primary keyword early (first 60 characters for search bolding).
- Use action verbs: Learn, Discover, Master, Build, etc.
- Benefit-focused with specific details.
- Match content accurately.
- Emotional triggers: essential, proven, expert, comprehensive, etc.
- Compelling: Appeals to the target audience.
3. Timestamps
Currently, our community manager manually creates timestamps using AI chatbots. We want to automate this to improve SEO and clarity. You can find these in the front matter of each podcast as a header: entry inside the transcript: entry.
Note: Your script will need to remove existing timestamps and inject new ones in the correct locations inside the transcript based on the specified timing.
Suggested Prompt:
You are given a transcript of a podcast episode.
Your task is to create timestamps — a structured outline that highlights major topic shifts and tools, written in concise, nominative phrases.
Follow these rules:
- One timestamp per topic shift (roughly every 3–8 minutes).
- Use nominative phrasing — describe what’s covered, not what’s asked.
❌ Don’t write: “How to contribute to open source?”
✅ Do write: “Open Source Contribution: Starting Small & Building Confidence.”
- Separate multiple subtopics, not full sentences. Example: “Tool: Evaluating Models; Responsible AI & Feedback Loops.”
- Keep each line ≤ 12 words.
Format as:
00:00:00 Title of Topic
00:04:32 Tool: Evaluating Models Beyond Accuracy
Do not include speaker names or questions unless essential to the context. Focus on clarity, parallel grammar, and information hierarchy.
4. Short Intro Summary
Most podcasts lack a short intro summary. A good intro helps both listeners and search engines understand the core problem and solution presented in the episode. Find or add these in the front matter as an intro: entry.
Suggested Prompt:
Based on the podcast timestamps and guest bio, generate an SEO-optimized intro summary, one paragraph starting with “In this episode…”:
- Add one clear sentence stating the main challenge or question the episode explores.
- Follow with one sentence summarizing what listeners will gain or learn.
- Naturally include the primary keyword and 1–2 related keywords.
- Briefly describe what the guest discusses in the transcript: focus on key topics, insights, and takeaways, not filler conversation.
- Use the guest’s title and company as of the podcast recording (ignore later updates).
- Exclude any mention of recent career changes or details unrelated to the episode’s timeframe.
5. Guest Bio (Long and Short Versions)
Each guest has a dedicated page (e.g., _people/jeffkatz.md). We want to automate the creation of two distinct bios: a short one for the podcast show notes (bio_short: "") and a long one for their dedicated _people page.
⚠️ Important Note on LinkedIn Data: We know that scraping LinkedIn profiles is notoriously difficult due to anti-bot protections. If you choose to tackle this task, you are welcome to use any API or solution you find to gather this data.
Alternative Approach: Because of the complexity of fetching data, this might be better suited as a completely separate automation task down the line (e.g., a script dedicated entirely to creating guest pages, fetching profile images, structuring content, and generating bios). If you'd rather skip this step for now, that is completely fine!
Suggested Prompt:
I’ll provide you with detailed information about a podcast guest, including their experience, roles, companies, speaking engagements, and areas of expertise.
Your task is to write two versions of their bio optimized for SEO and Google E-E-A-T:
Short Bio (1 paragraph) – concise, keyword-rich, suitable for YouTube descriptions or podcast show notes.
Long Bio (2-3 paragraphs) – comprehensive and human-sounding, showcasing the guest’s experience, education, credibility, and authority in their field.
Each bio should:
- Include relevant SEO keywords.
- Highlight the guest’s experience, expertise, and achievements (E-E-A-T signals).
- Sound professional yet approachable.
- Don't use em dashes.
Format your response exactly as:
Short Bio (SEO-Optimized)
[1-paragraph version here]
Long Bio (SEO-Optimized)
[2-3 paragraph version here]
6. Key Takeaways / Episode Highlights
We currently do not have this content on any of the podcast pages. We aim to extract the specific highlights of the episode to show key topics, who said what, a small summary, and specific quotes.
Structure Requirements:
- Semantic Headings: Use H2/H3 like “How to Contribute to Open Source”.
- Short Intro: A short intro paragraph for each topic (2-3 sentences) using relevant keywords.
- Key Takeaways: Bulleted list of concise, concrete insights.
- Quotes: Optional, but highly encouraged to capture the essence of the speaker's point.
- Context: Include timestamps, guest names, and specific entities/tools (e.g., GitHub, TensorFlow) wherever possible.
Example Output Format:
## How to Contribute to Open Source
Open-source projects can feel intimidating for beginners, but Hugo Bowne-Anderson shared a practical framework for getting started. He emphasized starting small, focusing on learning from the community, and treating collaboration as a skill.
**Key Takeaways:**
* Begin with simple contributions like fixing documentation or improving examples.
* Choose projects that align with your personal interests — motivation drives consistency.
* Read the project’s *CONTRIBUTING.md* file before submitting a pull request to understand the workflow.
> “Contributing to open source isn’t about code; it’s about learning how to collaborate at scale.” — Hugo Bowne-Anderson
*(Timestamp: 24:18)*
You can find the example of the key takeaways section on this page: https://ai.hubermanlab.com/d/64c11778-b80a-11ef-bf69-a3d8ec277d29


📝 Background & Objective
We have a vast collection of podcast content on our website. Currently, our workflow involves a community manager manually copying transcripts into AI chatbots to generate show notes, timestamps, and summaries.
To improve efficiency and discoverability, we want to fully automate this pipeline. The goal is to optimize our podcast pages so they are easily ranked and understood by both search engines and modern AI chatbots/crawlers.
We need to update or create the following podcast elements to enhance our site. You can take as many parts to work on as you want.
🛠 Developer & Contribution Guidelines
We want to give you as much creative and technical freedom as possible to solve this!
📂 Resources You Can Use
_podcastfolder. They are formatted in YAML/JSON, with each line recording the time and who said what: https://github.com/DataTalksClub/datatalksclub.github.io/tree/main/_podcast_peoplefolder: https://github.com/DataTalksClub/datatalksclub.github.io/tree/main/_people1. Title (& Subtitle)
We currently write podcast titles manually. We want to optimize them for SEO and clarity so they immediately signal the main themes to search engines. You can find them in the front matter of each podcast as a
title:entry.Example:
title: "How to Navigate Career Transitions, Overcome Imposter Syndrome, and Buildfor_podcast/s07e04-career-coaching.mdSuggested Prompt:
2. Meta-description
Currently, we do not have podcast meta-descriptions for most episodes. We want to automate writing and optimizing them to improve search engine snippet results. Find or add these in the front matter as a
description:entry.Suggested Prompt:
3. Timestamps
Currently, our community manager manually creates timestamps using AI chatbots. We want to automate this to improve SEO and clarity. You can find these in the front matter of each podcast as a
header:entry inside thetranscript:entry.Note: Your script will need to remove existing timestamps and inject new ones in the correct locations inside the transcript based on the specified timing.
Suggested Prompt:
4. Short Intro Summary
Most podcasts lack a short intro summary. A good intro helps both listeners and search engines understand the core problem and solution presented in the episode. Find or add these in the front matter as an
intro:entry.Suggested Prompt:
5. Guest Bio (Long and Short Versions)
Each guest has a dedicated page (e.g.,
_people/jeffkatz.md). We want to automate the creation of two distinct bios: a short one for the podcast show notes (bio_short: "") and a long one for their dedicated_peoplepage.Alternative Approach: Because of the complexity of fetching data, this might be better suited as a completely separate automation task down the line (e.g., a script dedicated entirely to creating guest pages, fetching profile images, structuring content, and generating bios). If you'd rather skip this step for now, that is completely fine!
Suggested Prompt:
6. Key Takeaways / Episode Highlights
We currently do not have this content on any of the podcast pages. We aim to extract the specific highlights of the episode to show key topics, who said what, a small summary, and specific quotes.
Structure Requirements:
Example Output Format:
You can find the example of the key takeaways section on this page: https://ai.hubermanlab.com/d/64c11778-b80a-11ef-bf69-a3d8ec277d29