refactor: remove outdated content and update links in blog and README

nicholasdbrady · nicholasdbrady · commit 9d7534ade551 · 2025-01-30T00:19:16.000-08:00
diff --git a/README.md b/README.md
@@ -16,11 +16,13 @@ Inside of this project, you'll find notebooks in my `examples` folder:
 
 ```text
 ├── examples/
-│   ├── phi-3/
-|       |── getting-started-with-phi-3.ipynb
+│   ├── deepseek/
+|       |── deepseek-r1-with-azure-ai-foundry-and-gradio.ipynb
 |   ├── maas-arena/
 |       |── azure-ai-models-as-a-service-chatbot-arena.ipynb
 |       |── maas_clients.py
+│   ├── phi-3/
+|       |── getting-started-with-phi-3.ipynb
 |   |── **more coming soon!**
 ```
 
diff --git a/examples/deepseek/deepseek-r1-with-azure-ai-foundry-and-gradio.ipynb b/examples/deepseek/deepseek-r1-with-azure-ai-foundry-and-gradio.ipynb
@@ -8,12 +8,22 @@
     "\n",
     "## Introduction\n",
     "\n",
-    "**DeepSeek R1** has gained widespread attention for its advanced reasoning capabilities, excelling in language processing, scientific problem-solving, and coding. With 671B total parameters, 37B active parameters, and a 128K context length, it pushes the boundaries of AI-driven reasoning ([Explore DeepSeek R1 on Azure AI Foundry](https://ai.azure.com/explore/models/DeepSeek-R1/version/1/registry/azureml-deepseek)). Benchmarking and evaluation results highlight its performance against other models, showcasing its effectiveness in reasoning tasks ([Evaluation Results](https://github.com/deepseek-ai/DeepSeek-R1/tree/main?tab=readme-ov-file#4-evaluation-results)). Building on prior models, DeepSeek R1 integrates Chain-of-Thought (CoT) reasoning, reinforcement learning (RL), and fine-tuning on curated datasets to achieve state-of-the-art performance. This tutorial will walk you through how to deploy DeepSeek R1 from [Azure AI Foundry's model catalog](https://ai.azure.com/explore/models/) and integrate it with [Gradio](https://www.gradio.app/) to build a real-time streaming chatbot specifically for thinking LLMs like **DeepSeek R1**.\n",
-    "\n",
+    "**DeepSeek R1** has gained widespread attention for its advanced reasoning capabilities, excelling in language processing, scientific problem-solving, and coding. With 671B total parameters, 37B active parameters, and a 128K context length, it pushes the boundaries of AI-driven reasoning ([Explore DeepSeek R1 on Azure AI Foundry](https://ai.azure.com/explore/models/DeepSeek-R1/version/1/registry/azureml-deepseek)). Benchmarking and evaluation results highlight its performance against other models, showcasing its effectiveness in reasoning tasks ([Evaluation Results](https://github.com/deepseek-ai/DeepSeek-R1/tree/main?tab=readme-ov-file#4-evaluation-results)). Building on prior models, DeepSeek R1 integrates Chain-of-Thought (CoT) reasoning, reinforcement learning (RL), and fine-tuning on curated datasets to achieve state-of-the-art performance. This tutorial will walk you through how to deploy DeepSeek R1 from [Azure AI Foundry's model catalog](https://ai.azure.com/explore/models/) and integrate it with [Gradio](https://www.gradio.app/) to build a real-time streaming chatbot specifically for thinking LLMs like **DeepSeek R1**."
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
     "### DeepSeek R1 on Azure AI Foundry\n",
     "\n",
-    "On **January 29, 2025**, Microsoft announced that **DeepSeek R1** is now available on **Azure AI Foundry** and **GitHub**, making it part of a growing portfolio of over **1,800 AI models** available for enterprise use. With this integration, businesses can deploy DeepSeek R1 using **serverless APIs**, ensuring seamless scalability, security, and compliance with Microsoft’s responsible AI principles. ([Azure AI Foundry announcement](https://azure.microsoft.com/en-us/blog/deepseek-r1-on-azure-ai-foundry))\n",
-    "\n",
+    "On **January 29, 2025**, Microsoft announced that **DeepSeek R1** is now available on **Azure AI Foundry** and **GitHub**, making it part of a growing portfolio of over **1,800 AI models** available for enterprise use. With this integration, businesses can deploy DeepSeek R1 using **serverless APIs**, ensuring seamless scalability, security, and compliance with Microsoft’s responsible AI principles. ([Azure AI Foundry announcement](https://azure.microsoft.com/en-us/blog/deepseek-r1-on-azure-ai-foundry))"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
     "### Benefits of Using DeepSeek R1 on Azure AI Foundry\n",
     "\n",
     "- **Enterprise-Ready AI:** DeepSeek R1 is available as a trusted, scalable, and secure AI model, backed by Microsoft's infrastructure.\n",
@@ -203,195 +213,10 @@
     "- Implementing real-time streaming responses.\n",
     "- Deploying the chatbot using Gradio.\n",
     "\n",
-    "Get started today by visiting **[Azure AI Foundry](https://azure.microsoft.com/en-us/products/ai-services/ai-foundry)** and **[DeepSeek on GitHub](https://github.com/DeepSeekAI/DeepSeek-R1)**.\n",
+    "Get started today by visiting **[DeepSeek R1 on Azure AI Foundry Model Catalog](https://ai.azure.com/explore/models/DeepSeek-R1/version/1/registry/azureml-deepseek)** and **[DeepSeek on GitHub Models](https://github.com/marketplace/models/azureml-deepseek/DeepSeek-R1)**.\n",
     "\n",
     "Happy coding! 🚀"
    ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": null,
-   "metadata": {},
-   "outputs": [
-    {
-     "name": "stdout",
-     "output_type": "stream",
-     "text": [
-      "* Running on local URL:  http://127.0.0.1:7910\n",
-      "\n",
-      "To create a public link, set `share=True` in `launch()`.\n"
-     ]
-    },
-    {
-     "data": {
-      "text/html": [
-       "<div><iframe src=\"http://127.0.0.1:7910/\" width=\"100%\" height=\"500\" allow=\"autoplay; camera; microphone; clipboard-read; clipboard-write;\" frameborder=\"0\" allowfullscreen></iframe></div>"
-      ],
-      "text/plain": [
-       "<IPython.core.display.HTML object>"
-      ]
-     },
-     "metadata": {},
-     "output_type": "display_data"
-    },
-    {
-     "name": "stdout",
-     "output_type": "stream",
-     "text": [
-      "Gradio ChatMessage: {'role': 'user', 'metadata': {}, 'content': 'hey there', 'options': []}\n",
-      "Converted to Azure Message: {'role': 'user', 'content': 'hey there'}\n",
-      "Final Azure Messages: [{'role': 'system', 'content': 'You are a helpful assistant.'}, {'role': 'user', 'content': 'hey there'}]\n",
-      "Using parameters - Temperature: 0.7, Top P: 1, Max Tokens: 2048\n",
-      "Entering thought processing mode.\n"
-     ]
-    }
-   ],
-   "source": [
-    "import os\n",
-    "import gradio as gr\n",
-    "from azure.ai.inference import ChatCompletionsClient\n",
-    "from azure.ai.inference.models import SystemMessage, UserMessage, AssistantMessage\n",
-    "from azure.core.credentials import AzureKeyCredential\n",
-    "from gradio import ChatMessage\n",
-    "from typing import Iterator\n",
-    "\n",
-    "###############################################################################\n",
-    "# 1) Create the ChatCompletionsClient\n",
-    "###############################################################################\n",
-    "client = ChatCompletionsClient(\n",
-    "    endpoint=os.environ[\"AZURE_INFERENCE_ENDPOINT\"],         # e.g. \"https://my-r1-endpoint.eastus2.inference.ai.azure.com\"\n",
-    "    credential=AzureKeyCredential(os.environ[\"AZURE_INFERENCE_CREDENTIAL\"])\n",
-    "    # If you're authenticating with Microsoft Entra ID, use DefaultAzureCredential() \n",
-    "    # or other supported credentials instead of AzureKeyCredential.\n",
-    ")\n",
-    "\n",
-    "###############################################################################\n",
-    "# 2) Stream response function for Gradio\n",
-    "###############################################################################\n",
-    "def stream_response(user_message: str, messages: list, temperature: float, top_p: float, max_tokens: int) -> Iterator[list]:\n",
-    "    if not messages:\n",
-    "        messages = []\n",
-    "    \n",
-    "    # Convert Gradio chat history into Azure AI Inference messages\n",
-    "    azure_messages = [SystemMessage(content=\"You are a helpful assistant.\")]\n",
-    "    for msg in messages:\n",
-    "        print(f\"Gradio ChatMessage: {msg}\")  # Debug print\n",
-    "        if isinstance(msg, ChatMessage):\n",
-    "            azure_msg = UserMessage(content=msg.content) if msg.role == \"user\" else AssistantMessage(content=msg.content)\n",
-    "        elif isinstance(msg, dict) and \"role\" in msg and \"content\" in msg:\n",
-    "            azure_msg = UserMessage(content=msg[\"content\"]) if msg[\"role\"] == \"user\" else AssistantMessage(content=msg[\"content\"])\n",
-    "        else:\n",
-    "            continue\n",
-    "        print(f\"Converted to Azure Message: {azure_msg}\")  # Debug print\n",
-    "        azure_messages.append(azure_msg)\n",
-    "    \n",
-    "    # Ensure only serializable objects are sent to Azure\n",
-    "    azure_messages = [msg.dict() if hasattr(msg, \"dict\") else msg for msg in azure_messages]\n",
-    "    \n",
-    "    print(f\"Final Azure Messages: {azure_messages}\")  # Debug print\n",
-    "    print(f\"Using parameters - Temperature: {temperature}, Top P: {top_p}, Max Tokens: {max_tokens}\")\n",
-    "    \n",
-    "    response = client.complete(messages=azure_messages, stream=True, temperature=temperature, top_p=top_p, max_tokens=max_tokens)\n",
-    "    \n",
-    "    # Initialize buffers\n",
-    "    thought_buffer = \"\"\n",
-    "    response_buffer = \"\"\n",
-    "    inside_thought = False\n",
-    "    \n",
-    "    for update in response:\n",
-    "        if update.choices:\n",
-    "            current_chunk = update.choices[0].delta.content\n",
-    "            \n",
-    "            if \"<think>\" in current_chunk:\n",
-    "                inside_thought = True\n",
-    "                print(\"Entering thought processing mode.\")\n",
-    "                messages.append(ChatMessage(role=\"assistant\", content=\"\", metadata={\"title\": \"🧠 R1 Thinking...\", \"status\": \"pending\"}))\n",
-    "                yield messages\n",
-    "                continue\n",
-    "            elif \"</think>\" in current_chunk:\n",
-    "                inside_thought = False\n",
-    "                messages[-1] = ChatMessage(\n",
-    "                    role=\"assistant\",\n",
-    "                    content=thought_buffer.strip(),\n",
-    "                    metadata={\"title\": \"🧠 R1 Thinking...\", \"status\": \"done\"}\n",
-    "                )\n",
-    "                yield messages  # Yield the thought message immediately\n",
-    "                thought_buffer = \"\"\n",
-    "                continue\n",
-    "            \n",
-    "            if inside_thought:\n",
-    "                thought_buffer += current_chunk\n",
-    "                messages[-1] = ChatMessage(\n",
-    "                    role=\"assistant\",\n",
-    "                    content=thought_buffer,\n",
-    "                    metadata={\"title\": \"🧠 R1 Thinking...\", \"status\": \"pending\"}\n",
-    "                )\n",
-    "                yield messages  # Yield the thought message as it updates\n",
-    "            else:\n",
-    "                response_buffer += current_chunk\n",
-    "                if messages and isinstance(messages[-1], ChatMessage) and messages[-1].role == \"assistant\" and (not messages[-1].metadata or \"title\" not in messages[-1].metadata):\n",
-    "                    messages[-1] = ChatMessage(role=\"assistant\", content=response_buffer)\n",
-    "                else:\n",
-    "                    messages.append(ChatMessage(role=\"assistant\", content=response_buffer))\n",
-    "                yield messages\n",
-    "\n",
-    "###############################################################################\n",
-    "# 3) Gradio UI\n",
-    "###############################################################################\n",
-    "brand_theme = gr.themes.Default(\n",
-    "    primary_hue=\"blue\",\n",
-    "    secondary_hue=\"blue\",\n",
-    "    neutral_hue=\"gray\",\n",
-    "    font=[\"Segoe UI\", \"Arial\", \"sans-serif\"],\n",
-    "    font_mono=[\"Courier New\", \"monospace\"]\n",
-    ").set(\n",
-    "    button_primary_background_fill=\"#0f6cbd\",\n",
-    "    button_primary_background_fill_hover=\"#115ea3\",\n",
-    "    button_primary_background_fill_hover_dark=\"#4f52b2\",\n",
-    "    button_primary_background_fill_dark=\"#5b5fc7\",\n",
-    "    button_primary_text_color=\"#ffffff\",\n",
-    "    body_background_fill=\"#f5f5f5\",\n",
-    "    block_background_fill=\"#ffffff\",\n",
-    "    body_text_color=\"#242424\",\n",
-    "    body_text_color_subdued=\"#616161\",\n",
-    "    block_border_color=\"#d1d1d1\",\n",
-    "    input_background_fill=\"#ffffff\",\n",
-    "    input_border_color=\"#d1d1d1\",\n",
-    "    input_border_color_focus=\"#0f6cbd\",\n",
-    ")\n",
-    "\n",
-    "with gr.Blocks(title=\"DeepSeek R1 with Azure AI Foundry\", theme=brand_theme, css=\"footer {visibility: hidden}\", fill_height=True, fill_width=True) as demo:\n",
-    "    title = gr.Markdown(\"## DeepSeek R1 with Azure AI Foundry 🤭\")\n",
-    "    chatbot = gr.Chatbot(\n",
-    "        type=\"messages\",\n",
-    "        label=\"DeepSeek-R1\",\n",
-    "        render_markdown=True,\n",
-    "        show_label=False,\n",
-    "        scale=1,\n",
-    "    )\n",
-    "    \n",
-    "    input_box = gr.Textbox(\n",
-    "        lines=1,\n",
-    "        submit_btn=True,\n",
-    "        show_label=False,\n",
-    "    )\n",
-    "    \n",
-    "    with gr.Accordion(\"Model Parameters\", open=False):\n",
-    "        temperature = gr.Slider(0.0, 2.0, value=0.7, step=0.1, label=\"Temperature\", info=\"Controls randomness\", interactive=True)\n",
-    "        top_p = gr.Slider(0.0, 1.0, value=1.0, step=0.1, label=\"Top P\", info=\"Nucleus sampling\", interactive=True)\n",
-    "        max_tokens = gr.Slider(0, 4096, value=2048, step=128, label=\"Max Tokens\", info=\"Limits response length\", interactive=True)\n",
-    "        reset_button = gr.Button(\"Reset Defaults\")\n",
-    "    \n",
-    "    reset_button.click(lambda: (0.7, 1.0, 2048), outputs=[temperature, top_p, max_tokens])\n",
-    "    \n",
-    "    msg_store = gr.State(\"\")\n",
-    "    input_box.submit(lambda msg: (msg, msg, \"\"), inputs=[input_box], outputs=[msg_store, input_box, input_box], queue=False)\n",
-    "    input_box.submit(lambda msg, chat: (ChatMessage(role=\"user\", content=msg), chat + [ChatMessage(role=\"user\", content=msg)]), inputs=[msg_store, chatbot], outputs=[msg_store, chatbot], queue=False).then(\n",
-    "        stream_response, inputs=[msg_store, chatbot, temperature, top_p, max_tokens], outputs=chatbot\n",
-    "    )\n",
-    "    \n",
-    "    demo.launch()\n"
-   ]
   }
  ],
  "metadata": {
diff --git a/src/content/blog/create-a-thinking-llm-chat-interface-with-azure-ai-foundry-deepseek-and-gradio.md b/src/content/blog/create-a-thinking-llm-chat-interface-with-azure-ai-foundry-deepseek-and-gradio.md
@@ -195,6 +195,6 @@ We covered:
 - Implementing real-time streaming responses.
 - Deploying the chatbot using Gradio.
 
-Get started today by visiting **[Azure AI Foundry](https://azure.microsoft.com/en-us/products/ai-services/ai-foundry)** and **[DeepSeek on GitHub](https://github.com/DeepSeekAI/DeepSeek-R1)**.
+Get started today by visiting **[DeepSeek R1 Azure AI Foundry Model Catalog](https://ai.azure.com/explore/models/DeepSeek-R1/version/1/registry/azureml-deepseek)** or **[DeepSeek on GitHub Models](https://github.com/marketplace/models/azureml-deepseek/DeepSeek-R1)**.
 
 Happy coding! 🚀
diff --git a/src/pages/index.astro b/src/pages/index.astro
@@ -19,10 +19,6 @@ import { SITE_TITLE, SITE_DESCRIPTION } from "../consts";
 				plenty of recipes for you to follow and learn from. I hope you
 				enjoy the content and find it useful.
 			</p>
-			<p>
-				Pardon the dust, I'm just getting started. I'll be adding more
-				content soon.
-			</p>
 			<p>
 				In the meantime, check out my <a href="/cookbook/blog/">blog</a> page
 				and please connect with me: