docs: Small template fixes (#6666)

caitlinwheeless · caitlinwheeless · web-flow · commit e4efd3c3b329 · 2024-11-15T13:23:14.000-06:00
Co-authored-by: caitlinwheeless &lt;caitlin@humansignal.com&gt;
diff --git a/docs/source/templates/generative-pairwise-human-preference.md b/docs/source/templates/generative-pairwise-human-preference.md
@@ -15,7 +15,7 @@ This project will help you to get up your LLM to the ChatGPT quality level throu
 
 Through ranking multiple responses based on quality, you can train a reward model that effectively captures human preferences. This reward model plays a crucial role in Reinforcement Learning, optimizing the performance of the fine-tuned foundational model.
 
-### Further Reading and Resources
+#### Further Reading and Resources
 
 - [Gathering Human Feedback Tutorial](https://github.com/heartexlabs/RLHF/blob/master/tutorials/RLHF_with_Custom_Datasets.ipynb) A Jupyter Notebook tutorial that will guide you through the step-by-step process of collecting comparison data, establishing human preferences, and incorporating this feedback into the reward model training.
 - [RLHF Resources](https://github.com/heartexlabs/RLHF): A collection of links, tutorials and best practices on how collect data and build an end-to-end Reinforcement Learning from Human Feedback (RLHF) system to fine-tune Generative AI models.
diff --git a/docs/source/templates/index.ejs b/docs/source/templates/index.ejs
@@ -51,6 +51,6 @@ cards:
   url: "/templates/gallery_generative_ai.html"
 - title: LLM Evaluations
   categories: generative ai, llm
-  image: "/images/templates/generative-pairwise-human-preference.png"
+  image: "/images/templates/response-moderation.png"
   url: "/templates/gallery_llm_evals.html"
 ---