Di world of open-source LLMs dey exciting and e dey always dey change. Dis lesson wan give una deep look inside open source models. If you dey find info on how proprietary models take compare to open source models, waka go di "Exploring and Comparing Different LLMs" lesson. Dis lesson go also talk about fine-tuning but if you want beta explanation, you fit find am for di "Fine-Tuning LLMs" lesson.
- Make you sabi open source Models well well
- Understand di benefits wey dey work with open source Models
- Explore di open models wey dey for Hugging Face and di Azure AI Studio
Open source software don play big role for di growth of technology for different areas. Di Open Source Initiative (OSI) don define 10 criteria for software wey make am open source. Di source code suppose dey openly shared under license wey OSI approve.
Even though di way dem dey develop LLMs get some tins wey resemble software development, di process no be exactly di same. Dis one don cause plenty talk for di community about wetin open source mean for LLMs. For model to follow di traditional open source definition, dis info suppose dey public:
- Datasets wey dem use train di model.
- Full model weights as part of di training.
- Di evaluation code.
- Di fine-tuning code.
- Full model weights and training metrics.
Right now, only small models dey wey fit dis criteria. Di OLMo model wey Allen Institute for Artificial Intelligence (AllenAI) create na one wey fit dis category.
For dis lesson, we go dey call di models "open models" as dem fit no too match di criteria well well as we dey write dis.
Highly Customizable - Because open models dey come with detailed training info, researchers and developers fit change di inside parts of di model. Dis one fit make dem create models wey dey very specialized and fine-tuned for one particular work or study area. Some examples na code generation, mathematical operations and biology.
Cost - Di cost per token to use and deploy dis models dey cheaper pass proprietary models. When you dey build Generative AI apps, you suppose check performance vs price when you dey use dis models for your own use case.
Flexibility - To work with open models go give you chance to dey flexible to use different models or join dem together. One example na di HuggingChat Assistants wey user fit choose di model wey dem wan use directly for di user interface:
LLama2, wey Meta develop, na open model wey dem optimize for chat-based apps. Dis one na because of di fine-tuning method wey dem use, wey get plenty dialogue and human feedback. With dis method, di model dey give results wey match human expectation well well, wey dey give better user experience.
Some fine-tuned versions of Llama na Japanese Llama, wey specialize for Japanese and Llama Pro, wey be beta version of di base model.
Mistral na open model wey focus well well on high performance and efficiency. E dey use Mixture-of-Experts approach wey join group of specialized expert models into one system, so depending on di input, certain models go dey selected to use. Dis one dey make computation more effective as models dey only handle di inputs wey dem sabi well.
Some fine-tuned versions of Mistral na BioMistral, wey focus on medical domain and OpenMath Mistral, wey dey do mathematical computation.
Falcon na LLM wey Technology Innovation Institute (TII) create. Di Falcon-40B train on 40 billion parameters and e don show say e fit perform better pass GPT-3 with less compute budget. Dis na because e dey use FlashAttention algorithm and multiquery attention wey help reduce memory wey e need when e dey do inference. With dis reduced inference time, Falcon-40B good for chat apps.
Some fine-tuned versions of Falcon na OpenAssistant, assistant wey dem build on open models and GPT4ALL, wey get higher performance pass di base model.
No be only one answer dey for how to choose open model. One good place to start na to use Azure AI Studio's filter by task feature. Dis one go help you sabi di kind tasks wey di model train for. Hugging Face still get LLM Leaderboard wey dey show di best performing models based on certain metrics.
If you wan compare LLMs across different types, Artificial Analysis na another beta resource:
If you dey work on one specific use case, to find fine-tuned versions wey focus on di same area fit work well. To try different open models to see how dem perform based on your and your users' expectations na another good way.
Di best part about open models be say you fit start to work with dem quick quick. Check out di Azure AI Foundry Model Catalog, wey get one special Hugging Face collection with di models wey we talk about here.
After you finish dis lesson, check out our Generative AI Learning collection to continue to improve your Generative AI knowledge!
Disclaimer: Dis document don translate wit AI translation service Co-op Translator. Even though we dey try make am correct, abeg sabi say automated translation fit get some mistakes or wrong tins. Di original document wey e dey for im own language na di correct one. If na serious matter, e better make human professional translate am. We no go responsible for any misunderstanding or wrong meaning wey fit come from dis translation.



