çæAIã¯ãæ°ããã³ã³ãã³ããçæããèœåãæã€ã·ã¹ãã ãäœæããããšã«çŠç¹ãåœãŠã人工ç¥èœã®é åçãªåéã§ãããã®ã³ã³ãã³ãã¯ãããã¹ããç»åãã鳿¥œãããã«ã¯ä»®æ³ç°å¢å šäœã«è³ããŸã§å€å²ã«ããããŸããçæAIã®æãè峿·±ãå¿çšã®äžã€ã¯ãèšèªã¢ãã«ã®é åã«ãããŸãã
å°åèšèªã¢ãã«ïŒSLMïŒã¯ã倧åèšèªã¢ãã«ïŒLLMïŒã®çž®å°çã衚ããLLMã®å€ãã®ã¢ãŒããã¯ãã£ååãšæè¡ã掻çšãã€ã€ãèšç®ãããããªã³ããå€§å¹ ã«åæžããŠããŸããSLMã¯ã人éã®ãããªããã¹ããçæããããã«èšèšãããèšèªã¢ãã«ã®ãµãã»ããã§ããGPT-4ã®ãããªå€§ããªã¢ãã«ãšã¯ç°ãªããSLMã¯ããã³ã³ãã¯ãã§å¹ççã§ãããèšç®è³æºãéãããŠããã¢ããªã±ãŒã·ã§ã³ã«æé©ã§ãããã®å°ããªãµã€ãºã«ãããããããããŸããŸãªã¿ã¹ã¯ãå®è¡ããããšãã§ããŸããéåžžãSLMã¯LLMãå§çž®ãŸãã¯èžçããããšã«ãã£ãŠæ§ç¯ãããå ã®ã¢ãã«ã®æ©èœãšèšèªèœåã®å€§éšåãä¿æããããšãç®æããŠããŸãããã®ã¢ãã«ãµã€ãºã®åæžã¯ãå šäœçãªè€éããæžå°ãããSLMãã¡ã¢ãªäœ¿çšéãšèšç®èŠä»¶ã®äž¡æ¹ã«ãããŠããå¹ççã«ããŸãããããã®æé©åã«ãããããããSLMã¯äŸç¶ãšããŠåºç¯ãªèªç¶èšèªåŠçïŒNLPïŒã¿ã¹ã¯ãå®è¡ããããšãã§ããŸãïŒ
- ããã¹ãçæïŒäžè²«æ§ããããæèã«é¢é£ããæã段èœãäœæããã
- ããã¹ãè£å®ïŒäžããããããã³ããã«åºã¥ããŠæãäºæž¬ãè£å®ããã
- 翻蚳ïŒããã¹ããããèšèªããå¥ã®èšèªã«å€æããã
- èŠçŽïŒé·ãããã¹ããçããããæ¶åããããèŠçŽã«åçž®ããã
ãã ããæ§èœãçè§£ã®æ·±ãã«ãããŠããã倧ããªã¢ãã«ãšæ¯ã¹ãŠããã€ãã®ãã¬ãŒããªãããããŸãã
SLMã¯èšå€§ãªéã®ããã¹ãããŒã¿ãçšããŠèšç·ŽãããŸããèšç·Žäžã«ãèšèªã®ãã¿ãŒã³ãšæ§é ãåŠã³ãææ³çã«æ£ãããæèã«é©ããããã¹ããçæããèœåã身ã«ã€ããŸããèšç·Žããã»ã¹ã«ã¯ä»¥äžãå«ãŸããŸãïŒ
- ããŒã¿åéïŒããŸããŸãªãœãŒã¹ãã倧éã®ããã¹ãããŒã¿ãåéããã
- ååŠçïŒããŒã¿ãã¯ãªãŒãã³ã°ããèšç·Žã«é©ããåœ¢ã«æŽçããã
- èšç·ŽïŒæ©æ¢°åŠç¿ã¢ã«ãŽãªãºã ãçšããŠã¢ãã«ã«ããã¹ããçè§£ãçæããæ¹æ³ãæããã
- 埮調æŽïŒç¹å®ã®ã¿ã¹ã¯ã§ã®æ§èœãåäžãããããã«ã¢ãã«ã調æŽããã
SLMã®éçºã¯ãã¢ãã€ã«ããã€ã¹ããšããžã³ã³ãã¥ãŒãã£ã³ã°ãã©ãããã©ãŒã ãªã©ãè³æºãå¶çŽãããç°å¢ã§å±éå¯èœãªã¢ãã«ã®å¢å ããããŒãºã«åèŽããŠããŸããå¹çã«çŠç¹ãåœãŠãããšã§ãSLMã¯æ§èœãšã¢ã¯ã»ã¹æ§ã®ãã©ã³ã¹ãåããããŸããŸãªãã¡ã€ã³ã§ã®åºç¯ãªå¿çšãå¯èœã«ããŸãã
ãã®ã¬ãã¹ã³ã§ã¯ãSLMã®ç¥èã玹ä»ããMicrosoft Phi-3ãšçµã¿åãããŠããã¹ãã³ã³ãã³ããããžã§ã³ãMoEã®ç°ãªãã·ããªãªãåŠã¶ããšãç®æããŸããã¬ãã¹ã³ã®çµãããŸã§ã«ã以äžã®è³ªåã«çããããããã«ãªãããšãæåŸ ããŠããŸãïŒ
- SLMãšã¯äœã
- SLMãšLLMã®éãã¯äœã
- Microsoft Phi-3/3.5ãã¡ããªãŒãšã¯äœã
- Microsoft Phi-3/3.5ãã¡ããªãŒãã©ã®ããã«æšè«ããã
æºåã¯ããã§ããïŒå§ããŸãããã
LLMãšSLMã¯ã©ã¡ãã確ççæ©æ¢°åŠç¿ã®åºæ¬ååã«åºã¥ããŠæ§ç¯ãããŠãããã¢ãŒããã¯ãã£èšèšãèšç·Žæ¹æ³è«ãããŒã¿çæããã»ã¹ãã¢ãã«è©äŸ¡æè¡ã«ãããŠé¡äŒŒã®ã¢ãããŒããæ¡çšããŠããŸãããããããããã®2ã€ã®ã¢ãã«ã«ã¯ããã€ãã®éèŠãªéãããããŸãã
SLMã¯ä»¥äžãå«ãåºç¯ãªå¿çšãæã£ãŠããŸãïŒ
- ãã£ãããããïŒã«ã¹ã¿ããŒãµããŒããæäŸãããŠãŒã¶ãŒãšäŒè©±åœ¢åŒã§äº€æµããã
- ã³ã³ãã³ãäœæïŒã¢ã€ãã¢ãçæããããèšäºå šäœãå·çãããããããšã§ã©ã€ã¿ãŒãæ¯æŽããã
- æè²ïŒåŠçã®äœæèª²é¡ãå©ããããæ°ããèšèªãåŠã¶ã®ãå©ãããããã
- ã¢ã¯ã»ã·ããªãã£ïŒé³å£°èªã¿äžãã·ã¹ãã ãªã©ãé害ã®ããå人ã®ããã®ããŒã«ãäœæããã
ãµã€ãº
LLMãšSLMã®äž»ãªéãã¯ã¢ãã«ã®èŠæš¡ã«ãããŸããChatGPTïŒGPT-4ïŒã®ãããªLLMã¯æšå®1.76å ã®ãã©ã¡ãŒã¿ãå«ãããšãã§ããŸãããMistral 7Bã®ãããªãªãŒãã³ãœãŒã¹SLMã¯çŽ70åã®ãã©ã¡ãŒã¿ã§èšèšãããŠããŸãããã®éãã¯äž»ã«ã¢ãã«ã¢ãŒããã¯ãã£ãšèšç·Žããã»ã¹ã®éãã«ãããã®ã§ããäŸãã°ãChatGPTã¯ãšã³ã³ãŒããŒãã³ãŒããŒãã¬ãŒã ã¯ãŒã¯å ã§èªå·±æ³šæã¡ã«ããºã ãæ¡çšããŠããŸãããMistral 7Bã¯ã¹ã©ã€ãã£ã³ã°ãŠã£ã³ããŠæ³šæã䜿çšããŠããããã³ãŒããŒã®ã¿ã®ã¢ãã«å ã§ããå¹ççãªèšç·Žãå¯èœã«ããŸãããã®ã¢ãŒããã¯ãã£ã®éãã¯ããããã®ã¢ãã«ã®è€éããšæ§èœã«é倧ãªåœ±é¿ãäžããŸãã
çè§£
SLMã¯éåžžãç¹å®ã®ãã¡ã€ã³å ã§ã®æ§èœãæé©åãããŠãããé«åºŠã«å°éåãããŠããŸãããè€æ°ã®ç¥èåéã«ãããåºç¯ãªæèçè§£ãæäŸããèœåãå¶éãããå¯èœæ§ããããŸããäžæ¹ãLLMã¯ããå æ¬çãªã¬ãã«ã§äººéã®ãããªç¥èœãã·ãã¥ã¬ãŒãããããšãç®æããŠããŸããèšå€§ã§å€æ§ãªããŒã¿ã»ããã§èšç·ŽãããLLMã¯ãããŸããŸãªãã¡ã€ã³ã§è¯å¥œãªæ§èœãçºæ®ããããã«èšèšãããŠãããããé«ãæ±çšæ§ãšé©å¿æ§ãæäŸããŸãããããã£ãŠãLLMã¯èªç¶èšèªåŠçãããã°ã©ãã³ã°ãªã©ãããåºç¯ãªäžæµã¿ã¹ã¯ã«é©ããŠããŸãã
èšç®
LLMã®èšç·Žãšå±éã¯è³æºéçŽçãªããã»ã¹ã§ãããå€§èŠæš¡ãªGPUã¯ã©ã¹ã¿ãŒãå«ãé倧ãªèšç®ã€ã³ãã©ã¹ãã©ã¯ãã£ãå¿ èŠãšããããšããããããŸããäŸãã°ãChatGPTã®ãããªã¢ãã«ããŒãããèšç·Žããã«ã¯ãäœåãã®GPUãé·æéã«ããã£ãŠå¿ èŠã«ãªããããããŸãããäžæ¹ãSLMã¯ãã®ãã©ã¡ãŒã¿æ°ãå°ãªããããèšç®è³æºã®èгç¹ã§ããã¢ã¯ã»ã¹ããããã§ããMistral 7Bã®ãããªã¢ãã«ã¯ãäžçšåºŠã®GPUèœåãåããããŒã«ã«ãã·ã³ã§èšç·Žããã³å®è¡ããããšãã§ããŸãããèšç·Žã«ã¯äŸç¶ãšããŠè€æ°ã®GPUã䜿çšããŠæ°æéããããŸãã
ãã€ã¢ã¹
ãã€ã¢ã¹ã¯LLMã«ãããæ¢ç¥ã®åé¡ã§ãããäž»ã«èšç·ŽããŒã¿ã®æ§è³ªã«ãããã®ã§ãããããã®ã¢ãã«ã¯ãã°ãã°ã€ã³ã¿ãŒãããããã®çã®ãªãŒãã³ããŒã¿ã«äŸåããŠãããç¹å®ã®ã°ã«ãŒããéå°è©äŸ¡ãŸãã¯èª€ã£ã衚çŸããããã誀ã£ãã©ãã«ä»ããå°å ¥ããããæ¹èšãå°ççå€åãããã³ææ³èŠåã«ãã£ãŠåœ±é¿ãåããèšèªãã€ã¢ã¹ãåæ ãããããå¯èœæ§ããããŸããããã«ãLLMã¢ãŒããã¯ãã£ã®è€éãã¯ãæ³šææ·±ã埮調æŽããªããã°æ°ä»ãããã«ãã€ã¢ã¹ãæªåãããå¯èœæ§ããããŸããäžæ¹ãSLMã¯ããå¶çŽããããã¡ã€ã³ç¹åã®ããŒã¿ã»ããã§èšç·ŽãããŠããããããã®ãããªãã€ã¢ã¹ã«å¯ŸããŠæ¬è³ªçã«åœ±é¿ãåãã«ããã§ãããå ç«ã§ã¯ãããŸããã
æšè«
SLMã®ãµã€ãºãå°ãããããæšè«é床ã«ãããŠå€§ããªå©ç¹ãæäŸããåºç¯ãªäžŠååŠçãå¿ èŠãšããã«ããŒã«ã«ããŒããŠã§ã¢äžã§å¹ççã«åºåãçæããããšãã§ããŸããå¯Ÿç §çã«ãLLMã¯ãã®ãµã€ãºãšè€éãã®ããã«ã蚱容å¯èœãªæšè«æéãéæããããã«å€å€§ãªäžŠåèšç®è³æºãå¿ èŠãšããããšããããããŸããè€æ°ã®åæãŠãŒã¶ãŒã®ååšã¯ãç¹ã«ã¹ã±ãŒã«ã§å±éãããå ŽåãLLMã®å¿çæéãããã«é ãããŸãã
ãŸãšãããšãLLMãšSLMã¯ã©ã¡ããæ©æ¢°åŠç¿ã®åºç€ã«åºã¥ããŠããŸãããã¢ãã«ãµã€ãºãè³æºèŠä»¶ãæèçè§£ããã€ã¢ã¹ã®åœ±é¿ãæšè«é床ã®èгç¹ã§å€§ããç°ãªããŸãããããã®éãã¯ãç°ãªã䜿çšäŸã«å¯Ÿããããããã®é©æ§ãåæ ããŠãããLLMã¯ããæ±çšæ§ãé«ããè³æºéçŽçã§ãããSLMã¯ãããã¡ã€ã³ç¹åã®å¹çãæäŸããèšç®èŠæ±ãå°ãªãã§ãã
泚æïŒãã®ç« ã§ã¯ãMicrosoft Phi-3 / 3.5ãäŸãšããŠSLMã玹ä»ããŸãã
Phi-3 / 3.5ãã¡ããªãŒã¯äž»ã«ããã¹ããããžã§ã³ãããã³ãšãŒãžã§ã³ãïŒMoEïŒã¢ããªã±ãŒã·ã§ã³ã·ããªãªã察象ãšããŠããŸãïŒ
äž»ã«ããã¹ãçæããã£ããè£å®ãããã³ã³ã³ãã³ãæ å ±æœåºãªã©ã«äœ¿çšãããŸãã
Phi-3-mini
3.8Bã®èšèªã¢ãã«ã¯Microsoft Azure AI StudioãHugging Faceãããã³Ollamaã§å©çšå¯èœã§ããPhi-3ã¢ãã«ã¯ãåçããã³ãã倧ããªãµã€ãºã®èšèªã¢ãã«ãäž»èŠãªãã³ãããŒã¯ã§å€§å¹ ã«äžåããŸãïŒä»¥äžã®ãã³ãããŒã¯æ°å€ãåç §ãæ°å€ãé«ãã»ã©è¯ãïŒãPhi-3-miniã¯ãã®ãµã€ãºã®2åã®ã¢ãã«ãäžåããPhi-3-smallãšPhi-3-mediumã¯GPT-3.5ãå«ããã倧ããªã¢ãã«ãäžåããŸãã
Phi-3-small & medium
ããã7Bã®ãã©ã¡ãŒã¿ã§ãPhi-3-smallã¯ããŸããŸãªèšèªãæšè«ãã³ãŒãã£ã³ã°ãããã³æ°åŠã®ãã³ãããŒã¯ã§GPT-3.5TãäžåããŸãã14Bã®ãã©ã¡ãŒã¿ãæã€Phi-3-mediumã¯ãã®åŸåãç¶ããGemini 1.0 ProãäžåããŸãã
Phi-3.5-mini
Phi-3-miniã®ã¢ããã°ã¬ãŒããšèããããšãã§ããŸãããã©ã¡ãŒã¿ã¯å€ãããŸããããè€æ°ã®èšèªããµããŒãããèœåãåäžããïŒ20以äžã®èšèªããµããŒãïŒã¢ã©ãã¢èªãäžåœèªããã§ã³èªããã³ããŒã¯èªããªã©ã³ãèªãè±èªããã£ã³ã©ã³ãèªããã©ã³ã¹èªããã€ãèªãããã©ã€èªããã³ã¬ãªãŒèªãã€ã¿ãªã¢èªãæ¥æ¬èªãéåœèªããã«ãŠã§ãŒèªãããŒã©ã³ãèªããã«ãã¬ã«èªããã·ã¢èªãã¹ãã€ã³èªãã¹ãŠã§ãŒãã³èªãã¿ã€èªããã«ã³èªããŠã¯ã©ã€ãèªïŒââãé·ãã³ã³ããã¹ãã®ãµããŒãã匷åããŸãã3.8Bã®ãã©ã¡ãŒã¿ãæã€Phi-3.5-miniã¯åããµã€ãºã®èšèªã¢ãã«ãäžåãããã®ãµã€ãºã®2åã®ã¢ãã«ãšåçã§ãã
Phi-3/3.5ã®Instructã¢ãã«ãPhiã®çè§£èœåãšèããããšãã§ããVisionã¯Phiã«äžçãçè§£ããç®ãäžããŸãã
Phi-3-Vision
ããã4.2Bã®ãã©ã¡ãŒã¿ãæã€Phi-3-visionã¯ãã®åŸåãç¶ããäžè¬çãªèŠèŠæšè«ã¿ã¹ã¯ãOCRã衚ããã³å³ã®çè§£ã¿ã¹ã¯ã§Claude-3 HaikuãGemini 1.0 Pro Vãªã©ã®ãã倧ããªã¢ãã«ãäžåããŸãã
Phi-3.5-Vision
Phi-3-Visionã®ã¢ããã°ã¬ãŒãã§ããããè€æ°ã®ç»åããµããŒãããããã«æ¹è¯ãããŠããŸããç»åã ãã§ãªãããããªãèŠãããšãã§ããããžã§ã³ã®æ¹åãšèããããšãã§ããŸããPhi-3.5-visionã¯OCRã衚ããã³ãã£ãŒãçè§£ã¿ã¹ã¯ã§Claude-3.5 SonnetãGemini 1.5 Flashãªã©ã®ãã倧ããªã¢ãã«ãäžåããäžè¬çãªèŠèŠç¥èæšè«ã¿ã¹ã¯ã§åçã§ããè€æ°ãã¬ãŒã å ¥åããµããŒãããããªãã¡è€æ°ã®å ¥åç»åã«å¯ŸããŠæšè«ãè¡ããŸãã
***Mixture of Experts(MoE)***ã¯ãã¢ãã«ãäºåèšç·Žããéã«èšç®éãå€§å¹ ã«åæžããããšãå¯èœã«ããå¯éã¢ãã«ãšåãèšç®äºç®ã§ã¢ãã«ãŸãã¯ããŒã¿ã»ãããµã€ãºãåçã«æ¡å€§ããããšãã§ããŸããç¹ã«ãMoEã¢ãã«ã¯äºåèšç·Žäžã«å¯éã¢ãã«ãšåãå質ãã¯ããã«æ©ãéæããã¹ãã§ããPhi-3.5-MoEã¯16x3.8Bã®ãšãã¹ããŒãã¢ãžã¥ãŒã«ã§æ§æãããŠããŸããPhi-3.5-MoEã¯ããã6.6Bã®ã¢ã¯ãã£ããã©ã¡ãŒã¿ã§ããã倧ããªã¢ãã«ãšåæ§ã®æšè«ãèšèªçè§£ãããã³æ°åŠãéæããŸãã
Phi-3/3.5ãã¡ããªãŒã¢ãã«ãç°ãªãã·ããªãªã«åºã¥ããŠäœ¿çšããããšãã§ããŸããLLMãšã¯ç°ãªããPhi-3/3.5-miniãŸãã¯Phi-3/3.5-Visionããšããžããã€ã¹ã«å±éããããšãã§ããŸãã
ç°ãªãã·ããªãªã§Phi-3/3.5ã䜿çšããããšãæãã§ããŸããæ¬¡ã«ãç°ãªãã·ããªãªã«åºã¥ããŠPhi-3/3.5ã䜿çšããŸãã
ã¯ã©ãŠãã®API
GitHubã¢ãã«
GitHub
Modelsã¯æãçŽæ¥çãªæ¹æ³ã§ããGitHub ModelsãéããŠPhi-3/3.5-Instructã¢ãã«ã«è¿
éã«ã¢ã¯ã»ã¹ã§ããŸããAzure AI Inference SDK / OpenAI SDKãšçµã¿åãããããšã§ãã³ãŒããéããŠAPIã«ã¢ã¯ã»ã¹ããPhi-3/3.5-InstructåŒã³åºããå®äºã§ããŸãããŸããPlaygroundãéããŠç°ãªã广ããã¹ãããããšãã§ããŸãã- ãã¢: äžåœã®ã·ããªãªã«ãããPhi-3-miniãšPhi-3.5-miniã®å¹æã®æ¯èŒ
Azure AI Studio ãããã¯ãããžã§ã³ãMoEã¢ãã«ã䜿çšãããå Žåã¯ãAzure AI Studioã䜿çšããŠåŒã³åºããå®äºã§ããŸããèå³ãããå Žåã¯ãPhi-3 Cookbookãèªãã§ãAzure AI StudioãéããŠPhi-3/3.5 InstructãVisionãMoEãåŒã³åºãæ¹æ³ãåŠã¶ããšãã§ããŸã ãã®ãªã³ã¯ãã¯ãªã㯠NVIDIA NIM AzureãGitHubãæäŸããã¯ã©ãŠãããŒã¹ã®Model Catalogãœãªã¥ãŒã·ã§ã³ã«å ããŠãNivida NIMã䜿çšããŠé¢é£ããåŒã³åºããå®äºããããšãã§ããŸããNIVIDA NIMã蚪åããŠPhi-3/3.5 Familyã®APIåŒã³åºããå®äºã§ããŸããNVIDIA NIMïŒNVIDIA Inference MicroservicesïŒã¯ãéçºè
ãAIã¢ãã«ãå¹ççã«å±éã§ããããã«èšèšãããé«éåãããæšè«ãã€ã¯ããµãŒãã¹ã®ã»ããã§ãã¯ã©ãŠããããŒã¿ã»ã³ã¿ãŒãã¯ãŒã¯ã¹ããŒã·ã§ã³ãªã©ããŸããŸãªç°å¢ã§å©çšã§ããŸããNVIDIA NIMã®äž»ãªç¹åŸŽã¯æ¬¡ã®ãšããã§ã: - å±éã®å®¹æã: NIMã¯AIã¢ãã«ã®å±éãåäžã®ã³ãã³ãã§å¯èœã«ããæ¢åã®ã¯ãŒã¯ãããŒã«ç°¡åã«çµ±åã§ããŸãã - æé©åãããããã©ãŒãã³ã¹: TensorRTãTensorRT-LLMãªã©ã®NVIDIAã®äºåæé©åãããæšè«ãšã³ãžã³ã掻çšããäœã¬ã€ãã³ã·ãŒãšé«ã¹ã«ãŒããããå®çŸããŸãã - ã¹ã±ãŒã©ããªãã£: NIMã¯Kubernetesã§ã®èªåã¹ã±ãŒãªã³ã°ããµããŒãããå€åããã¯ãŒã¯ããŒãã«å¹æçã«å¯Ÿå¿ã§ããŸãã - ã»ãã¥ãªãã£ãšã³ã³ãããŒã«: çµç¹ã¯èªç€Ÿã®ç®¡çã€ã³ãã©ã¹ãã©ã¯ãã£ã§NIMãã€ã¯ããµãŒãã¹ãèªå·±ãã¹ãã£ã³ã°ããããšã§ãããŒã¿ãšã¢ããªã±ãŒã·ã§ã³ã®ç®¡çãç¶æã§ããŸãã - æšæºAPI: NIMã¯æ¥çæšæºã®APIãæäŸãããã£ããããããAIã¢ã·ã¹ã¿ã³ããªã©ã®AIã¢ããªã±ãŒã·ã§ã³ãç°¡åã«æ§ç¯ããã³çµ±åã§ããŸããNIMã¯NVIDIA AI Enterpriseã®äžéšã§ãããAIã¢ãã«ã®å±éãšéçšãç°¡çŽ åããNVIDIA GPUäžã§å¹ççã«åäœãããããšãç®çãšããŠããŸãã- ãã¢: Nividia NIMã䜿çšããŠPhi-3.5-Vision-APIãåŒã³åºã [ãã®ãªã³ã¯ãã¯ãªãã¯] ### ããŒã«ã«ç°å¢ã§ã®Phi-3/3.5ã®æšè« Phi-3ãGPT-3ã®ãããªèšèªã¢ãã«ã«é¢é£ããæšè«ã¯ãåãåã£ãå
¥åã«åºã¥ããŠå¿çãäºæž¬ãçæããããã»ã¹ãæããŸããPhi-3ã«ããã³ããã質åãæäŸãããšããããèšç·Žããããã¥ãŒã©ã«ãããã¯ãŒã¯ã䜿çšããŠãèšç·ŽãããããŒã¿ã®ãã¿ãŒã³ãé¢ä¿ãåæããããšã§æãå¯èœæ§ãé«ãé¢é£æ§ã®ããå¿çãæšæž¬ããŸãã Hugging Face Transformer Hugging Face Transformersã¯ãèªç¶èšèªåŠçïŒNLPïŒããã®ä»ã®æ©æ¢°åŠç¿ã¿ã¹ã¯ã®ããã«èšèšããã匷åãªã©ã€ãã©ãªã§ãããã®äž»ãªãã€ã³ãã¯ä»¥äžã®éãã§ã: 1. äºåèšç·Žæžã¿ã¢ãã«: ããã¹ãåé¡ãååä»ããšã³ãã£ãã£èªèã質åå¿çãèŠçŽã翻蚳ãããã¹ãçæãªã©ã®ããŸããŸãªã¿ã¹ã¯ã«äœ¿çšã§ããæ°åã®äºåèšç·Žæžã¿ã¢ãã«ãæäŸããŸãã 2. ãã¬ãŒã ã¯ãŒã¯ã®äºææ§: PyTorchãTensorFlowãJAXãªã©ã®è€æ°ã®æ·±å±€åŠç¿ãã¬ãŒã ã¯ãŒã¯ããµããŒãããŠããã1ã€ã®ãã¬ãŒã ã¯ãŒã¯ã§ã¢ãã«ãèšç·Žããå¥ã®ãã¬ãŒã ã¯ãŒã¯ã§äœ¿çšã§ããŸãã 3. ãã«ãã¢ãŒãã«æ©èœ: NLPã®ä»ã«ããHugging Face Transformersã¯ã³ã³ãã¥ãŒã¿ãŒããžã§ã³ïŒäŸ: ç»ååé¡ãç©äœæ€åºïŒãé³å£°åŠçïŒäŸ: é³å£°èªèãé³å£°åé¡ïŒã®ã¿ã¹ã¯ããµããŒãããŠããŸãã 4. 䜿ãããã: ã©ã€ãã©ãªã¯ã¢ãã«ã®ããŠã³ããŒããšåŸ®èª¿æŽãç°¡åã«è¡ãããã®APIãšããŒã«ãæäŸããåå¿è
ããå°éå®¶ãŸã§ã¢ã¯ã»ã¹ããããããŠããŸãã 5. ã³ãã¥ããã£ãšãªãœãŒã¹: Hugging Faceã¯æŽ»æ°ããã³ãã¥ããã£ãæã¡ããŠãŒã¶ãŒãã©ã€ãã©ãªãå§ããã®ã«åœ¹ç«ã€è±å¯ãªããã¥ã¡ã³ãããã¥ãŒããªã¢ã«ãã¬ã€ããæäŸããŠããŸããå
¬åŒããã¥ã¡ã³ããŸãã¯ãã®GitHubãªããžããªãããã¯æãäžè¬çã«äœ¿çšãããæ¹æ³ã§ãããGPUã¢ã¯ã»ã©ã¬ãŒã·ã§ã³ãå¿
èŠã§ããçµå±ãVisionãMoEã®ãããªã·ãŒã³ã¯å€ãã®èšç®ãå¿
èŠãšããéååãããŠããªãå ŽåãCPUã§ã¯éåžžã«å¶éãããŸãã- ãã¢: Transformerã䜿çšããŠPhi-3.5-InstuctãåŒã³åºã ãã®ãªã³ã¯ãã¯ãªã㯠- ãã¢: Transformerã䜿çšããŠPhi-3.5-VisionãåŒã³åºããã®ãªã³ã¯ãã¯ãªã㯠- ãã¢: Transformerã䜿çšããŠPhi-3.5-MoEãåŒã³åºããã®ãªã³ã¯ãã¯ãªã㯠Ollama Ollamaã¯ãå€§èŠæš¡èšèªã¢ãã«ïŒLLMïŒãããŒã«ã«ã§å®è¡ããã®ãç°¡åã«ããããã«èšèšããããã©ãããã©ãŒã ã§ããLlama 3.1ãPhi 3ãMistralãGemma 2ãªã©ãããŸããŸãªã¢ãã«ããµããŒãããŠããŸãããã®ãã©ãããã©ãŒã ã¯ãã¢ãã«ã®éã¿ãæ§æãããŒã¿ãåäžã®ããã±ãŒãžã«ãŸãšããããšã§ããã»ã¹ãç°¡çŽ åãããŠãŒã¶ãŒãèªåã®ã¢ãã«ãã«ã¹ã¿ãã€ãºãäœæããã®ãããã¢ã¯ã»ã¹ããããããŸããOllamaã¯macOSãLinuxãWindowsã§å©çšå¯èœã§ããã¯ã©ãŠããµãŒãã¹ã«é Œããã«LLMãå®éšãŸãã¯å±éãããå Žåã«ã¯çŽ æŽãããããŒã«ã§ããOllamaã¯æãçŽæ¥çãªæ¹æ³ã§ãããæ¬¡ã®ã¹ããŒãã¡ã³ããå®è¡ããã ãã§ãã ```bash
ollama run phi3.5
**ONNX Runtime for GenAI** [ONNX Runtime](https://github.com/microsoft/onnxruntime-genai?WT.mc_id=academic-105485-koreyst)ã¯ãã¯ãã¹ãã©ãããã©ãŒã ã®æšè«ããã³ãã¬ãŒãã³ã°ã®æ©æ¢°åŠç¿ã¢ã¯ã»ã©ã¬ãŒã¿ã§ããONNX Runtime for Generative AI (GENAI)ã¯ãããŸããŸãªãã©ãããã©ãŒã ã§çæAIã¢ãã«ãå¹ççã«å®è¡ããã®ãå©ãã匷åãªããŒã«ã§ãã ## ONNX Runtimeãšã¯äœã§ããïŒ ONNX Runtimeã¯ãæ©æ¢°åŠç¿ã¢ãã«ã®é«æ§èœæšè«ãå¯èœã«ãããªãŒãã³ãœãŒã¹ãããžã§ã¯ãã§ããããã¯ãæ©æ¢°åŠç¿ã¢ãã«ã衚çŸããæšæºã§ããOpen Neural Network Exchange (ONNX)圢åŒã®ã¢ãã«ããµããŒãããŠããŸããONNX Runtimeæšè«ã¯ãPyTorchãTensorFlow/Kerasãªã©ã®æ·±å±€åŠç¿ãã¬ãŒã ã¯ãŒã¯ããã®ã¢ãã«ããŸãã¯scikit-learnãLightGBMãXGBoostãªã©ã®å€å
žçãªæ©æ¢°åŠç¿ã©ã€ãã©ãªããµããŒãããããéã顧客äœéšãšã³ã¹ãåæžãå¯èœã«ããŸããONNX Runtimeã¯ç°ãªãããŒããŠã§ã¢ããã©ã€ããŒããªãã¬ãŒãã£ã³ã°ã·ã¹ãã ã«å¯Ÿå¿ããŠãããã°ã©ãæé©åã倿ãšãšãã«ããŒããŠã§ã¢ã¢ã¯ã»ã©ã¬ãŒã¿ã掻çšããããšã§æé©ãªããã©ãŒãã³ã¹ãæäŸããŸãã ## Generative AIãšã¯äœã§ããïŒ Generative AIã¯ãèšç·ŽãããããŒã¿ã«åºã¥ããŠããã¹ããç»åã鳿¥œãªã©ã®æ°ããã³ã³ãã³ããçæã§ããAIã·ã¹ãã ãæããŸããäŸãšããŠã¯ãGPT-3ã®ãããªèšèªã¢ãã«ãStable Diffusionã®ãããªç»åçæã¢ãã«ããããŸããONNX Runtime for GenAIã©ã€ãã©ãªã¯ãONNXã¢ãã«ã®ããã®çæAIã«ãŒããæäŸããONNX Runtimeã«ããæšè«ãããžããåŠçãæ€çŽ¢ãšãµã³ããªã³ã°ãKVãã£ãã·ã¥ç®¡çãå«ã¿ãŸãã ## ONNX Runtime for GENAI ONNX Runtime for GENAIã¯ãçæAIã¢ãã«ããµããŒãããããã«ONNX Runtimeã®æ©èœãæ¡åŒµããŸãã以äžã¯ãã®äž»ãªç¹åŸŽã§ã: - **åºç¯ãªãã©ãããã©ãŒã ãµããŒã:** WindowsãLinuxãmacOSãAndroidãiOSãªã©ãããŸããŸãªãã©ãããã©ãŒã ã§åäœããŸãã - **ã¢ãã«ãµããŒã:** LLaMAãGPT-NeoãBLOOMãªã©ã®å€ãã®äººæ°ã®ããçæAIã¢ãã«ããµããŒãããŠããŸãã - **ããã©ãŒãã³ã¹æé©å:** NVIDIA GPUãAMD GPUãªã©ã®ç°ãªãããŒããŠã§ã¢ã¢ã¯ã»ã©ã¬ãŒã¿ã«å¯Ÿããæé©åãå«ãã§ããŸãã - **䜿ãããã:** ã¢ããªã±ãŒã·ã§ã³ãžã®ç°¡åãªçµ±åãå¯èœã«ããAPIãæäŸããæå°éã®ã³ãŒãã§ããã¹ããç»åããã®ä»ã®ã³ã³ãã³ããçæã§ããŸãã - ãŠãŒã¶ãŒã¯é«ã¬ãã«ã®generate()ã¡ãœãããåŒã³åºãããã¢ãã«ã®åã€ãã¬ãŒã·ã§ã³ãã«ãŒãã§å®è¡ãã1åã®ããŒã¯ã³ãçæãããªãã·ã§ã³ã§ã«ãŒãå
ã§çæãã©ã¡ãŒã¿ãæŽæ°ããããšãã§ããŸãã - ONNX Runtimeã¯ãŸããããŒã¯ã³ã·ãŒã±ã³ã¹ãçæããããã®è²ªæ¬²/ããŒã æ€çŽ¢ãšTopPãTopKãµã³ããªã³ã°ããµããŒãããŠãããå埩ããã«ãã£ã®ãããªçµã¿èŸŒã¿ã®ããžããåŠçãæäŸããŸããã«ã¹ã¿ã ã¹ã³ã¢ãªã³ã°ãç°¡åã«è¿œå ã§ããŸãã ## å§ããã«ã¯ ONNX Runtime for GENAIãå§ããã«ã¯ã以äžã®æé ãå®è¡ã§ããŸã: ### ONNX Runtimeã®ã€ã³ã¹ããŒã«:Python
pip install onnxruntime
### Generative AI Extensionsã®ã€ã³ã¹ããŒã«:Python
pip install onnxruntime-genai
### ã¢ãã«ã®å®è¡: ããã«Pythonã®ç°¡åãªäŸããããŸã:Python
import onnxruntime_genai as og
model = og.Model('path_to_your_model.onnx')
tokenizer = og.Tokenizer(model)
input_text = "Hello, how are you?"
input_tokens = tokenizer.encode(input_text)
output_tokens = model.generate(input_tokens)
output_text = tokenizer.decode(output_tokens)
print(output_text)
### ãã¢: ONNX Runtime GenAIã䜿çšããŠPhi-3.5-VisionãåŒã³åºãpython
import onnxruntime_genai as og
model_path = './Your Phi-3.5-vision-instruct ONNX Path'
img_path = './Your Image Path'
model = og.Model(model_path)
processor = model.create_multimodal_processor()
tokenizer_stream = processor.create_stream()
text = "Your Prompt"
prompt = "<|user|>\n"
prompt += "<|image_1|>\n"
prompt += f"{text}<|end|>\n"
prompt += "<|assistant|>\n"
image = og.Images.open(img_path)
inputs = processor(prompt, images=image)
params = og.GeneratorParams(model)
params.set_inputs(inputs)
params.set_search_options(max_length=3072)
generator = og.Generator(model, params)
while not generator.is_done():
generator.compute_logits()
generator.generate_next_token()
new_token = generator.get_next_tokens()[0]
code += tokenizer_stream.decode(new_token)
print(tokenizer_stream.decode(new_token), end='', flush=True)
**å
責äºé
**:
ãã®ææžã¯AI翻蚳ãµãŒãã¹[Co-op Translator](https://github.com/Azure/co-op-translator)ã䜿çšããŠç¿»èš³ãããŠããŸããæ£ç¢ºããæããŠãããŸãããèªå翻蚳ã«ã¯èª€ããäžæ£ç¢ºããå«ãŸããå¯èœæ§ãããããšããæ¿ç¥ãããã ãããå
ã®èšèªã«ããåæãæš©åšããæ
å ±æºãšèŠãªãããã¹ãã§ããéèŠãªæ
å ±ã«ã€ããŠã¯ãå°éã®äººéã«ãã翻蚳ããå§ãããŸãããã®ç¿»èš³ã®äœ¿çšã«ããçãã誀解ã誀蚳ã«ã€ããŠã¯ãäžåã®è²¬ä»»ãè² ããããŸãã

