Question
这是我第一次接触多模态MoE,想通过学习完整的大模型代码了解推理调度的全过程。请问在这个项目中,主要的推理调度或是专家调度函数是在那个文件或是哪些文件中?
This is my first encounter with multimodal Mixture of Experts (MoE), and I want to understand the entire inference scheduling process by studying the complete large - model code. In this project, where are the main inference scheduling or expert scheduling functions located?