add _argparse_forward()

Tongjilibo · Tongjilibo · commit 75e77fd7674b · 2023-09-08T15:04:15.000+08:00
diff --git a/README.md b/README.md
@@ -85,7 +85,7 @@ pip install git+https://github.com/Tongjilibo/torch4keras.git
 - **v0.0.1**：20221019 初始版本
 
 ## 5. 更新：
-- **20230907**: 增加from_pretrained和save_pretrained方法，增加log_warn_once方法，compile()中可设置成员变量，默认move_to_model_device设置为True, 增加JsonConfig
+- **20230907**: 增加from_pretrained和save_pretrained方法，增加log_warn_once方法，compile()中可设置成员变量，默认move_to_model_device设置为True, 增加JsonConfig，增加_argparse_forward()方便下游继承改写Trainer
 - **20230901**: compile()可不传参，interval不一致报warning, 去除部分self.vars, 调整move_to_model_device逻辑，DDP每个epoch重新设置随机数，save_weights()和load_weights()可以按照`pretrained`格式
 - **20230821**: 代码结构调整，增加trainer.py文件，方便下游集成
 - **20230812**: 修复DeepSpeedTrainer，修复DDP
diff --git a/torch4keras/trainer.py b/torch4keras/trainer.py
@@ -148,18 +148,24 @@ def _log_first_step(self, resume_step, train_X):
             print(colorful('[Label]: ', color='green'), + train_X)
 
     def _forward(self, *inputs, **input_kwargs):
+        '''调用模型的forward，方便下游继承的时候可以自定义使用哪个模型的forward
+        '''
+        return self._argparse_forward(self.unwrap_model(), *inputs, **input_kwargs)
+
+    @staticmethod
+    def _argparse_forward(model, *inputs, **input_kwargs):
         '''调用模型的forward
         如果传入了网络结构module，则调用module的forward；如果是继承方式，则调用自身的forward
         '''
         if (len(inputs)==1) and isinstance(inputs[0], (tuple,list)):  # 防止([])嵌套
             inputs = inputs[0]
         
         if isinstance(inputs, torch.Tensor):  # tensor不展开
-            return self.unwrap_model().forward(inputs, **input_kwargs)
+            return model.forward(inputs, **input_kwargs)
         elif isinstance(inputs, (tuple, list)):
-            return self.unwrap_model().forward(*inputs, **input_kwargs)
+            return model.forward(*inputs, **input_kwargs)
         else:
-            return self.unwrap_model().forward(inputs, **input_kwargs)
+            return model.forward(inputs, **input_kwargs)
 
     def train_step(self, train_X, train_y):
         ''' Perform a training step on a batch of inputs. '''