Summarization service (and maybe other services) have their own, separate code to load the models. However, the model from the chain in the notebook runs using the definitions in the utils.py file.
For the sake of simplicity for maintenance and reuse of code, we should have a single point for model loading - using only utils.py makes more sense in my opinion