Open
Description
🚀 The feature, motivation and pitch
Dear @pmeier,
As a student who is learning about the Faster R-CNN family and trying to understand the implementation provided in the torchvision.model.detection module, I find the code difficult to read and comprehend. Specifically, some blocks of code could be simplified further, such as the code block shown below:
#https://github.com/pytorch/vision/blob/2b25d67925df9741ba2a75a07bc3046302969e87/torchvision/models/detection/_utils.py#L162
box_sum = 0
for val in boxes_per_image:
box_sum += val
Therefore, I would like to propose some improvements to enhance the code's readability and make it more accessible to beginners like myself. I believe these changes will improve the overall quality of the module. Here are some possible improvements that I could make:
- Add more comments and docstrings to explain the purpose and functionality of each function and module.
Simplify and reorganize complex code blocks. - I would be happy to collaborate with other contributors to implement these changes. Please let me know if you have any feedback or suggestions.
Thank you for your consideration.
Alternatives
No response
Additional context
No response