Improving code readability of torchvision.models.detection 

### 🚀 The feature, motivation and pitch

Dear @pmeier,

As a student who is learning about the Faster R-CNN family and trying to understand the implementation provided in the torchvision.model.detection module, I find the code difficult to read and comprehend. Specifically, some blocks of code could be simplified further, such as the code block shown below:

```python
#https://github.com/pytorch/vision/blob/2b25d67925df9741ba2a75a07bc3046302969e87/torchvision/models/detection/_utils.py#L162
box_sum = 0
for val in boxes_per_image:
    box_sum += val
```
Therefore, I would like to propose some improvements to enhance the code's readability and make it more accessible to beginners like myself. I believe these changes will improve the overall quality of the module. Here are some possible improvements that I could make:

* Add more comments and docstrings to explain the purpose and functionality of each function and module.
Simplify and reorganize complex code blocks.
* I would be happy to collaborate with other contributors to implement these changes. Please let me know if you have any feedback or suggestions.

Thank you for your consideration.

### Alternatives

_No response_

### Additional context

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improving code readability of torchvision.models.detection #7458

🚀 The feature, motivation and pitch

Alternatives

Additional context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Improving code readability of torchvision.models.detection #7458

Description

🚀 The feature, motivation and pitch

Alternatives

Additional context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions