[CodeCamp #15] Add sigmoid focal loss cpu impl #2536

XULU42 · 2023-01-09T17:15:03Z

Motivation

This PR adds sigmoid focal loss CPU implementation on the basis of CUDA implementation.

Modification

Add sigmoid focal loss cpu implementation
Add focal loss unit test on cpu
Add ops to EN/ZH documents

Checklist

Before PR:

I have read and followed the workflow indicated in the CONTRIBUTING.md to create this PR.
Pre-commit or linting tools indicated in CONTRIBUTING.md are used to fix the potential lint issues.
Bug fixes are covered by unit tests, the case that causes the bug should be added in the unit tests.
New functionalities are covered by complete unit tests. If not, please add more unit test to ensure the correctness.
The documentation has been modified accordingly, including docstring or example tutorials.

After PR:

If the modification has potential influence on downstream or other related projects, this PR should be tested with some of those projects, like MMDet or MMCls.
CLA has been signed and all committers have signed the CLA in this PR.

* Add sigmoid focal loss cpu implementation * Add focal loss unit test on cpu * Add ops to EN/ZH documents

mmcv/ops/csrc/pytorch/cpu/sigmoid_focal_loss.cpp

grimoire

Since this op can be implemented with native torch. Please provide a benchmark between this ops and torch implementation.

XULU42 · 2023-01-11T08:13:12Z

Thanks for reviewer's opinion. I have some question about the benchmark. Could you please help me figure out?

Is there a benchmark example for this op? I think this answer would be YES due to the existing cuda op.
I would like to know the goal for mmcv to reimplement this op? I would guess the goal is performance due to the benchmark requirement. If the benchmark result shows current op impl has no superior performance than torch impl, would merge delay until the optimization is completed or would merge before optimization?(HAHA, codecamp has a ddl^_^)
If there was no benchmark before, here is my plan, could you please help me check? I would test for different batch (2, 4, 8, ..., 8192) and different num_classes (2, 4, 8, ..., 8192) for the python api, each case run 10 times, and cal the average latency of both impl. Where would the benchmark code put?

Thanks again^_^

grimoire · 2023-01-11T11:39:22Z

Yes, it would help us to find the bottleneck of the ops.
It would be cool if the benchmark can guide you to improve the performance of the op. And we will merge this PR even when the performance is not the best (less is better than nothing). Your benchmark would help us to optimize it in the future.
The plan sounds cool. It would be even better if you can perform some tests on the data size of the real scene. pytest-benchmark or PyTorch profiler can help you build the benchmark. You can place the benchmark in this PR.

Accoding to reviewer's opinion: 1. replace expf with exp 2. direct bind KernelLaucher function to impl function

data case: batch: [2, 4, 8, ..., 8096] num_classes: [2, 4, 8, ..., 4096] beta conclusion: 1. In small data size, implemented op is superior. 2. In medium and large data size, implemented op is superior. Note: 1. implemented op is compated to torchvison.ops.sigmoid_focal_loss, and the latter is different in 'targets' arg. In benchmark, use different data to overcome this.

XULU42 · 2023-01-16T14:24:55Z

I have used pytest-benchmark to benchmark the implemented sigmoid focal loss op. Benchmark code is in the latest commit. And on my 8 core notebook, conclusions could be drawn as below: 1. In small data size, implemented op is superior. 2. In medium and large data size, torch implemented is superior. The processed benchmark data could be seen in benchmark_round10.csv and the raw benchmark data is in
0003_4e4fa93e7c5ee5a15dd63068c0fe9992068bc881_20230116_012432_uncommited-changes.txt

Next step is to optimize this op in medium and large data size, and maybe the torchvision implementation may be a good start point. Any suggestions are welcomed^_^

grimoire · 2023-01-18T09:07:07Z

Cool.
Do remember to remove the benchmark test code before the final commit. Benchmark test might slow down github action.

[Feature] Add sigmoid focal loss cpu impl

4e4fa93

* Add sigmoid focal loss cpu implementation * Add focal loss unit test on cpu * Add ops to EN/ZH documents

mm-assistant bot assigned zhouzaida Jan 9, 2023

XULU42 changed the title ~~CodeCamp #15~~ [CodeCamp #15] Add sigmoid focal loss cpu impl Jan 10, 2023

grimoire reviewed Jan 11, 2023

View reviewed changes

mmcv/ops/csrc/pytorch/cpu/sigmoid_focal_loss.cpp Outdated Show resolved Hide resolved

grimoire reviewed Jan 11, 2023

View reviewed changes

mmcv/ops/csrc/pytorch/cpu/sigmoid_focal_loss.cpp Outdated Show resolved Hide resolved

grimoire reviewed Jan 11, 2023

View reviewed changes

HAOCHENYE added the CodeCamp label Jan 11, 2023

XULU42 added 2 commits January 16, 2023 22:03

[fix]: simplify code

fa65a43

Accoding to reviewer's opinion: 1. replace expf with exp 2. direct bind KernelLaucher function to impl function

[ENHANCE]: skip benchmark by default

464e2bb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[CodeCamp #15] Add sigmoid focal loss cpu impl #2536

[CodeCamp #15] Add sigmoid focal loss cpu impl #2536

Uh oh!

XULU42 commented Jan 9, 2023 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

grimoire left a comment

Uh oh!

XULU42 commented Jan 11, 2023 •

edited

Loading

Uh oh!

grimoire commented Jan 11, 2023

Uh oh!

XULU42 commented Jan 16, 2023 •

edited

Loading

Uh oh!

grimoire commented Jan 18, 2023

Uh oh!

Uh oh!

[CodeCamp #15] Add sigmoid focal loss cpu impl #2536

Are you sure you want to change the base?

[CodeCamp #15] Add sigmoid focal loss cpu impl #2536

Uh oh!

Conversation

XULU42 commented Jan 9, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Modification

Checklist

Uh oh!

Uh oh!

Uh oh!

grimoire left a comment

Choose a reason for hiding this comment

Uh oh!

XULU42 commented Jan 11, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

grimoire commented Jan 11, 2023

Uh oh!

XULU42 commented Jan 16, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

grimoire commented Jan 18, 2023

Uh oh!

Uh oh!

XULU42 commented Jan 9, 2023 •

edited

Loading

XULU42 commented Jan 11, 2023 •

edited

Loading

XULU42 commented Jan 16, 2023 •

edited

Loading