Skip to content

Commit d23da20

Browse files
committed
first commit
0 parents  commit d23da20

File tree

1,273 files changed

+133851
-0
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

1,273 files changed

+133851
-0
lines changed

CITATION.cff

+8
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,8 @@
1+
cff-version: 1.2.0
2+
message: "If you use this software, please cite it as below."
3+
authors:
4+
- name: "MMDetection Contributors"
5+
title: "OpenMMLab Detection Toolbox and Benchmark"
6+
date-released: 2018-08-22
7+
url: "https://github.com/open-mmlab/mmdetection"
8+
license: Apache-2.0

LICENSE

+203
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,203 @@
1+
Copyright 2018-2023 OpenMMLab. All rights reserved.
2+
3+
Apache License
4+
Version 2.0, January 2004
5+
http://www.apache.org/licenses/
6+
7+
TERMS AND CONDITIONS FOR USE, REPRODUCTION, AND DISTRIBUTION
8+
9+
1. Definitions.
10+
11+
"License" shall mean the terms and conditions for use, reproduction,
12+
and distribution as defined by Sections 1 through 9 of this document.
13+
14+
"Licensor" shall mean the copyright owner or entity authorized by
15+
the copyright owner that is granting the License.
16+
17+
"Legal Entity" shall mean the union of the acting entity and all
18+
other entities that control, are controlled by, or are under common
19+
control with that entity. For the purposes of this definition,
20+
"control" means (i) the power, direct or indirect, to cause the
21+
direction or management of such entity, whether by contract or
22+
otherwise, or (ii) ownership of fifty percent (50%) or more of the
23+
outstanding shares, or (iii) beneficial ownership of such entity.
24+
25+
"You" (or "Your") shall mean an individual or Legal Entity
26+
exercising permissions granted by this License.
27+
28+
"Source" form shall mean the preferred form for making modifications,
29+
including but not limited to software source code, documentation
30+
source, and configuration files.
31+
32+
"Object" form shall mean any form resulting from mechanical
33+
transformation or translation of a Source form, including but
34+
not limited to compiled object code, generated documentation,
35+
and conversions to other media types.
36+
37+
"Work" shall mean the work of authorship, whether in Source or
38+
Object form, made available under the License, as indicated by a
39+
copyright notice that is included in or attached to the work
40+
(an example is provided in the Appendix below).
41+
42+
"Derivative Works" shall mean any work, whether in Source or Object
43+
form, that is based on (or derived from) the Work and for which the
44+
editorial revisions, annotations, elaborations, or other modifications
45+
represent, as a whole, an original work of authorship. For the purposes
46+
of this License, Derivative Works shall not include works that remain
47+
separable from, or merely link (or bind by name) to the interfaces of,
48+
the Work and Derivative Works thereof.
49+
50+
"Contribution" shall mean any work of authorship, including
51+
the original version of the Work and any modifications or additions
52+
to that Work or Derivative Works thereof, that is intentionally
53+
submitted to Licensor for inclusion in the Work by the copyright owner
54+
or by an individual or Legal Entity authorized to submit on behalf of
55+
the copyright owner. For the purposes of this definition, "submitted"
56+
means any form of electronic, verbal, or written communication sent
57+
to the Licensor or its representatives, including but not limited to
58+
communication on electronic mailing lists, source code control systems,
59+
and issue tracking systems that are managed by, or on behalf of, the
60+
Licensor for the purpose of discussing and improving the Work, but
61+
excluding communication that is conspicuously marked or otherwise
62+
designated in writing by the copyright owner as "Not a Contribution."
63+
64+
"Contributor" shall mean Licensor and any individual or Legal Entity
65+
on behalf of whom a Contribution has been received by Licensor and
66+
subsequently incorporated within the Work.
67+
68+
2. Grant of Copyright License. Subject to the terms and conditions of
69+
this License, each Contributor hereby grants to You a perpetual,
70+
worldwide, non-exclusive, no-charge, royalty-free, irrevocable
71+
copyright license to reproduce, prepare Derivative Works of,
72+
publicly display, publicly perform, sublicense, and distribute the
73+
Work and such Derivative Works in Source or Object form.
74+
75+
3. Grant of Patent License. Subject to the terms and conditions of
76+
this License, each Contributor hereby grants to You a perpetual,
77+
worldwide, non-exclusive, no-charge, royalty-free, irrevocable
78+
(except as stated in this section) patent license to make, have made,
79+
use, offer to sell, sell, import, and otherwise transfer the Work,
80+
where such license applies only to those patent claims licensable
81+
by such Contributor that are necessarily infringed by their
82+
Contribution(s) alone or by combination of their Contribution(s)
83+
with the Work to which such Contribution(s) was submitted. If You
84+
institute patent litigation against any entity (including a
85+
cross-claim or counterclaim in a lawsuit) alleging that the Work
86+
or a Contribution incorporated within the Work constitutes direct
87+
or contributory patent infringement, then any patent licenses
88+
granted to You under this License for that Work shall terminate
89+
as of the date such litigation is filed.
90+
91+
4. Redistribution. You may reproduce and distribute copies of the
92+
Work or Derivative Works thereof in any medium, with or without
93+
modifications, and in Source or Object form, provided that You
94+
meet the following conditions:
95+
96+
(a) You must give any other recipients of the Work or
97+
Derivative Works a copy of this License; and
98+
99+
(b) You must cause any modified files to carry prominent notices
100+
stating that You changed the files; and
101+
102+
(c) You must retain, in the Source form of any Derivative Works
103+
that You distribute, all copyright, patent, trademark, and
104+
attribution notices from the Source form of the Work,
105+
excluding those notices that do not pertain to any part of
106+
the Derivative Works; and
107+
108+
(d) If the Work includes a "NOTICE" text file as part of its
109+
distribution, then any Derivative Works that You distribute must
110+
include a readable copy of the attribution notices contained
111+
within such NOTICE file, excluding those notices that do not
112+
pertain to any part of the Derivative Works, in at least one
113+
of the following places: within a NOTICE text file distributed
114+
as part of the Derivative Works; within the Source form or
115+
documentation, if provided along with the Derivative Works; or,
116+
within a display generated by the Derivative Works, if and
117+
wherever such third-party notices normally appear. The contents
118+
of the NOTICE file are for informational purposes only and
119+
do not modify the License. You may add Your own attribution
120+
notices within Derivative Works that You distribute, alongside
121+
or as an addendum to the NOTICE text from the Work, provided
122+
that such additional attribution notices cannot be construed
123+
as modifying the License.
124+
125+
You may add Your own copyright statement to Your modifications and
126+
may provide additional or different license terms and conditions
127+
for use, reproduction, or distribution of Your modifications, or
128+
for any such Derivative Works as a whole, provided Your use,
129+
reproduction, and distribution of the Work otherwise complies with
130+
the conditions stated in this License.
131+
132+
5. Submission of Contributions. Unless You explicitly state otherwise,
133+
any Contribution intentionally submitted for inclusion in the Work
134+
by You to the Licensor shall be under the terms and conditions of
135+
this License, without any additional terms or conditions.
136+
Notwithstanding the above, nothing herein shall supersede or modify
137+
the terms of any separate license agreement you may have executed
138+
with Licensor regarding such Contributions.
139+
140+
6. Trademarks. This License does not grant permission to use the trade
141+
names, trademarks, service marks, or product names of the Licensor,
142+
except as required for reasonable and customary use in describing the
143+
origin of the Work and reproducing the content of the NOTICE file.
144+
145+
7. Disclaimer of Warranty. Unless required by applicable law or
146+
agreed to in writing, Licensor provides the Work (and each
147+
Contributor provides its Contributions) on an "AS IS" BASIS,
148+
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or
149+
implied, including, without limitation, any warranties or conditions
150+
of TITLE, NON-INFRINGEMENT, MERCHANTABILITY, or FITNESS FOR A
151+
PARTICULAR PURPOSE. You are solely responsible for determining the
152+
appropriateness of using or redistributing the Work and assume any
153+
risks associated with Your exercise of permissions under this License.
154+
155+
8. Limitation of Liability. In no event and under no legal theory,
156+
whether in tort (including negligence), contract, or otherwise,
157+
unless required by applicable law (such as deliberate and grossly
158+
negligent acts) or agreed to in writing, shall any Contributor be
159+
liable to You for damages, including any direct, indirect, special,
160+
incidental, or consequential damages of any character arising as a
161+
result of this License or out of the use or inability to use the
162+
Work (including but not limited to damages for loss of goodwill,
163+
work stoppage, computer failure or malfunction, or any and all
164+
other commercial damages or losses), even if such Contributor
165+
has been advised of the possibility of such damages.
166+
167+
9. Accepting Warranty or Additional Liability. While redistributing
168+
the Work or Derivative Works thereof, You may choose to offer,
169+
and charge a fee for, acceptance of support, warranty, indemnity,
170+
or other liability obligations and/or rights consistent with this
171+
License. However, in accepting such obligations, You may act only
172+
on Your own behalf and on Your sole responsibility, not on behalf
173+
of any other Contributor, and only if You agree to indemnify,
174+
defend, and hold each Contributor harmless for any liability
175+
incurred by, or claims asserted against, such Contributor by reason
176+
of your accepting any such warranty or additional liability.
177+
178+
END OF TERMS AND CONDITIONS
179+
180+
APPENDIX: How to apply the Apache License to your work.
181+
182+
To apply the Apache License to your work, attach the following
183+
boilerplate notice, with the fields enclosed by brackets "[]"
184+
replaced with your own identifying information. (Don't include
185+
the brackets!) The text should be enclosed in the appropriate
186+
comment syntax for the file format. We also recommend that a
187+
file or class name and description of purpose be included on the
188+
same "printed page" as the copyright notice for easier
189+
identification within third-party archives.
190+
191+
Copyright 2018-2023 OpenMMLab.
192+
193+
Licensed under the Apache License, Version 2.0 (the "License");
194+
you may not use this file except in compliance with the License.
195+
You may obtain a copy of the License at
196+
197+
http://www.apache.org/licenses/LICENSE-2.0
198+
199+
Unless required by applicable law or agreed to in writing, software
200+
distributed under the License is distributed on an "AS IS" BASIS,
201+
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
202+
See the License for the specific language governing permissions and
203+
limitations under the License.

MANIFEST.in

+6
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,6 @@
1+
include requirements/*.txt
2+
include mmdet/VERSION
3+
include mmdet/.mim/model-index.yml
4+
include mmdet/.mim/demo/*/*
5+
recursive-include mmdet/.mim/configs *.py *.yml
6+
recursive-include mmdet/.mim/tools *.sh *.py

README.md

+75
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,75 @@
1+
## Introduction
2+
3+
This is an unofficial replication of "Pix2seq: A Language Modeling Framework for Object Detection" with pretrained model on mmdetection.
4+
5+
## License
6+
7+
This project is released under the [Apache 2.0 license](LICENSE).
8+
9+
## Installation
10+
11+
Please refer to [get_started.md](docs/get_started.md) for installation.
12+
13+
## Train & Evaluation
14+
15+
Train by running (about 10 days with 8*V100 32GB)
16+
```bash
17+
python -m torch.distributed.launch --nproc_per_node=8 --master_port=5003 \
18+
tools/train.py configs/pix2seq/pix2seq_r50_8x4_50e_coco.py --work-dir pix2seq-output --gpus 8 --launcher pytorch
19+
```
20+
21+
or
22+
23+
Download [pretrained pix2seq weights](https://drive.google.com/file/d/1Ku8ZORiLtMs66uleS3aXId7pxlJrTK9d/view?usp=sharing).
24+
25+
Evaluate with single gpu:
26+
```bash
27+
python tools/test.py configs/pix2seq/pix2seq_r50_8x4_300_coco.py \
28+
weights/checkpoints.pth --work-dir pix2seq-output --eval bbox --show-dir pix2seq-vis
29+
```
30+
31+
Evaluate with 8 gpus:
32+
```bash
33+
python -m torch.distributed.launch --nproc_per_node=8 --master_port=5003 \
34+
tools/test.py configs/pix2seq/pix2seq_r50_8x4_300_coco.py weights/checkpoints.pth \
35+
--work-dir pix2seq-output --eval bbox --launcher pytorch
36+
```
37+
38+
| Method | backbone | Epoch | Batch Size | AP | AP50 | AP75 |
39+
| :-----: | :------: | :----:| :---------:| :---:| :---: | :---: |
40+
| Ours | R50 | 300 | 32 | 36.4 | 52.8 | 38.5 |
41+
| Paper | R50 | 300 | 128 | 43.0 | 61.0 | 45.6 |
42+
43+
44+
## Visualization
45+
46+
![](https://github.com/Sharpiless/mmdet-Pix2Seq/blob/main/resources/007114.jpg)
47+
48+
![](https://github.com/Sharpiless/mmdet-Pix2Seq/blob/main/resources/007351.jpg)
49+
50+
![](https://github.com/Sharpiless/mmdet-Pix2Seq/blob/main/resources/008322.jpg)
51+
52+
![](https://github.com/Sharpiless/mmdet-Pix2Seq/blob/main/resources/000000289393.jpg)
53+
54+
![](https://github.com/Sharpiless/mmdet-Pix2Seq/blob/main/resources/000000212559.jpg)
55+
56+
![](https://github.com/Sharpiless/mmdet-Pix2Seq/blob/main/resources/000000255664.jpg)
57+
58+
## TO-DO
59+
60+
- [x] random shuffle targets
61+
- [x] training from scratch
62+
- [x] drop class token
63+
- [x] stochastic depth
64+
- [x] large scale jittering
65+
- [ ] support for custom dataset
66+
- [x] two independent augmentations for each image
67+
- [x] FrozenBatchNorm2d in backbones
68+
- [x] auto-argument
69+
- [x] nucleus sampling
70+
71+
## Acknowledgement
72+
73+
[https://github.com/gaopengcuhk/Pretrained-Pix2Seq](https://github.com/gaopengcuhk/Pretrained-Pix2Seq)
74+
75+
[https://github.com/open-mmlab/mmdetection](https://github.com/open-mmlab/mmdetection)

README_zh-CN.md

+14
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,14 @@
1+
$$
2+
\begin{array}{l}
3+
P(z_{n} \geq z_{n^{\prime}} ; \forall n^{\prime} \neq n \mid\{\pi_{n^{\prime}}\}_{n^{\prime}=1}^{N})\\
4+
=\int \prod_{n^{\prime} \neq n} e^{-e^{-(z_{n}-\pi_{n^{\prime}})}} \cdot e^{-(z_{n}-\pi_{n})-e^{-(z_{n}-\pi_{n})}} d z_{n}\\
5+
=\int e^{-\sum_{n^{\prime} \neq n} e^{-(z_{n}-\pi_{n})}-(z_{n}-\pi_{n})-e^{-(z_{n}-\pi_{n})}} d z_{n}\\
6+
=\int e^{-\sum_{n=1}^{N} e^{-(z_{n}-\pi_{n^{\prime}})}-(z_{n}-\pi_{n})} d z_{n}\\
7+
=\int e^{-(\sum_{n=1}^{N} e^{\pi_{n^{\prime}}}) e^{-z_{n}}-z_{n}+\pi_{n}} d z_{n}\\
8+
=\int e^{-e^{-z_{n}+\ln (\sum_{n=1}^{N}} e^{\pi^{\pi} n})_{-z_{n}+\pi_{n}} d z_{n}}\\
9+
=\int e^{-e^{-(z_{n}-\ln (\sum_{n=1}^{N}} e^{\pi_{n^{\prime}}}))}(z_{n}-\ln (\sum_{n^{\prime}=1}^{N} e^{\pi_{n^{\prime}}}))-\ln (\sum_{n^{\prime}=1}^{N} e^{\pi^{\prime}} n^{\prime})+\pi_{n} d z_{n}\\
10+
=e^{-\ln (\sum_{n^{\prime}}^{N} e^{e} e^{\pi_{\prime}})+\pi_{n}} \int e^{-e^{-(z_{n}-\ln (\sum_{n}^{N}=1} e^{\pi_{n^{\prime}}}))}(z_{n}-\ln (\sum_{n^{\prime}=1}^{N} e^{\pi_{n^{\prime}})} d z_{n}\\
11+
=\frac{e^{\pi_{n}}}{\sum_{n^{\prime}=1}^{N} e^{\pi_{n^{\prime}}}} \int e^{-e^{-(z_{n}-\ln (\sum_{n}^{N}=1} e^{\pi^{\prime}}{ }_{n}^{\prime}))}(z_{n}-\ln (\sum_{n^{\prime}=1}^{N} e^{.\pi_{n^{\prime}})}) d z_{n}\\
12+
=\frac{e^{\pi_{n}}}{\sum_{n=1}^{N} e^{\pi_{n^{\prime}}}} \int e^{-(z_{n}-\ln (\sum_{n=1}^{N} e^{\pi_{n^{\prime}}}))-e^{-(z_{n}-\ln (\sum_{n}^{N}=1} e^{\pi_{n^{\prime}}})} d z_{n}
13+
\end{array}
14+
$$
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,56 @@
1+
# dataset settings
2+
dataset_type = 'CityscapesDataset'
3+
data_root = 'data/cityscapes/'
4+
img_norm_cfg = dict(
5+
mean=[123.675, 116.28, 103.53], std=[58.395, 57.12, 57.375], to_rgb=True)
6+
train_pipeline = [
7+
dict(type='LoadImageFromFile'),
8+
dict(type='LoadAnnotations', with_bbox=True),
9+
dict(
10+
type='Resize', img_scale=[(2048, 800), (2048, 1024)], keep_ratio=True),
11+
dict(type='RandomFlip', flip_ratio=0.5),
12+
dict(type='Normalize', **img_norm_cfg),
13+
dict(type='Pad', size_divisor=32),
14+
dict(type='DefaultFormatBundle'),
15+
dict(type='Collect', keys=['img', 'gt_bboxes', 'gt_labels']),
16+
]
17+
test_pipeline = [
18+
dict(type='LoadImageFromFile'),
19+
dict(
20+
type='MultiScaleFlipAug',
21+
img_scale=(2048, 1024),
22+
flip=False,
23+
transforms=[
24+
dict(type='Resize', keep_ratio=True),
25+
dict(type='RandomFlip'),
26+
dict(type='Normalize', **img_norm_cfg),
27+
dict(type='Pad', size_divisor=32),
28+
dict(type='ImageToTensor', keys=['img']),
29+
dict(type='Collect', keys=['img']),
30+
])
31+
]
32+
data = dict(
33+
samples_per_gpu=1,
34+
workers_per_gpu=2,
35+
train=dict(
36+
type='RepeatDataset',
37+
times=8,
38+
dataset=dict(
39+
type=dataset_type,
40+
ann_file=data_root +
41+
'annotations/instancesonly_filtered_gtFine_train.json',
42+
img_prefix=data_root + 'leftImg8bit/train/',
43+
pipeline=train_pipeline)),
44+
val=dict(
45+
type=dataset_type,
46+
ann_file=data_root +
47+
'annotations/instancesonly_filtered_gtFine_val.json',
48+
img_prefix=data_root + 'leftImg8bit/val/',
49+
pipeline=test_pipeline),
50+
test=dict(
51+
type=dataset_type,
52+
ann_file=data_root +
53+
'annotations/instancesonly_filtered_gtFine_test.json',
54+
img_prefix=data_root + 'leftImg8bit/test/',
55+
pipeline=test_pipeline))
56+
evaluation = dict(interval=1, metric='bbox')

0 commit comments

Comments
 (0)