@@ -2,7 +2,7 @@ TrackML utility library
2
2
=======================
3
3
4
4
A python library to simplify working with the
5
- [ High Energy Physics Tracking Machine Learning challenge] ( kaggle_trackml )
5
+ [ High Energy Physics Tracking Machine Learning challenge] [ kaggle_trackml ]
6
6
dataset.
7
7
8
8
Installation
@@ -96,11 +96,10 @@ some hits can be left unassigned). The training dataset contains the recorded
96
96
hits, their truth association to particles, and the initial parameters of those
97
97
particles. The test dataset contains only the recorded hits.
98
98
99
- The dataset is provided as a set of plain ` .csv ` files (` .csv.gz ` or ` .csv.bz2 `
100
- are also allowed). Each event has four associated files that contain hits, hit
101
- cells, particles, and the ground truth association between them. The common
102
- prefix (like ` event000000000 ` ) is fully constrained to be ` event ` followed by 9
103
- digits.
99
+ The dataset is provided as a set of plain ` .csv ` files. Each event has four
100
+ associated files that contain hits, hit cells, particles, and the ground truth
101
+ association between them. The common prefix, e.g. ` event000000010 ` , is always
102
+ ` event ` followed by 9 digits.
104
103
105
104
event000000000-hits.csv
106
105
event000000000-cells.csv
@@ -122,7 +121,7 @@ a name starting with `submission`, e.g.
122
121
The hits file contains the following values for each hit/entry:
123
122
124
123
* ** hit_id** : numerical identifier of the hit inside the event.
125
- * ** x, y, z** : measured x, y, z position (in millimeters ) of the hit in
124
+ * ** x, y, z** : measured x, y, z position (in millimeter ) of the hit in
126
125
global coordinates.
127
126
* ** volume_id** : numerical identifier of the detector group.
128
127
* ** layer_id** : numerical identifier of the detector layer inside the
@@ -159,7 +158,7 @@ The particles files contains the following values for each particle/entry:
159
158
coordinates.
160
159
* ** px, py, pz** : initial momentum (in GeV/c) along each global axis.
161
160
* ** q** : particle charge (as multiple of the absolute electron charge).
162
- * ** nhits** : number of hits generated by this particle
161
+ * ** nhits** : number of hits generated by this particle.
163
162
164
163
All entries contain the generated information or ground truth.
165
164
@@ -171,7 +170,8 @@ particle/track.
171
170
172
171
* ** hit_id** : numerical identifier of the hit as defined in the hits file.
173
172
* ** particle_id** : numerical identifier of the generating particle as defined
174
- in the particles file.
173
+ in the particles file. A value of 0 means that the hit did not originate
174
+ from a reconstructible particle, but e.g. from detector noise.
175
175
* ** tx, ty, tz** true intersection point in global coordinates (in
176
176
millimeters) between the particle trajectory and the sensitive surface.
177
177
* ** tpx, tpy, tpz** true particle momentum (in GeV/c) in the global
@@ -186,14 +186,57 @@ The submission file must associate each hit in each event to one and only one
186
186
reconstructed particle track. The reconstructed tracks must be uniquely
187
187
identified only within each event. Participants are advised to compress the
188
188
submission file (with zip, bzip2, gzip) before submission to the
189
- [ Kaggle site] ( kaggle_trackml ) .
189
+ [ Kaggle site] [ kaggle_trackml ] .
190
190
191
191
* ** event_id** : numerical identifier of the event; corresponds to the number
192
192
found in the per-event file name prefix.
193
193
* ** hit_id** : numerical identifier of the hit inside the event as defined in
194
194
the per-event hits file.
195
195
* ** track_id** : user-defined numerical identifier (non-negative integer) of
196
- the track
196
+ the track.
197
+
198
+ ### Additional detector geometry information
199
+
200
+ The detector modules that measure particles and generated the hits are organized
201
+ into detector groups or volumes identified by a volume id. Inside a volume they
202
+ are further grouped into layers identified by a layer id. Each layer can contain
203
+ an arbitrary number of detector modules, the smallest geometrically distinct
204
+ detector object, each identified by a module_id. Within each group detector
205
+ modules are of the same type have e.g. the same granularity. All simulated
206
+ detector modules are so-called semiconductor sensors that are build from thin
207
+ silicon sensor chips. Each module can be represented by a two-dimensional,
208
+ planar, bounded sensitive surface. These sensitive surfaces are subdivided into
209
+ regular grids that define the detectors cells, the smallest granularity within
210
+ the detector.
211
+
212
+ Each module has a different position and orientation described in the detectors
213
+ file. A local, right-handed coordinate system is defined on each sensitive
214
+ surface such that the first two coordinates u and v are on the sensitive surface
215
+ and the third coordinate w is normal to the surface. The orientation and
216
+ position are defined by the following transformation
217
+
218
+ pos_xyz = rotation_matrix * pos_uvw + offset
219
+
220
+ that transform a position described in local coordinates u,v,w into the
221
+ equivalent position x,y,z in global coordinates using a rotation matrix and
222
+ an offset.
223
+
224
+ * ** volume_id** : numerical identifier of the detector group.
225
+ * ** layer_id** : numerical identifier of the detector layer inside the
226
+ group.
227
+ * ** module_id** : numerical identifier of the detector module inside
228
+ the layer.
229
+ * ** cx, cy, cz** : position of the local origin in the described in the global
230
+ coordinate system (in millimeter).
231
+ * ** rot_xu, rot_xv, rot_xw, rot_yu, ...** : components of the rotation matrix
232
+ to rotate from local u,v,w to global x,y,z coordinates.
233
+ * ** module_t** : thickness of the detector module (in millimeter).
234
+ * ** module_minhu, module_maxhu** : the minimum/maximum half-length of the
235
+ module boundary along the local u direction (in millimeter).
236
+ * ** module_hv** : the half-length of the module boundary along the local v
237
+ direction (in millimeter).
238
+ * ** pitch_u, pitch_v** : the size of detector cells along the local u and v
239
+ direction (in millimeter).
197
240
198
241
199
242
[ cern ] : https://home.cern
0 commit comments