-
Notifications
You must be signed in to change notification settings - Fork 13
Open
Description
Develop a node to use a ML model to recognize activities from video
-
Use this model with the pretrained weights for the charades dataset
-
Class labels for the Charades dataset can be found in the file 'Charades_v1_classes.txt' here
The node should:
- subscribe to a camera topic (for rgb images)
- accumulate N frames (for this model it looks like they use 64 frames)
- transform them as expected by the model (resized to 224x224 and reshaped to 3x64x224x224)
- classify the sequence of frames
- Repeat for next sequence of frames
Metadata
Metadata
Assignees
Labels
No labels