-
Notifications
You must be signed in to change notification settings - Fork 22
Open
Description
This is a vision/language RED model with low PCC lately that our team owns and needs some debug.
Some more details
This test tests/runner/test_models.py::test_all_models_torch[mistral/pixtral/pytorch-single_device-inference] was passing prior to ~Dec 17 but then regressed to 0.96 PCC and then to 0.93 around Dec 25 where PCC was disabled via #2680
specific_test_case model_type group arch bringup_status model_status pcc pcc_thres pcc_en parallelism time guidance
test_all_models_torch[mistral/pixtral/pytorch-single_device-inference] vision red p150 INCORRECT_RESULT PASSING 0.9310660697632953 0.99 PCC_DIS single_device 160.218 N/A
test_all_models_torch[mistral/pixtral/pytorch-tensor_parallel-inference] vision red n300-llmbox INCORRECT_RESULT PASSING 0.9760126530058753 0.99 PCC_DIS tensor_parallel 189.437 N/A
It's failing for p150 (single_device) and n300-llmbox (tensor_parallel) but single_device will be easier to debug.
Assign to @devisettymahidhar608 can you please take a look? Some bisecting could make sense based on above datapoints, maybe.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels