Do you use the entire dataset to test the metrics?
Do you use the entire dataset to test the metrics?