Refactor generation of reports in clients by vpetrovicTT · Pull Request #1969 · tenstorrent/tt-inference-server

vpetrovicTT · 2026-02-02T09:46:50Z

Currently, all media clients (audio_client.py, image_client.py, cnn_client.py, tts_client.py, embedding_client.py) implement their own _generate_report() methods with significant code duplication. Refactor report generation into a scalable and reusable mechanism.

github-actions · 2026-02-02T10:51:48Z

✅ Test Coverage Report

Coverage of Changed Lines

Metric	Value
Coverage	%
Threshold	50%
Status	✅ PASSED

💡 This checks coverage of newly added/modified lines only, not total codebase coverage.

github-actions · 2026-02-02T10:51:57Z

✅ Test Results - PASSED

Summary

Component	Total	Passed	Status
tt-inference-server	385	385	✅
tt-media-server	464	464	✅
Overall	849	849	✅

Details

Python Version: 3.10
Workflow: Test Gate
Commit: 687179a
Run ID: 21898176042

🎉 All tests passed! This PR is ready for review.

…ents

fivanovicTT

I like the idea of having metrics_utils since we can utilize these across all clients.

However, I think we can improve upon this refactoring. Here are my suggestions:

metric_utils.py

We can extend BaseTestStatus to have an abstract method / interface so that each status class (eg: ImageGenerationTestStatus) can declare which metrics it provides. Image client will have different methods than audio client for example

Bonus points: we can have a single method in BaseTestStatus:

def get_metrics(self) -> Dict[str, float]:
    """Return available metrics for this status type.
    
    Returns dict with keys like 'elapsed', 'ttft', 'rtr', 'tsu', etc.
    Only includes metrics that have values (not None).
    """
    pass

Each subclass implements it, returning only non-None metrics

report generation

While looking at the codebase, I saw that also we should extract eval report generation functionality. Having this also in mind, I think it would be really good to separate this from base_strategy_interface.py (you can create a new file report_utils.py)

We will have these 2 methods:

_generate_benchmark_report(status_list, task_type, extra_data=None)
_generate_eval_report(status_list, task_type, extra_data=None)

where each client would only provide:

task_type (string like "image", "audio", etc.)
extra_benchmarks dict for client-specific fields

Bonus points: We should utilize Dependency Inversion and use ReportContext object for example to pass necessary data to ReportGenerator

With these changes we will have: testability (easy to test each component), single responsibility, loose coupling and reusability

utils/media_clients/base_strategy_interface.py

…ents

fivanovicTT

General comments and improvements:

Let's keep base_strategy_interface.py clean -> don't introduce report related methods which are not part of that class. We are breaking single responsibility here and bloating the class (they should be in report related class).

I think you can just make ReportGenerator available to clients and let them handle the rest.

We can have simpler clients with something like this:

class TtsClientStrategy(BaseMediaStrategy):
    def run_benchmark(self, num_calls: int) -> list[TtsTestStatus]:
        # ... run benchmark ...
        status_list = self._run_tts_benchmark(num_calls)
        
        # Client builds its own extras and calls ReportGenerator directly
        context = ReportContext(
            model_name=self.config.model_spec.model_name,
            device_name=self.config.device_name,
            output_path=self.config.output_path,
            model_id=self.config.model_spec.model_id,
        )
        extras = {"ttft": calculate_ttft(status_list) / 1000, ...}
        self.report_generator.generate_benchmark_report(context, status_list, "tts", extras)
        
        return status_list

Could you think of way to even more simplify metrics_utils.py - seems to me there are too many granularity and it could be even more simplified? If TestStatus._METRIC_ATTRS already defines what fields exist and get_metrics() returns them, why do we need separate calculate_ttft(), calculate_rtr() functions?

Let's see how can we improve current implementation. Keep up the great work! 🚀

utils/media_clients/utils/__init__.py

utils/media_clients/utils/metrics_utils.py

utils/media_clients/utils/report_utils.py

utils/media_clients/tts_client.py

…ents

feat(reports): Early refactoring of report generating

594ca7e

vpetrovicTT changed the title ~~Refactor generate report in clients~~ Refactor generation of reports in clients Feb 2, 2026

fivanovicTT changed the base branch from main to dev February 2, 2026 10:47

vpetrovicTT added 4 commits February 2, 2026 10:47

feat(rep): Delegate metric calcs to util, test on TTS workflow

c5df682

feat(rep): Add utils for metric calcs

8d838d6

feat(rep): Refactor tts cli to work with suggested arc

f140c79

fix(rep): Fix Ruff format errs

6f01823

vpetrovicTT linked an issue Feb 2, 2026 that may be closed by this pull request

Tech Debt - _generate_report in clients #1807

Open

Merge branch 'dev' into vpetrovic/feature/1807-generate-report-in-cli…

d79e783

…ents

fivanovicTT requested changes Feb 2, 2026

View reviewed changes

vpetrovicTT added 18 commits February 3, 2026 08:34

fix(tts): Refactor client to suit new suggested arc

01e3640

test(tts): Update tests

505b368

feat(tts): Early look of metric calc funcs

7ce37ae

feat(tts): Early take on metrics aggregation (exploratory)

21ee351

feat(rep): Explore new arc in test status for TTS

51b2d72

feat(rep): Test suggested arc

b68c0ed

test(tts): Update test for tts cli

fc62b2c

fix(rep): Fix Ruff format errs

a5c861e

fix(tts): Refactor report generating into class

50a209f

fix(tts): Move to different folder utils for media report generating

b7225a7

fix(tts): Refactor base status class

75fbc4c

fix(rep): Add metrics aggregator and rep generator as DI's to base strat

c40d966

fix(rep): Minor refactor in metric utils

173de24

fix(tts): Fix tests after refactoring

b26ec22

Merge branch 'dev' into vpetrovic/feature/1807-generate-report-in-cli…

a879a3b

…ents

fix(tts): Fix Ruff errs

566ea46

fix(tts): Resolve conflicts and fix tests

7d7ddaa

fix(ruff): Fix errs

0681860

Merge branch 'dev' into vpetrovic/feature/1807-generate-report-in-cli…

8fcb15c

…ents

fivanovicTT requested changes Feb 4, 2026

View reviewed changes

vpetrovicTT added 16 commits February 4, 2026 13:31

fix(tts): Minor comm fixes

9e74099

fix(rep): Cleanup of base strat cls

43dff98

fix(rep): Refactor Report utility class

4213874

Final refactor of metric utilities

e8a75ee

feat(rep): Introduce new constants file

7d6d8dc

fix(strat): Final touches on base strat

c575d83

fix(tts): Refactor client

61368cb

fix(tts): Clean status cls

0599027

Merge branch 'dev' into vpetrovic/feature/1807-generate-report-in-cli…

f2c407d

…ents

fix(tts): Fix tests

1bb377e

fix(metrics): Remove redudant comms

36591b4

fix(tts): Final code polishing

25b9121

Merge branch 'dev' into vpetrovic/feature/1807-generate-report-in-cli…

ffb80bb

…ents

Merge branch 'dev' into vpetrovic/feature/1807-generate-report-in-cli…

0d2d7d3

…ents

Merge branch 'dev' into vpetrovic/feature/1807-generate-report-in-cli…

f743cb2

…ents

Merge branch 'dev' into vpetrovic/feature/1807-generate-report-in-cli…

7ba48ea

…ents

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor generation of reports in clients#1969

Refactor generation of reports in clients#1969
vpetrovicTT wants to merge 41 commits intodevfrom
vpetrovic/feature/1807-generate-report-in-clients

vpetrovicTT commented Feb 2, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Feb 2, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Feb 2, 2026 •

edited

Loading

Uh oh!

fivanovicTT left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

fivanovicTT left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

vpetrovicTT commented Feb 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Feb 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Test Coverage Report

Coverage of Changed Lines

Uh oh!

github-actions bot commented Feb 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Test Results - PASSED

Summary

Details

Uh oh!

fivanovicTT left a comment

Choose a reason for hiding this comment

metric_utils.py

report generation

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

fivanovicTT left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

vpetrovicTT commented Feb 2, 2026 •

edited

Loading

github-actions bot commented Feb 2, 2026 •

edited

Loading

github-actions bot commented Feb 2, 2026 •

edited

Loading