Skip to content

feat(metrics): add /metrics which returns prometheus metrics #95

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Nov 4, 2024

Conversation

co42
Copy link
Contributor

@co42 co42 commented Oct 23, 2024

Will be useful for Dedicated Endpoints to improve request forwarding fairness and scaling

@co42 co42 force-pushed the feat/metrics-concurrent-connection branch 2 times, most recently from 8ec058c to 551d454 Compare October 23, 2024 15:12
Comment on lines +79 to +80
f"inf_batch_current_size {batch_current_size}\n" +
f"inf_queue_size {queue_size}\n"
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Not sure the naming is the best, I copied TGI

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Are we okay that this will always be 1?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If it bother you or makes it unclear for customers I can remove inf_batch_current_size

Copy link
Member

@alvarobartt alvarobartt left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, maybe let's wait and see if @philschmid agrees on the metrics reported and we can merge 🤗 Thanks!

@alvarobartt alvarobartt added the enhancement New feature or request label Oct 28, 2024
@co42 co42 force-pushed the feat/metrics-concurrent-connection branch from 551d454 to 05868e5 Compare November 4, 2024 09:13
Copy link
Contributor

@philschmid philschmid left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

Comment on lines +79 to +80
f"inf_batch_current_size {batch_current_size}\n" +
f"inf_queue_size {queue_size}\n"
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Are we okay that this will always be 1?

@co42 co42 merged commit 6b17e6c into main Nov 4, 2024
6 checks passed
@co42 co42 deleted the feat/metrics-concurrent-connection branch November 4, 2024 10:07
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants