Skip to content

update inferentia2 image #106

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 4 commits into from
Apr 14, 2025
Merged

update inferentia2 image #106

merged 4 commits into from
Apr 14, 2025

Conversation

oOraph
Copy link
Contributor

@oOraph oOraph commented Mar 13, 2025

  • Update base image to ubuntu 22.04 (20.04 reaches end of support this year anyway)
  • Remove some non required dependencies for the webservice to start
  • Remove unused build arg from docker file

@oOraph
Copy link
Contributor Author

oOraph commented Mar 13, 2025

side note: I may be wrong in removing all the dependencies (openjdk and stuff). I did so because I do not see where they are used, so no need to embed them, but I may miss the whole picture about the image usage. I just built the inf2 image as if I were to deploy it on HF endpoints, for the encoder models it now supports on inf2

@oOraph oOraph requested review from ErikKaum and alvarobartt March 13, 2025 09:43
@oOraph
Copy link
Contributor Author

oOraph commented Mar 13, 2025

for the inf2_env.py helper script, strongly inspired from the script we made on tgi neuronx side

@oOraph oOraph marked this pull request as ready for review March 13, 2025 09:59
@oOraph oOraph force-pushed the dev/inf2_image branch 3 times, most recently from 038e9eb to 78a245c Compare March 13, 2025 15:34
Copy link
Member

@ErikKaum ErikKaum left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

should be good, thank you 👍

oOraph added 4 commits April 2, 2025 16:06
20.04 is reaching end of support this year anyway

remove some unused dependencies for the inference web service to run

Signed-off-by: Raphael Glon <[email protected]>
- + fix broken dep install (kenlm)

Signed-off-by: Raphael Glon <[email protected]>
use the same HF_MODEL_ID env var as the main server

Signed-off-by: Raphael Glon <[email protected]>
@alvarobartt alvarobartt merged commit 8c3562d into main Apr 14, 2025
6 checks passed
@alvarobartt alvarobartt deleted the dev/inf2_image branch April 14, 2025 07:20
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants