Llama 3.2 1B Instruct on TPU v4, bumping transformers to 4.45.2 by artus-LYTiQ · Pull Request #109 · huggingface/optimum-tpu

artus-LYTiQ · 2024-10-20T13:33:43Z

Added llama3 rope_type implementation and changed default model to Llama 3.2 1B Instruct.

Create an adaptation of the HF transformer's llama3 rope_type implementation in modeling_llama.py.

Updated the dependency to the current transformer library version 4.45.2.

Added more logging to distributed_model.py as the TPU v4-8 vms love to hang at random places when running this code.

Fixes #80

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

Note that the two new test are just manual test, not pytests. The rope implementation is unvalidated - we just pray and are happy that it still generates tokens XD

tengomucho

We will wait for the other contribution to be merged before merging this one, but thank you for contributing! Can you confirm the models you have tested with your changes?

tengomucho · 2024-10-21T08:35:01Z

+    next_token_id = torch.argmax(next_logits, dim=-1)[:, None].int()
+    return next_token_id
+
+def _test_distributed_model_generation(model_id, max_new_tokens=20):


for tests, please create one test similar to tests/test_distributed_model.py (or modify the existing one). To launch it, you can use pytest: python -m pytest -sv /path/to/test_mytest.py::test_my_test_function.

artus-LYTiQ added 8 commits October 20, 2024 01:12

Installation guide for TPU v4.

3caeae8

Forgot pyproject.toml

446bb2c

Llama 3.2 readiness, current transformer version

1ea17d0

Note that the two new test are just manual test, not pytests. The rope implementation is unvalidated - we just pray and are happy that it still generates tokens XD

Fix - again forgot a file due to rename

7210745

Changed logging back to debug

caab98c

removed restore of hf hub from gs bucket

d7ea97a

change default model for generation, updated logging

f0237d1

Changed default models

1588182

tengomucho reviewed Oct 21, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Llama 3.2 1B Instruct on TPU v4, bumping transformers to 4.45.2#109

Llama 3.2 1B Instruct on TPU v4, bumping transformers to 4.45.2#109
artus-LYTiQ wants to merge 8 commits into
huggingface:mainfrom
artus-LYTiQ:llama3-tpu

artus-LYTiQ commented Oct 20, 2024

Uh oh!

tengomucho left a comment

Uh oh!

tengomucho Oct 21, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

artus-LYTiQ commented Oct 20, 2024

Added llama3 rope_type implementation and changed default model to Llama 3.2 1B Instruct.

Before submitting

Uh oh!

tengomucho left a comment

Choose a reason for hiding this comment

Uh oh!

tengomucho Oct 21, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants