Add `HF_TRUST_REMOTE_CODE` environment variable #78

alvarobartt · 2024-08-08T07:44:02Z

Description

As flagged by Changyu Zhu from Google, there was no way of setting trust_remote_code=True when loading a model from the Hugging Face Hub that requires remote code execution; whilst for Text Generation Inference (TGI) is indeed possible via the TRUST_REMOTE_CODE environment variable (as per https://huggingface.co/docs/text-generation-inference/en/basic_tutorials/safety).

This PR adds the HF_TRUST_REMOTE_CODE environment variable in order to be able to set it for transformers, sentence-transformers, and diffusers pipelines. Additionally, this PR also fixes the **kwargs propagation for both sentence-transformers and diffusers. Finally, this PR also updates the README.md accordingly as of the HF_TRUST_REMOTE_CODE addition, while also fixing some typos and aligning the formatting.

Note

As of the recent merges within #76 and #77, the version in setup.py has also been bumped to 0.4.1 to include all those changes and generate the wheel accordingly. So on, once this PR is merged, the version in the main branch should point to 0.4.2.dev0 instead.

…ines

The `strtobool` had to be defined within `huggingface_inference_toolkit` since it's deprecated and removed from `distutils` from Python 3.10 onwards.

oOraph

LGTM, question: under the hood, this trust_remote_code bypasses security flags like weights_only=True in pytorch and stuff right ? (allowing for arbitrary pickle loading)

Where do you intend to activate it ? In one of your hf endpoints using a custom env variable ? (just want to make sure we do not activate it by default there :) )

alvarobartt · 2024-08-08T08:34:55Z

LGTM, question: under the hood, this trust_remote_code bypasses security flags like weights_only=True in pytorch and stuff right ? (allowing for arbitrary pickle loading)
Where do you intend to activate it ? In one of your hf endpoints using a custom env variable ? (just want to make sure we do not activate it by default there :) )

It's indeed disabled by default, and explicitly enabled on the user's side if desired; to basically load models that live within the Hub repository instead of directly being supported / integrated within transformers i.e. custom modeling. And the use case for this is when users run a job in either Vertex AI or GKE using our custom DLCs, as the PyTorch Inference DLC is running huggingface_inference_toolkit under the hood, so that users can load models such as https://huggingface.co/LGAI-EXAONE/EXAONE-3.0-7.8B-Instruct 🤗

P.S. Thanks for the review!

alvarobartt · 2024-08-08T10:22:08Z

To add more context to this PR, it's updated here huggingface/Google-Cloud-Containers#64

alvarobartt added 5 commits August 8, 2024 09:09

Propagate **kwargs to sentence-transformers and diffusers pipel…

5786f81

…ines

Add HF_TRUST_REMOTE_CODE env var

e8d689c

Fix HF_TRUST_REMOTE_CODE bool-handling via strtobool

09e8d66

The `strtobool` had to be defined within `huggingface_inference_toolkit` since it's deprecated and removed from `distutils` from Python 3.10 onwards.

Fix some typos with codespell

5905985

Update README.md

43ad6e7

alvarobartt added the enhancement New feature or request label Aug 8, 2024

alvarobartt self-assigned this Aug 8, 2024

oOraph approved these changes Aug 8, 2024

View reviewed changes

Bump version to 0.4.2

c9384c2

alvarobartt force-pushed the trust-remote-code-env branch from 9cb211b to c9384c2 Compare August 8, 2024 10:15

alvarobartt added 4 commits August 8, 2024 15:29

Move strtobool to env_utils module to avoid circular import

c1c37a3

Revert enforce of trust_remote_code=True

3569eab

Remove logging messages for debug

0f7235a

Fix diffusers propagation of trust_remote_code=True

db6e1d9

alvarobartt merged commit d9ae3d9 into main Aug 9, 2024
0 of 6 checks passed

alvarobartt deleted the trust-remote-code-env branch August 9, 2024 08:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add `HF_TRUST_REMOTE_CODE` environment variable #78

Add `HF_TRUST_REMOTE_CODE` environment variable #78

Uh oh!

alvarobartt commented Aug 8, 2024

Uh oh!

oOraph left a comment

Uh oh!

alvarobartt commented Aug 8, 2024

Uh oh!

alvarobartt commented Aug 8, 2024

Uh oh!

Uh oh!

Uh oh!

Add HF_TRUST_REMOTE_CODE environment variable #78

Add HF_TRUST_REMOTE_CODE environment variable #78

Uh oh!

Conversation

alvarobartt commented Aug 8, 2024

Description

Uh oh!

oOraph left a comment

Choose a reason for hiding this comment

Uh oh!

alvarobartt commented Aug 8, 2024

Uh oh!

alvarobartt commented Aug 8, 2024

Uh oh!

Uh oh!

Uh oh!

Add `HF_TRUST_REMOTE_CODE` environment variable #78

Add `HF_TRUST_REMOTE_CODE` environment variable #78