Skip to content

Conversation

@spmex
Copy link

@spmex spmex commented Dec 19, 2025

Summary:

Context

In BaseEmbeddingConfig, the default value of weight min/max is now determined by sqrt(1 / self.embedding_dim), Previously it was sqrt(1 / self.num_embeddings) but it may become super small if the number of embeddings become large.

References

See more discussions in the Github PR.

Differential Revision: D89562350

…dingConfig

Summary:
# Context

In `BaseEmbeddingConfig`, the default value of weight min/max is now determined by `sqrt(1 / self.embedding_dim)`, Previously it was `sqrt(1 / self.num_embeddings)` but it may become super small if the number of embeddings become large.

# References
See more discussions in the [Github PR](meta-pytorch#1902).

Differential Revision: D89562350
@meta-cla meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Dec 19, 2025
@meta-codesync
Copy link
Contributor

meta-codesync bot commented Dec 19, 2025

@spmex has exported this pull request. If you are a Meta employee, you can view the originating Diff in D89562350.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. fb-exported meta-exported

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant