27 Jan 21:03

cgarciae

7c487b1

0.12.3 Latest

Latest

What's Changed

Set numpy<2.4 to fix DeprecationWarning in the CI doctest by @vfdev-5 in #5163
ignore optax deprecation warning by @cgarciae in #5165
fix general guides landing page by @ivrolan in #5139
Remove nnx.split_rngs call wrapping nnx.scan in linen to nnx tutorial by @samanklesaria in #5160
Add out_sharding arguments to linear layers where supported by @jackopenn in #5156
fix kw_only_dataclasses for python 3.14 (part 2) by @copybara-service[bot] in #5135
No public description by @copybara-service[bot] in #5164
Have _graph_flatten respect nnx.data declarations (extension of #5140) by @samanklesaria in #5159
[pmap] Avoid degraded performance under the new jax.pmap. by @copybara-service[bot] in #5152
Support multiple None and UNCONSTRAINED when resolving logical rules by @copybara-service[bot] in #5129
improve hijax guide by @cgarciae in #5115
docs: fix typo 'paramater' -> 'parameter' by @ayulockedin in #5166
Make nnx.pop remove sown attributes. by @samanklesaria in #5133
Rename sharding_names to sharding_metadata by @samanklesaria in #5089
Fix bug in graph overhead benchmark by @samanklesaria in #5183
Fixed typos in the docstrings using antigravity by @vfdev-5 in #5145
docs(nnx): add missing functional args to Conv and LinearGeneral by @ayulockedin in #5174
empty change by @copybara-service[bot] in #5184
Use nnx split during tabulate (clone of #5069) by @samanklesaria in #5186
Handle pure bodies in nnx.fori_loop by @samanklesaria in #5141
Typo fix in _cached_partial method by @vfdev-5 in #5142
Update mnist example to use NNX (clone of #5064) by @samanklesaria in #5188
Docs: Fix typo and clarify introduction in Functional API section by @Moriyuki-S in #5157
Shard split rngs in lifted vmap if spmd_axis_name is given and applied to vmapped axis. Without this, jax.vmap gives an error: ValueError: Mapped away dimension of inputs passed to vmap should be sharded the same. Got inconsistent axis specs: None vs batch due to split_rngs being replicated. by @copybara-service[bot] in #5189
Remove ref cycles introduced by self-calling nested functions. by @copybara-service[bot] in #5193
Add HijaxTransformCoverageTest by @cgarciae in #5190
allow nnx standalone import p1 by @copybara-service[bot] in #5196
Add _graph_node_set_key method for List class by @samanklesaria in #5171
_apply_sharding disallow mixed Explicit/Auto mesh by @copybara-service[bot] in #5199
update flax to version 0.12.3 by @copybara-service[bot] in #5206

New Contributors

@ivrolan made their first contribution in #5139
@jackopenn made their first contribution in #5156
@ayulockedin made their first contribution in #5166
@Moriyuki-S made their first contribution in #5157

Full Changelog: v0.12.2...v0.12.3

Contributors

samanklesaria, vfdev-5, and 6 other contributors

Assets 2

0 Join discussion

18 Dec 22:35

IvyZX

v0.12.2

d179184

Version 0.12.2

What's Changed

[flax:examples:wmt] Small linter fixes. by @copybara-service[bot] in #5012
[flax:examples:seq2seq] Create main and default config based on seq2seq.ipynb. by @copybara-service[bot] in #5119
[flax:examples:vae] Small linter fixes. by @copybara-service[bot] in #5014
[flax:examples:gemma] Fixing linter errors. by @copybara-service[bot] in #5013
[flax:examples:sst2] Fix pytype errors. by @copybara-service[bot] in #5118
Allow substring matching in nnx.PathContains by @thijs-vanweezel in #5094
[flax:examples:sst2] Fix notebook error. by @copybara-service[bot] in #5122
[flax:examples:ppo] Fix some linter / import issues. #jax-fixit by @copybara-service[bot] in #5120
Avoid passing concrete argument to jax.remat by @copybara-service[bot] in #5121
[flax:examples:lm1b_nnx] Update example to work internally. #jax-fixit. by @copybara-service[bot] in #5125
[flax:examples:nlp_seq] Create a main.py file to run tests with config files to match other examples. #jax-fixit by @copybara-service[bot] in #5126
[jax:benchmarks] Add tracing/lowering benchmarks for a few flax examples. by @copybara-service[bot] in #4911
remove abstracted_axes from nnx.jit by @copybara-service[bot] in #5132
Pooling operation by @jorisSchaller in #5057
Added is_causal mask argument to flax.nnx.dot_product_attention by @ibbyml in #5093
Add out_sharding argument to call methods for layers with jax calls that support it by @samanklesaria in #5102
Temporary fix for failing CI by @vfdev-5 in #5144
New release 0.12.2 by @IvyZX in #5149

New Contributors

@thijs-vanweezel made their first contribution in #5094
@ibbyml made their first contribution in #5093

Full Changelog: v0.12.1...v0.12.2

Contributors

samanklesaria, vfdev-5, and 5 other contributors

Assets 2

0 Join discussion

19 Nov 21:55

cgarciae

v0.12.1

697f4e5

v0.12.1

Deprecations

Variable.value

Variable.value is now deprecated. Consider the following example:

import jax.numpy as jnp
import jax
from flax import nnx

my_param = nnx.Param({'a': 0.0})

@nnx.jit
def f(m):
    m.value['a'] = 1.0
    return m

Running f(my_param) produces Param(value={'a': 0.0}), not Param(value={'a': 1.0}) as before. This is because getting the value parameter new returns a copy of the pytree values (like dict / list). Instead, use the __setitem__ method to update the value:

@nnx.jit
def f(m):
    m['a'] = 1.0
    return m

nnx.Data and nnx.Static

nnx.Data and nnx.Static annotations are now deprecated. To create nnx.Pytree or nnx.Module dataclasses use the new nnx.dataclass with nnx.data and nnx.static as field descriptors.

# old
@dataclasses.dataclass
class Foo(nnx.Pytree):
  a: nnx.Data[int]
  b: nnx.Static[str]

# new
@nnx.dataclass
class Foo(nnx.Pytree):
  a: int = nnx.data()
  b: str = nnx.static()

Pull Requests

Clarify *Norm layer docstrings: axis_index_groups is unused under SPMD jit. by @copybara-service[bot] in #4940
Move ArrayRef creation to the end of Variable creation by @IvyZX in #4980
clean up jax.Ref-related names by @copybara-service[bot] in #4988
Add compute_flops and compute_vjp_flops options to nnx.tabulate by @samanklesaria in #4948
Fix nnx.tabulate crash with empty dict/None values (fixes #4889) by @mohsinm-dev in #4891
Future-proof imports of jax.new_ref / jax.Ref. by @copybara-service[bot] in #4986
Use jnp.stack instead of np.stack in flax.training.common_utils.stack_forest by @vfdev-5 in #4991
Fixed broken nnx.statelib.diff by @vfdev-5 in #4992
Implemented spectral norm in NNX by @mattbahr in #4623
Improve Variable.{get,set}_metadata by @cgarciae in #4985
Move iter_children and iter_modules to functions by @samanklesaria in #4961
Avoid install, import, or tests with tensorflow-text under Python 3.13+. by @jburnim in #5001
disallow setting metadata through settattr by @cgarciae in #4993
Use sphinx 6.2+ for docs, which works with Python 3.13. by @jburnim in #5009
Removed kernel_init/bias_init atttributes from popular layers by @vfdev-5 in #4998
Migrate from jax.experimental.enable_x64 to jax.enable_x64. by @copybara-service[bot] in #5011
Add Rngs KeylessInitializers by @cgarciae in #5017
optimize scan transpositions by @cgarciae in #5015
Variable refactor by @cgarciae in #5006
Remove invalid gymnasium dependency in pyproject.toml by @IvyZX in #5016
Use jax.shard_map in flax by @copybara-service[bot] in #5020
use jax.shard_map by @copybara-service[bot] in #5018
Fix formatting in PR template checklist by @rapsealk in #5024
Fixed attribute visualization in treescope_repr by @vfdev-5 in #5022
feat: add nnx.set_metadata to in-place change metadata of the state variables of nnx.Modules by @pfackeldey in #5007
Update README to use fully qualified nnx.Linear in example by @rapsealk in #5023
Fix nnx tabulate variable hooks by @mohsinm-dev in #5008
python 3.13 support by @cgarciae in #4987
Added a note in nnx.jit about arg donation by @vfdev-5 in #5031
Add flip doc link to eager sharding error message by @IvyZX in #5033
fix reseed for abstract values by @cgarciae in #5034
Deduplicate Variable nodes in iter_graph and eliminate recursion. by @copybara-service[bot] in #5035
Support for python 3.14 by @vfdev-5 in #5032
[docs] Exposed more helper functions/classes in state.rst by @vfdev-5 in #5037
Copybara import of the project: by @copybara-service[bot] in #5041
Internal change by @copybara-service[bot] in #5048
filter grad state in nnx.Optimizer by @copybara-service[bot] in #5049
Add NNX WeightNorm (update of #4568) by @samanklesaria in #5043
Fix shard_map documentation link in compilation.py by @vfdev-5 in #5038
Fix ValueError when nnx.jit is used with nnx.custom_vjp by @samanklesaria in #5045
Recursive map by @chapman20j in #5042
Convert linen pytorch guide to nnx by @samanklesaria in #4999
Set Mode with Tests by @chapman20j in #5056
Fixing Optimizer docstring - fixing #5060 by @Lucas-Fernandes-Martins in #5061
Update tutorial examples to thread explicit RNGs by @samanklesaria in #4975
Fix NNX jit static args with in_shardings issue #4989 by @mohsinm-dev in #4996
support explicit sharding in eager sharding by @cgarciae in #5070
Added missing LayerNorm test case into TestLayersSameGraph by @vfdev-5 in #5076
fix main by @cgarciae in #5081
docs: Document allow_duplicates argument of nnx.to_arrays. by @dan-zheng in #5083
add promote_dtype to all standard layers by @cgarciae in #5080
add nnx.dataclass by @cgarciae in #5066
Expand ConvTranspose padding documentation by @samanklesaria in #4990
Added kernel_metadata/bias_metadata args to nnx layers by @vfdev-5 in #5074
Add nnx.use_eager_sharding context manager by @samanklesaria in #5079
fix main by @cgarciae in #5090
Adding set_mode_info by @chapman20j in #5071
Fixed nnx.scan with carry as pytree and sow by @vfdev-5 in #5073
Fix bound method auto-unbinding for NNX transforms by @mohsinm-dev in #5055
deprecate Variable.value by @cgarciae in #5052
Add eq for variables by @samanklesaria in #5084
Fixed deprecated .value usage failing CI tests by @vfdev-5 in #5097
update jax minver to 0.8.1 by @cgarciae in #5095

New Contributors

@samanklesaria made their first contribution in #4948
@jburnim made their first contribution in #5001
@rapsealk made their first contribution in #5024
@pfackeldey made their first contribution in #5007
@chapman20j made their first contribution in #5042
@Lucas-Fernandes-Martins made their first contribution in #5061

Full Changelog: v0.12.0...v0.12.1

Contributors

samanklesaria, jburnim, and 11 other contributors

Assets 2

0 Join discussion

25 Sep 23:58

cgarciae

v0.12.0

34e1d68

0.12.0

Flax 0.12.0 includes many updates and some important breaking changes to the NNX API.

Breaking Changes

Pytree Strict Attributes

nnx.Pytree and therefore nnx.Module are now stricter with regards to attributes that contain Arrays and changing the status of attributes. For example, the code below now fails:

from flax import nnx
import jax
import jax.numpy as jnp

class Foo(nnx.Module):
  def __init__(self, use_bias, rngs):
    self.layers = [  # ERROR
      nnx.Linear(3, 3, rngs=rngs) for _ in range(5)
    ]
    self.bias = None # status = static
    if use_bias:
      self.bias = nnx.Param(rngs.params.uniform(3,)) # ERROR

This happens for two reasons:

JAX pytree structures that contain Arrays now have to be marked with nnx.data. Alternatively, if the container pytree is a list or a dict, you can use nnx.List or nnx.Dict, which additionally allow mixed "data" and "static" elements.
Attributes will no longer automatically change their status—this now has to be done explicitly using nnx.data or nnx.static. Additionally, assigning Arrays or structures with Arrays to static attributes is now an error, as they will not automatically change to data.

To fix the above you can just create layers as a List Module which is automatically recognized as data, and be explicit about bias being a data attribute on the first assignment by using nnx.data:

class Foo(nnx.Module):
  def __init__(self, use_bias, rngs):
    self.layers = nnx.List([  # nnx.data also works but List is recommended
      nnx.Linear(3, 3, rngs=rngs) for _ in range(5)
    ])
    self.bias = nnx.data(None)
    if use_bias:
      self.bias = nnx.Param(rngs.params.uniform(3,))

For more information check the Module & Pytree guide.

Eager Sharding

Variables will now eagerly shard their values when sharding_names metadata is provided. A mesh is required—it can be provided either via passing a mesh metadata attribute or setting the global mesh context via jax.set_mesh. This simplifies the process of sharding a Variable to construction time:

jax.config.update('jax_num_cpu_devices', 8)
mesh = jax.make_mesh((2, 4), ('data', 'model'))

with jax.set_mesh(mesh):
  variable = nnx.Param(jnp.ones((16, 32)), sharding_names=(None, 'model'))
  
print(variable.value.sharding)

Eager sharding will also occur when using the nnx.with_partitioning initializer decorator and will automatically extend to the Optimizer. This means that both model and optimizer will be sharded at construction without the need for the somewhat cumbersome nnx.get_partition_spec + jax.lax.with_sharding_constraint + nnx.update pattern:

with jax.set_mesh(mesh):
  linear = nnx.Linear(
    in_features=16, out_features=16, use_bias=False,
    kernel_init=nnx.with_partitioning(
      nnx.initializers.lecun_normal(), (None, 'model')
    ),
    rngs=nnx.Rngs(0),
  )
  optimizer = nnx.Optimizer(linear, optax.adam(1e-3), wrt=nnx.Param)
  
print(linear.kernel.value.sharding)
print(optimizer.opt_state[0].mu.kernel.value.sharding)

For projects that currently rely on other means for sharding, eager sharding can be turned off by passing eager_sharding=False to the Variable constructor, either directly or through initializer decorators like nnx.with_partitioning:

linear = nnx.Linear(
  in_features=16, out_features=16, use_bias=False,
  kernel_init=nnx.with_partitioning(
    nnx.initializers.lecun_normal(), (None, 'model'), eager_sharding=False
  ),
  rngs=nnx.Rngs(0),
)
optimizer = nnx.Optimizer(linear, optax.adam(1e-3), wrt=nnx.Param)
  
print(linear.kernel.value.sharding)
print(optimizer.opt_state[0].mu.kernel.value.sharding)

Eager sharding can also be turned off globally via the flax_always_shard_variable config flag or the FLAX_ALWAYS_SHARD_VARIABLE environment variable:

import flax
flax.config.update('flax_always_shard_variable', False)

For more information, check out the Variable eager sharding FLIP.

In-Place Operators No Longer Allowed

In-place operators will now raise an error. This is done as part of the push for Variables to be compatible with Tracer semantics:

w = nnx.Variable(jnp.array(0))
w += 1  # ERROR

The fix is to simply operate on the .value property instead:

w.value += 1

All Changes

Doc fix: remove dead link to pre-Orbax checkpointing. by @copybara-service[bot] in #4914
Fix typo in unflatten docs by @copybara-service[bot] in #4918
fix RNN by @copybara-service[bot] in #4917
Update optimizer.py to support masked variable from optax. by @ywrt in #4904
Added missing functions to graph.rst by @vfdev-5 in #4922
Update flax/docs_nnx/guides/performance.md and .ipynb by @hanrach9 in #4919
Added preferred_element_type arg to nnx.Linear*, nnx.Conv*, nnx.Einsum by @vfdev-5 in #4920
Update README badges and remove invalid ones by @IvyZX in #4905
static + pytree guide by @cgarciae in #4897
fix mypy by @copybara-service[bot] in #4931
Avoid passing non-boolean mask to where argument of jax.numpy reductions. Non-boolean mask inputs have been deprecated for several releases, and will result in an error starting in JAX v0.8.0. by @copybara-service[bot] in #4923
Ported nnx.PReLU from linen by @vfdev-5 in #4934
Added nnx.scan docs and few minor docs fixes by @vfdev-5 in #4930
add variables argument to nnx.clone by @cgarciae in #4945
only copy dicts on State.getitem by @cgarciae in #4946
always differentiate standalone Variables in nnx.grad by @cgarciae in #4947
Implement instance norm in NNX by @mattbahr in #4939
Automatically apply sharding constraints to sharded models by @IvyZX in #4844
Add reference of flip doc to gspmd guide by @IvyZX in #4949
Fixed nnx.is_data docstring rendering by @vfdev-5 in #4957
expose pytree guide by @cgarciae in #4951
fix toy examples by @cgarciae in #4952
Explicitly cast attribute names to string before checking for private attributes. by @copybara-service[bot] in #4955
add flax_hijax_variable flag by @cgarciae in #4953
mark shard_map as implemented in transforms guide by @cgarciae in #4738
improve Variable flatten by @cgarciae in #4954
Minor typo fix in nnx.call docstring by @vfdev-5 in #4959
allow split tuples in Rngs.fork by @cgarciae in #4958
Fixed Gemma example using Gemma2 models by @vfdev-5 in #4830
finish pytree guide by @cgarciae in #4929
update bridge wrappers from maxtext by @cgarciae in #4937
fix HashableMapping hash definition for mixed key types by @copybara-service[bot] in #4936
Flax RNG guide for jax.jit: clarify rng outputs are shared but not inputs. by @copybara-service[bot] in #4956
fix Variable pytree flatten by @copybara-service[bot] in #4962
import PathParts from flax.typing by @cgarciae in #4966
Correctly expose flax.config.temp_flip_flag by @IvyZX in #4969
raise on Variable inplace operators by @cgarciae in #4967
Copybara import of the project: by @copybara-service[bot] in #4976
update to version 0.12.0 by @cgarciae in #4982
Minor typo fixes in flax gspmd guide by @vfdev-5 in #4970
ignore uv.lock by @copybara-service[bot] in #4974
[nnx] preserve the function's type information in jit by @cgarciae in #4981
add Variable.set_metadata by @cgarciae in #4968
propagate eager sharding by @cgarciae in #4983

New Contributors

@ywrt made their first contribution in #4904
@hanrach9 made their first contribution in #4919

Full Changelog: v0.11.2...v0.12.0

Contributors

vfdev-5, ywrt, and 4 other contributors

Assets 2

5 Join discussion

28 Aug 17:55

cgarciae

v0.11.2

405f656

0.11.2

What's Changed

nnx.merge now doesn't create a copy of the Variables in the incoming states by default, meaning that the new merged structures holds references to the incoming Variables. This enables new patterns, for example its now possible to create models with the same state but with different runtime behavior:

model = SomeModel(...)
# create eval model
eval_model = nnx.merge(*nnx.split(model))  # same Variables, different structure
eval_model.eval()

model and eval_model share the same Variables and are therefore kept in sync but have different runtime behavior, this avoids having to constantly mutate a single model back and forth between different runtime modes which can be error prone / cause unwanted recompilation.

To keep the old behavior use nnx.merge(..., copy=True).

PRs

add Rngs random helpers by @cgarciae in #4876
Fix re-export and docs for identity by @jlperla in #4850
Fix ToLinen docstring return description by @mohsinm-dev in #4852
Update doc build instructions and clean up unused packages by @IvyZX in #4885
Improve docs related with dataclasses by @IvyZX in #4884
Fix broken contributing documentation link by @mohsinm-dev in #4855
Internal change by @copybara-service[bot] in #4886
Fix string key preservation in replace_by_pure_dict by @mohsinm-dev in #4860
Remove the need for Conv and ConvTranspose to know the precise batch size. by @copybara-service[bot] in #4877
call jax's source_info_util.register_exclusion in flax's traceback_util.register_exclusion by @copybara-service[bot] in #4887
Update typo in nnx.Optimizer by @codinfox in #4880
Exposed split_rngs docstring in the docs_nnx by @vfdev-5 in #4846
Pin sentencepiece version to 0.2.0 to fix head by @IvyZX in #4892
Relax duplicate check to exclude non-string values such as PartitionSpec.UNCONSTRAINED, since those can be repeated. by @copybara-service[bot] in #4881
add find_duplicates by @cgarciae in #4894
Sharding API improvements (non breaking) by @IvyZX in #4893
document jax.random shorthand methods by @cgarciae in #4899
Optimiser was already instantiated using the model - 05_vae.py by @nenuadrian in #4857
revert is_leaf logic in _check_carry_same_references by @copybara-service[bot] in #4903
Doc fix: remove outdated advice on flax v0.6.10; it was released two years ago. by @copybara-service[bot] in #4910
Fix bug when raising ScopeParamNotFoundError. by @copybara-service[bot] in #4898
fix mypy on main by @cgarciae in #4909
merge no copy Variables by @cgarciae in #4912
update version to 0.11.2 by @copybara-service[bot] in #4915

New Contributors

@mohsinm-dev made their first contribution in #4852
@codinfox made their first contribution in #4880
@nenuadrian made their first contribution in #4857

Full Changelog: v0.11.1...v0.11.2

Contributors

vfdev-5, codinfox, and 5 other contributors

Assets 2

1 Join discussion

08 Aug 21:25

cgarciae

v0.11.1

a0a291c

v0.11.1

What's Changed

Make Sequential() be identity by @SobhanMP in #4796
Add a JAX/Flax key concepts doc by @IvyZX in #4795
miscellaneous improvements by @cgarciae in #4859
Replace jax.sharding.use_mesh with jax.set_mesh. jax.set_mesh can act as a global setter or a context manager. by @copybara-service[bot] in #4862
Pytree and ArrayRef refactor by @cgarciae in #4863
Add old property attributes for object->pytree rename. by @copybara-service[bot] in #4864
Add BatchNorm layers to CNN in MNIST tutorial for improved training stability by @sanepunk in #4773
Description by @copybara-service[bot] in #4866
update and pop for dict by @cgarciae in #4869
simplify nnx_basics by @cgarciae in #4868
updates to version 0.11.1 by @cgarciae in #4878

New Contributors

@SobhanMP made their first contribution in #4796
@sanepunk made their first contribution in #4773

Full Changelog: v0.11.0...v0.11.1

Contributors

SobhanMP, cgarciae, and 2 other contributors

Assets 2

29 Jul 21:04

cgarciae

v0.11.0

c9fc00b

v0.11.0

v0.11.0 - Pytrees, MutableArrays, and more!

This version of Flax introduces some changes to improve interop with native JAX and adds support for the new jax.experimental.MutableArray. More on this soon! However, some breaking changes to align with the JAX way of doing things were necessary. Most code should remain intact, however, the following changes deviate from the current behavior:

Rngs in standard layers: all standard layers no longer hold a shared reference to the rngs object given in the constructor, instead they now keep a fork-ed copy of the Rngs or RngStream objects. This impacts Using Rngs in NNX Transforms and Loading Checkpoints with RNGs.
Optimizer Updates: the Optimizer abstraction no longer holds a reference to the model to avoid reference sharing, instead the model must be provided as the first argument to update.
Modules as Pytrees: Modules are now pytrees! This avoid unnecessary use of split and merge when interacting trivially with raw JAX transforms (state must still be manually propagated if not using MutableArrays, and referential transparency is still an issue). This affects when operating on Pytrees containing NNX Objects with jax.tree.* APIs.

Checkout the full NNX 0.10 to NNX 0.11 migration guide.

In the near future we'll share more information about new ways of using NNX with JAX transforms directly by leveraging the new Pytree and MutableArray support. Stay tuned!

What's Changed

[nnx] mutable array p3 by @cgarciae in #4755
[nnx] allow method calls in ToLinen by @cgarciae in #4808
Internal change by @copybara-service[bot] in #4807
Preserve sharding information in axes_scan by @copybara-service[bot] in #4806
Deduplicate contributing and philosophy and move to main site by @IvyZX in #4809
Fixed nnx.remat docstring rendering by @vfdev-5 in #4790
Added a note to gemma guide about model's license consent on kaggle by @vfdev-5 in #4776
[nnx] ToLinen add abtract_init flag by @cgarciae in #4813
Modify NNX to use id(variable) instead of nnx.Variables as dictionary by @divyashreepathihalli in #4814
Allow using LazyRngs for flax init/apply. by @copybara-service[bot] in #4818
[nnx] remove VariableState by @cgarciae in #4800
Fix failing CI jobs: trailing whitespace, deprecated .type usage by @vfdev-5 in #4823
[nnx] fix Rngs dtype check by @cgarciae in #4820
refactor: move usages of .value to [...] in modules_test.py by @lukeyeh in #4815
Added training script for Gemma model by @vfdev-5 in #4822
[nnx] add flax_pytree_module flag by @cgarciae in #4811
create ModelAndOptimizer symbol by @copybara-service[bot] in #4849
[nnx] remove Optimizer.model attribute by @cgarciae in #4842
[nnx] add mutable array support in update by @cgarciae in #4851
Migrate transforms_test.py from .value to [...] by @lukeyeh in #4841
0.11.0 migration guide by @cgarciae in #4854

New Contributors

@divyashreepathihalli made their first contribution in #4814
@lukeyeh made their first contribution in #4815

Full Changelog: v0.10.7...v0.11.0

Contributors

vfdev-5, cgarciae, and 3 other contributors

Assets 2

7 Join discussion

02 Jul 06:09

cgarciae

v0.10.7

1ecd34f

0.10.7

What's Changed

Added identity export from JAX by @jlperla in #4652
Fixes a bug in type annotations for scope.param (unbox=True should accept callable[..., T | AxisMEtadata[T]] and return T, while unbox=False should always return the same thing as what callable returning. by @copybara-service in #4727
fix merge by @copybara-service in #4731
[nnx] make Variable a pytree by @cgarciae in #4728
[nnx] add JitWrapped API by @cgarciae in #4699
Update JAX nightly index usage by @copybara-service in #4733
[nnx] mutable array p1 by @cgarciae in #4715
add dataclass by @copybara-service in #4739
[flax] unconditionally register nnx.Variable as a pytree by @copybara-service in #4748
Updated version of pre-commit-hooks in .pre-commit-config.yaml by @vfdev-5 in #4746
Fixed docstring visibility for nnx.eval_shape by @vfdev-5 in #4747
Added keep_rngs arg to MHA to optionally store rngs by @vfdev-5 in #4749
MultiHeadAttention only keeps rngs if dropout_rate is positive by @copybara-service in #4750
[nnx] mutable array p2 by @cgarciae in #4741
Add in_kv_features argument to nnx.MultiHeadAttention, addressing #4756. by @copybara-service in #4757
Fix broken link for Transforms guide by @nireekshak in #4763
Minor improvements of lm1b_nnx example by @vfdev-5 in #4745
Fix head CI tests by @IvyZX in #4764
Fix typos by @nireekshak in #4725
Check for leaves of type variablelib.Variable when getting sharding specs. by @copybara-service in #4769
Fixes #1925 non-str dict keys not suppoted in module state by @muhrin in #4563
Modified the Functional API link by @nireekshak in #4767
Fix hardcoded link to filter guide in docs by @hamogu in #4768
Fix bad doc links by @IvyZX in #4770
revise axes_scan to flatten argument pytrees only once by @copybara-service in #4772
Simplify ToNNX access of Linen module methods by @IvyZX in #4766
Use .input_formats and .output_formats in place of .input_layouts and .output_layouts respectively. by @copybara-service in #4784
Exposed OptState in nnx module by @vfdev-5 in #4788
Fixes colab link for nnx docs by @vfdev-5 in #4775
Internal changes by @copybara-service in #4786
Fix typo in Flax nnx_basics doc. by @copybara-service in #4781
update version to 0.10.7 by @cgarciae in #4798

New Contributors

@nireekshak made their first contribution in #4763
@muhrin made their first contribution in #4563
@hamogu made their first contribution in #4768

Full Changelog: v0.10.6...v0.10.7

Contributors

hamogu, muhrin, and 5 other contributors

Assets 2

23 Apr 20:26

cgarciae

v0.10.6

0fde636

0.10.6

What's Changed

Sow top activations based on absolute value. by @copybara-service in #4670
Add support for layer-specific rope scale factors. by @copybara-service in #4672
Automatic model selection for Gemma 3 models. by @copybara-service in #4671
Make LoRA's dtype arg useful by @IvyZX in #4681
[NVIDIA] Support FP8 Einsum Op by @kaixih in #4686
[nnx] remove deprecated APIs by @cgarciae in #4627
Add attention_bias parameter to MultiHeadDotProductAttention. by @copybara-service in #4694
Unit tests for attention_bias parameter to MultiHeadDotProductAttention. Add parameter to all overloads to make pytype happy. by @copybara-service in #4702
Rollback of attention_bias parameter, because the change overrides the attention bias for injected attention functions. by @copybara-service in #4703
Add custom einsum op to Einsum() by @IvyZX in #4705
[nnx] refactor GraphDef by @cgarciae in #4630
Make fully replicated array before saving checkpoints for examples that use pmap. by @copybara-service in #4707
Fix CI by @cgarciae in #4716
remove "nnx" collection in ToLinen by @copybara-service in #4708
[nnx] flaxlib types by @cgarciae in #4639
v0.10.6 by @cgarciae in #4724

Full Changelog: v0.10.5...v0.10.6

Contributors

kaixih, cgarciae, and IvyZX

Assets 2

31 Mar 15:17

cgarciae

v0.10.5

68b69f7

0.10.5

What's Changed

[nnx] fix tabulate by @cgarciae in #4580
Refactor bridge.Module tests from wrappers_test.py to another file. by @copybara-service in #4581
Avoid calls to jnp.shape for non-array inputs. by @jakevdp in #4592
remove Embed nan casting by @cgarciae in #4600
Add QK Norm. by @copybara-service in #4594
Util to let bridge module work with NNX submodules by @IvyZX in #4584
Add configurable Query Pre Attention scalar. by @copybara-service in #4595
Make RoPE Base Frequency configurable. by @copybara-service in #4596
[nnx] pytrees are graph nodes by @cgarciae in #4547
Add option to load checkpoints with transposed Gating Einsum. by @copybara-service in #4597
add top_p sampling in gemma example by @copybara-service in #4591
Fix position and name of Post Attention Norm. by @copybara-service in #4598
Add Sow Config to from_params constructor. by @copybara-service in #4599
bridge module with linen submodule by @IvyZX in #4604
Dramatically speed up sampling compilation time by @copybara-service in #4574
[nnx] improve grad docs by @cgarciae in #4588
[nnx] add support for standalone Variables by @cgarciae in #4606
add promote_dtype as a config option for multiple layers by @cgarciae in #4613
Copybara import of the project: by @copybara-service in #4616
Fixed typo in beam_search loop. by @copybara-service in #4615
support swap model params in gemma sampler by @copybara-service in #4614
Allow bridge module to have 'name' field by @IvyZX in #4619
fix performance guide by @cgarciae in #4621
Copybara import of the project: by @copybara-service in #4618
Add REFLECT padding to convolution layer by @sarlinpe in #4553
fix trace-level detection by @cgarciae in #4527
Add attribute path customization to bridge modules by @IvyZX in #4624
add reprlib max depth flag by @cgarciae in #4632
Allow custom axis metadata annotation during transforms by @IvyZX in #4637
[bridge module] Allow name arg to represent actual submodule path by @IvyZX in #4634
[nnx] improve Variable proxy for binary operations by @cgarciae in #4641
Fix module stack typing annotation. by @copybara-service in #4633
Stop passing reduce_axes to jax.grad, jax.vjp, and jax.value_and_grad. by @copybara-service in #4617
discord release webhook by @cgarciae in #4646
[nnx] support Array leaves in graph nodes by @cgarciae in #4612
Roll up package jax version and uv.lock by @IvyZX in #4648
Use jax.nn.dot_product_attention when possible by @IvyZX in #4649
Fix flaky vmap test tolerance. by @copybara-service in #4653
Test runner ubuntu upgrade 24.04 by @IvyZX in #4659
Fix lazy_init typo by @IvyZX in #4657
deflake a test by @copybara-service in #4663
v0.10.5 by @cgarciae in #4656

New Contributors

@sarlinpe made their first contribution in #4553

Full Changelog: v0.10.4...v0.10.5

Contributors

jakevdp, cgarciae, and 2 other contributors

Assets 2

Releases: google/flax

0.12.3

What's Changed

New Contributors

Contributors

Uh oh!

Version 0.12.2

What's Changed

New Contributors

Contributors

Uh oh!

v0.12.1

Deprecations

Variable.value

nnx.Data and nnx.Static

Pull Requests

New Contributors

Contributors

Uh oh!

0.12.0

Breaking Changes

Pytree Strict Attributes

Eager Sharding

In-Place Operators No Longer Allowed

All Changes

New Contributors

Contributors

Uh oh!

0.11.2

What's Changed

PRs

New Contributors

Contributors

Uh oh!

v0.11.1

What's Changed

New Contributors

Contributors

Uh oh!

v0.11.0

v0.11.0 - Pytrees, MutableArrays, and more!

What's Changed

New Contributors

Contributors

Uh oh!

0.10.7

What's Changed

New Contributors

Contributors

Uh oh!

0.10.6

What's Changed

Contributors

Uh oh!

0.10.5

What's Changed

New Contributors

Contributors

Uh oh!