Make deviator() and second_invariant() consistent with plane strain assumption in 2D #6471

YiminJin · 2025-06-17T16:49:48Z

This PR modifies the functions deviator() and second_invariant() to make them consistent with the plane strain assumption in 2D. The modified functions are placed in namespace Utilities::Tensors. For details of the mathematics, please refer to #6434 and #6459. A direct impact of this modification is that it sharpen the shear bands in plastic models with nonzero dilation angle.

This PR is a follow-up of #6373, but it will affect other material models. Also, we need several test problems to show the plane strain functions work well (or to find out if we can do better), which have not been implemented. Please make comments if you have suggestions or questions about this PR.

bangerth · 2025-06-17T17:39:47Z

Others know this area much better than me, but in any case, this deserves a changelog entry.

gassmoeller

Thank you for tackling these questions. I left a few comments. I think this is the right approach, but there are still some places that compute the deviatoric strain rate as strain rate - 1/dim * trace, please fix those as well.

In light of the discussion in #6434 I think this is the right path forward, but like Wolfgang commented, please add a changelog entry. And we will likely have to update a lot of test results.

gassmoeller · 2025-06-17T19:54:13Z

source/material_model/grain_size.cc

        }

      const double strain_rate_dependence = (1.0 - dislocation_creep_exponent[phase_index]) / dislocation_creep_exponent[phase_index];
      const SymmetricTensor<2,dim> shear_strain_rate = strain_rate - 1./dim * trace(strain_rate) * unit_symmetric_tensor<dim>();


here we still compute the shear strain rate as strain_rate -1./dim *trace. Does this need to be changed as well? If so, there are several places in the code base where we compute the shear strain rate in this way. Please search for /dim in ASPECT's directories and check which places need fixes.

Thank you for pointing this out. I found two more 1/dim * trace expressions in material models, and they are all replaced by Utilities::Tensors::deviator(). Also, a changelog is added for these modifications.

gassmoeller · 2025-06-17T19:58:06Z

tests/spiegelman_fail_test.cc

-    compute_second_invariant(const SymmetricTensor<2,dim> strain_rate, const double min_strain_rate) const
+    compute_Utilities::Tensors::deviatoric_tensor_inv2(const SymmetricTensor<2,dim> strain_rate, const double min_strain_rate) const
    {
      const double edot_ii_strict = std::sqrt(strain_rate*strain_rate);


why do we have a different way to compute the second invariant here? This way of computing the invariant also appears in the spiegelman benchmark cases. We should figure out in which way this is identical or different to the previous form.

It's worth mentioning that @cedrict and I looked into this at some point in the past and found that people have all sorts of incompatible definitions of the second invariant. Sometimes it included a factor of 2, sometime it didn't, and similar shenanigans. I would suggest sticking closely to the definition we had previously used, and only change 1/dim to 1/3.

Sorry, this is my miss.

gassmoeller · 2025-06-17T19:59:40Z

tests/spiegelman_fail_test.cc

    {
      public:
-        double compute_second_invariant(const SymmetricTensor<2,dim> strain_rate, const double min_strain_rate) const;
+        double compute_Utilities::Tensors::deviatoric_tensor_inv2(const SymmetricTensor<2,dim> strain_rate, const double min_strain_rate) const;


you probably didnt mean to rename this function, right?

No, it is my mistake. I just used a script to replace the string second_invariant by Utilities::Tensors::deviatoric_tensor_inv2 in all the files.

Do you think the second invariant of strain rate tensor should be changed here? I do not know if Spiegelman use a special function here (the function calculates the norm of the strain rate tensor).

gassmoeller

Just a few more thoughts I had after the review.

gassmoeller · 2025-06-17T20:10:09Z

include/aspect/utilities.h

+       * strain assumption when dim = 2. Specifically, the second invariant of the deviatoric
+       * stress tensor $\tau_{II}$ in 2D is given by $\tau_{II} = -\frac{1}{2}(\tau_{11}^2 +
+       * \tau_{22}^2 + \tau_{33}^2 + 2\tau_{12}^2) = -\frac{1}{2}[\tau_{11}^2 + \tau_{22}^2 +
+       * (\tau_{11} + \tau_{22})^2 + 2\tau_{12}^2]$ under the plane strain assumption.


Add one sentence that in 3D this function simply returns the usual second invariant.

gassmoeller · 2025-06-17T20:10:43Z

include/aspect/utilities.h

+       * deal.II, this function is consistent with the plane strain assumption when dim = 2.
+       * Specifically, the deviator of the stress tensor $\mathbf\tau$ in 2D is given by
+       * $\text{dev}(\mathbf\tau) = \mathbf\tau - \frac{1}{3}\text{trace}(\mathbf\tau)\mathbf 1$
+       * under the plane strain assumption.


Add one sentence that in 3D this returns the usual definition of a deviatoric tensor.

The comments for the two functions are revised accordingly.

gassmoeller · 2025-06-17T20:12:12Z

include/aspect/utilities.h

+       */
+      template <int dim>
+      double
+      deviatoric_tensor_inv2(const SymmetricTensor<2,dim> &input);


I am unsure about this name, maybe second_invariant_plane_strain would explain better what the purpose of this function is?

I do not know. I agree that plane_strain should be added to the name, but deviatoric should also be added, because the function returns the correct result only when the input tensor is deviatoric. Yet, we are not able to check if the input tensor is deviatoric, for the deviator of 2D plane strain tensor is not "deviatoric" in the common sense...

YiminJin · 2025-06-18T04:56:01Z

Completion of this PR is harder than expected. Currently there are two major problems:

The names of the plane-strain-consistent functions. After discussing with @gassmoeller and @bangerth , I decide to use consistent_deviator and consistent_second_invariant_of_deviatoric_tensor temporarily. Apparently, the latter is not a good name. I want to include both "consistent" and "deviatoric tensor" in the name to remind the users that this function only works for deviatoric tensors consistent with plane strain assumption in 2D. Could anyone help me find a better name for this function?
The viscosity derivative w.r.t. strain rate in material models should be modified. Currently, we calculate the derivative w.r.t. deviatoric strain rate $\varepsilon'$. In order to do so, we need to:
(a) calculate the deviatoric strain tensor, $\varepsilon'=dev(\varepsilon)$;
(b) add a small increment, $\tilde\varepsilon' = \varepsilon' + d\varepsilon'$;
(c) feed it to the function that calculate the viscosity, $\eta = \eta(\tilde\varepsilon')$.
However, in most cases, the viscosity is dependent on the deviatoric strain tensor, i.e. the function $\eta(\cdot)$ will call the function $dev(\cdot)$ again. This will cause problem, because the deviator of a deviatoric tensor in the plane-strain sense is no longer itself again:
$dev(dev(\varepsilon)) \neq dev(\varepsilon)$.
Now I have only modified the visco plastic model, and the Newton solver works. There are other material models, such as the drucker prager model, the grain size model, and some models in tests and benchmarks need to be modified.
@MFraters Could you please take a look at this comment and check if it is correct? If it is correct, could you help me modify the Spiegelman benchmark? I think Spiegelman intended to use plane strain assumption in his original paper (Eq. (4)).

gassmoeller

Just a few small comments I saw while reading through the PR.

gassmoeller · 2025-06-18T20:24:27Z

source/material_model/rheology/elasticity.cc


        const SymmetricTensor<2, dim>
-        edot_deviator = deviator(strain_rate) + 0.5 * stress_0_advected / elastic_viscosity
+        edot_deviator = strain_rate + 0.5 * stress_0_advected / elastic_viscosity


was this the place where you removed the deviator on purpose?

Yes. I think we need to be more careful when using consistent_deviator(), since we cannot apply it more than once to a symmetric tensor. I move the deviator here to the place where it is called --- MaterialModel::Rheology::ViscoPlastic::compute_isostrain_viscosities (`source/material_model/rheology/visco_plastic.cc, line 310). But I need to think this over, because under this change the Newton assembler gets the full strain rate instead of deviatoric strain rate.

By the way, is the name of the variable now wrong?

So instead of strain_rate it should be something like consistent_deviator_of_strain_rate or just deviatoric_strain_rate?

I think this comment is not addressed yet. The name of the variable is edot_deviator, but it now uses the full strain rate. Does that name of the variable have to be changed?

gassmoeller · 2025-06-18T20:29:19Z

source/utilities.cc

        return t;
      }
+
+


add another empty line here

gassmoeller · 2025-06-18T20:29:28Z

source/utilities.cc

+        return output;
+      }
+
+


and another empty line here

Sure. I also add another empty line between to_voigt_stiffness_vector and levi_civita to make the format consistent.

gassmoeller · 2025-06-18T21:10:17Z

source/simulator/assemblers/newton_stokes.cc


              if (enable_elasticity)
-                data.local_rhs(i) += ( deviator(elastic_out->elastic_force[q])
+                data.local_rhs(i) += ( elastic_out->elastic_force[q]


why is this no longer the deviatoric force?

Actually I do not know. But the Stokes also uses the full elastic force, and the deviatoric elastic force does not produce sharp shear bands in the strip footing test. I need some time to figure out why.

What did you find?

gassmoeller · 2025-06-18T21:46:31Z

A plan forward for this PR that I just discussed with @YiminJin:

We agree that this change fundamentally is the right path forward.
@YiminJin will upload the changed test results to this PR so we have an overview of the changes caused by this PR at the moment.
Material models should not hand over deviatoric strain rates to subfunctions, because applying the deviator again changes the results as @YiminJin described above. If a strain rate is expected as input of a function, it needs to be the full strain rate. This will affect the viscosity derivative computation in some material models like visco_plastic.
We need to test what the change in 3. will mean for the convergence rate and stability of the Newton solver. Benchmarks to re-run would be the Spiegelman benchmark and the nonlinear channel flow benchmark. We should also check if we have a compressible benchmark with the Newton solver and if that still gives the correct results.

We want to make sure this PR is tested and merged before the next release, since it fixes an important bug in the current implementation.

YiminJin · 2025-06-19T14:59:04Z

@gassmoeller The reasons for removing the deviator when calculating the viscoelastic strain rate and assembling the system rhs are:

When the material model is incompressible, the top-left block of the Stokes system should be
$\int_\Omega2\eta\nabla_S\varphi : \varepsilon d\Omega$, (1)
not
$\int_\Omega\nabla_S\boldsymbol\varphi : dev(\varepsilon) d\Omega$, (2)
because if we use (2), then the matrix becomes
$\int_\Omega2\eta\nabla_S\varphi : \left[\nabla_S\varphi - \frac{1}{3}(\nabla\cdot\varphi)I\right] d\Omega$,
which is not symmetric. Therefore, when we use Newton method, the system Jacobian should be
$\int_\Omega(2\eta\nabla_S\varphi : \varepsilon + 2\nabla_S\varphi : \varepsilon\otimes\frac{\partial\eta}{\partial\varepsilon} : \nabla_S\varphi)d\Omega$,
where $\varepsilon$ is the full strain rate.

I know it contradicts my modifications on the Newton assembler last year, but I cannot remember why I use deviatoric strain rate at that time...

In source/material_model/rheology/elasticity.cc, deviator is never applied to the stress tensor, for it is deviatoric itself. Although there would be numerical errors if we use field method to advect the deviatoric stress fields, I think it is better to keep the consistency between the codes material model and assembler.

The comments above are only simple analyses. I think our final choice should depend on experiments. I am working on it.

YiminJin · 2025-06-19T16:38:26Z

Sorry, I was wrong again:
$\int_\Omega\nabla_S\varphi : (\nabla_S\varphi - \frac{1}{3}\nabla\cdot\varphi I)d\Omega = \int_\Omega[\nabla_S\varphi : \nabla_S\varphi - \frac{1}{3}(\nabla\cdot\varphi)(\nabla\cdot\varphi)]d\Omega.$
The matrix is symmetric.

bangerth · 2025-06-19T16:53:12Z

You already found out that
$\int_\Omega\nabla_S\varphi : (\nabla_S\varphi - \frac{1}{3}\nabla\cdot\varphi I)d\Omega = \int_\Omega[\nabla_S\varphi : \nabla_S\varphi - \frac{1}{3}(\nabla\cdot\varphi)(\nabla\cdot\varphi)]d\Omega$

Moreover, we have
$\int_\Omega(\nabla\cdot\varphi I) : (\nabla_S\varphi - \frac{1}{3}\nabla\cdot\varphi I)d\Omega = \int_\Omega[ (\nabla\cdot\varphi I) (I : \nabla_S\varphi - \frac{1}{3}\nabla\cdot\varphi I : I) ]d\Omega = \int_\Omega[ (\nabla\cdot\varphi I) (\nabla\cdot\varphi - \nabla\cdot\varphi) ]d\Omega = 0$
and as a consequence,
$\int_\Omega\nabla_S\boldsymbol\varphi : dev(\varepsilon) d\Omega = \int_\Omega dev(\varepsilon) : dev(\varepsilon) d\Omega$.

Both lines of the argument indicate that the system matrix is symmetric.

YiminJin · 2025-06-28T04:24:51Z

I am trying out the benchmark in Spiegelman et al. (2016) with the modifications in this PR and #6546 . As a first step, I use the visco plastic model and the same material parameters as in benchmarks/newton_newton_solver_benchmark_set/spiegelman_et_al_2016/input.prm (except for the "regularization" viscosity). The prm file is:
spiegelman_et_al_2016.prm.txt
The results:

where $\eta^{vp}$ is the viscosity of the viscous damper. In all these tests, I cannot make the Newton solver converge to $10^{-4}$ within 100 iterations, and the Newton solver does not converge when Use Newton residual scaling method is off. This is apparently an unacceptable result. However, when I run the original benchmark with the main branch, the result looks like

which is quite different from the figures in Spiegelman et al. (2016). Do we have model settings that can reproduce the results in the paper with a good convergence rate? @gassmoeller @MFraters

MFraters · 2025-07-08T21:48:15Z

hmm, that means they have been broken somewhere along the way. Just checking, have you tried running the original files and material model here: https://github.com/geodynamics/aspect/tree/main/benchmarks/newton_solver_benchmark_set/spiegelman_et_al_2016?

YiminJin · 2025-07-09T22:28:42Z

Just checking, have you tried running the original files and material model here: https://github.com/geodynamics/aspect/tree/main/benchmarks/newton_solver_benchmark_set/spiegelman_et_al_2016

Yes. I only changed the refinement level from 4 to 6.

…ssumption in 2D

YiminJin · 2025-10-24T00:34:37Z

I derived the analytical expression of the system Jacobian under plane-strain assumption for DP rheology, and I think I have found the reason for the different convergence behaviors with and without stabilization shown in #6160 .

The derivation is provided in the following document:
plane_strain_DP.pdf

In one word, the reason for the better performance of using deviatoric strain rate when SPD stabilization is turned on is that: when using the full strain rate, the Jacobian matrix is not semi-positive-definite, and we need a smaller scaling factor to restore the positive-definiteness. However, I think we should use the full strain rate even if it slows down the convergence rate, because it is correct (if we use the deviatoric strain rate, it would be equivalent to modifying the rheological model).

I made some modifications in source/simulator/assemblers/newton_stokes.cc and benchmarks/newton_solver_benchmark_set/spiegelman_et_al_2016/drucker_prager_compositions.cc accordingly. When using 8 levels of refinements (1024 * 256 cells), the result now looks like:

The shear bands are still not as sharp as those in the original paper of Fraters et al. (2019), but are similar to those in Spiegelman et al. (2016). Surprisingly, the nonlinear solver converges to $10^{-14}$ with only 13 iterations.

I also plotted the convergence curves with low-resolution models:

The convergence curves of Newton solver are very close to those plotted in #6160. Furthermore, these curves are produced by models with boundary velocity of 5 mm/y (I forgot to restore the boundary velocity after the high-resolution experiment), so the convergence behavior is actually better than before (in the original paper, the nonlinear solver fails to converge to $10^{-14}$ with Vel = 5 mm/y).

What is your opinion about my modifications @gassmoeller @MFraters ? If they are acceptable, I think it is time to apply the differences in test results and prepare for the final merge.

YiminJin · 2025-10-24T04:04:36Z

Corrections:

the parameters I used to reproduce the convergence curves were not the same as in the original paper. Here are the results using the same parameters ($\eta_{ref}$ = 1e23, Vel = 2.5 mm/y, mLT = 1e-8):

The curves are almost the same as those plotted in #6160 (this time we use full strain rate when assembling the system Jacobian).

The reference viscosity I used to reproduce the high-resolution result was one magnitude smaller than that in the original paper. Here is the result using the same parameters ($\eta_{ref}$ = 1e24, Vel = 5 mm/y, mLT=1e-8):

It converges to about 3e-6. The strain rate field now is very similar to Fig. 3 in the original paper.

MFraters

Thanks for doing all this work! That is great.

Do I understand it correctly that the convergence plot you show should be the same as the bottom left one in figure 5?

The stabilization definitely looks better, even though, looking at the Picard iteration the problems is more difficult, since that one is not converging.

Do you have the same plots for the more difficult cases? It would be interesting to see how it behaves there.

MFraters · 2025-10-24T08:37:07Z

source/utilities.cc

+      double consistent_second_invariant_of_deviatoric_tensor(const SymmetricTensor<2,dim> &t)
+      {
+        if (dim == 2)
+          return -( Utilities::fixed_power<2,double>(t[0][0])             // t11^2


why not simply t[0][0]*t[0][0]?

Same in other places.

I thought fixed_power() saves an addition operation for $(t_{11}+t_{22})^2$, and I apply it to other places just for a neat format...

But I think you are right. I read the source codes of fixed_power() and now I think it may cost more (there are if-else statements, and I am not sure if it will be optimized by the compiler). So I changed to the simpler form, as you suggested.

YiminJin · 2025-10-24T18:27:39Z

@MFraters The convergence behaviors for more difficult cases are shown below:

It can be seen that the convergence behavior of the current code is better indeed: the unstabilized Newton solver converges for the difficult cases with second-order rate. However, without stabilization the linear solver takes more iterations to converge, and in the most difficult case (Vel = 12.5 mm/y) the cheap iterations are even exhausted. So it is still worth using the stabilization in larger models.

YiminJin · 2025-10-24T19:32:30Z

Supplement to my comments on my derivation of the analytical formula of system Jacobian:

The derivation is under plane-strain assumption, but the conclusion also applies to the original 2D formulation. In the original 2D formulation, we have
$(2\Vert\boldsymbol\varepsilon\Vert\Vert\boldsymbol\varepsilon'\Vert)^2 - (\Vert\boldsymbol\varepsilon\Vert^2 + \boldsymbol\varepsilon:\boldsymbol\varepsilon')^2 = -\dfrac{1}{4}(\varepsilon_{xx} + \varepsilon_{yy})^4$.
Hence, $\mathbb{E}$ still has a negative eigenvalue if we use full strain rate in the assembler, and the convergence rate is slowed down by PD stabilization.

YiminJin · 2025-10-27T16:51:00Z

Eq. (9) in my derivation is calculated with the help of Sympy. The python codes are shown below:

from sympy import symbols, pprint, simplify

# Define the components of 2D strain rate as independent variables
e_xx, e_yy, e_xy = symbols('epsilon_11, epsilon_22, epsilon_12')

# Dalculate the deviatoric strain rate under plane-strain assumption
d_xx = 2 * e_xx / 3 - e_yy / 3
d_yy = 2 * e_yy / 3 - e_xx / 3
d_zz = -(e_xx + e_yy) / 3
d_xy = e_xy

# The minimum eigenvalue of the tangent operator is proportional to
# ||e|| ||d|| - ||e||^2 - e : d
# To determine whether it is positive or negative, we can compare
# the values of ||e|| ||d|| and (||e||^2 + e : d)
e_norm2 = e_xx**2 + e_yy**2 + 2 * e_xy**2
d_norm2 = d_xx**2 + d_yy**2 + d_zz**2 + 2 * d_xy**2
ed = e_xx * d_xx + e_yy * d_yy + 2 * e_xy * d_xy

print('With plane-strain:')
pprint(simplify(4 * e_norm2 * d_norm2 - (e_norm2 + ed)**2))

# Now do the same for deviatoric strain rate without plane-strain assumption
d_xx = e_xx / 2 - e_yy / 2
d_yy = e_yy / 2 - e_xx / 2
d_xy = e_xy

e_norm2 = e_xx**2 + e_yy**2 + 2*e_xy**2
d_norm2 = d_xx**2 + d_yy**2 + 2*d_xy**2
ed = e_xx * d_xx + e_yy * d_yy + 2 * e_xy * d_xy

print('Without plane-strain:')
pprint(simplify(4 * e_norm2 * d_norm2 - (e_norm2 + ed)**2))

The outputs are:

With plane-strain:
     4      3           2     2           3      4
  ε₁₁    4⋅ε₁₁ ⋅ε₂₂   2⋅ε₁₁ ⋅ε₂₂    4⋅ε₁₁⋅ε₂₂    ε₂₂ 
- ─── - ────────── - ────────── - ───────── - ───
   9        9             3           9        9  
Without plane-strain:
     4                 2    2                4
  ε₁₁      3        3⋅ε₁₁ ⋅ε₂₂         3   ε₂₂ 
- ──── - ε₁₁ ⋅ε₂₂ - ────────── - ε₁₁⋅ε₂₂ - ───
   4                    2                  4

It is easy to see that the expressions with and without plane-strain are equal to
$-\dfrac{1}{9}(\varepsilon_{11} + \varepsilon_{22})^4$
and
$-\dfrac{1}{4}(\varepsilon_{11} + \varepsilon_{22})^4$
respectively. These values characterize the minimum eigenvalues of the tangent operator $\mathbb D = 2\eta\mathbb I + \mathbb E$ with and without plane-strain assumption. They are both negative, but the absolute value of $\mathbb D$ with plane-strain asumption is smaller than that without plane-strain assumption, so the convergence rate should be less affected by the PD stabilization.

MFraters

Generally looks good to me. A few small comments to clear things up.

I don't see a significant difference in the unstabilized convergence when it converges (maybe I am missing something), but the stabilized one is definitely better.

Can you fix the tests? Than we can also see what difference it makes there.

@bangerth, can you have a look at the derivations?

MFraters · 2025-11-03T16:12:12Z

benchmarks/newton_solver_benchmark_set/spiegelman_et_al_2016/drucker_prager_compositions.cc

                              const double regularization_adjustment = (ref_visc * ref_visc)
-                                                                       / (ref_visc * ref_visc + 2.0 * ref_visc * drucker_prager_viscosity
-                                                                          + drucker_prager_viscosity * drucker_prager_viscosity);
+                                                                       / Utilities::fixed_power<2,double>(ref_visc + drucker_prager_viscosity);


I am personally not really a fan of using functions like this for something so simple. You are basically inlining the following code here: https://github.com/dealii/dealii/blob/93160909dbf3bbfe986ad5320b675737f89d6e00/include/deal.II/base/utilities.h#L944-L961

Since it is an inline constexpr, and with compiler optimizations, it is probably almost as fast, or as fast as writing it out yourself. You can argue both ways about which one is more clear to read, so it is fine to keep it, but I did want to mention it.

I did not read the source code of Utilities::fixed_power() when I made the change. Now I think you are right: using the function may not be faster. I have changed it to explicit form in a new commit.

MFraters · 2025-11-03T16:16:18Z

doc/modules/changes/20250617_yiminjin

@@ -0,0 +1,5 @@
+Changed: The deviator and second invariant of symmetric tensors are modified to
+be consistent with the plane strain assumption in 2D. It changes the outputs of
+compressible material models that are dependent on the deviatoric strain rate.


I don't think I fully understand this. Currently the Newton solver is only stabilized for the incompressible case. Did you mean incompressible or do you say it is now also stabilized for the compressible case?

I forget why I wrote this. I deleted the second sentence in a new commit.

MFraters · 2025-11-03T16:20:26Z

source/material_model/rheology/elasticity.cc


        const SymmetricTensor<2, dim>
-        edot_deviator = deviator(strain_rate) + 0.5 * stress_0_advected / elastic_viscosity
+        edot_deviator = strain_rate + 0.5 * stress_0_advected / elastic_viscosity


So instead of strain_rate it should be something like consistent_deviator_of_strain_rate or just deviatoric_strain_rate?

MFraters · 2025-11-03T16:22:23Z

source/simulator/assemblers/newton_stokes.cc


              if (enable_elasticity)
-                data.local_rhs(i) += ( deviator(elastic_out->elastic_force[q])
+                data.local_rhs(i) += ( elastic_out->elastic_force[q]


What did you find?

YiminJin · 2025-11-04T21:27:42Z

I tried the default model runs (using run.sh) for the nonlinear channel flow benchmark and the Spiegelman et al. (2016) benchmark. Here are the results:

Nonlinear channel flow

a. Always use full strain rate (FSR):

b. Use full strain rate when not stabilized, and use deviatoric strain rate (DSR) when stabilized:

The figures correspond to Fig. 2 in the original paper:

The results can be summarized as follows:

	without stabilization	with stabilization
without line search	Almost the same in both cases	Harder to converge in both cases
with line search	Slightly better in both cases	Faster with FSR, slightly slower with DSR

Spiegelman et al. (2016)

a. Always use FSR:

b. Use FSR when not stabilized, and use DSR when stabilized:

The figures correspond to Fig. 4 in the original paper:

	without stabilization	with stabilization
Vel: 2.5 cm/y, $\eta_{ref}$: 1e23	Slightly faster in both cases	Faster in both cases
Vel: 5 cm/y, $\eta_{ref}$: 1e24	No better, no worse	Much faster in both cases, FSR slightly better
Vel: 12.5 cm/y, $\eta_{ref}$: 5e24	Easier to converge in both cases	No better, no worse

The results show that:

The convergence behavior is not affected by changing from 2D tensor to plane-strain tensor;
The new scaling factor (merged last year) does lead to faster convergence rate when using PD stabilization; meanwhile, it might also reduce the stability a little bit;
Using deviatoric strain rate in the system Jacobian does not improve convergence rate or stability when using PD stabilization; instead, it leads to slower convergence rate in the channel flow benchmark.

I have not figured out why using deviatoric strain rate results in slower convergence in the channel flow benchmark. But, according to the results and the mathematical work, using full strain rate seems to be the right choice. That is also the reason I changed deviatoric strain rate to full strain the in elastic rheology.

There is an issue to be noticed: when using 2D tensor, one can use deviator() as a filter, i.e. apply it repeatedly and obtain the same result; but when using plane-strain tensor, one can only apply consistent_deviator() once, otherwise the resulting tensor would be incorrect. Currently there is no guarantee for this.

What do you think about the results @MFraters @bangerth @gassmoeller ? Do I need to carry out experiments with elastic rheology? (I have tested the full strain rate version with associated-plastic-flow tests (strip-footing, pure shear), but not the deviatoric strain rate version).

YiminJin · 2025-11-05T02:02:24Z

Sorry, I made a big mistake in the last comment: I forgot that the 2D tensors in the channel flow benchmark have not been changed to plane-strain tensors.

When I tried to do the modification, I found something that looks weird: line 183 and line 189 of benchmarks/newton_solver_benchmark_set/nonlinear_channel_flow/simple_nonlinear.cc calculate the viscosity and the viscosity derivatives according to the first two equations in Appendix B1 of the original paper:
$\eta = \eta_0^{-1/n}\varepsilon_{II}^{1/n-1}$,
$\dfrac{\partial\eta}{\partial\boldsymbol\varepsilon} = \eta\bigg(\dfrac{1}{n}-1\bigg)\dfrac{\boldsymbol\varepsilon}{\Vert\boldsymbol\varepsilon\Vert^2}$
However, line 180 multiplies edot_ii by 2, which leads to
$\eta = \eta_0^{-1/n}(2\varepsilon_{II})^{1/n-1}$
Is there something I am missing? @MFraters

YiminJin · 2025-11-05T02:46:24Z

I also made a mistake when computing the deviatoric strain rates in system Jacobian: I forgot to change deviator() to Utilities::Tensors::consistent_deviator(). Here are the correct results for DSR in Spiegelman et al. (2016) benchmark:

It is almost the same as the full strain rate case. I also compared the distribution of SPD factor, and the results are again the same. I think this is because the material model is incompressible, hence $\dfrac{\eta}{\boldsymbol a : \boldsymbol b}$ is very close to -1, which leads to
$\alpha_{SPD} = -c_{safety}\dfrac{\eta}{\boldsymbol a : \boldsymbol b}\approx c_{safety}$
in the yielding regions.

@gassmoeller Do you remember how we came to the conclusion that using deviatoric strain rate when stabilized is faster than using full strain rate?

YiminJin · 2025-11-06T02:52:33Z

I have tried to test the VEP rheology with plastic dilation, and I found some problems in source/simulator/assemblers/newton_stokes.cc:

Function NewtonStokesCompressibleStrainRateTerms::execute() does not multiply phi_p with pressure_scaling;
The function does not symmetrize the top left block when Stabilization::symmetric is set;
The function does not take the spd factor into account.

I also found that the Newton Stokes assembler uses StokesCompressiblePreconditioner, which is inconsistent with the system Jacobian.

After fixing all the problems, and adding a NewtonStokesCompressiblePreconditioner, the solving speed of the strip footing problem improved about 40%.

And, from the compressible case, I confirmed the argument that we should use full strain rate in NewtonStokesIncompressibleTerms. But I cannot be sure, since I keep making mistakes on this issue.

gassmoeller · 2025-11-10T14:40:20Z

@YiminJin Concerning the deviatoric strain rate: I think the relevant PR where I tested this was #6159 (comment), and in particular this code: https://github.com/geodynamics/aspect/pull/6159/files#diff-67a939b674da49b34479b331895052a08d8712da7befddadaba9390cd782ee05R183. As far as I can tell we did not decide to use the deviatoric strain rate for the stabilized version because it is faster, it seems like there was a theoretical argument that the stabilization factor was only safe if applied to a deviatoric strain rate. In other words the guarantee that the stabilization makes the matrix positive definite is only valid if the used strain rate is deviatoric. But I have to say I dont remember the details or if I ever made that derivation myself (I would guess I accepted it from the paper or some earlier PR discussion). From looking at the earlier PR my main point was that we need to use the full strain rate for the unstabilized version in order to guarantee the expected convergence rate of the Newton method. If your version here works with the full strain rate for both versions so much the better.

YiminJin · 2025-11-10T18:17:21Z

I apologize for making so many mistakes during the discussion.

During the morning meeting, the documentation and the conclusion were correct, but I failed to explain it. I added some comments to the derivation to make it clearer:
fsr_vs_dsr.pdf
The point is, we have removed the volumetric components in class NewtonStokesCompressibleStrainRateTerms, so we should use full strain rate in class NewtonStokesIncompressibleTerms.

Yet, I understand that we should use full strain rate does not mean that using full strain rate is faster, because using full strain rate breaks the symmetry and semi-positive-definiteness of the top-left block. I have tested the Spiegelman benchmark and the result show that using full strain rate (without stabilization) leads to a little more linear iterations but much fewer nonlinear iterations (similar to the results shown in #6160 ). But one benchmark is not enough.

@MFraters Could you help me with the nonlinear channel flow benchmark? I do not know if I misunderstand your implementation.

YiminJin · 2025-11-10T22:22:39Z

I plot the number of cheap linear iterations in each Newton iteration for the Spiegelman benchmark:

Use full strain rate in NewtonStokesIncompressibleTerms:

a. residuals:

b. number of cheap iterations:

Use deviatoric strain rate in NewtonStokesIncompressibleTerms:

a. residuals:

b. number of cheap iterations:

Remark:

The cheap iterations are limited by 1000;
Five cases report linear solver errors (The iterative (top left) solver did not converge):
Vel = 12.5 mm/yr, eta_ref = 5e24, mLT = 1e-8, 0 Picard, unstabilized, with full strain rate;
Vel = 12.5 mm/yr, eta_ref = 5e24, mLT = 1e-8, 5 Picard, unstabilized, with full strain rate;
Vel = 12.5 mm/yr, eta_ref = 5e24, mLT = 1e-8, 25 Picard, unstabilized, with full strain rate;
Vel = 12.5 mm/yr, eta_ref = 5e24, mLT = 1e-8, 0 Picard, unstabilized, with deviatoric strain rate;
Vel = 12.5 mm/yr, eta_ref = 5e24, mLT = 1e-8, 5 Picard, unstabilized, with deviatoric strain rate;

It can be seen that:

When not stabilized, using full strain rate generally takes fewer Newton iterations than using deviatoric strain rate, but the convergence is more difficult (takes more cheap linear iterations, and easier to fail);
When stabilized, the convergence behaviors of full strain rate and deviatoric strain rate are similar.

Fact 1 confirms the point that using full strain rate breaks the symmetry and semi-positive-definiteness of the top-left block. In the easiest case (Vel = 2.5 mm/yr, eta_ref = 1e23), the convergence curve of stabilized and unstabilized cases are almost the same when using deviatoric strain rate, which suggests that using deviatoric strain rate is equivalent to symmetrizing the matrix (changing the descending direction, slowing down the convergence, but restoring the symmetry and makes the linear solver stabler).

The reason that using deviatoric strain rate or not affects the convergence behavior when using incompressible model is unclear. I guess it comes from numerical error. The distribution of strain rate and velocity divergence in case {Vel = 5 mm/yr, eta_ref = 1e24, mLT = 1e-8} are:

It can be seen that the numerical error (velocity divergence) is in the same magnitude as the strain rate. So the absolute value of the negative eigenvalue of the tangent operator when using full strain rate is large at some nodes.

MFraters · 2025-11-11T06:11:42Z

I apologize for making so many mistakes during the discussion.

That is normal. I really appreciate all the work you put into this!

During the morning meeting, the documentation and the conclusion were correct, but I failed to explain it. I added some comments to the derivation to make it clearer: fsr_vs_dsr.pdf The point is, we have removed the volumetric components in class NewtonStokesCompressibleStrainRateTerms, so we should use full strain rate in class NewtonStokesIncompressibleTerms.

Yet, I understand that we should use full strain rate does not mean that using full strain rate is faster, because using full strain rate breaks the symmetry and semi-positive-definiteness of the top-left block. I have tested the Spiegelman benchmark and the result show that using full strain rate (without stabilization) leads to a little more linear iterations but much fewer nonlinear iterations (similar to the results shown in #6160 ). But one benchmark is not enough.

This is a difficult one. In the end the Newton derivatives do not decide whether you converge to the correct solution, only how fast. So in the end for us, it doesn't matter too much what the "correct" method of computing the derivative is, but what derivative results in the the fastest and most reliable convergence.

The problem you state is that method which results in the fastest convergence (full strain-rate) is not the same as the method which results in the most reliable convergence (deviatoric strain rate) in the unstabilized case.

When not stabilized, using full strain rate generally takes fewer Newton iterations than using deviatoric strain rate, but the convergence is more difficult (takes more cheap linear iterations, and easier to fail);

The difference in the amount of failure cases (2 vs 3) seems minimal to me. I think there is a third thing we should look at, which is not converging to the tolerance. Based on your figures, it seems to me that using the full strain-rate has a higher chance of actually converging to the required tolerance.

So, based on these results, I would say my preference is on the full strain-rate.

The reason that using deviatoric strain rate or not affects the convergence behavior when using incompressible model is unclear. I guess it comes from numerical error.

Could be, although it seems systematic. Maybe it could have something to do with whether every point is the cell is guaranteed to be divergence free, or just the cell as a whole. @bangerth, do you have an idea?

@MFraters Could you help me with the nonlinear channel flow benchmark? I do not know if I misunderstand your implementation.

Sure, could you be a bit more specific what the issue is? In general, there are two version, one with a prescribed velocity (input_v.prm) and one with a prescribed traction (input_t.prm). You need to run cmake to compile the material model and then you should be able to use the run script.

YiminJin · 2025-11-12T18:29:25Z

@MFraters Thank you for the detailed comment! I prefer the full strain rate too.

The issue with the nonlinear channel flow benchmark (with prescribed velocity) is that:

line 183 and line 189 of benchmarks/newton_solver_benchmark_set/nonlinear_channel_flow/simple_nonlinear.cc calculate the viscosity and the viscosity derivatives according to the first two equations in Appendix B1 of the original paper:
$\eta = \eta_0^{-1/n}\varepsilon_{II}^{1/n-1}$,
$\dfrac{\partial\eta}{\partial\boldsymbol\varepsilon} = \eta\bigg(\dfrac{1}{n}-1\bigg)\dfrac{\boldsymbol\varepsilon}{\Vert\boldsymbol\varepsilon\Vert^2}$
However, line 180 multiplies edot_ii by 2, which leads to
$\eta = \eta_0^{-1/n}(2\varepsilon_{II})^{1/n-1}$
Is there something I am missing?

YiminJin · 2025-11-12T20:14:51Z

I tested the strip footing model:

The Newton solver parameters are set as follows:

Parameter	Value
Max nonlinear iterations	100
Nonlinear solver tolerance	1e-5
Max pre-Newton nonlinear iterations	3
Max Newton line search iterations	3
Maximum linear Stokes solver tolerance	1e-2
Use Newton residual scaling method	false
Stabilization preconditioner	SPD
Stabilization velocity block	SPD

The convergence behaviors of the full strain rate case and the deviatoric strain rate case are as follows:

Clearly, the full strain rate prevails. It can be explained by the fact that in compressible/prescribed-dilation models, the assembler NewtonStokesCompressibleStrainRateTerms has already removed the volumetric part, so if we use deviatoric strain rate in NewtonStokesIncompressibleTerms, then we actually minus the volumetric part twice, which does not matter in 2D or 3D cases, but is incorrect in plane-strain cases.

gassmoeller

Thanks for putting all the effort into this! I will take another look at the main changes again, but here are a few small comments for now.

gassmoeller · 2025-11-13T16:16:56Z

doc/modules/changes/20250617_yiminjin

+Changed: The deviator and second invariant of symmetric tensors are modified to
+be consistent with the plane strain assumption in 2D.


Can you add a sentence about which models will be affected? E.g.:

Suggested change

Changed: The deviator and second invariant of symmetric tensors are modified to

be consistent with the plane strain assumption in 2D.

Fixed: The deviator and second invariant of symmetric tensors were modified to

be consistent with the plane strain assumption in 2D. This fixes a bug

in the strain rate and strain rate invariant computation of many material

models for compressible 2D models. It also results in better

nonlinear solver convergence due to some fixes to the Newton solver.

Thank you for providing a nice example! However, there are some statements I am not quite sure of:

I do not know if the original implementation should be called a ``bug'', since ASPECT has never declared that it applies plane-strain assumption in 2D.

The modifications do not only changes compressible models, but also models including prescribed dilation. Basically, all the material models that calls deviator() are affected.
Based on these points, I modified the statements a little in a new commit. Please help me revise them if the words are inappropriate.

gassmoeller · 2025-11-13T16:18:57Z

source/material_model/rheology/elasticity.cc


        const SymmetricTensor<2, dim>
-        edot_deviator = deviator(strain_rate) + 0.5 * stress_0_advected / elastic_viscosity
+        edot_deviator = strain_rate + 0.5 * stress_0_advected / elastic_viscosity


I think this comment is not addressed yet. The name of the variable is edot_deviator, but it now uses the full strain rate. Does that name of the variable have to be changed?

gassmoeller · 2025-11-13T16:22:52Z

source/simulator/solver/stokes_matrix_free_local_smoothing.cc

                            effective_strain_rate = elastic_out->viscoelastic_strain_rate[q];
                          else if ((this->get_newton_handler().parameters.velocity_block_stabilization & Newton::Parameters::Stabilization::PD) != Newton::Parameters::Stabilization::none)
-                            effective_strain_rate = deviator(effective_strain_rate);
+                            effective_strain_rate = Utilities::Tensors::consistent_deviator(effective_strain_rate);


Since you opened that pull request we also added the file stokes_matrix_free_global_coarsening. Could you fix it there as well?

Thanks for finding the flaws. The variable name has been fixed.

The GMG solver requires more attention. Besides changing deviatoric strain rate to full strain rate, I found two other problems:

The Newton derivatives related to the volumetric part is missing;

When material averaging is set to none, the Newton scheme does not work because the uninitialized viscosity_derivative_averaging_weights are assembled into the system.

I fixed the problems in a new commit. However, I cannot make the strip footing model run with GMG solver. The Kaus benchmark (with $\phi=\psi=30^\circ$) works, but it also requires Newton residual scaling method to achieve convergence. Below are the convergence behaviors with and without the Newton derivatives related to the volumetric part (only nonlinear iterations 10 to 19 are presented):

The prm file is:
kaus_2010_extension.prm.txt

It can be seen that adding the volumetric part does not lead to a big improvement in convergence rate (except for the 18th nonlinear iteration). Nevertheless, I feel it is safer to follow the ``correct'' formula.

Please let me know if there are problems in my codes, or more benchmarks to run.

YiminJin force-pushed the plane-strain branch from 8f38002 to d8b9378 Compare June 17, 2025 17:10

gassmoeller reviewed Jun 17, 2025

View reviewed changes

bobmyhill mentioned this pull request Jun 17, 2025

Make the 2D formulations consistent with plane strain assumption #6434

Open

YiminJin force-pushed the plane-strain branch from 447f09f to 5bb559c Compare June 18, 2025 04:30

bangerth mentioned this pull request Jun 18, 2025

Better document the assumptions underlying the definition of the deviatoric tensor. dealii/dealii#18569

Merged

gassmoeller reviewed Jun 18, 2025

View reviewed changes

gassmoeller changed the title ~~[WIP] Make deviator() and second_invariant() consistent with plane strain assumption in 2D~~ Make deviator() and second_invariant() consistent with plane strain assumption in 2D Jun 18, 2025

YiminJin mentioned this pull request Jun 21, 2025

Modify the plastic dilation terms to reduce noises in effective viscosity #6546

Merged

make deviator() and second_invariant() consistent with plane strain a…

79e2084

…ssumption in 2D

YiminJin force-pushed the plane-strain branch from adbca51 to 79e2084 Compare October 22, 2025 18:35

YiminJin added 3 commits October 23, 2025 16:57

use plane strain consistent operators in Spiegelman et al. benchmark

f6a9696

use full strain rate in Newton Stokes assembler

c64e01b

adjust the format

24a2169

MFraters reviewed Oct 24, 2025

View reviewed changes

YiminJin added 2 commits October 24, 2025 11:04

add header for aspect::Utilities::Tensors

69857a4

make the codes neater

ff0c142

MFraters reviewed Nov 3, 2025

View reviewed changes

YiminJin added 2 commits November 4, 2025 12:16

make the codes simpler

0af008e

modify the changelog

5001113

fix problems in compressible Newton Stokes assembler

fc3c34d

gassmoeller reviewed Nov 13, 2025

View reviewed changes

YiminJin added 3 commits November 13, 2025 16:14

Update the changelog

6a5ed77

Fix inproper variable names

3ef7b3f

Fix Newton derivatives in GMG solver for compressible models

ffe3dfb

		Changed: The deviator and second invariant of symmetric tensors are modified to
		be consistent with the plane strain assumption in 2D.

-Changed: The deviator and second invariant of symmetric tensors are modified to
-be consistent with the plane strain assumption in 2D.
+Fixed: The deviator and second invariant of symmetric tensors were modified to
+be consistent with the plane strain assumption in 2D. This fixes a bug
+in the strain rate and strain rate invariant computation of many material
+models for compressible 2D models. It also results in better
+nonlinear solver convergence due to some fixes to the Newton solver.

Make deviator() and second_invariant() consistent with plane strain assumption in 2D #6471

Are you sure you want to change the base?

Make deviator() and second_invariant() consistent with plane strain assumption in 2D #6471

Uh oh!

Conversation

YiminJin commented Jun 17, 2025

Uh oh!

bangerth commented Jun 17, 2025

Uh oh!

gassmoeller left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gassmoeller left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

YiminJin commented Jun 18, 2025

Uh oh!

gassmoeller left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gassmoeller commented Jun 18, 2025

Uh oh!

YiminJin commented Jun 19, 2025

Uh oh!

YiminJin commented Jun 19, 2025

Uh oh!

bangerth commented Jun 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

YiminJin commented Jun 28, 2025

Uh oh!

MFraters commented Jul 8, 2025

Uh oh!

YiminJin commented Jul 9, 2025

bangerth commented Jun 19, 2025 •

edited

Loading

YiminJin commented Oct 24, 2025 •

edited

Loading

YiminJin commented Oct 24, 2025 •

edited

Loading