Add CI runner building with LLVM Flang #739

mmuetzel · 2025-12-27T17:43:19Z

Disable MPI because Ubuntu does not provide Fortran bindings for LLVM Flang.
Disable quadruple-precision math because Ubuntu distributes a Flang compiler runtime without support for it. (And reportedly [1], the upstream support for quadruple-precision floating-point math in Flang isn't ready yet either if I understood correctly.)

~~Also fix some build errors after configuring with -DHAVE_QP=ON and enable it by default (like until recently).~~ (Addressed in #745.)

mmuetzel · 2025-12-27T18:05:34Z

Some tests seem to be failing with Flang as the Fortran compiler.
E.g., with runtime errors like this:

fatal Fortran runtime error(/home/runner/work/elmerfem/elmerfem/fem/src/modules/MagnetoDynamics/CalcFields.F90:1145): Assign: mismatching element counts in array assignment (to 15, from 6)

I don't know if that is an issue with the compiler, with its runtime or if it actually points at something that might be suspicious in ElmerFEM.

mmuetzel · 2026-01-01T18:27:35Z

Would it make sense to split the PR into two parts so that you could merge that part that fixes building with support for quadruple-precision floating point number, and leave the part with the CI using LLVM Flang for later?

mmuetzel · 2026-01-09T11:21:17Z

Rebased on a current head after #745 was merged.

juharu · 2026-01-09T12:19:16Z

Some tests seem to be failing with Flang as the Fortran compiler. E.g., with runtime errors like this:
fatal Fortran runtime error(/home/runner/work/elmerfem/elmerfem/fem/src/modules/MagnetoDynamics/CalcFields.F90:1145): Assign: mismatching element counts in array assignment (to 15, from 6)
I don't know if that is an issue with the compiler, with its runtime or if it actually points at something that might be suspicious in ElmerFEM.

Yes, this seems definitely a bug, i'll give it a try. Thanks!

raback · 2026-01-09T12:59:27Z

I worked last week a little on this and at least some problems seemed to be from non-explicit range when using GetReal.
https://github.com/ElmerCSC/elmerfem/tree/ExplicitGetRealRange
Merging this can maybe resolve some test.

mmuetzel · 2026-01-12T15:33:25Z

I rebased on a current head of the devel branch. And the number of failing tests reduced from 25 to 11.
Nice. 👍

The remaining test errors seem to be in different categories than the "mismatching element counts" errors.

juharu · 2026-01-13T11:00:15Z

670 - circuits_harmonic_foil (Failed)                   3D circuits harmonic mgdyn serial whitney
671 - circuits_harmonic_foil_anl_rotm (Failed)          3D circuits harmonic mgdyn rotm serial whitney
672 - circuits_harmonic_foil_wvector (Failed)           3D circuits harmonic mgdyn serial whitney wvector
673 - circuits_harmonic_homogenization_coil_solver (Failed) 3D circuits harmonic homogenization mgdyn serial stranded whitney
676 - circuits_harmonic_stranded (Failed)               3D circuits harmonic mgdyn serial whitney
677 - circuits_harmonic_stranded_homogenization (Failed) 3D circuits harmonic homogenization mgdyn serial stranded whitney

These failures seem to be because the stack (8M) is not large enough on my laptop.
(using "ulimit -s unlimited" on my computer fixes the failures). We maybe should look whether we
can do better, but this might also be a compiler thing ...

juharu · 2026-01-13T11:30:45Z

The patch below fixes the stack size problem for me. My first impression is that the compiler
is maybe doing something strange with the original code....

diff --git a/fem/src/ElmerSolver.F90 b/fem/src/ElmerSolver.F90
index afb7b025b..7c5fd73d1 100644
--- a/fem/src/ElmerSolver.F90
+++ b/fem/src/ElmerSolver.F90
@@ -2298,7 +2298,15 @@

            IF(ASSOCIATED(Mesh % Edges)) THEN
              IF ( i<=Mesh % NumberOfBulkElements) THEN

              Gotit = ListCheckPresent( IC, TRIM(Var % Name)//' {e}' )

+#if 1

```
              BLOCK
```

                CHARACTER(LEN(Var % Name)+4) :: s

                s = Var % Name // ' {e}'

                Gotit = ListCheckPresent( IC, Var % Name//' {e}' )

```
              END BLOCK
```

+#else

                Gotit = ListCheckPresent( IC, Var % Name//' {e}' )

+#endif
IF ( Gotit ) THEN
DO k=1,Element % TYPE % NumberOfedges
Edge => Mesh % Edges(Element % EdgeIndexes(k))

juharu · 2026-01-13T11:33:37Z

... and I wasn't even using the introduced new 's' variable for anything there (as i intended). So just
some added no-op piece of code fixed the thing.

juharu · 2026-01-13T11:39:04Z

Just adding the
BLOCK
END BLOCK
around the call to "ListCheckPresent()" seems enough ....

juharu · 2026-01-13T11:52:58Z

154 - EM_port_eigen_2ndorder (Failed)                   complex_eigen eigen emwave serial

2941

this also runs smoothly (on my laptop) with added stack space: "ulimit -s unlimited"
the stack is consumed somewhere else than in the "circuits_harmonic*" tests though

mmuetzel · 2026-01-13T14:53:02Z

The patch is hard to read (using triple backticks for blocks of unformatted text might help).
One other change (apart from the BLOCK) could be that you removed the TRIM. Did that alone make a difference?

juharu · 2026-01-13T17:20:49Z

Yes, sorry 'bout the formatting, should have used something to prevent the default. Anyway, closing the call within the BLOCK-construct was the key. Removing or adding TRIM doesn't do anything (and hasn't mostly been required for a few years now - after compilers really started supporting fortran allocatable character string -construct ...)

juharu · 2026-01-13T19:09:36Z

I committed the BLOCK-END BLOCK thing to devel branch, shouldn't do any harm ...

juharu · 2026-01-14T08:04:52Z

After a recompilation, also tests "Contact3DLevelProj" and "Contact3DNormalProj" exceed the default 8M main stack
on my laptop. The patch below fixes this (after applying this you can reduce the stack size to < 512K), what I don't understand really, is, that this maybe should be the default:
-fno-stack-arrays Allocate array temporaries on the heap (default)
Or maybe the effective word is "temporaries" ?

st.patch

mmuetzel · 2026-01-14T08:09:40Z

Thanks for looking into this. But the diff is unreadable with the default formatting. You need to use triple backticks for blocks of plain text in comment. Single backticks only work inside paragraphs of formatted text.
Could you please edit your comment and change the single backticks to triple backticks around the diff?

juharu · 2026-01-14T08:10:28Z

Ok, thanks. I'll do that next, time, for now I attached the patch...

mmuetzel · 2026-01-14T08:12:55Z

Thanks for attaching the patch file.

Yeah. Explicitly allocating these potentially large arrays on the heap instead of on the stack looks reasonable to me.

juharu · 2026-01-14T08:22:05Z

Yes, I think so too, I'll commit these changes (... and similar changes in the complex version of IDRS implementation) to "devel".

mmuetzel · 2026-01-14T09:17:17Z

I rebased again (on top of b44300d).
Only 4 of the 852 run tests are still failing:

	154 - EM_port_eigen_2ndorder (Failed)                   complex_eigen eigen emwave serial
	216 - FilmFlowPlane4 (Failed)                           quick serial
	217 - FilmFlowPlane5 (Failed)                           quick serial
	593 - SunAngle (Failed)                                 quick serial

Good progress. 👍

juharu · 2026-01-14T09:22:07Z

Yep, thanks! The reason for the last 3 is known (small elmer bugs). I'll try to see where the first (mis)uses its stack space.

juharu · 2026-01-14T11:02:53Z

For reference, gfortran has this (which is why we haven't seen this type of problems in a while)

    -fmax-stack-var-size=n

           This  option  specifies  the  size  in bytes of the largest array that is put on the stack; 
if the size is exceeded static memory is used (except in procedures marked as "RECURSIVE"). 
Use the option -frecursive to allow for recursive procedures that do not have a "RECURSIVE" 
attribute or for  parallel  programs.  Use  -fno-automatic  to  never use the stack.

  This  option currently only affects local arrays declared with constant bounds, and may not 
apply to all character variables.  Future versions of GNU Fortran may improve   this behavior.

           The default value for n is 65536.

mmuetzel · 2026-01-14T12:40:46Z

Apparently, there is no support for -fmax-stack-var-size in LLVM Flang. And I couldn't find an issue where that would be discussed.
According to their documentation:

Flang already allocates all local arrays on the stack

Matching the documentation for gfortran, they follow up with:

But there are some cases where temporary arrays are created on the heap by Flang.

Apparently, they implemented -fstack-array to force allocation of these temporaries on the stack, too. But I couldn't find anything to pivot in the other way (i.e., more allocations in the heap).

If you have a reproducer, it might make sense to open an issue on their tracker: https://github.com/llvm/llvm-project/issues
Obviously, the current behavior (allocating automatic variables on the stack independent of their size) can cause stack overflows in real-life applications.

juharu · 2026-01-14T14:12:36Z

Below is a very simple test case, it reads a number "n", gives it the to subroutine "msum", which uses it to allocate an automatic variable "w". flang compiled image crashes when n=~1100000 (running on my laptop, with 8mb default stack size), gfortran compiled image will accept anything that fits to central memory, f.ex. 1000000000 (1e9), again running my laptop

I think elmer is now mostly good with flang, given the changes I made to source code. Can't explain the "circuit_harmonic*" test cases though, with the BLOCK-END BLOCK around the one subroutine call (somewhat dramatically) reducing stack usage .


program test

   integer(8) :: n
   read(5,*) n
   call msum(n)

contains

  subroutine msum(n)
    integer(8) :: n
    real(8) :: w(n)

    call random_number(w)
    print*,'sum: ', n,sum(w)
  end subroutine msum

end program test```

Disable MPI because Ubuntu does not provide Fortran bindings for LLVM Flang. Disable quadruple precision math because Ubuntu distributes a Flang runtime without support for it.

mmuetzel · 2026-01-14T23:08:03Z

I rebased on a current head (e11e6fb). With that, the following three tests are failing for the runner that builds with LLVM Flang:

	216 - FilmFlowPlane4 (Failed)                           quick serial
	217 - FilmFlowPlane5 (Failed)                           quick serial
	593 - SunAngle (Failed)                                 quick serial

raback · 2026-01-15T16:05:13Z

Nice work! All tests should pass now...

mmuetzel mentioned this pull request Jan 8, 2026

Fix building with quadruple-precision floating-point support #745

Merged

mmuetzel marked this pull request as draft January 8, 2026 14:47

mmuetzel force-pushed the ci-ubuntu branch from 1cf6b34 to cbd1bae Compare January 9, 2026 11:20

mmuetzel marked this pull request as ready for review January 9, 2026 11:21

mmuetzel changed the title ~~Fix building with quadruple-precision fp support and add CI runner building with LLVM Flang~~ Add CI runner building with LLVM Flang Jan 9, 2026

mmuetzel mentioned this pull request Jan 9, 2026

Compilation error with LLVM Flang #609

Closed

mmuetzel force-pushed the ci-ubuntu branch from cbd1bae to c99f395 Compare January 12, 2026 14:59

mmuetzel force-pushed the ci-ubuntu branch from c99f395 to bc060cf Compare January 14, 2026 08:46

Add CI runner building with LLVM Flang.

d5cf41c

Disable MPI because Ubuntu does not provide Fortran bindings for LLVM Flang. Disable quadruple precision math because Ubuntu distributes a Flang runtime without support for it.

mmuetzel force-pushed the ci-ubuntu branch from bc060cf to d5cf41c Compare January 14, 2026 21:40

raback merged commit 71f94d5 into ElmerCSC:devel Jan 15, 2026
11 of 12 checks passed

Add CI runner building with LLVM Flang #739

Add CI runner building with LLVM Flang #739

Conversation

mmuetzel commented Dec 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mmuetzel commented Dec 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mmuetzel commented Jan 1, 2026

Uh oh!

mmuetzel commented Jan 9, 2026

Uh oh!

juharu commented Jan 9, 2026

Uh oh!

raback commented Jan 9, 2026

Uh oh!

mmuetzel commented Jan 12, 2026

Uh oh!

juharu commented Jan 13, 2026

Uh oh!

juharu commented Jan 13, 2026

Uh oh!

juharu commented Jan 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

juharu commented Jan 13, 2026

Uh oh!

juharu commented Jan 13, 2026

Uh oh!

mmuetzel commented Jan 13, 2026

Uh oh!

juharu commented Jan 13, 2026 via email • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

juharu commented Jan 13, 2026

Uh oh!

juharu commented Jan 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mmuetzel commented Jan 14, 2026

Uh oh!

juharu commented Jan 14, 2026

Uh oh!

mmuetzel commented Jan 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

juharu commented Jan 14, 2026

Uh oh!

mmuetzel commented Jan 14, 2026

Uh oh!

juharu commented Jan 14, 2026 via email • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

juharu commented Jan 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mmuetzel commented Jan 14, 2026

Uh oh!

juharu commented Jan 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mmuetzel commented Jan 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

raback commented Jan 15, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

mmuetzel commented Dec 27, 2025 •

edited

Loading

mmuetzel commented Dec 27, 2025 •

edited

Loading

juharu commented Jan 13, 2026 •

edited

Loading

juharu commented Jan 13, 2026 via email •

edited

Loading

juharu commented Jan 14, 2026 •

edited

Loading

mmuetzel commented Jan 14, 2026 •

edited

Loading

juharu commented Jan 14, 2026 via email •

edited

Loading

juharu commented Jan 14, 2026 •

edited

Loading

juharu commented Jan 14, 2026 •

edited

Loading

mmuetzel commented Jan 14, 2026 •

edited

Loading