[otbn] Define SIMD instructions #29231

etterli · 2026-02-02T16:22:22Z

This PR defines the new SIMD (vectorized) bignum instructions for OTBN operating on the WDRs. The new instructions operate on 8 32-bit elements per WDR:

bn.addv(m): Vectorized addition with optional pseudo-modulo reduction.
bn.subv(m): Vectorized subtraction with optional pseudo-modulo reduction
bn.mulv(l): Vectorized multiplication with optional 'by lane' mode.
bn.mulvm(l): Vectorized Montgomery multiplication (without the conditional subtraction step of Montgomery). With optional 'by lane' mode.
bn.trn1 / bn.trn2: Instructions to interleave vector elements. This is especially useful for NTT and INTT to shuffle the vector elements when the stride between elements is smaller than what two WDRs provide. These instructions also operate on 64-bit and 128-bit vector elements.
bn.shv: Vectorized logic shift instruction
bn.pack / bn.unpk: Instructions to pack / unpack vectors from/to 32-bit elements to/from 24-bit element vectors. This allows to save on DMEM space.

The instructions are described in more detail in hw/ip/otbn/data/bignum-insns.yml (first commit). For bn.pack / bn.unpk and bn.trn see also the programmer's manual. Alternatively one can build the documentation locally and view the generated ISA manual (use util/site/build-docs.sh serve).

When this is merged, various OTBN DV tests will fail because the Random Instruction Generator (RIG) parses the bignum-insn.yml file and will generate programs including the new instructions. This will result in failing tests because illegal instructions are encountered as these are not yet added to the simulator and RTL.

This PR will have minor conflicts with #29025 (only some documentation files). So we should agree on which to merge first (probably #29025 first).

hw/ip/otbn/data/bignum-insns.yml

hw/ip/otbn/doc/programmers_guide.md

nasahlpa

Thanks Pascal, this looks really good to me. Just a couple of nits/questions.

hw/ip/otbn/data/bignum-insns.yml

nasahlpa

Thanks for addressing the feedback!

h-filali

Thanks @etterli this looks very good. I just have some minor comments and questions.

hw/ip/otbn/data/bignum-insns.yml

h-filali · 2026-02-05T16:37:20Z

hw/ip/otbn/data/bignum-insns.yml

+    The vectors are multiplied elementwise and each product is truncated to the element length.
+    The final vector is stored in `wrd`.
+
+    This instruction stalls OTBN for 4 cycles and writes the final result to the ACC WSR.


Is the final result added to the accumulate WSR or just stored there? Or why are we storing to wrd and ACC?

The instruction operates over 4 cycles and after each cycle there is one vector chunk result which must be stored somewhere. It cannot be written to the WDR before the instruction finishes (otherwise the lane version won't work properly) so we store the partial results in ACC. This way we "construct" the result over multiple cycles and use ACC as buffer. At the end we still must perform a write to the desired WDR.

I agree this sentence should be extended similar to how the mulvm instructions are.

Do you think there should be more remarks on how the partial results are accumulated?

Okay that makes sense. That was also what I had in mind. I think what tripped me up a bit is the fact that we are using the accumulate register even though we are not accumulating. Maybe we can mention explicitly that ACC is overwritten and we do not accumulate.

"writes the final result to the ACC WSR" I think this includes these points?

hw/ip/otbn/data/bignum-insns.yml

hw/ip/otbn/doc/isa.md

hw/ip/otbn/doc/pack_instruction_shifting.svg

vogelpi · 2026-02-06T13:07:53Z

@andreaskurth this PR will break the OTBN regressions as it introduces instruction definitions which at the moment neither the RTL nor the simulator knows. And that's kind of okay as we need to split up the work into multiple PRs.

However, I think we should increase the version number and set back the design and verification stages in otbn.hjson to D1 and V1 to reflect this. In terms of version number increase: OTBN binaries and Ibex code interfacing OTBN should not change as long as the IMEM does not increase to more than 16 KiB. Right now, it's not entirely clear if 16 KiB will be enough. So, I would suggest increasing the version from 1.1.0 to 1.2.0 for now. We can always go to 2.0.0 if breaking changes get introduced. WDYT?

rswarbrick

This looks good to me: thanks!

nasahlpa · 2026-02-10T06:54:31Z

@andreaskurth this PR will break the OTBN regressions as it introduces instruction definitions which at the moment neither the RTL nor the simulator knows. And that's kind of okay as we need to split up the work into multiple PRs.

However, I think we should increase the version number and set back the design and verification stages in otbn.hjson to D1 and V1 to reflect this. In terms of version number increase: OTBN binaries and Ibex code interfacing OTBN should not change as long as the IMEM does not increase to more than 16 KiB. Right now, it's not entirely clear if 16 KiB will be enough. So, I would suggest increasing the version from 1.1.0 to 1.2.0 for now. We can always go to 2.0.0 if breaking changes get introduced. WDYT?

Do we have an update on this?

I agree that we should move OTBN to D1/V1. I'd suggest doing this in this PR as #29232 follows afterwards.

etterli · 2026-02-10T07:53:20Z

@nasahlpa I discussed this with @vogelpi offline and we decided to increase the version to 1.2.0 and set back the development stages. I added a commit which does this.

vogelpi

The referenced commit_id is not belonging to a commit on master, but a commit in this PR. Once the PR is merged, the ID will change. This doesn't work.

hw/ip/otbn/data/otbn.hjson

etterli · 2026-02-10T08:24:50Z

@vogelpi Thanks for clarifying the process of the version increase. I have now removed the version change commit and will create a new PR once this one is merged to master.

vogelpi · 2026-02-10T08:48:36Z

@vogelpi Thanks for clarifying the process of the version increase. I have now removed the version change commit and will create a new PR once this one is merged to master.

Okay, sounds good. Thank you!

This introduces vectorized (SIMD) big number instructions operating on the 256-bit WDRs. These instructions interpret WDRs as vectors of unsigned elements. The width of the elements is for most instructions 32 bits except for a few instructions which support also larger widths. Signed-off-by: Pascal Etterli <[email protected]>

…lanations Adds a simple example how to use the bn.pack and bn.unpack instructions. Signed-off-by: Pascal Etterli <[email protected]>

This illustrates the functionlaty of the bn.trn1 and bn.trn2 instructions. Signed-off-by: Pascal Etterli <[email protected]>

etterli requested review from andrea-caforio, h-filali, nasahlpa, rswarbrick and vogelpi February 2, 2026 16:24

etterli mentioned this pull request Feb 2, 2026

[otbn] Add OTBN simulator implementation of SIMD instructions #29232

Merged

etterli requested a review from johannheyszl February 2, 2026 16:29

rswarbrick reviewed Feb 2, 2026

View reviewed changes

etterli force-pushed the otbn-pqc-simd-insn branch from f0c84b5 to 81053bf Compare February 2, 2026 20:19

nasahlpa reviewed Feb 3, 2026

View reviewed changes

etterli force-pushed the otbn-pqc-simd-insn branch 4 times, most recently from 5997928 to 1c3cca1 Compare February 4, 2026 08:01

etterli added the CI:Rerun Rerun failed CI jobs label Feb 4, 2026

github-actions bot removed the CI:Rerun Rerun failed CI jobs label Feb 4, 2026

nasahlpa approved these changes Feb 5, 2026

View reviewed changes

h-filali reviewed Feb 5, 2026

View reviewed changes

etterli force-pushed the otbn-pqc-simd-insn branch from 1c3cca1 to 6505711 Compare February 6, 2026 08:45

etterli added the CI:Rerun Rerun failed CI jobs label Feb 6, 2026

github-actions bot removed the CI:Rerun Rerun failed CI jobs label Feb 6, 2026

etterli force-pushed the otbn-pqc-simd-insn branch from 6505711 to 134ff78 Compare February 6, 2026 14:14

rswarbrick approved these changes Feb 9, 2026

View reviewed changes

vogelpi requested changes Feb 10, 2026

View reviewed changes

hw/ip/otbn/data/otbn.hjson Outdated Show resolved Hide resolved

hw/ip/otbn/data/otbn.hjson Outdated Show resolved Hide resolved

etterli force-pushed the otbn-pqc-simd-insn branch from 7240de4 to 134ff78 Compare February 10, 2026 08:22

etterli added the CI:Rerun Rerun failed CI jobs label Feb 10, 2026

github-actions bot removed the CI:Rerun Rerun failed CI jobs label Feb 10, 2026

vogelpi self-requested a review February 10, 2026 08:48

vogelpi approved these changes Feb 10, 2026

View reviewed changes

etterli added the CI:Rerun Rerun failed CI jobs label Feb 10, 2026

github-actions bot removed the CI:Rerun Rerun failed CI jobs label Feb 10, 2026

etterli added 3 commits February 10, 2026 10:38

[otbn,doc] Extend the programmer's guide with bn.pack and bn.unpk exp…

d6e2cf1

…lanations Adds a simple example how to use the bn.pack and bn.unpack instructions. Signed-off-by: Pascal Etterli <[email protected]>

[otbn,doc] Extend the programmer's guide with bn.trn explanations

22f2027

This illustrates the functionlaty of the bn.trn1 and bn.trn2 instructions. Signed-off-by: Pascal Etterli <[email protected]>

etterli force-pushed the otbn-pqc-simd-insn branch from 134ff78 to 22f2027 Compare February 10, 2026 09:42

nasahlpa added this pull request to the merge queue Feb 10, 2026

Merged via the queue into lowRISC:master with commit c7f8f68 Feb 10, 2026
75 of 78 checks passed

etterli mentioned this pull request Feb 11, 2026

[otbn] Increase OTBN version to 1.2.0 #29296

Merged

[otbn] Define SIMD instructions #29231

[otbn] Define SIMD instructions #29231

Uh oh!

Conversation

etterli commented Feb 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nasahlpa left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nasahlpa left a comment

Choose a reason for hiding this comment

Uh oh!

h-filali left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

h-filali Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

etterli Feb 6, 2026

Choose a reason for hiding this comment

Uh oh!

etterli Feb 6, 2026

Choose a reason for hiding this comment

Uh oh!

h-filali Feb 6, 2026

Choose a reason for hiding this comment

Uh oh!

etterli Feb 6, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

vogelpi commented Feb 6, 2026

Uh oh!

rswarbrick left a comment

Choose a reason for hiding this comment

Uh oh!

nasahlpa commented Feb 10, 2026

Uh oh!

etterli commented Feb 10, 2026

Uh oh!

vogelpi left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

etterli commented Feb 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vogelpi commented Feb 10, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

etterli commented Feb 2, 2026 •

edited

Loading

etterli commented Feb 10, 2026 •

edited

Loading