Add PWPolyF piecewise polynomial activation layer by ollycassidy13 · Pull Request #1573 · Xilinx/finn

ollycassidy13 · 2026-04-24T13:34:34Z

Add PWPolyF, an RTL activation layer that approximates GELU, SiLU, Sigmoid,
and Tanh using piecewise degree-2 polynomials evaluated via Horner's method
on cascaded DSPFP32 FMA units. Two DSPs per PE, no BRAM, single-cycle
throughput (II=1).
Coefficients and per-function clamping config are delivered through a
SystemVerilog package (pwpolyf_pkg.sv) containing a func_cfg_t struct,
regenerated at build time for the node's K value by generate_coeffs_pkg().
InferPWPolyFLayer converts both the explicit PWPolyF custom op and
standard ONNX activations (Gelu, Sigmoid, Tanh, Sigmoid+Mul for
SiLU, Div+Erf+Add+Mul+Mul for GELU with dynamo=True / opset < 20) to
PWPolyF HW layers with default K=3.
PE folding handled by SetFolding. Resource estimates: 2 DSPs + ~200 LUTs
per PE, 0 BRAM.

STFleming

Nice work @ollycassidy13 this is a very useful PR!

My biggest current concern is testing (see comments) it would be great if we could work on improving the test coverage together to verify thing more robustly. In particular making sure that codegen and the RTL is verified. Please ping me if you have any questions.

Thanks again for the great work!

auphelia · 2026-05-07T10:44:41Z

Could you please move this into the components subsection? https://github.com/Xilinx/finn/tree/dev/docs/finn/components

auphelia · 2026-05-14T10:25:03Z

@@ -0,0 +1,196 @@
+# Copyright (C) 2026, Advanced Micro Devices, Inc.


You can use the SPDX-identifier for the copyright header instead and the year is not necessary anymore.

Copyright Advanced Micro Devices, Inc. SPDX-License-Identifier: BSD-3-Clause

auphelia · 2026-05-14T13:07:24Z

+from finn.util.pwpolyf import CLAMP_CFG, NUM_OCTAVES, SUPPORTED_FUNCS, _fit_coefficients
+
+
+def _float_to_hex(f):


FINN already has float-to-hex packing in finn.util.data_packing.array2hexstring (line 92 handles floats). Please use:

array2hexstring(np.array([f]), DataType["FLOAT32"], 32, prefix="").upper()

This keeps it consistent with other RTL backends. Alternatively, BitArray(float=f, length=32).hex.upper() would also work.

auphelia · 2026-05-14T13:10:33Z

+    return "%08X" % struct.unpack("!I", struct.pack("!f", float(f)))[0]
+
+
+def generate_coeffs_pkg(K, degree=2, num_samples=1000):


generate_coeffs_pkg could be a class method of PWPolyF_rtl. Other RTL backends use instance methods like prepare_codegen_rtl_values or generate_params that access node attributes via self. This would be more consistent and avoid passing K and degree explicitly:

def _generate_coeffs_pkg(self, num_samples=1000): K = self.get_nodeattr("K") degree = self.get_nodeattr("degree") ...

auphelia · 2026-05-14T13:12:03Z

@@ -0,0 +1,207 @@
+# Copyright (C) 2026, Advanced Micro Devices, Inc.


You can use the SPDX-identifier for the copyright header instead and the year is not necessary anymore.

Copyright Advanced Micro Devices, Inc. SPDX-License-Identifier: BSD-3-Clause

auphelia · 2026-05-14T13:26:46Z

+        oshape = self.get_normal_output_shape()
+        return super().make_const_shape_op(oshape)
+
+    def infer_node_datatype(self, model):


Since PWPolyF only supports FLOAT32 (as checked in verify_node), we could warn or raise an error if the incoming datatype is not FLOAT32, rather than silently updating the attribute to an unsupported type. Additionally, outputDataType is read but never updated. For a FLOAT32→FLOAT32 op it should be set to match the input.

def infer_node_datatype(self, model): node = self.onnx_node idt = model.get_tensor_datatype(node.input[0]) assert idt == DataType["FLOAT32"], ( f"{node.name}: PWPolyF requires FLOAT32 input, got {idt}" ) self.set_nodeattr("inputDataType", idt.name) self.set_nodeattr("outputDataType", idt.name) model.set_tensor_datatype(node.output[0], idt)

Alternatively, since input and output are always FLOAT32, you could simplify to a single dataType node attribute instead of separate inputDataType/outputDataType.

auphelia · 2026-05-14T13:32:29Z

+    def get_normal_output_shape(self, ind=0):
+        return self.get_normal_input_shape()
+
+    def get_number_output_values(self):


get_number_output_values: This is identical to the base HWCustomOp implementation. Can be removed.

auphelia · 2026-05-14T13:50:42Z

@@ -0,0 +1,237 @@
+# Copyright (C) 2026, Advanced Micro Devices, Inc.


You can use the SPDX-identifier for the copyright header instead and the year is not necessary anymore.

Copyright Advanced Micro Devices, Inc. SPDX-License-Identifier: BSD-3-Clause

auphelia · 2026-05-14T13:50:51Z

@@ -0,0 +1,753 @@
+# Copyright (C) 2026, Advanced Micro Devices, Inc.


You can use the SPDX-identifier for the copyright header instead and the year is not necessary anymore.

Copyright Advanced Micro Devices, Inc. SPDX-License-Identifier: BSD-3-Clause

auphelia · 2026-05-14T13:57:30Z

+from finn.transformation.fpgadataflow.specialize_layers import SpecializeLayers
+from finn.util.pwpolyf import PiecewisePolyActivation
+
+test_fpga_part = "xcve2002-sbva484-2MP-e-S"


The FPGA part used in the tests is not supported in Vivado 2022.2. Please use a part that works with both Vivado 2022.2 and 2024.2, such as the VCK190 part (xcvc1902-vsva2197-2MP-e-S).

ollycassidy13 added 4 commits April 22, 2026 11:43

pwpolyf initial integration (missing dynamo and nn.act) (hw stub)

1d13990

nn.act detection and dynamo=True

abecad3

svh -> pkg and all k

752ecd0

pkg changes

1f6c5eb

STFleming requested review from STFleming and auphelia April 24, 2026 13:41

ollycassidy13 added 3 commits April 24, 2026 15:35

merge conflicts resolved to dev

21d804e

linting

6d01b10

linting

adc14f2

STFleming requested changes Apr 27, 2026

View reviewed changes

Comment thread tests/fpgadataflow/test_fpgadataflow_pwpolyf.py

Comment thread src/finn/custom_op/fpgadataflow/pwpolyf.py Outdated

Comment thread src/finn/custom_op/fpgadataflow/pwpolyf.py Outdated

ollycassidy13 added 3 commits April 27, 2026 17:35

improved testing

dd1e700

versal check

c23097b

linting

7d56e90

STFleming self-requested a review April 30, 2026 14:57

auphelia requested changes May 14, 2026

View reviewed changes

ollycassidy13 added 3 commits May 18, 2026 15:29

move pwpolyf torch module

1b6692d

Address PWPolyF reviewer comments

59fc398

export to match brevitas

f3156c4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add PWPolyF piecewise polynomial activation layer#1573

Add PWPolyF piecewise polynomial activation layer#1573
ollycassidy13 wants to merge 13 commits into
Xilinx:devfrom
ollycassidy13:feature/pwpolyf

ollycassidy13 commented Apr 24, 2026

Uh oh!

STFleming left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

auphelia May 7, 2026

Uh oh!

auphelia May 14, 2026

Uh oh!

auphelia May 14, 2026

Uh oh!

auphelia May 14, 2026

Uh oh!

auphelia May 14, 2026

Uh oh!

auphelia May 14, 2026

Uh oh!

auphelia May 14, 2026

Uh oh!

auphelia May 14, 2026

Uh oh!

auphelia May 14, 2026

Uh oh!

auphelia May 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		@@ -0,0 +1,196 @@
		# Copyright (C) 2026, Advanced Micro Devices, Inc.

		from finn.util.pwpolyf import CLAMP_CFG, NUM_OCTAVES, SUPPORTED_FUNCS, _fit_coefficients


		def _float_to_hex(f):

		return "%08X" % struct.unpack("!I", struct.pack("!f", float(f)))[0]


		def generate_coeffs_pkg(K, degree=2, num_samples=1000):

		@@ -0,0 +1,207 @@
		# Copyright (C) 2026, Advanced Micro Devices, Inc.

		@@ -0,0 +1,237 @@
		# Copyright (C) 2026, Advanced Micro Devices, Inc.

		@@ -0,0 +1,753 @@
		# Copyright (C) 2026, Advanced Micro Devices, Inc.

Conversation

ollycassidy13 commented Apr 24, 2026

Uh oh!

STFleming left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants