[gpu]: ISTFT basic impl. #29030

pkowalc1 · 2025-02-17T12:16:34Z

Basic implementation of ISTFT for GPU.

Details:

supports everything which supports CPU ref impl.

Tickets:

CVS-162460

praasz · 2025-02-21T07:00:38Z

src/core/include/openvino/op/istft.hpp

@@ -54,6 +54,7 @@ class OPENVINO_API ISTFT : public Op {
    void set_center(const bool center);

    bool get_normalized() const;
+    void set_normalized(const bool normalized);


Can be removed.
Is not required right, shape inference not depends on this attribute

This method is needed to properly construct op on gpu side(ctor params are not supported in gpu plugin... )..

Otherwise you will construct partially undefined op which will work at this point, but only because you and me know that shape inference currently does not depend on that state.

praasz

OK, for core part

mlukasze · 2025-04-16T04:56:32Z

@wilson-seok please review, or assign someone to review

isanghao · 2025-04-17T06:21:39Z

src/plugins/intel_gpu/src/graph/impls/ocl/istft.cpp

@@ -0,0 +1,107 @@
+// Copyright (C) 2018-2025 Intel Corporation


We have a new(and better) infra for adding kernel: ocl_v2. Could you implement kernel on that, instead of ocl?
Here's an example of migration: #29901

Well I can do it in separate PR if really needed - this one has already passed all the tests(which took a week due to unrelated CI failures...)

I'm voting for this approach

sshlyapn · 2025-04-18T07:51:56Z

src/plugins/intel_gpu/src/graph/impls/ocl/istft.cpp

+        stream& stream = instance.get_network().get_stream();
+        // This is needed to clear the output memory before executing the kernel for static shapes model.
+        // Ref kernel assumes that output memory is already cleared.
+        instance.output_memory(0).fill(stream, false);


For proper synchronization this call should either be blocking, or result event should be added to kernel dependencies

So the kernel may not run on the same stream?

No, they are guaranteed to be executed on the same stream, but depending on the queue type there might be a problem:

in case of in_order queue all submitted tasks (memory fill and kernel enqueue) are executed in order they are enqueued

however, in case of out_of_order queue driver considers dependencies between the tasks to execute them. If there are no proper dependencies (using barriers or events), the driver doesn't guarantee that memory will be reset before the kernel launches in this case - as kernel execution might start in parallel or even before memory reset in theory

Additionally, considering that memory can be reused between primitives, there might be a situation where we reset the memory of some predecessor primitive right during its execution. Therefore, for out_of_order queue some additional barrier is required before this fill() call as well

OK, good catch then - I was not aware of ooe in ocl. Should be fixed now(?)

sshlyapn · 2025-04-18T08:06:29Z

src/plugins/intel_gpu/src/plugin/ops/istft.cpp

+    if (inputs.size() == 4) {
+        auto prim = cldnn::ISTFT(layer_type_name_ID(op), inputs[0], inputs[1], inputs[2], inputs[3], op->get_center(), op->get_normalized());
+        p.add_primitive(*op, prim);
+    } else {
+        auto prim = cldnn::ISTFT(layer_type_name_ID(op), inputs[0], inputs[1], inputs[2], inputs[3], inputs[4], op->get_center(), op->get_normalized());
+        p.add_primitive(*op, prim);
+    }


I think it's not actually needed to enumerate all the inputs, we can just slightly update ISTFT like this:

ISTFT(const primitive_id& id, const std::vector<input_info>& inputs, const bool center, const bool normalized) : primitive_base(id, inputs), center(center), normalized(normalized) {}

And simplify primitive creation:

Suggested change

if (inputs.size() == 4) {

auto prim = cldnn::ISTFT(layer_type_name_ID(op), inputs[0], inputs[1], inputs[2], inputs[3], op->get_center(), op->get_normalized());

p.add_primitive(*op, prim);

} else {

auto prim = cldnn::ISTFT(layer_type_name_ID(op), inputs[0], inputs[1], inputs[2], inputs[3], inputs[4], op->get_center(), op->get_normalized());

p.add_primitive(*op, prim);

}

auto prim = cldnn::ISTFT(layer_type_name_ID(op), inputs, op->get_center(), op->get_normalized());

p.add_primitive(*op, prim);

sshlyapn · 2025-04-18T08:08:30Z

src/plugins/intel_gpu/src/plugin/ops/istft.cpp

@@ -0,0 +1,27 @@
+// Copyright (C) 2018-2025 Intel Corporation


nit:

Suggested change

// Copyright (C) 2018-2025 Intel Corporation

// Copyright (C) 2025 Intel Corporation

github-actions · 2025-05-04T00:29:31Z

This PR will be closed in a week because of 2 weeks of no activity.

[gpu]: [istft]: Added needed boilerplate code.

38a6c14

pkowalc1 added the WIP work in progress label Feb 17, 2025

pkowalc1 requested review from a team as code owners February 17, 2025 12:16

github-actions bot added category: Core OpenVINO Core (aka ngraph) category: GPU OpenVINO GPU plugin category: CPP API OpenVINO CPP API bindings labels Feb 17, 2025

[gpu]: istft unit test stub.

1f0b152

pkowalc1 requested review from a team as code owners February 17, 2025 15:56

github-actions bot added the category: IE Tests OpenVINO Test: plugins and common label Feb 17, 2025

pkowalc1 added 2 commits February 20, 2025 14:41

gpu: istft: WIP, working kernel for single frame.

3584958

gpu]: istft: Fixed bugs and added more tests.

cdf71cf

praasz reviewed Feb 21, 2025

View reviewed changes

pkowalc1 added 15 commits March 13, 2025 12:15

[istft]: Atomics WIP, added more tests.

c2c2923

[gpu][istft]: Added more tests.

0532c6c

[gpu]:[istft]: Fixed memset output buffer.

a6c7803

[gpu]istft: Fized problem with divisior reduction.

ace08a4

[gpu]: [istft]: Removed not needed code.

baad9f3

Merge branch 'master' into gpu_istft_ref_impl

ac5459c

[gpu]: istft: Fxied code style.

d407a05

[gpu]: [istft]: Added support for non even frame_size.

cdcc899

[gpu]:[istft]: Added support for normalized param.

1a34437

[gpu]:[istft}: Refactoring, renameing

ce11306

[gpu]:[istft]: Added support for center param.

d5b2b96

[gpu]: [istft]: code cleanup.

e9b30fe

[gpu]:[istft]: Fixed issue with constant input shape.

83a4e78

[gpu]: [istft]: half atomic add.

2fa6dcd

[gpu[: [istft]: Fixed bug with center mode.

b22f65c

pkowalc1 added 3 commits April 2, 2025 11:33

[gpu]: [istft]: Restored func test.

058f738

[gpu]: [istft]: Refactored cl kerrnel.

e8bd0c2

Merge branch 'master' into gpu_istft_ref_impl

1299e1b

pkowalc1 changed the title ~~WIP: [gpu]: ISTFT ref impl.~~ [gpu]: ISTFT basic impl. Apr 3, 2025

pkowalc1 removed the WIP work in progress label Apr 3, 2025

pkowalc1 added 2 commits April 4, 2025 09:29

Merge branch 'master' into gpu_istft_ref_impl

9b1ef35

Merge branch 'master' into gpu_istft_ref_impl

703042c

mlukasze requested a review from wilson-seok April 9, 2025 05:07

Merge branch 'openvinotoolkit:master' into gpu_istft_ref_impl

7a366c4

praasz approved these changes Apr 14, 2025

View reviewed changes

mitruska approved these changes Apr 16, 2025

View reviewed changes

isanghao reviewed Apr 17, 2025

View reviewed changes

mlukasze requested review from isanghao and mlukasze April 18, 2025 04:49

sshlyapn reviewed Apr 18, 2025

View reviewed changes

sshlyapn added this to the 2025.2 milestone Apr 18, 2025

github-actions bot added the Stale label May 4, 2025

[gpu]: [istft]: Review fixes.

95a2db6

pkowalc1 requested a review from sshlyapn May 5, 2025 11:03

[gpu]: [istft]: Review fixes.

94a0a94

sshlyapn approved these changes May 5, 2025

View reviewed changes

mlukasze enabled auto-merge May 5, 2025 13:57

github-actions bot removed the Stale label May 6, 2025

Merge branch 'master' into gpu_istft_ref_impl

8087599

mlukasze added this pull request to the merge queue May 6, 2025

Merged via the queue into openvinotoolkit:master with commit 0dcdfec May 6, 2025
218 of 220 checks passed

pkowalc1 deleted the gpu_istft_ref_impl branch May 6, 2025 12:41

		@@ -0,0 +1,107 @@
		// Copyright (C) 2018-2025 Intel Corporation

		@@ -0,0 +1,27 @@
		// Copyright (C) 2018-2025 Intel Corporation

	// Copyright (C) 2018-2025 Intel Corporation
	// Copyright (C) 2025 Intel Corporation

[gpu]: ISTFT basic impl. #29030

[gpu]: ISTFT basic impl. #29030

Uh oh!

Conversation

pkowalc1 commented Feb 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Details:

Tickets:

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

praasz left a comment

Choose a reason for hiding this comment

Uh oh!

mlukasze commented Apr 16, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented May 4, 2025

Uh oh!

Uh oh!

Uh oh!

pkowalc1 commented Feb 17, 2025 •

edited

Loading