Add transpose operation for F-order data and fix validation for GPU by avolkov-intel · Pull Request #3665 · uxlfoundation/oneDAL

avolkov-intel · 2026-06-18T14:44:33Z

Description

Checklist:

Completeness and readability

I have commented my code, particularly in hard-to-understand areas.
I have updated the documentation to reflect the changes or created a separate PR with updates and provided its number in the description, if necessary.
Git commit message contains an appropriate signed-off-by string (see CONTRIBUTING.md for details).
I have resolved any merge conflicts that might occur with the base branch.

Testing

I have run it locally and tested the changes extensively.
All CI jobs are green or I have provided justification why they aren't.
I have extended testing suite if new functionality was introduced in this PR.

Performance

I have measured performance for affected algorithms using scikit-learn_bench and provided at least a summary table with measured data, if performance change is expected.
I have provided justification why performance and/or quality metrics have changed or why changes are not expected.
I have extended the benchmarking suite and provided a corresponding scikit-learn_bench PR if new measurable functionality was introduced in this PR.

david-cortes-intel · 2026-06-19T06:34:40Z

@avolkov-intel What about logistic regression?

david-cortes-intel · 2026-06-22T06:39:29Z

@avolkov-intel @Vika-F Looks like the 'copy' function is not using omatcopy when doing transposes:

oneDAL/cpp/oneapi/dal/backend/primitives/ndarray.hpp

Line 601 in 05bd0eb

inline void copy(ndview<T1, 2, ord1>& dst, const ndview<T2, 2, ord2>& src) {

Perhaps that could be improved in a different PR. Omatcopy should do it in parallel and can have good speedups on CPU when transposing matrices.

For the non-transpose case with strides, it could also use 'lacpy' from MKL instead:
https://www.intel.com/content/www/us/en/docs/onemkl/developer-reference-fortran/2023-2/lacpy.html

Initial commit

3bca6de

david-cortes-intel reviewed Jun 19, 2026

View reviewed changes

Comment thread cpp/oneapi/dal/backend/primitives/utils.hpp Outdated

david-cortes-intel mentioned this pull request Jun 19, 2026

F-order dpnp arrays make Ridge GPU array API extremely slow uxlfoundation/scikit-learn-intelex#3235

Open

avolkov-intel added 2 commits June 19, 2026 08:36

Added dispatch between copy and omat to handle integer data types

bc7c5f4

remove omatcopy dependency

2b7340e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add transpose operation for F-order data and fix validation for GPU#3665

Add transpose operation for F-order data and fix validation for GPU#3665
avolkov-intel wants to merge 3 commits into
uxlfoundation:mainfrom
avolkov-intel:dev/f-order-optimization

avolkov-intel commented Jun 18, 2026

Uh oh!

Uh oh!

david-cortes-intel commented Jun 19, 2026

Uh oh!

david-cortes-intel commented Jun 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

avolkov-intel commented Jun 18, 2026

Description

Uh oh!

Uh oh!

david-cortes-intel commented Jun 19, 2026

Uh oh!

david-cortes-intel commented Jun 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants