feat(joblist,sorting): MCOL-3758 new ORDER BY algo by drrtuy · Pull Request #3790 · mariadb-corporation/mariadb-columnstore-engine

drrtuy · 2025-09-23T21:50:22Z

No description provided.

… != power of 2

…vots in phase 2.

mariadb-LeonidFedorov · 2026-03-18T23:38:24Z

/gemini review

mariadb-LeonidFedorov · 2026-03-18T23:39:51Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces new sorting algorithms, including PDQ sort and a heap-based merge sort, to enhance ORDER BY performance, particularly for large datasets. The changes are substantial and well-structured, with new sorting implementations, a factory method in TupleAnnexStep to select the appropriate algorithm, and supporting data structures. While this is a valuable feature, I've identified a couple of critical issues that need to be addressed before merging. Specifically, there's a memory leak in HeapOrderBy due to incorrect std::unique_ptr usage and a significant bug in the sorting key generation for unsigned integers that will lead to incorrect results. I have also included several medium-severity recommendations to improve code quality and maintainability.

gemini-code-assist · 2026-03-18T23:41:58Z

+        *pos++ = (!isAsc) ? static_cast<uint8_t>(!isNotNull) : static_cast<uint8_t>(isNotNull);
+        std::memcpy(pos, valueBuf, sizeof(StorageType));
+        StorageType* valPtr = reinterpret_cast<StorageType*>(pos);
+        *valPtr ^= 0x80;


The logic for creating a sortable key for unsigned integers is incorrect. The code performs a sign-bit flip (*valPtr ^= 0x80;), which is a technique for signed integers to make them comparable as unsigned. For unsigned integers, this will produce an incorrect sort order. For example, for uint8_t, 128 (binary 10000000) would become 0, and 0 would become 128, causing 128 to be sorted before 0 in an ascending sort. This line should be removed. This issue also exists for USMALLINT, UINT, and UBIGINT types in this file.

gemini-code-assist · 2026-03-18T23:41:58Z

+  joblist::OrderByKeysType jobListorderByRGColumnIDs_;
+  rowgroup::RowGroup rg_;
+  std::unique_ptr<joblist::MemManager> mm_;
+  std::unique_ptr<uint8_t> keyBuf_;


keyBuf_ is declared as std::unique_ptr<uint8_t>, but it's allocated as an array using new uint8_t[...] in heaporderby.cpp. This will lead to a memory leak because the default deleter for std::unique_ptr<T> calls delete instead of delete[], deallocating only the first element of the array. To fix this, you should declare keyBuf_ as std::unique_ptr<uint8_t[]>.

Suggested change

std::unique_ptr<uint8_t> keyBuf_;

std::unique_ptr<uint8_t[]> keyBuf_;

gemini-code-assist · 2026-03-18T23:41:58Z

+    // CHAR_1BYTE,
+    // CHAR_2BYTES,
+    // CHAR_4BYTES,
+    // CHAR_8BYTES,
+    // VARCHAR_1BYTE,
+    // VARCHAR_2BYTES,
+    // VARCHAR_4BYTES,
+    // VARCHAR_8BYTES,
  };


These enum values are commented out. If they are not intended for use in the near future, they should be removed to avoid cluttering the code and confusing future maintainers.

gemini-code-assist · 2026-03-18T23:41:58Z

  {
    return s128Value == static_cast<int128_t>(x);
  }
+  inline bool operator==(const TSInt128 x) const


The operator== takes its TSInt128 parameter by value. For a 128-bit type, this causes an unnecessary copy. It would be more efficient to pass it by const reference.

inline bool operator==(const TSInt128& x) const

gemini-code-assist · 2026-03-18T23:41:58Z

+KeyType::KeyType(rowgroup::RowGroup& rg, const joblist::OrderByKeysType& colsAndDirection,
+                 const sorting::PermutationType p, uint8_t* buf)
+{


The KeyType constructor is very long and contains a lot of duplicated code for handling different integer types (e.g., TINYINT, SMALLINT, INT, BIGINT and their unsigned versions). This reduces maintainability. Consider refactoring this into a template function that can handle all integer types, parameterized by the type and size, to reduce code duplication.

gemini-code-assist · 2026-03-18T23:41:58Z

+
+    auto end = std::chrono::steady_clock::now();
+    std::chrono::duration<double> elapsed_seconds = end - start;
+    std::cout << "keys " + std::to_string(id) + " elapsed time: " << elapsed_seconds.count() << "s\n";


There are several std::cout statements used for performance logging throughout this file (and others in this PR). These should be removed or replaced with a proper logging framework before merging into a main branch to avoid polluting the output.

gemini-code-assist · 2026-03-18T23:41:58Z

+  utils::setThreadName(threadName.c_str());
+  // std::cout << "enter " << threadName << std::endl;
+  RGData rgDataOut;
+  sorting::PermutationVec perm;


The perm vector is populated inside a loop using perm.insert(). This can be inefficient as it may lead to multiple reallocations if the final size is large. It would be more performant to first calculate the total number of elements that will be inserted, call perm.reserve() to allocate the required memory at once, and then use perm.insert() or std::copy.

mariadb-LeonidFedorov · 2026-03-18T23:42:02Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces a new parallel sorting mechanism for ORDER BY operations, leveraging PDQOrderBy for initial per-thread sorting and HeapOrderBy for merging results. Key changes include adding cstdint for integral type definitions, refining decimal width constants, and implementing TSInt128 comparison. The core sorting logic in heaporderby.cpp and pdqorderby.cpp handles various data types, null values, and byte-swapping for correct ordering. The TupleAnnexStep is updated to integrate these new sorting strategies, dynamically choosing between single-threaded LimitedOrderBy, single-threaded PDQOrderBy, or a two-phase parallel PDQOrderBy and HeapOrderBy approach based on the number of sorting threads and data size. Review comments highlight a critical memory leak in HeapOrderBy due to incorrect unique_ptr usage for arrays, potential complexity and bugs in ranges_ vector population, inconsistent formatting in template specializations, significant code duplication in integer type handling within heaporderby.cpp, inefficient use of std::mutex for a counter that could be atomic, and performance concerns with perm.insert() in a loop.

gemini-code-assist · 2026-03-18T23:46:12Z

+  joblist::OrderByKeysType jobListorderByRGColumnIDs_;
+  rowgroup::RowGroup rg_;
+  std::unique_ptr<joblist::MemManager> mm_;
+  std::unique_ptr<uint8_t> keyBuf_;


This unique_ptr is managing a dynamically allocated array created with new[] in heaporderby.cpp. However, it is not using the array form of unique_ptr (std::unique_ptr<uint8_t[]>). This will result in delete being called instead of delete[], leading to a memory leak and undefined behavior. This is a critical issue that must be fixed.

Suggested change

std::unique_ptr<uint8_t> keyBuf_;

std::unique_ptr<uint8_t[]> keyBuf_;

gemini-code-assist · 2026-03-18T23:46:12Z

+    ranges_.push_back({ImpossiblePermute});
+    ranges_.back().reserve(rangeSize);
+    if (rangeSize)
+    {
+      ranges_.back().insert(ranges_.back().end(), srcPermBegin, srcPermEnd);
+    }


The logic for populating the ranges_ vector seems overly complex and potentially buggy. If rangeSize > 0, the PermutationVec will contain ImpossiblePermute at the beginning, followed by the actual permutations from srcPermBegin. This is likely not the intended behavior. A cleaner and more correct approach would be to create the vector with the correct elements from the start.

if (rangeSize) { ranges_.emplace_back(srcPermBegin, srcPermEnd); } else { ranges_.push_back({ImpossiblePermute}); }

gemini-code-assist · 2026-03-18T23:46:12Z

+    // CHAR_1BYTE,
+    // CHAR_2BYTES,
+    // CHAR_4BYTES,
+    // CHAR_8BYTES,
+    // VARCHAR_1BYTE,
+    // VARCHAR_2BYTES,
+    // VARCHAR_4BYTES,
+    // VARCHAR_8BYTES,


These commented-out enum values seem to be for future work. Leaving commented-out code can be confusing for future maintenance. If these are not yet implemented, it would be better to remove them and add them back in a future PR when they are used. If they are intended as placeholders, a comment explaining their purpose would be helpful.

gemini-code-assist · 2026-03-18T23:46:12Z

+struct ColDataTypeToIntegralType<SystemCatalog::TINYINT>
+ : _ColDataTypeToIntegralType<SystemCatalog::TINYINT, int8_t>
+{
+};
+template <>
+struct ColDataTypeToIntegralType<SystemCatalog::SMALLINT>
+ : _ColDataTypeToIntegralType<SystemCatalog::SMALLINT, int16_t>
+{
+};
+template <>
+struct ColDataTypeToIntegralType<SystemCatalog::MEDINT>
+ : _ColDataTypeToIntegralType<SystemCatalog::MEDINT, int32_t>
+{
+};
+template <>
+struct ColDataTypeToIntegralType<SystemCatalog::INT> : _ColDataTypeToIntegralType<SystemCatalog::INT, int32_t>
+{
+};


The formatting of the template specializations is inconsistent. In some cases, the base class is on a new line (e.g., line 585), while in others it's on the same line (e.g., line 599). Adopting a consistent style would improve readability. For example, placing the base class on a new line for all specializations.

gemini-code-assist · 2026-03-18T23:46:12Z

+      case execplan::CalpontSystemCatalog::TINYINT:
+      {
+        using StorageType =
+            datatypes::ColDataTypeToIntegralType<execplan::CalpontSystemCatalog::UTINYINT>::type;
+        const uint8_t* valueBuf = rg.getColumnValueBuf(columnId, p.rowID);
+        const uint8_t* nullValuePtr = reinterpret_cast<const uint8_t*>(&joblist::TINYINTNULL);
+        bool isNotNull = memcmp(nullValuePtr, valueBuf, columnWidth) != 0;
+        *pos++ = (!isAsc) ? static_cast<uint8_t>(!isNotNull) : static_cast<uint8_t>(isNotNull);
+        std::memcpy(pos, valueBuf, sizeof(StorageType));
+        StorageType* valPtr = reinterpret_cast<StorageType*>(pos);
+        *valPtr ^= 0x80;
+        *valPtr = (!isAsc) ? ~*valPtr : *valPtr;
+        pos += columnWidth;
+        break;
+      }
+      case execplan::CalpontSystemCatalog::SMALLINT:
+      {
+        using StorageType =
+            datatypes::ColDataTypeToIntegralType<execplan::CalpontSystemCatalog::SMALLINT>::type;
+        const uint8_t* valueBuf = rg.getColumnValueBuf(columnId, p.rowID);
+        const uint8_t* nullValuePtr = reinterpret_cast<const uint8_t*>(&joblist::SMALLINTNULL);
+        bool isNotNull = memcmp(nullValuePtr, valueBuf, columnWidth) != 0;
+        *pos++ = (!isAsc) ? static_cast<uint8_t>(!isNotNull) : static_cast<uint8_t>(isNotNull);
+        std::memcpy(pos, valueBuf, sizeof(StorageType));
+        StorageType* valPtr = reinterpret_cast<StorageType*>(pos);
+        *valPtr ^= 0x8000;
+        *valPtr = (!isAsc) ? ~htons(*valPtr) : htons(*valPtr);
+        pos += columnWidth;
+        break;
+      }
+      case execplan::CalpontSystemCatalog::INT:
+      {
+        using StorageType = datatypes::ColDataTypeToIntegralType<execplan::CalpontSystemCatalog::INT>::type;
+        const uint8_t* valueBuf = rg.getColumnValueBuf(columnId, p.rowID);
+        const uint8_t* nullValuePtr = reinterpret_cast<const uint8_t*>(&joblist::INTNULL);
+        bool isNotNull = memcmp(nullValuePtr, valueBuf, columnWidth) != 0;
+        *pos++ = (!isAsc) ? static_cast<uint8_t>(!isNotNull) : static_cast<uint8_t>(isNotNull);
+        std::memcpy(pos, valueBuf, sizeof(StorageType));
+        StorageType* valPtr = reinterpret_cast<StorageType*>(pos);
+        *valPtr ^= 0x80000000;
+        *valPtr = (!isAsc) ? ~htonl(*valPtr) : htonl(*valPtr);
+        pos += columnWidth;
+        break;
+      }
+      case execplan::CalpontSystemCatalog::BIGINT:
+      {
+        using StorageType =
+            datatypes::ColDataTypeToIntegralType<execplan::CalpontSystemCatalog::BIGINT>::type;
+        const uint8_t* valueBuf = rg.getColumnValueBuf(columnId, p.rowID);
+        const uint8_t* nullValuePtr = reinterpret_cast<const uint8_t*>(&joblist::BIGINTNULL);
+        bool isNotNull = memcmp(nullValuePtr, valueBuf, columnWidth) != 0;
+        *pos++ = (!isAsc) ? static_cast<uint8_t>(!isNotNull) : static_cast<uint8_t>(isNotNull);
+        std::memcpy(pos, valueBuf, sizeof(StorageType));
+        StorageType* valPtr = reinterpret_cast<StorageType*>(pos);
+        *valPtr ^= 0x8000000000000000;
+        *valPtr = (!isAsc) ? ~htonll(*valPtr) : htonll(*valPtr);
+        pos += columnWidth;
+        break;
+      }
+      case execplan::CalpontSystemCatalog::UTINYINT:
+      {
+        using StorageType =
+            datatypes::ColDataTypeToIntegralType<execplan::CalpontSystemCatalog::UTINYINT>::type;
+        const uint8_t* valueBuf = rg.getColumnValueBuf(columnId, p.rowID);
+        const uint8_t* nullValuePtr = reinterpret_cast<const uint8_t*>(&joblist::UTINYINTNULL);
+        bool isNotNull = memcmp(nullValuePtr, valueBuf, columnWidth) != 0;
+        *pos++ = (!isAsc) ? static_cast<uint8_t>(!isNotNull) : static_cast<uint8_t>(isNotNull);
+        std::memcpy(pos, valueBuf, sizeof(StorageType));
+        StorageType* valPtr = reinterpret_cast<StorageType*>(pos);
+        *valPtr ^= 0x80;
+        *valPtr = (!isAsc) ? ~*valPtr : *valPtr;
+        pos += columnWidth;
+        break;
+      }
+
+      case execplan::CalpontSystemCatalog::USMALLINT:
+      {
+        using StorageType =
+            datatypes::ColDataTypeToIntegralType<execplan::CalpontSystemCatalog::USMALLINT>::type;
+        const uint8_t* valueBuf = rg.getColumnValueBuf(columnId, p.rowID);
+        const uint8_t* nullValuePtr = reinterpret_cast<const uint8_t*>(&joblist::USMALLINTNULL);
+        bool isNotNull = memcmp(nullValuePtr, valueBuf, columnWidth) != 0;
+        *pos++ = (!isAsc) ? static_cast<uint8_t>(!isNotNull) : static_cast<uint8_t>(isNotNull);
+        std::memcpy(pos, valueBuf, sizeof(StorageType));
+        StorageType* valPtr = reinterpret_cast<StorageType*>(pos);
+        *valPtr ^= 0x8000;
+        *valPtr = (!isAsc) ? ~htons(*valPtr) : htons(*valPtr);
+        pos += columnWidth;
+        break;
+      }
+      case execplan::CalpontSystemCatalog::UINT:
+      {
+        using StorageType = datatypes::ColDataTypeToIntegralType<execplan::CalpontSystemCatalog::UINT>::type;
+        const uint8_t* valueBuf = rg.getColumnValueBuf(columnId, p.rowID);
+        const uint8_t* nullValuePtr = reinterpret_cast<const uint8_t*>(&joblist::UINTNULL);
+        bool isNotNull = memcmp(nullValuePtr, valueBuf, columnWidth) != 0;
+        *pos++ = (!isAsc) ? static_cast<uint8_t>(!isNotNull) : static_cast<uint8_t>(isNotNull);
+        std::memcpy(pos, valueBuf, sizeof(StorageType));
+        StorageType* valPtr = reinterpret_cast<StorageType*>(pos);
+        *valPtr ^= 0x80000000;
+        *valPtr = (!isAsc) ? ~htonl(*valPtr) : htonl(*valPtr);
+        pos += columnWidth;
+        break;
+      }
+      case execplan::CalpontSystemCatalog::UBIGINT:
+      {
+        using StorageType =
+            datatypes::ColDataTypeToIntegralType<execplan::CalpontSystemCatalog::UBIGINT>::type;
+        const uint8_t* valueBuf = rg.getColumnValueBuf(columnId, p.rowID);
+        const uint8_t* nullValuePtr = reinterpret_cast<const uint8_t*>(&joblist::UBIGINTNULL);
+        bool isNotNull = memcmp(nullValuePtr, valueBuf, columnWidth) != 0;
+        *pos++ = (!isAsc) ? static_cast<uint8_t>(!isNotNull) : static_cast<uint8_t>(isNotNull);
+        std::memcpy(pos, valueBuf, sizeof(StorageType));
+        StorageType* valPtr = reinterpret_cast<StorageType*>(pos);
+        *valPtr ^= 0x8000000000000000;
+        *valPtr = (!isAsc) ? ~htonll(*valPtr) : htonll(*valPtr);
+        pos += columnWidth;
+        break;
+      }


There is significant code duplication in the switch statement for handling various integer types (TINYINT, SMALLINT, INT, BIGINT, and their unsigned counterparts). The logic for building the sort key for these types is nearly identical, differing only in type names, null values, and byte-swapping functions. This could be refactored into a template helper function to reduce duplication and improve maintainability.

gemini-code-assist · 2026-03-18T23:46:12Z

+  parallelOrderByMutex_.lock();
+  ++fFinishedThreads;
+  if (fFinishedThreads == fMaxThreads)
+  {
+    // WIP Replace this vector with vector of RGDatas?
+    // INV: firstPhaseThreads ref scope is shorter than TAS scope.
+    const sorting::SortingThreads& firstPhaseThreads = firstPhaseflatOrderBys_;
+
+    size_t numberOfSortingWithData = std::accumulate(firstPhaseThreads.begin(), firstPhaseThreads.end(), 0,
+                                                     [](size_t acc, const sorting::PDQSortingThread& sorting)
+                                                     { return acc + !sorting->getRGDatas().empty(); });
+    if (numberOfSortingWithData <= 1)
+    {
+      RGData rgDataOut;
+      rowgroup::RowGroup rowGroupOut{fRowGroupOut};
+      for (const sorting::PDQSortingThread& sorting : firstPhaseThreads)
+      {
+        if (!sorting->getRGDatas().empty())
+        {
+          while (sorting->getData(rgDataOut, firstPhaseflatOrderBys_) && !cancelled())
+          {
+            rowGroupOut.setData(&rgDataOut);
+            auto rows = rowGroupOut.getRowCount();
+            if (rows > 0)
+            {
+              fOutputDL->insert(rgDataOut);
+            }
+          }
+        }
+      }
+      fOutputDL->endOfInput();
+    }
+    else
+    {
+      auto perThreadRangesMatrix = calculatePivots4phase2(firstPhaseThreads);
+      assert(perThreadRangesMatrix.size() == fMaxThreads);
+      // push 1st of the 2nd phase threads as HeapOrderBy with the first range
+      fRunnersList.push_back(jobstepThreadPool.invoke(
+          [this, perThreadRangesMatrix, &firstPhaseThreads]()
+          { this->finalizeHeapOrderBy(1, perThreadRangesMatrix[0], firstPhaseThreads); }));
+      for (uint32_t i = 2; i <= fMaxThreads; ++i)
+      {
+        fRunnersList.push_back(jobstepThreadPool.invoke(
+            [this, i, perThreadRangesMatrix, &firstPhaseThreads]()
+            { this->finalizePDQOrderBy(i, perThreadRangesMatrix, firstPhaseThreads); }));
+      }
+      // Merge finalizer threads' rgDatas from separate outputDLs into a common outputDL
+      fRunnersList.push_back(jobstepThreadPool.invoke([this]() { this->joinOutputDLs(); }));
+    }
+  }
+  parallelOrderByMutex_.unlock();


Using std::mutex to protect a simple counter (fFinishedThreads) is less efficient than using an atomic variable. The lock is held while calculatePivots4phase2 and other operations are performed, which could be a significant amount of time. Consider changing fFinishedThreads to a std::atomic<size_t> and using an atomic fetch-and-add operation to avoid the overhead of locking.

gemini-code-assist · 2026-03-18T23:46:12Z

+      // std::cout << threadName << " empty perm range" << std::endl;
+      continue;
+    }
+    perm.insert(perm.end(), srcPermBegin, srcPermEnd);


The perm.insert() call is inside a loop, which can lead to multiple reallocations and copies of the vector's contents, impacting performance. It would be more efficient to pre-calculate the total number of elements to be inserted and call perm.reserve() once before the loop to allocate the required memory upfront.

gemini-code-assist

Code Review

This pull request introduces a new two-phase parallel sorting mechanism using a modified quicksort (PDQOrderBy) and a heap-based merge (HeapOrderBy) for handling large datasets and parallel execution. The changes include adding new PDQOrderBy and HeapOrderBy classes, integrating them into the TupleAnnexStep for query execution, and updating JobInfo to support the new sorting types. The TupleAnnexStep now dynamically selects between LimitedOrderBy, PDQOrderBy (single-threaded), or a two-phase parallel PDQOrderBy and HeapOrderBy based on the limit and thread configuration. Several critical issues were identified in the new sorting logic, including incorrect type handling for string columns in HeapOrderBy and PDQOrderBy, potential out-of-bounds access in modified_pdqsort, uninitialized KeyType objects in HeapOrderBy, and a double function call in modified_pdqsort. Concurrency issues with fRowsReturned and fFinishedThreads were also noted, along with several WIP comments indicating incomplete functionality for string sorting, null handling, and LIMIT/OFFSET in the parallel sorting paths. Additionally, the test for PDQOrderBy was found to be incorrectly configured to test LimitedOrderBy instead.

gemini-code-assist · 2026-03-18T23:47:44Z

+    );
+    rowgroup::RowGroup jobInfoRG(inRG);
+    joblist::TupleAnnexStep tns = joblist::TupleAnnexStep(jobInfo);
+    tns.addOrderBy(new joblist::LimitedOrderBy());


The test file pqorderby-test.cpp is intended to test PDQOrderBy, but tns.addOrderBy(new joblist::LimitedOrderBy()); is being called, which initializes LimitedOrderBy. This means the test is not actually exercising the PDQOrderBy functionality as intended by the file name. This is a critical mismatch and needs to be corrected to test PDQOrderBy.

gemini-code-assist · 2026-03-18T23:47:44Z

+    }
+    else
+      mod_sort3(begin + s2, begin, end - 1, perm_begin + s2, perm_begin, perm_end - 1, comp);
+


The line mod_partition_right(begin, end, perm_begin, perm_end, comp); is called twice: once as part of the structured binding assignment and then immediately again as a standalone call. This is a bug; the function should only be called once to avoid redundant work and potential side effects.

gemini-code-assist · 2026-03-18T23:47:44Z

+    while (first < last && --perm_last != perm_last_2 && !comp(*--last, pivot))
+      ;
+  else
+    while (--perm_last != perm_last_2 && !comp(*--last, pivot))


The condition --perm_last != perm_last_2 allows perm_last to potentially go two elements past perm_end (in the reverse direction). This is an out-of-bounds access and will lead to undefined behavior. The loop condition should ensure perm_last stays within valid bounds, typically perm_last >= perm_begin.

gemini-code-assist · 2026-03-18T23:47:44Z

+        else if (!sorting::isDictColumn(columnType, columnWidth))
+        {
+          switch (columnWidth)
+          {
+            case 1:
+            {
+              using StorageType =
+                  datatypes::ColDataTypeToIntegralType<execplan::CalpontSystemCatalog::TINYINT>::type;
+              value = rg.getColumnValue<execplan::CalpontSystemCatalog::VARCHAR, StorageType, EncodedKeyType>(
+                  columnId, p.rowID);
+              *pos++ = (!isAsc) ? 0 : 1;
+              break;
+            }
+
+            case 2:
+            {
+              using StorageType =
+                  datatypes::ColDataTypeToIntegralType<execplan::CalpontSystemCatalog::SMALLINT>::type;
+              value = rg.getColumnValue<execplan::CalpontSystemCatalog::VARCHAR, StorageType, EncodedKeyType>(
+                            columnId, p.rowID)
+                          .rtrimZero();
+              *pos++ = (!isAsc) ? 0 : 1;
+              break;
+            };
+
+            case 4:
+            {
+              using StorageType =
+                  datatypes::ColDataTypeToIntegralType<execplan::CalpontSystemCatalog::INT>::type;
+              value = rg.getColumnValue<execplan::CalpontSystemCatalog::VARCHAR, StorageType, EncodedKeyType>(
+                            columnId, p.rowID)
+                          .rtrimZero();
+              *pos++ = (!isAsc) ? 0 : 1;
+              break;
+            };
+
+            case 8:
+            {
+              using StorageType =
+                  datatypes::ColDataTypeToIntegralType<execplan::CalpontSystemCatalog::BIGINT>::type;
+              value = rg.getColumnValue<execplan::CalpontSystemCatalog::VARCHAR, StorageType, EncodedKeyType>(
+                            columnId, p.rowID)
+                          .rtrimZero();
+              *pos++ = (!isAsc) ? 0 : 1;
+              break;
+            };
+            default: idbassert(0);


This switch (columnWidth) block is within the VARCHAR, CHAR, TEXT case, but it attempts to use datatypes::ColDataTypeToIntegralType for these string types. This is fundamentally incorrect as VARCHAR, CHAR, and TEXT are string types, not integral types. This will lead to type mismatches, undefined behavior, or crashes. String types should be handled using string comparison functions, potentially with collation, not by reinterpreting their bytes as integers.

gemini-code-assist · 2026-03-18T23:47:44Z

+  // Find the first element strictly smaller than the pivot. We have to guard this search if
+  // there was no element before *first.
+  if (first - 1 == begin)
+    while (first < last && --perm_last != perm_last_2 && !comp(*--last, pivot))


The condition --perm_last != perm_last_2 allows perm_last to potentially go two elements past perm_end (in the reverse direction). This is an out-of-bounds access and will lead to undefined behavior. The loop condition should ensure perm_last stays within valid bounds, typically perm_last >= perm_begin.

gemini-code-assist · 2026-03-18T23:47:46Z

+    size_t offset2stringStoreOffset =
+        RowGroup::getHeaderSize() + rowID * getRowSize() + getOffsetArray()[columnID];
+    // bool isNull = strings->isNullValue(offset2stringStoreOffset);
+    // if (isNull)
+    //   return nullValue;


The commented-out code for null value handling (// bool isNull = strings->isNullValue(offset2stringStoreOffset); // if (isNull) // return nullValue;) suggests that null values might not be correctly handled for IsVariadicType. If null strings are possible, this logic should be re-enabled or an alternative null handling mechanism should be in place to prevent incorrect behavior.

gemini-code-assist · 2026-03-18T23:47:46Z

+    // bool isNull = strings->isNullValue(offset2stringStoreOffset);
+    // if (isNull)
+    //   return nullValue;
+    // check the out of bounds invariant somehow


The comment // check the out of bounds invariant somehow indicates a missing bounds check. Accessing data[valueOffset] without proper bounds checking can lead to buffer overflows or crashes. This is a medium-severity correctness issue.

gemini-code-assist · 2026-03-18T23:47:46Z

+    size_t offset2stringStoreOffset =
+        RowGroup::getHeaderSize() + rowID * getRowSize() + getOffsets()[columnID];
+    // bool isNull = strings->isNullValue(offset2stringStoreOffset);
+    // if (isNull)
+    //   return nullValue;


The commented-out code for null value handling (// bool isNull = strings->isNullValue(offset2stringStoreOffset); // if (isNull) // return nullValue;) suggests that null values might not be correctly handled for IsShortString. If null strings are possible, this logic should be re-enabled or an alternative null handling mechanism should be in place to prevent incorrect behavior.

gemini-code-assist · 2026-03-18T23:47:46Z

+    // bool isNull = strings->isNullValue(offset2stringStoreOffset);
+    // if (isNull)
+    //   return nullValue;
+    // check the out of bounds invariant somehow


The comment // check the out of bounds invariant somehow indicates a missing bounds check. Accessing data[valueOffset] without proper bounds checking can lead to buffer overflows or crashes. This is a medium-severity correctness issue.

gemini-code-assist · 2026-03-18T23:47:46Z

+    assert(data);
+    size_t valueOffset = RowGroup::getHeaderSize() + getOffsets()[columnID] + rowID * getRowSize();
+    // check the out of bounds invariant somehow
+    const char* valuePtr = reinterpret_cast<const char*>(&data[valueOffset]);


The comment // check the out of bounds invariant somehow indicates a missing bounds check. Accessing data[valueOffset] without proper bounds checking can lead to buffer overflows or crashes. This is a medium-severity correctness issue.

Roman Nozdrin and others added 30 commits September 21, 2025 19:18

PDQ key and permutation sorting

49f665a

TAS sorting class initial cut

1444e96

WIP permutatop-capable PDQsort works

d0a4c9e

WIP Sorting algo changes

4679acf

WIP Trivial ORDER BY works for BIGINT

4d9aa56

WIP working on range based sorting

39c016c

WIP single-column BIGINT sorting works

a5534ac

WIP This patch changes ranges repr and implements multiple column sort

86d8b34

WIP Enable concepts and refactor sorting with recursion

cdce188

WIP Sorting handles sorting direction correctly

c8b9d45

WIP Enable branchless right partitioning in pdqsort

e327d38

Adding LIMIT/OFFSET support

395f325

WIP prepare for constnullablestring

a2416b5

WIP Initial char support patch

145336a

WIP adding simple tests for single step sorting

e936dad

WIP single step sorting for temporal and char datatypes

b0e1f67

WIP Decimal sorting support

0c4e0ff

WIP Composite key sorting

d837292

WIP preparations for multi-threaded sort

2d026e1

WIP parallel sorting intermediate patch

0003fb2

WIP on parallel sort

77f02dd

WIP parallel flat sorting works

2214715

WIP This patch hides template method as a method

daf5399

WIP parallel sorting works. No proper stats calculation yet though

7e18981

WIP Parallel factor is now configurable. Stats doesn't work properly

b1fd33a

WIP lowerBound using permutation vector

30bf0c4

WIP hardcoded lowerbound for 3 threads + logical function basis

fe29d41

WIP separate and generalise <left, right> calculation

9c07f44

WIP Generalize the calcStats routine

89007c2

WIP This commit introduces HeapOrderBy the first finalize thread uses

ae0bc2a

Roman Nozdrin and others added 22 commits September 21, 2025 20:34

WIP Long strings less works

06df51b

WIP The tests now covers INT types

3b3c832

WIP Int commit

c755377

WIP Uints tests are green

256bcb2

WIP Initial float Key encoder.

ba9e7de

WIP Float/Double tests are now a template test

0d7e3a1

WIP double encoding works

bcb8b2b

WIP WideDecimal tests first cut

dd7af4f

WIP Keys test for wide decimal

23cd9c9

WIP Fixed decimal sizes in KeyType ctor + removed extra output

dda0f79

WIP Replaced KeyType::less with more sophisticated version.

db19141

WIP HeapOderBy is now aligned with PDQOrderBy about sorting direction

142c571

WIP HeapOrderBy handles threads w/o permutes and parallization factor…

df2a68a

… != power of 2

WIP Disable PQ order by UTs

920fd92

WIP Initial effective metric

86b0b48

WIP this introduces an efficient M3 metric

3f1aff8

WIP disable a strange ASAN warning about smart pointers

be7bd92

WIP rebase fixes

bb3e436

WIP fixed sigabrt when number of threads is not a power of 2

61408ea

WIP minor changes

e10e820

WIP take a correct column offset from RG, skip NULL values picking pi…

74a708d

…vots in phase 2.

chore(): rebase leftover fixes.

17108eb

drrtuy changed the title ~~feat(joblist,sorting): new ORDER BY algo~~ feat(joblist,sorting): MCOL-3758 new ORDER BY algo Sep 23, 2025

fix(OB,joblist): introduce a bypass if there is only one RGData.

1aa8393

gemini-code-assist Bot reviewed Mar 18, 2026

View reviewed changes

	std::unique_ptr<uint8_t> keyBuf_;
	std::unique_ptr<uint8_t[]> keyBuf_;

Conversation

drrtuy commented Sep 23, 2025

Uh oh!

mariadb-LeonidFedorov commented Mar 18, 2026

Uh oh!

mariadb-LeonidFedorov commented Mar 18, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

mariadb-LeonidFedorov commented Mar 18, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Mar 18, 2026

Choose a reason for hiding this comment