Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Fix][Connector-V2][OceanBase] oceanbase vector support simple vector index #9072

Merged
merged 8 commits into from
Apr 2, 2025

Conversation

xxsc0529
Copy link
Contributor

Purpose of this pull request

ConstraintType class add VECTOR_INDEX_KEY enum,The error is reported in the buildConstrainKeySql method of OceanBaseMysqlCreateTableSqlBuilder. Currently, the vector index supported by oceanbase is very limited. It is not a good solution to not add an index under a large amount of data. Here is an appendix to a default index implementation.
close #9056

Does this PR introduce any user-facing change?

none

How was this patch tested?

seatunnel-e2e/seatunnel-connector-v2-e2e/connector-jdbc-e2e/connector-jdbc-e2e-part-2/src/test/java/org/apache/seatunnel/connectors/seatunnel/jdbc/JdbcOceanBaseMilvusIT.java

Check list

Copy link
Member

@Hisoka-X Hisoka-X left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hi @xxsc0529 , thanks for the pr. Could you explain why test case execute normally before this patch. Does the change not be covered by test case?

@hailin0 hailin0 requested a review from Copilot March 31, 2025 04:22
Copy link

@Copilot Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

This PR introduces support for simple vector indexing for OceanBase by updating constraint types and vector type handling while adding new tests for this functionality.

  • Updated test setup in JdbcOceanBaseMilvusIT to configure OceanBase vector memory and added a new test case for handling non-existent tables.
  • Modified vector type recognition and naming in OceanBaseMySqlTypeConverter.
  • Enhanced the create table SQL builder to support the VECTOR INDEX constraint and adjusted catalog column type mapping.

Reviewed Changes

Copilot reviewed 4 out of 5 changed files in this pull request and generated 1 comment.

File Description
seatunnel-e2e/seatunnel-connector-v2-e2e/connector-jdbc-e2e-part-2/src/test/java/org/apache/seatunnel/connectors/seatunnel/jdbc/JdbcOceanBaseMilvusIT.java Added test methods and configuration changes for OceanBase vector indexing.
seatunnel-connectors-v2/connector-jdbc/src/main/java/org/apache/seatunnel/connectors/seatunnel/jdbc/internal/dialect/oceanbase/OceanBaseMySqlTypeConverter.java Updated vector column type handling for case sensitivity and naming.
seatunnel-connectors-v2/connector-jdbc/src/main/java/org/apache/seatunnel/connectors/seatunnel/jdbc/catalog/oceanbase/OceanBaseMysqlCreateTableSqlBuilder.java Implemented VECTOR_INDEX_KEY SQL generation with specific index options.
seatunnel-connectors-v2/connector-jdbc/src/main/java/org/apache/seatunnel/connectors/seatunnel/jdbc/catalog/oceanbase/OceanBaseMySqlCatalog.java Normalized column type naming to support VECTOR types.
Files not reviewed (1)
  • seatunnel-e2e/seatunnel-connector-v2-e2e/connector-jdbc-e2e/connector-jdbc-e2e-part-2/pom.xml: Language not supported
Comments suppressed due to low confidence (2)

seatunnel-connectors-v2/connector-jdbc/src/main/java/org/apache/seatunnel/connectors/seatunnel/jdbc/internal/dialect/oceanbase/OceanBaseMySqlTypeConverter.java:300

  • [nitpick] Consider applying toUpperCase() consistently for both the start and end checks to ensure robust case-insensitive matching of the entire column type string.
if (columnType.toUpperCase().startsWith("VECTOR(") && columnType.endsWith(")")) {

seatunnel-e2e/seatunnel-connector-v2-e2e/connector-jdbc-e2e/connector-jdbc-e2e-part-2/src/test/java/org/apache/seatunnel/connectors/seatunnel/jdbc/JdbcOceanBaseMilvusIT.java:422

  • [nitpick] Consider adding JavaDoc comments to the setObVectorMemory() method to explain its purpose and usage in setting up OceanBase vector memory for tests.
public void setObVectorMemory() {

@xxsc0529
Copy link
Contributor Author

Hi @xxsc0529 , thanks for the pr. Could you explain why test case execute normally before this patch. Does the change not be covered by test case?

add test case:testMilvusToOceanBaseNotTable
The oceanbase table can be automatically created. This test passes the test, which means that the upstream has an index, and the downstream can be aware and created an oceanabse table when processing it downstream, without an error.

@Hisoka-X
Copy link
Member

Hi @xxsc0529 , thanks for the pr. Could you explain why test case execute normally before this patch. Does the change not be covered by test case?

add test case:testMilvusToOceanBaseNotTable The oceanbase table can be automatically created. This test passes the test, which means that the upstream has an index, and the downstream can be aware and created an oceanabse table when processing it downstream, without an error.

image
I cannot found the create table log... Maybe something wrong in test case.

Copy link
Member

@Hisoka-X Hisoka-X left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @xxsc0529 ! LGTM except one minior problem.

…nnector-jdbc-e2e-part-2/src/test/java/org/apache/seatunnel/connectors/seatunnel/jdbc/JdbcOceanBaseMilvusIT.java

Co-authored-by: Jia Fan <[email protected]>
@xxsc0529
Copy link
Contributor Author

xxsc0529 commented Apr 2, 2025

Thanks @xxsc0529 ! LGTM except one minior problem.

It has been modified

…nnector-jdbc-e2e-part-2/src/test/java/org/apache/seatunnel/connectors/seatunnel/jdbc/JdbcOceanBaseMilvusIT.java
Hisoka-X
Hisoka-X previously approved these changes Apr 2, 2025
@Hisoka-X Hisoka-X merged commit 4140cd1 into apache:dev Apr 2, 2025
6 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[Fix][Connector-V2][OceanBase] oceanbase vector support simple vector index
3 participants