Skip to content

[Improvement] Improve the current SQL of JDBCBackend #4024

Open
@jerryshao

Description

@jerryshao

What would you like to be improved?

In the current implementation of xxxMetaMapper and xxxMetaService, we seldom use JOIN and transactions, for example:

image

The list catalog operation will issue two SQL queries, the first one is to get a metalake id, then using this id to get the catalog list.

Instead of issuing two queries, we can use JOIN to join catalog and metalake table to get catalog list by one query, this will save the IO time and avoid inconsistent problem.

Also for updateCatalog

image

We have several steps:

  1. get metalake id.
  2. get catalog PO.
  3. update the catalog object.
  4. update the catalog in RDBMS.

These 4 steps are not in the transaction, which will potentially meet the inconsistent problem, and we highly leverage the lock to avoid the concurrent problem. A better solution is to put these 4 steps into a transaction.

How should we improve?

So basically, we can improve the current SQL to:

  1. reduce the queries and increase the performance.
  2. Put multiple step actions into one transaction to leverage DB's transaction to keep consistent.

Activity

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Metadata

Metadata

Assignees

No one assigned

    Labels

    help wantedExtra attention is neededimprovementImprovements on everything

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions