[Improvement] Optimize fetching table by name identifier

### What would you like to be improved?

According to the CPU profiler,

<img width="1927" alt="Image" src="https://github.com/user-attachments/assets/6e932e4d-0526-4715-b86a-fe8c75806ed1" />

The following code is very time-consuming; we need to merge the logic of fetching the table as much as possible
https://github.com/apache/gravitino/blob/1297713992dfd376fc2a6fba805a6cdee61c4373/core/src/main/java/org/apache/gravitino/storage/relational/service/TableMetaService.java#L80-L92


The SQL corresponding to `getColumnsByTableIdAndVersion` can be optimized:

https://github.com/apache/gravitino/blob/1297713992dfd376fc2a6fba805a6cdee61c4373/core/src/main/java/org/apache/gravitino/storage/relational/mapper/provider/base/TableColumnBaseSQLProvider.java#L28C17-L48


```shell
mysql> select
    ->   *
    -> from
    ->   table_column_version_info t1
    ->   inner join (
    ->     SELECT
    ->       column_id,
    ->       MAX(table_version) AS max_table_version
    ->     from
    ->       table_column_version_info
    ->     where
    ->       table_id = 2716478369449788787
    ->       and table_version <= 10
    ->       and deleted_at = 0
    ->     group by
    ->       column_id
    ->   ) t2 on t1.column_id = t2.column_id
    ->   AND t1.table_version = t2.max_table_version;
8 rows in set (0.28 sec)

mysql> select
    ->   *
    -> from
    ->   table_column_version_info t1
    ->   inner join (
    ->     SELECT
    ->       column_id,
    ->       MAX(table_version) AS max_table_version
    ->     from
    ->       table_column_version_info
    ->     where
    ->       table_id = 2716478369449788787
    ->       and table_version <= 10
    ->       and deleted_at = 0
    ->     group by
    ->       column_id
    ->   ) t2 on t1.column_id = t2.column_id
    ->   AND t1.table_version = t2.max_table_version
    ->   and table_id = 2716478369449788787;
8 rows in set (0.00 sec)
```

If we add a condition like `table_id = xxxx` in the end, **it will be more efficient and only take a few milliseconds compared to 200+ milliseconds(see above).**


### How should we improve?

_No response_

	public TableEntity getTableByIdentifier(NameIdentifier identifier) {
	NameIdentifierUtil.checkTable(identifier);

	Long schemaId =
	CommonMetaService.getInstance().getParentEntityIdByNamespace(identifier.namespace());

	TablePO tablePO = getTablePOBySchemaIdAndName(schemaId, identifier.name());
	List<ColumnPO> columnPOs =
	TableColumnMetaService.getInstance()
	.getColumnsByTableIdAndVersion(tablePO.getTableId(), tablePO.getCurrentVersion());

	return POConverters.fromTableAndColumnPOs(tablePO, columnPOs, identifier.namespace());
	}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Improvement] Optimize fetching table by name identifier #6638

What would you like to be improved?

How should we improve?

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Improvement] Optimize fetching table by name identifier #6638

Description

What would you like to be improved?

How should we improve?

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions