-
Notifications
You must be signed in to change notification settings - Fork 285
Description
Is there an existing issue for the same bug?
- I have checked the existing issues.
Branch Name
3.0-dev
Commit ID
Other Environment Information
- Hardware parameters:
- OS type:
- Others:Actual Behavior
安利表ca_comprehensive_dataset_copy 第一次 diff dump csv到 stage 报错
Step: step5 diff copy against empty table
SQL/operation:
data branch diff anli_test.ca_comprehensive_dataset_copy against anli_test.ca_comprehensive_dataset_empty output file 'stage://stage01';
ERROR 1105 (HY000) at line 1: PUT https://ghs-test-1308875761.cos.ap-guangzhou.myqcloud.com/diff_ca_comprehensive_dataset_copy_ca_comprehensive_dataset_empty_20251208_101823.csv: 400 EntityTooSmall(Message: Your proposed upload is smaller than the minimum allowed object size., RequestId: NjkzNmE1ZWZfOTE0ZTdiMGJfY2MzYl83ZWExYmQy, TraceId: OGVmYzZiMmQzYjA2OWNhODk0NTRkMTBiOWVmMDAxODc0OWRkZjk0ZDM1NmI1M2E2MTRlY2MzZDhmNmI5MWI1OWRlZDk5YzgyOTg0ZTg2ODA1ODFjOGY0MWFhYWFhOTdmZTJkNmNmM2QwNmQ0YmMwODk3ZmVhMzE1ZTQzOGMwMjc=)
PUT https://ghs-test-130887
Expected Behavior
No response
Steps to Reproduce
ddl:
create database anli_test;use anli_test;CREATE TABLE if not exists ca_comprehensive_dataset (md5_id varchar(255) NOT NULL,question text DEFAULT NULL,answer json DEFAULT NULL,source_type varchar(255) DEFAULT NULL,content_type varchar(255) DEFAULT NULL,keyword varchar(255) DEFAULT NULL,question_vector vecf32(1024) DEFAULT NULL COMMENT '摘要的向量集',allow_access varchar(511) DEFAULT NULL,allow_identities varchar(512) DEFAULT NULL,delete_flag int DEFAULT NULL,created_at timestamp DEFAULT CURRENT_TIMESTAMP(),updated_at timestamp DEFAULT CURRENT_TIMESTAMP() ON UPDATE CURRENT_TIMESTAMP(),PRIMARY KEY (md5_id),KEY idx_comprehensive_allow_access (allow_access),KEY idx_comprehensive_allow_identities (allow_identities),KEY idx_comprehensive_content_type (content_type));
load data url s3option {'endpoint'='http://cos.ap-guangzhou.myqcloud.com','access_key_id'='AKIDxxx','secret_access_key'='pXGxxx','bucket'='mo-load-guangzhou-1308875761', 'filepath'='mo-big-data/ca_ai_ca_comprehensive_dataset.csv'} into table ca_comprehensive_dataset FIELDS TERMINATED BY ',' ENCLOSED BY '\"' LINES TERMINATED BY '\n' PARALLEL 'TRUE';
set experimental_ivf_index=1;create index idx_vec_question using ivfflat on anli_test.ca_comprehensive_dataset(question_vector) lists = 256 op_type 'vector_l2_ops';
create fulltext index idx_ft_question on anli_test.`ca_comprehensive_dataset`(`question`) WITH PARSER ngram;
create stage stage01 url = 's3://ghs-test-1308875761/' credentials = {"aws_key_id"='AKIDUxxx',"aws_secret_key"='pXGuxx',"AWS_REGION"='ap-guangzhou','PROVIDER'='Amazon','ENDPOINT'='http://cos.ap-guangzhou.myqcloud.com'};
./branch_self_diff.sh -h 172.16.47.147 -u dump -p 111 -export_dir_path 'stage://stage01' -tbl anli_test.ca_comprehensive_datasetAdditional information
No response