Fix ci flink 37396 #2

loserwang1024 · 2025-02-27T09:35:16Z

No description provided.

…and types This closes #12323

…y and SinkFormatFactory We improved the interfaces with the following changes: 1. Have a common interface DynamicTableSource.Context, and make Context of ScanTableSource and LookupTableSource extend it, and rename them to LookupContext and ScanContext 2. Change parameter of ScanFormat.createScanFormat from ScanTableSource.Context to DynamicTableSource.Context 3. Rename ScanFormat.createScanFormat to DecodingFormat#createRuntimeDecoder() 4. Rename SinkFormat.createSinkFormat to EncodingFormat#createRuntimeEncoder() 5. Rename ScanFormatFactory to DecodingFormatFactory 6. Rename SinkFormatFactory to EncodingFormatFactory This closes #12320

… format The current timestamp format in JSON format is not SQL standard which uses RFC-3339. This commit changes the default behavior to parse/generate timestamp using SQL standard. Besides, it introduces an option "json.timestamp-format.standard" to have the ability to fallback to ISO standard. This closes #12661

…d as TypeInformation Introduces a WrapperTypeInfo that can replace most (if not all) TypeInformation classes in the Blink planner. It is backed by logical types and uses internal serializers. This closes #12852.

… type This closes #12421

TypeInformation is a legacy class for the sole purpose of creating a TypeSerializer. Instances of TypeInformation are not required in the table ecosystem but sometimes enforced by interfaces of other modules (such as org.apache.flink.api.dag.Transformation). Therefore, we introduce InternalTypeInfo which acts as an adapter whenever type information is required. Instances of InternalTypeInfo should only be created for passing it to interfaces that require type information. The class should not be used as a replacement for a LogicalType. Information such as the arity of a row type, field types, field names, etc. should be derived from the LogicalType directly. This closes #12900.

…r Json format This closes #12756

This closes #12907

… data from Kafka

This closes #13032.

…'s IDENTITY config is not FULL This commit add documentation for this case and throws a guide message in the exception. This closes #13019

…essage is received Just skip the tombstone messages This closes #13019

…ell changelogs This closes #13090

This closes #13122

This closes #13333

This commit upgrades the default version of avro that flink-avro will use. It should be possible to downgrade the avro version in a user job as the binary format is compatible and we do not expose any dependencies on avro in the API. Additionally this commit fixes handling of logical types: time-micros and timestamp-micros as well as interpretation of timestamp-millis in the AvroRowDataDeserializationSchema.

…c database and table for canal-json format This closes (#13294)

This closes #13303 Co-authored-by: Jark Wu <[email protected]>

This closes #13543

… to table factory context

…tead This closes #13653

… instead This closes #13653

@SuppressWarning

…nformation in sources and sinks This it is not a compatible change. But given that those interfaces are still relatively new and not many people have changed to the new sources/sinks. We should do this change now or never and avoid @SuppressWarning in almost all implementations.

This closes #13809

…ormat deserialization Never modify and prefix the field name, instead, we now use the {rowName}_{fieldName} as the nested row type name because Avro schema does not allow same name row type with different schema.

… deserialization * Fix the TIME schema precision as 3 * Fix the nullability of type: TIMESTAMP_WITHOUT_TIME_ZONE, DATE, TIME_WITHOUT_TIME_ZONE, DECIMAL, MAP, ARRAY * The table schema row type should be always non-nullable

This closes #13872

Make assertj print full stacktraces when encountering unexpected exceptions.

Job may be at various states including race conditions during shutdown. Ideally, the framework would provide idempotence but we can workaround that by ignoring specific exceptions.

For non-transactional producers, a notifyCheckpointCompleted after finishOperator will set the transaction inside the 2PCSinkFunction to null, such that on close, the producer is leaked. Since transactional producer stores the transactions in pendingTransactions before that, we just need to fix the cases where we don't preCommit/commit. The easiest solution is to actually close the producer on finishOperator - no new record can arrive.

Add leak check in all relevant tests.

The test tried to assert on byte counts which are written async. Commit adds flushing and establishes a baseline so that metadata request don't interfere with assertions.

Workaround for FLINK-36454.

Also - adds the migration support tests up to 1.20. - bumps Kafka-client to 3.6.2

Signed-off-by: Pawel Leszczynski <[email protected]>

…pache#136)

Add more test coverage for unchained cases and separate the behavioral components from data capture and assertions. Also reduces the need to convey information with fields.

To test recovery in future PRs, it's important to decompose the #createWriter methods into common cases and advanced cases that may require some additional setup.

Since Java 20, Thread.stop fails, so we just need to remember old leaks to avoid failing subsequent tests.

Move FlinkKafkaInternalProducer and TransactionalIdFactory to internal. All other classes are potentially leaked through the generics and signatures of the KafkaSink(Builder).

Split the easy case of non-transactional writer from the transactional writer to simplify reasoning about the state (e.g. which fields are used when).

Backchannel provides a way for the committer to communicate to the writer even in (simple) non-chained cases thanks to colocation contraints. It's the same trick that is employed in statefun. A backchannel is stateless, however, because its state can be entirely derived from committer state. Thus, it's much easier to handle than a statefun backchannel. Backchannel will be used to communicate the committed transactions to the writer in future commits.

Add first class producer pool that self-manages all resources and allows to recycle producers by transactional ids.

Close apache#134

… a flat-layout

JingsongLi and others added 30 commits April 6, 2023 16:33

[FLINK-17925][fs-connector] Fix Filesystem options to default values …

20223de

…and types This closes #12323

[hotfix][tests] Move containsCause to FlinkMatchers

b77f97c

[FLINK-18002][json] Correct the behavior for ContainerNode as varchar…

1acd122

… type This closes #12421

[FLINK-18296][json] Add support for TIMESTAMP_WITH_LOCAL_ZONE type fo…

ed9f5af

…r Json format This closes #12756

[FLINK-18607][build] Give the maven module a human readable name

7006696

This closes #12907

[FLINK-16048][avro] Support read/write confluent schema registry avro…

1380f0e

… data from Kafka

[FLINK-18776][avro] Avoid hardcoded scala version

ae8ac94

This closes #13032.

[FLINK-18700][debezium] Debezium-json format throws NPE when PG table…

5561317

…'s IDENTITY config is not FULL This commit add documentation for this case and throws a guide message in the exception. This closes #13019

[FLINK-18705][debezium] Fix Debezium-JSON throws NPE when tombstone m…

b76981e

…essage is received Just skip the tombstone messages This closes #13019

[FLINK-18844][json][maxwell] Support maxwell-json format to read Maxw…

5260033

…ell changelogs This closes #13090

[hoxfix] Fix various typos

bad1aeb

[FLINK-18824][json][table] Support serialization for canal-json format

3776d1e

This closes #13122

[FLINK-18823][format] Support serialization for debezium-json format

f30075d

This closes #13333

[FLINK-19152] Remove Kafka 0.10.x and 0.11.x connectors

7d74f65

[FLINK-19002][canal][json] Support to only read changelogs of specifi…

fca2c3f

…c database and table for canal-json format This closes (#13294)

[FLINK-19098][json][csv] Make RowData CSV and JSON converters public

1c7ecd9

This closes #13303 Co-authored-by: Jark Wu <[email protected]>

[FLINK-19509][json] Support MULTISET type for JSON format

edf51f6

This closes #13543

[FLINK-18999][table-common][table-planner-blink] Add isTemporary flag…

839f96a

… to table factory context

[FLINK-17528][table] Remove RowData#get() API and use FieldGetter ins…

13d7fbb

…tead This closes #13653

[FLINK-17528][table] Remove ArrayData#get() API and use ElementGetter…

2e4a694

… instead This closes #13653

[hotfix][json] Add serialVersionUID to JsonInputFormat class

ef72378

This closes #13809

[FLINK-19873][canal-json] Skip DDL change events for Canal data

7fbc65d

This closes #13872

AHeise and others added 28 commits October 9, 2024 11:54

[FLINK-36441] Show full stacktraces in tests for failures

a738104

Make assertj print full stacktraces when encountering unexpected exceptions.

[FLINK-36441] Ignore exceptions during test cleanup

ab333a4

Job may be at various states including race conditions during shutdown. Ideally, the framework would provide idempotence but we can workaround that by ignoring specific exceptions.

[FLINK-36441] Ensure producers are not leaked in tests

28836c6

Add leak check in all relevant tests.

[FLINK-36441] Fix KafkaWriterITCase#testIncreasingRecordBasedCounters

5eeafd6

The test tried to assert on byte counts which are written async. Commit adds flushing and establishes a baseline so that metadata request don't interfere with assertions.

[FLINK-35109] Fix SmokeKafkaITCase for later java versions

429fe0c

Workaround for FLINK-36454.

[FLINK-35109] Bump to Flink 1.19 and support Flink 1.20

aedc790

Also - adds the migration support tests up to 1.20. - bumps Kafka-client to 3.6.2

[hotfix] Add maven wrapper

7f1ac6d

Addressing Chesnay's feedback

6dc74db

[FLINK-35109] Update version to 3.4-SNAPSHOT

c2e11a6

[FLINK-34467] bump flink version to 1.20.0 (apache#111)

2dfdae6

[FLINK-35109] Updating build configurations for v3.3

0fed445

[FLINK-34466] Lineage interfaces for kafka connector (apache#130)

727327d

Signed-off-by: Pawel Leszczynski <[email protected]>

[FLINK-35109] Update weekly CI to verify 3.4 release branch

59baacc

[FLINK-36780] Kafka source disable partition discovery unexpectedly (a…

f6a077a

…pache#136)

[hotfix] Remote dead code

20809da

[FLINK-37281] Refactor KafkaSinkITCase

8bb953d

Add more test coverage for unchained cases and separate the behavioral components from data capture and assertions. Also reduces the need to convey information with fields.

[FLINK-37281] Improve extensibility in KafkaWriterTestBase

ee3d713

To test recovery in future PRs, it's important to decompose the #createWriter methods into common cases and advanced cases that may require some additional setup.

[FLINK-37281] Fix KafkaUtil#checkProducerLeak

707ec4c

Since Java 20, Thread.stop fails, so we just need to remember old leaks to avoid failing subsequent tests.

[FLINK-37282] Introduce internal package to sink

5947f3c

Move FlinkKafkaInternalProducer and TransactionalIdFactory to internal. All other classes are potentially leaked through the generics and signatures of the KafkaSink(Builder).

[hotfix] Fix leaks in FlinkKafkaProducerTest

2e652a9

[FLINK-37282] Split KafkaWriter into EOS/non-EOS

df353e9

Split the easy case of non-transactional writer from the transactional writer to simplify reasoning about the state (e.g. which fields are used when).

[FLINK-37282] Add ProducerPool

580d3ed

Add first class producer pool that self-manages all resources and allows to recycle producers by transactional ids.

[FLINK-37282] Force colocation of kafka writer and kafka committer.

d3c3546

[FLINK-37282] Incorporate Backchannel and ProducerPool into EOSWriter

f4015d1

[FLINK-33265] Support source parallelism setting for Kafka connector

a52f15a

Close apache#134

[FLINK-37396] Fix setup.py: Multiple top-level packages discovered in…

a025109

… a flat-layout

loserwang1024 force-pushed the fix-ci-FLINK-37396 branch from 54b51e9 to a025109 Compare February 27, 2025 09:51

loserwang1024 closed this Feb 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix ci flink 37396 #2

Fix ci flink 37396 #2

Uh oh!

loserwang1024 commented Feb 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants

Fix ci flink 37396 #2

Fix ci flink 37396 #2

Uh oh!

Conversation

loserwang1024 commented Feb 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants