-
Notifications
You must be signed in to change notification settings - Fork 2.9k
NIFI-15077 : Added conflict resolution strategies for file move operations in FetchFileTransfer #10405
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
…ons in FetchFileTransfer - Introduced MOVE_CONFLICT_RESOLUTION property to handle filename collisions during file move - Implemented logic for conflict resolution strategies (REPLACE, RENAME, IGNORE, etc.) - Refactored filename normalization and path handling utilities - Updated FetchFTP and FetchSFTP processors for enhanced conflict resolution support - Added corresponding unit tests and utility methods for unique filename generation in conflicts
exceptionfactory
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for proposing this improvement @ravinarayansingh. The concept makes sense, and the basic approach looks good. I noted some recommendations on a few implementation and logging details.
...e-transfer/src/main/java/org/apache/nifi/processor/util/file/transfer/FetchFileTransfer.java
Show resolved
Hide resolved
...fer/src/main/java/org/apache/nifi/processor/util/file/transfer/FileTransferConflictUtil.java
Outdated
Show resolved
Hide resolved
...e-transfer/src/main/java/org/apache/nifi/processor/util/file/transfer/FetchFileTransfer.java
Outdated
Show resolved
Hide resolved
...e-transfer/src/main/java/org/apache/nifi/processor/util/file/transfer/FetchFileTransfer.java
Outdated
Show resolved
Hide resolved
...e-transfer/src/main/java/org/apache/nifi/processor/util/file/transfer/FetchFileTransfer.java
Outdated
Show resolved
Hide resolved
…/src/main/java/org/apache/nifi/processor/util/file/transfer/FetchFileTransfer.java Co-authored-by: David Handermann <[email protected]>
…lictUtil - Replaced incrementing prefix-based approach with UUID prefix for unique filename generation - Simplified conflict resolution logic in FetchFileTransfer during file move operations - Improved logging consistency and updated descriptions for MOVE_DESTINATION_DIR property
Thanks for review, @exceptionfactory. |
- Adjusted indentation for property description for better readability
- Adjusted indentation for property description for better readability
exceptionfactory
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for updates @ravinarayansingh, there was one previous comment remaining on description formatting, and I recommended some adjustments to the logging messages, then this should be ready to go.
...fer/src/main/java/org/apache/nifi/processor/util/file/transfer/FileTransferConflictUtil.java
Outdated
Show resolved
Hide resolved
...e-transfer/src/main/java/org/apache/nifi/processor/util/file/transfer/FetchFileTransfer.java
Show resolved
Hide resolved
...e-transfer/src/main/java/org/apache/nifi/processor/util/file/transfer/FetchFileTransfer.java
Outdated
Show resolved
Hide resolved
...e-transfer/src/main/java/org/apache/nifi/processor/util/file/transfer/FetchFileTransfer.java
Outdated
Show resolved
Hide resolved
...e-transfer/src/main/java/org/apache/nifi/processor/util/file/transfer/FetchFileTransfer.java
Outdated
Show resolved
Hide resolved
...e-transfer/src/main/java/org/apache/nifi/processor/util/file/transfer/FetchFileTransfer.java
Outdated
Show resolved
Hide resolved
...e-transfer/src/main/java/org/apache/nifi/processor/util/file/transfer/FetchFileTransfer.java
Outdated
Show resolved
Hide resolved
…/src/main/java/org/apache/nifi/processor/util/file/transfer/FetchFileTransfer.java Co-authored-by: David Handermann <[email protected]>
…/src/main/java/org/apache/nifi/processor/util/file/transfer/FetchFileTransfer.java Co-authored-by: David Handermann <[email protected]>
…/src/main/java/org/apache/nifi/processor/util/file/transfer/FetchFileTransfer.java Co-authored-by: David Handermann <[email protected]>
…/src/main/java/org/apache/nifi/processor/util/file/transfer/FetchFileTransfer.java Co-authored-by: David Handermann <[email protected]>
exceptionfactory
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the incremental adjustments @ravinarayansingh, it looks like just accepting the proposed changes resulted in compilation issues, and a few comments are still unaddressed. If you can work through the remaining items and update the pull request, that would be helpful.
- Moved unique filename generation method from FileTransferConflictUtil to FetchFileTransfer - Simplified conflict resolution logic by removing unused utility class
exceptionfactory
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the updates @ravinarayansingh. This looks better, but handling IGNORE, REJECT, and FAIL with the same case does not seem correct. The IGNORE and NONE options should not log a warning as previously mentioned, and the other two options require routing to a different location.
...e-transfer/src/main/java/org/apache/nifi/processor/util/file/transfer/FetchFileTransfer.java
Outdated
Show resolved
Hide resolved
Thanks for the feedback, @exceptionfactory. For the other two options — REJECT and FAIL — should these also log a warning, or should they additionally route the FlowFile to a different relationship (for example, failure or reject)? |
Yes, for IGNORE and NONE, and INFO log looks good.
Those options should log a warning, and route to the appropriate relationship, based on the description for each value. |
…TE and adjusted pre-commit handling for MOVE - Centralized MOVE and DELETE completion handling with pre-commit routing for MOVE conflicts and post-commit actions for DELETE. - Introduced detailed failure reasons for enhanced debugging. - Added REL_REJECT and REL_FAILURE relationships for comprehensive failure management. - Updated unit tests to reflect the new process flow and relationships.
|
Hi @exceptionfactory |
exceptionfactory
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the updates @ravinarayansingh.
It looks like the changes resulted in some unit test failures:
TestFTP.testFetchFtpUnicodeFileName:350 expected: <1> but was: <0>
TestFTP.testFetchFtp:257 » IndexOutOfBounds Index 0 out of bounds for length 0
I should have considered this in my previous reply, but the changes surface the fact that the FetchFTP and FetchSFTP Processors do not have failure and reject relationships. They have more specific types of failure relationships, (not found, permission denied, and comms failure), but those do not immediately align with the other failure conditions.
Introducing new relationships creates potential migration problems if not handled programmatically, such as auto-terminating or migrating. That could be done, but on balance, adding new relationships seems to introduce additional complexity for a conditional scenario, which is not ideal.
Taking another look at the options, I think the reject and fail completion strategies should be removed and not supported in this Processor. Instead, limiting support to ignore or rename would avoid introducing new relationships, and still provide more flexibility.
How does that sound?
Hi @exceptionfactory |
- Removed redundant conflict resolution strategies (REJECT, FAIL) and relationships (REL_REJECT, REL_FAILURE). - Refactored MOVE conflict handling to streamline conditions and improve clarity. - Introduced `getSimpleFilename` utility for filename parsing. - Updated logging to prioritize successful processing while handling edge cases gracefully.
|
I’ve updated the code with the following changes to simplify the FetchFileTransfer completion strategy logic:
please have look |
exceptionfactory
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the redirected approach @ravinarayansingh. The general scope now looks good in terms of output relationships. The new methods, however, presents a number of implementation concerns and need some refactoring to avoid deep levels of nesting and numerous return statements.
...e-transfer/src/main/java/org/apache/nifi/processor/util/file/transfer/FetchFileTransfer.java
Outdated
Show resolved
Hide resolved
...e-transfer/src/main/java/org/apache/nifi/processor/util/file/transfer/FetchFileTransfer.java
Outdated
Show resolved
Hide resolved
- Centralized MOVE pre-commit and DELETE post-commit handling. - Modularized MOVE conflict resolution with `resolveMoveConflict` and `handleMovePreCommit`. - Enhanced logging and failure routing for improved debugging.
exceptionfactory
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the updates @ravinarayansingh.
Unfortunately this change appears to be running into to some fundamental limitations of the current design of FetchFileTransfer. Pushing down transfer relationship handling to separate methods, introducing short-circuit returns, and passing around large numbers of method arguments highlight some of the challenges.
It is noteworthy that the Completion Strategy property description indicates that if the strategy fails, a warning will be logged. With that in mind, it seems like the change should much more localized, avoiding FlowFile transfer and other operations in short-circuit methods.
- Moved DELETE and MOVE strategies to pre-commit for centralized handling. - Introduced detailed failure reason constants for all failure scenarios. - Updated flow file routing to use REL_PERMISSION_DENIED and REL_COMMS_FAILURE based on a failure type. - Enhanced unit tests to validate updated behavior and new failure conditions.
Thanks for the feedback @exceptionfactory I’ve refactored the implementation to make the handling more localized and aligned with the intended design of
This refactoring keeps the behavior consistent with the Completion Strategy description while minimizing side effects and improving maintainability. |
Summary
NIFI-15077
Tracking
Please complete the following tracking steps prior to pull request creation.
Issue Tracking
Pull Request Tracking
NIFI-00000NIFI-00000Pull Request Formatting
mainbranchVerification
Please indicate the verification steps performed prior to pull request creation.
Build
./mvnw clean install -P contrib-checkLicensing
LICENSEandNOTICEfilesDocumentation