-
Notifications
You must be signed in to change notification settings - Fork 635
Revise GEP-1713: Update gateway status and conformance test details for ListenerSets #4205
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Revise GEP-1713: Update gateway status and conformance test details for ListenerSets #4205
Conversation
|
@davidjumani thanks, this is a great write of conformance. Do you think it makes sense to also add the expected conditions on each conformance item, and also the ancestorStatus expected, to make it clear to implementations what we will look for? |
Added the expected status as well as added the |
geps/gep-1713/index.md
Outdated
| - The ListenerSet has the following status : | ||
| | Type | Status | Reason | | ||
| | -------- | ------- | ------- | | ||
| | Accepted | True | ListenersNotValid | |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
should this accepted here be false?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Based on the GEP
// This can be the reason when "Accepted" is "True" or "False", depending on whether
// the listener being invalid causes the entire Gateway to not be accepted.
I believe this should be true since the ListenerSet is configured but does not adversely affect the parent Gateway.
Additionally, there can be a scenario where only a subset of the listeners is invalid and this would more accurately reflect it
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
but then the reason should be empty or something else, right? It was accepted but not programmed, I guess
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
+1. If the status is Accepted, there should not be a reason for that.
I wouldn't also mix Accepted: True and Programmed: False in this case. The Programmed condition indicates whether the controller was physically able to apply the requested resources (e.g. allocate an IP or open a port). Listener conflicts are configuration errors, not the controller ability to program them.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't think so. A subset of listeners can be accepted, so the status should reflect it accordingly.
If not all listeners are accepted, then the listenerSet Accepted condition should be false.
Additionally, the Programmed condition should also reflect the same, since a subset of listeners was applied
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think there's some confusion here, let me know if the below makes sense
- If I have one listener conflicted:
- This listener should be accepted (means a controller SAW it), no Reason set
- This listener should have a status Conflicted=True, Reason explaining if it is a Protocol conflict, etc (and a good message)
- Programmed will be False, because the proxy won't receive this configuration
As the status conditions is per listener, I think this would make sense, wdyt?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
(specifically on this case, we can think about extending the valid Reasons to something like ConflictWithParent to let the user know there's already a conflicting listener on the Gateway, without leaking any kind of information)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think the language in this part is a little confusing:
// This can be the reason when "Accepted" is "True" or "False", depending on whether
// the listener being invalid causes the entire Gateway to not be accepted.
I think it should be something like:
// This can be used as a reason when "Accepted" is "True" or "False", because Accepted
// requires that at least one Listener present on the object is valid. If zero Listeners in the
// ListenerSet are valid, then Accepted is False, and this reason is used to explain why.
// If at least one Listener in the ListenerSet is valid, then Accepted is True, and this reason
// is used to indicate that the ListenerSet is only _partially_ valid.
Acceptance is about being locally correct, where Programmed is about full correctness. Conflicted is then used to indicate that at least one Listener in the ListenerSet is in conflict with a Listener on another object (no matter which one it is).
I don't think that saying that the conflicted Listener is on the Gateway gives too much information away, since the ListenerSet already includes a reference to that parent anyway. That said, adding a ConflictWithParent reason sounds good to me. It's okay to have more options for Reasons, they don't have the overhead that adding extra whole Conditions does.
geps/gep-1713/index.md
Outdated
| - The parent gateway has the following status : | ||
| | Type | Status | Reason | | ||
| | -------- | ------- | ------- | | ||
| | AttachedListenerSets | False | ListenerSetsNotAllowed | |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
do we really need this condition on the parent gateway? I am wondering if in Gateway case we can just not have any count of attached listener, as mostly the user will have its own condition
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think the condition should be set since :
- The user has explicitly disallowed ListenerSets
- ListenerSets have been attached (at least tried) to the Gateway
The user should know about this in the status
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I am not aware of the whole conditions reasoning here tbh, will defer to @youngnick for some better view on it
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I am still not fully convinced we should have the condition if we would have a counter as well, but I also do not have a very strong opinion on it
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
In this case, I think it's useful for the reason that @davidjumani said - the Gateway owner has said "no ListenerSets allowed", and some ListenerSets have attempted to attach. Regardless of exactly how that situation has come up, the Gateway owner needs to know. attachedListeners doesn't tell us that, because it's a count of ListenerSets that have successfully attached.
geps/gep-1713/index.md
Outdated
| - The newer ListenerSet has the following status : | ||
| | Type | Status | Reason | | ||
| | -------- | ------- | ------- | | ||
| | Accepted | True | ListenersNotValid | |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
again, should we have an accepted here? And a reason? Not sure if this doesn't get more confusing
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think this is a better indication that the listenerSet is correct (the parentref is valid, etc.) but one or more of the listeners is invalid which is indicated by the ListnerStatus above
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
but one or more of the listeners is invalid which is indicated by the ListnerStatus above
If all listeners in the ListenerSet have conflicts, shouldn't we report the ListenerSet as not accepted? This is consistent with other resources (e.g. if an HTTPRoute has no hostname intersection with a Gateway listener, it's reported a not accepted).
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, I agree and will update it accordingly. I'll make this clearer with the total number of conflicting listeners
|
@davidjumani overall lgtm! I would like us to keep separated a PR that changes the GEP from a PR that changes the API, this way if we need to track any (or revert) we have proper changes Thanks again for the effort here! |
| | -------- | ------- | ------- | | ||
| | Accepted | False | NotAllowed | | ||
| - The parent gateway has the following status : |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just a headsup, I got a bit confused here to understand that AttachedListenerSets is a condition and AttachedListeners is a .status field, maybe worth just clarifying it
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yeah, I think these might need to be separate tables or something, or maybe something like:
| Type | Status | Reason |
| -------- | ------- | ------- |
| conditions.AttachedListenerSets | Unknown | // any reason // |
| AttachedListeners | 0 | |
geps/gep-1713/index.md
Outdated
| | AttachedListenerSets | Unknown | // any reason // | | ||
| | AttachedListeners | 0 | | | ||
| - The request on the ListenerSet port fails. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
On the port? Or on the tuple? If you have a Listener defined on the same port, using a different host (eg.: some HTTP listener) this shouldn't fail.
I think you can make a call out for now that all the tests here are executed with at least one listener defined on gateway (because we enforce this...) and then have a separate set of tests for when we remove the need of Listeners on Gateway, wdyt?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should probably be "the request to that distinct Listener fails" (that's what the word "distinct" is useful for when talking about Listeners.)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
And yes, we will definitely need a bunch more tests if and when required Listeners is 0 on a Gateway, both when ListenerSets are attached and when they are not.
geps/gep-1713/index.md
Outdated
| 1. A listener on a ListenerSet has a protocol conflict with a listener on another ListenerSet | ||
| - The conflicting listener on the older ListenerSet is accepted based on the [Listener Precedence](#listener-precedence) with the following status : | ||
| | Type | Status | Reason | | ||
| | -------- | ------- | ------- | | ||
| | Conflicted | False | NoConflicts | |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hmm, this means that the winning ListenerSet owner will have way to know that there is another ListenerSet that's in conflict with this one.
It feels a little better to me to have this ListenerSet have a Conflicted: True, with a reason of something like ConflictWinner (or some other better name), to indicate that there was a conflict, but it won. Then the losing ListnenerSet would get "ConflictLoser" or something (wow, that's a terrible Reason, but you get what I mean).
That way "Conflicted" means "there is a conflict with something", and there is a Reason to indicate what the result of the conflict was.
I suppose, though, that Accepted is probably also carrying the same information, so maybe those Reasons are redundant. Interested in other's thoughts here.
|
This is really great @davidjumani, I think that we should try and do this exercise more often for conformance tests for other, this will make it easier to ensure we don't miss things in testing. |
geps/gep-1713/index.md
Outdated
| ``` | ||
| 1. A listener on a ListenerSet without a defined port | ||
| - If the implementation supports dynamic port assignment : |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
do we want to mark this feature as well on supported features? like a Gateway SHOULD report if it supports dynamic port assignment using the feature name "ListenerSetDynamic" (be more creative than I am with names!)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is there an existing feature name for dynamic port assignment on Gateway? If so we should use that - since ListenerSets are just an extension of the gateway
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There is currently no dynamic port assignment on Gateway, port is a required field there. We are trying out dynamic port assignment on ListenerSet.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Added the ListenerSetDynamicPortAssignment supported feature flag
| type: NotAllowedByListeners | ||
| ``` | ||
| 1. A listener on a ListenerSet with a missing ReferenceGrant |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
IIRC ReferenceGrant is also a specific supported feature (
| SupportReferenceGrant FeatureName = "ReferenceGrant" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Agreed. Will add this as a requirement on the specific test
| allowedListeners: 1 | ||
| ``` | ||
| - The request to the listener on the ListenerSet succeeds. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
not sure if this is me being too cautious, but I would like for every test if possible that we also make a request and guarantee that it is working as desired.
eg.: on a conflicting ListenerSet we need to be sure the request goes to the right route attached to the right LS on the right namespace.
Also, optionally, do we want a test on a ListenerSet that has 2 different listeners (eg.: http and https) just to be sure that everything is working fine on some more complex scenarios?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, we can do that, and I intend to do so.
I removed it from the doc for now, as I see that not all Gateway conformance tests send out requests but only check the status and I wasn't sure if we wanted to just do it that way.
Co-authored-by: Ricardo Pchevuzinske Katz <[email protected]>
|
/cc |
|
Just did a pair review and update with David, I think this is good to go. /lgtm Will wait @youngnick for a second pass, but we can work on conformance based on this already Thanks! |
|
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: davidjumani, rikatz The full list of commands accepted by this bot can be found here. DetailsNeeds approval from an approver in each of these files:Approvers can indicate their approval by writing |
| - type: Programmed | ||
| status: True | ||
| reason: Programmed | ||
| - type: Conflicted |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is it useful in this case to record that the Listener is actually Conflicted? (It is, it's just the winner in the conflict stakes).
That would allow Gateway owners to tell that, for example, ListenerSet owners are attempting to configure a Listener that conflicts, and that removing this Listener will allow the ListenerSet Listener to win instead.
This would have to apply to all conflicts between Listeners, including between sibling ListenerSets to work though.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I feel like it probably is worth marking the Listener as conflicted.
More broadly, I think that it's a good idea to start thinking of a Listener as a sort of degenerate LIstenerSet in basically all cases.
| - The listener must have the following status : | ||
| ``` | ||
| name: listener-without-port | ||
| port: <unique dynamically assigned port value> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Doesn't need to be resolved now, but we will need to come back to what constitutes a "unique" port. If the Protocol is HTTPS, does this port need to be unique if the hostname is distinct? (That would substantially complicate implementing this, and may be a bad idea. But if we are ruling it out, we need to be explicit about that).
|
The list of conformance tests looks great to me, as long as we are can answer the two questions I added there. |
What type of PR is this?
Add one of the following kinds:
/kind documentation
/kind gep
Optionally add one or more of the following kinds if applicable:
/area conformance-test
What this PR does / why we need it:
This PR does the following :
AttachedListenersthat is the count of successful ListenerSet attachments to the gateway. Ref: slack theraadThe aim is to get consensus for the set of conformance tests required to validate this feature against implementations, and move towards promoting this to Standard
Does this PR introduce a user-facing change?: