-
Notifications
You must be signed in to change notification settings - Fork 18
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: Add support to fetch and match worker logs #183
Conversation
ed174a7
to
aee1f33
Compare
@@ -252,6 +264,46 @@ def set_stopped_status(self): | |||
LOG.exception(f"Failed to update worker status: {error}") | |||
raise | |||
|
|||
def get_worker_logs(self) -> Optional[WorkerLogConfig]: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
To make this a little more clear, should we change this method to _get_worker_log_config
?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sure, we can do that.
I was thinking if other tests may want to get the WorkerLogConfig. Both works.
# Default, no log group yet. | ||
return None | ||
|
||
def get_all_worker_logs(self, *, logs_client: botocore.client.BaseClient) -> WorkerLog: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Perhaps just get_logs
? Would be more consistent with our Job implementation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sure, easy
6b44091
to
2e47d5b
Compare
@@ -252,6 +264,46 @@ def set_stopped_status(self): | |||
LOG.exception(f"Failed to update worker status: {error}") | |||
raise | |||
|
|||
def _get_worker_logs(self) -> Optional[WorkerLogConfig]: | |||
"""Get the log group and log stream for the worker. Retain the API structure""" | |||
response = self.deadline_client.get_worker( |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nit: type
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Q: do we have a data class for GetWorker response?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We don't have typing in this library for boto classes :(
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The data class comes from the design and structure of this library. There's no use of boto3 generated types
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Could we type it as Dict[str, Any]
?
fleetId=self.configuration.fleet.id, | ||
workerId=self.worker_id, | ||
) | ||
if log_config := response["log"]: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If log
isn't a key in response
, wouldn't this line throw an exception?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
good call, I'll change it to .get()
|
||
def get_logs(self, *, logs_client: botocore.client.BaseClient) -> WorkerLog: | ||
# Get the worker log group and stream from the service. | ||
log_config = self._get_worker_logs() |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nit: type
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sure
logStreamNames=[log_config.cloudwatch_log_stream], | ||
), | ||
) | ||
log_events = filter_log_events_pages.build_full_result() |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nit: type
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
there's no typing for boto in this package.
) | ||
log_events = filter_log_events_pages.build_full_result() | ||
log_events = [CloudWatchLogEvent.from_api_response(e) for e in log_events["events"]] | ||
# For debugging test cases. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Could we remove this comment? Doesn't seem to be adding any value.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We do, otherwise everyone wants to remove LOG.info :)
@@ -252,6 +264,46 @@ def set_stopped_status(self): | |||
LOG.exception(f"Failed to update worker status: {error}") | |||
raise | |||
|
|||
def _get_worker_logs(self) -> Optional[WorkerLogConfig]: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nit - what about get_worker_log_config
?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Arrrg, I had it as the a non private name before but went for _
so no :) Not changing it
if TYPE_CHECKING: | ||
from botocore.paginate import PageIterator, Paginator |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
curious - why only import those during type checking?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This was done in the rest of the library too, there's some differences in versions of python I think.
workerId=self.worker_id, | ||
) | ||
if log_config := response["log"]: | ||
LOG.info(f"LogGroup structure {log_config}") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nit - you meant LogConfig?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sure
return WorkerLogConfig( | ||
cloudwatch_log_group=log_group_name, cloudwatch_log_stream=log_stream_name | ||
) | ||
# Default, no log group yet. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nit - you meant log config? log group should always exist as it's at fleet level.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
sure
if not log_config: | ||
return WorkerLog(worker_id=self.worker_id, logs=[]) # type: ignore[arg-type] | ||
|
||
filter_log_events_paginator: Paginator = logs_client.get_paginator("filter_log_events") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please make sure caller has the logs:FilterLogEvents
permission to perform this operation. Local test and automated tests use different infrastructure setup.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is the same in the rest of the library and we don't check
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This introduce FilterLogEvents call to fleet logs while the prior code filter on session logs no? The permission for CWL is configured at log group level, that's why I suggested checking.
Took a quick look seems we have have this permission on all deadline log groups log-group:/aws/deadline/*
, should be good then.
Signed-off-by: David Leong <[email protected]>
|
What was the problem/requirement? (What/Why)
What was the solution? (How)
assert_pattern_in_log
andassert_pattern_not_in_log
.What is the impact of this change?
How was this change tested?
hatch build
hatch run fmt
hatch run all:lint
Was this change documented?
Not Applicable.
Is this a breaking change?
Not Applicable.
By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.