-
Notifications
You must be signed in to change notification settings - Fork 2
Open
Description
Token Service Discovery Service Accident
Our wechaty puppet service discovery service has been experiencing out-of-service issues from 3 pm Feb 7.
- 10 am Feb 7: notice the disk usage of some instances are abnormal, then clear logs file and make instance keep running right, at the same time the api.chatie.io work well
- 3 pm Feb 7: this problem outbreak in the afternoon then we working on it, and found that the http response status code 503 of api.chatie.io
- 2 am Feb 8: @huan show some detail info from heroku, see: 🔥🔥🔥 api.chatie.io服务异常,HTTP错误码503 #97 (comment)
- 8 am Feb 8: confirm api.chatie.io out-of-service due receive too many requests (init token on api.chatie.io) in few seconds
- 9 am Feb 8: find the bug in
wechaty-puppet-workpro
, one NodeJS Timer function init token on api.chatie.io has not been clear right, and we notice that the only way which could fix this bug temporarily is to restart all containers - 10 am Feb 8: confirm the operation time to restart all containers
- 2 pm Feb 8: restart all containers
- 2:30 pm Feb8`: the server fully restored
- 6 pm Feb 8: create the hotfix PR to fix this problem
- 9 pm Feb 8: PR has been merged, and ready to deploy
- 0 pm Feb 9: start deploy for some instances
Metadata
Metadata
Assignees
Labels
No labels