You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
I am trying to deploy a custom neural network on SageMaker. However, when I run the deployment code (which I got from the AWS examples), I get the following error:
An error occurred (ModelError) when calling the InvokeEndpoint operation: Received server error (500) from primary and could not load the entire response body. See https://eu-west-2.console.aws.amazon.com/cloudwatch/home?region=eu-west-2#logEventViewer:group=/aws/sagemaker/Endpoints/pytorch-inference-2024-12-30-09-09-34-368 for more information.
This is also the log - there are no logs tagged as errors.
timestamp,message,logStreamName
1735549918460,"2024-12-30 09:11:57,620 [INFO ] main com.amazonaws.ml.mms.ModelServer - ",AllTraffic/i-08ffdaaab186465db
1735549918460,MMS Home: /opt/conda/lib/python3.6/site-packages,AllTraffic/i-08ffdaaab186465db
1735549918460,Current directory: /,AllTraffic/i-08ffdaaab186465db
1735549918460,Temp directory: /home/model-server/tmp,AllTraffic/i-08ffdaaab186465db
1735549918460,Number of GPUs: 0,AllTraffic/i-08ffdaaab186465db
1735549918460,Number of CPUs: 1,AllTraffic/i-08ffdaaab186465db
1735549918460,Max heap size: 3197 M,AllTraffic/i-08ffdaaab186465db
1735549918460,Python executable: /opt/conda/bin/python3.6,AllTraffic/i-08ffdaaab186465db
1735549918460,Config file: /etc/sagemaker-mms.properties,AllTraffic/i-08ffdaaab186465db
1735549918460,Inference address: http://0.0.0.0:8080,AllTraffic/i-08ffdaaab186465db
1735549918460,Management address: http://0.0.0.0:8080,AllTraffic/i-08ffdaaab186465db
1735549918460,Model Store: /.sagemaker/mms/models,AllTraffic/i-08ffdaaab186465db
1735549918460,Initial Models: ALL,AllTraffic/i-08ffdaaab186465db
1735549918460,Log dir: /logs,AllTraffic/i-08ffdaaab186465db
1735549918460,Metrics dir: /logs,AllTraffic/i-08ffdaaab186465db
1735549918460,Netty threads: 0,AllTraffic/i-08ffdaaab186465db
1735549918460,Netty client threads: 0,AllTraffic/i-08ffdaaab186465db
1735549918460,Default workers per model: 1,AllTraffic/i-08ffdaaab186465db
1735549918460,Blacklist Regex: N/A,AllTraffic/i-08ffdaaab186465db
1735549918460,Maximum Response Size: 6553500,AllTraffic/i-08ffdaaab186465db
1735549918460,Maximum Request Size: 6553500,AllTraffic/i-08ffdaaab186465db
1735549918460,Preload model: false,AllTraffic/i-08ffdaaab186465db
1735549918460,Prefer direct buffer: false,AllTraffic/i-08ffdaaab186465db
1735549918460,"2024-12-30 09:11:57,694 [WARN ] W-9000-model com.amazonaws.ml.mms.wlm.WorkerLifeCycle - attachIOStreams() threadName=W-9000-model",AllTraffic/i-08ffdaaab186465db
1735549918460,"2024-12-30 09:11:57,787 [INFO ] W-9000-model-stdout com.amazonaws.ml.mms.wlm.WorkerLifeCycle - model_service_worker started with args: --sock-type unix --sock-name /home/model-server/tmp/.mms.sock.9000 --handler sagemaker_pytorch_serving_container.handler_service --model-path /.sagemaker/mms/models/model --model-name model --preload-model false --tmp-dir /home/model-server/tmp",AllTraffic/i-08ffdaaab186465db
1735549918460,"2024-12-30 09:11:57,788 [INFO ] W-9000-model-stdout com.amazonaws.ml.mms.wlm.WorkerLifeCycle - Listening on port: /home/model-server/tmp/.mms.sock.9000",AllTraffic/i-08ffdaaab186465db
1735549918460,"2024-12-30 09:11:57,788 [INFO ] W-9000-model-stdout com.amazonaws.ml.mms.wlm.WorkerLifeCycle - [PID] 29",AllTraffic/i-08ffdaaab186465db
1735549918460,"2024-12-30 09:11:57,789 [INFO ] W-9000-model-stdout com.amazonaws.ml.mms.wlm.WorkerLifeCycle - MMS worker started.",AllTraffic/i-08ffdaaab186465db
1735549918460,"2024-12-30 09:11:57,789 [INFO ] W-9000-model-stdout com.amazonaws.ml.mms.wlm.WorkerLifeCycle - Python runtime: 3.6.13",AllTraffic/i-08ffdaaab186465db
1735549918460,"2024-12-30 09:11:57,789 [INFO ] main com.amazonaws.ml.mms.wlm.ModelManager - Model model loaded.",AllTraffic/i-08ffdaaab186465db
1735549918460,"2024-12-30 09:11:57,793 [INFO ] main com.amazonaws.ml.mms.ModelServer - Initialize Inference server with: EpollServerSocketChannel.",AllTraffic/i-08ffdaaab186465db
1735549918460,"2024-12-30 09:11:57,802 [INFO ] W-9000-model com.amazonaws.ml.mms.wlm.WorkerThread - Connecting to: /home/model-server/tmp/.mms.sock.9000",AllTraffic/i-08ffdaaab186465db
1735549918460,"2024-12-30 09:11:57,852 [INFO ] main com.amazonaws.ml.mms.ModelServer - Inference API bind to: http://0.0.0.0:8080",AllTraffic/i-08ffdaaab186465db
1735549918460,Model server started.,AllTraffic/i-08ffdaaab186465db
1735549918460,"2024-12-30 09:11:57,859 [INFO ] W-9000-model-stdout com.amazonaws.ml.mms.wlm.WorkerLifeCycle - Connection accepted: /home/model-server/tmp/.mms.sock.9000.",AllTraffic/i-08ffdaaab186465db
1735549918702,"2024-12-30 09:11:57,860 [WARN ] pool-2-thread-1 com.amazonaws.ml.mms.metrics.MetricCollector - worker pid is not available yet.",AllTraffic/i-08ffdaaab186465db
1735549918702,"2024-12-30 09:11:58,598 [INFO ] W-9000-model-stdout com.amazonaws.ml.mms.wlm.WorkerLifeCycle - Model model loaded io_fd=e218b1fffe837cab-00000011-00000000-4bff44452ed61a70-223c9727",AllTraffic/i-08ffdaaab186465db
1735549918702,"2024-12-30 09:11:58,604 [INFO ] W-9000-model com.amazonaws.ml.mms.wlm.WorkerThread - Backend response time: 705",AllTraffic/i-08ffdaaab186465db
1735549920456,"2024-12-30 09:11:58,605 [WARN ] W-9000-model com.amazonaws.ml.mms.wlm.WorkerLifeCycle - attachIOStreams() threadName=W-model-1",AllTraffic/i-08ffdaaab186465db
1735549924460,"2024-12-30 09:12:00,295 [INFO ] pool-1-thread-3 ACCESS_LOG - /169.254.178.2:49624 ""GET /ping HTTP/1.1"" 200 31",AllTraffic/i-08ffdaaab186465db
1735549930461,"2024-12-30 09:12:05,231 [INFO ] pool-1-thread-3 ACCESS_LOG - /169.254.178.2:49624 ""GET /ping HTTP/1.1"" 200 0",AllTraffic/i-08ffdaaab186465db
1735549935241,"2024-12-30 09:12:10,229 [INFO ] pool-1-thread-3 ACCESS_LOG - /169.254.178.2:49624 ""GET /ping HTTP/1.1"" 200 0",AllTraffic/i-08ffdaaab186465db
1735549939461,"2024-12-30 09:12:15,229 [INFO ] pool-1-thread-3 ACCESS_LOG - /169.254.178.2:49624 ""GET /ping HTTP/1.1"" 200 0",AllTraffic/i-08ffdaaab186465db
1735549944505,"2024-12-30 09:12:20,229 [INFO ] pool-1-thread-3 ACCESS_LOG - /169.254.178.2:49624 ""GET /ping HTTP/1.1"" 200 0",AllTraffic/i-08ffdaaab186465db
1735549949461,"2024-12-30 09:12:25,229 [INFO ] pool-1-thread-3 ACCESS_LOG - /169.254.178.2:49624 ""GET /ping HTTP/1.1"" 200 0",AllTraffic/i-08ffdaaab186465db
1735549954461,"2024-12-30 09:12:30,229 [INFO ] pool-1-thread-3 ACCESS_LOG - /169.254.178.2:49624 ""GET /ping HTTP/1.1"" 200 0",AllTraffic/i-08ffdaaab186465db
1735549955791,"2024-12-30 09:12:35,229 [INFO ] pool-1-thread-3 ACCESS_LOG - /169.254.178.2:49624 ""GET /ping HTTP/1.1"" 200 0",AllTraffic/i-08ffdaaab186465db
1735549955791,"2024-12-30 09:12:35,657 [INFO ] W-9000-model com.amazonaws.ml.mms.wlm.WorkerThread - Backend response time: 1",AllTraffic/i-08ffdaaab186465db
1735549960303,"2024-12-30 09:12:35,657 [INFO ] W-9000-model ACCESS_LOG - /169.254.178.2:49624 ""POST /invocations HTTP/1.1"" 500 7",AllTraffic/i-08ffdaaab186465db
1735549964460,"2024-12-30 09:12:40,231 [INFO ] pool-1-thread-3 ACCESS_LOG - /169.254.178.2:44552 ""GET /ping HTTP/1.1"" 200 0",AllTraffic/i-08ffdaaab186465db
1735549969461,"2024-12-30 09:12:45,229 [INFO ] pool-1-thread-3 ACCESS_LOG - /169.254.178.2:44552 ""GET /ping HTTP/1.1"" 200 0",AllTraffic/i-08ffdaaab186465db
1735549974460,"2024-12-30 09:12:50,229 [INFO ] pool-1-thread-3 ACCESS_LOG - /169.254.178.2:44552 ""GET /ping HTTP/1.1"" 200 0",AllTraffic/i-08ffdaaab186465db
1735549979460,"2024-12-30 09:12:55,229 [INFO ] pool-1-thread-3 ACCESS_LOG - /169.254.178.2:44552 ""GET /ping HTTP/1.1"" 200 0",AllTraffic/i-08ffdaaab186465db
1735549984501,"2024-12-30 09:13:00,229 [INFO ] pool-1-thread-3 ACCESS_LOG - /169.254.178.2:44552 ""GET /ping HTTP/1.1"" 200 0",AllTraffic/i-08ffdaaab186465db
1735549989461,"2024-12-30 09:13:05,229 [INFO ] pool-1-thread-3 ACCESS_LOG - /169.254.178.2:44552 ""GET /ping HTTP/1.1"" 200 0",AllTraffic/i-08ffdaaab186465db
1735549994460,"2024-12-30 09:13:10,229 [INFO ] pool-1-thread-3 ACCESS_LOG - /169.254.178.2:44552 ""GET /ping HTTP/1.1"" 200 0",AllTraffic/i-08ffdaaab186465db
1735549999502,"2024-12-30 09:13:15,229 [INFO ] pool-1-thread-3 ACCESS_LOG - /169.254.178.2:44552 ""GET /ping HTTP/1.1"" 200 0",AllTraffic/i-08ffdaaab186465db
1735550004461,"2024-12-30 09:13:20,229 [INFO ] pool-1-thread-3 ACCESS_LOG - /169.254.178.2:44552 ""GET /ping HTTP/1.1"" 200 0",AllTraffic/i-08ffdaaab186465db
1735550009460,"2024-12-30 09:13:25,229 [INFO ] pool-1-thread-3 ACCESS_LOG - /169.254.178.2:44552 ""GET /ping HTTP/1.1"" 200 0",AllTraffic/i-08ffdaaab186465db
1735550014501,"2024-12-30 09:13:30,229 [INFO ] pool-1-thread-3 ACCESS_LOG - /169.254.178.2:44552 ""GET /ping HTTP/1.1"" 200 0",AllTraffic/i-08ffdaaab186465db
1735550019461,"2024-12-30 09:13:35,229 [INFO ] pool-1-thread-3 ACCESS_LOG - /169.254.178.2:44552 ""GET /ping HTTP/1.1"" 200 0",AllTraffic/i-08ffdaaab186465db
1735550024461,"2024-12-30 09:13:40,232 [INFO ] pool-1-thread-3 ACCESS_LOG - /169.254.178.2:44552 ""GET /ping HTTP/1.1"" 200 0",AllTraffic/i-08ffdaaab186465db
1735550030235,"2024-12-30 09:13:45,229 [INFO ] pool-1-thread-3 ACCESS_LOG - /169.254.178.2:44552 ""GET /ping HTTP/1.1"" 200 0",AllTraffic/i-08ffdaaab186465db
1735550034460,"2024-12-30 09:13:50,229 [INFO ] pool-1-thread-3 ACCESS_LOG - /169.254.178.2:44552 ""GET /ping HTTP/1.1"" 200 0",AllTraffic/i-08ffdaaab186465db
1735550039461,"2024-12-30 09:13:55,229 [INFO ] pool-1-thread-3 ACCESS_LOG - /169.254.178.2:44552 ""GET /ping HTTP/1.1"" 200 0",AllTraffic/i-08ffdaaab186465db
1735550044461,"2024-12-30 09:14:00,229 [INFO ] pool-1-thread-3 ACCESS_LOG - /169.254.178.2:44552 ""GET /ping HTTP/1.1"" 200 0",AllTraffic/i-08ffdaaab186465db
This is my notebook code:
This is my inference code:
I would appreciate any help - is there anything wrong with the _fn functions?
Beta Was this translation helpful? Give feedback.
All reactions