Differences of v0.3 docker images and connection refused from API call #1255
lililolo0927
started this conversation in
General
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
In docker hub for KTransformers, I've found several kinds of tags for the new version(v0.3) images such as v0.3-AVX2, lastest-AVX2, v0.3-NATIVE, latest-NATIVE, v0.3-FANCY, lastest-FANCY, v0.3-AVX512, latest-AVX512 .
What exactly is the difference between these tags?
I'm not sure which image I should download.
Currently, I downloaded
approachingai/ktransformers:v0.3-AVX2
and tried running the Qwen3-30B-A3B model.There are no errors, but I can't access through the API.
When I run
curl -X 'GET' 'http://localhost:10002/api/tags'
, I get a "connection refused" response.I've confirmed that all the necessary ports inside the container are open, but API access still doesn't work.
The command I used is as follows:
In addition , I tried the option
--optimize_config_path ktransformers/optimize/optimize_rules/Qwen3Moe-serve-amx.yaml
but it returned me an error even though my machine support AMX.I would appreciate your opinions on this.
Thanks in advance.
Beta Was this translation helpful? Give feedback.
All reactions