Open
Description
The app fails while loading models:
root@koboldcpp-cb947d9b7-jqrfp:/koboldcpp# python koboldcpp.py --model /app/models/mythalion-13b.Q8_0.gguf --threads 8 --noavx2 --debugmode
***
Welcome to KoboldCpp - Version 1.43
Attempting to use non-avx2 compatibility library.
Initializing dynamic library: koboldcpp_noavx2.so
==========
Namespace(model='/app/models/mythalion-13b.Q8_0.gguf', model_param='/app/models/mythalion-13b.Q8_0.gguf', port=5001, port_param=5001, host='', launch=False, lora=None, config=None, threads=8, blasthreads=8, psutil_set_threads=False, highpriority=False, contextsize=2048, blasbatchsize=512, ropeconfig=[0.0, 10000.0], stream=False, smartcontext=False, unbantokens=False, bantokens=None, usemirostat=None, forceversion=0, nommap=False, usemlock=False, noavx2=True, debugmode=1, skiplauncher=False, hordeconfig=None, noblas=False, useclblast=None, usecublas=None, gpulayers=0, tensor_split=None)
==========
Loading model: /app/models/mythalion-13b.Q8_0.gguf
[Threads: 8, BlasThreads: 8, SmartContext: False]
Illegal instruction
The build is successful (DONE
without return code):
#6 19.06 I llama.cpp build info:
#6 19.06 I UNAME_S: Linux
#6 19.06 I UNAME_P: unknown
#6 19.06 I UNAME_M: x86_64
#6 19.06 I CFLAGS: -I. -I./include -I./include/CL -I./otherarch -I./otherarch/tools -Ofast -DNDEBUG -std=c11 -fPIC -DGGML_USE_K_QUANTS -DLOG_DISABLE_LOGS -pthread -s -pthread -march=native -mtune=native
#6 19.06 I CXXFLAGS: -I. -I./common -I./include -I./include/CL -I./otherarch -I./otherarch/tools -Ofast -DNDEBUG -std=c++11 -fPIC -DGGML_USE_K_QUANTS -DLOG_DISABLE_LOGS -pthread -s -Wno-multichar -Wno-write-strings -pthread
#6 19.06 I LDFLAGS:
#6 19.06 I CC: cc (Debian 12.2.0-14) 12.2.0
#6 19.06 I CXX: g++ (Debian 12.2.0-14) 12.2.0
#6 19.06
#6 19.06 cc -I. -I./include -I./include/CL -I./otherarch -I./otherarch/tools -Ofast -DNDEBUG -std=c11 -fPIC -DGGML_USE_K_QUANTS -DLOG_DISABLE_LOGS -pthread -s -pthread -march=native -mtune=native -c ggml.c -o ggml.o
#6 33.43 cc -I. -I./include -I./include/CL -I./otherarch -I./otherarch/tools -Ofast -DNDEBUG -std=c11 -fPIC -DGGML_USE_K_QUANTS -DLOG_DISABLE_LOGS -pthread -s -pthread -march=native -mtune=native -c otherarch/ggml_v2.c -o ggml_v2.o
#6 45.55 cc -I. -I./include -I./include/CL -I./otherarch -I./otherarch/tools -Ofast -DNDEBUG -std=c11 -fPIC -DGGML_USE_K_QUANTS -DLOG_DISABLE_LOGS -pthread -s -pthread -march=native -mtune=native -c otherarch/ggml_v1.c -o ggml_v1.o
#6 53.97 g++ -I. -I./common -I./include -I./include/CL -I./otherarch -I./otherarch/tools -Ofast -DNDEBUG -std=c++11 -fPIC -DGGML_USE_K_QUANTS -DLOG_DISABLE_LOGS -pthread -s -Wno-multichar -Wno-write-strings -pthread -c expose.cpp -o expose.o
#6 55.71 g++ -I. -I./common -I./include -I./include/CL -I./otherarch -I./otherarch/tools -Ofast -DNDEBUG -std=c++11 -fPIC -DGGML_USE_K_QUANTS -DLOG_DISABLE_LOGS -pthread -s -Wno-multichar -Wno-write-strings -pthread -c common/common.cpp -o common.o
#6 64.01 g++ -I. -I./common -I./include -I./include/CL -I./otherarch -I./otherarch/tools -Ofast -DNDEBUG -std=c++11 -fPIC -DGGML_USE_K_QUANTS -DLOG_DISABLE_LOGS -pthread -s -Wno-multichar -Wno-write-strings -pthread -c gpttype_adapter.cpp -o gpttype_adapter.o
Environment and Context
$ lscpu
Architecture: x86_64
CPU op-mode(s): 32-bit, 64-bit
Address sizes: 46 bits physical, 48 bits virtual
Byte Order: Little Endian
CPU(s): 36
On-line CPU(s) list: 0-35
Vendor ID: GenuineIntel
BIOS Vendor ID: QEMU
Model name: Intel(R) Xeon(R) CPU E5-2470 v2 @ 2.40GHz
BIOS Model name: pc-i440fx-8.0 CPU @ 2.0GHz
BIOS CPU family: 1
CPU family: 6
Model: 62
Thread(s) per core: 1
Core(s) per socket: 18
Socket(s): 2
Stepping: 4
BogoMIPS: 4799.99
Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 cl
flush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc ar
ch_perfmon rep_good nopl xtopology cpuid tsc_known_freq pni pclmulqdq vmx
ssse3 cx16 pdcm pcid sse4_1 sse4_2 x2apic popcnt tsc_deadline_timer aes xs
ave avx f16c rdrand hypervisor lahf_lm cpuid_fault pti ssbd ibrs ibpb stib
p tpr_shadow vnmi flexpriority ept vpid fsgsbase tsc_adjust smep erms xsav
eopt arat umip md_clear arch_capabilities
Virtualization features:
Virtualization: VT-x
Hypervisor vendor: KVM
Virtualization type: full
Caches (sum of all):
L1d: 1.1 MiB (36 instances)
L1i: 1.1 MiB (36 instances)
L2: 144 MiB (36 instances)
L3: 32 MiB (2 instances)
NUMA:
NUMA node(s): 2
NUMA node0 CPU(s): 0-17
NUMA node1 CPU(s): 18-35
Vulnerabilities:
Itlb multihit: Not affected
L1tf: Mitigation; PTE Inversion; VMX flush not necessary, SMT disabled
Mds: Mitigation; Clear CPU buffers; SMT Host state unknown
Meltdown: Mitigation; PTI
Mmio stale data: Unknown: No mitigations
Retbleed: Not affected
Spec store bypass: Mitigation; Speculative Store Bypass disabled via prctl
Spectre v1: Mitigation; usercopy/swapgs barriers and __user pointer sanitization
Spectre v2: Mitigation; Retpolines, IBPB conditional, IBRS_FW, STIBP disabled, RSB fil
ling, PBRSB-eIBRS Not affected
Srbds: Not affected
Tsx async abort: Not affected
- Operating System: Debian Bookworm v12.1
$ uname -a
Linux koboldcpp-cb947d9b7-jqrfp 6.1.42-production+truenas #2 SMP PREEMPT_DYNAMIC Mon Aug 14 23:21:26 UTC 2023 x86_64 GNU/Linux
$ python3 --version
Python 3.10.13
$ make --version
GNU Make 4.3
Built for x86_64-pc-linux-gnu
Copyright (C) 1988-2020 Free Software Foundation, Inc.
License GPLv3+: GNU GPL version 3 or later <http://gnu.org/licenses/gpl.html>
This is free software: you are free to change and redistribute it.
There is NO WARRANTY, to the extent permitted by law.
$ g++ --version
g++ (Debian 12.2.0-14) 12.2.0
Copyright (C) 2022 Free Software Foundation, Inc.
This is free software; see the source for copying conditions. There is NO
warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.
$ git log | head -1
commit 2dc96687eb7c0fcacb2506e2fcf97dc909cc6cae
$ sha256sum /app/models/mythalion-13b.Q8_0.gguf
ed815d6d74783cc45a66beccceaa6e7d2e4642e38e333334e142e08446072a6e /app/models/mythalion-13b.Q8_0.gguf
I've also reported here because I'm not sure if it's KoboldCPP related issue or Docker/Kubernetes related issue: LostRuins/koboldcpp#434
Metadata
Metadata
Assignees
Labels
No labels