Skip to content

Commit 914a397

Browse files
author
Shaw
committed
training, inference, registry
1 parent d916433 commit 914a397

423 files changed

Lines changed: 48476 additions & 4013 deletions

File tree

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

ELIZA_1_GGUF_READINESS.md

Lines changed: 0 additions & 60 deletions
Original file line numberDiff line numberDiff line change
@@ -47,9 +47,6 @@ Required files:
4747
- `quantization/polarquant_config.json`
4848

4949
Missing files/evidence:
50-
- `asr/eliza-1-asr-mmproj.gguf`
51-
- `asr/eliza-1-asr.gguf`
52-
- `cache/voice-preset-default.bin`
5350
- `checksums/SHA256SUMS`
5451
- `dflash/drafter-0_6b.gguf`
5552
- `dflash/target-meta.json`
@@ -71,20 +68,14 @@ Missing files/evidence:
7168
- `evidence/platform/windows-x64-cpu.json`
7269
- `evidence/platform/windows-x64-vulkan.json`
7370
- `evidence/release.json`
74-
- `licenses/LICENSE.asr`
7571
- `licenses/LICENSE.dflash`
7672
- `licenses/LICENSE.eliza-1`
7773
- `licenses/LICENSE.text`
78-
- `licenses/LICENSE.vad`
79-
- `licenses/LICENSE.voice`
8074
- `quantization/fused_turboquant.json`
8175
- `quantization/polarquant_config.json`
8276
- `quantization/qjl_config.json`
8377
- `quantization/turboquant.json`
8478
- `text/eliza-1-0_6b-32k.gguf`
85-
- `tts/omnivoice-base-Q4_K_M.gguf`
86-
- `tts/omnivoice-tokenizer-Q4_K_M.gguf`
87-
- `vad/silero-vad-int8.onnx`
8879

8980
## 1_7b
9081

@@ -125,45 +116,21 @@ Required files:
125116
- `quantization/polarquant_config.json`
126117

127118
Missing files/evidence:
128-
- `asr/eliza-1-asr-mmproj.gguf`
129-
- `asr/eliza-1-asr.gguf`
130-
- `cache/voice-preset-default.bin`
131-
- `checksums/SHA256SUMS`
132-
- `dflash/drafter-1_7b.gguf`
133-
- `dflash/target-meta.json`
134-
- `evals/aggregate.json`
135-
- `evals/cpu_dispatch.json`
136119
- `evals/cpu_reference.json`
137-
- `evals/metal_dispatch.json`
138-
- `evals/metal_verify.json`
139-
- `evals/vulkan_dispatch.json`
140-
- `evals/vulkan_verify.json`
141120
- `evidence/platform/android-adreno-vulkan.json`
142121
- `evidence/platform/android-mali-vulkan.json`
143-
- `evidence/platform/darwin-arm64-metal.json`
144122
- `evidence/platform/ios-arm64-metal.json`
145123
- `evidence/platform/linux-x64-cpu.json`
146124
- `evidence/platform/linux-x64-vulkan.json`
147125
- `evidence/platform/windows-arm64-cpu.json`
148126
- `evidence/platform/windows-arm64-vulkan.json`
149127
- `evidence/platform/windows-x64-cpu.json`
150128
- `evidence/platform/windows-x64-vulkan.json`
151-
- `evidence/release.json`
152-
- `licenses/LICENSE.asr`
153-
- `licenses/LICENSE.dflash`
154-
- `licenses/LICENSE.eliza-1`
155-
- `licenses/LICENSE.text`
156-
- `licenses/LICENSE.vad`
157-
- `licenses/LICENSE.voice`
158129
- `quantization/fused_turboquant.json`
159130
- `quantization/polarquant_config.json`
160131
- `quantization/qjl_config.json`
161132
- `quantization/turboquant.json`
162133
- `text/eliza-1-1_7b-32k.gguf`
163-
- `text/eliza-1-1_7b-64k.gguf`
164-
- `tts/omnivoice-base-Q4_K_M.gguf`
165-
- `tts/omnivoice-tokenizer-Q4_K_M.gguf`
166-
- `vad/silero-vad-int8.onnx`
167134

168135
## 9b
169136

@@ -210,9 +177,6 @@ Required files:
210177
- `quantization/polarquant_config.json`
211178

212179
Missing files/evidence:
213-
- `asr/eliza-1-asr-mmproj.gguf`
214-
- `asr/eliza-1-asr.gguf`
215-
- `cache/voice-preset-default.bin`
216180
- `checksums/SHA256SUMS`
217181
- `dflash/drafter-9b.gguf`
218182
- `dflash/target-meta.json`
@@ -239,22 +203,16 @@ Missing files/evidence:
239203
- `evidence/platform/windows-x64-cuda.json`
240204
- `evidence/platform/windows-x64-vulkan.json`
241205
- `evidence/release.json`
242-
- `licenses/LICENSE.asr`
243206
- `licenses/LICENSE.dflash`
244207
- `licenses/LICENSE.eliza-1`
245208
- `licenses/LICENSE.text`
246-
- `licenses/LICENSE.vad`
247209
- `licenses/LICENSE.vision`
248-
- `licenses/LICENSE.voice`
249210
- `quantization/fused_turboquant.json`
250211
- `quantization/polarquant_config.json`
251212
- `quantization/qjl_config.json`
252213
- `quantization/turboquant.json`
253214
- `text/eliza-1-9b-128k.gguf`
254215
- `text/eliza-1-9b-64k.gguf`
255-
- `tts/omnivoice-base-Q8_0.gguf`
256-
- `tts/omnivoice-tokenizer-Q8_0.gguf`
257-
- `vad/silero-vad-int8.onnx`
258216
- `vision/mmproj-9b.gguf`
259217

260218
## 27b
@@ -302,9 +260,6 @@ Required files:
302260
- `quantization/polarquant_config.json`
303261

304262
Missing files/evidence:
305-
- `asr/eliza-1-asr-mmproj.gguf`
306-
- `asr/eliza-1-asr.gguf`
307-
- `cache/voice-preset-default.bin`
308263
- `checksums/SHA256SUMS`
309264
- `dflash/drafter-27b.gguf`
310265
- `dflash/target-meta.json`
@@ -327,22 +282,16 @@ Missing files/evidence:
327282
- `evidence/platform/windows-x64-cuda.json`
328283
- `evidence/platform/windows-x64-vulkan.json`
329284
- `evidence/release.json`
330-
- `licenses/LICENSE.asr`
331285
- `licenses/LICENSE.dflash`
332286
- `licenses/LICENSE.eliza-1`
333287
- `licenses/LICENSE.text`
334-
- `licenses/LICENSE.vad`
335288
- `licenses/LICENSE.vision`
336-
- `licenses/LICENSE.voice`
337289
- `quantization/fused_turboquant.json`
338290
- `quantization/polarquant_config.json`
339291
- `quantization/qjl_config.json`
340292
- `quantization/turboquant.json`
341293
- `text/eliza-1-27b-128k.gguf`
342294
- `text/eliza-1-27b-256k.gguf`
343-
- `tts/omnivoice-base-Q8_0.gguf`
344-
- `tts/omnivoice-tokenizer-Q8_0.gguf`
345-
- `vad/silero-vad-int8.onnx`
346295
- `vision/mmproj-27b.gguf`
347296

348297
## 27b-256k
@@ -389,9 +338,6 @@ Required files:
389338
- `quantization/polarquant_config.json`
390339

391340
Missing files/evidence:
392-
- `asr/eliza-1-asr-mmproj.gguf`
393-
- `asr/eliza-1-asr.gguf`
394-
- `cache/voice-preset-default.bin`
395341
- `checksums/SHA256SUMS`
396342
- `dflash/drafter-27b-256k.gguf`
397343
- `dflash/target-meta.json`
@@ -413,19 +359,13 @@ Missing files/evidence:
413359
- `evidence/platform/linux-x64-rocm.json`
414360
- `evidence/platform/linux-x64-vulkan.json`
415361
- `evidence/release.json`
416-
- `licenses/LICENSE.asr`
417362
- `licenses/LICENSE.dflash`
418363
- `licenses/LICENSE.eliza-1`
419364
- `licenses/LICENSE.text`
420-
- `licenses/LICENSE.vad`
421365
- `licenses/LICENSE.vision`
422-
- `licenses/LICENSE.voice`
423366
- `quantization/fused_turboquant.json`
424367
- `quantization/polarquant_config.json`
425368
- `quantization/qjl_config.json`
426369
- `quantization/turboquant.json`
427370
- `text/eliza-1-27b-256k.gguf`
428-
- `tts/omnivoice-base-Q8_0.gguf`
429-
- `tts/omnivoice-tokenizer-Q8_0.gguf`
430-
- `vad/silero-vad-int8.onnx`
431371
- `vision/mmproj-27b-256k.gguf`

ELIZA_1_RELEASE_ASSET_STATUS.md

Lines changed: 100 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,100 @@
1+
# Eliza-1 Release Asset Status
2+
3+
Generated on 2026-05-11 after local iOS smoke + asset acquisition.
4+
5+
This is a release-prep ledger, not a release approval. All upstream source
6+
weights acquired here are staged under `source/` and are not final Eliza-1
7+
weights. The publishable files must still land at the paths listed in
8+
`ELIZA_1_GGUF_READINESS.md`.
9+
10+
## Local Bundle Roots
11+
12+
All local bundles are under:
13+
14+
`/Users/shawwalters/.eliza/local-inference/models/`
15+
16+
Staged bundle directories:
17+
18+
- `eliza-1-0_6b.bundle`
19+
- `eliza-1-1_7b.bundle`
20+
- `eliza-1-9b.bundle`
21+
- `eliza-1-27b.bundle`
22+
- `eliza-1-27b-256k.bundle`
23+
24+
`stage_eliza1_bundle_assets.py --link-mode hardlink` now hardlinks Hub cache
25+
blobs into tier bundles, so repeated ASR/TTS assets do not consume disk once
26+
per tier.
27+
28+
## Acquired Non-Text Assets
29+
30+
Every tier has these runtime side assets staged with SHA-256 evidence in
31+
`evidence/bundle-assets.json`:
32+
33+
- TTS: `Serveurperso/OmniVoice-GGUF`
34+
- ASR: `ggml-org/Qwen3-ASR-0.6B-GGUF` for 0.6B/1.7B/9B tiers
35+
- ASR: `ggml-org/Qwen3-ASR-1.7B-GGUF` for 27B tiers
36+
- VAD: `onnx-community/silero-vad`
37+
- Cache: deterministic `cache/voice-preset-default.bin`
38+
39+
## Acquired Source Weights
40+
41+
Each tier has `evidence/source-weights.json` and source files under `source/`:
42+
43+
| Tier | Text source | DFlash source | Vision source |
44+
| --- | --- | --- | --- |
45+
| `0_6b` | `Qwen/Qwen3-0.6B-GGUF` / `Qwen3-0.6B-Q8_0.gguf` | Missing upstream drafter | n/a |
46+
| `1_7b` | `Qwen/Qwen3-1.7B-GGUF` / `Qwen3-1.7B-Q8_0.gguf` | Missing upstream drafter | n/a |
47+
| `9b` | `unsloth/Qwen3.5-9B-GGUF` / `Qwen3.5-9B-Q4_K_M.gguf` | `lym00/Qwen3.5-9B-DFlash-GGUF-Test` / `Qwen3.5-9B-DFlash-q8_0.gguf` | `unsloth/Qwen3.5-9B-GGUF` / `mmproj-F16.gguf` |
48+
| `27b` | `batiai/Qwen3.6-27B-GGUF` / `Qwen-Qwen3.6-27B-Q4_K_M.gguf` | `spiritbuun/Qwen3.6-27B-DFlash-GGUF` / `dflash-draft-3.6-q8_0.gguf` | `batiai/Qwen3.6-27B-GGUF` / `mmproj-Qwen-Qwen3.6-27B-Q6_K.gguf` |
49+
| `27b-256k` | same as `27b` | same as `27b` | same as `27b` |
50+
51+
These are conversion/training inputs only. Do not rename them into `text/` or
52+
`dflash/` release paths without running the final Eliza-1 train, quantize,
53+
kernel, eval, checksum, license, and upload gates.
54+
55+
## iOS Evidence
56+
57+
XCFramework physical-device smoke now passes:
58+
59+
- Report: `packages/inference/verify/hardware-results/ios-device-smoke-2026-05-11.json`
60+
- Device: iPhone 15 Pro, iOS 26.3.1, UDID `00008130-001955E91EF8001C`
61+
- Result: 3/3 XCTest cases passed
62+
- Voice ABI: required and not skipped
63+
64+
This proves XCFramework structure, Metal availability, runtime symbols, and
65+
fail-closed ABI calls. It does not prove a final weight-backed Capacitor app
66+
route yet.
67+
68+
## Publish Blockers
69+
70+
- Final Eliza-1 text GGUFs are not produced for any tier.
71+
- Final 0.6B and 1.7B DFlash drafter GGUFs are missing.
72+
- The 9B DFlash source comes from a test repo and is not release-grade.
73+
- Final long-context variants (`32k`, `64k`, `128k`, `256k`) are not generated
74+
and evaluated from trained Eliza-1 checkpoints.
75+
- Final `dflash/target-meta.json` files are missing for all tiers except any
76+
local stand-in artifacts already marked non-publishable.
77+
- Final evals are missing: text, ASR WER, VAD latency, TTS RTF, expressive voice,
78+
DFlash acceptance, first token, first audio, barge-in, 30-turn endurance,
79+
mobile RSS, thermal, and backend dispatch.
80+
- Final release checksums and release-reviewed license attestations are missing
81+
for the staged final paths.
82+
- Final platform evidence is incomplete across native Linux Vulkan, Android
83+
graph dispatch, CUDA, ROCm, GH200/H200, native Windows, and weight-backed iOS.
84+
- `elizalabs` Hugging Face upload evidence is still absent. Existing accessible
85+
staging evidence is not a release namespace proof.
86+
87+
## Next Release Actions
88+
89+
1. Train/fine-tune the Eliza-1 text checkpoints for each tier.
90+
2. Produce or train matching DFlash drafters for 0.6B/1.7B/9B/27B tiers.
91+
3. Quantize final text and drafters into the exact `text/` and `dflash/` paths
92+
required by `ELIZA_1_GGUF_READINESS.md`.
93+
4. Generate final quantization sidecars for TurboQuant, fused TurboQuant, QJL,
94+
and PolarQuant from the exact final bytes.
95+
5. Run all local eval gates and write final `evals/*.json`.
96+
6. Run platform hardware dispatch gates and write final `evidence/platform/*.json`.
97+
7. Generate `checksums/SHA256SUMS` and `evidence/release.json` only after every
98+
gate is green.
99+
8. Upload to `elizalabs/eliza-1-*` and preserve upload logs/Hub URLs in
100+
`evidence/release.json`.

ELIZA_1_TESTING_TODO.md

Lines changed: 10 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@ Status as of 2026-05-11 on this workspace:
66
- Standalone Vulkan SPIR-V fixtures pass on this Apple Silicon host through MoltenVK, including TurboQuant, QJL, PolarQuant, and Polar+QJL residual.
77
- Built-fork Vulkan graph dispatch source wiring now exists for `GGML_OP_ATTN_SCORE_QJL`, `GGML_OP_ATTN_SCORE_TBQ` (`turbo3`, `turbo4`, `turbo3_tcq`), and `GGML_OP_ATTN_SCORE_POLAR` (`use_qjl=0/1`), but runtime-ready capability bits stay false until native Vulkan graph smoke passes on physical hardware.
88
- `adb devices -l` currently shows only `emulator-5554`; emulator Vulkan is diagnostic only and is not recordable Eliza-1 hardware evidence.
9-
- `xcrun xctrace list devices` currently shows `Shaw's iPhone (26.3.1)` offline even though `xcrun devicectl list devices` sees the iPhone 15 Pro as paired/available; simulator results are not physical iOS evidence.
9+
- Physical iOS XCFramework smoke now passes on `Shaw's iPhone (26.3.1)` / iPhone 15 Pro UDID `00008130-001955E91EF8001C`; simulator results remain non-recordable for release bundle evidence.
1010
- CUDA, ROCm, GH200, and native Windows runners are present and fail closed, but this Mac cannot provide recordable target hardware evidence.
1111
- No final Eliza-1 release bundles exist yet with final weights, hashes, eval outputs, license manifests, and Hugging Face upload evidence.
1212

@@ -201,10 +201,16 @@ node packages/app-core/scripts/ios-xcframework/run-physical-device-smoke.mjs
201201

202202
Current iOS blockers:
203203

204-
- The physical iPhone is visible to CoreDevice as paired/available, but `xctrace` still lists UDID `00008130-001955E91EF8001C` as offline.
205-
- Retrying with the CoreDevice identifier reached an interactive `Password:` prompt before XCTest output; do not enter credentials inside the runner.
204+
- XCFramework physical-device smoke is PASS as of 2026-05-11:
205+
`packages/inference/verify/hardware-results/ios-device-smoke-2026-05-11.json`
206+
reports 3/3 XCTest cases passing without `--skip-voice-abi`.
207+
- The fixed runner treats CoreDevice `connected` state as connected, and
208+
`build-xcframework.mjs` refreshes the runtime ABI shim before packaging so
209+
stale shim archives cannot reintroduce the old TTS ABI crash.
210+
- Remaining iOS blocker: run a real Eliza-1 bundle smoke from the Capacitor app
211+
shell with final text + DFlash + TTS + ASR payloads and record first token,
212+
first audio, peak RSS, thermal state, and at least one local voice route.
206213
- Simulator runs do not count as physical iOS evidence.
207-
- Physical smoke must validate the embedded Metal library, Capacitor bridge load, and at least one real local-inference route from the app shell.
208214

209215
## Release Bundle Evidence
210216

0 commit comments

Comments
 (0)