cannot use RWKV models

I just tried to use the current version of "llama-node" with the "rwkv.cpp" backend and failed.

The link found in the docs where I should be able to [download RWKV models](https://huggingface.co/Malan/ggml-rwkv-4-raven-Q4_1_0) points to nowhere.

Since I could not find pre-quantized models anywhere, I followed the instructions found in [the rwkv.cpp repo](https://github.com/saharNooby/rwkv.cpp) to download, convert and quantize the 1.5B and 0.1B models - I even uploaded them to HuggingFace.

Then, I copied the [example found in your docs](https://llama-node.vercel.app/docs/start) added a path to my quantized model, changed the `template` and tried to run the result.

Unfortunately, I got nothing but an error message:

```
llama.cpp: loading model from /Users/andreas/rozek/AI/RWKV/RWKV-5-World-0.1B-v1-20230803-ctx4096-Q4_1.bin
error loading model: unknown (magic, version) combination: 67676d66, 00000065; is this really a GGML file?
llama_init_from_file: failed to load model
node:internal/process/promises:288
            triggerUncaughtException(err, true /* fromPromise */);
            ^

[Error: Failed to initialize LLama context from file: /Users/andreas/rozek/AI/RWKV/RWKV-5-World-0.1B-v1-20230803-ctx4096-Q4_1.bin] {
  code: 'GenericFailure'
}

Node.js v18.17.0
```

Do you have any idea what could be wrong?


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

cannot use RWKV models #121

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

cannot use RWKV models #121

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions