n_ctx_per_seq (512) < n_ctx_train

Hi!
I've faced the issue when trying to use chat mode. Every time i try to chat,  no matter how big or small my code base, it's always hangs for ever to Processing. Meanwhile search mode works fine. 
There only one things that always follow the problem is:

```
llama_init_from_model: n_batch is less than GGML_KQ_MASK_PAD - increasing to 32
llama_init_from_model: n_ctx_per_seq (512) < n_ctx_train (16384) -- the full capacity of the model will not be utilized
```

I've created the empty project with the 1 Go-lang file with an empty class App. Search mode works fine and it's response fast and correct, but in the chat mode i can't get any response and can't understand what exactly happens.

Environment: Win11 WSL, all libs installed correctly faiss-gpu also, RTX3060, i5 12400F (works very fine with StableDiffusion)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

n_ctx_per_seq (512) < n_ctx_train #50

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

n_ctx_per_seq (512) < n_ctx_train #50

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions