Ollama

Connect to Ollama™ models from MATLAB® locally or nonlocally.

Setup
Get Started
Manage Chat History
Images
JSON-Formatted and Structured Output
- JSON Mode
- Structured Output
Tool Calling
See Also
Examples

Setup

Connecting to Ollama models using this add-on requires an installed version of Ollama, as well as installed versions of the models you want to use.

Install Ollama. For information on how to install Ollama, see https://ollama.com/download.
Install Model. If you have Ollama installed, then you can install models from the MATLAB Command Window using the "ollama pull" command. For example, to install Mistral, run this code.

>> !ollama pull mistral

Get Started

Connect to Ollama using the ollamaChat function and generate text using the generate function. Optionally specify a system prompt.

model = ollamaChat("mistral","You are a helpful assistant.");
generate(model,"Who would win a footrace, a snail or a blue whale?")

ans = " A blue whale cannot compete in a footrace as it lives and moves primarily in water, not on land. If we were to compare the speed between a snail and an animal that can move on land, like a cheetah for example, a cheetah would win hands down. Cheetahs have been recorded running at speeds up to 70 mph (112 km/h), while the maximum speed achievable by the average garden snail is about 0.03 mph (0.05 km/h)."

By default, the ollamaChat function connects to a local server. To use a remote Ollama server, specify the server name and port number using the Endpoint name-value argument.

>> model = ollamaChat("mistral",Endpoint="myOllamaServer:12345");

For more examples of how to generate text using Ollama from MATLAB, see for instance:

Process Generated Text in Real Time by Using Ollama in Streaming Mode
Retrieval-Augmented Generation Using Ollama and MATLAB (requires Text Analytics Toolbox™)

Manage Chat History

Manage and store messages in a conversation using the messageHistory function. Use this to create a chatbot, use few-shot prompting, or to facilitate workflows that require more than a single LLM call, such as tool calling.

Connect to Ollama using the ollamaChat function.

model = ollamaChat("mistral");

Initialize the message history.

messages = messageHistory;

Add a user message to the message history.

messages = addUserMessage(messages,"What is the precise definition of a treble crochet stitch?");

Generate a response from the message history.

[generatedText,completeOutput] = generate(model,messages)

generatedText = " A Treble Crochet Stitch (abbreviated as tr or trbl in patterns) is one of the basic stitches used in crocheting. It is formed by yarn-overing twice and inserting the hook under two loops on the previous row, then pulling up a loop through all six loops on the hook: one loop from each yarn over and one loop from each of the two adjacent stitches below. This combination creates a taller and looser stitch compared to a double crochet (dc) stitch. The treble crochet stitch is often used for increasing and creating textured patterns in crocheting projects."
completeOutput = struct with fields:
       role: 'assistant'
    content: ' A Treble Crochet Stitch (abbreviated as tr or trbl in patterns) is one of the basic stitches used in crocheting. It is formed by yarn-overing twice and inserting the hook under two loops on the previous row, then pulling up a loop through all six loops on the hook: one loop from each yarn over and one loop from each of the two adjacent stitches below. This combination creates a taller and looser stitch compared to a double crochet (dc) stitch. The treble crochet stitch is often used for increasing and creating textured patterns in crocheting projects.'

Add the response message to the message history.

messages = addResponseMessage(messages,completeOutput);

Ask a follow-up question by adding another user message to the message history.

messages = addUserMessage(messages,"When was it first invented?");

Generate a response from the message history.

generate(model,messages)

ans = " The exact origins of crochet are unclear, but it is believed that it originated around the mid-16th century. Various types of crochet including treble stitches were developed and refined over time by different cultures such as Egyptians, Arabs, Persians, and Europeans. The modern version of crocheting using a hook and yarn became popular in Europe during the 19th century with the spread of the Industrial Revolution, which made it easier to produce thread and hooks on a mass scale. However, it is difficult to pinpoint the exact time when specific stitches like the treble crochet were first invented."

For another example of how to use and manage the message history, see the Create Simple Ollama ChatBot example (requires Text Analytics Toolbox).

Images

You can use Ollama to generate text based on image inputs. For information on whether an Ollama model supports image inputs, check whether the model has the vision tag in ollama.com/library.

Tip

Some models that do not support image inputs allow you to specify images in the prompt, but silently ignore the images from the input.

Load a sample image from Wikipedia. Use the imread function to read images from URLs or filenames.

image_url = 'https://upload.wikimedia.org/wikipedia/commons/thumb/d/dd/Gfp-wisconsin-madison-the-nature-boardwalk.jpg/2560px-Gfp-wisconsin-madison-the-nature-boardwalk.jpg';
im = imread(image_url);
figure
imshow(im)

Set up the interface to Ollama using the model Moondream.

chat = ollamaChat("moondream");

Initialize the message history. Add a user prompt, along with the image, to the message history.

messages = messageHistory;
messages = addUserMessageWithImages(messages,"Please describe the image.", string(image_url));

Generate a response.

generate(chat,messages)

ans = 
    "
     The image shows a long walkway or boardwalk made of wood, situated between two grass fields, likely in North America as it is close to the border between the United States and Mexico. The boards for the pathway are positioned at an angle towards the left side on both parts of the walkway. This path can provide easy access to nature, offering a relaxing stroll through the lush green field."

JSON-Formatted and Structured Output

For some workflows, it is useful to generate text in a specific format. For example, a predictable output format allows you to more easily analyze the generated output.

You can specify the format either by using JSON mode, or by using structured outputs, depending on what the model supports. Both generate text containing JSON code. For more information on structured output in Ollama, see https://ollama.com/blog/structured-outputs.

JSON Mode

To run an LLM in JSON mode, set the ResponseFormat name-value argument of ollamaChat or generate to "json". To configure the format of the generated JSON code, describe the format using natural language and provide it to the model either in the system prompt or as a user message. The prompt or message describing the format must contain the word "json" or "JSON".

Structured Output

To use structured outputs, rather than describing the required format using natural language, provide the model with a valid JSON schema.

In LLMs with MATLAB, you can specify the structure of the output in two different ways.

Specify a valid JSON Schema directly.
Specify an example structure array that adheres to the required output format. The software automatically generates the corresponding JSON Schema and provides this to the LLM. Then, the software automatically converts the output of the LLM back into a structure array.

To do this, set the ResponseFormat name-value argument of ollamaChat or generate to:

A string scalar containing a valid JSON Schema.
A structure array containing an example that adheres to the required format, for example: ResponseFormat=struct("Name","Rudolph","NoseColor",[255 0 0])

For an example of how to use structured output with LLMs with MATLAB, see Analyze Sentiment in Text Using ChatGPT and Structured Output.

Tool Calling

Some large language models can suggest calls to a tool that you have, such as a MATLAB function, in their generated output. An LLM does not execute the tool itself. Instead, the model encodes the name of the tool and the name and value of any input arguments. You can then write scripts that automate the tool calls suggested by the LLM.

To use tool calling, specify the ToolChoice name-value argument of the ollamaChat function.

For information on whether an Ollama model supports tool calling, check whether the model has the tools tag in ollama.com/library.

For an example of how to use tool calling with Ollama in LLMs with MATLAB, see Analyze Text Data Using Parallel Function Calls with Ollama.

Examples

Process Generated Text in Real Time by Using Ollama in Streaming Mode
Create Simple Ollama ChatBot (requires Text Analytics Toolbox)
Analyze Sentiment in Text Using ChatGPT and Structured Output
Analyze Text Data Using Parallel Function Calls with Ollama
Retrieval-Augmented Generation Using Ollama and MATLAB (requires Text Analytics Toolbox)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!