Ollama - Lessons Learned and Recommendations

bills · January 11, 2025, 12:30am

Background: I am exclusively using/testing your framework in a private network exclusively using Ollama and a number of LLMs on various servers.

Documentation Update: When you update your documentation in the next release, can you please include your recommendations, knowledge and best-practices on how/what to use when using Ollama. Both your framework and Ollama are rapidly evolving and having the associated growing pains. The comment I just read about Agent replacing Assistant, … would be helpful (and easy to do) in a Ollama Usage recommendation note.

I’ve been doing testing across your Cookbook examples, converting them to exclusively use Ollama. Things I’ve ‘discovered’ from my testing include: In most cases if an agent uses tools, it is better to use OllamaTools (but not necessarily in all cases). Control agents in a multi-agent environment should only use Ollama. Execution of various LLMs in Ollama can generate similar results, but there can be sufficient difference to adversely affect Agent execution. Suitable OpenAPI prompts in (some?) cases are unsuitable for Ollama LLMs.

My biggest request after more overview documentation is to fix all modes of MemoryAgent usage to work with Ollama. Without conversation and session memory, it’s difficult to create a robust virtual assistant.

Finally, it would be helpful to know your development priorities. The fastest, best and most financially lucrative focus is supporting the OpenAI models and ecosystem. But, there is an issue of privacy, which will inhibit major corporations from fully embracing the current environment. So what is your next?

Thanks and you are course to develop a truly great product.

Monali · January 13, 2025, 8:07am

Hi @bills
Thank you for reaching out and using Phidata! I’ve tagged the relevant engineers to assist you with your query. We aim to respond within 48 hours.
If this is urgent, please feel free to let us know, and we’ll do our best to prioritize it.
Thanks for your patience!

Dirk · January 13, 2025, 11:29am

Hi @bills
Thanks for the insight! We work directly with some customers with a lot of the same requirements around using open source models via Ollama. I agree, we need to give you (the user) much better insight into using Ollama. I have added it to my list of documentation updates to make.

We have focussed on OpenAI since they are generally ahead of the curve in terms of features and are well known, but that doesn’t mean Ollama is of a lesser concern to us. At the moment “Memory Summarization” doesn’t work with Ollama, but other than that memory should work fine with OpenAI.

Dirk · January 13, 2025, 11:30am

The truth is also that we are a small team and can’t get to everything. Feedback from the community greatly influences where we put our focus next, so Ollama will move up the list.

Leonid · February 17, 2025, 2:47am

Hi All,

I also tried using Ollama exclusively while studying/testing the Agno library and reviewing the Getting Started examples.

The first several examples - Basic Agent, Agent with Tools, Agent with Knowledge, and Agent with Storage - worked fine with Ollama.

However, I started getting errors related to Pydantic validation when I moved to the Agent Team example

Interestingly, when I changed the example to work with the Groq Free Tier model, it then worked fine.

I uploaded my code to the GitHub - LeonidShamis/agno-getting-started: Getting started examples of using Agno library repository - it includes the instructions and the changes I made to run the examples with Ollama and Groq.

I just now noticed the information about the use of OllamaTools that @bills raised in his post - I’ll try this technique.

Overall, I also think that documenting and verifying the use of Ollama instead of OpenAI would make the library more robust and allow a better development experience, as well as support more Production deployment options.

Leonid · February 17, 2025, 2:55am

Hi @bills,

As you can see from my above post, I also tried changing the examples to use Ollama, and I’m interested to know whether the issues that you encountered were also related to Pydantic validation(s) of what appears to be expected for Tool use, for example:

pydantic_core._pydantic_core.ValidationError: 1 validation error for Tool
function.parameters.properties.additional_information.type
  Input should be a valid string [type=string_type, input_value=['string', 'null'], input_type=list]

Do you mind sharing an example (code) of how you solved it using OllamaTools, e.g. for Agent Team?

Thank you very much.

TLtil · February 19, 2025, 9:40am

you can see ollama._types.ResponseError: json: cannot unmarshal array into Go struct field .tools.function.parameters.properties.type of type string - #11 by vincevegafr

bills · February 22, 2025, 12:25pm

Unfortunately I am not a Pedantic expert… and am encountering similar uses that you are identifying. In some cases due to time constraints I actually removed the Pedantic dependencies… I too am encountering problems/challenges with multi-Agent usage. Depending on the LLM used, you can get significanlty different results and more/less Pedantic issues. I’m in the process of unwinding Agent results that create Pydantic issues and the generation of wrong/unexpected results. The challenge is working with multiple black boxes and trying to ‘fence in’ their output. I’m currently wishing for better LLMs… that I’m not forced to train. Let me know if you solve some of your issues. We Ollama users will be very appreciative.

Dirk · February 28, 2025, 7:02am

Hi @Leonid
This was an issue with how we formatted tool definitions in the case of Ollama. They expect a slightly more strict format of JSON schema for tool definitions. We have since fixed this in version 1.1.5 and above.

@bills I am currently working on a revamp of teams that should be a lot more robust. The current implementation of teams does not handle structured output well. It also does not properly enable multimodal cases. We are making a better version of teams .

hlthcare · March 5, 2025, 7:50pm

Cant agree more with @Dirk . I really like the Agno interface , and would like to use models locally with Ollama, as we are working of some Health-data usecases. Happy to assist in whatever shape or form to push this up the queue.

bills · May 20, 2025, 6:22pm

It would be helpful to know where/what you fixed in 1.1.5 that addressed the initial Ollama / pydantic errors. After converting ‘cookbook/getting_started/17_user_memories_and_summaries.py’ to use Ollama, the output from Ollama (llama3.2) generates a pydantic error when the response is fed into the add_memory() function. There are several other places where pydantic errors are generated, and probably all related to the latest memory v2 system. It appears looking at the debugging traces and single stepping laboriously through the code, there are two (2) possible solutions. Option 1 is to the agent give better guidance on what the output looks like from the Ollama LLM or Option 2, intercept the output and reformat to conform to the expected memory argument formats. Option 1 may be possible to do through Agent argument definitions, while Option 2 requires patching the library. Neither solution seems totally reliable… Thoughts/suggestions?

Monali · May 21, 2025, 5:31am

Hey @bills
Thank you for your question. I have forwarded this to our engineers. We will get back to you as soon as possible

Topic		Replies	Views
How do i hit ollama hosted on a GPU server , from a client machine? General agent , tool-call	8	127	April 26, 2025
Agents running through Ollama not able to call tools General agent , tool-call	5	98	June 16, 2025
Search_knowledge_base Error with Ollama and Agent_rag General knowledge	8	124	April 30, 2025
Not proper response for Ollama based agents General agent	11	128	April 25, 2025
Workspace Facility using Local (Ollama) LLMs General knowledge , feedback	1	40	May 23, 2025

Ollama - Lessons Learned and Recommendations

Related topics