Inference Using Local GPU Resource

coolwin20 · March 7, 2025, 7:37am

Dear Agno Team,

I would like to run a model uploaded to Hugging Face using my local GPU resources.

However, I couldn’t find any relevant documentation in the official docs.

I’m wondering if I might have missed something or if this feature is planned for future updates.

Monali · March 10, 2025, 5:16am

Hi @coolwin20
Thanks for reaching out and for using Agno! I’ve looped in the right engineers to help with your question. We usually respond within 48 hours, but if this is urgent, just let us know, and we’ll do our best to prioritize it.
Appreciate your patience—we’ll get back to you soon!

ayush · March 12, 2025, 6:18am

hey @coolwin20 here is the documentation for running huggingface with Agno HuggingFace - Agno

coolwin20 · March 12, 2025, 8:49am

@ayush I know about that module, but isn’t it only performing inference using the Hugging Face API? I want to perform inference using the GPU on my local computer.

ayush · March 12, 2025, 6:12pm

hey @coolwin20 actually we dont support this feature yet , i have added this feature to community wishlist .

coolwin20 · March 13, 2025, 12:53am

@ayush Thank you! I am preparing for service production in the smart farm sector using Agno. The current API-based agent framework is already excellent, but I look forward to its future on-premise support, which will make it even more useful.

Topic		Replies	Views
Can Agno be used with OCI inference API? General agent	4	68	March 3, 2025
Trying to make a simple agent with Hugging face embeddings and lance db General knowledge	3	63	May 15, 2025
Support for Vertex AI Agent Engine deployment General agent	1	22	May 5, 2025
SQL agent/app error with Hugging face models General agent , bug	4	53	April 17, 2025
Huggingface model error General agent , bug	3	45	March 6, 2025

Inference Using Local GPU Resource

Related topics