After conducting multiple conversations in the same session, my question tokens keep exceeding the limit

komiya · May 7, 2025, 2:34am

After conducting multiple conversations in the same session, my question tokens keep exceeding the limit. Is there something in the Agent that is increasing the token count of my questions? I am using the knowledge base and memory.

MyAgent：

 self.agent = Agent(
            #模型
            model= model,
            #用户id
            user_id= role.roleid,
            #介绍
            introduction=dedent("""
            You are a professional TRPG game player.
            You excel at analyzing and using tool functions, knowledge bases, past memories, mastering game rules and game situations.
            You are skilled at role-playing specified characters, and planning character actions based on known information to make reasonable decisions.
            """),
            #重复对话
            session_id=self.session_id,

            #记忆引用
            memory=self.memory,
            enable_agentic_memory=self.is_memory,
            enable_user_memories=self.is_memory,
            add_memory_references=self.is_memory,
            add_session_summary_references=self.is_memory,
            enable_session_summaries=self.is_memory,
            #工具
            tools = tools,
            show_tool_calls=True,
            tool_choice=tool_call,
            #知识
            knowledge=knowledge,
            search_knowledge=self.is_knowledge
        )

Monali · May 7, 2025, 5:30am

Hi @komiya , thanks for reaching out and supporting Agno. I’ve shared this with the team, we’re working through all requests one by one and will get back to you soon.
In the meantime please refer the FAQ: Tokens-per-minute rate limiting - Agno

komiya · May 8, 2025, 2:07am

I want to know whether adding memory will increase the token for each query.

ansub · May 16, 2025, 4:20am

Hey @komiya

Our memory features (like chat history, user memories, and session summaries) are designed to help agents feel more intelligent and personal. But as you noticed, this richer context does mean more tokens are used as conversations get longer. It’s a balance: the more your agent “remembers,” the more resources it needs.

You can have a look at our Memory docs: Memory - Agno

Topic		Replies	Views
Issue when add_history_to_messages=True General agent , memory	5	94	February 18, 2025
Why is Agent.run() calling the model four times, and why does num_history_responses=2 still include more history? General agent , tool-call , bug	2	52	March 20, 2025
Initialize agent with custom chat history General agent , storage	3	26	May 30, 2025
Unable to clear in context memory for an agent General agent	7	67	January 20, 2025
Tool call limit parameter for agent and team General tool-call , feature-requests	3	45	April 30, 2025

After conducting multiple conversations in the same session, my question tokens keep exceeding the limit

Related topics