Why my output of agent always not long enough even if 'max_tokens=5000'?

Ray · September 15, 2025, 5:14pm

why my output of agent always not long enough even if ‘max_tokens=5000’?,it seems not valid?

Monali · September 16, 2025, 5:33am

Hey @Ray, Thank you for your question.

Your agent’s output isn’t long enough because max_tokens=5000 is just an upper bound. The actual length depends on model context window, your input size, and how forcefully you prompt the model to be verbose.

How to Fix / Get Longer Outputs

Be explicit in the prompt

“Write at least 3000 tokens of detailed explanation covering X, Y, Z.”
Chunk outputs
If you need something like a long article or report, instruct the agent to generate in sections (part 1, part 2).
Check context usage
Log how many tokens your inputs are using. If you’re already close to the model’s window, responses will be cut short no matter what max_tokens says.
Remove unwanted stop sequences
Look at your config to ensure stop=[] unless you intentionally want cutoffs.
Streaming + stitch
Some use streaming mode and concatenate partial outputs to avoid early truncation.

I hope this helps

Ray · October 5, 2025, 6:55am

Thanks for your advice,I have soled this problem with Chunk outputs、adjusted structure of agent outputs and models selection. Now I can get outputs beyond 10000 Tokens

Monali · October 6, 2025, 6:08am

Hey @Ray, that’s awesome to hear!
Glad you got it working. Chunking and structuring outputs is exactly the right approach for long-form generations, especially when models start hitting context limits. Also, choosing the right model with higher context capacity makes a big difference.

Thanks for sharing your solution — it’ll definitely help others facing the same issue!

Topic		Replies	Views
I encountered an error with "context_length_exceeded" while using in python function agent tool General agent	6	153	April 8, 2025
Model's maximum context length General knowledge	3	145	April 8, 2025
An issue where streams cannot be output after using the tool General agent , tool-call , bug	2	65	April 17, 2025
Increasing Max Tokens Limit for Azure OpenAI LLM Model in PhiData General agent	2	74	January 20, 2025
Agent answers unreasonably General agent , knowledge	3	103	December 17, 2024

Why my output of agent always not long enough even if 'max_tokens=5000'?

Related topics