Hello everyone, when users engage in long conversations, what is the best practice for efficiently remembering the dialogue?
For example, if a user typically has 20 turns in a single conversation, it can cause the context window to overflow, consume a large number of tokens, and require frequent queries to the database to retrieve context…