Will it be possible to extract/save and retrieve KV cache for different chat sessions?
What I mean is, for example, if you created a regular chat application like Llamatik, and the user created multiple chats at different times, when they open a chat, its KV cache could then be retrieved and saved after the response is generated, right?
Will it be possible to extract/save and retrieve KV cache for different chat sessions?
What I mean is, for example, if you created a regular chat application like Llamatik, and the user created multiple chats at different times, when they open a chat, its KV cache could then be retrieved and saved after the response is generated, right?