I was not getting good cache utilization when including dynamic context in agent threads. After a lot of experimentation, I found a good pattern that minimizes how often long lived conversation history gets modified while still supporting dynamic context. It has flexible hooks for doing things ...

Source: [Hacker News](https://github.com/scottpurdy/llmbuffer)

Sponsored