Tag: java
All the articles with the tag "java".
-
What is an AI agent? Moving beyond single LLM calls
A single LLM call answers a question. An AI agent reasons, decides which tools to use, calls them, observes results, and loops until the task is complete. This post explains the concept clearly and when you actually need an agent vs a simpler approach.
-
Managing context window efficiently — windowed memory and summarization
Sending full conversation history on every request is expensive and eventually hits the context window limit. Windowed memory keeps only recent turns. Summarization condenses older history into a compact summary. This post shows both techniques and when to use each.
-
Persistent chat memory in Spring AI — survive restarts and scale horizontally
InMemoryChatMemory loses all conversations on restart and doesn't work across multiple application instances. JdbcChatMemory stores conversation history in PostgreSQL — persistent, durable, and load-balancer friendly.
-
Chat memory in Spring AI — building a chatbot that remembers
Spring AI's MessageChatMemoryAdvisor automatically injects conversation history into every LLM call and saves each turn back to a ChatMemory store. This post wires InMemoryChatMemory into the support assistant with per-session isolation.
-
Why LLMs forget everything — and what you must do about it
Every LLM call is stateless. The model has no memory of previous turns unless you explicitly provide them. This post explains why, what the context window limit means for conversations, and the three strategies for managing memory in AI applications.