Tag: springboot
All the articles with the tag "springboot".
-
Multimodal AI in Spring AI — adding image understanding to your Java app
Multimodal models process images alongside text. A customer can send a photo of a damaged product, and the LLM reads both image and question together to answer. Spring AI's UserMessage API handles images via URL or base64 — this post shows both.
-
Running local AI models with Ollama and Spring AI — private, free, offline
Ollama runs open-weight LLMs and embedding models on your own machine. No API key, no data leaving your network, no per-token cost. This post shows how to swap Spring AI from OpenAI to Ollama with a profile switch — and where local models fall short.
-
Deployment and configuration best practices for AI-powered Spring Boot apps
Shipping an AI feature involves more than the code: API key management, environment-specific model routing, database schema for vector storage, feature flags for safe rollouts, and a pre-deploy checklist. This post covers the production-readiness concerns specific to AI applications.
-
Error handling for AI apps — rate limits, timeouts, and fallback strategies
LLM API calls fail in ways that normal service calls don't — rate limits, content policy rejections, context window overflows, and intermittent 503s. This post covers the error types, retry strategies, timeout configuration, and graceful fallbacks for production resilience.
-
Safety and guardrails for AI apps — protecting users and your system
AI applications face threats that traditional APIs do not — prompt injection, jailbreaks, off-topic responses, and toxic content generation. This post covers the practical guardrails every production AI application needs on both input and output.