Edge of Context
Practical AI engineering by Slava Dubrov. I write about the parts of AI systems that have to survive production: agent runtimes, memory, security, retrieval, evaluation, LLM infrastructure, and the developer tooling around them.
Start here
- Agent architecture: reasoning loops, memory, tool use, security, and long-running runtimes.
- Retrieval and evaluation: RAG evaluation, search ranking, and context engineering.
- LLM infrastructure: fine-tuning, vLLM structured outputs, LoRAX serving, and LLM engineering concepts.
- Developer tooling: uv on macOS, pyproject.toml, and local LLMs on macOS.
Browse by topic
The blog index lists every post and can be filtered by topic. The Agents 101 series walks through the agentic stack part by part. For author background, conference talks, and topic coverage, see About Me.