AI Systems Engineer
Arunkumar
Building agent-based applications for startups. Writing about systems architecture, agent execution, memory systems, and observability.
Remote, India
Recent Articles
All articles ·
Implementation
Observability for AI Agent Systems
Why traditional APM tools fall short for agent workloads, and how to build observability that captures token usage, decision traces, and tool call latencies.
observability agents operations
·
Architectural
Memory Systems for Long-Running Agents
Exploring working memory, episodic memory, and semantic memory architectures for agents that need to maintain context across hundreds of interactions.
agents memory architecture
·
Implementation
Building Reliable Agent Loops: Retry, Backoff, and Circuit Breakers
How to design agent execution loops that handle failures gracefully — covering retry strategies, exponential backoff, circuit breakers, and graceful degradation patterns for production AI systems.
agents reliability systems-design