LangChain Unveils Agent Middleware for Custom AI Harness Development

New Computer Achieves 50% Higher Recall with LangSmith Integration

Felix Pinkston
Mar 26, 2026 15:31

LangChain introduces AgentMiddleware system enabling developers to customize AI agent behavior with hooks for PII detection, dynamic tool selection, and production-ready features.

LangChain has released a comprehensive middleware system that lets developers customize AI agent behavior without rebuilding core infrastructure from scratch. The framework, detailed in a March 26, 2026 blog post, introduces six distinct hooks that intercept and modify agent execution at every stage of the loop.

The middleware architecture addresses a persistent pain point in agent development: the gap between demo-ready prototypes and production systems. While basic customization like swapping system prompts or adding tools has always been straightforward, modifying the fundamental agent loop—what happens before model calls, how tools execute, when humans intervene—required extensive custom code.

How the Hook System Works

AgentMiddleware exposes six intervention points. before_agent fires once at invocation for loading memory or validating inputs. before_model runs before each LLM call, useful for trimming history or catching sensitive data. wrap_model_call handles caching, retries, and dynamic tool binding. wrap_tool_call does the same for tool execution. after_model slots in human-in-the-loop workflows. after_agent handles cleanup and notifications.

These middlewares stack. Developers can layer multiple modifications without conflicts.

Built-in Solutions for Common Problems

LangChain ships prebuilt middleware for patterns that keep appearing across production deployments. PIIMiddleware implements before_model and after_model hooks to mask, redact, or hash personally identifiable information—critical for HIPAA compliance where you can’t simply prompt your way to legal safety.

LLMToolSelectorMiddleware tackles context bloat by running a fast model to identify relevant tools from a registry before the main call, binding only what’s needed. SummarizationMiddleware prevents context overflow by compressing message history when token counts climb too high.

ModelRetryMiddleware wraps API calls with configurable retry logic—retry counts, backoff factors, initial delays for rate limiting. ShellToolMiddleware handles resource initialization and teardown around agent loops.

Deep Agents as Proof of Concept

LangChain built Deep Agents entirely on this middleware stack to validate the architecture. The agent harness runs on create_agent, LangChain’s standard entry point, with specialized middleware layered on top: FilesystemMiddleware for file-based context management, SubagentMiddleware for context-isolated subagents, SkillsMiddleware for progressive capability disclosure.

The approach lets teams own different concerns independently. Business logic stays decoupled from core agent code. Reusable middleware can spread across organizations without tight coupling.

What This Means for Agent Development

LangChain acknowledges that improving model capabilities will eventually absorb some current middleware functions—summarization, tool selection, output trimming might move into models themselves. But deterministic policy enforcement, production guardrails, and business-specific logic won’t migrate. Those stay in the harness layer.

Developers can start with create_agent for minimal setups or create_deep_agent for more robust foundations. Custom middleware contributions are accepted through LangChain’s integration documentation.

Image source: Shutterstock

Source link