Long-running agents are agents that operate across extended timeframes rather than a single request-response turn. They may monitor systems, perform research, execute workflows, wait for events, or coordinate multi-step tasks over hours or days.
Long-running agents need stronger state management, observability, budget controls, policy checks, and interruption handling than short-lived agents. Evaluation also changes: success is often measured at the task or session level, not a single response.