Tag
agent evaluation
2 articles

Research/May 12
How Memory Shapes Autonomous LLM Agents
A survey of how memory is built, measured, and used in autonomous LLM agents, with a focus on design choices and open problems.

AI Agent/Apr 3
Hermes Agent: The Agent Harness Framework to Watch
Hermes Agent aims to make agent testing and orchestration easier, with tool use, evals, and workflow control in one stack.