Tag
autonomous agents
3 articles

Research/May 12
How Memory Shapes Autonomous LLM Agents
A survey of how memory is built, measured, and used in autonomous LLM agents, with a focus on design choices and open problems.

Research/Apr 16
LongCoT Benchmark: 2,500-Probl. Long-Horizon Reasoning
LongCoT is a 2,500-problem benchmark for measuring whether frontier models can sustain long, interdependent reasoning chains.

AI Agent/Apr 1
OpenClaw and the New Solo Builder Stack
One builder runs 8 orchestrators and 35 personas on a homelab, using OpenClaw to ship writing, research, and ops in parallel.