Only three AI models finished above starting capital in a 500-day startup survival test
Researchers at Princeton University developed a benchmark called CEO-Bench, designed to evaluate the performance of AI agents in managing a fictional software company over a span of 500 simulated days....