Hedge-Bench: Benchmarking Agents on Hard, Realistic Tasks Pertaining to Financial Reasoning
Problem Current AI benchmarks for financial reasoning primarily focus on mechanical tasks, such as document retrieval and formula calculations, neglecting the more complex, open-ended reasoning required in expert analyst roles....