Metrics on Yuan's Blog

Metrics on Yuan's Bloghttps://liyuan.org/tags/metrics/Recent content in Metrics on Yuan's BlogHugoen-usTue, 05 May 2026 00:00:00 +0000Picking Evaluation Metrics for a RAG Agent — Notes from the Trencheshttps://liyuan.org/posts/ai/rag-eval-metrics-selection/Tue, 05 May 2026 00:00:00 +0000https://liyuan.org/posts/ai/rag-eval-metrics-selection/This article outlines a pragmatic, tiered approach to evaluating Retrieval-Augmented Generation (RAG) agents, specifically within the context of complex financial document analysis (FinanceBench). The author argues that effective evaluation is not about maximizing the number of metrics, but about selecting signals that provide clear, actionable insights at different stages of the development lifecycle.