准确性与给定答案

On this page

代码

在此示例中，代理不会被执行，但给定的结果将根据预期输出来评估正确性。

代码

from typing import Optional

from agno.eval.accuracy import AccuracyEval, AccuracyResult
from agno.models.openai import OpenAIChat

evaluation = AccuracyEval(
    model=OpenAIChat(id="o4-mini"),
    input="What is 10*5 then to the power of 2? do it step by step",
    expected_output="2500",
    num_iterations=1,
)
result_with_given_answer: Optional[AccuracyResult] = evaluation.run_with_output(
    output="2500", print_results=True
)
assert result_with_given_answer is not None and result_with_given_answer.avg_score >= 8

简单准确性使用工具的准确性

示例

代理概念

模型

准确性与给定答案

代码

示例

代理概念

模型

​代码

代码