menu
dart_agent_core package
documentation
eval.dart
SaturationThresholds
brokenTaskPassRate property
brokenTaskPassRate property
dark_mode
light_mode
brokenTaskPassRate
property
double
brokenTaskPassRate
final
pass@1 ≤ 该阈值视为"几乎不能解"——往往是任务定义有 bug。
Implementation
final double brokenTaskPassRate;
dart_agent_core package
documentation
eval
SaturationThresholds
brokenTaskPassRate property
SaturationThresholds class