Files
siemens_ragas/.superpowers/brainstorm/1625-1781595805/content/trigger-mode.html

69 lines
2.6 KiB
HTML
Raw Normal View History

<h2>优化顾问在什么情况下运行?</h2>
<p class="subtitle">这决定了模块与现有评测流程的集成方式</p>
<div class="options">
<div class="option" data-choice="a" onclick="toggleSelect(this)">
<div class="letter">A</div>
<div class="content">
<h3>每次评测自动运行</h3>
<p>run_scenario() 结束后自动调用,无需任何额外配置。</p>
<div class="pros-cons">
<div class="pros"><h4>优点</h4><ul>
<li>零感知,开箱即用</li>
<li>每次跑完都有建议报告</li>
</ul></div>
<div class="cons"><h4>缺点</h4><ul>
<li>每次都多一次 LLM 调用,不管是否需要</li>
<li>无法关闭</li>
</ul></div>
</div>
</div>
</div>
<div class="option" data-choice="b" onclick="toggleSelect(this)">
<div class="letter">B</div>
<div class="content">
<h3>YAML 场景中显式开启(推荐)</h3>
<p>在 scenario YAML 里加一行 <code>optimization_advisor: true</code>,默认关闭。</p>
<div class="mockup">
<div class="mockup-header">siemens-pdf-question-bank-online.yaml</div>
<div class="mockup-body" style="font-family:monospace;font-size:13px;line-height:1.8">
metrics:<br>
&nbsp;&nbsp;- faithfulness<br>
&nbsp;&nbsp;- noise_sensitivity<br>
&nbsp;&nbsp;...<br>
<span style="color:#4ade80;font-weight:bold">optimization_advisor: true # 新增</span>
</div>
</div>
<div class="pros-cons">
<div class="pros"><h4>优点</h4><ul>
<li>显式可见,按需开启</li>
<li>与现有 YAML 驱动风格一致</li>
<li>可为不同场景独立配置</li>
</ul></div>
<div class="cons"><h4>缺点</h4><ul>
<li>需要手动在 YAML 里加一行</li>
</ul></div>
</div>
</div>
</div>
<div class="option" data-choice="c" onclick="toggleSelect(this)">
<div class="letter">C</div>
<div class="content">
<h3>阈值触发(任一指标低于警戒线时自动激活)</h3>
<p>规则引擎先算,若发现有指标低于阈值则自动启动 LLM 分析;一切正常则跳过。</p>
<div class="pros-cons">
<div class="pros"><h4>优点</h4><ul>
<li>"有问题才报警",符合直觉</li>
<li>高分场景无额外成本</li>
</ul></div>
<div class="cons"><h4>缺点</h4><ul>
<li>阈值需要维护,不同场景可能不同</li>
<li>正常分数时无建议,但用户可能仍想看优化空间</li>
</ul></div>
</div>
</div>
</div>
</div>