版本：最新版

RL Driven

概览

rl_driven 用于在线探索与个性化的选择算法。

对应 config/algorithm/selection/rl-driven.yaml。

若路由器应持续学习而非冻结当前胜者，静态选择器会成为瓶颈。rl_driven 为这些场景暴露基于探索的策略。

在 routing.decisions[].algorithm 中使用：

algorithm:
  type: rl_driven
  rl_driven:
    exploration_rate: 0.15
    use_thompson_sampling: true
    enable_personalization: true