1. 首页
  2. 人工智能
  3. 深度学习
  4. Contextual Bandit Learning with Predictable Rewards

Contextual Bandit Learning with Predictable Rewards

上传者: 2020-07-19 08:37:56上传 PDF文件 154.37KB 热度 21次
Contextual bandit learning is a reinforcement learning problemwhere the learner repeatedly receives a set of features (context), takes an action and receives a reward based on the action and context
下载地址
用户评论