首页
学习
活动
专区
圈层
工具
发布
    • 综合排序
    • 最热优先
    • 最新优先
    时间不限
  • 来自专栏arXiv每日学术速递

    人工智能学术速递[6.22]

    directly estimates the stationary distribution corrections of the optimal policy and does not rely on policy-gradients

    1.7K10发布于 2021-07-02
  • 来自专栏arXiv每日学术速递

    机器学习学术速递[6.22]

    directly estimates the stationary distribution corrections of the optimal policy and does not rely on policy-gradients

    2.2K30发布于 2021-07-02
领券