首页
学习
活动
专区
圈层
工具
发布
    • 综合排序
    • 最热优先
    • 最新优先
    时间不限
  • 来自专栏数据魔术师

    强化学习读书笔记(9)| On-policy Prediction with Approximation(上)

    Stochastic-gradient Semi-gradient Methods ? ? ? ? ? ? ? Linear Methods ? ? ? ? ?

    1.1K21发布于 2019-10-09
  • 来自专栏专知

    【论文推荐】最新六篇主题模型相关论文—收敛率、大规模、深度主题建模、优化、情绪强度、广义动态主题模型

    inference (WHAI) for deep latent Dirichlet allocation, which infers posterior samples via a hybrid of stochastic-gradient

    1.2K40发布于 2018-04-08
  • 来自专栏CreateAMind

    共轭计算变分推理:将非共轭模型中的变分推理转换为共轭模型中的推理 1703

    This will fix the issues of stochastic-gradient methods but maintain the computational efficiency of

    56010编辑于 2023-12-05
  • 来自专栏机器之心

    资源 | Richard Sutton经典教材《强化学习》第二版公布(附PDF下载)

    Prediction with Approximation 9.1 Value-function Approximation 9.2 The Prediction Objective (VE) 9.3 Stochastic-gradient

    10.7K90发布于 2018-05-10
  • 来自专栏arXiv每日学术速递

    统计学学术速递[7.12]

    regression, Newton's method, and Kalman filter, as well as modern deep-learning algorithms such as stochastic-gradient

    64460发布于 2021-07-27
  • 来自专栏arXiv每日学术速递

    统计学学术速递[12.10]

    for strongly log-concave distributions with non-i.i.d data and study how the injected noise and the stochastic-gradient

    96940编辑于 2021-12-10
  • 来自专栏arXiv每日学术速递

    机器学习学术速递[8.17]

    range is defined as the Pareto front of the SMOO problem, which can then be efficiently computed using stochastic-gradient

    1.9K20发布于 2021-08-24
领券