首页
学习
活动
专区
圈层
工具
发布
社区首页 >问答首页 >pyMC2贝叶斯A/B测试实例在pyMC3中的移植

pyMC2贝叶斯A/B测试实例在pyMC3中的移植
EN

Stack Overflow用户
提问于 2014-03-28 09:15:54
回答 3查看 1.9K关注 0票数 6

我正在努力学习pyMC 3,并且遇到了一些麻烦。由于pyMC3的教程有限,所以我在针对黑客的贝叶斯方法工作。我正在尝试将pyMC 2代码移植到贝叶斯A/B检验示例中的pyMC 3,但没有成功。据我所见,这个模型根本没有考虑到观测结果。

我不得不从这个示例中做一些更改,因为pyMC 3非常不同,所以应该是这样的:将pymc导入为pm

代码语言:javascript
复制
# The parameters are the bounds of the Uniform.
p = pm.Uniform('p', lower=0, upper=1)

# set constants
p_true = 0.05  # remember, this is unknown.
N = 1500

# sample N Bernoulli random variables from Ber(0.05).
# each random variable has a 0.05 chance of being a 1.
# this is the data-generation step
occurrences = pm.rbernoulli(p_true, N)

print occurrences  # Remember: Python treats True == 1, and False == 0
print occurrences.sum()

# Occurrences.mean is equal to n/N.
print "What is the observed frequency in Group A? %.4f" % occurrences.mean()
print "Does this equal the true frequency? %s" % (occurrences.mean() == p_true)

# include the observations, which are Bernoulli
obs = pm.Bernoulli("obs", p, value=occurrences, observed=True)

# To be explained in chapter 3
mcmc = pm.MCMC([p, obs])
mcmc.sample(18000, 1000)

figsize(12.5, 4)
plt.title("Posterior distribution of $p_A$, the true effectiveness of site A")
plt.vlines(p_true, 0, 90, linestyle="--", label="true $p_A$ (unknown)")
plt.hist(mcmc.trace("p")[:], bins=25, histtype="stepfilled", normed=True)
plt.legend()

相反,如下所示:

代码语言:javascript
复制
import pymc as pm

import random
import numpy as np
import matplotlib.pyplot as plt

with pm.Model() as model:
    # Prior is uniform: all cases are equally likely
    p = pm.Uniform('p', lower=0, upper=1)

    # set constants
    p_true = 0.05  # remember, this is unknown.
    N = 1500

    # sample N Bernoulli random variables from Ber(0.05).
    # each random variable has a 0.05 chance of being a 1.
    # this is the data-generation step
    occurrences = []  # pm.rbernoulli(p_true, N)
    for i in xrange(N):
        occurrences.append((random.uniform(0.0, 1.0) <= p_true))
    occurrences = np.array(occurrences)
    obs = pm.Bernoulli('obs', p_true, observed=occurrences)

    start = pm.find_MAP()
    step = pm.Metropolis()
    trace = pm.sample(18000, step, start)
    pm.traceplot(trace);
    plt.show()

对于冗长的文章表示歉意,但在我的改编中,出现了一些小的变化,比如手动生成观察结果,因为pm.rbernoulli已经不存在了。我也不确定是否应该在运行跟踪之前找到开始。如何将实现更改为正确运行?

EN

回答 3

Stack Overflow用户

回答已采纳

发布于 2014-03-30 09:25:51

你们真的很亲密。然而,这一行:

代码语言:javascript
复制
obs = pm.Bernoulli('obs', p_true, observed=occurrences)

是错误的,因为您只是为p设置一个常量值(p_true == 0.05)。因此,上面定义为具有一致先验的随机变量p不受可能性的限制,并且您的图显示您只是从先验抽样。如果您在代码中将p_true替换为p,那么它应该可以工作。以下是固定版本:

代码语言:javascript
复制
import pymc as pm

import random
import numpy as np
import matplotlib.pyplot as plt

with pm.Model() as model:
    # Prior is uniform: all cases are equally likely
    p = pm.Uniform('p', lower=0, upper=1)

    # set constants
    p_true = 0.05  # remember, this is unknown.
    N = 1500

    # sample N Bernoulli random variables from Ber(0.05).
    # each random variable has a 0.05 chance of being a 1.
    # this is the data-generation step
    occurrences = []  # pm.rbernoulli(p_true, N)
    for i in xrange(N):
        occurrences.append((random.uniform(0.0, 1.0) <= p_true))
    occurrences = np.array(occurrences)
    obs = pm.Bernoulli('obs', p, observed=occurrences)

    start = pm.find_MAP()
    step = pm.Metropolis()
    trace = pm.sample(18000, step, start)

pm.traceplot(trace);
票数 4
EN

Stack Overflow用户

发布于 2016-08-14 15:31:35

这对我有用。在开始建立模型之前,我生成了观察结果。

代码语言:javascript
复制
true_p_A = 0.05
true_p_B = 0.04
N_A = 1500
N_B = 750

obs_A = np.random.binomial(1, true_p_A, size=N_A)
obs_B = np.random.binomial(1, true_p_B, size=N_B)

with pm.Model() as ab_model:
    p_A = pm.Uniform('p_A', 0, 1)
    p_B = pm.Uniform('p_B', 0, 1)
    delta = pm.Deterministic('delta',p_A - p_B)
    obs_A = pm.Bernoulli('obs_A', p_A, observed=obs_A)
    osb_B = pm.Bernoulli('obs_B', p_B, observed=obs_B)

with ab_model:
    trace = pm.sample(2000)

pm.traceplot(trace)
票数 1
EN

Stack Overflow用户

发布于 2014-03-29 02:04:55

你非常接近--你只需要解开最后两行,这两行就产生了追踪图。您可以将绘制跟踪图看作是在完成采样后应该发生的诊断。以下几点对我来说是可行的:

代码语言:javascript
复制
import pymc as pm

import random
import numpy as np
import matplotlib.pyplot as plt

with pm.Model() as model:
    # Prior is uniform: all cases are equally likely
    p = pm.Uniform('p', lower=0, upper=1)

    # set constants
    p_true = 0.05  # remember, this is unknown.
    N = 1500

    # sample N Bernoulli random variables from Ber(0.05).
    # each random variable has a 0.05 chance of being a 1.
    # this is the data-generation step
    occurrences = []  # pm.rbernoulli(p_true, N)
    for i in xrange(N):
        occurrences.append((random.uniform(0.0, 1.0) <= p_true))
    occurrences = np.array(occurrences)
    obs = pm.Bernoulli('obs', p_true, observed=occurrences)

    start = pm.find_MAP()
    step = pm.Metropolis()
    trace = pm.sample(18000, step, start)

#Now plot
pm.traceplot(trace)
plt.show()
票数 0
EN
页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持
原文链接:

https://stackoverflow.com/questions/22708513

复制
相关文章

相似问题

领券
问题归档专栏文章快讯文章归档关键词归档开发者手册归档开发者手册 Section 归档