问梯度下降算法
EN

Stack Overflow用户

提问于 2022-10-31 01:42:35

回答 1查看 19关注 0票数 0

我正在尝试Boyd的“凸优化”一书中的问题9.30。但出于某种原因我不能让回溯线搜索起作用。这是我的代码：

import numpy as np

n, m = 100, 200
A = np.random.randn(m, n)
a, b = 0.01, 0.5
gtol = 1e-3


def f(x):
    # return - np.sum(np.log(1-x*x)) - np.sum(np.log(1-A @ x))
    return -np.sum(np.log(1-A@x)) - np.sum(np.log(1+x)) - np.sum(np.log(1-x))


def G(x):
    # return 2*x/(1-x*x) + np.sum(A, 0)/np.sum(1-A@x)
    return A.T @ (1/(1-A@x)) - 1/(1-x) + 1/(1+x)


def feasible(x):
    return np.all(x*x<1) and np.all(A@x<1)


def step_size(x, g, a, b):
    # backtracking line search
    fx = f(x)
    dx = -g
    t = 1
    while True:
        if not feasible(x+t*dx):
            t *= b
        else:
            if f(x+t*dx) <= fx+a*t*g.T@dx:
                break
            t *= b
    return t


def stopping_condition(g):
    return np.linalg.norm(g, 2) < gtol


def gradient_descent(x, a, b):
    flist, xlist, tlist = [f(x)], [x],[np.nan]
    while True:
        g = G(x)
        if stopping_condition(g):
            break
        t = step_size(x, -g, a, b)
        x -= t * g
        print(f(x), t, np.linalg.norm(g, 2))
        flist.append(f(x)), xlist.append(x), tlist.append(t)
    return flist, xlist, tlist


fx, x, t = gradient_descent(np.zeros(n), a, b)

我看到f(x)和G(x)正确地计算了这个值。然而，step_size函数似乎并不收敛，而理论认为它应该收敛。我似乎搞不懂为什么这不管用。

gradient-descent

convex-optimization

回答 1

Stack Overflow用户

发布于 2022-10-31 02:18:10

好吧，我发现问题了！

我正在传递-g，然后再乘以一个-ive！

谢谢。

票数 0

页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持

原文链接：

https://stackoverflow.com/questions/74258007

复制

相似问题

问梯度下降算法
EN

回答 1

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问梯度下降算法EN

回答 1

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问梯度下降算法
EN