文章/答案/技术大牛

发布

社区首页 >问答首页 >pytorch中3D语义分割的示例CrossEntropyLoss

问pytorch中3D语义分割的示例CrossEntropyLoss
EN

Stack Overflow用户

提问于 2017-12-08 21:48:56

回答 2查看 2.4K关注 0票数 2

我有一个在5D输入张量上执行3D卷积的网络。我的网络if的输出大小为(1，12，60，36，60)，对应于( BatchSize，NumClasses，x-dim，y-dim，z-dim)。我需要计算体素的交叉熵损失。然而，我一直收到错误。

当尝试使用torch.nn.CrossEntropyLoss()计算交叉熵损失时，我一直收到以下错误消息：

RuntimeError: multi-target not supported at .../src/THCUNN/generic/ClassNLLCriterion.cu:16

下面是我的代码摘录：

import torch
import torch.nn as nn
from torch.autograd import Variable
criterion = torch.nn.CrossEntropyLoss()
images = Variable(torch.randn(1, 12, 60, 36, 60)).cuda()
labels = Variable(torch.zeros(1, 12, 60, 36, 60).random_(2)).long().cuda()
loss = criterion(images.view(1,-1), labels.view(1,-1))

当我为标签创建一个热张量时，也会发生同样的情况：

nclasses = 12
labels = (np.random.randint(0,12,(1,60,36,60))) # Random labels with values between [0..11]
labels = (np.arange(nclasses) == labels[..., None] - 1).astype(int) # Converts labels to one_hot_tensor
a = np.transpose(labels,(0,4,3,2,1)) #  Reorder dimensions to match shape of "images" ([1, 12, 60, 36, 60])
b = Variable(torch.from_numpy(a)).cuda()
loss = criterion(images.view(1,-1), b.view(1,-1))

知道我做错了什么吗？有人能提供一个在5D输出张量上计算交叉熵的例子吗？

pytorch

回答 2

Stack Overflow用户

发布于 2017-12-09 19:00:41

我刚刚检查了一些2D语义分割的实现(fcn)，并尝试将其应用于3D语义分割。不能保证这是正确的，我将不得不再次检查...

 import torch
 import torch.nn.functional as F
 def cross_entropy3d(input, target, weight=None, size_average=True):
    # input: (n, c, h, w, z), target: (n, h, w, z)
    n, c, h, w , z = input.size()
    # log_p: (n, c, h, w, z)
    log_p = F.log_softmax(input, dim=1)
    # log_p: (n*h*w*z, c)
    log_p = log_p.permute(0, 4, 3, 2, 1).contiguous().view(-1, c) # make class dimension last dimension
    log_p = log_p[target.view(n, h, w, z, 1).repeat(1, 1, 1, 1, c) >= 0] # this looks wrong -> Should rather be a one-hot vector
    log_p = log_p.view(-1, c)
    # target: (n*h*w*z,)
    mask = target >= 0
    target = target[mask]
    loss = F.nll_loss(log_p, target.view(-1), weight=weight, size_average=False)
    if size_average:
        loss /= mask.data.sum()
    return loss
images = Variable(torch.randn(5, 3, 16, 16, 16))
labels = Variable(torch.LongTensor(5, 16, 16, 16).random_(3))
cross_entropy3d(images, labels, weight=None, size_average=True)

票数 1

Stack Overflow用户

发布于 2017-12-09 07:31:48

docs解释了这种行为(底线是，它看起来实际上是在计算稀疏交叉熵损失，因此不需要输出的所有维度的目标，而只需要所需维度的索引)……他们特别声明：

Input: (N,C), where C = number of classes
Target: (N), where each value is 0 <= targets[i] <= C-1
Output: scalar. If reduce is False, then (N) instead.

我不确定您的用例，但您可能希望使用KL Divergence或Binary Cross Entropy Loss。两者都是在相同大小的输入和目标上定义的。

票数 0

页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持

原文链接：

https://stackoverflow.com/questions/47715696

复制

相似问题

问pytorch中3D语义分割的示例CrossEntropyLoss
EN

回答 2

Stack Overflow用户

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问pytorch中3D语义分割的示例CrossEntropyLossEN

回答 2

Stack Overflow用户

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问pytorch中3D语义分割的示例CrossEntropyLoss
EN