首页
学习
活动
专区
圈层
工具
发布
社区首页 >问答首页 >用PyTorch进行深度学习:对4维权重的期望四维输入[64,3,7,7],但却得到了大小为[32,1728]的二维输入。

用PyTorch进行深度学习:对4维权重的期望四维输入[64,3,7,7],但却得到了大小为[32,1728]的二维输入。
EN

Stack Overflow用户
提问于 2021-10-13 03:08:56
回答 1查看 1.1K关注 0票数 0

我正在使用深入学习与PyTorch做一些图像分类。每当我试图训练我的模型时,函数转发就失败了。有人能解释一下为什么输入的大小是错误的,以及如何解决这个问题吗?

这是我的模型的代码,以及我的训练损失和优化器:

代码语言:javascript
复制
model.fc = nn.Sequential(
        nn.Conv2d(1, 6, 9, padding=0),  # 64040
        nn.ReLU(),  #
        nn.AvgPool2d(2, stride=2),  # max 62020
        nn.Conv2d(6, 16, 11, padding=0),  # 161010
        nn.ReLU(),  # 161010
        nn.AvgPool2d(2, stride=2),  # 1655 = 400
        nn.Flatten(),
        nn.Linear(400, 200),
        nn.ReLU(),
        nn.Linear(200, 100),
        nn.ReLU(),
        nn.Linear(100, 3), 
        nn.LogSoftmax(dim=1))

criterion = nn.NLLLoss()
optimizer = optim.Adam(model.fc.parameters(), lr=0.003)
model.to(device)

这是验证功能:

代码语言:javascript
复制
def validation(model, val_dataloader, criterion):

val_loss = 0
accuracy = 0

for images, labels in iter(val_dataloader):

    images, labels = images.to('cuda'), labels.to('cuda')

    output = model.forward(images)
    val_loss += criterion(output, labels).item()

    probabilities = torch.exp(output)
    
    equality = (labels.data == probabilities.max(dim=1)[1])
    accuracy += equality.type(torch.FloatTensor).mean()

return val_loss, accuracy

最后,这是我的培训职能:

代码语言:javascript
复制
def train_classifier():
  epochs = 10
  steps = 0
  print_every = 40

  model.to('cuda')

  for e in range(epochs):
  
      model.train()

      running_loss = 0

      for images, labels in iter(train_dataloader):
        images = images.view(images.shape[0], -1)  #this flattens it?
        
        steps += 1

        images, labels = images.to('cuda'), labels.to('cuda')

        optimizer.zero_grad()

        # training 
        output = model.forward(images)
        loss = criterion(output, labels)
        loss.backward()
        optimizer.step()

        running_loss += loss.item()
  
        if steps % print_every == 0:
        
            model.eval()
        
            # Turn off gradients for validation, saves memory and computations
            with torch.no_grad():
                validation_loss, accuracy = validation(model, validate_loader, criterion)
    
            print("Epoch: {}/{}.. ".format(e+1, epochs),
                  "Training Loss: {:.3f}.. ".format(running_loss/print_every),
                  "Validation Loss: {:.3f}.. ".format(validation_loss/len(validate_loader)),
                  "Validation Accuracy: {:.3f}".format(accuracy/len(validate_loader)))
    
            running_loss = 0
            model.train()
                    
train_classifier()

错误码:

代码语言:javascript
复制
---------------------------------------------------------------------------
RuntimeError                              Traceback (most recent call last)
<ipython-input-32-60a435d940e1> in <module>()
     49             model.train()
     50 
---> 51 train_classifier()

5 frames
/usr/local/lib/python3.7/dist-packages/torch/nn/modules/conv.py in _conv_forward(self, input, weight, bias)
    438                             _pair(0), self.dilation, self.groups)
    439         return F.conv2d(input, weight, bias, self.stride,
--> 440                         self.padding, self.dilation, self.groups)
    441 
    442     def forward(self, input: Tensor) -> Tensor:

RuntimeError: Expected 4-dimensional input for 4-dimensional weight [64, 3, 7, 7], but got 2-dimensional input of size [32, 1728] instead

任何帮助都是非常感谢的!谢谢!

EN

回答 1

Stack Overflow用户

回答已采纳

发布于 2021-10-13 05:07:21

您的网络既有Conv2d层,也有完全连接的Linear层--这就是问题的根源:Conv2d期望其输入为4D:批处理通道高度宽度。另一方面,nn.Linear作品的“扁平”特性:批处理通道。因此,您需要“平整”您的数据,但不是在应用网络之前,而是在它的处理过程中,在其中您有一个Flattening层。

票数 0
EN
页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持
原文链接:

https://stackoverflow.com/questions/69549282

复制
相关文章

相似问题

领券
问题归档专栏文章快讯文章归档关键词归档开发者手册归档开发者手册 Section 归档