我正在使用深入学习与PyTorch做一些图像分类。每当我试图训练我的模型时,函数转发就失败了。有人能解释一下为什么输入的大小是错误的,以及如何解决这个问题吗?
这是我的模型的代码,以及我的训练损失和优化器:
model.fc = nn.Sequential(
nn.Conv2d(1, 6, 9, padding=0), # 64040
nn.ReLU(), #
nn.AvgPool2d(2, stride=2), # max 62020
nn.Conv2d(6, 16, 11, padding=0), # 161010
nn.ReLU(), # 161010
nn.AvgPool2d(2, stride=2), # 1655 = 400
nn.Flatten(),
nn.Linear(400, 200),
nn.ReLU(),
nn.Linear(200, 100),
nn.ReLU(),
nn.Linear(100, 3),
nn.LogSoftmax(dim=1))
criterion = nn.NLLLoss()
optimizer = optim.Adam(model.fc.parameters(), lr=0.003)
model.to(device)这是验证功能:
def validation(model, val_dataloader, criterion):
val_loss = 0
accuracy = 0
for images, labels in iter(val_dataloader):
images, labels = images.to('cuda'), labels.to('cuda')
output = model.forward(images)
val_loss += criterion(output, labels).item()
probabilities = torch.exp(output)
equality = (labels.data == probabilities.max(dim=1)[1])
accuracy += equality.type(torch.FloatTensor).mean()
return val_loss, accuracy最后,这是我的培训职能:
def train_classifier():
epochs = 10
steps = 0
print_every = 40
model.to('cuda')
for e in range(epochs):
model.train()
running_loss = 0
for images, labels in iter(train_dataloader):
images = images.view(images.shape[0], -1) #this flattens it?
steps += 1
images, labels = images.to('cuda'), labels.to('cuda')
optimizer.zero_grad()
# training
output = model.forward(images)
loss = criterion(output, labels)
loss.backward()
optimizer.step()
running_loss += loss.item()
if steps % print_every == 0:
model.eval()
# Turn off gradients for validation, saves memory and computations
with torch.no_grad():
validation_loss, accuracy = validation(model, validate_loader, criterion)
print("Epoch: {}/{}.. ".format(e+1, epochs),
"Training Loss: {:.3f}.. ".format(running_loss/print_every),
"Validation Loss: {:.3f}.. ".format(validation_loss/len(validate_loader)),
"Validation Accuracy: {:.3f}".format(accuracy/len(validate_loader)))
running_loss = 0
model.train()
train_classifier()错误码:
---------------------------------------------------------------------------
RuntimeError Traceback (most recent call last)
<ipython-input-32-60a435d940e1> in <module>()
49 model.train()
50
---> 51 train_classifier()
5 frames
/usr/local/lib/python3.7/dist-packages/torch/nn/modules/conv.py in _conv_forward(self, input, weight, bias)
438 _pair(0), self.dilation, self.groups)
439 return F.conv2d(input, weight, bias, self.stride,
--> 440 self.padding, self.dilation, self.groups)
441
442 def forward(self, input: Tensor) -> Tensor:
RuntimeError: Expected 4-dimensional input for 4-dimensional weight [64, 3, 7, 7], but got 2-dimensional input of size [32, 1728] instead任何帮助都是非常感谢的!谢谢!
发布于 2021-10-13 05:07:21
您的网络既有Conv2d层,也有完全连接的Linear层--这就是问题的根源:Conv2d期望其输入为4D:批处理通道高度宽度。另一方面,nn.Linear作品的“扁平”特性:批处理通道。因此,您需要“平整”您的数据,但不是在应用网络之前,而是在它的处理过程中,在其中您有一个Flattening层。
https://stackoverflow.com/questions/69549282
复制相似问题