如何确保每个批次都有带有所有标签的样品?例如,考虑情绪分析问题的标签正面和负面。
tokens = tokenizer.batch_encode_plus(text.tolist(),max_length = max_seq_len,pad_to_max_length=True,truncation=True, return_token_type_ids=False)
seq = torch.tensor(tokens['input_ids'])
mask = torch.tensor(tokens['attention_mask'])
y = torch.tensor(labels.tolist())
data = TensorDataset(seq, mask,y)
data_sampler = RandomSampler(data)
data_dataloader = DataLoader(data, sampler=data_sampler, batch_size=batch_size)我想要一批像
Batch-1 ['positive','positive','positive','negative']
Batch-2 ['negative','negative','positive','negative']每个批次都包含所有标签。
https://datascience.stackexchange.com/questions/103605
复制相似问题