文章/答案/技术大牛

发布

社区首页 >问答首页 >如何使用tensorflow成对导入jpg和npy文件进行深度学习？

问如何使用tensorflow成对导入jpg和npy文件进行深度学习？
EN

Stack Overflow用户

提问于 2019-01-11 17:11:21

回答 1查看 413关注 0票数 0

我有1500个RGB文件(.jpg)和1500个特征地图值(.npy)。我想把它们用作我的深度学习项目的数据集。我使用的是tensorflow 1.12。

我使用tf.Example将它们写入到.tfrecords文件中。下面是我用tf.data导入这个文件的代码(感谢乌代的评论)。

import tensorflow as tf
import numpy as np
import pdb

IMAGE_HEIGHT = 228
IMAGE_WIDTH = 304

def tfdata_generator(tfrname, is_training, batch_size):
    '''Construct a data generator using tf.Dataset'''
    ## You can write your own parse function
    def parse_function(example):

    features = tf.parse_single_example(example, features={

        'image_raw': tf.FixedLenFeature([], tf.string, default_value=""),
        'hint_raw': tf.FixedLenFeature([], tf.string, default_value="")
        })
    image = features['image_raw']
    hint = features['hint_raw']

    image = tf.decode_raw(image, tf.uint8)
    image = tf.cast(image, tf.float32)
    image = tf.reshape(image, [IMAGE_HEIGHT, IMAGE_WIDTH, 3])

    hint = tf.decode_raw(hint, tf.uint8)
    hint = tf.cast(hint, tf.float32)
    hint = tf.reshape(hint, [8, 10, 1024])

    return image, hint

dataset = tf.data.TFRecordDataset(tfrname)
#pdb.set_trace()
if is_training:
    dataset = dataset.shuffle(100)  # depends on sample size
#pdb.set_trace()
# Transform and batch data at the same time
dataset = dataset.apply(tf.data.experimental.map_and_batch(parse_function, 
        8, num_parallel_batches=4)) # cpu cores

dataset = dataset.repeat(-1)
dataset = dataset.prefetch(2)
return dataset

我将batch_size设置为8。但是当我进行调试时，数据集的形状是

((?, 228, 304, 3), (?, 8, 10, 1024)), types: (tf.float32, tf.float32)

这是正确的吗？这段代码是错误的吗？或者我在做the记录的时候出错了？

tensorflow

deep-learning

dataset

tfrecord

python

回答 1

Stack Overflow用户

发布于 2019-01-11 18:31:10

您可以使用如下代码，

def tfdata_generator(images, labels, is_training, batch_size=32):
   '''Construct a data generator using tf.Dataset'''
   ## You can write your own parse function
   def parse_function(filename, label):
        image_string = tf.read_file(filename)
        image = tf.image.decode_jpeg(image_string)
        image = tf.image.convert_image_dtype(image, tf.float32)
        y = tf.one_hot(tf.cast(label, tf.uint8), 16)
    return image, y

    dataset = tf.data.Dataset.from_tensor_slices((images, labels))
    if is_training:
        dataset = dataset.shuffle(1000)  # depends on sample size

    # Transform and batch data at the same time
    dataset = dataset.apply(tf.data.experimental.map_and_batch( parse_function, 
            batch_size,num_parallel_batches=6,  # cpu cores
        drop_remainder=True if is_training else False))
    dataset = dataset.repeat()
    dataset = dataset.prefetch(no_of_prefetch_needed)
return dataset

票数 0

页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持

原文链接：

https://stackoverflow.com/questions/54143365

复制

相似问题

问如何使用tensorflow成对导入jpg和npy文件进行深度学习？
EN

回答 1

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问如何使用tensorflow成对导入jpg和npy文件进行深度学习？EN

回答 1

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问如何使用tensorflow成对导入jpg和npy文件进行深度学习？
EN