首页
学习
活动
专区
圈层
工具
发布
社区首页 >问答首页 >如何在tensorflow模型中打印转换后的特征值

如何在tensorflow模型中打印转换后的特征值
EN

Stack Overflow用户
提问于 2019-02-07 17:56:02
回答 1查看 154关注 0票数 1

我如何才能看到在tensorflow模型中训练的最终特征的价值。就像在下面的例子中,我试图对我的列'x‘进行多热处理,我想看看这些特性如何应用到我的模型中。

这在sklearn中很容易做到,但作为Tensorflow的新手,我不明白这是怎么可能的。

代码语言:javascript
复制
import tensorflow as tf
import pandas as pd

data = {'x':['a c', 'a b', 'b c'], 'y': [1, 1, 0]}

df = pd.DataFrame(data)
Y = df['y']
X = df.drop('y', axis=1)
indicator_features = [tf.feature_column.indicator_column(categorical_column=
      tf.feature_column.categorical_column_with_vocabulary_list(key = 'x', 
                                                 vocabulary_list = ['a','b','c']))]
model = tf.estimator.LinearClassifier(feature_columns=indicator_features,
                                                              model_dir = "/tmp/samplemodel")
training_input_fn = tf.estimator.inputs.pandas_input_fn(x = X,
                                                    y=Y,
                                                    batch_size=64,
                                                    shuffle= True,
                                                    num_epochs = None)

model.train(input_fn=training_input_fn,steps=1000)
EN

回答 1

Stack Overflow用户

发布于 2019-02-08 13:50:52

我已经能够通过在tensorflow中启用急切执行来打印值。在下面发布我的解决方案。也欢迎任何其他的想法。

代码语言:javascript
复制
import tensorflow as tf
import tensorflow.feature_column as fc 
import pandas as pd

PATH = "/tmp/sample.csv"

tf.enable_eager_execution()

COLUMNS = ['education','label']
train_df = pd.read_csv(PATH, header=None, names = COLUMNS)

#train_df['education'] = train_df['education'].str.split(" ")
def easy_input_function(df, label_key, num_epochs, shuffle, batch_size):
  label = df[label_key]
  ed = tf.string_split(df['education']," ")
  df['education'] = ed
  ds = tf.data.Dataset.from_tensor_slices((dict(df),label))
  if shuffle:
    ds = ds.shuffle(10000)
  ds = ds.batch(batch_size).repeat(num_epochs)
  return ds

ds = easy_input_function(train_df, label_key='label', num_epochs=5, shuffle=False, batch_size=5)


for feature_batch, label_batch in ds.take(1):
  print('Some feature keys:', list(feature_batch.keys())[:5])
  print()
  print('A batch of education  :', feature_batch['education'])
  print()
  print('A batch of Labels:', label_batch )
  print(feature_batch)

education_vocabulary_list = [
    'Bachelors', 'HS-grad', '11th', 'Masters', '9th', 'Some-college',
    'Assoc-acdm', 'Assoc-voc', '7th-8th', 'Doctorate', 'Prof-school',
    '5th-6th', '10th', '1st-4th', 'Preschool', '12th']  
education = tf.feature_column.categorical_column_with_vocabulary_list('education', vocabulary_list=education_vocabulary_list)

fc.input_layer(feature_batch, [fc.indicator_column(education)])
票数 0
EN
页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持
原文链接:

https://stackoverflow.com/questions/54570626

复制
相关文章

相似问题

领券
问题归档专栏文章快讯文章归档关键词归档开发者手册归档开发者手册 Section 归档