首页
学习
活动
专区
圈层
工具
发布
社区首页 >问答首页 >tensorflow:在fn中创建tf.map_fn返回值错误的变量

tensorflow:在fn中创建tf.map_fn返回值错误的变量
EN

Stack Overflow用户
提问于 2017-08-21 05:25:22
回答 1查看 2.4K关注 0票数 0

我对map_fn中的变量初始化有疑问。

我试图在张量中的每个单独元素上分别应用一些高速公路层,所以我认为map_fn可能是最好的方法。

代码语言:javascript
复制
segment_list = tf.reshape(raw_segment_embedding,[batch_size*seqlen,embed_dim])
segment_embedding = tf.map_fn(lambda x: stack_highways(x, hparams), segment_list)

现在的问题是我的fn,即stack_highways,创建变量,由于某种原因,tensorflow无法初始化这些变量并给出错误。

代码语言:javascript
复制
W = tf.Variable(tf.truncated_normal(W_shape, stddev=0.1), name='weight')

ValueError: Initializer for variable body/model/parallel_0/body/map/while/highway_layer0/weight/ is from inside a control-flow construct, such as a loop or conditional. When creating a variable inside a loop or conditional, use a lambda as the initializer. 

我现在非常无知,基于错误,我想它不是关于范围的,但我不知道如何使用lambda作为初始化器(我甚至不知道这到底是什么意思)。以下是stack_highways的实现,如有任何建议将不胜感激。

代码语言:javascript
复制
def weight_bias(W_shape, b_shape, bias_init=0.1):
  """Fully connected highway layer adopted from 
     https://github.com/fomorians/highway-fcn/blob/master/main.py
  """
  W = tf.Variable(tf.truncated_normal(W_shape, stddev=0.1), name='weight')
  b = tf.Variable(tf.constant(bias_init, shape=b_shape), name='bias')
  return W, b




def highway_layer(x, size, activation, carry_bias=-1.0):
  """Fully connected highway layer adopted from 
     https://github.com/fomorians/highway-fcn/blob/master/main.py
  """
  W, b = weight_bias([size, size], [size])
  with tf.name_scope('transform_gate'):
    W_T, b_T = weight_bias([size, size], bias_init=carry_bias)


    H = activation(tf.matmul(x, W) + b, name='activation')
    T = tf.sigmoid(tf.matmul(x, W_T) + b_T, name='transform_gate')
    C = tf.sub(1.0, T, name="carry_gate")


    y = tf.add(tf.mul(H, T), tf.mul(x, C), name='y') # y = (H * T) + (x * C)
    return y




def stack_highways(x, hparams):
  """Create highway networks, this would not create
  a padding layer in the bottom and the top, it would 
  just be layers of highways.


  Args:
    x: a raw_segment_embedding
    hparams: run hyperparameters


  Returns:
    y: a segment_embedding
  """
  highway_size = hparams.highway_size
  activation = hparams.highway_activation #tf.nn.relu
  carry_bias_init = hparams.highway_carry_bias
  prev_y = None
  y = None
  for i in range(highway_size):
    with tf.name_scope("highway_layer{}".format(i)) as scope:
      if i == 0: # first, input layer
        prev_y = highway_layer(x, highway_size, activation, carry_bias=carry_bias_init)
      elif i == highways - 1: # last, output layer
        y = highway_layer(prev_y, highway_size, activation, carry_bias=carry_bias_init)
      else: # hidden layers
        prev_y = highway_layer(prev_y, highway_size, activation, carry_bias=carry_bias_init)
  return y

最热烈的问候,科尔曼

EN

回答 1

Stack Overflow用户

回答已采纳

发布于 2017-08-24 17:28:52

TensorFlow提供了两种初始化变量的主要方法:

  1. "lambda“初始化器:返回初始化值的可调用项。TF提供了许多打包良好的几个
  2. 按张量值初始化:这是您目前正在使用的。

错误消息是,在使用while_loop ( map_fn内部调用的)变量时,需要使用第一种类型的初始化器。(一般来说,lambda初始化器对我来说似乎更健壮。)

另外,在过去,当在控制流中使用变量时,变量似乎比tf.Variable更可取。

因此,我怀疑您可以通过将weight_bias函数修复为如下所示来解决您的问题:

代码语言:javascript
复制
def weight_bias(W_shape, b_shape, bias_init=0.1):
  """Fully connected highway layer adopted from 
     https://github.com/fomorians/highway-fcn/blob/master/main.py
  """
  W = tf.get_variable("weight", shape=W_shape,
          initializer=tf.truncated_normal_initializer(stddev=0.1))
  b = tf.get_variable("bias", shape=b_shape,
          initializer=tf.constant_inititializer(bias_init))
  return W, b

希望这能帮上忙!

票数 5
EN
页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持
原文链接:

https://stackoverflow.com/questions/45789822

复制
相关文章

相似问题

领券
问题归档专栏文章快讯文章归档关键词归档开发者手册归档开发者手册 Section 归档