文章/答案/技术大牛

发布

社区首页 >问答首页 >使用tensorflow集线器模型和TensorFlow2.0作为后端创建keras自定义层时出现Variable_scope运行时错误

问使用tensorflow集线器模型和TensorFlow2.0作为后端创建keras自定义层时出现Variable_scope运行时错误
EN

Stack Overflow用户

提问于 2019-08-01 04:16:59

回答 2查看 3.2K关注 0票数 1

我试图通过将预先训练好的tf-hub elmo model集成到keras层中来使用它。

Keras层：

class ElmoEmbeddingLayer(tf.keras.layers.Layer):

    def __init__(self, **kwargs):
        super(ElmoEmbeddingLayer, self).__init__(**kwargs)
        self.dimensions = 1024
        self.trainable = True
        self.elmo = None

    def build(self, input_shape):
        url = 'https://tfhub.dev/google/elmo/2'

        self.elmo = hub.Module(url)
        self._trainable_weights += trainable_variables(
            scope="^{}_module/.*".format(self.name))
        super(ElmoEmbeddingLayer, self).build(input_shape)

    def call(self, x, mask=None):
        result = self.elmo(
            x,
            signature="default",
            as_dict=True)["elmo"]
        return result

    def compute_output_shape(self, input_shape):
        return input_shape[0], self.dimensions

当我运行代码时，我得到以下错误：

Traceback (most recent call last):
  File "D:/Google Drive/Licenta/Gemini/Emotion Analysis/nn/trainer/model.py", line 170, in <module>
    validation_steps=validation_dataset.size())
  File "D:/Google Drive/Licenta/Gemini/Emotion Analysis/nn/trainer/model.py", line 79, in train_gpu
    model = build_model(self.config, self.embeddings, self.sequence_len, self.out_classes, summary=True)
  File "D:\Google Drive\Licenta\Gemini\Emotion Analysis\nn\architectures\models.py", line 8, in build_model
    return my_model(embeddings, config, sequence_length, out_classes, summary)
  File "D:\Google Drive\Licenta\Gemini\Emotion Analysis\nn\architectures\models.py", line 66, in my_model
    inputs, embedding = resolve_inputs(embeddings, sequence_length, model_config, input_type)
  File "D:\Google Drive\Licenta\Gemini\Emotion Analysis\nn\architectures\models.py", line 19, in resolve_inputs
    return elmo_input(model_conf)
  File "D:\Google Drive\Licenta\Gemini\Emotion Analysis\nn\architectures\models.py", line 58, in elmo_input
    embedding = ElmoEmbeddingLayer()(input_text)
  File "D:\Apps\Anaconda\envs\tf2.0\lib\site-packages\tensorflow\python\keras\engine\base_layer.py", line 616, in __call__
    self._maybe_build(inputs)
  File "D:\Apps\Anaconda\envs\tf2.0\lib\site-packages\tensorflow\python\keras\engine\base_layer.py", line 1966, in _maybe_build
    self.build(input_shapes)
  File "D:\Google Drive\Licenta\Gemini\Emotion Analysis\nn\architectures\custom_layers.py", line 21, in build
    self.elmo = hub.Module(url)
  File "D:\Apps\Anaconda\envs\tf2.0\lib\site-packages\tensorflow_hub\module.py", line 156, in __init__
    abs_state_scope = _try_get_state_scope(name, mark_name_scope_used=False)
  File "D:\Apps\Anaconda\envs\tf2.0\lib\site-packages\tensorflow_hub\module.py", line 389, in _try_get_state_scope
    "name_scope was already taken." % abs_state_scope)
RuntimeError: variable_scope module/ was unused but the corresponding name_scope was already taken.

这似乎是由于急切的执行行为造成的。如果我禁用了急切执行，我必须在tensorflow会话中包围model.fit函数，并使用sess.run(global_variables_initializer())初始化变量，以避免下一个错误：

Traceback (most recent call last):
  File "D:/Google Drive/Licenta/Gemini/Emotion Analysis/nn/trainer/model.py", line 168, in <module>
    validation_steps=validation_dataset.size().eval(session=Session()))
  File "D:/Google Drive/Licenta/Gemini/Emotion Analysis/nn/trainer/model.py", line 90, in train_gpu
    class_weight=weighted)
  File "D:\Apps\Anaconda\envs\tf2.0\lib\site-packages\tensorflow\python\keras\engine\training.py", line 643, in fit
    use_multiprocessing=use_multiprocessing)
  File "D:\Apps\Anaconda\envs\tf2.0\lib\site-packages\tensorflow\python\keras\engine\training_arrays.py", line 664, in fit
    steps_name='steps_per_epoch')
  File "D:\Apps\Anaconda\envs\tf2.0\lib\site-packages\tensorflow\python\keras\engine\training_arrays.py", line 294, in model_iteration
    batch_outs = f(actual_inputs)
  File "D:\Apps\Anaconda\envs\tf2.0\lib\site-packages\tensorflow\python\keras\backend.py", line 3353, in __call__
    run_metadata=self.run_metadata)
  File "D:\Apps\Anaconda\envs\tf2.0\lib\site-packages\tensorflow\python\client\session.py", line 1458, in __call__
    run_metadata_ptr)
tensorflow.python.framework.errors_impl.FailedPreconditionError: 2 root error(s) found.
  (0) Failed precondition: Error while reading resource variable module/bilm/RNN_0/RNN/MultiRNNCell/Cell1/rnn/lstm_cell/bias from Container: localhost. This could mean that the variable was uninitialized. Not found: Resource localhost/module/bilm/RNN_0/RNN/MultiRNNCell/Cell1/rnn/lstm_cell/bias/class tensorflow::Var does not exist.
     [[{{node elmo_embedding_layer/module_apply_default/bilm/RNN_0/RNN/MultiRNNCell/Cell1/rnn/lstm_cell/bias/Read/ReadVariableOp}}]]
  (1) Failed precondition: Error while reading resource variable module/bilm/RNN_0/RNN/MultiRNNCell/Cell1/rnn/lstm_cell/bias from Container: localhost. This could mean that the variable was uninitialized. Not found: Resource localhost/module/bilm/RNN_0/RNN/MultiRNNCell/Cell1/rnn/lstm_cell/bias/class tensorflow::Var does not exist.
     [[{{node elmo_embedding_layer/module_apply_default/bilm/RNN_0/RNN/MultiRNNCell/Cell1/rnn/lstm_cell/bias/Read/ReadVariableOp}}]]
     [[metrics/f1_micro/Identity/_223]]
0 successful operations.
0 derived errors ignored.

我的解决方案是：

with Session() as sess:
    sess.run(global_variables_initializer())
    history = model.fit(self.train_data.repeat(),
                        epochs=self.config['epochs'],
                        validation_data=self.validation_data.repeat(),
                        steps_per_epoch=steps_per_epoch,
                        validation_steps=validation_steps,
                        callbacks=self.__callbacks(monitor_metric),
                        class_weight=weighted)

主要的问题是，是否有其他方法可以在keras自定义层中使用elmo tf-hub模块并训练我的模型。另一个问题是，我当前的解决方案是否没有影响训练性能或给出OOM GPU错误(我在具有较高批处理大小的几个时期后出现OOM错误，我发现这与会话未关闭或内存泄漏有关)。

keras

scope

keras-layer

tensorflow2.0

回答 2

Stack Overflow用户

发布于 2019-10-31 23:09:45

如果将模型包装在Session()字段中，则还必须将使用该模型的所有其他代码包装在Session()字段中。这需要花费大量的时间和精力。我有另一种处理方法:首先，创建一个elmo模块，向keras添加一个会话：

   elmo_model = hub.Module("https://tfhub.dev/google/elmo/3", trainable=True, 
   name='elmo_module')
   sess = tf.Session()
   sess.run(tf.global_variables_initializer())
   sess.run(tf.tables_initializer())
   K.set_session(sess)

而不是直接在ElmoEmbeddinglayer中创建elmo模块

  self.elmo = hub.Module(url)
  self._trainable_weights += trainable_variables(
            scope="^{}_module/.*".format(self.name))

您可以执行以下操作，我认为它可以正常工作！

  self.elmo = elmo_model
  self._trainable_weights += trainable_variables(
            scope="^elmo_module/.*")

票数 0

Stack Overflow用户

发布于 2019-11-27 19:19:33

以下是我在我的案例中使用的一个简单解决方案：

当我使用单独的python脚本创建模块时，这件事发生在我身上。

为了解决这个问题，我将主脚本中的tf.Session()传递给另一个脚本中的tf.keras.backend，方法是在调用层之前创建一个入口点来传递它。初始化

示例：

主文件：

import tensorflow.compat.v1 as tf
from ModuleFile import ModuleLayer

def __main__():
  init_args = [...]
  input = ...
  sess= tf.keras.backend.get_session()
  Module_layer.__init_session___(sess)
  module_layer = ModuleLayer(init_args)(input)

模块文件：

import tensorflow.compat.v1 as tf

class ModuleLayer(tf.keras.layers.Layer):

  @staticmethod
  def __init_session__(session):
    tf.keras.backend.set_session(session)

  def __init__(*args):
  ...

希望这能有所帮助:)

票数 0

页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持

原文链接：

https://stackoverflow.com/questions/57298252

复制

相似问题

问使用tensorflow集线器模型和TensorFlow2.0作为后端创建keras自定义层时出现Variable_scope运行时错误
EN

回答 2

Stack Overflow用户

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问使用tensorflow集线器模型和TensorFlow2.0作为后端创建keras自定义层时出现Variable_scope运行时错误EN

回答 2

Stack Overflow用户

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问使用tensorflow集线器模型和TensorFlow2.0作为后端创建keras自定义层时出现Variable_scope运行时错误
EN