下面是我尝试运行的一段代码:
import tensorflow as tf
a = tf.constant([[1, 2], [2, 3]], dtype=tf.float32)
b = tf.constant([[1, 2], [2, 3]], dtype=tf.float32)
with tf.GradientTape() as tape1, tf.GradientTape() as tape2:
tape1.watch(a)
tape2.watch(a)
c = a * b
grad1 = tape1.gradient(c, a)
grad2 = tape2.gradient(c[:, 0], a)
print(grad1)
print(grad2)这是输出:
tf.Tensor(
[[1. 2.]
[2. 3.]], shape=(2, 2), dtype=float32)
None正如您可以观察到的,tf.GradientTape()不能处理分片输出。有什么办法可以解决这个问题吗?
发布于 2020-08-03 16:38:37
是的,您对张量所做的一切都需要在磁带上下文中进行。你可以像这样相对容易地修复它:
import tensorflow as tf
a = tf.constant([[1, 2], [2, 3]], dtype=tf.float32)
b = tf.constant([[1, 2], [2, 3]], dtype=tf.float32)
with tf.GradientTape() as tape1, tf.GradientTape() as tape2:
tape1.watch(a)
tape2.watch(a)
c = a * b
c_sliced = c[:, 0]
grad1 = tape1.gradient(c, a)
grad2 = tape2.gradient(c_sliced, a)
print(grad1)
print(grad2)https://stackoverflow.com/questions/63225910
复制相似问题