文章/答案/技术大牛

发布

社区首页 >问答首页 >开放式AI健身房观测空间形状问题

问开放式AI健身房观测空间形状问题
EN

Stack Overflow用户

提问于 2022-10-07 16:25:45

回答 1查看 54关注 0票数 0

我试图解决一次和永远使用强化学习的Yatzee游戏。可悲的是，当我检查健身房是否符合稳定的基线时，我的观察空间的形状就会受到影响。所以，我在构造函数中放置了一个print语句，它告诉我我的观察空间的形状，只要我创建一个对象。

class YatzeeEnv{
    game_state = np.zeros(19, np.int32)

    def __init__(self):
        self.action_space = gym.spaces.Discrete(19)
        self.observation_space = gym.spaces.MultiDiscrete(19)

        for x in self.game_state_adresses:
            self.game_state[x] = -1
        self.reroll()
        self.game_state[self.reroll_state] = 0
        print(self.game_state.shape)
        print(self.observation_space.shape)
}

a = YatzeeEnv()

可悲的是，它的输出是

np array shape: (19,)
Observation space shape: ()

为什么会这样呢？我认为gym.spaces.MultiDiscrete(19)将观察空间定义为19个值的int数组。

python

numpy

reinforcement-learning

openai

回答 1

Stack Overflow用户

发布于 2022-10-07 20:29:27

从医生那里..。

This represents the cartesian product of arbitrary :class:`Discrete` spaces.
    It is useful to represent game controllers or keyboards where each key can be represented as a discrete action space.
    Note:
        Some environment wrappers assume a value of 0 always represents the NOOP action.
    e.g. Nintendo Game Controller - Can be conceptualized as 3 discrete action spaces:
    1. Arrow Keys: Discrete 5  - NOOP[0], UP[1], RIGHT[2], DOWN[3], LEFT[4]  - params: min: 0, max: 4
    2. Button A:   Discrete 2  - NOOP[0], Pressed[1] - params: min: 0, max: 1
    3. Button B:   Discrete 2  - NOOP[0], Pressed[1] - params: min: 0, max: 1
    It can be initialized as ``MultiDiscrete([ 5, 2, 2 ])`` such that a sample might be ``array([3, 1, 0])``.
    Although this feature is rarely used, :class:`MultiDiscrete` spaces may also have several axes
    if ``nvec`` has several axes:
    Example::
        >> d = MultiDiscrete(np.array([[1, 2], [3, 4]]))
        >> d.sample()
        array([[0, 0],
               [2, 3]])

如果您只有一个操作空间，则不必使用MultiDiscrete。或者使用MultiDiscrete(19)。

票数 0

页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持

原文链接：

https://stackoverflow.com/questions/73990048

复制

相似问题

问开放式AI健身房观测空间形状问题
EN

回答 1

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问开放式AI健身房观测空间形状问题EN

回答 1

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问开放式AI健身房观测空间形状问题
EN