HIERARCHICAL VAE BASED SEMANTIC COMMUNICATIONS FOR POMDP TASKS

Partially Observable Markov Decision Process (POMDP) is a general framework for a wide range of control tasks, which can benefit from enabling semantic communicatons among different agents. Semantic communications aim to exchange compact messages that can convey task-relevant information between agents. A critical problem in semantic communication is source representation learning, which is governed by a fundamental tradeoff between compactness and sufficiency. Such a tradeoff is still underinvestigated in the context of POMDP. In this paper, we propose HVRL - Hierarchical Variational autoencoders for Reinforcement Learning. Experiments show that our method can effectively balance the pursuit of compactness and sufficiency, thereby learning enough information for decision and mitigates the risk of over-abstraction in the observation space. This approach effectively encodes the endogenous semantic information about the observation itself, and shows good sample efficiency and control performance.

cdz_icassp_poster.pdf

cdz_icassp_poster.pdf (15)

Thumbs Up

CITE

Documents

Poster

HIERARCHICAL VAE BASED SEMANTIC COMMUNICATIONS FOR POMDP TASKS

cdz_icassp_poster.pdf

QUESTIONS?