LazyAct: Lazy actor with dynamic state skip based on constrained MDP.

Deep reinforcement learning has achieved significant success in complex decision-making tasks. However, the high computational cost of policies based on deep neural networks restricts their practical application. Specifically, each decision made by an agent requires a complete neural network computa...

Full description

Saved in:
Bibliographic Details
Main Authors: Hongjie Zhang, Zhenyu Chen, Hourui Deng, Chaosheng Feng
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2025-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0318778
Tags: Add Tag
No Tags, Be the first to tag this record!