Controlling Partially Observed Industrial System Based on Offline Reinforcement Learning—A Case Study of Paste Thickener
Published in IEEE Transactions on Industrial Informatics, 2025
This article proposes an offline-data-driven control strategy for paste thickening systems, addressing the challenges of high process complexity, partial observability, and environmental noise in mineral processing operations. By leveraging offline reinforcement learning, the approach eliminates the risks associated with unsafe online exploration while enabling data-efficient policy training. Operational trajectories are collected using a proportional–integral–derivative controller informed by domain knowledge, forming a reliable offline dataset. To handle constrained observation spaces, a novel algorithm—temporal batch-constrained Q-learning (TBCQ)—is developed for partially observed decision processes. Evaluations in both simulation and a real industrial paste thickener in a copper mine demonstrate improved control performance and significant tracking error reduction.
Use Google Scholar for full citation
Recommended citation: Zhaolin Yuan, ZiXuan Zhang, Xiaorui Li, Yunduan Cui, Ming Li, Xiaojuan Ban, "Controlling Partially Observed Industrial System Based on Offline Reinforcement Learning—A Case Study of Paste Thickener." IEEE Transactions on Industrial Informatics, 2025.
