Special Session 16: Recent Development of Stochastic Optimal Control and Differential Games

Discrete-Time Mean-Variance Strategy Based on Reinforcement Learning

Xun Li
HK PolyU
Hong Kong
Co-Author(s):    Xiangyu Cui, Yun Shi, Si Zhao
Abstract:
This talk studies a discrete-time mean-variance model based on reinforcement learning. Compared with its continuous-time counterpart in Wang and Zhou (2020), the discrete-time model makes more general assumptions about the asset`s return distribution. Using entropy to measure the cost of exploration, we derive the optimal investment strategy, whose density function is also Gaussian type. Additionally, we design the corresponding reinforcement learning algorithm. Both simulation experiments and empirical analysis indicate that our discrete-time model exhibits better applicability when analyzing real-world data than the continuous-time model.