Asymptotic Theory for IV-Based Reinforcement Learning with Potential Endogeneity

Recent Posts

23 February, 2026

The State of AI in HR: Balancing Efficiency and Strategy

23 February, 2026

The C-Smart Agent: A New Service Model at Yum China

11 February, 2026

AI改變了客服行業但只是個開始｜彭博商業周刊／中文版

26 January, 2026

人工智慧時代的思考權：在堅守與讓渡之間 | 經濟一週

16 January, 2026

Deloitte-HKU AI Adoption Index 2026: The Paradox of Promise and Performance

Author: Jin Li, Ye Luo, Zigan Wang and Xiaowei Zhang

| We identify a new type of bias in data analysis, termed reinforcement bias, and develop IV-based reinforcement learning algorithms to correct it. Additionally, we establish their theoretical properties by integrating them into a stochastic approximation framework. Our analysis accommodates iterate-dependent Markovian structures and, therefore, can be used to study RL algorithms with policy improvement.

Check the full paper

Tag :

example, category, and, terms