Multimedia >>

Self-learning Control for Wavefront Sensorless Adaptive Optics System Through Deep Reinforcement Learning

Author(s): Hu, K (Hu, Ke); Xu, B (Xu, Bing); Xu, ZX (Xu, Zhenxing); Wen, LH (Wen, Lianghua); Yang, P (Yang, Ping); Wang, S (Wang, Shuai); Dong, LZ (Dong, Lizhi) 

Source: OPTIK  Volume: 178  Pages: 785-793  DOI: 10.1016/j.ijleo.2018.09.160  Published: 2019   

Abstract: An aberration correction algorithm for wavefront sensorless adaptive optics (WFSless AO) systems based on deep reinforcement learning is presented. An actor critic structure is designed to evaluate a control policy through the deep deterministic policy gradient (DDPG) algorithm. The algorithm performance is verified with a set-up simulation environment. According to the correction results, the aberration correction process can be expressed as a Markov decision process (MDP). The method exhibits excellent performances in correction capacity and speed. Similar correction effects are obtained with the stochastic parallel-gradient descent (SPGD) algorithm and WFSless AO based on the general-modes (AOG) algorithm. Moreover, the correction speed is improved by approximately 9 and 2.5 times, respectively. 

ISSN: 0030-4026 

  Copyright © The Institute of Optics And Electronics, The chinese Academy of Sciences
Address: Box 350, Shuangliu, Chengdu, Sichuan, China
Email:dangban@ioe.ac.cn Post Code: 610 209 备案号:蜀ICP备05022581号