You are on page 1of 1

Z. Xiang, et al.

Reliability Engineering and System Safety 199 (2020) 106901

[41] Sutton RS. Barto A G. Reinforcement learning: an introduction[J]. IEEE Trans really differ?[J]. Probab Eng Mech 2009;24(4):577–84.
Neural Netw 1998;9(5). 1054-1054. [46] He K., Zhang X., Ren S., et al. Delving deep into rectifiers: surpassing human-level
[42] Watkins CJCH, Dayan P. Technical note: Q-learning[J]. Mach Learn 1992;8(3- performance on imagenet classification[J]. 2015.
4):279–92. [47] Zheng H, Yang Z, Liu W, et al. Improving deep neural networks using softplus units
[43] Konda V. Actor-critic algorithms[J]. SIAM J Control Optim 2003;42(4):1143–66. [C]//2015. IEEE 2015:1–4.
[44] Human-level control through deep reinforcement learning[J]. Nature [48] Tieleman Tijmen, Hinton Geoffrey. Lecture 6.5-rmsprop: divide the gradient by a
2015;518(7540):529–33. running average of its recent magnitude. COURSERA 2012;4(2):26–31.
[45] Lebrun Régis, Dutfoy A. Do Rosenblatt and Nataf isoprobabilistic transformations

10

You might also like