References
- Sungpill Kim, Deep Learning First Step, pp.17-33, Hanbit Media, 2016.
- Taewoo Lee, Jinhoo Ryu, Heemin Park "Hovering Control of 1-Axial Drone with Reinforcement Learning", Journal of Korea Multimedia Society, Vol.21, No.2, pp.250-260, 2018. https://doi.org/10.9717/kmms.2018.21.2.250
- Daniel R.Jiang, Emmanuel Ekwedike, Han Liu, "Feedback-Based Tree Search for Reinforcement Learning", Journal of Korea Multimedia Society, arXiv:1805.05935, 2018.
- Jeongsoo Han, "A Study of Adaptive QoS Routing scheme using Policy-gradient Reinforcement Learning", Journal of the Korea Society of Computer and Information, Vol.16, No.2, pp.93-99, 2011. https://doi.org/10.9708/jksci.2011.16.2.093
- Jongho Kim, Daesung Kang, Jooyoung Park, "Robot Locomotion via RLS-based Actor-Critic Learning", Journal of Korean Institute of Intelligent Systems, Vol.15, No.7, pp.893-898, 2005. https://doi.org/10.5391/JKIIS.2005.15.7.893
- Arthur Juliani, "Introducing: Unity Machine Learning Agents Toolkit", Unity Blog, https://blogs.unity3d.com/2017/09/19/introducin g-unity-machine-learning-agents/, 2017.
- Wooil Shim, Taehwa Park, Kyungjoong Kim, "Comparison of Policy Optimization Reinforcement Learning for Simulated Autonomous Car Environment", Korea Information Science Society, p.833-835, 2018.
- Adrian Gonzalez, Ramirez, "Neural networks applied to a tower defense video game", Universitat Jaume I, Grauen Disseny i Desenvolupament de Videojocs [94], 2018.
- Arthur Juliani, Vincent-Pierre Berges, Esh Vckay, Yuan Gao, Hunter Henry, Marwan Mattar, Danny Lange, "ML-Agents Toolkit Overview",https://github.com/Unity-Technologies/ml-agents/blob/master/docs/ML-Agents-Overview.md, 2017.
- Jaehoon Lee, Taerim Kim, Jonggyu Song, Hyunjae Im, "Flight Trajectory Simulation via Reinforcement Learning in Virtual Environment", Journal of the Korea Society for Simulation, Vol.27, No.4, p.1-8, 2018. https://doi.org/10.9709/JKSS.2018.27.4.001
- Sonic, "PPO (Proximal Policy Optimization Algorithms) I Machine Learning & QA)", Naver Blog, https://cafe.naver.com/soynature/2400, 2017.
- Saemaro Moon, Yonglak Choi "A Study on Application of Reinforcement Learning Algorithm Using Pixel Data", Journal of Information Technology Services, Vol.15, No.4, pp.85-95, 2016. https://doi.org/10.9716/KITS.2016.15.4.085
- John Schulman, Filip Wolski, Prafulla Dhariwal, Alec Radford, Oleg Klimov, "Proximal Policy Optimization Algorithms", OpenAI, arxiv.org/pdf/1707.06347, 2017.
- RL Korea, "PG Travel Guide", RLKoreaBlog, https://reinforcement-learning-kr.github.io/2018/06/29/0_pg-travel-guide/#, 2018.
- Kyeongnam Kim, "ML-Agents Project Organization Unity ML / Unity", Naver Blog, https://blog.naver.com/kkyy0126/221448746477, 2019.
- Arthur Juliani, Vincent-Pierre Berges, Esh Vckay, Yuan Gao, Hunter Henry, Marwan Mattar, Danny Lange, "Training with Proximal Policy Optimization", https://github.com/Unity-Technologies/ml-agents/blob/master/docs/Training-PPO.md, 2017.
- Arthur Juliani, Vincent-Pierre Berges, Esh Vckay, Yuan Gao, Hunter Henry, Marwan Mattar, Danny Lange, "Getting Started with the 3D Balance Ball Environment", https://github.com/Unity-Technologies/ml-agents/blob/master/docs/Getting-Started-with-Balance-Ball.md#observing-training-progress, 2017.