9/4/2023 0 Comments Yang yuAnalysis of noisy evolutionary optimization when sampling fails. Chao Qian, Chao Bian, Yang Yu, Ke Tang, and Xin Yao.In: Proceedings of the 24th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD'18) (Applied Track), London, UK, 2018. Reinforcement learning to rank in e-commerce search engine: Formalization, analysis, and application. Yujing Hu, Qing Da, Anxiang Zeng, Yang Yu and Yinghui Xu.In: Proceedings of the 24th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD'18) (Research Track), London, UK, 2018. Stabilizing reinforcement learning in dynamic environment with application to online recommendation. Shi-Yong Chen, Yang Yu, Qing Da, Jun Tan, Hai-Kuan Huang and Hai-Hong Tang.In: ICML 2018 Workshop on AutoML, Stockholm, Sweden, 2018. Towards AutoML in the presence of drift: First results. Morales, Wei-Wei Tu, Yang Yu, Lisheng Sun-Hosoya, Isabelle Guyon, and Michele Sebag. In: Proceedings of the 27th International Joint Conference on Artificial Intelligence (IJCAI'18) (Early Career), Stockholm, Sweden, 2018, pp.5739-5743. Towards sample efficient reinforcement learning. In: Advances in Neural Information Processing Systems 31 (NIPS'18), Montreal, Canada, 2018. Multi-layered gradient boosting decision trees. In: Proceedings of the 33rd AAAI Conference on Artificial Intelligence (AAAI’19), Honolulu, HI, 2019. On reinforcement learning for full-length game of StarCraft. Zhen-Jia Pang, Ruo-Ze Liu, Zhou-Yu Meng, Yi Zhang, Yang Yu, and Tong Lu.Multi-fidelity automatic hyper-parameter tuning via transfer series expansion. Yi-Qi Hu, Yang Yu, Wei-Wei Tu, Qiang Yang, Yuqiang Chen, and Wenyuan Dai.Virtual-Taobao: Virtualizing real-world online retail environment for reinforcement learning. Jing-Cheng Shi, Yang Yu, Qing Da, Shi-Yong Chen, and An-Xiang Zeng.In: Proceedings of the 25th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD'19) (Research Track), Anchorage, AL, 2019. Environment reconstruction with hidden confounders for reinforcement learning based recommendation. Wenjie Shang, Yang Yu, Qingyang Li, Zhiwei Qin, Yiping Meng and Jieping Ye. In: Proceedings of the 28th International Joint Conference on Artificial Intelligence (IJCAI'19), Macao, China, 2019. Reinforcement learning experience reuse with policy residual representation. Wen-Ji Zhou, Yang Yu, Yingfeng Chen, Kai Guan, Tangjie Lv, Changjie Fan and Zhi-Hua Zhou.Cascaded algorithm-selection and hyper-parameter optimization with extreme-region upper confidence bound bandit. In: Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems (AAMAS'19), Montreal, Canada, 2019, pp.1880-1882. Reinforcement Learning with Derivative-Free Exploration. In: Proceedings of the 1st International Conference on Distributed Artificial Intelligence (DAI'19), Beijing, China, 2019. Asynchronous Classification-Based Optimization. Yu-Ren Liu, Yi-Qi Hu, Hong Qian, Yang Yu.In: Advances in Neural Information Processing Systems 32 (NeurIPS'19), Vancouver, Canada, 2019. Bridging machine learning and logical reasoning by abductive learning.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |