中国科大学位与研究生教育
课程名称: 教师:
当前位置:
 >> 
 >> 
无人机结合多智能体增强学习
无人机结合多智能体增强学习
教师介绍

本讲教师:Arumugam Nallanathan
所属学科:工科
人  气:1890

课程介绍
报告人简介:Arumugam Nallanathan is Professor of Wireless Communications and Head of the Communication Systems Research (CSR) group in the School of Electronic Engineering and Computer Science at Queen Mary University of London since September 2017. His research interests include Artificial Intelligence for wireless systems, 5G and beyond Wireless Networks, Internet of Things (IoT) and Molecular Communications. He is an Editor for IEEE Transactions on Communications. He was an Editor for IEEE Transactions on Wireless Communications (2006-2011), IEEE Transactions on Vehicular Technology (2006-2017), IEEE Wireless Communications Letters and IEEE Signal Processing Letters. He is an IEEE Fellow and IEEE Distinguished Lecturer. Unmanned aerial vehicles (UAVs) can be served as aerial base stations (BSs) to provide cost-effective and on-demand wireless communications. UAVs are rapidly deployable for complementing the terrestrial communication based on a 3GPP LTE-A. Machine learning as a promising tool provides an autonomous and effective solution in an intelligent manner to enhance the UAVs enabled communication networks. However, most of the proposed machine learning algorithms focus on single UAV scenarios or multi-UAV scenarios by assuming the availability of complete network information for each UAV. In practice, it is difficult to have perfect knowledge of dynamic environments due to the high movement speed of UAVs, which imposes formidable challenges on the design of reliable UAV enabled wireless communications. In this talk, a novel framework based on stochastic game theory will be provided to model the dynamic resource allocation problem of multi-UAV networks and a multi-agent reinforcement learning (MARL) based resource allocation approach will be presented for solving the formulated stochastic game of multi-UAV networks. With the help of stochastic modelling, reinforcement learning based automated trajectory optimization approach will also be presented.
致谢:本课件的制作和发布均为公益目的,免费提供给公众学习和研究。对于本课件制作传播过程中可能涉及的作品或作品部分内容的著作权人以及相关权利人谨致谢意!
课件总访问人次:24127150
中国科学技术大学研究生网络课堂试运行版,版权属于中国科学技术大学研究生院。
本网站所有内容属于中国科学技术大学,未经允许不得下载传播。
地址:安徽省合肥市金寨路96号;邮编:230026。TEL:+86-551-63602929;E-mail:wlkt@ustc.edu.cn。

扫一扫,手机版