Shangdong Yang

Cited by

	All	Since 2019
Citations	88	79
h-index	5	5
i10-index	3	3

201720182019202020212022202320243 6 6 17 13 11 19 13

Public access

View all

4 articles

8 articles

available

not available

Based on funding mandates

Co-authors

Yang GaoNanjing University, ChinaVerified email at nju.edu.cn
Hao WangWuhan UniversityVerified email at whu.edu.cn
Xingguo Chen（陈兴国）南京邮电大学Verified email at njupt.edu.cn

Shangdong Yang

Nanjing University of Posts and Telecommunications

Verified email at njupt.edu.cn - Homepage

Reinforcement Learning Multi-agent Systems Multi-armed Bandits


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Efficient Average Reward Reinforcement Learning Using Constant Shifting Values S Yang, Y Gao, B An, H Wang, X Chen Proceedings of the AAAI Conference on Artificial Intelligence 30 (1), 2016	30	2016
An Optimal Algorithm for the Stochastic Bandits While Knowing the Near-Optimal Mean Reward S Yang, Y Gao IEEE Transactions on Neural Networks and Learning Systems 32 (5), 2285-2291, 2021	11	2021
A Contextual Bandit Approach to Personalized Online Recommendation via Sparse Interactions C Zhang, H Wang, S Yang, Y Gao Advances in Knowledge Discovery and Data Mining: 23rd Pacific-Asia …, 2019	11	2019
Contextual Bandits With Hidden Features to Online Recommendation via Sparse Interactions S Yang, H Wang, C Zhang, Y Gao IEEE Intelligent Systems 35 (5), 62-72, 2020	8	2020
New Galois Hulls Of Generalized Reed-Solomon Codes Y Wu, C Li, S Yang Finite Fields and Their Applications 83, 102084, 2022	7	2022
Incremental Nonnegative Matrix Factorization Based on Matrix Sketching and k-means Clustering C Zhang, H Wang, S Yang, Y Gao Intelligent Data Engineering and Automated Learning–IDEAL 2016: 17th …, 2016	5	2016
Effective Interpretable Policy Distillation via Critical Experiences Identification X Liu, S Liu, B An, Y Gao, S Yang, W Li IEEE Intelligent Systems, 2023	4	2023
Modified Retrace for Off-Policy Temporal Difference Learning X Chen, X Ma, Y Li, G Yang, S Yang, Y Gao 39th Conference On Uncertainty in Artificial Intelligence, 2023	2	2023
Learning Explicit Credit Assignment for Cooperative Multi-Agent Reinforcement Learning via Polarization Policy Gradient W Chen, W Li, X Liu, S Yang, Y Gao Proceedings of the AAAI Conference on Artificial Intelligence, 2023	2	2023
Keeping Minimal Experience to Achieve Efficient Interpretable Policy Distillation X Liu, S Liu, W Li, S Yang, Y Gao arXiv preprint arXiv:2203.00822, 2022	2	2022
An Optimal Algorithm for the Stochastic Bandits with Knowing Near-optimal Mean Reward S Yang, H Wang, Y Gao, X Chen Proceedings of the 17th International Conference on Autonomous Agents and …, 2018	2	2018
WToE: Learning When to Explore in Multi-Agent Reinforcement Learning S Dong, H Mao, S Yang, Z Shengyu, L Wenbin, H Jianye, Y Gao IEEE Transactions on Cybernetics, 2023	1	2023
Modeling rationality: Toward better performance against unknown agents in sequential games Z Ge, S Yang, P Tian, Z Chen, Y Gao IEEE Transactions on Cybernetics, 2022	1	2022
Online attentive kernel-based temporal difference learning G Yang, X Chen, S Yang, H Wang, S Dong, Y Gao arXiv preprint arXiv:2201.09065, 2022	1	2022
Learning Credit Assignment for Cooperative Reinforcement Learning W Chen, W Li, X Liu, S Yang arXiv preprint arXiv:2210.05367, 2022	1	2022
Multi-Agent Sparse Interaction Modeling is an Anomaly Detection Problem C Li, S Dong, S Yang, H Cao, W Li, Y Gao ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024		2024
Online attentive kernel-based temporal difference learning X Chen, G Yang, S Yang, H Wang, S Dong, Y Gao Knowledge-Based Systems 278, 110902, 2023		2023
Enhancing OOD Generalization in Offline Reinforcement Learning with Energy-Based Policy Optimization H Cao, S Yang, J Huo, X Chen, Y Gao 26th European Conference on Artificial Intelligence ECAI 2023, 2023		2023
Convergence Analysis of Graphical Game-based Nash Q−learning Using the Interaction Detection Signal of N−step Return Y Zhuang, S Yang, W Li, Y Gao 2023 IEEE International Conference on Acoustics, Speech and Signal Processing, 2023		2023
Leveraging transition exploratory bonus for efficient exploration in Hard-Transiting reinforcement learning problems S Yang, H Wang, S Dong, X Chen Future Generation Computer Systems, 2023		2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors