Showing results 1 to 20 of 40
next >
Issue Date | Title | Author(s) |
2023 | ACE: Cooperative Multi-Agent Q-learning with Bidirectional Action-Dependency | Li, Chuming; Liu, Jie; Zhang, Yinmin; Wei, Yuhong; Niu, Yazhe; Yang, Yaodong; Liu, Yu; Ouyang, Wanli |
Dec-2024 | Adaptive pessimism via target Q-value for offline reinforcement learning | Liu, Jie; Zhang, Yinmin; Li, Chuming; Yang, Yaodong; Liu, Yu; Ouyang, Wanli |
2024 | AnySkill: Learning Open-Vocabulary Physical Skill for Interactive Agents | Cui, Jieming; Liu, Tengyu; Liu, Nian; Yang, Yaodong; Zhu, Yixin; Huang, Siyuan |
May-2024 | The application of large language models in medicine: A scoping review | Meng, Xiangbin; Yan, Xiangyu; Zhang, Kuo; Liu, Da; Cui, Xiaojuan; Yang, Yaodong; Zhang, Muhan; Cao, Chunxia; Wang, Jingjia; Wang, Xuliang; Gao, Jun; Wang, Yuan-Geng-Shuo; Ji, Jia-ming; Qiu, Zifeng; Li, Muzi; Qian, Cheng; Guo, Tianze; Ma, Shuangquan; Wang, Zeying; Guo, Zexuan; Lei, Youlan; Shao, Chunli; Wang, Wenyao; Fan, Haojun; Tang, Yi-Da |
Jun-2024 | ASP: Learn a Universal Neural Solver! | Wang, Chenguang; Yu, Zhouliang; McAleer, Stephen; Yu, Tianshu; Yang, Yaodong |
Mar-2025 | <bold>JARVIS</bold>-1: Open-World Multi-Task Agents With Memory-Augmented Multimodal Language Models | Wang, Zihao; Cai, Shaofei; Liu, Anji; Jin, Yonggang; Hou, Jinbing; Zhang, Bowei; Lin, Haowei; He, Zhaofeng; Zheng, Zilong; Yang, Yaodong; Ma, Xiaojian; Liang, Yitao |
2022 | Debias the Black-Box: A Fair Ranking Framework via Knowledge Distillation | Zhu, Zhitao; Si, Shijing; Wang, Jianzong; Yang, Yaodong; Xiao, Jing |
Feb-2025 | Discrete Information Acquisition in Financial Markets | Pan, Jingrui; Liu, Shancun; Zhang, Qiang; Yang, Yaodong |
2023 | Dynamic Handover: Throw and Catch with Bimanual Hands | Huang, Binghao; Chen, Yuanpei; Wang, Tianyu; Qin, Yuzhe; Yang, Yaodong; Atanasov, Nikolay; Wang, Xiaolong |
Jun-2023 | Editorial Special Issue on Simulation and AI | Peng, Yijie; Yang, Yaodong |
3-Sep-2024 | Efficient and scalable reinforcement learning for large-scale network control | Ma, Chengdong; Li, Aming; Du, Yali; Dong, Hao; Yang, Yaodong |
2023 | GenDexGrasp: Generalizable Dexterous Grasping | Li, Puhao; Liu, Tengyu; Li, Yuyang; Geng, Yiran; Zhu, Yixin; Yang, Yaodong; Huang, Siyuan |
May-2024 | Grasp Multiple Objects With One Hand | Li, Yuyang; Liu, Bo; Geng, Yiran; Li, Puhao; Yang, Yaodong; Zhu, Yixin; Liu, Tengyu; Huang, Siyuan |
2024 | Heterogeneous-Agent Reinforcement Learning | Zhong, Yifan; Kuba, Jakub Grudzien; Feng, Xidong; Hu, Siyi; Ji, Jiaming; Yang, Yaodong |
2023 | Hierarchical Multi-Agent Skill Discovery | Yang, Mingyu; Yang, Yaodong; Lu, Zhenbo; Zhou, Wengang; Li, Houqiang |
Dec-2023 | Large sequence models for sequential decision-making: a survey | Wen, Muning; Lin, Runji; Wang, Hanjing; Yang, Yaodong; Wen, Ying; Mai, Luo; Wang, Jun; Zhang, Haifeng; Zhang, Weinan |
2023 | Learning to Shape Rewards Using a Game of Two Partners | Mguni, David; Jafferjee, Taher; Wang, Jianhong; Perez-Nieves, Nicolas; Song, Wenbin; Tong, Feifei; Taylor, Matthew E.; Yang, Tianpei; Dai, Zipeng; Chen, Hui; Zhu, Jiangcheng; Shao, Kun; Wang, Jun; Yang, Yaodong |
May-2022 | Measuring the Non-Transitivity in Chess | Sanjaya, Ricky; Wang, Jun; Yang, Yaodong |
2023 | MSRL: Distributed Reinforcement Learning with Dataflow Fragments | Zhu, Huanzhou; Zhao, Bo; Chen, Gang; Chen, Weifeng; Chen, Yijie; Shi, Liang; Yang, Yaodong; Pietzuch, Peter; Chen, Lei |
29-Dec-2024 | Multi-Agent Deep Reinforcement Learning for Multi-Echelon Inventory Management | Liu, Xiaotian; Hu, Ming; Peng, Yijie; Yang, Yaodong |