Issue Date | Title | Author(s) |
2021 | ACMo: Angle-Calibrated Moment Methods for Stochastic Optimization | Huang, Xunpeng; Xu, Runxin; Zhou, Hao; Wang, Zhe; Liu, Zhengyang; Li, Lei |
2024 | DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models | Dai, Damai; Deng, Chengqi; Zhao, Chenggang; Xu, Runxin; Gao, Huazuo; Chen, Deli; Li, Jiashi; Zeng, Wangding; Yu, Xingkai; Wu, Y.; Xie, Zhenda; Li, Y. K.; Huang, Panpan; Luo, Fuli; Ruan, Chong; Su, Zhifang; Liang, Wenfeng |
2021 | Document-level Event Extraction via Heterogeneous Graph-based Interaction Model with a Tracker | Xu, Runxin; Liu, Tianyu; Li, Lei; Chang, Baobao |
2022 | A Double-Graph Based Framework for Frame Semantic Parsing | Zheng, Ce; Chen, Xudong; Xu, Runxin; Chang, Baobao |
2022 | An Enhanced Span-based Decomposition Method for Few-Shot Sequence Labeling | Wang, Peiyi; Xu, Runxin; Liu, Tianyu; Zhou, Qingyu; Cao, Yunbo; Chang, Baobao; Sui, Zhifang |
2022 | Focus on the Target's Vocabulary: Masked Label Smoothing for Machine Translation | Chen, Liang; Xu, Runxin; Chang, Baobao |
2022 | From Dense to Sparse: Contrastive Pruning for Better Pre-trained Language Model Compression | Xu, Runxin; Luo, Fuli; Wang, Chengyu; Chang, Baobao; Huang, Jun; Huang, Songfang; Huang, Fei |
2024 | Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations | Wang, Peiyi; Li, Lei; Shao, Zhihong; Xu, Runxin; Dai, Damai; Li, Yifei; Chen, Deli; Wu, Yu; Sui, Zhifang |
2024 | Multimodal ArXiv: A Dataset for Improving Scientific Comprehension of Large Vision-Language Models | Li, Lei; Wang, Yuqi; Xu, Runxin; Wang, Peiyi; Feng, Xiachong; Kong, Lingpeng; Liu, Qi |
2022 | Probing Structured Pruning on Multilingual Pre-trained Models: Settings, Algorithms, and Efficiency | Li, Yanyang; Luo, Fuli; Xu, Runxin; Huang, Songfang; Huang, Fei; Wang, Liwei |
2021 | Raise a Child in Large Language Model: Towards Effective and Generalizable Fine-tuning | Xu, Runxin; Luo, Fuli; Zhang, Zhiyuan; Tan, Chuanqi; Chang, Baobao; Huang, Songfang; Huang, Fei |
2022 | Tuning: A Simple Cross-lingual Sub-network Tuning Method | Xu, Runxin; Luo, Fuli; Chang, Baobao; Huang, Songfang; Huang, Fei |
2022 | A Two-Stream AMR-enhanced Model for Document-level Event Argument Extraction | Xu, Runxin; Wang, Peiyi; Liu, Tianyu; Zeng, Shuang; Chang, Baobao; Sui, Zhifang |