东北大学信息检索实验室 Northeastern University Information Retrieval Group

Research

ALL

Conference

Advancing llm reasoning generalists with preference trees
Advancing llm reasoning generalists with preference trees
Lifan Yuan, Ganqu Cui, Hanbin Wang, Ning Ding, Xingyao Wang, Jia Deng, Boji Shan, Huimin Chen, Ruobing Xie, Yankai Lin, Zhenghao Liu, Bowen Zhou, Hao Peng, Zhiyuan Liu, Maosong Sun
Proceedings of ICLR  ·  2025  ·  CCF-Expanded
VisRAG: Vision-based retrieval-augmented generation on multi-modality documents
VisRAG: Vision-based retrieval-augmented generation on multi-modality documents
Shi Yu, Chaoyue Tang, Bokai Xu, Junbo Cui, Junhao Ran, Yukun Yan, Zhenghao Liu, Shuo Wang, Xu Han, Zhiyuan Liu, Maosong Sun
Proceedings of ICLR  ·  2025  ·  CCF-Expanded
RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards
RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards
Xinze Li, Sen Mei, Zhenghao Liu, Yukun Yan, Shuo Wang, Shi Yu, Zheni Zeng, Hao Chen, Ge Yu, Zhiyuan Liu, Maosong Sun, Chenyan Xiong
Proceedings of ICLR  ·  2025  ·  CCF-Expanded
COAST: Enhancing the Code Debugging Ability of LLMs through Communicative Agent Based Data Synthesis
COAST: Enhancing the Code Debugging Ability of LLMs through Communicative Agent Based Data Synthesis
Weiqing Yang, Hanbin Wang, Zhenghao Liu, Xinze Li, Yukun Yan, Shuo Wang, Yu Gu, Minghe Yu, Zhiyuan Liu, Ge Yu
Proceedings of NAACL (Findings)  ·  2025  ·  CCF-B
Exploring the Potential of Dimension Reduction in Building Efficient Dense Retrieval Systems
Exploring the Potential of Dimension Reduction in Building Efficient Dense Retrieval Systems
Zhipeng Xu, Zhenghao Liu, Yu Gu, Ge Yu
Proceedings of CCIR  ·  2024
Chameleon: Towards Update-Efficient Learned Indexing for Locally Skewed Data
Chameleon: Towards Update-Efficient Learned Indexing for Locally Skewed Data
Na Guo, Yaqi Wang, Wenli Sun, Yu Gu, Jianzhong Qi, Zhenghao Liu, Xiufeng Xia, Ge Yu
Proceedings of ICDE  ·  2024  ·  CCF-A
MCTS: A Multi-Reference Chinese Text Simplification Dataset
MCTS: A Multi-Reference Chinese Text Simplification Dataset
Ruining Chong, Luming Lu, Liner Yang, Jinran Nie, Zhenghao Liu, Shuo Wangl, Shuhan Zhou, Yaoxin Li, Erhong Yang
Proceedings of COLING  ·  2024  ·  CCF-B
Fusion-in-T5: Unifying Document Ranking Signals for Improved Information Retrieval
Fusion-in-T5: Unifying Document Ranking Signals for Improved Information Retrieval
Shi Yu, Chenghao Fan, Chenyan Xiong, David Jin, Zhiyuan Liu, Zhenghao Liu
Proceedings of COLING  ·  2024  ·  CCF-B
Toolink: Linking toolkit creation and using through chain-of-solving on open-source model
Toolink: Linking toolkit creation and using through chain-of-solving on open-source model
Cheng Qian, Chenyan Xiong, Zhenghao Liu, Zhiyuan Liu
Proceedings of NAACL  ·  2024  ·  CCF-B
Modeling User Viewing Flow Using Large Language Models for Article Recommendation
Modeling User Viewing Flow Using Large Language Models for Article Recommendation
Zhenghao Liu, Zulong Chen, Moufeng Zhang, Shaoyang Duan, Hong Wen, Liangyue Li, Nan Li, Yu Gu, Ge Yu
Proceedings of WWW  ·  2024  ·  CCF-A
Enhancing Legal Case Retrieval via Scaling High-quality Synthetic Query-Candidate Pairs
Enhancing Legal Case Retrieval via Scaling High-quality Synthetic Query-Candidate Pairs
Cheng Gao, Chaojun Xiao, Zhenghao Liu, Huimin Chen, Zhiyuan Liu, Maosong Sun
Proceedings of EMNLP  ·  2024  ·  CCF-B
MatPlotAgent: Method and Evaluation for LLM-Based Agentic Scientific Data Visualization
MatPlotAgent: Method and Evaluation for LLM-Based Agentic Scientific Data Visualization
Zhiyu Yang, Zihan Zhou, Shuo Wang, Xin Cong, Xu Han, Yukun Yan, Zhenghao Liu, Zhixing Tan, Pengyuan Liu, Dong Yu, Zhiyuan Liu, Xiaodong Shi, Maosong Sun
Proceedings of ACL (Findings)  ·  2024  ·  CCF-A
INTERVENOR: Prompting the Coding Ability of Large Language Models with the Interactive Chain of Repair
INTERVENOR: Prompting the Coding Ability of Large Language Models with the Interactive Chain of Repair
Hanbin Wang, Zhenghao Liu, Shuo Wang, Ganqu Cui, Ning Ding, Zhiyuan Liu, Ge Yu
Proceedings of ACL (Findings)  ·  2024  ·  CCF-A
MARVEL: Unlocking the Multi-Modal Capability of Dense Retrieval via Visual Module Plugin
MARVEL: Unlocking the Multi-Modal Capability of Dense Retrieval via Visual Module Plugin
Tianshuo Zhou, Sen Mei, Xinze Li, Zhenghao Liu, Chenyan Xiong, Zhiyuan Liu, Yu Gu, Ge Yu
Proceedings of ACL  ·  2024  ·  CCF-A
UltraLink: An Open-Source Knowledge-Enhanced Multilingual Supervised Fine-tuning Dataset
UltraLink: An Open-Source Knowledge-Enhanced Multilingual Supervised Fine-tuning Dataset
Haoyu Wang, Shuo Wang, Yukun Yan, Xujia Wang, Zhiyu Yang, Yuzhuang Xu, Zhenghao Liu, Ning Ding, Xu Han, Zhiyuan Liu, Maosong Sun
Proceedings of ACL  ·  2024  ·  CCF-A
Cleaner Pretraining Corpus Curation with Neural Web Scraping
Cleaner Pretraining Corpus Curation with Neural Web Scraping
Zhipeng Xu, Zhenghao Liu, Yukun Yan, Zhiyuan Liu, Chenyan Xiong, Ge Yu
Proceedings of ACL  ·  2024  ·  CCF-A
Text Matching Improves Sequential Recommendation by Reducing Popularity Biases
Text Matching Improves Sequential Recommendation by Reducing Popularity Biases
Zhenghao Liu, Sen Mei, Chenyan Xiong, Xiaohua Li, Shi Yu, Zhiyuan Liu, Yu Gu, Ge Yu
Proceedings of CIKM  ·  2023  ·  CCF-B
Universal Vision-Language Dense Retrieval: Learning A Unified Representation Space for Multi-Modal Retrieval
Universal Vision-Language Dense Retrieval: Learning A Unified Representation Space for Multi-Modal Retrieval
Zhenghao Liu, Chenyan Xiong, Yuanhuiyi Lv, Zhiyuan Liu, Ge Yu
Proceedings of ICLR  ·  2023  ·  CCF-Expanded
Openmatch-v2: An all-in-one multi-modality plm-based information retrieval toolkit
Openmatch-v2: An all-in-one multi-modality plm-based information retrieval toolkit
Shi Yu, Zhenghao Liu, Chenyan Xiong, Zhiyuan Liu
Proceedings of SIGIR  ·  2023  ·  CCF-A
Structure-Aware Language Model Pretraining Improves Dense Retrieval on Structured Data
Structure-Aware Language Model Pretraining Improves Dense Retrieval on Structured Data
Xinze Li, Zhenghao Liu, Chenyan Xiong, Shi Yu, Yu Gu, Zhiyuan Liu, Ge Yu
Proceedings of ACL (Findings)  ·  2023  ·  CCF-A
Leveraging Prefix Transfer for Multi-Intent Text Revision
Leveraging Prefix Transfer for Multi-Intent Text Revision
Ruining Chong, Cunliang Kong, Liu Wu, Zhenghao Liu, Ziye Jin, Liner Yang, Yange Fan, Hanghang Fan, Erhong Yang
Proceedings of ACL  ·  2023  ·  CCF-A
P3 Ranker: Mitigating the Gaps between Pre-training and Ranking Fine-tuning with Prompt-based Learning and Pre-finetuning
P3 Ranker: Mitigating the Gaps between Pre-training and Ranking Fine-tuning with Prompt-based Learning and Pre-finetuning
Xiaomeng Hu, Shi Yu, Chenyan Xiong, Zhenghao Liu, Zhiyuan Liu, Ge Yu
Proceedings of SIGIR  ·  2022  ·  CCF-A
Dimension Reduction for Efficient Dense Retrieval via Conditional Autoencoder
Dimension Reduction for Efficient Dense Retrieval via Conditional Autoencoder
Zhenghao Liu, Han Zhang, Chenyan Xiong, Zhiyuan Liu, Yu Gu, Xiaohua Li
Proceedings of EMNLP  ·  2022  ·  CCF-B
Neural Quality Estimation with Multiple Hypotheses for Grammatical Error Correction
Neural Quality Estimation with Multiple Hypotheses for Grammatical Error Correction
Zhenghao Liu, Xiaoyuan Yi, Maosong Sun, Liner Yang, Tat-Seng Chua
Proceedings of NAACL  ·  2021  ·  CCF-B
More robust dense retrieval with contrastive dual learning
More robust dense retrieval with contrastive dual learning
Yizhi Li, Zhenghao Liu, Chenyan Xiong, Zhiyuan Liu
Proceedings of ICTIR  ·  2021
OpenMatch: An Open Source Library for Neu-IR Research
OpenMatch: An Open Source Library for Neu-IR Research
Zhenghao Liu, Kaitao Zhang, Chenyan Xiong, Zhiyuan Liu, Maosong Sun
Proceedings of SIGIR  ·  2021  ·  CCF-A
Few-Shot Conversational Dense Retrieval
Few-Shot Conversational Dense Retrieval
Shi Yu, Zhenghao Liu, Chenyan Xiong, Tao Feng, Zhiyuan Liu
Proceedings of SIGIR  ·  2021  ·  CCF-A
Capturing Global Informativeness in Open Domain Keyphrase Extraction
Capturing Global Informativeness in Open Domain Keyphrase Extraction
Si Sun, Zhenghao Liu, Chenyan Xiong, Zhiyuan Liu, Jie Bao
Proceedings of NLPCC  ·  2021  ·  CCF-C
TIAGE: A Benchmark for Topic-Shift Aware Dialog Modeling
TIAGE: A Benchmark for Topic-Shift Aware Dialog Modeling
Huiyuan Xie, Zhenghao Liu, Chenyan Xiong, Zhiyuan Liu, Ann Copestake
Proceedings of EMNLP (Findings)  ·  2021  ·  CCF-B
Few-Shot Text Ranking with Meta Adapted Synthetic Weak Supervision
Few-Shot Text Ranking with Meta Adapted Synthetic Weak Supervision
Si Sun, Yingzhuo Qian, Zhenghao Liu, Chenyan Xiong, Kaitao Zhang, Jie Bao, Zhiyuan Liu, Paul Bennett
Proceedings of ACL-IJCNLP  ·  2021  ·  CCF-A
Text Style Transfer via Learning Style Instance Supported Latent Space
Text Style Transfer via Learning Style Instance Supported Latent Space
Kaitao Zhang, Chenyan Xiong, Zhenghao Liu, Zhiyuan Liu
Proceedings of IJCAI  ·  2020  ·  CCF-A
Selective Weak Supervision for Neural Information Retrieval
Selective Weak Supervision for Neural Information Retrieval
Kaitao Zhang, Chenyan Xiong, Zhenghao Liu, Zhiyuan Liu
Proceedings of WWW  ·  2020  ·  CCF-A
Adapting Open Domain Fact Extraction and Verification to COVID-FACT through In-Domain Language Modeling
Adapting Open Domain Fact Extraction and Verification to COVID-FACT through In-Domain Language Modeling
Zhenghao Liu, Chenyan Xiong, Zhuyun Dai, Si Sun, Maosong Sun, Zhiyuan Liu
Proceedings of EMNLP (Findings)  ·  2020  ·  CCF-B
Coreferential Reasoning Learning for Language Representation
Coreferential Reasoning Learning for Language Representation
Deming Ye, Yankai Lin, Jiaju Du, Zhenghao Liu, Peng Li, Maosong Sun, Zhiyuan Liu
Proceedings of EMNLP  ·  2020  ·  CCF-B
Fine-grained Fact Verification with Kernel Graph Attention Network
Fine-grained Fact Verification with Kernel Graph Attention Network
Zhenghao Liu, Chenyan Xiong, Maosong Sun, Zhiyuan Liu
Proceedings of ACL  ·  2020  ·  CCF-A
Grounded Conversation Generation as Guided Traverses in Commonsense Knowledge Graphs
Grounded Conversation Generation as Guided Traverses in Commonsense Knowledge Graphs
Houyu Zhang, Zhenghao Liu, Chenyan Xiong, Zhiyuan Liu
Proceedings of ACL  ·  2020  ·  CCF-A
Explore Entity Embedding Effectiveness in Entity Retrieval
Explore Entity Embedding Effectiveness in Entity Retrieval
Zhenghao Liu, Chenyan Xiong, Maosong Sun, Zhiyuan Liu
Proceedings of CCL  ·  2019
DocRED: A Large-Scale Document-Level Relation Extraction Dataset
DocRED: A Large-Scale Document-Level Relation Extraction Dataset
Yuan Yao, Deming Ye, Peng Li, Xu Han, Yankai Lin, Zhenghao Liu, Zhiyuan Liu, Lixin Huang, Jie Zhou, Maosong Sun
Proceedings of ACL  ·  2019  ·  CCF-A
Entity-Duet Neural Ranking: Understanding the Role of Knowledge Graph Semantics in Neural Information Retrieval
Entity-Duet Neural Ranking: Understanding the Role of Knowledge Graph Semantics in Neural Information Retrieval
Zhenghao Liu, Chenyan Xiong, Maosong Sun, Zhiyuan Liu
Proceedings of ACL  ·  2018  ·  CCF-A

Journal

Building A Coding Assistant via the Retrieval-Augmented Language Model
Building A Coding Assistant via the Retrieval-Augmented Language Model
Xinze Li, Hanbin Wang, Zhenghao Liu, Shi Yu, Shuo Wang, Yukun Yan, Yukai Fu, Yu Gu, Ge Yu
Journal of ACM Transactions on Information Systems (TOIS)  ·  2024  ·  CCF-A
CHGNN: A Semi-Supervised Contrastive Hypergraph Learning Network
CHGNN: A Semi-Supervised Contrastive Hypergraph Learning Network
Yumeng Song, Yu Gu, Tianyi Li, Jianzhong Qi, Zhenghao Liu, Christian S Jensen, Ge Yu
Journal of IEEE Transactions on Knowledge and Data Engineering (TKDE)  ·  2024  ·  CCF-A
Tailored Definitions With Easy Reach: Complexity-Controllable Definition Generation
Tailored Definitions With Easy Reach: Complexity-Controllable Definition Generation
Liner Yang, Jiaxin Yuan, Cunliang Kong, Jingsi Yu, Ruining Chong, Zhenghao Liu, Erhong Yang
Journal of IEEE Transactions on Big Data (TBD)  ·  2024  ·  CCF-A
Multi-Evidence based Fact Verification via A Confidential Graph Neural Network
Multi-Evidence based Fact Verification via A Confidential Graph Neural Network
Yuqing Lan, Zhenghao Liu, Yu Gu, Xiaoyuan Yi, Xiaohua Li, Liner Yang, Ge Yu
Journal of IEEE Transactions on Big Data (TBD)  ·  2024  ·  CCF-A
Neural Parse Combination
Neural Parse Combination
Liner Yang, Maosong Sun, Jiacheng Zhang, Zhenghao Liu, Huanbo Luan, Yang Liu
Journal of Computer Science and Technology (JCST)  ·  2017  ·  CCF-B