Research
ALL
Conference

Advancing llm reasoning generalists with preference trees
Proceedings of ICLR
·
2025
· CCF-Expanded

VisRAG: Vision-based retrieval-augmented generation on multi-modality documents
Proceedings of ICLR
·
2025
· CCF-Expanded

RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards
Proceedings of ICLR
·
2025
· CCF-Expanded

COAST: Enhancing the Code Debugging Ability of LLMs through Communicative Agent Based Data Synthesis
Proceedings of NAACL (Findings)
·
2025
· CCF-B

Exploring the Potential of Dimension Reduction in Building Efficient Dense Retrieval Systems
Proceedings of CCIR
·
2024

Chameleon: Towards Update-Efficient Learned Indexing for Locally Skewed Data
Proceedings of ICDE
·
2024
· CCF-A

MCTS: A Multi-Reference Chinese Text Simplification Dataset
Proceedings of COLING
·
2024
· CCF-B

Fusion-in-T5: Unifying Document Ranking Signals for Improved Information Retrieval
Proceedings of COLING
·
2024
· CCF-B

Toolink: Linking toolkit creation and using through chain-of-solving on open-source model
Proceedings of NAACL
·
2024
· CCF-B

Modeling User Viewing Flow Using Large Language Models for Article Recommendation
Proceedings of WWW
·
2024
· CCF-A

Enhancing Legal Case Retrieval via Scaling High-quality Synthetic Query-Candidate Pairs
Proceedings of EMNLP
·
2024
· CCF-B

MatPlotAgent: Method and Evaluation for LLM-Based Agentic Scientific Data Visualization
Proceedings of ACL (Findings)
·
2024
· CCF-A

INTERVENOR: Prompting the Coding Ability of Large Language Models with the Interactive Chain of Repair
Proceedings of ACL (Findings)
·
2024
· CCF-A

MARVEL: Unlocking the Multi-Modal Capability of Dense Retrieval via Visual Module Plugin
Proceedings of ACL
·
2024
· CCF-A

UltraLink: An Open-Source Knowledge-Enhanced Multilingual Supervised Fine-tuning Dataset
Proceedings of ACL
·
2024
· CCF-A

Cleaner Pretraining Corpus Curation with Neural Web Scraping
Proceedings of ACL
·
2024
· CCF-A

Text Matching Improves Sequential Recommendation by Reducing Popularity Biases
Proceedings of CIKM
·
2023
· CCF-B

Universal Vision-Language Dense Retrieval: Learning A Unified Representation Space for Multi-Modal Retrieval
Proceedings of ICLR
·
2023
· CCF-Expanded

Openmatch-v2: An all-in-one multi-modality plm-based information retrieval toolkit
Proceedings of SIGIR
·
2023
· CCF-A

Structure-Aware Language Model Pretraining Improves Dense Retrieval on Structured Data
Proceedings of ACL (Findings)
·
2023
· CCF-A

Leveraging Prefix Transfer for Multi-Intent Text Revision
Proceedings of ACL
·
2023
· CCF-A

P3 Ranker: Mitigating the Gaps between Pre-training and Ranking Fine-tuning with Prompt-based Learning and Pre-finetuning
Proceedings of SIGIR
·
2022
· CCF-A

Dimension Reduction for Efficient Dense Retrieval via Conditional Autoencoder
Proceedings of EMNLP
·
2022
· CCF-B
Neural Quality Estimation with Multiple Hypotheses for Grammatical Error Correction
Proceedings of NAACL
·
2021
· CCF-B

More robust dense retrieval with contrastive dual learning
Proceedings of ICTIR
·
2021

OpenMatch: An Open Source Library for Neu-IR Research
Proceedings of SIGIR
·
2021
· CCF-A

Few-Shot Conversational Dense Retrieval
Proceedings of SIGIR
·
2021
· CCF-A
Capturing Global Informativeness in Open Domain Keyphrase Extraction
Proceedings of NLPCC
·
2021
· CCF-C

TIAGE: A Benchmark for Topic-Shift Aware Dialog Modeling
Proceedings of EMNLP (Findings)
·
2021
· CCF-B

Few-Shot Text Ranking with Meta Adapted Synthetic Weak Supervision
Proceedings of ACL-IJCNLP
·
2021
· CCF-A

Text Style Transfer via Learning Style Instance Supported Latent Space
Proceedings of IJCAI
·
2020
· CCF-A

Selective Weak Supervision for Neural Information Retrieval
Proceedings of WWW
·
2020
· CCF-A
Adapting Open Domain Fact Extraction and Verification to COVID-FACT through In-Domain Language Modeling
Proceedings of EMNLP (Findings)
·
2020
· CCF-B

Coreferential Reasoning Learning for Language Representation
Proceedings of EMNLP
·
2020
· CCF-B

Fine-grained Fact Verification with Kernel Graph Attention Network
Proceedings of ACL
·
2020
· CCF-A

Grounded Conversation Generation as Guided Traverses in Commonsense Knowledge Graphs
Proceedings of ACL
·
2020
· CCF-A
Explore Entity Embedding Effectiveness in Entity Retrieval
Proceedings of CCL
·
2019

DocRED: A Large-Scale Document-Level Relation Extraction Dataset
Proceedings of ACL
·
2019
· CCF-A

Entity-Duet Neural Ranking: Understanding the Role of Knowledge Graph Semantics in Neural Information Retrieval
Proceedings of ACL
·
2018
· CCF-A
Journal

Building A Coding Assistant via the Retrieval-Augmented Language Model
Journal of ACM Transactions on Information Systems (TOIS)
·
2024
· CCF-A

CHGNN: A Semi-Supervised Contrastive Hypergraph Learning Network
Journal of IEEE Transactions on Knowledge and Data Engineering (TKDE)
·
2024
· CCF-A

Tailored Definitions With Easy Reach: Complexity-Controllable Definition Generation
Journal of IEEE Transactions on Big Data (TBD)
·
2024
· CCF-A

Multi-Evidence based Fact Verification via A Confidential Graph Neural Network
Journal of IEEE Transactions on Big Data (TBD)
·
2024
· CCF-A

Neural Parse Combination
Journal of Computer Science and Technology (JCST)
·
2017
· CCF-B