From Distillation to Hard Negative Sampling: Making Sparse Neural IR Models More Effective

Formal, Thibault; Lassance, Carlos; Piwowarski, Benjamin; Clinchant, Stéphane

Computer Science > Information Retrieval

arXiv:2205.04733 (cs)

[Submitted on 10 May 2022 (v1), last revised 12 May 2022 (this version, v2)]

Title:From Distillation to Hard Negative Sampling: Making Sparse Neural IR Models More Effective

Authors:Thibault Formal, Carlos Lassance, Benjamin Piwowarski, Stéphane Clinchant

View PDF

Abstract:Neural retrievers based on dense representations combined with Approximate Nearest Neighbors search have recently received a lot of attention, owing their success to distillation and/or better sampling of examples for training -- while still relying on the same backbone architecture. In the meantime, sparse representation learning fueled by traditional inverted indexing techniques has seen a growing interest, inheriting from desirable IR priors such as explicit lexical matching. While some architectural variants have been proposed, a lesser effort has been put in the training of such models. In this work, we build on SPLADE -- a sparse expansion-based retriever -- and show to which extent it is able to benefit from the same training improvements as dense models, by studying the effect of distillation, hard-negative mining as well as the Pre-trained Language Model initialization. We furthermore study the link between effectiveness and efficiency, on in-domain and zero-shot settings, leading to state-of-the-art results in both scenarios for sufficiently expressive models.

Comments:	Accepted at SIGIR22 as a short paper (this work is the extension of SPLADE v2)
Subjects:	Information Retrieval (cs.IR); Computation and Language (cs.CL)
Cite as:	arXiv:2205.04733 [cs.IR]
	(or arXiv:2205.04733v2 [cs.IR] for this version)
	https://doi.org/10.48550/arXiv.2205.04733

Submission history

From: Thibault Formal [view email]
[v1] Tue, 10 May 2022 08:08:43 UTC (263 KB)
[v2] Thu, 12 May 2022 14:58:24 UTC (609 KB)

Computer Science > Information Retrieval

Title:From Distillation to Hard Negative Sampling: Making Sparse Neural IR Models More Effective

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Computer Science > Information Retrieval

Title:From Distillation to Hard Negative Sampling: Making Sparse Neural IR Models More Effective

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.