0% found this document useful (0 votes)

26 views12 pages

29 - Khattab - CS224U IR Part 4

Uploaded by

Atharva Tambat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views12 pages

29 - Khattab - CS224U IR Part 4

Uploaded by

Atharva Tambat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

NLU & IR:

NEURAL IR (II)
Omar Khattab

CS224U: Natural Language Understanding

Spring 2021

1
Neural Ranking: Functional View
▪ All we need is a score for every query–document pair
– We’ll sort the results by decreasing score

What compounds in the stomach protect against

Q ingested pathogens?

Immune System | Wikipedia

Neural 0.93
D1 Chemical barriers also protect against infection. The skin and
respiratory tract secrete antimicrobial peptides such as the β-
defensins. […] In the stomach, gastric acid serves as a chemical
defense against ingested pathogens.
Ranker

What compounds in the stomach protect against

Q ingested pathogens?

Why isn't this a syntax error in python? | Stack Overflow

Neural 0.01

D99
Noticed a line in our codebase today which I thought surely would have
failed the build with syntax error. […] Whitespace is sometimes not
required in the conditional expression `1if True else 0`
Ranker
https://stackoverflow.com/questions/23998026

2
Query–Document Interaction Models
1. Tokenize the query and the document
2. Embed all the tokens of each
s
3. Build a query–document interaction matrix MLP
AvgPool
– Most commonly: store the cos similarity of each pair of words Convolution

Document
4. Reduce this dense matrix to a score
– Learn neural layers (e.g., convolution, linear layers)

Models in this category include

KNRM, Conv-KNRM, and Duet.
Query
Chenyan Xiong, et al. End-to-end neural ad-hoc ranking with kernel pooling. SIGIR’17
Zhuyun Dai, et al. Convolutional neural networks for soft-matching n-grams in ad-hoc search. WSDM’18
Bhaskar Mitra, et al. Learning to match using local and distributed representations of text for web search. WWW’17 3
Query–Document Interaction Models: MS MARCO Results

▪ Considerable gains in quality—at a reasonable increase in computational cost!

These models re-rank the top-1000

passages retrieved by BM25.

Bhaskar Mitra and Nick Craswell. An Updated Duet Model for Passage Re-ranking. arXiv:1903.07666 (2019)
Sebastian Hofstätter, et al. On the effect of low-frequency terms on neural-IR models. SIGIR’19 4
All-to-all Interaction with BERT
1. Feed BERT “[CLS] Query [SEP] Document [SEP]”
2. Run this through all the BERT layers s

3. Extract the final [CLS] output embedding

– Reduce to a single score through a linear layer

This is essentially a standard BERT

classifier, used for ranking passages.

Of course, we must fine-tune BERT for

this task with positives and negatives to
be effective.
Query Document
Rodrigo Nogueira and Kyunghyun Cho. 2019. Passage Re-ranking with BERT. arXiv:1901.04085 (2019)
Zhuyun Dai and Jamie Callan. 2019. Deeper Text Understanding for IR with Contextual Neural Language Modeling. SIGIR’19 5
BERT Rankers: SOTA 2019 (in quality)

MS MARCO Ranking screenshot as of Jan 2019. From Rodrigo Nogueira’s Brief History of DL applied to IR (UoG talk).
https://blog.google/products/search/search-language-understanding-bert/
6
https://azure.microsoft.com/en-us/blog/bing-delivers-its-largest-improvementin-search-experience-using-azure-gpus/
BERT Rankers: Efficiency–Effectiveness Tradeoff

▪ Dramatic gains in quality—but also a dramatic increase in computational cost!

(Nogueira & Cho, 2019)

Can we achieve high MRR

and low latency?

Rodrigo Nogueira and Kyunghyun Cho. 2019. Passage Re-ranking with BERT. arXiv:1901.04085 (2019) 7
Toward Faster Ranking: Pre-computation
▪ BERT rankers are slow because their computations be redundant:
– Represent the query (1000 times for 1000 documents)
– Represent the document (once for every query!)
– Conduct matching between the query and the document

Is there a unique value in jointly representing

▪ We have the documents in advance. queries and documents?

– Can we pre-compute the document representations?

– And “cache” these representations for use across queries

8
Neural IR Paradigms: Learning term weights
▪ BM25 decomposed a document’s score into a summation over
term–document weights. Can we learn term weights with BERT?

▪ Tokenize the query/document Save term weights to

the inverted index
Compute sum of
▪ Use BERT to produce a score scores for the t91 t2 t1 t32
matching terms!
for each token in the document
Lookup term
▪ Add the scores of the tokens weights from
inverted index
that also appear in the query
t 1 t2 t3 t91 t2 t1 … t32

Query Document
Dai, Zhuyun, and Jamie Callan. "Context-aware term weighting for first stage passage retrieval.” SIGIR’20
Nogueira, Rodrigo and Jimmy Lin. "From doc2query to docTTTTTquery." Online preprint (2019).
9
Mallia, Antonio, et al. "Learning Passage Impacts for Inverted Indexes.“ SIGIR’21.
Learning term weights
▪ We get to learn the term weights with BERT and to re-use them!
▪ But our query is back to being a “bag of words”.

DeepCT and doc2query are two

major models under this paradigm.

Can we do better?

10
Next: Can we achieve high MRR and low latency?

▪ Yes! We’ll discuss two rich neural IR paradigms:

– Representation Similarity

– Late Interaction

11
References
Omar Khattab and Matei Zaharia. “ColBERT: Efficient and effective passage search via contextualized late interaction over BERT.“ SIGIR’20
Chenyan Xiong, et al. End-to-end neural ad-hoc ranking with kernel pooling. SIGIR’17
Zhuyun Dai, et al. Convolutional neural networks for soft-matching n-grams in ad-hoc search. WSDM’18
Bhaskar Mitra, et al. Learning to match using local and distributed representations of text for web search. WWW’17
Bhaskar Mitra and Nick Craswell. An Updated Duet Model for Passage Re-ranking. arXiv:1903.07666 (2019)
Sebastian Hofstätter, et al. On the effect of low-frequency terms on neural-IR models. SIGIR’19
Zhuyun Dai and Jamie Callan. 2019. Deeper Text Understanding for IR with Contextual Neural Language Modeling. SIGIR’19
Rodrigo Nogueira. “A Brief History of Deep Learning applied to Information Retrieval” (UoG talk). Retrieved from
https://docs.google.com/presentation/d/1_mlvmyev0pjdG0OcfbEWManRREC0jCdjD3b1tPPvcbk
Zhuyun Dai, and Jamie Callan. "Context-aware term weighting for first stage passage retrieval.” SIGIR’20
Rodrigo Nogueira and Jimmy Lin. "From doc2query to docTTTTTquery." Online preprint (2019).
Antonio Mallia, et al. "Learning Passage Impacts for Inverted Indexes.“ SIGIR’21.

Practical RAG
No ratings yet
Practical RAG
127 pages
RAG - Search Generate
No ratings yet
RAG - Search Generate
13 pages
Cs224u Intro 2023 Handout
No ratings yet
Cs224u Intro 2023 Handout
98 pages
Question Answering
No ratings yet
Question Answering
68 pages
W6L2 LLM For Search
No ratings yet
W6L2 LLM For Search
70 pages
External Information On Large Linguistic Models Utilizing Retrieval Enhanced Generation (RAG)
100% (10)
External Information On Large Linguistic Models Utilizing Retrieval Enhanced Generation (RAG)
6 pages
NLP Week10 IR Enc Dec Annotated - by - Ces
No ratings yet
NLP Week10 IR Enc Dec Annotated - by - Ces
83 pages
2022 - Recurrent Neural Networks Concepts and Applications
No ratings yet
2022 - Recurrent Neural Networks Concepts and Applications
413 pages
Lecture16-Retrieval Augmented Generation With LLMs
No ratings yet
Lecture16-Retrieval Augmented Generation With LLMs
40 pages
Reading:: Sources
No ratings yet
Reading:: Sources
15 pages
NLP Week10 IR Enc Dec
No ratings yet
NLP Week10 IR Enc Dec
68 pages
Cs224n 2025 Lecture03 Neuralnets
No ratings yet
Cs224n 2025 Lecture03 Neuralnets
96 pages
Santhanam Colbert
No ratings yet
Santhanam Colbert
20 pages
Sec1 Introduction GR Tutorial Slides SIGIR
No ratings yet
Sec1 Introduction GR Tutorial Slides SIGIR
25 pages
Information Retrival List of Experiment - Odd Sem 2024-25
No ratings yet
Information Retrival List of Experiment - Odd Sem 2024-25
23 pages
Joshua K. Cage - Python Transformers by Huggingface Hands On - 101 Practical Implementation Hands-On of ALBERT - ViT - BigBird and Other Latest Models With Huggingface Transformers
No ratings yet
Joshua K. Cage - Python Transformers by Huggingface Hands On - 101 Practical Implementation Hands-On of ALBERT - ViT - BigBird and Other Latest Models With Huggingface Transformers
186 pages
Lecture # 14-1 Introduction To RAG
No ratings yet
Lecture # 14-1 Introduction To RAG
56 pages
NLP Week9 Fine Tuning - and - IR
No ratings yet
NLP Week9 Fine Tuning - and - IR
64 pages
Beyond Explaining The Basics of Retrieval (Augmented Generation)
No ratings yet
Beyond Explaining The Basics of Retrieval (Augmented Generation)
22 pages
29 Khattab CS224U IR Part 5
No ratings yet
29 Khattab CS224U IR Part 5
18 pages
Intelligent Bot: For Healthcare
No ratings yet
Intelligent Bot: For Healthcare
26 pages
Reasonir: Training Retrievers For Reasoning Tasks
No ratings yet
Reasonir: Training Retrievers For Reasoning Tasks
36 pages
LectureLtR-neural IR 1
No ratings yet
LectureLtR-neural IR 1
55 pages
Myth of Neural Information Retrieval
No ratings yet
Myth of Neural Information Retrieval
24 pages
Large Language Models For Information Retrieval: A Survey
No ratings yet
Large Language Models For Information Retrieval: A Survey
35 pages
1 Overview
No ratings yet
1 Overview
44 pages
CS11-711 Advanced NLP: Retrieval and Retrieval-Augmented Generation
No ratings yet
CS11-711 Advanced NLP: Retrieval and Retrieval-Augmented Generation
37 pages
ML - Project Report PDF
No ratings yet
ML - Project Report PDF
24 pages
NLP Review 3 Formatted 2
No ratings yet
NLP Review 3 Formatted 2
27 pages
29 - Khattab - CS224U IR Part 3
No ratings yet
29 - Khattab - CS224U IR Part 3
8 pages
1 introIR
No ratings yet
1 introIR
15 pages
Chapter 14-NLP
No ratings yet
Chapter 14-NLP
24 pages
Colbert: Efficient and Effective Passage Search Via Contextualized Late Interaction Over Bert
No ratings yet
Colbert: Efficient and Effective Passage Search Via Contextualized Late Interaction Over Bert
10 pages
Question Answering, Information Retrieval, and Retrieval Augmented Generation
No ratings yet
Question Answering, Information Retrieval, and Retrieval Augmented Generation
22 pages
Intro Notes
No ratings yet
Intro Notes
11 pages
Colbert
No ratings yet
Colbert
10 pages
RAG Training NEW
No ratings yet
RAG Training NEW
47 pages
Rank
No ratings yet
Rank
9 pages
Information Retrieval: Recent Advances and Beyond: Kailash Hambarde, and Hugo Proença
No ratings yet
Information Retrieval: Recent Advances and Beyond: Kailash Hambarde, and Hugo Proença
26 pages
IR and RL NLP
No ratings yet
IR and RL NLP
9 pages
Evaluation of GPT and BERT-based Models On Identif
No ratings yet
Evaluation of GPT and BERT-based Models On Identif
25 pages
Information Retrieval: DR Sharifullah Khan Nust Seecs
No ratings yet
Information Retrieval: DR Sharifullah Khan Nust Seecs
32 pages
Deeper Text Understanding For IR With Contextual Neural Language Modeling
No ratings yet
Deeper Text Understanding For IR With Contextual Neural Language Modeling
4 pages
Ecir2018 Tutorial Nn4ir
No ratings yet
Ecir2018 Tutorial Nn4ir
9 pages
Rethinking Search: Making Domain Experts Out of Dilettantes
No ratings yet
Rethinking Search: Making Domain Experts Out of Dilettantes
27 pages
Introduction To Information Retrieval
No ratings yet
Introduction To Information Retrieval
50 pages
Yann Debray - 1714613827618
No ratings yet
Yann Debray - 1714613827618
16 pages
CCS369 - TSS-Unit 3
No ratings yet
CCS369 - TSS-Unit 3
55 pages
Generative Information Retrieval
No ratings yet
Generative Information Retrieval
9 pages
Generative Adversarial Networks
No ratings yet
Generative Adversarial Networks
43 pages
Rebertsubmission116 NW
No ratings yet
Rebertsubmission116 NW
26 pages
Semantic Search
No ratings yet
Semantic Search
9 pages
An End-to-End Model With Adaptive Filtering For Retrieval-Augmented Generation
No ratings yet
An End-to-End Model With Adaptive Filtering For Retrieval-Augmented Generation
13 pages
ReNeuIR at SIGIR 2023: The Second Workshop On Reaching Efficiency in Neural Information Retrieval
No ratings yet
ReNeuIR at SIGIR 2023: The Second Workshop On Reaching Efficiency in Neural Information Retrieval
4 pages
Data aug-IR
No ratings yet
Data aug-IR
15 pages
Thesis RAG Retrieval Augmented Generation For The IR-Anthology
No ratings yet
Thesis RAG Retrieval Augmented Generation For The IR-Anthology
83 pages
Prompting and Fine-Tuning Pre-Trained Generative Language Models
No ratings yet
Prompting and Fine-Tuning Pre-Trained Generative Language Models
4 pages
RAG Papers
No ratings yet
RAG Papers
5 pages
Knowledge Retrieval Based On Generative AI: 1 Te-Lun Yang
No ratings yet
Knowledge Retrieval Based On Generative AI: 1 Te-Lun Yang
8 pages
Steps Involved in RAG
No ratings yet
Steps Involved in RAG
4 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
21 pages
Lecun 2015
No ratings yet
Lecun 2015
10 pages
Ann Unit 1
No ratings yet
Ann Unit 1
26 pages
Practical Natural Language Processing: A Comprehensive Guide To Building Real-World NLP Systems
No ratings yet
Practical Natural Language Processing: A Comprehensive Guide To Building Real-World NLP Systems
8 pages
Chapter 2
No ratings yet
Chapter 2
41 pages
EE114 Power Engineering - I Assignment 03: e G 2 C R e G C
No ratings yet
EE114 Power Engineering - I Assignment 03: e G 2 C R e G C
5 pages
A Survey of Deep Learning and Its Applications: A New Paradigm To Machine Learning
No ratings yet
A Survey of Deep Learning and Its Applications: A New Paradigm To Machine Learning
22 pages
Unit 2
No ratings yet
Unit 2
34 pages
ELET442 - Artificial Neural Networks (ANNs)
No ratings yet
ELET442 - Artificial Neural Networks (ANNs)
56 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
15 pages
Chapter 2 - Artificial Neural Networks (ANNs)
No ratings yet
Chapter 2 - Artificial Neural Networks (ANNs)
27 pages
SC 3
No ratings yet
SC 3
127 pages
INFOSYS Natural Language Processing
No ratings yet
INFOSYS Natural Language Processing
13 pages
6 Multivariate Gaussian
No ratings yet
6 Multivariate Gaussian
138 pages
Deep Learning Methods and Applications Li Deng Dong Yu PDF Download
No ratings yet
Deep Learning Methods and Applications Li Deng Dong Yu PDF Download
49 pages
Scenario Modelling in The Context of Foresight Studies
No ratings yet
Scenario Modelling in The Context of Foresight Studies
21 pages
Advanced Dynamic System Simulation Model Replication and Monte Carlo Studies 2nd Edition Granino A. Korn
100% (1)
Advanced Dynamic System Simulation Model Replication and Monte Carlo Studies 2nd Edition Granino A. Korn
59 pages
Unit 2 - Soft Computing
No ratings yet
Unit 2 - Soft Computing
49 pages
Representation Learning and NLP
No ratings yet
Representation Learning and NLP
11 pages
Narratives AI 2020
No ratings yet
Narratives AI 2020
32 pages
A Systematic Survey of Text Summarization - From Statistical To Langauge Models
No ratings yet
A Systematic Survey of Text Summarization - From Statistical To Langauge Models
42 pages
Huang 2021
No ratings yet
Huang 2021
14 pages
Neuro-Symbolic Artificial Intelligence: A Survey: Review
No ratings yet
Neuro-Symbolic Artificial Intelligence: A Survey: Review
36 pages
Endsem 2023
No ratings yet
Endsem 2023
20 pages
Final Report
No ratings yet
Final Report
56 pages
Distributed Representations
No ratings yet
Distributed Representations
33 pages
Learn Operating Digital Storage Oscilloscopes (Dsos) : Rupesh Gupta
No ratings yet
Learn Operating Digital Storage Oscilloscopes (Dsos) : Rupesh Gupta
27 pages
Machine Learning and Deep Learning: Janiesch, Christian Zschech, Patrick Heinrich, Kai
No ratings yet
Machine Learning and Deep Learning: Janiesch, Christian Zschech, Patrick Heinrich, Kai
12 pages
CS355 LabEndsem SeatingArrangement
No ratings yet
CS355 LabEndsem SeatingArrangement
10 pages
Neural Networks
No ratings yet
Neural Networks
13 pages
2024 Emnlp D2R
No ratings yet
2024 Emnlp D2R
12 pages
A Novel Knowledge Graph-Based Optimization Approach For Resource Allocation in Discrete Manufacturing Workshops
No ratings yet
A Novel Knowledge Graph-Based Optimization Approach For Resource Allocation in Discrete Manufacturing Workshops
14 pages
GNN Foundations Frontiers and Applications Chapter1
No ratings yet
GNN Foundations Frontiers and Applications Chapter1
13 pages
Digital Storage Oscilloscope (DSO) Presentation Based On The TDS210/TDS1002 Series User Manual
No ratings yet
Digital Storage Oscilloscope (DSO) Presentation Based On The TDS210/TDS1002 Series User Manual
19 pages
Autocad Lab-8 Steps+PDF Drawings
No ratings yet
Autocad Lab-8 Steps+PDF Drawings
12 pages
2015 - Request Confirmation Networks For Neuro-Symbolic Script Execution.
No ratings yet
2015 - Request Confirmation Networks For Neuro-Symbolic Script Execution.
9 pages
Cross River Solutions
No ratings yet
Cross River Solutions
2 pages
Quiz 2 Cheatsheet
No ratings yet
Quiz 2 Cheatsheet
2 pages
Lab8 Question2 Model
No ratings yet
Lab8 Question2 Model
1 page

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

29 - Khattab - CS224U IR Part 4

Uploaded by

29 - Khattab - CS224U IR Part 4

Uploaded by

NLU & IR:

CS224U: Natural Language Understanding

What compounds in the stomach protect against

Immune System | Wikipedia

What compounds in the stomach protect against

Why isn't this a syntax error in python? | Stack Overflow

Models in this category include

▪ Considerable gains in quality—at a reasonable increase in computational cost!

These models re-rank the top-1000

3. Extract the final [CLS] output embedding

This is essentially a standard BERT

Of course, we must fine-tune BERT for

▪ Dramatic gains in quality—but also a dramatic increase in computational cost!

(Nogueira & Cho, 2019)

Can we achieve high MRR

Is there a unique value in jointly representing

– Can we pre-compute the document representations?

▪ Tokenize the query/document Save term weights to

DeepCT and doc2query are two

▪ Yes! We’ll discuss two rich neural IR paradigms:

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.