0% found this document useful (0 votes)

17 views30 pages

Lec3-posner intro

Uploaded by

pratzohol

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views30 pages

Lec3-posner intro

Uploaded by

pratzohol

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 30

Part of Speech Tagging and

Named Entity Recognition

Parts of Speech
From the earliest linguistic traditions (Yaska and Panini 5th
C. BCE, Aristotle 4th C. BCE), the idea that words can be
classified into grammatical categories
• part of speech, word classes, POS, POS tags
8 parts of speech attributed to Dionysius Thrax of
Alexandria (c. 1st C. BCE):
• noun, verb, pronoun, preposition, adverb, conjunction,
participle, article
• These categories are relevant for NLP today.
Two classes of words: Open vs. Closed

Closed class words

• Relatively fixed membership
• Usually function words: short, frequent words with
grammatical function
• determiners: a, an, the
• pronouns: she, he, I
• prepositions: on, under, over, near, by, …
Open class words
• Usually content words: Nouns, Verbs, Adjectives, Adverbs
• Plus interjections: oh, ouch, uh-huh, yes, hello
• New nouns and verbs like iPhone or to fax
Open class ("content") words
Nouns Verbs Adjectives old green tasty

Proper Common Main Adverbs slowly yesterday

Janet cat, cats eat
Italy mango went Interjections Ow hello
Numbers
122,312
… more
one
Closed class ("function")
Auxiliary
Determiners the some can Prepositions to with
had
Conjunctions and or Particles off up … more

Pronouns they its

Part-of-Speech Tagging
Assigning a part-of-speech to each word in a text.
Words often have more than one POS.
book:
• VERB: (Book that flight)
• NOUN: (Hand me that book).
Part-of-Speech Tagging
Map from sequence x1,…,xn of words to y1,…,yn of POS tags
"Universal Dependencies" Tagset Nivre et al. 2016
Sample "Tagged" English sentences
There/PRO were/VERB 70/NUM children/NOUN
there/ADV ./PUNC
Preliminary/ADJ findings/NOUN were/AUX
reported/VERB in/ADP today/NOUN ’s/PART
New/PROPN England/PROPN Journal/PROPN
of/ADP Medicine/PROPN
Why Part of Speech Tagging?

◦ Can be useful for other NLP tasks

◦ Parsing: POS tagging can improve syntactic parsing
◦ MT: reordering of adjectives and nouns (say from Spanish to English)
◦ Sentiment or affective tasks: may want to distinguish adjectives or other POS
◦ Text-to-speech (how do we pronounce “lead” or "object"?)
◦ Or linguistic or language-analytic computational tasks
◦ Need to control for POS when studying linguistic change like creation of new
words, or meaning shift
◦ Or control for POS in measuring meaning similarity or difference
How difficult is POS tagging in English?
Roughly 15% of word types are ambiguous
• Hence 85% of word types are unambiguous
• Janet is always PROPN, hesitantly is always ADV
But those 15% tend to be very common.
So ~60% of word tokens are ambiguous
E.g., back
earnings growth took a back/ADJ seat
a small building in the back/NOUN
a clear majority of senators back/VERB the bill
enable the country to buy back/PART debt
I was twenty-one back/ADV then
POS tagging performance in English
How many tags are correct? (Tag accuracy)
◦ About 97%
◦ Hasn't changed in the last 10+ years
◦ HMMs, CRFs, BERT perform similarly .
◦ Human accuracy about the same
But baseline is 92%!
◦ Baseline is performance of stupidest possible method
◦ "Most frequent class baseline" is an important baseline for many tasks
◦ Tag every word with its most frequent tag
◦ (and tag unknown words as nouns)
◦ Partly easy because
◦ Many words are unambiguous
Sources of information for POS tagging
Janet will back the bill
AUX/NOUN/VERB? NOUN/VERB?
Prior probabilities of word/tag
• "will" is usually an AUX
Identity of neighboring words
• "the" means the next word is probably not a verb
Morphology and wordshape:
◦ Prefixes unable: un-  ADJ
◦ Suffixes importantly: -ly  ADV
◦ Capitalization Janet: CAP  PROPN
Standard algorithms for POS tagging
Supervised Machine Learning Algorithms:
• Hidden Markov Models
• Conditional Random Fields (CRF)/ Maximum Entropy Markov
Models (MEMM)
• Neural sequence models (RNNs or Transformers)
• Large Language Models (like BERT), finetuned
All required a hand-labeled training set, all about equal performance
(97% on English)
All make use of information sources we discussed
• Via human created features: HMMs and CRFs
• Via representation learning: Neural LMs
Named Entity Recognition
(NER)
Named Entities
◦ Named entity, in its core usage, means anything that
can be referred to with a proper name. Most common
4 tags:
◦ PER (Person): “Marie Curie”
◦ LOC (Location): “New York City”
◦ ORG (Organization): “Stanford University”
◦ GPE (Geo-Political Entity): “India, Colorado"
◦ Often multi-word phrases
◦ But the term is also extended to things that aren't entities:
◦ dates, times, prices
Named Entity tagging
The task of named entity recognition (NER):
• find spans of text that constitute proper names
• tag the type of the entity.
NER output
Why NER?
Sentiment analysis: consumer’s sentiment toward a
particular company or person?
Question Answering: answer questions about an
entity?
Information Extraction: Extracting facts about
entities from text.
Why NER is hard
1) Segmentation
• In POS tagging, no segmentation problem since each
word gets one tag.
• In NER we have to find and segment the entities!
2) Type ambiguity
BIO Tagging
How can we turn this structured problem into a
sequence problem like POS tagging, with one label per
word?

[PER Jane Villanueva] of [ORG United] , a unit of [ORG

United Airlines Holding] , said the fare applies to the
[LOC Chicago ] route.
BIO Tagging
[PER Jane Villanueva] of [ORG United] , a unit of [ORG United Airlines Holding] ,
said the fare applies to the [LOC Chicago ] route.

Now we have one tag per token!!!

BIO Tagging
B: token that begins a span
I: tokens inside a span
O: tokens outside of any span

# of tags (where n is #entity types):

1 O tag,
n B tags,
n I tags
total of 2n+1
BIO Tagging variants: IO and BIOES
[PER Jane Villanueva] of [ORG United] , a unit of [ORG United Airlines Holding] ,
said the fare applies to the [LOC Chicago ] route.
Standard algorithms for NER
Supervised Machine Learning given a human-
labeled training set of text annotated with tags
• Hidden Markov Models
• Conditional Random Fields (CRF)/ Maximum
Entropy Markov Models (MEMM)
• Neural sequence models (RNNs or Transformers)
• Large Language Models (like BERT), finetuned
Part of Speech Tagging
Techniques
Part-of-Speech Tagging
How hard is the tagging problem?
That:
• as a determiner (followed by a noun):Give me that
hammer.
• as a demonstrative pronoun (without a following
noun):Who gave you that?
• as a conjunction (connecting two clauses):I didn’t
know that she was married.
The number of word types in Brown corpus by degree of • as a relative pronoun (forming the subject, object, or
ambiguity. complement of a relative clause):It’s a song that my
mother taught me.
• Many of the 40% ambiguous tokens are easy to • as an adverb (before an adjective or adverb):Three
years? I can’t wait that long.
disambiguate, because
– Various tags associated with a word are not equally likely, or
event.
– E.g., ‘a’ can be a determiner or a letter (perhaps as part of a
acronym)
• But the determiner sense is much more likely
Part-of-Speech Tagging
Many tagging algorithms fall into two classes:
◦ Rule-based taggers
◦ Involve a large database of hand-written disambiguation rule
specifying, for example, that an ambiguous word is a noun rather
than a verb if it follows a determiner.
◦ Stochastic taggers
◦ Resolve tagging ambiguities by using a training corpus to count the
probability of a given word having a given tag in a given context.
The Brill tagger, called the transformation-based
tagger, shares features of both tagging architecture.
Rule-Based Part-of-Speech Tagging
The earliest algorithms for automatically assigning POS were
based on a two-stage architecture
◦ First, use a dictionary to assign each word a list of potential POS.
◦ Second, use large lists of hand-written disambiguation rules to
winnow down this list to a single POS for each word
The ENGTWOL tagger (1995) is based on the same two stage
architecture, with much more sophisticated lexicon and
disambiguation rules than before.
◦ Lexicon:
◦ 56000 entries
◦ A word with multiple POS is counted as separate entries
Rule-Based Part-of-Speech Tagging
In the first stage of tagger,
◦ each word is run through the two-level lexicon transducer and
◦ the entries for all possible POS are returned.
A set of about 1,100 constraints are then applied to the input sentences to rule out incorrect
POS.
Rule-Based Part-of-Speech Tagging

A simplified version of the constraint:

ADVERBIAL-THAT RULE
Given input: “that”
if
(+1 A/ADV/QUANT); /* if next word is adj, adverb, or quantifier */
(+2 SENT-LIM); /* and following which is a sentence boundary, */
(NOT -1 SVOC/A); /* and the previous word is not a verb like */
/* ‘consider’ which allows adj as object complements */
then eliminate non-ADV tags
else eliminate ADV tags

• It isn’t that odd.

• I considered that odd.

ai txt unit4
No ratings yet
ai txt unit4
39 pages
Apznzaaczprqee1da4bjade7ul0meb Ap8tjou Feozcgqct6cpnh0z32ibu3faj 0wgfmnhp5p Eneunhaucakhow Bie9yhlaoqtsknu7yq0gfnxrzjd2mjuyrbnhadveb2wj7gjgcxpffbjgyxl4nzdqf5qeux-Lla2ggr5kg9w4bp8ev5hqrj7bwr3npwnp9gfmazwtau
No ratings yet
Apznzaaczprqee1da4bjade7ul0meb Ap8tjou Feozcgqct6cpnh0z32ibu3faj 0wgfmnhp5p Eneunhaucakhow Bie9yhlaoqtsknu7yq0gfnxrzjd2mjuyrbnhadveb2wj7gjgcxpffbjgyxl4nzdqf5qeux-Lla2ggr5kg9w4bp8ev5hqrj7bwr3npwnp9gfmazwtau
108 pages
Lecture#11 (POS Tagging)
No ratings yet
Lecture#11 (POS Tagging)
19 pages
Esin Orucu, Methodological Aspects of Comparative Law 8EurJLReform29
No ratings yet
Esin Orucu, Methodological Aspects of Comparative Law 8EurJLReform29
15 pages
Lecture Part of Speech Tagging
No ratings yet
Lecture Part of Speech Tagging
41 pages
nlp-unit-iii-notes
No ratings yet
nlp-unit-iii-notes
30 pages
Unit Ii Part of Speech Tagging and Syntactic Parsing
No ratings yet
Unit Ii Part of Speech Tagging and Syntactic Parsing
29 pages
The Davis-Moore Thesis States That Quizlet
100% (2)
The Davis-Moore Thesis States That Quizlet
8 pages
Module-2_NLP (1)
No ratings yet
Module-2_NLP (1)
50 pages
Hmm
No ratings yet
Hmm
94 pages
Lecture 20-23 Part of Speech Tagging
No ratings yet
Lecture 20-23 Part of Speech Tagging
36 pages
lec04-2-PartOfSpeechTagging
No ratings yet
lec04-2-PartOfSpeechTagging
56 pages
BSOC 113
No ratings yet
BSOC 113
54 pages
Ethical Decision-Making in Management. Perspectives of the Philosopher, the Sociologist and the Manager 1st Edition Matej Drašček pdf download
No ratings yet
Ethical Decision-Making in Management. Perspectives of the Philosopher, the Sociologist and the Manager 1st Edition Matej Drašček pdf download
44 pages
Lecture 16-17-18-19
No ratings yet
Lecture 16-17-18-19
42 pages
Module 2 HMMppt
No ratings yet
Module 2 HMMppt
31 pages
module-3
No ratings yet
module-3
33 pages
Part of Speech Tagging and Hidden Markov Models
No ratings yet
Part of Speech Tagging and Hidden Markov Models
24 pages
NLP 4
No ratings yet
NLP 4
83 pages
Dancing with Sophia Integral Philosophy on the Verge 1st Edition Readable PDF Download
100% (13)
Dancing with Sophia Integral Philosophy on the Verge 1st Edition Readable PDF Download
16 pages
POS__Tagging
No ratings yet
POS__Tagging
11 pages
Cme4408 p6 Pos Tagging
No ratings yet
Cme4408 p6 Pos Tagging
33 pages
DSIOPMA K34 - Group 2 - Case Study 1
No ratings yet
DSIOPMA K34 - Group 2 - Case Study 1
19 pages
Research Method CHAPTER 1
No ratings yet
Research Method CHAPTER 1
68 pages
pos tagging and chunking
No ratings yet
pos tagging and chunking
29 pages
Parts of Speech
No ratings yet
Parts of Speech
26 pages
An Ethical Defense of Private Property
No ratings yet
An Ethical Defense of Private Property
13 pages
Session 6 - Part-Of-Speech Tagging, Sequence Labeling
No ratings yet
Session 6 - Part-Of-Speech Tagging, Sequence Labeling
86 pages
10 - POS Tagging
No ratings yet
10 - POS Tagging
75 pages
Lecture 5
No ratings yet
Lecture 5
56 pages
Thesis Statement On Interpersonal Relationships
100% (3)
Thesis Statement On Interpersonal Relationships
4 pages
Jurnal Skripsi Indri Tja
No ratings yet
Jurnal Skripsi Indri Tja
10 pages
unit-3
No ratings yet
unit-3
50 pages
Print Lect6 Pos
No ratings yet
Print Lect6 Pos
11 pages
Week9
No ratings yet
Week9
36 pages
Syntactic Processing - Lecture Notes
No ratings yet
Syntactic Processing - Lecture Notes
56 pages
Synthesis Essay Example Prompt
100% (2)
Synthesis Essay Example Prompt
6 pages
Introduction Machine Learning & NLP: 17B1NCI731 (Credits:3, Contact Hours: 3)
No ratings yet
Introduction Machine Learning & NLP: 17B1NCI731 (Credits:3, Contact Hours: 3)
93 pages
The Testament of The Other Abraham and Torok's Failed Expiation of Ghosts
No ratings yet
The Testament of The Other Abraham and Torok's Failed Expiation of Ghosts
28 pages
Ilak Pos Tagging
No ratings yet
Ilak Pos Tagging
48 pages
Museums and Historical Digital Objects
No ratings yet
Museums and Historical Digital Objects
20 pages
NLPChapter3
No ratings yet
NLPChapter3
14 pages
Lect6 Pos
No ratings yet
Lect6 Pos
62 pages
9.Chapter7 POS Tagging
No ratings yet
9.Chapter7 POS Tagging
37 pages
Hmms Spring2013
No ratings yet
Hmms Spring2013
22 pages
Module-5 (Markov Model and Pos Tagging)
No ratings yet
Module-5 (Markov Model and Pos Tagging)
66 pages
Parts of Speech Tagging
No ratings yet
Parts of Speech Tagging
17 pages
1 Globalization and Women Empowerment
No ratings yet
1 Globalization and Women Empowerment
20 pages
Pos Tagging
No ratings yet
Pos Tagging
84 pages
Pos Tagging
No ratings yet
Pos Tagging
84 pages
Unit 7.1
No ratings yet
Unit 7.1
3 pages
Part-of-Speech (POS) Tagging
No ratings yet
Part-of-Speech (POS) Tagging
47 pages
Part of Speech Tagging
No ratings yet
Part of Speech Tagging
13 pages
Berkeley'S Active Self: Jonathan Dancy
No ratings yet
Berkeley'S Active Self: Jonathan Dancy
16 pages
NLP Ia2
No ratings yet
NLP Ia2
18 pages
Natural Language Processing: Parts of Speech Tagging - Pos
No ratings yet
Natural Language Processing: Parts of Speech Tagging - Pos
20 pages
LESSON-2
No ratings yet
LESSON-2
6 pages
Unit 3
No ratings yet
Unit 3
16 pages
Preprocessing NLTK
No ratings yet
Preprocessing NLTK
5 pages
Word Classes and Part-of-Speech (POS) Tagging: CS4705 Julia Hirschberg
No ratings yet
Word Classes and Part-of-Speech (POS) Tagging: CS4705 Julia Hirschberg
40 pages
What Is Social Capital? A Comprehensive Review of The Concept
No ratings yet
What Is Social Capital? A Comprehensive Review of The Concept
37 pages
8 POSNER Intro May 6 2021
No ratings yet
8 POSNER Intro May 6 2021
26 pages
Lec-5 POStagging
No ratings yet
Lec-5 POStagging
24 pages
Chapter Two Natural Language Processing
No ratings yet
Chapter Two Natural Language Processing
141 pages
Part of Speech Tagging (Chapter 5) : Adapted From Kathy Mccoy'S Presentation Downloaded From The Web, September 2010
No ratings yet
Part of Speech Tagging (Chapter 5) : Adapted From Kathy Mccoy'S Presentation Downloaded From The Web, September 2010
63 pages
3 Natural Language Processing-PoS Tagging
No ratings yet
3 Natural Language Processing-PoS Tagging
14 pages
UBUNTU
No ratings yet
UBUNTU
101 pages
Understanding Culture, Society and Politics Mod 1
No ratings yet
Understanding Culture, Society and Politics Mod 1
15 pages
Part-Of-Speech Tagging: A Simple But Useful Form of Linguistic Analysis
No ratings yet
Part-Of-Speech Tagging: A Simple But Useful Form of Linguistic Analysis
18 pages
UNIT NO 3
No ratings yet
UNIT NO 3
8 pages
The Mind's Best Trick: How We Experience Conscious Will: Daniel M. Wegner
No ratings yet
The Mind's Best Trick: How We Experience Conscious Will: Daniel M. Wegner
5 pages
UTS - Self Test
No ratings yet
UTS - Self Test
7 pages
The Greek Triumvirate
50% (4)
The Greek Triumvirate
5 pages
Tagging and its types
No ratings yet
Tagging and its types
3 pages
POStagging
No ratings yet
POStagging
72 pages
Explanation Elaboration - Discussion
No ratings yet
Explanation Elaboration - Discussion
3 pages
Advanced Research Methodologies in Translation Studies: January 2015
No ratings yet
Advanced Research Methodologies in Translation Studies: January 2015
46 pages
Experiment 4
No ratings yet
Experiment 4
3 pages
Bias and Noise: Daniel Kahneman On Errors in Decision-Making
0% (1)
Bias and Noise: Daniel Kahneman On Errors in Decision-Making
10 pages
Chapter 8
No ratings yet
Chapter 8
3 pages
Speech Recognition Architecture
No ratings yet
Speech Recognition Architecture
13 pages
NLP Part1
No ratings yet
NLP Part1
67 pages
Multi-Tagging For Transition-Based Dependency Parsing
No ratings yet
Multi-Tagging For Transition-Based Dependency Parsing
10 pages
DISS Week 9 DexterDacanay
No ratings yet
DISS Week 9 DexterDacanay
16 pages
Johari Window Model
No ratings yet
Johari Window Model
11 pages
Part-Of-Speech (POS) Tagging
No ratings yet
Part-Of-Speech (POS) Tagging
53 pages
POS Tagging: Introduction: Heng Ji
No ratings yet
POS Tagging: Introduction: Heng Ji
35 pages
Begin !!: Lesson 5 - Interpretative Social Science
No ratings yet
Begin !!: Lesson 5 - Interpretative Social Science
5 pages
Coreference: Fundamentals and Applications
From Everand
Coreference: Fundamentals and Applications
Fouad Sabry
No ratings yet
Review Your Grammar and Ace Exams
From Everand
Review Your Grammar and Ace Exams
Florian Navarroza-Flores
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Lec3-posner intro

Uploaded by

Lec3-posner intro

Uploaded by

Part of Speech Tagging and

Named Entity Recognition

Closed class words

Proper Common Main Adverbs slowly yesterday

Pronouns they its

◦ Can be useful for other NLP tasks

[PER Jane Villanueva] of [ORG United] , a unit of [ORG

Now we have one tag per token!!!

# of tags (where n is #entity types):

A simplified version of the constraint:

• It isn’t that odd.

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.