0% found this document useful (0 votes)

81 views5 pages

Entity Recognition in Assamese Text: Abstract - Entity Recognition Detects All The Entities Present

This document discusses entity recognition in Assamese text. It introduces entity recognition using conditional random fields for the Assamese language. It describes some challenges of entity recognition in Assamese, including its free word order, lack of resources, ambiguity of names, agglutinative nature, spelling variations, lack of capitalization, nested entities, and unique features of Assamese grammar. The proposed system combines preprocessing of Assamese text with entity recognition using conditional random fields and natural language toolkits.

Uploaded by

suy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

81 views5 pages

Entity Recognition in Assamese Text: Abstract - Entity Recognition Detects All The Entities Present

Uploaded by

suy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

ENTITY RECOGNITION IN ASSAMESE TEXT

Nandana Mahanta, Sourish Dhar, Sudipta Roy

Department of CSE,
Assam University, Silchar,
Assam, India
Email: {1nandana.mahanta, dharsourish, sudipta.it}@gmai1.com

Abstract— Entity Recognition detects all the entities present Assamese is an Indic or Indo-Aryan language (branch of
in a document to improve the performance of some high level Indo European language family) spoken mainly in the state of
Natural Language Processing (NLP) tasks like Question Assam, where it is an official language. Assamese is spoken
Answering, Auto Summarization, Machine Translation, by over 30 million people in North East India. Assamese is a
Information Extraction. The task is subdivided into two parts:
national language of India but with a limited computational
Parts of Speech Tagging (POS) and Entity Recognition. Each
sentence is annotated with part-of-speech tags and then the linguistic work [3, 4].
proper nouns are again classified with our own entity tag set. We have used Conditional Random Field (CRF), a
This paper introduces Entity Recognition in Assamese Text using machine learning approach for our Entity Recognition task.
Conditional Random Fields (CRF). Results are measured with F- Although a lot of work for IE and its subtasks has been done
measure metric for each different entity class. in English and other foreign languages like Spanish, German
and Chinese with high accuracy but for Indian languages not
Keywords—POS tagging; Entity Recognition; CRF; Assamese much work have been done yet. Ours is the first work on
Language Entity Recognition for Assamese.

I. INTRODUCTION II. ISSUES REGARDING ENTITY RECOGNITION IN

Natural Language Processing (NLP)is a field of computer ASSAMESE
science, artificial intelligence, and computational linguistics Assamese is written using the Assamese script, similar to
ZKLFK GHDOV ZLWK DQDO\]LQJ XQGHUVWDQGLQJ DQG that of Bengali except the symbols for ৰ/ra/ and ৱ/wa/, and
JHQHUDWLQJ WKH KXPDQ QDWXUDO ODQJXDJHV LQ RUGHU highly resembles the Devanagiri script of Hindi, Sanskrit and
WRLQWHUIDFHLWZLWKFRPSXWHUVLQERWKZULWWHQDQG other related Indic languages [11]. There are various issues
VSRNHQ FRQWH[WV LQVWHDG RI FRPSXWHU ODQJXDJHV related to Assamese entity Recognition. Many of these issues
Parts Of Speech tagging (POS tagging) and Named Entity are general to other NLP tasks and not specific to Assamese.
Recognition (NER) both are individual tasks in the field of A. Sentence structure
1/3 >@ Assamese is a relatively free word order language. The
Entity Recognition is a subtask of Information Extraction basic structure of an Assamese sentence is Subject + Object +
(IE) [1, 2] which is associated with the problem of text Verb (SOV). But SVO, OSV, OVS, VOS and VSO can also
simplification in order to form an organized view of the result the same meaning. Thus sentence formation in
information present in free text. The aim of Entity Recognition
Assamese is diverse in nature [5, 7].
is to create a more easily machine-readable text to process the
sentence. B. Scarcity of resources
Let’s look into an Assamese sentence to understand the Although Assamese is a language spoken by about 15
difference between Entity Recognition and NER. million people in the Indian state of Assam as a first language,
the development of electronic resources for the language has
ডাঃ ৰােম এ.িব.আই.ৰ ১০০টা য়াৰ িকিনিছল । been lagging behind compared to many Indian languages [5].
In the above sentence NER will identify only ৰােম[PER] Entity Recognition requires a large data set. But very few
corpora of Assamese are publicly available and most of them
এ.িব.আই.[ORG]. So if we have a question like “What did Ram are driven by specific agenda. Not much NLP work has been
buy?”, then we can’t answer it only with the help of NER. done for Assamese and other north eastern Indic languages.
On the other hand with the help of POS tagging C. Ambiguity
(supervised) [9] Entity Recognition will tag the above
sentence as There exist an ambiguity in the names of peoples since
names of people are usually dictionary words, unlike Western
ডাঃ[P_NOM] ৰােম[PER] এ.িব.আই.[ORG]ৰ[SFX] ১০০[NUM] names. For example AKAAX, JON means sky and moon
টা[SFX] য়াৰ[CN] িকিনিছল[VB] ।[PUN] respectively in Assamese, but also can indicate person names
thus creating ambiguities between common noun and proper
Thus Entity Recognition will improve the performance of
noun [6].
high level NLP tasks.
D. Agglutinative nature Plural numbers can be formed by adding suffixes to the
Assamese language suffers from agglutination and singular forms of nouns [ৰামহঁ ত, /ramhot/ ram (ram) -hɔt(-PL)
complex words are created by adding additional features to ‘Ram and others’]. This feature has not been seen in Hindi or
change the meaning of the word. For example, অসম (Assam) is Bengali languages.
the name of a place which is a location named entity but অসমীয়া
(Assamiya) is produced by adding suffix ইয়া (IYA) to অসম Pronouns are also made plural with suffixation [িসহঁ ত,
/xihɔt/ xi (he) -hɔt(- PL) ‘They’]. They can also be expressed
(Assam) which signifies people residing in Assam which is
by adding qualifying words, in which case no suffix is added
not a location named entity [8].
[ব ত মানুহ /bɔhut manuh/ bɔhut (many) manuh (man) ‘Many
E. Spelling Variation men’].
Changes in the spelling of proper names are another
problem in Assamese Named Entity Recognition. For C. Inflection of Adjectives
example, in চা (Shree Shreesanth) there is a confusion
whether (Shree) in চা (Shreesanth) is a Pre-nominal word Assamese adjective is basically not inflected, but
or person named entity [6]. sometimes when adjectives are inflected, then they take the
noun form [5, 7]. For example
F. Lack of captalization
In English capitalization plays an important role in ধুনীয়াজনী হ আিহছা ।
identifying the named entities. But there is no concept of Here ধুনীয়া (dhunia: beautiful) is an adjective, but after adding
capitalization in Assamese language thus making it difficult to feminine definitive জনী (zɒni) the whole constituent becomes
identify the proper noun [6].
a noun word.
G. Nested Entities
III. PROPOSED WORK
When two or more proper nouns are present then it
becomes difficult to assign the proper named entity class. For Our proposed system is combination of preprocessing of
example, in গৗহা িব িবদ ালয় (Gauhati bishabidyaly) গৗহা Assamese text and Entity Recognition.
The system is developed in Python language and Natural
(Gauhati) is a location named entity and িব িবদ ালয়
Language Toolkit (NLTK).
(bishabidyalay) refers to organization thus creating problem in
assigning the proper class.
Raw
Some issues which we have seen in Assamese language (Unstructured)
are not present in Hindi, Bengali or other Indic languages. Assamese Text
They are:

A. Negation of verbs
Preprocessing Tokenization
The procedure of negation of verbs in Assamese language Of
is a unique feature which clearly distinguishes it from the rest Raw text
of the Indo-Aryan and other Dravidian languages. In POS Tagging
Assamese ন/n/ is pre-fixed to the verb followed by a vowel
which is the exact copy of the vowel of the first syllable of the
Entity Recognition
verb, as in নালােগ/nalage/ meaning ‘do not want’ (1st, 2nd, 3rd using CRF
person). The various negative markers in Assamese are ন/n/,
না/no/, না/na/, ন/ne/ and িন/ni/ etc. [5].
Tagged
B. Use of plural suffixation (Structured)
Assamese Text
The use of the plural suffixes is another feature of
Assamese. In Assamese nouns, pronouns are generally Figure 1: System Model
inflected for number, gender and case. For instance, all the
bound forms such as হতঁ /hɔt/, বাৰ/bur/, িবলাক/bilak/, Preprocessing: Preprocessing task consists of loading text in
মখা/mokha/, জাক /zak/, সকল/xɔkɔl/ etc denote plurality and the system, Tokenization and POS tagging.
Loading text means loading the Assamese text file from
are suffixed to a noun or a pronoun [5,7].
the system. After that the text file is tokenized i.e. each word,
symbol and punctuation mark are separated. Then the system
will tag each token of the tokenized file with the most We have used the following features while training the
appropriate POS tag. model.
To train the model for POS tagging we have used the
A. POS tag
partially annotated corpora from Research Centre for Indian
Language Technology Solution (RCILTS). The remaining POS tag is the most important feature in our system. It
unannotated corpora is again manually tagged (30,000 words). gives the parts-of-speech information about a token which is
The training dataset consists of 90k annotated words. We have very much helpful in our Entity Recognition system. With
used Stanford POS Tagger and the POS tag set described by help of parts-of-speech information the relation between 2
Pallav Kumar Dutta [5]. The considered tag set along with its entities can be found out.
meaning is described in Table-1. B. Word Prefix and Suffix
TABLE 1: POS TAG SET The starting and ending characters of a token play
important role in tagging. Suffixation of nouns is very
Tag Description
extensive in Assamese. There are more than 100 suffixes for
NN Noun
NNPC Compound Proper Noun the Assamese noun [7].
NNP Proper Noun
C. N-gram
NLOC Noun Location
NVB Noun in Kriya (verb) Mula N-grams have been widely investigated for a number of
PRP Pronoun text processing and retrieval applications. An n-gram is a
CC Conjunction contiguous sequence of n-items from a given sequence of text
INTF Intensifier
JJ Adjective
or speech. The items can be phonemes, syllables, letters,
JVB Adjective in Kriya (verb) Mula words or base pairs according to the application. The n-grams
NEG Negative typically are collected from a text or speech corpus [12]. We
PSP Post-position have considered letters of a word as n-gram (n=6) feature for
PUNC Punctuation the system.
QF Quantifier
QFNUM Number Quantifier D. Context Words
QW Question Word
RB Adverb
We have considered the previous and next tag of current
RBVB Adverb in Kriya (verb) Mula word. In Assamese although sentence structure is not similar
RP Particle like English and other European language still we can get
SYM Symbol some information about the type of a current token by
UH Interjection Word observing its surrounding tokens.
VAUX Auxiliary Verb
VAUXN Negative Verb Auxiliary
Using our own entity classes and with the help of POS
VFM Verb Finite Main tagging each entity will be tagged with some categories based
VFMN Negative Verb Finite Main on the appropriate meaning in the text. Our considered tag set
VJJ Verb Non-Finite Adjectival for Entity recognition is given below-
VJJN Negative Verb Non-Finite Adjectival
VNN Verb Non-Finite Nominal TABLE 2: ENTITY TAG SET
VNNN Negative Verb Non-Finite Nominal
VRB Verb Non-Finite Adverbial Tag Description Example
VRBN Negative Verb Non-Finite Adverbial PER Single word person name ৰাজীৱ/PER
VNF Non-Finite Verb LOC Single word location name কিলকতা/LOC
VNFN Negative Non-Finite Verb ORG Single word organization name কংে ছ/ORG
Source: POS tag set of Pallav Kumar Dutta [5] B-PER Beginning, Internal or End of a মাহনদাস/B-PER
I-PER multiword person name কৰমচা /I-PER
Entity Recognition: RCILTS corpora is not rich with proper E-PER
গা ী/E-PER
nouns since it is developed for POS tagging. To bridge this B-LOC Beginning, Internal or End of a মহা া/B-LOC গা ী/I-
gap we have manually collected around 30 articles (around I-LOC multiword location name LOC পথ/E-LOC
3000 sentences) of Assamese text from online Assamese E-LOC
B-ORG Beginning, Internal or End of a মহাকাশ /I-ORG
Wikipedia1 for our Entity Recognition task. These articles are I-ORG multiword organization name গেৱষণা /I-ORG
again manually tagged to train the system and 20% of this E-ORG
সং া/E-ORG
annotated data is kept aside for testing.
NUM Number ১০০
DATE Date ১২ বহাগ/DATE
With the remaining 80% data we trained our Entity
ABV Abbreviation ড°/ABV
Recognition system using CRF. CRF combines the advantages
of discriminative classification and graphical modeling and
results more accurate conditional model which has much IV. RESULT ANALYSIS
simpler structure than a joint model [10].
After finishing classification task, we tested our system
with the 20% data (600 sentences) which we kept aside earlier.
So far as we know Entity Recognition task has not been done
till date in case of Assamese language. So, we could not PERSON 112 0.9035
compare our result with any existing system.
LOCATION 61 0.6244
Result Evaluation Parameters:
ORGANIZATION 20 .8669
Precision = True Positive/ (True Negative + False Positive)
Recall = True Positive/ (True Positive + False Negative)
F-Score = 2* Precision* Recall/ (Precision + Recall) NUMBER 79 0.6218

For each entity class the F-score is listed in Table 4. ABBRVIATION 43 0.8656

DATE 67 0.6577
TABLE 3: CLASSIFIER RESULT ON TEST DATA SET

ENTITY PRECISION RECALL

V. CONCLUSION AND FUTURE WORK

PERSON 0.9577 0.8551
In this paper we briefly discussed about our proposed
Entity Recognition system for Assamese text and different
LOCATION 0.8821 0.4074
components to develop this system. We think with better
resources and varied dataset in Assamese language this result
ORGANIZATION 0.9218 0.8182 can be optimized.
We got maximum of 0.9577 and minimum of 0.7561
NUMBER 0.8219 0.5000 precision values for PERSON and DATE entity class
respectively and maximum 0.8551 and minimum 0.5820 recall
values for these two entity classes. Since ours is the first
ABBREVIATION 0.9122 0.8235
system for Entity Recognition in Assamese language we are
not able to compare our results. We hope this concept of
DATE 0.7561 0.5820 Entity recognition will be make a huge difference in high level
NLP task for Assamese and other Indian languages.
REFERENCES
[1] Tang, J., Hong, M., Zhang, D., Liang, B. and Li, J., 2007, Information
Extraction: Methodologies and Applications, in Prado, H. A. D. and
Ferneda, E., eds., Emerging Technologies of Text Minig: Techniques and
Applications, IGI Global, New York, p. 1-33.
[2] Gupta, V. and Lehaal, G. S., 2009, A survey of Text Minig: Techniques
and Applications, Journal of Emerging Technologies in Web
Intelligence, vol. 1, No. 1, p. 60-76.
[3] Rahman, M., Das, S. and Sharma, U., 2009, Parsing of part-of-speech
tagged Assamese Texts, ,-&6, ,QWHUQDWLRQDO -RXUQDO RI
&RPSXWHU6FLHQFH,VVXHV9RO1RS
[4] Assamese Website, http://www.iitg.ernet.in/rcilts/pdf/assamese.pdf,
(August 4, 2016).
[5] Dutta, P. K., An Online Semi Automated Part of Speech Tagging
Technique Applied to Assamese, PhD Thesis, Dept. of CSE, Indian
Institute of Technology Guwahati, Guwahati – 781039, Assam, India,
December 2013.
[6] Sharma, P., Sharma, U. and kalita, J., Named Entity Recognition: A
Survey for the Indian Languages, 2010, National Seminar on Lexical
Resources and Computational Techniques on Indian Langusges,
Pondicherry.
FIGURE 2: PRECISION VS RECALL GRAPH ON ENTITY RESULT
[7] Saharia, N., Computational Morphology and Syntax for a Resource-Poor
Inflectional Language, PhD Thesis, Dept. of CSE, School of
TABLE 4: ENTITY EXTRACTED AND F-SCORE VALUE Engineering, Tezpur University, Tezpur, Assam, India – 784028,
January 2014.
ENTITY NUMBER OF F-SCORE [8] Talukdar, G., Borah, P. P. and boruah, A., 2014, Supervised Named
EXTRACTION Entity Recognition in Assamese language, IC3I International
Conference on Contemporary Computing and Informatics, Mysore.
[9] Guilder, L. V., Automated Part of Speech Tagging: A Brief Overview,
Handout for LlNG361, Fall 1995, Georgetown University.
[10] Sutton, C. and McCallum, A., 2007, An Introduction to Conditional A Hybrid Approach, International Journal of Computer Applications,
Random Fields for Relational Learning, in Getoor, L., ed., Introdcution vol. 84, No. 9, p. 31-35
to Statistical Relational Learning, MIT Press, p. 93-123
[11] Assamese design Guide,
http://www.iitg.ernet.in/rcilts/phaseI/newassamesedesign.pdf, (August 4,
2016).
[12] Dey, A. and Purkayastha, B. S., 2013, Named Entity Recognition using
Gazetteer Method and N-gram Technique for an Inflectional Language:

Affixes Activity
67% (3)
Affixes Activity
2 pages
Toeic Part5 Integrale Key1
67% (3)
Toeic Part5 Integrale Key1
23 pages
Corpus Linguistics 1
No ratings yet
Corpus Linguistics 1
48 pages
Parts of Speech Activities
100% (4)
Parts of Speech Activities
67 pages
Learning Basic English Grammar
100% (1)
Learning Basic English Grammar
156 pages
Positive Form Negative Form Yes - No Questions Wh-Questions Question Tags
100% (1)
Positive Form Negative Form Yes - No Questions Wh-Questions Question Tags
25 pages
Chapter 3: Syntactic Forms, Grammatical Functions, and Semantic Roles
No ratings yet
Chapter 3: Syntactic Forms, Grammatical Functions, and Semantic Roles
30 pages
A Phonetic Descrition of Assameses
No ratings yet
A Phonetic Descrition of Assameses
8 pages
Sanskrit-Verbless Sentences in Sanskrit
100% (1)
Sanskrit-Verbless Sentences in Sanskrit
30 pages
A Suffix Based Morphological Analysis of Assamese Word Formation
100% (2)
A Suffix Based Morphological Analysis of Assamese Word Formation
5 pages
Alr 2012
No ratings yet
Alr 2012
145 pages
Vibhakti Identification Techniques For Sanskrit IJERTCONV3IS01045
No ratings yet
Vibhakti Identification Techniques For Sanskrit IJERTCONV3IS01045
6 pages
Can, Can't, Could, Couldn't
No ratings yet
Can, Can't, Could, Couldn't
6 pages
05 Introduction To NLP
No ratings yet
05 Introduction To NLP
63 pages
M1 Lesson 2 Slides For Students
No ratings yet
M1 Lesson 2 Slides For Students
88 pages
Article On Sanskrit Language PDF
No ratings yet
Article On Sanskrit Language PDF
52 pages
Ijst 2021 1163
No ratings yet
Ijst 2021 1163
9 pages
Artical On Geeta Linguistics
No ratings yet
Artical On Geeta Linguistics
8 pages
Ответы к теорвопросам
No ratings yet
Ответы к теорвопросам
28 pages
English For Nursing Vocational Book1 2012 PDF
No ratings yet
English For Nursing Vocational Book1 2012 PDF
2 pages
Optical Character Recognition of Handwri PDF
No ratings yet
Optical Character Recognition of Handwri PDF
6 pages
Group Members:: Ayesha Azhar Bareera Akbar Irum Masood Maryam Ahmed Tahira Jabeen
No ratings yet
Group Members:: Ayesha Azhar Bareera Akbar Irum Masood Maryam Ahmed Tahira Jabeen
58 pages
DBT Om For Revision of Emoluments
No ratings yet
DBT Om For Revision of Emoluments
20 pages
D2 Advanced Structure
No ratings yet
D2 Advanced Structure
17 pages
Thamizhi: Morph: A Morphological Parser For The Tamil Language
No ratings yet
Thamizhi: Morph: A Morphological Parser For The Tamil Language
34 pages
Corpora in Indian Languages
No ratings yet
Corpora in Indian Languages
18 pages
Bege 102 Solved Assignment 2018 19
No ratings yet
Bege 102 Solved Assignment 2018 19
19 pages
Structured and Logical Representations of Assamese Text For Question-Answering System
No ratings yet
Structured and Logical Representations of Assamese Text For Question-Answering System
12 pages
NER Overview PPT Final
No ratings yet
NER Overview PPT Final
20 pages
23 - Verb Tree Chart
No ratings yet
23 - Verb Tree Chart
1 page
Pastel Lined English Language Features Analysis Close Reading Activity Pre - 20240206 - 183710 - 0000
No ratings yet
Pastel Lined English Language Features Analysis Close Reading Activity Pre - 20240206 - 183710 - 0000
11 pages
Ocr Progress4
No ratings yet
Ocr Progress4
22 pages
Simple Past Tense (Statements) : Exercise 1
No ratings yet
Simple Past Tense (Statements) : Exercise 1
3 pages
2023 Icon-1 38
No ratings yet
2023 Icon-1 38
10 pages
Information Theoretical Complexities in Developing A Bilingual Corpus: Critical Comparison Hindi and Marathi
No ratings yet
Information Theoretical Complexities in Developing A Bilingual Corpus: Critical Comparison Hindi and Marathi
18 pages
Performance Analysis of Artificial Neural Network Algorithms For Automatic Handwritten Devanagari Text Generation in Marathi Styles
No ratings yet
Performance Analysis of Artificial Neural Network Algorithms For Automatic Handwritten Devanagari Text Generation in Marathi Styles
8 pages
N Gram and Gazetteer List Based Named Entity Recognition For Urdu: A Scarce Resourced Language
No ratings yet
N Gram and Gazetteer List Based Named Entity Recognition For Urdu: A Scarce Resourced Language
10 pages
Normalizing The Hindi Text
No ratings yet
Normalizing The Hindi Text
8 pages
Ijst 2023 765
No ratings yet
Ijst 2023 765
12 pages
Eapp Lesson 1 - Developing Your Vocabulary
No ratings yet
Eapp Lesson 1 - Developing Your Vocabulary
22 pages
Automatic Pronunciation Assessment For Language Learners With Acoustic-Phonetic Features
No ratings yet
Automatic Pronunciation Assessment For Language Learners With Acoustic-Phonetic Features
8 pages
A Character N-Gram Based Approach For Improved Recall in Indian Language NER
No ratings yet
A Character N-Gram Based Approach For Improved Recall in Indian Language NER
7 pages
Preparation of A Dataset and Issues Related With Recognition of Optical Character in Assamese Script
No ratings yet
Preparation of A Dataset and Issues Related With Recognition of Optical Character in Assamese Script
7 pages
9 Present Perfect Tense
No ratings yet
9 Present Perfect Tense
6 pages
Researchpaper UNER
No ratings yet
Researchpaper UNER
6 pages
Handwritten Marathi Character Recognition Using R
No ratings yet
Handwritten Marathi Character Recognition Using R
10 pages
Handwritten Assamese - Character
No ratings yet
Handwritten Assamese - Character
9 pages
Handwritten Assamese Character
No ratings yet
Handwritten Assamese Character
9 pages
(IJCST-V10I1P17) :kapadia Utkarsh N, Deasi Apurva A
No ratings yet
(IJCST-V10I1P17) :kapadia Utkarsh N, Deasi Apurva A
9 pages
Project Report
No ratings yet
Project Report
6 pages
Language Structure
No ratings yet
Language Structure
10 pages
LRL 13
No ratings yet
LRL 13
6 pages
Research Paper
No ratings yet
Research Paper
8 pages
Tirasaroj 2009
No ratings yet
Tirasaroj 2009
5 pages
Tesseract Ocr Engine
No ratings yet
Tesseract Ocr Engine
5 pages
Noun Group and Verb Group Identification For Hindi: Smriti Singh, Om P. Damani, Vaijayanthi M. Sarma
No ratings yet
Noun Group and Verb Group Identification For Hindi: Smriti Singh, Om P. Damani, Vaijayanthi M. Sarma
16 pages
Saskrit Parser Presentation CSE
No ratings yet
Saskrit Parser Presentation CSE
30 pages
GEOD202 Course Outline - 2023-2024
No ratings yet
GEOD202 Course Outline - 2023-2024
4 pages
A Survey On Recognition of Devnagari Script: Ratnashil N Khobragade1 Dr. Nitin A. Koli Mahendra S Makesar
No ratings yet
A Survey On Recognition of Devnagari Script: Ratnashil N Khobragade1 Dr. Nitin A. Koli Mahendra S Makesar
5 pages
Word Class Prediction of Ambiguous and Unknown Words of Punjabi Language Using Bi-Gram Methods
No ratings yet
Word Class Prediction of Ambiguous and Unknown Words of Punjabi Language Using Bi-Gram Methods
5 pages
Tamil
No ratings yet
Tamil
4 pages
Review Paper
No ratings yet
Review Paper
4 pages
Development of Part of Speech Tagger For Assamese Using HMM
No ratings yet
Development of Part of Speech Tagger For Assamese Using HMM
10 pages
Linguistic Area-ENG
No ratings yet
Linguistic Area-ENG
4 pages
Speech Recognition Architecture
No ratings yet
Speech Recognition Architecture
13 pages
English Tenses, Czech
No ratings yet
English Tenses, Czech
9 pages
Adjectives Fill The Jar
No ratings yet
Adjectives Fill The Jar
2 pages
Newspaper Headlines
No ratings yet
Newspaper Headlines
3 pages
Key: Subject Yellow, Bold Verb Green, Underline
No ratings yet
Key: Subject Yellow, Bold Verb Green, Underline
4 pages
Aya
No ratings yet
Aya
4 pages
Informal Letter - Past, Present & Future
No ratings yet
Informal Letter - Past, Present & Future
23 pages
Article On Sanskrit Language
No ratings yet
Article On Sanskrit Language
51 pages
A Structured Approach For Building Assamese Corpus: Insights, Applications and Challenges
No ratings yet
A Structured Approach For Building Assamese Corpus: Insights, Applications and Challenges
8 pages
The Seven Ages of Man
No ratings yet
The Seven Ages of Man
10 pages
Grammar Tules
No ratings yet
Grammar Tules
5 pages
Regras de Plural - Básico - Respostas
No ratings yet
Regras de Plural - Básico - Respostas
1 page
Textual Characteristics For Language Engineering: Mathias Bank, Robert Remus, Martin Schierle
No ratings yet
Textual Characteristics For Language Engineering: Mathias Bank, Robert Remus, Martin Schierle
5 pages
Natural Language Processing Tools For Tamil Grammar
No ratings yet
Natural Language Processing Tools For Tamil Grammar
5 pages
Cpms Long Iwlc 06
No ratings yet
Cpms Long Iwlc 06
19 pages
DLP-Day-5 (PARMISANA - ACTIVE PASSIVE)
No ratings yet
DLP-Day-5 (PARMISANA - ACTIVE PASSIVE)
6 pages
Implementation of Marathi Language Speech Databases For Large Dictionary
No ratings yet
Implementation of Marathi Language Speech Databases For Large Dictionary
6 pages
(IJCST-V11I4P14) :DR Arzoo
No ratings yet
(IJCST-V11I4P14) :DR Arzoo
4 pages
Use of Metadata To Improve Recognition of Spontaneous Speech and Named Entities
No ratings yet
Use of Metadata To Improve Recognition of Spontaneous Speech and Named Entities
4 pages
Paper Use of English
No ratings yet
Paper Use of English
9 pages
Prepositions 1
No ratings yet
Prepositions 1
1 page
Parts of Speech Tagging For Afaan Oromo
No ratings yet
Parts of Speech Tagging For Afaan Oromo
5 pages
Direct and Indirect Speech
No ratings yet
Direct and Indirect Speech
5 pages
Sanskrit Project Abstract
No ratings yet
Sanskrit Project Abstract
3 pages
Learning Hindi: Speak, Read and Write Hindi with Manga Comics! A Language Guide for Self-Study (Free Online Audio & Flash Cards)
From Everand
Learning Hindi: Speak, Read and Write Hindi with Manga Comics! A Language Guide for Self-Study (Free Online Audio & Flash Cards)
Brajesh Samarth
5/5 (1)
Let Us Learn Tamil
From Everand
Let Us Learn Tamil
S. Raman
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Entity Recognition in Assamese Text: Abstract - Entity Recognition Detects All The Entities Present

Uploaded by

Entity Recognition in Assamese Text: Abstract - Entity Recognition Detects All The Entities Present

Uploaded by

ENTITY RECOGNITION IN ASSAMESE TEXT

Nandana Mahanta, Sourish Dhar, Sudipta Roy

I. INTRODUCTION II. ISSUES REGARDING ENTITY RECOGNITION IN

ENTITY PRECISION RECALL

V. CONCLUSION AND FUTURE WORK

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.