0% found this document useful (0 votes)

23 views27 pages

01 NLP - Merged Vinay

Hsjshsjavsjsjsksbsbsmzbzjxnnxnxznbzjzmzn Jsksjsjznsnznnzzbbxn.jsjsjshsjskjdjsjj

Uploaded by

kambleyash1412

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views27 pages

01 NLP - Merged Vinay

Hsjshsjavsjsjsksbsbsmzbzjxnnxnxznbzjzmzn Jsksjsjznsnznnzzbbxn.jsjsjshsjskjdjsjj

Uploaded by

kambleyash1412

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 27

N.G. ACHARYA & D.K.

MARATHE COLLEGE OF
ARTS, SCIENCE & COMMERCE.
(Affiliated to University Of Mumbai)

PRACTICAL JOURNAL

PSCSP514
Natural Language Processing
SUBMITTED BY

VINAY VIJAY GUPTA

SEAT NO :

SUBMITTED IN PARTIAL FULFILLMENT OF THE REQUIREMENTS

FOR QUALIFYING M.Sc. (CS) PART-I (SEMESTER – II) EXAMINATION.

2023-2024

DEPARTMENT OF COMPUTER SCIENCE

SHREE N.G.ACHARYA MARG,CHEMBUR

MUMBAI-400 071
N.G.ACHARYA & D.K.MARATHE COLLEGE
OF ARTS,SCIENCE & COMMERCE
(Affiliated to University of Mumbai)

CERTIFICATE

This is to certify that Mr. Vinay Vijay Gupta Seat No. studying
in Master of Science in Computer Science Part I Semester II has
satisfactorily completed the Practical of PSCSP514 Natural Language
Processing as prescribed by University of Mumbai, during the academic
year 2023-24.

Signature Signature Signature

Internal Guide External Examiner Head Of Department

College Seal Date

2
Sr.No Title Signature
1 Write a program to implement sentence segmentation and
word tokenization

2 WAP stemming and lemmatization

3 Implement a tri-gram model

a. unigram
b. bigram
c. trigram

4 Implement PoS TAGGING USING HMM & NEURAL

Model

5 Write a program to Implement syntactic parsing of a given

text

6 Write a program to Implement dependency parsing of a

given text

7
Write a program to Implement Named Entity
Recognition (NER)

8 Create a chatbot using python and nltk(application of

Natural Language Processing)
Practical 1

Aim: WAP to implement sentence segmentation and word tokenization.

What is Tokenization?

Tokenization is the process of breaking up a piece of text into sentences or

words. When we break down textual data into sentences or words, the output we
get is known as tokens. There are two strategies for tokenization of a textual
dataset:

Program:

Tokenization:
import nltk
nltk.download('punkt')
from nltk.tokenize import sent_tokenize
sentence = "Hi, My name is Aman, I hope you like my
work. You can follow me on Instagram for more resources.
My username is 'the.clever.programmer'."
print(sent_tokenize(sentence))
from nltk.tokenize import TreebankWordTokenizer
word_token = TreebankWordTokenizer()
print(word_token.tokenize(sentence))

Output:
b. Segmentation:
The process of deciding from where the sentences actually start or end in
NLP or we can simply say that here we are dividing a paragraph based on
sentences. This process is known as Sentence Segmentation.

Type of text the pipeline is trained on, e.g. web or news. For example,
en_core_web_sm is a small English pipeline trained on written web text
(blogs, news, comments), that includes vocabulary, vectors, syntax and
entities.

#import spacy library

import spacy

#load core english library

nlp = spacy.load("en_core_web_sm")

#take unicode string

#here u stands for unicode
doc = nlp(u"I Love Coding. Geeks for Geeks helped me in this regard
very much. I Love Geeks for Geeks.")
#to print sentences
for sent in doc.sents:
print(sent)

output:
Practical 2

Aim: WAP stemming and lemmatization.

Languages we speak and write are made up of several words often

derived from one another. When a language contains words that are
derived from another word as their use in the speech changes is
called Inflected Language.

Porter Stemmer This is the Porter stemming algorithm. It follows the algorithm
presented in Porter, M. "An algorithm for suffix stripping.

CONNECT
CONNECTIONS ----- > CONNECT
CONNECTED ----- > CONNECT
CONNECTING ----- > CONNECT
CONNECTION ----- > CONNECT

Lancaster Stemmer is the most aggressive stemming algorithm. It has an edge

over other stemming techniques because it offers us the functionality to add our
own custom rules in this algorithm when we implement this using the NLTK
package. This sometimes results in abrupt results
Program :
from nltk.stem import PorterStemmer
from nltk.stem import LancasterStemmer
porter = PorterStemmer()
lancaster=LancasterStemmer()
#proide a word to be stemmed
print("Porter Stemmer")
print(porter.stem("cats"))
print(porter.stem("trouble"))
print(porter.stem("troubling"))
print(porter.stem("troubled"))
print("Lancaster Stemmer")
print(lancaster.stem("cats"))
print(lancaster.stem("trouble"))
print(lancaster.stem("troubling"))
print(lancaster.stem("troubled"))

output:
What is Stemming?

Stemming is a method of normalization of words in Natural Language

Processing. It is a technique in which a set of words in a sentence are converted
into a sequence to shorten its lookup.

from nltk.stem import PorterStemmer

e_words= ["wait", "waiting", "waited", "waits"]
ps =PorterStemmer()
for w in e_words:
rootWord=ps.stem(w)
print(rootWord)

Output:

The below program uses the Porter Stemming Algorithm for stemming.

import nltk
from nltk.stem.porter import PorterStemmer
porter_stemmer = PorterStemmer()

word_data = "It originated from the idea that there are readers who prefer
learning new skills from the comforts of their drawing rooms"
# First Word tokenization
nltk_tokens = nltk.word_tokenize(word_data)
#Next find the roots of the word
for w in nltk_tokens:
print ("Actual: %s Stem: %s" % (w,porter_stemmer.stem(w)))

Output:
Lemmatization:

Lemmatization is the grouping together of different forms of the same word.

For example if a paragraph has words like cars, trains and automobile, then it will
link all of them to automobile. In the below program we use the WordNet lexical
database for lemmatization.

from nltk.stem import WordNetLemmatizer

nltk.download('wordnet')

a= WordNetLemmatizer()

print("rocks :", a.lemmatize("rocks"))

print("corpora :", a.lemmatize("corpora"))
print("oranges:",a.lemmatize("oranges"))
# a denotes adjective in "pos"
print("better :", a.lemmatize("better", pos ="a"))
Practical 3

Aim: Implement a tri-gram model (N-Gram).

Unigram

from nltk.util import ngrams

n=1
sentence = 'You will face many defeats in life, but never let yourself be
defeated.'
unigrams = ngrams(sentence.split(), n)

for item in unigrams:

print(item)

Output:

bi-gram

from nltk.util import ngrams

n=2

sentence = 'The purpose of our life is to happy'

unigrams = ngrams(sentence.split(), n)
for item in unigrams:

print(item)

output:

tri-gram

from nltk.util import ngrams

n=3

sentence = 'Whoever is happy will make others happy too'

unigrams = ngrams(sentence.split(), n)

for item in unigrams:

print(item)

output:
Practical 4

Aim: Implement PoS TAGGING USING HMM &NEURAL Model.

Program:

import nltk
nltk.download('stopwords')
nltk.download('punkt')
nltk.download('averaged_perceptron_tagger')
from nltk.corpus import stopwords
from nltk.tokenize import word_tokenize, sent_tokenize
stop_words = set(stopwords.words('english'))

txt = "Sukanya, Rajib and Naba are my good friends. " \

"Sukanya is getting married next year. " \
"Marriage is a big step in one’s life." \
"It is both exciting and frightening. " \
"But friendship is a sacred bond between people." \
"It is a special kind of love between us. " \
"Many of you must have tried searching for a friend "\
"but never found the right one."

# sent_tokenize is one of instances of

# PunktSentenceTokenizer from the nltk.tokenize.punkt module

tokenized = sent_tokenize(txt)

for i in tokenized:

# Word tokenizers is used to find the words

# and punctuation in a string
wordsList = nltk.word_tokenize(i)
# removing stop words from wordList
wordsList = [w for w in wordsList if not w in stop_words]

# Using a Tagger. Which is part-of-speech

# tagger or POS-tagger.
tagged = nltk.pos_tag(wordsList)

print(tagged)

output

or
import spacy

# Load English tokenizer, tagger,

# parser, NER and word vectors
nlp = spacy.load("en_core_web_sm")

# Process whole documents

text = ("""My name is Shaurya Uppal.
I enjoy writing articles on GeeksforGeeks checkout
my other article by going to my profile section.""")

doc = nlp(text)

# Token and Tag

for token in doc:
print(token, token.pos)

# You want list of Verb tokens

print("Verbs:", [token.text for token in doc if token.pos_ == "VERB"])

output:
Practical 5

Write a program to Implement syntactic parsing of a given text.

Trained Models & Pipelines · spaCy Models Documentation

A compound is made up of various parts of speech such as a noun, verb,

and adverb. This means that compounds can be a combination of noun
plus noun, verb plus noun, adjective plus noun, etc. The word "bedroom"
is made up of two nouns, bed and room.

spaCy is a library for advanced Natural Language Processing in Python

and Cython.
sm-small
en-english model

import spacy
# Loading the model
nlp=spacy.load('en_core_web_sm')
text = "Reliance Retail acquires majority stake in designer brand
Abraham & Thakore."
# Creating Doc object
doc=nlp(text)
print(doc)
# Getting dependency tags
for token in doc:
print(token.text,'=>',token.dep_)
# Importing visualizer
from spacy import displacy
# Visualizing dependency tree
displacy.render(doc,jupyter=True)

import spacy
from spacy import displacy
nlp = spacy.load('en_core_web_sm')
doc = nlp(u'This is a sentence.')
displacy.serve(doc, style='dep')
import spacy
from spacy import displacy

nlp = spacy.load("en_core_web_sm")
doc = nlp("This is a sentence.")
displacy.serve(doc, style="dep")
Practical 6

Write a program to Implement dependency parsing of a given text.

Dependency structure shows which word or phrase depends on which

other words or phrases. We use dependency-based parsing to analyze and
infer both structure and semantic dependencies and relationships between
tokens in a sentence.

import spacy
nlp=spacy.load('en_core_web_sm')

text='It took me more than two hours to translate a few pages of English.'

for token in nlp(text): print(token.text,'=>',token.pos_,'=>',token.tag_)

doc=nlp(text)
print(doc)
from spacy import displacy

displacy.render(doc, style='dep')

doc=nlp(text)
import spacy

py_text = "spacy dependency parser in python."

py_nlp = spacy.load("en_core_web_sm")

py_doc = py_nlp( py_text)

from spacy import displacy

displacy.render(py_doc,style='dep')
Practical 7
Write a program to Implement Named Entity Recognition
(NER).

import spacy

nlp = spacy.load('en_core_web_sm')

sentence = "Apple is looking at buying U.K. startup for $1 billion"

doc = nlp(sentence)

for ent in doc.ents:

print(ent.text, ent.start_char, ent.end_char, ent.label_)

# imports and load spacy english language package

import spacy

from spacy import displacy

from spacy import tokenizer

nlp = spacy.load('en_core_web_sm')

#Load the text and process it

# I copied the text from python wiki

text =("Python is an interpreted, high-level and general-purpose

programming language "

"Pythons design philosophy emphasizes code readability with"

"its notable use of significant indentation."

"Its language constructs and object-oriented approach aim to"

"help programmers write clear and"

"logical code for small and large-scale projects")

# text2 = # copy the paragraphs from

https://www.python.org/doc/essays/

doc = nlp(text)

#doc2 = nlp(text2)

sentences = list(doc.sents)

print(sentences)

# tokenization

for token in doc:

print(token.text)
# print entities

ents = [(e.text, e.start_char, e.end_char, e.label_) for e in doc.ents]

print(ents)

# now we use displaycy function on doc2

displacy.render(doc, style='ent', jupyter=True)

Practical 8
Write a program to Implement Text Summarization for the given
sample text.

Text Summarization Approaches for NLP - Practical Guide with

Generative Examples - Machine Learning Plus

original_text = 'Junk foods taste good that’s why it is mostly liked by

everyone of any age group especially kids and school going children. They
generally ask for the junk food daily because they have been trend so by
their parents from the childhood. They never have been discussed by their
parents about the harmful effects of junk foods over health. According to
the research by scientists, it has been found that junk foods have negative
effects on the health in many ways. They are generally fried food found in
the market in the packets. They become high in calories, high in
cholesterol, low in healthy nutrients, high in sodium mineral, high in
sugar, starch, unhealthy fat, lack of protein and lack of dietary fibers.
Processed and junk foods are the means of rapid and unhealthy weight
gain and negatively impact the whole body throughout the life. It makes
able a person to gain excessive weight which is called as obesity. Junk
foods tastes good and looks good however do not fulfil the healthy calorie
requirement of the body. Some of the foods like french fries, fried foods,
pizza, burgers, candy, soft drinks, baked goods, ice cream, cookies, etc are
the example of high-sugar and high-fat containing foods. It is found
according to the Centres for Disease Control and Prevention that Kids and
children eating junk food are more prone to the type-2 diabetes. In type-2
diabetes our body become unable to regulate blood sugar level. Risk of
getting this disease is increasing as one become more obese or overweight.
It increases the risk of kidney failure. Eating junk food daily lead us to the
nutritional deficiencies in the body because it is lack of essential nutrients,
vitamins, iron, minerals and dietary fibers. It increases risk of
cardiovascular diseases because it is rich in saturated fat, sodium and bad
cholesterol. High sodium and bad cholesterol diet increases blood pressure
and overloads the heart functioning. One who like junk food develop more
risk to put on extra weight and become fatter and unhealthier. Junk foods
contain high level carbohydrate which spike blood sugar level and make
person more lethargic, sleepy and less active and alert. Reflexes and senses
of the people eating this food become dull day by day thus they live more
sedentary life. Junk foods are the source of constipation and other disease
like diabetes, heart ailments, clogged arteries, heart attack, strokes, etc
because of being poor in nutrition. Junk food is the easiest way to gain
unhealthy weight. The amount of fats and sugar in the food makes you
gain weight rapidly. However, this is not a healthy weight. It is more of
fats and cholesterol which will have a harmful impact on your health. Junk
food is also one of the main reasons for the increase in obesity
nowadays.This food only looks and tastes good, other than that, it has no
positive points. The amount of calorie your body requires to stay fit is not
fulfilled by this food. For instance, foods like French fries, burgers, candy,
and cookies, all have high amounts of sugar and fats. Therefore, this can
result in long-term illnesses like diabetes and high blood pressure. This
may also result in kidney failure. Above all, you can get various nutritional
deficiencies when you don’t consume the essential nutrients, vitamins,
minerals and more. You become prone to cardiovascular diseases due to
the consumption of bad cholesterol and fat plus sodium. In other words, all
this interferes with the functioning of your heart. Furthermore, junk food
contains a higher level of carbohydrates. It will instantly spike your blood
sugar levels. This will result in lethargy, inactiveness, and sleepiness. A
person reflex becomes dull overtime and they lead an inactive life. To
make things worse, junk food also clogs your arteries and increases the
risk of a heart attack. Therefore, it must be avoided at the first instance to
save your life from becoming ruined.The main problem with junk food is
that people don’t realize its ill effects now. When the time comes, it is too
late. Most importantly, the issue is that it does not impact you instantly. It
works on your overtime; you will face the consequences sooner or later.
Thus, it is better to stop now.You can avoid junk food by encouraging
your children from an early age to eat green vegetables. Their taste buds
must be developed as such that they find healthy food tasty. Moreover, try
to mix things up. Do not serve the same green vegetable daily in the same
style. Incorporate different types of healthy food in their diet following
different recipes. This will help them to try foods at home rather than
being attracted to junk food.In short, do not deprive them completely of it
as that will not help. Children will find one way or the other to have it.
Make sure you give them junk food in limited quantities and at healthy
periods of time. '
!pip3 install gensim==3.6.0

import gensim

from gensim.summarization import summarize

short_summary = summarize(original_text)

print(short_summary)

Output:

Ks A Companies
No ratings yet
Ks A Companies
308 pages
Ceo
No ratings yet
Ceo
70 pages
Textbook Spanish
No ratings yet
Textbook Spanish
58 pages
Principles of Marketing 1st Semester
No ratings yet
Principles of Marketing 1st Semester
126 pages
NLTK Tutorial
No ratings yet
NLTK Tutorial
33 pages
Atg Worksheet Subjectobjpron PDF
100% (1)
Atg Worksheet Subjectobjpron PDF
2 pages
Natural Language Processing: Practical 1
No ratings yet
Natural Language Processing: Practical 1
64 pages
Full Blast Exam 1
No ratings yet
Full Blast Exam 1
5 pages
Soal Bahasa Inggris Kelas 9
100% (1)
Soal Bahasa Inggris Kelas 9
5 pages
Ingredients: Ep 18 - Rhubarb Trifle With Champagne Jelly and Mascarpone
No ratings yet
Ingredients: Ep 18 - Rhubarb Trifle With Champagne Jelly and Mascarpone
6 pages
Carbohydrate
100% (1)
Carbohydrate
43 pages
La Union Colleges of Science and Technology, Inc.: Basic Education Department Senior High School
No ratings yet
La Union Colleges of Science and Technology, Inc.: Basic Education Department Senior High School
34 pages
UBC Summer School in NLP - VSP 2019 Lecture 10
No ratings yet
UBC Summer School in NLP - VSP 2019 Lecture 10
33 pages
Department of Education: School Form 8 Learner's Basic Health and Nutrition Report For Senior High School (SF8-SHS)
No ratings yet
Department of Education: School Form 8 Learner's Basic Health and Nutrition Report For Senior High School (SF8-SHS)
3 pages
Iodimetric Titration of Vitamin C
100% (1)
Iodimetric Titration of Vitamin C
7 pages
FMCG FMCG: Kirti Agarwal Kritika Jain Radhika Fomra Salauni Gupta Simran MFM, Sem Iii NIFT, Kolkata
No ratings yet
FMCG FMCG: Kirti Agarwal Kritika Jain Radhika Fomra Salauni Gupta Simran MFM, Sem Iii NIFT, Kolkata
29 pages
Python and NLP Notes
No ratings yet
Python and NLP Notes
32 pages
Pat Bahasa Inggris Kelas Ii
No ratings yet
Pat Bahasa Inggris Kelas Ii
3 pages
Shubham Jade MSC It 31031420010 NLP Practical Journal
No ratings yet
Shubham Jade MSC It 31031420010 NLP Practical Journal
17 pages
Progressive Era Reforms of Theodore Roosevelt and Woodrow Wilson
No ratings yet
Progressive Era Reforms of Theodore Roosevelt and Woodrow Wilson
18 pages
NLP - Spacy Package
No ratings yet
NLP - Spacy Package
28 pages
NLP Manual (1-12)
No ratings yet
NLP Manual (1-12)
54 pages
MainEvent Menu
No ratings yet
MainEvent Menu
8 pages
NLP Manual (1-12)
No ratings yet
NLP Manual (1-12)
55 pages
ANLP semVI Labmanual
No ratings yet
ANLP semVI Labmanual
33 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
15 pages
NLP Part1
No ratings yet
NLP Part1
67 pages
NLP Manual (1-12) 1
No ratings yet
NLP Manual (1-12) 1
56 pages
Colette Browne-Weekes /EDID 6508/ Individual Submission/ Assignment 4
No ratings yet
Colette Browne-Weekes /EDID 6508/ Individual Submission/ Assignment 4
21 pages
Biomax ForbesAsia Dec2013
100% (1)
Biomax ForbesAsia Dec2013
3 pages
Cocacola Crisis
No ratings yet
Cocacola Crisis
5 pages
1
No ratings yet
1
27 pages
Program Studi Farmasi FMIPA UNSRAT Manado, 95115
No ratings yet
Program Studi Farmasi FMIPA UNSRAT Manado, 95115
10 pages
Timelog & Weekly Calendar Vincoy
No ratings yet
Timelog & Weekly Calendar Vincoy
3 pages
Soal Bahasa Inggris Kelas 10 Pts Ganjil
No ratings yet
Soal Bahasa Inggris Kelas 10 Pts Ganjil
4 pages
SK NLP Practical (FS)
No ratings yet
SK NLP Practical (FS)
22 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
33 pages
NLP Lab - 1
No ratings yet
NLP Lab - 1
3 pages
NLP Final Review
No ratings yet
NLP Final Review
32 pages
02 Linguistics Essentials
No ratings yet
02 Linguistics Essentials
36 pages
Serai1 - Uji Daya Hambat
No ratings yet
Serai1 - Uji Daya Hambat
5 pages
NLP Lab1
No ratings yet
NLP Lab1
6 pages
NLP - Cheatsheet
No ratings yet
NLP - Cheatsheet
10 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
25 pages
NLP Programs
No ratings yet
NLP Programs
5 pages
NLP Record
No ratings yet
NLP Record
6 pages
Rajeev Mishra 20 SCSE1180087
No ratings yet
Rajeev Mishra 20 SCSE1180087
29 pages
NLP Notes and Related Questions
No ratings yet
NLP Notes and Related Questions
7 pages
Unit 1: Overview of Hospitality
No ratings yet
Unit 1: Overview of Hospitality
5 pages
NLP Lecture2 Text Pre Processing
No ratings yet
NLP Lecture2 Text Pre Processing
54 pages
NLP-Lab Manual - Ashwini - Kachare
No ratings yet
NLP-Lab Manual - Ashwini - Kachare
41 pages
UNIT-V-NLP Using NLTK
No ratings yet
UNIT-V-NLP Using NLTK
19 pages
Toaz - Info Zombie Retreat 2 Walkthrough v0101 PR
No ratings yet
Toaz - Info Zombie Retreat 2 Walkthrough v0101 PR
15 pages
NLP Smitpatel
No ratings yet
NLP Smitpatel
32 pages
Jal Patel NLP
No ratings yet
Jal Patel NLP
32 pages
Natural Language Processing
No ratings yet
Natural Language Processing
25 pages
Final NLP Lab File
No ratings yet
Final NLP Lab File
28 pages
Rekapan Belanja Hurya 2
No ratings yet
Rekapan Belanja Hurya 2
20 pages
NLP Lab File
No ratings yet
NLP Lab File
15 pages
NLP FinAL
No ratings yet
NLP FinAL
27 pages
Culvers Customer Service Training Plan
No ratings yet
Culvers Customer Service Training Plan
53 pages
NLP Lab File
No ratings yet
NLP Lab File
13 pages
NLP Lab File
No ratings yet
NLP Lab File
13 pages
NLP Intro
No ratings yet
NLP Intro
15 pages
AP For NLP-LO1
No ratings yet
AP For NLP-LO1
61 pages
Ir Manual
No ratings yet
Ir Manual
53 pages
AP For NLP-Word 2 Vec
No ratings yet
AP For NLP-Word 2 Vec
33 pages
Đề cương ôn tập giữa kì I Anh 7
No ratings yet
Đề cương ôn tập giữa kì I Anh 7
5 pages
Wsma Final Manual
No ratings yet
Wsma Final Manual
58 pages
Text Preprocessing For NLP
No ratings yet
Text Preprocessing For NLP
15 pages
NLP Lab - Manual
No ratings yet
NLP Lab - Manual
33 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
17 pages
How To Write A Recipe
No ratings yet
How To Write A Recipe
2 pages
For Assignment-10 (Machine Learning With Python - NLP-2)
No ratings yet
For Assignment-10 (Machine Learning With Python - NLP-2)
37 pages
NLP Pratical
No ratings yet
NLP Pratical
14 pages
C24064 - NLP - Lab Manual
No ratings yet
C24064 - NLP - Lab Manual
28 pages
Lab 2
No ratings yet
Lab 2
49 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
19 pages
Adnan Amin
No ratings yet
Adnan Amin
19 pages
NLP Notebook
No ratings yet
NLP Notebook
20 pages
3.Nlp Lab Manual
No ratings yet
3.Nlp Lab Manual
18 pages
Python NLP Assignment
No ratings yet
Python NLP Assignment
9 pages
Tinywow Pythass3 77951173
No ratings yet
Tinywow Pythass3 77951173
17 pages
NLP Lab Work
No ratings yet
NLP Lab Work
34 pages
NLP - Exp 1 11
No ratings yet
NLP - Exp 1 11
29 pages
NLP Lab Programms
No ratings yet
NLP Lab Programms
9 pages
NLP Lab
No ratings yet
NLP Lab
7 pages
CSE 3652 Lab Record Format - PDF
No ratings yet
CSE 3652 Lab Record Format - PDF
13 pages
Composing Software: An Exploration of Functional Programming and Object Composition in JavaScript
From Everand
Composing Software: An Exploration of Functional Programming and Object Composition in JavaScript
Eric Elliott
No ratings yet
The 1 Page Python Book
From Everand
The 1 Page Python Book
Barani Kumar
2/5 (1)

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

01 NLP - Merged Vinay

Uploaded by

01 NLP - Merged Vinay

Uploaded by

N.G. ACHARYA & D.K.

VINAY VIJAY GUPTA

SUBMITTED IN PARTIAL FULFILLMENT OF THE REQUIREMENTS

DEPARTMENT OF COMPUTER SCIENCE

Signature Signature Signature

Internal Guide External Examiner Head Of Department

College Seal Date

2 WAP stemming and lemmatization

3 Implement a tri-gram model

4 Implement PoS TAGGING USING HMM & NEURAL

5 Write a program to Implement syntactic parsing of a given

6 Write a program to Implement dependency parsing of a

8 Create a chatbot using python and nltk(application of

Aim: WAP to implement sentence segmentation and word tokenization.

Tokenization is the process of breaking up a piece of text into sentences or

#import spacy library

#load core english library

#take unicode string

Aim: WAP stemming and lemmatization.

Languages we speak and write are made up of several words often

Lancaster Stemmer is the most aggressive stemming algorithm. It has an edge

Stemming is a method of normalization of words in Natural Language

from nltk.stem import PorterStemmer

Lemmatization is the grouping together of different forms of the same word.

from nltk.stem import WordNetLemmatizer

print("rocks :", a.lemmatize("rocks"))

Aim: Implement a tri-gram model (N-Gram).

from nltk.util import ngrams

for item in unigrams:

from nltk.util import ngrams

sentence = 'The purpose of our life is to happy'

from nltk.util import ngrams

sentence = 'Whoever is happy will make others happy too'

for item in unigrams:

Aim: Implement PoS TAGGING USING HMM &NEURAL Model.

txt = "Sukanya, Rajib and Naba are my good friends. " \

# sent_tokenize is one of instances of

# Word tokenizers is used to find the words

# Using a Tagger. Which is part-of-speech

# Load English tokenizer, tagger,

# Process whole documents

# Token and Tag

# You want list of Verb tokens

Write a program to Implement syntactic parsing of a given text.

Trained Models & Pipelines · spaCy Models Documentation

A compound is made up of various parts of speech such as a noun, verb,

spaCy is a library for advanced Natural Language Processing in Python

Write a program to Implement dependency parsing of a given text.

Dependency structure shows which word or phrase depends on which

for token in nlp(text): print(token.text,'=>',token.pos_,'=>',token.tag_)

py_text = "spacy dependency parser in python."

py_doc = py_nlp( py_text)

from spacy import displacy

sentence = "Apple is looking at buying U.K. startup for $1 billion"

for ent in doc.ents:

print(ent.text, ent.start_char, ent.end_char, ent.label_)

from spacy import displacy

from spacy import tokenizer

#Load the text and process it

# I copied the text from python wiki

text =("Python is an interpreted, high-level and general-purpose

"Pythons design philosophy emphasizes code readability with"

"its notable use of significant indentation."

"Its language constructs and object-oriented approach aim to"

"help programmers write clear and"

"logical code for small and large-scale projects")

# text2 = # copy the paragraphs from

for token in doc:

ents = [(e.text, e.start_char, e.end_char, e.label_) for e in doc.ents]

# now we use displaycy function on doc2

displacy.render(doc, style='ent', jupyter=True)

Text Summarization Approaches for NLP - Practical Guide with

original_text = 'Junk foods taste good that’s why it is mostly liked by

from gensim.summarization import summarize

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.