0% found this document useful (0 votes)

2 views12 pages

Foundation (Week 4) - DeepTech - Ready Upskilling Program

This document outlines the Week 4 curriculum for a course on Natural Language Processing (NLP), focusing on fundamentals, text preprocessing, and representation techniques. It includes learning objectives, required resources, and a series of applied learning assignments that involve practical tasks such as text cleaning, tokenization, stemming, and lemmatization. Additionally, it provides links to course materials and specific coding tasks using Python and various libraries.

Uploaded by

gurjibecha88

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views12 pages

Foundation (Week 4) - DeepTech - Ready Upskilling Program

Uploaded by

gurjibecha88

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Week 4: Fundamentals of Natural Language Processing

In this week, you will look at this course;

● Fundamentals of NLP
Course 1: Fundamentals of Natural Language Processing

Learning objectives for course

At the end of this course, you should be able to;

● Understand NLP Fundamentals.

● Preprocess Text and Analysis.
● Apply Text Representation Techniques.
Learning Requirements

To support your learning this week, you will require the

following resources;

● Jupyter Notebook
● Google Colab (Recommended)

Provided is a guide on how to use and for your

assignment with Google Colab. Google Colab Guide
Course 1: Fundamentals of Natural Language Processing

Link(s) to the course:

• Introduction to Natural Language Processing (NLP)

• Text Preprocessing, Tokenization, Stemming & Lemmatization
• Text Representations in NLP
Week 4: Fundamentals of Natural Language Processing

Learning Resources
Course:

1. Slide 1 – Introduction to Natural Language Processing (NLP)

2. Slide 2 - Text Preprocessing, Tokenization, Stemming &
Lemmatization
3. Slide 3 - Text Representations in NLP
4. Notebook 1- Regex Colab Notebook
5. Notebook 2 - String Processing Colab
6. Notebook 3- Text tokenization Colab
7. Notebook 4 - Text Representation Colab
Week 4: Fundamentals of Natural Language Processing
Applied Learning Assignments 1:
1. Define Natural Language Processing (NLP) in your own words.
2. List at least three real-world applications of NLP and explain their significance.
3. Identify and explain two challenges that make NLP complex.
4. Extract the following patterns using regex:
a) All email addresses from the text below:
“Contact us at support@company.com or sales@business.org.
For more, email info@service.net.”

b) All words that end with "ing" from this sentence:

“NLP is amazing for cleaning and processing text while learning new
techniques.”

5. Write a Python program to clean the following text by:

“NLP makes AI smarter! But, sometimes, it’s challenging… Don’t you agree?”
a) Removing all punctuation.
b) Converting it to lowercase.
c) Splitting it into words.
Week 4: Fundamentals of Natural Language Processing
Applied Learning Assignments 2:
1. Text Cleaning Task
Apply text cleaning techniques to preprocess the following text:
"OMG!! NLP is soooo coool 🤩...!!! It costs $1000. Learn it now at https://3mtt.com 😎."

Refer to the course slide for more information

2. Tokenization Task
Perform both word-level and sentence-level tokenization on the given
text.

"Tokenization is the first step in NLP. It splits text into smaller pieces for
analysis."
o Use NLTK to perform word tokenization.
o Use NLTK to perform sentence tokenization
Week 4: Fundamentals of Natural Language Processing
Applied Learning Assignments 2:

3. Stemming and Lemmatization Task

Apply stemming and lemmatization techniques to a list of words:

["running", "flies", "studies", "easily", "studying", "better"]

o Use Porter Stemmer to perform stemming on the words.

o Use spaCy to perform lemmatization on the same words.

Week 4: Fundamentals of Natural Language Processing
Applied Learning Assignments 3:
1. Define a vocabulary of at least 5 unique words. Write Python code to
generate one-hot encoded vectors for your vocabulary.

2. Use the following sentences as your dataset:

● “The quick brown fox jumps over the lazy dog.”

● “The dog sleeps in the kernel”

– Write Python code to generate a Bag of Words representation for the

dataset using CountVectorizer.

– Write Python code to compute the TF-IDF representation using

TfidfVectorizer.
Week 4: Fundamentals of Natural Language Processing
Applied Learning Assignments 3:

3. Create a small dataset of at least 3 sentences related to animals.

Example: "The cat meows. The dog barks. The bird sings."
– Write Python code to:
• Train a Word2Vec model using gensim.
• Retrieve the embedding for the word "dog".

4. Load the pretrained GloVe model (glove-wiki-gigaword-50) using gensim.

– Write Python code to:
• Retrieve the embedding for the word "king".
• Find the 5 most similar words to "king".

Gata 6 Global - Hki
No ratings yet
Gata 6 Global - Hki
225 pages
Edexcel TT
No ratings yet
Edexcel TT
30 pages
Analytic Rubric For Poster Making
No ratings yet
Analytic Rubric For Poster Making
2 pages
Reading First
No ratings yet
Reading First
60 pages
WS English4 Week4 v.2 BLR PD
No ratings yet
WS English4 Week4 v.2 BLR PD
7 pages
Tarea Agenda 1
No ratings yet
Tarea Agenda 1
9 pages
English7 q1 Mod7of8 Phrases v2
100% (1)
English7 q1 Mod7of8 Phrases v2
24 pages
Masera - Vasquez - Salatino - 3, Vol 3, No 3, The Map and The Universe The Work of Maurits Cornelis Escher From A Cultural-Historical Approach
No ratings yet
Masera - Vasquez - Salatino - 3, Vol 3, No 3, The Map and The Universe The Work of Maurits Cornelis Escher From A Cultural-Historical Approach
9 pages
IELTS Academic Reading Module Test Validity and Reliability1
No ratings yet
IELTS Academic Reading Module Test Validity and Reliability1
13 pages
Verbo TO BE
100% (1)
Verbo TO BE
12 pages
How To Answer Bi Paper 1 and Paper 2 Wisely and Effectively by P
No ratings yet
How To Answer Bi Paper 1 and Paper 2 Wisely and Effectively by P
21 pages
4 Answer Key A Grammar, Vocabulary, and Pronunciation
No ratings yet
4 Answer Key A Grammar, Vocabulary, and Pronunciation
6 pages
Need Help With Beginner Spanish Grammar? Relax, Here Are 5 Lessons To Start
No ratings yet
Need Help With Beginner Spanish Grammar? Relax, Here Are 5 Lessons To Start
10 pages
Pemetaan Kompetensi Dan Teknik Penilaian Perangkat Pembelajaran Kelas 8 SMP SKB
No ratings yet
Pemetaan Kompetensi Dan Teknik Penilaian Perangkat Pembelajaran Kelas 8 SMP SKB
7 pages
Fao Corps Selection Questionnaire: Part I - Biographical Data
No ratings yet
Fao Corps Selection Questionnaire: Part I - Biographical Data
4 pages
Reported Speech Material
No ratings yet
Reported Speech Material
4 pages
Mathematics For Computer Vision
No ratings yet
Mathematics For Computer Vision
14 pages
Grammar Guide PDF
No ratings yet
Grammar Guide PDF
2 pages
Advanced Data Visualization and Interpretation 3
No ratings yet
Advanced Data Visualization and Interpretation 3
21 pages
Generative Grammar
No ratings yet
Generative Grammar
15 pages
Practice Test - U4 - G11
No ratings yet
Practice Test - U4 - G11
3 pages
Reading Comprehension A Rainy Day - 1st Grade
No ratings yet
Reading Comprehension A Rainy Day - 1st Grade
3 pages
SSC Regular Hallticket
No ratings yet
SSC Regular Hallticket
2 pages
Speak Think Thrive: in English
No ratings yet
Speak Think Thrive: in English
9 pages
تصحيح وثيقة الخلاصة العامة لجميع دروس الوحدة الأولى (الإنجليزية مع السيمو)
No ratings yet
تصحيح وثيقة الخلاصة العامة لجميع دروس الوحدة الأولى (الإنجليزية مع السيمو)
2 pages
ADAP (Week 2) - Learning Content
No ratings yet
ADAP (Week 2) - Learning Content
6 pages
Saying It With Slang - Listening Comprehension
No ratings yet
Saying It With Slang - Listening Comprehension
2 pages
Communication Attitude Test For Preschoo
No ratings yet
Communication Attitude Test For Preschoo
4 pages
Programming Foundations in Computer Vision
No ratings yet
Programming Foundations in Computer Vision
12 pages
Raymond S. T. Lee - Natural Language Processing. A Textbook With Python Implementation-Springer (2024)
No ratings yet
Raymond S. T. Lee - Natural Language Processing. A Textbook With Python Implementation-Springer (2024)
454 pages
Vocabulary Development
No ratings yet
Vocabulary Development
6 pages
The Syntax of Temporal Relations
No ratings yet
The Syntax of Temporal Relations
8 pages
Natural Language Processing Assignment - 240612 - 121206
No ratings yet
Natural Language Processing Assignment - 240612 - 121206
4 pages
Natural Language Processing A Machine Learning Perspective by Yue Zhang, Westlake University Zhiyang Teng, Westlake University
No ratings yet
Natural Language Processing A Machine Learning Perspective by Yue Zhang, Westlake University Zhiyang Teng, Westlake University
768 pages
01 Introduction To Natural Language Processing
No ratings yet
01 Introduction To Natural Language Processing
42 pages
Module II
No ratings yet
Module II
6 pages
ANLP Syllabus 2021
No ratings yet
ANLP Syllabus 2021
7 pages
NLP Semester 7
No ratings yet
NLP Semester 7
1,072 pages
Communication Skills A Guide For Engineering and Applied Science Students 3rd Edition John Davies Instant Download
No ratings yet
Communication Skills A Guide For Engineering and Applied Science Students 3rd Edition John Davies Instant Download
48 pages
Essay (Involves Vs Coinvolge)
No ratings yet
Essay (Involves Vs Coinvolge)
1 page
NLP Lab Manual
No ratings yet
NLP Lab Manual
13 pages
Natural Language Processing (NLP) With Python - Tutorial
No ratings yet
Natural Language Processing (NLP) With Python - Tutorial
72 pages
Assignment I
No ratings yet
Assignment I
6 pages
CS269 01
No ratings yet
CS269 01
78 pages
Unit 1
No ratings yet
Unit 1
99 pages
NLP Lect Unit I
100% (1)
NLP Lect Unit I
140 pages
Syllabus NLP
100% (1)
Syllabus NLP
2 pages
Nlp-Unit-I Final
No ratings yet
Nlp-Unit-I Final
31 pages
NLP Curriculum
No ratings yet
NLP Curriculum
2 pages
NLP Session I-Unit I and II
No ratings yet
NLP Session I-Unit I and II
50 pages
Module-I NLP
No ratings yet
Module-I NLP
35 pages
SCO409 Lecture Notes
No ratings yet
SCO409 Lecture Notes
64 pages
Session2 3
No ratings yet
Session2 3
18 pages
315329-Natural Language Processing
No ratings yet
315329-Natural Language Processing
7 pages
NLP Front Matter
No ratings yet
NLP Front Matter
28 pages
NLP - Course EDC 1 29
No ratings yet
NLP - Course EDC 1 29
29 pages
2 - 6N302 Natural Language Processing
No ratings yet
2 - 6N302 Natural Language Processing
6 pages
NLP Defaulter Assignment
No ratings yet
NLP Defaulter Assignment
2 pages
CSR 322 Syllabus
No ratings yet
CSR 322 Syllabus
2 pages
NLP 1
No ratings yet
NLP 1
11 pages
NLP Study Material
No ratings yet
NLP Study Material
8 pages
Session 1
No ratings yet
Session 1
22 pages
Introduction
No ratings yet
Introduction
29 pages
NLP Syllabus
No ratings yet
NLP Syllabus
2 pages
Natural Language Processing: Zhao Hai 赵海 Department of Computer Science and Engineering Shanghai Jiao Tong University
No ratings yet
Natural Language Processing: Zhao Hai 赵海 Department of Computer Science and Engineering Shanghai Jiao Tong University
61 pages
Introduction To NLP
No ratings yet
Introduction To NLP
23 pages
Unit 1
No ratings yet
Unit 1
20 pages
Lecture 1 Introduction
No ratings yet
Lecture 1 Introduction
57 pages
Top 50 NLP Interview Questions and Answers (2023) - Reader View
No ratings yet
Top 50 NLP Interview Questions and Answers (2023) - Reader View
27 pages
ChatGPT - MyLearning On Coding For NLP
No ratings yet
ChatGPT - MyLearning On Coding For NLP
10 pages
CS702B
No ratings yet
CS702B
114 pages
NLP Notes
No ratings yet
NLP Notes
16 pages
Lec-1 Introduction
No ratings yet
Lec-1 Introduction
68 pages
Unit I NLP
No ratings yet
Unit I NLP
5 pages
FRM Course Syllabus IPDownload
No ratings yet
FRM Course Syllabus IPDownload
2 pages
Module I NLP
No ratings yet
Module I NLP
65 pages
Natural Language Processing
No ratings yet
Natural Language Processing
19 pages
Applied Natural Language Processing
No ratings yet
Applied Natural Language Processing
3 pages
21AD3202 - Natural LanguageProcessing-Record
No ratings yet
21AD3202 - Natural LanguageProcessing-Record
64 pages
NLP Syllabus R21
100% (1)
NLP Syllabus R21
2 pages
Brochure CMU NLP 24-08-2022 V13
No ratings yet
Brochure CMU NLP 24-08-2022 V13
13 pages
NLP Syllabus
No ratings yet
NLP Syllabus
2 pages
Natural Language Processing - Session 1 - Introduction
No ratings yet
Natural Language Processing - Session 1 - Introduction
55 pages
NLP PPT1
No ratings yet
NLP PPT1
29 pages
Syllabus NLP
No ratings yet
Syllabus NLP
2 pages
Natural Language Processing With Python
No ratings yet
Natural Language Processing With Python
3 pages
GBHRFTHRDF
No ratings yet
GBHRFTHRDF
3 pages
Syllabus NLP (UE19CS334)
No ratings yet
Syllabus NLP (UE19CS334)
2 pages
Gujarat Technological University: W.E.F. AY 2018-19
No ratings yet
Gujarat Technological University: W.E.F. AY 2018-19
2 pages
NLP Lab Manual-1
No ratings yet
NLP Lab Manual-1
18 pages
Swe1017 NLP Syllabus
No ratings yet
Swe1017 NLP Syllabus
2 pages
CCS369
No ratings yet
CCS369
2 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Foundation (Week 4) - DeepTech - Ready Upskilling Program

Uploaded by

Foundation (Week 4) - DeepTech - Ready Upskilling Program

Uploaded by

Week 4: Fundamentals of Natural Language Processing

In this week, you will look at this course;

Learning objectives for course

● Understand NLP Fundamentals.

To support your learning this week, you will require the

Provided is a guide on how to use and for your

Link(s) to the course:

• Introduction to Natural Language Processing (NLP)

1. Slide 1 – Introduction to Natural Language Processing (NLP)

b) All words that end with "ing" from this sentence:

5. Write a Python program to clean the following text by:

Refer to the course slide for more information

3. Stemming and Lemmatization Task

Apply stemming and lemmatization techniques to a list of words:

["running", "flies", "studies", "easily", "studying", "better"]

o Use Porter Stemmer to perform stemming on the words.

o Use spaCy to perform lemmatization on the same words.

2. Use the following sentences as your dataset:

● “The quick brown fox jumps over the lazy dog.”

● “The dog sleeps in the kernel”

– Write Python code to generate a Bag of Words representation for the

– Write Python code to compute the TF-IDF representation using

3. Create a small dataset of at least 3 sentences related to animals.

4. Load the pretrained GloVe model (glove-wiki-gigaword-50) using gensim.

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.