Getting Started With Natural Language Processing
Getting Started With Natural Language Processing
Language Processing
• Natural Language Processing is a sub field of
artificial-intelligence that helps in processing
and analyzing natural language like text,
speech and so on. In this article we will try to
explain various techniques.
Text Preprocessing
Text is mostly in unstructured form. Lot of noises will be
present in it. In data preprocessing we will remove
the noises associated with it. It is not possible to
analyze the data without properly preprocessing it.
Stemming
• Stemming is a rule based approach which
strips suffixes(ing,ly,s etc). some of the
examples are:
Word standardisation:
Text may contains words that are not in
dictionary.for example in tweets or
comments , it can contain words like ‘re’
representing are,’s’ for is,’awsm’ for awesome
and so on.such words will not recognized by
our model.so we have to fix it
Noise entity removal: