Natural Language
Processing on Bengali
NLP is a branch of artificial intelligence concerned with the interaction
between computers and humans using natural language. It focuses on
enabling computers to understand, interpret, and respond to human language.
by :-
Tirumal M Naik 1SU21CS136
Vignesh V J 1SU21CS145
Overview of Bengali language
1 Rich History 2 Distinctive Script
Bengali is one of the most spoken The beautiful Bengali script has its
languages in the world, with a rich origins in the Brahmic family of scripts
cultural and literary heritage. and is known for its flowing, artistic
style.
3 Phonetic Complexity
Bengali has a phonetic complexity with various sounds and intonations that contribute to
its unique linguistic characteristics.
Challenges in NLP for Bengali
Morphological Variation Limited Annotated Data
Bengali exhibits intricate morphological The availability of annotated data in Bengali is
variations that pose challenges for accurate limited, hindering the development of effective
analysis and processing. NLP models.
Preprocessing techniques for Bengali
text
1 Text Normalization 2 Stopword Removal
The process of standardizing and Eliminating common words that do not
organizing text to make it uniform and contribute much to the overall NLP
consistent for analysis. analysis.
3 Morphological Analysis
Breaking down words into their constituent parts to understand their meaning and
grammatical structure.
Bengali word tokenization
Tokenization Linguistic Analysis
Process of breaking down a text into units like Examining the structure and meaning of words,
words or sentences for analysis. ensuring accurate tokenization.
Bengali part-of-speech tagging
5 3
Identification Accuracy
Assignment of grammatical categories to words in Precision in identifying the different parts of
Bengali sentences. speech within Bengali texts.
Bengali named entity recognition
Named Entities Identification of specific entities such as
names, dates, and locations within Bengali
text.
Entity Classification Classifying named entities into predefined
categories for analysis.
Applications of NLP in Bengali
language processing
Language Translation
Utilizing NLP for accurate translation between Bengali and other languages.
Information Extraction
Extracting structured information from unstructured Bengali text for analysis.
Sentiment Analysis
Understanding public sentiment by analyzing social media and news content in
Bengali.
Thank You