Papers by Dr. Apurba Senapati

SN Computer Science
Neologisms refer to newly coined words or phrases adopted by a language, and it is a slow but ong... more Neologisms refer to newly coined words or phrases adopted by a language, and it is a slow but ongoing process that occurs in all languages. Sometimes, rarely used or obsolete words are also considered neologisms. Certain events, such as wars, the emergence of new diseases, or advancements like computers and the internet, can trigger the creation of new words or neologisms. The COVID-19 pandemic is one such event that has rapidly led to an explosion of neologisms in the context of the disease and several other social contexts. Even the term COVID-19 itself is a newly coined term. Studying such adaptation or change and quantifying it is essential from a linguistic perspective. However, identifying newly coined terms or extracting neologisms computationally is a challenging task. The standard tools and techniques for finding newly coined terms in English-like languages may not be suitable for Bengali and other Indic languages. This study aims to use a semi-automated approach to investigate the emergence or modification of new words in the Bengali language amidst the COVID-19 pandemic. To conduct this study, a Bengali web corpus was compiled consisting of COVID-19 related articles sourced from various web sources in Bengali. The current experiment focuses solely on COVID-19-related neologisms, but the method can be adapted for general purposes and extended to other languages as well.

Scalable Computing: Practice and Experience
Hate speech detection research is a recent sizzling topic in natural language processing (NLP). U... more Hate speech detection research is a recent sizzling topic in natural language processing (NLP). Unburdened uses of social media platforms make people over-opinionative, which crosses the limit of leaving comments and posts toxic. A toxic outlook increases violence towards the neighbour, state, country, and continent. Several laws have been introduced in different countries to end the emergency problem. Now, all the media platforms have started working on restricting hate posts or comments. Hate speech detection is generally a text classification problem if considered a supervised observation. To tackle text in terms of computation perspective is challenging because of its semantic and complex grammatical nature. Resource-rich languages leverage their richness, whereas resource scarce language suffers significantly from a lack of dataset. This paper makes a multifaceted contribution encompassing resource generation, experimentation with Machine Learning (ML), Deep Learning (DL) and s...
Smart innovation, systems and technologies, 2022
Uploads
Papers by Dr. Apurba Senapati