Pratik Ratadiya

Savitribai Phule Pune University, Computer Engineering, Undergraduate

Followers

Following

Public Views

Steven Pinker

Harvard University

Beat Signer

Vrije Universiteit Brussel

Armando Marques-Guedes

UNL - New University of Lisbon

Paul Tobin

Dublin Institute of Technology

Jaydip Sen

Praxis Business School

Mohammed Najm Abdullah Al Salam

University of Technology/Iraq

Olayemi M Olaniyi

National Open University of Nigeria

PALIMOTE JUSTICE

RIVERS STATE POLYTECHNIC

Ferhat Bozkurt

Ataturk University

IJCST Eighth Sense Research Group

Anna University

Interests

Uploads

Papers by Pratik Ratadiya

Transformer-based Hierarchical Encoder for Document Classification

2021 International Conference on Data Mining Workshops (ICDMW), Dec 1, 2021

Document Classification has a wide range of applications in various domains like Ontology Mapping... more Document Classification has a wide range of applications in various domains like Ontology Mapping, Sentiment Analysis, Topic Categorization and Document Clustering, to mention a few. Unlike Text Classification, Document Classification works with longer sequences that typically contain multiple paragraphs. Previous approaches for this task have achieved promising results, but have often relied on complex recurrence mechanisms that are expensive and time-consuming in nature. Recently, self-attention based models like Transformers and BERT have achieved state-of-the-art performance on several Natural Language Understanding (NLU) tasks, but owing to the quadratic computational complexity of the self-attention mechanism with respect to the input sequence length, these approaches are generally applied to shorter text sequences. In this paper, we address this issue, by proposing a new Transformer-based Hierarchical Encoder approach for the Document Classification task. The hierarchical framework we adopt helps us extend the self-attention mechanism to long-form text modelling thereby reducing the complexity considerably. We use the Bidirectional Transformer Encoder (BTE) at the sentence-level to generate a fixed-size sentence embedding for each sentence in the document. A document-level Transformer Encoder is then used to model the global document context and learn the inter-sentence dependencies. We also carry out experiments with the BTE in a feature-extraction and a fine-tuning setup, allowing us to evaluate the trade-off between computation power and accuracy. Furthermore, we also conduct ablation experiments, and evaluate the impact of different pre-training strategies on the overall performance. Experimental results demonstrate that our proposed model achieves state-of-the-art performance on two standard benchmark datasets.

Contextualized Embedding based Approaches for Social Media-specific Sentiment Analysis

2021 International Conference on Data Mining Workshops (ICDMW), Dec 1, 2021

Social media-specific Sentiment Analysis has a wide range of applications in various domains like... more Social media-specific Sentiment Analysis has a wide range of applications in various domains like Business Intelligence, Marketing, Politics and Psychology, to mention a few. Irony Detection and Emotion Recognition, two of Sentiment Analysis' significant pillars have become increasingly important as a result of the continued growth of social media. Previous approaches for the two tasks have yielded promising results, but have often relied on recurrence and pre-trained wordembedding ensembles. In this paper, we propose two novel contextual embedding-based approaches for Irony Detection and Emotion Recognition. We leverage social media-specific pretraining in the form of BERTweet-A language model pre-trained on English Tweets, along with either a Convolutional Neural Network or a Transformer Encoder. We empirically show that the addition of Convolutional Neural Networks or a Transformer Encoder results in improved performance when compared to a vanilla BERTweet model. Furthermore, we also compare CNNs and the Transformer Encoder as feature extractors, assessing the trade-off between the number of learnable parameters and performance. Finally, we also investigate the impact of partial and complete fine-tuning and analyze the trade-off between computational power and accuracy in the process. Experimental results demonstrate that our proposed methods achieve state-ofthe-art performance on two standard benchmark datasets.

Pratik Ratadiya

Uploads

Papers by Pratik Ratadiya

Log In