Mohammad M. Masud

Followers

Following

Co-authors

Public Views

Abeer Al-Attar

Univ. of Technology

Ayse Kalemtas

Muğla Üniversitesi

Ajay Bhardwaj

SSBT, COET,North Maharashtra University, India

Mirosław J Kruszewski

Warsaw University of Technology

Haider Zaman

Taibah University, Madinah, Saudi Arabia

Mahmoud Elwaheidi

King Saud University

Evandro Nohara

Universidade de Taubaté, SP, Brazil

Pamies Teixeira

Universidade Nova de Lisboa

Nurfaizey Bin Abdul Hamid

Universiti Teknikal Malaysia Melaka

Gerhard Welsch

Case Western Reserve University

Interests

Uploads

Papers by Mohammad M. Masud

Classification and Adaptive Novel Class Detection of Feature-Evolving Data Streams

ABSTRACT Data stream classification poses many challenges to the data mining community. In this p... more ABSTRACT Data stream classification poses many challenges to the data mining community. In this paper, we address four such major challenges, namely, infinite length, concept-drift, concept-evolution, and feature-evolution. Since a data stream is theoretically infinite in length, it is impractical to store and use all the historical data for training. Concept-drift is a common phenomenon in data streams, which occurs as a result of changes in the underlying concepts. Concept-evolution occurs as a result of new classes evolving in the stream. Feature-evolution is a frequently occurring process in many streams, such as text streams, in which new features (i.e., words or phrases) appear as the stream progresses. Most existing data stream classification techniques address only the first two challenges, and ignore the latter two. In this paper, we propose an ensemble classification framework, where each classifier is equipped with a novel class detector, to address concept-drift and concept-evolution. To address feature-evolution, we propose a feature set homogenization technique. We also enhance the novel class detection module by making it more adaptive to the evolving stream, and enabling it to detect more than one novel class at a time. Comparison with state-of-the-art data stream classification techniques establishes the effectiveness of the proposed approach.

Network Packet Filtering and Deep Packet Inspection Hybrid Mechanism for IDS Early Packet Matching

2016 IEEE 30th International Conference on Advanced Information Networking and Applications (AINA), 2016

Examining The Effect of Feature Selection on Improving Patient Deterioration Prediction

International Journal of Data Mining & Knowledge Management Process, 2015

Large amount of heterogeneous medical data is generated every day in various healthcare organizat... more Large amount of heterogeneous medical data is generated every day in various healthcare organizations. Those data could derive insights for improving monitoring and care delivery in the Intensive Care Unit. Conversely, these data presents a challenge in reducing this amount of data without information loss. Dimension reduction is considered the most popular approach for reducing data size and also to reduce noise and redundancies in data. In this paper, we are investigate the effect of the average laboratory test value and number of total laboratory in predicting patient deterioration in the Intensive Care Unit, where we consider laboratory tests as features. Choosing a subset of features would mean choosing the most important lab tests to perform. Thus, our approach uses state-of-the-art feature selection to identify the most discriminative attributes, where we would have a better understanding of patient deterioration problem. If the number of tests can be reduced by identifying the most important tests, then we could also identify the redundant tests. By omitting the redundant tests, observation time could be reduced and early treatment could be provided to avoid the risk. Additionally, unnecessary monetary cost would be avoided. We apply our technique on the publicly available MIMIC-II database and show the effectiveness of the feature selection. We also provide a detailed analysis of the best features identified by our approach.

ICU Patient Deterioration Prediction : A Data-Mining Approach

Computer Science & Information Technology ( CS & IT ), 2015

A huge amount of medical data is generated every day, which presents a challenge in analysing the... more A huge amount of medical data is generated every day, which presents a challenge in analysing these data. The obvious solution to this challenge is to reduce the amount of data without information loss. Dimension reduction is considered the most popular approach for reducing data size and also to reduce noise and redundancies in data. In this paper, we investigate the effect of feature selection in improving the prediction of patient deterioration in ICUs. We consider lab tests as features. Thus, choosing a subset of features would mean choosing the most important lab tests to perform. If the number of tests can be reduced by identifying the most important tests, then we could also identify the redundant tests. By omitting the redundant tests, observation time could be reduced and early treatment could be provided to avoid the risk. Additionally, unnecessary monetary cost would be avoided. Our approach uses state-of-the-art feature selection for predicting ICU patient deterioration using the medical lab results. We apply our technique on the publicly available MIMIC-II database and show the effectiveness of the feature selection. We also provide a detailed analysis of the best features identified by our approach.

A Hybrid Model to Detect Malicious Executables

2007 Ieee International Conference on Communications, 2007

Mining concept-drifting data stream to detect peer to peer botnet traffic

We propose a novel stream data classification technique to detect Peer to Peer botnet. Botnet tra... more We propose a novel stream data classification technique to detect Peer to Peer botnet. Botnet traffic can be considered as stream data having two important properties: infinite length and drifting concept. Thus, stream data classification technique is more appealing to botnet detection than simple classification technique. However, no other botnet detection approaches so far have applied stream data classification technique. We propose a multi-chunk, multi-level ensemble classifier based data mining technique to classify concept-drifting stream data. Previous ensemble techniques in classifying concept-drifting stream data use a single data chunk to train a classifier. In our approach, we train an ensemble of v classifiers from r consecutive data chunks. K of these v-classifier ensembles are used to build another level of ensemble. By introducing this multi-chunk, multi-level ensemble, we significantly reduce error compared to the singlechunk, single level ensemble. We have established the justification of using our algorithm theoretically. We have also tested our technique on both botnet traffic and simulated data, and obtained better detection accuracies compared to other published works.

Mohammad M. Masud

Related Authors

Uploads

Papers by Mohammad M. Masud

Log In