A Comparison of Demographic Attributes Detection from Handwriting based on Traditional and Deep Learning Methods
Document Analysis and Recognition – ICDAR 2023 Workshops, 2023
Analyzing handwritten documents and detecting demographic attributes of writers from handwritten ... more Analyzing handwritten documents and detecting demographic attributes of writers from handwritten samples has received enormous attention from various fields of research, including psychology, computer science and artificial intelligence. Auto-matic detection of age, gender, handedness, nationality, and qualification of writers based on handwritten documents has several real-world applications, such as forensics and psychology. This paper proposes two simple but effective methods to detect the demographic information of writers from offline handwritten document images. The proposed methods are based on traditional and deep learning approaches. In the tradi-tional machine learning method, the Rank Transform feature extraction method is used for measuring the intensity in handwriting images. The extracted handcrafted features are then fed into Support Vector Machine based classifiers to predict the demographical attributes of writers. In the deep learning method, a Convolutional Neural Network model based on the ResNet architecture with a fully connected layer, followed by a softmax layer is used to provide probability scores to facilitate demographic infor-mation detection. To evaluate the proposed methods and compare the results with the results in the literature, a comprehensive set of experiments was conducted on a fre-quently used benchmark database, KHATT. Both methods performed relatively well in predicting different demographic attributes. However, considering the settings in our experiments, the results obtained from the traditional model indicated better demo-graphic detection compared to the deep learning models in all the tasks.
Uploads
Papers by Fahimeh Alaei
to obtain accurate recognition/classification is a crucial need in
document classification domain. The one-class classification is
chosen since only positive samples are available for the
training. In this paper, a new one-class classification method
based on symbolic representation method is proposed. Initially
a set of features is extracted from the training set. A set of
intervals valued symbolic feature vector is then used to
represent the class. Each interval value (symbolic data) is
computed using mean and standard deviation of the
corresponding feature values. To evaluate the proposed oneclass
classification method a dataset composed of 544
document images was used. Experiment results reveal that the
proposed one-class classification method works well even when
the number of training samples is small (≤10). Moreover, we
noted that the proposed one-class classification method is
suitable for document classification and provides better result
compared to one-class k-nearest neighbor (k-NN) classifier.
to obtain accurate recognition/classification is a crucial need in
document classification domain. The one-class classification is
chosen since only positive samples are available for the
training. In this paper, a new one-class classification method
based on symbolic representation method is proposed. Initially
a set of features is extracted from the training set. A set of
intervals valued symbolic feature vector is then used to
represent the class. Each interval value (symbolic data) is
computed using mean and standard deviation of the
corresponding feature values. To evaluate the proposed oneclass
classification method a dataset composed of 544
document images was used. Experiment results reveal that the
proposed one-class classification method works well even when
the number of training samples is small (≤10). Moreover, we
noted that the proposed one-class classification method is
suitable for document classification and provides better result
compared to one-class k-nearest neighbor (k-NN) classifier.