Applied Sciences: Gender Classification Using Sentiment Analysis and Deep Learning in A Health Web Forum
Applied Sciences: Gender Classification Using Sentiment Analysis and Deep Learning in A Health Web Forum
sciences
Article
Gender Classification Using Sentiment Analysis and
Deep Learning in a Health Web Forum
Sunghee Park and Jiyoung Woo *
Department of Future Convergence Technology, Soonchunhyang University, Asan-si, Korea; [email protected]
* Correspondence: [email protected]
Received: 15 February 2019; Accepted: 19 March 2019; Published: 25 March 2019
Featured Application: This work can be applied to detect the gender of online users who do not
disclose this information.
Abstract: Sentiment analysis is the most common text classification tool that analyzes incoming
messages and tells whether the underlying sentiment is positive, negative, or neutral. We can use
this technique to understand people by gender, especially people who are suffering from a sensitive
disease. People use health-related web forums to easily access health information written by and
for non-experts and also to get comfort from people who are in a similar situation. The government
operates medical web forums to provide medical information, manage patients’ needs and feelings,
and boost information-sharing among patients. If we can classify people’s emotional or information
needs by gender, age, or location, it is possible to establish a detailed health policy specialized
into patient segments. However, people with sensitive illness such as AIDS tend to hide their
information. Especially, in the case of sexually transmitted AIDS, we can detect problems and needs
according to gender. In this work, we present a gender detection model using sentiment analysis and
machine learning including deep learning. Through the experiment, we found that sentiment features
generate low accuracy. However, senti-words give better results with SVM. Overall, traditional
machine learning algorithms have a high misclassification rate for the female category. The deep
learning algorithm overcomes this drawback with over 90% accuracy.
Keywords: sentiment analysis; gender classification; machine learning; deep learning; medical
web forum
1. Introduction
Sentiment analysis is contextual mining of text that identifies and extracts subjective information
in source material, helping a business to understand the social sentiment of their brand, product,
or service while monitoring the online conversation. With recent advances in machine learning,
text mining techniques have improved considerably. Creative use of advanced artificial intelligence
techniques can be an effective tool for doing in-depth research.
Sentiment analysis of web content is becoming increasingly important due to augmented
communication through Internet sources such as e-mail, websites, forums, and chat rooms.
By collecting these articles and analyzing people’s emotions expressed in them, we can figure out
people’s feelings and opinions about polices, products, brands, and so on. Compared to traditional
surveys and other research techniques, this rich information can be obtained with less cost and
effort. People can search for particular information based on their individual needs. Patients,
patients’ significant others, or caregivers also use health-related web forums to get health and medical
information and to get comfort from people who are similar to themselves. They also ask questions
about the disease and find information that is easy to understand [1]. The medical forum also reflects
their feelings [2]. The government relies on web forums to act as a helpdesk to promote people’s
well-being. Some governments directly operate medical web forums to provide medical information,
get to know patients’ needs, manage their emotions, and help people share information [3]. Users
divulge some pieces of personal information when they sign up, and this information can be used to
establish national health care policies. However, for sensitive disease such as AIDS, patients tend not
to expose their personal information. In terms of health policy, it is important to know who is looking
for more information. If we can guess the information from the non-disclosed data, we can understand
who is suffering from a specific disease and what types of worries they might have. Especially, in
sensitive diseases, by understanding patients we can help them deal with their situation properly and
adapt to society. The analysis of AIDS patients’ communication could be useful from a government
perspective, too.
Categorizing and analyzing according to demographic information such as gender, age, and
region is essential to obtain information such as consumers’ emotions, values, and attitudes in all areas
of marketing. In particular, it is important to distinguish gender for detailed policy establishment
because the situation and necessity can be different by gender. It can also help users to see what topics
are most talked about by males and females, and what services are liked or disliked by men and
women [4]. Knowing this information is crucial for market intelligence because the information can be
used in targeted advertising and service improvement.
Our study is based on a real-life application of a web forum. In this study, we collect data from
the AIDS-related bulletin board at Healthboards.com [5], which is one of the top 20 health websites
according to Consumer Reports Health Web Watch. Under the premise that “Emotions expressed by
men and women will be different,” we propose a model that extracts words and emotions from text
messages and distinguishes the gender. By establishing a learning model using gender information,
we figure out unclassified gender information and identify the gender awareness of AIDS patients
by gender.
2. Related Works
In most previous studies on gender classification, the various features fed to machine learning
algorithms. At the Islamic Women’s political forum, Zhang et al. built a machine learning model that
classifies genders by properly combining the characteristics of vocabulary, syntax, structure, uni-gram,
and bi-grams [6]. In the study of Ryu et al. [7], they used logistic regression and SVM (a support
vector machine) to estimate the gender, age, and location of Twitter users. Wang et al. [8] suggested
research based on the Latent Dirichlet Allocation (LDA) model using text data from Facebook or
Twitter, in which women have been shown to use a lot of personal themes while men tend to post a lot
of philosophical or informative text. In the study of Na and Cho [9], they surveyed the emotions of
male and female college students and showed that gender can be distinguished by analyzing emotions
using Fastcluster.
In the study of Yan et al. [10], they presented a naïve Bayes classification approach to identify the
gender of weblog authors. They used weblog-specific features such as webpage background colors
and emoticons. They report an F-measure of around 64% using their features. Mukherjee and Liu. [11]
proposed two new techniques. They used their own POS Sequence mining algorithm and an Ensemble
feature selection technique, and achieved an accuracy of 88%.
Pennacchiotti and Popescu [12] proposed a user classification model applying machine learning
algorithm to the feature set including user profile, user tweeting behavior, linguistic content of user
messages, and user social network features. They explored the feature importance and found that
linguistic features are consistently effective in user classification.
Dwivedi et al. [13] present two systems, a manual feature extraction system and a deep learning
method, to automatically classify the gender of blog authors. For the deep-learning-based model,
they apply a Bidirectional Long Short-Term Memory Network. Barlte and Zheng [14] report an
Appl. Sci. 2019, 9, 1249 3 of 12
accuracy of 86% in gender classification on blog datasets by applying deep learning models based on
the Windowed Recurrent Convolutional Neural Network (WRCNN).
Filho et2019,
Appl. Sci. al. [15] proposed
9, x FOR textual meta-attributes, taking into account the characters, syntax,
PEER REVIEW words,
3 of 12
structure, and morphology of short, multi-genre, content-free text posted to Twitter to classify an
Filho et al. [15] proposed textual meta-attributes, taking into account the characters, syntax,
author’s gender via three different machine learning algorithms. The novel contribution of this work is
words, structure, and morphology of short, multi-genre, content-free text posted to Twitter to classify
to employ a word-based meta-attribute such as Ratio between hapax dislegomena (a word that appears
an author’s gender via three different machine learning algorithms. The novel contribution of this
only work
twiceisintoaemploy
whole atext) and themeta-attribute
word-based total numbersuch of words.
as RatioFurthermore,
between hapaxthey developed
dislegomena a textual
(a word
morphology
that appearsbased only ontwice
the meta-attributes from
in a whole text) and thetextual structures
total number such Furthermore,
of words. as the ratio between the number
they developed
of pronouns
a textual and the totalbased
morphology number of meta-attributes
on the words. This work from achieved 81% accuracy,
textual structures butratio
such as the the performance
between
by each
the class
number is not presented.
of pronouns andGaribo-Orts [16] of
the total number proposed
words. This a statistical approach
work achieved to the task
81% accuracy, butofthe
gender
performance by each class is not presented. Garibo-Orts [16] proposed a statistical
classification in tweets. The statistical features include skewness, kurtosis, and central moments; approach to the
thesetask of gender
statistical classification
features make thein tweets. Thealgorithm
learning statistical features include
context-free skewness,
and kurtosis, and central
language-independent.
moments; these statistical features make the learning algorithm context-free and language-
Recently, Convolutional Neural Networks (CNNs) have also been successful in various text
independent.
classification tasks. Kim [17] showed that a simple CNN with little hyper-parameter tuning and static
Recently, Convolutional Neural Networks (CNNs) have also been successful in various text
vectors achieves excellent results. Severyn et al. [18] proposed a deep learning model for Twitter sentiment
classification tasks. Kim [17] showed that a simple CNN with little hyper-parameter tuning and static
classification. They advanced
vectors achieves a CNNSeveryn
excellent results. model to et adjust
al. [18] word
proposedembedding using unsupervised
a deep learning model for Twitter learning.
Based
sentiment classification. They advanced a CNN model to adjust word embedding usingof the
on previous works, females and males are found to have differences in terms
vocabulary used and
unsupervised the emotional expression. During a literature review, we found some limitations
learning.
Based literature.
in the current on previousFirst, works, females
most andworks
recent males focus
are found to have or
on Twitter differences
weblogs.in Eventerms though
of the the
gendervocabulary
information used and the emotional
disclosed expression.
in the medical webDuring
forum,a literature review,
especially we foundtransmitted
on sexually some limitations
diseases,
in the current literature. First, most recent works focus on Twitter
is important, we hardly found any work dealing with medical web forums. Secondly, studies or weblogs. Even though theusing
gender information disclosed in the medical web forum, especially on sexually transmitted diseases,
sentiment features derived from text are lacking, while word features are explored a lot. Thirdly, deep
is important, we hardly found any work dealing with medical web forums. Secondly, studies using
learning is applied a lot to text classification, but its application to gender classification from textual
sentiment features derived from text are lacking, while word features are explored a lot. Thirdly,
information is rare.
deep learning is applied a lot to text classification, but its application to gender classification from
In this work,
textual information we aim to figure out how the sentiment features derived from emotions expressed
is rare.
in articlesInworkthis work, we aimclassification.
in gender to figure out howWethewill do this features
sentiment by using both from
derived machine learning
emotions and deep
expressed
learning methods.
in articles work in gender classification. We will do this by using both machine learning and deep
learning methods.
3. Experiment
3. Experiment
We retrieved the users’ gender information from an AIDS-related medical web forum and
presentedWe retrieved
a gender the users’
detection gender
model that information from based
classifies gender an AIDS-related medical
on the emotions web forum
expressed and and
in posts
presented a gender detection model that classifies gender based on the emotions expressed
comments. We developed the sentiment feature set that expresses how often the posts contain emotions,in posts
and comments. We developed the sentiment feature set that expresses how often the posts contain
and assessed the emotional complexity, which is the number of emotion categories shown in a post.
emotions, and assessed the emotional complexity, which is the number of emotion categories shown
Using vocabulary characteristics and emotions from the disclosed data, we built machine learning
in a post. Using vocabulary characteristics and emotions from the disclosed data, we built machine
models and measured
learning models andthe accuracy,
measured thevarying
accuracy,the feature
varying thesets to select
feature sets tothe bestthe
select model. The proposed
best model. The
framework
proposedis presented
framework in Figure 1. in Figure 1.
is presented
Figure Gender
1. 1.
Figure Genderclassification modelfor
classification model formedical
medical web
web forum.
forum.
Appl. Sci. 2019, 9, 1249 4 of 12
3.1. Dataset
From 14 November 2000 to 25 February 2010, we collected 3451 posts and 18,944 comments—in
total, 22,395 messages—from an AIDS-related bulletin board from HealthBoard.com. It is one
of the first independent health community websites and has more than 280 bulletin boards for
communication about diseases. As shown in Table 1, this dataset has 8954 male-written messages
and 6078 female-written messages. A total of 3282 users participated in the AIDS bulletin board,
with 849 female users and 1054 male users. This implies that 1379 users have not disclosed their
gender. The average post length is 113 words for men and 106 words for women. The percentage of
users who disclosed gender information is 59% for males and 41% for females; 33% of the total has no
gender information. As for the Alzheimer’s-related board, only 1708 (6.2%) out of 27,874 (posts: 2486,
comments: 25,134) have not disclosed their gender. The number of users in the Alzheimer’s bulletin
board is 1498. The number of female users is 924 and male users 185. In total, 1109 users have gender
information. Compared to the users in the Alzheimer’s-related board, users in the AIDS-related board
have a low ratio of disclosed gender information.
3.2.1. Unigram
We extracted the word features from the text to construct a learning model. The first step
was to derive words after preprocessing. We used the Bag-of-Words technique to tokenize each
word into a DocumentTermMatrix that indicates how many times the word appears in each article.
The Bag-of-Words technique transforms the sentence into a numeric vector as follows. First, the word
set is built by collecting words from all sentences in all documents. Then, a sentence is expressed as
the word counts in word order. For example, in “symptoms appear after at least two” and “it was two
years of unnecessary stress,” these two sentences generate the word set, {symptoms, appear, after, at,
least, two, it, was, years, of, unnecessary, stress}. Two sentences are expressed as the counts of each
word as {1,1,1,1,1,1,0,0,0,0,0,0} and {0,0,0,0,0,1,1,1,1,1,1,1}, respectively. The numeric vector is derived
for a document by aggregating the word count for all sentences in a document. The document term
matrix (DTM) is represented in Figure 2.
Appl. Sci. 2019, 9, 1249 5 of 12
Appl. Sci. 2019, 9, x FOR PEER REVIEW 5 of 12
Figure
Figure 2.
2. Representation
Representation of
of DocumentTermMatrix.
DocumentTermMatrix.
We extracted the
We extracted document term
the document term matrix
matrix in
in two
two ways,
ways, the
the word occurrence and
word occurrence and the
the occurrence
occurrence
frequency, to test which feature extraction method is efficient.
frequency, to test which feature extraction method is efficient.
To
To reduce the feature
reduce the feature set,
set, we
we set
set the
the threshold
threshold value
value to
to the
the number
number of
of appearances
appearances for
for extracting
extracting
word
word features.
features. WeWe incrementally
incrementally increased
increased the
the threshold
threshold value
value from
from 55 to
to 15
15 by
by 5s
5s and
and checked
checked the
the
classification performance.
classification performance.
3.2.2. Sentiment
Sentiment Features
Features
To derive
derive the
the sentiment
sentimentfeatures,
features,we weused
usedtidytext,
tidytext,which
which is is
a dictionary
a dictionary package
packagebuilt in R,
built in to
R,
measure
to measure thethe
raterate
of emotions.
of emotions. There are several
There dictionaries,
are several but we
dictionaries, butused
we usedNRCNRC [19] and
[19] BING.
and BING. The
NRC
The NRCdictionary has 10
dictionary hascategories: trust,trust,
10 categories: fear, fear,
negative, sadness,
negative, anger,
sadness, surprise,
anger, positive,
surprise, disgust,
positive, joy,
disgust,
and anticipation.
joy, and Based
anticipation. on on
Based this, wewe
this, developed
developed sentiment-related
sentiment-relatedfeatures:
features:thethenumber
numberof of emotion
emotion
types expressed in a message, objectivity (1-number of used used emotion
emotion types/total
types/total emotion
emotion (10)),
(10)), and
emotional complexity (the number of emotion types/10). types/10).
The Bing dictionary
dictionary classifies
classifieswords
wordsonly onlyinto
intopositive
positiveoror negative,
negative, andandhashas 6788
6788 words.
words. Based
Based on
on
thethe Bing
Bing dictionary,
dictionary, we we calculated
calculated thethe positive
positive raterate
(the(the number
number of positive
of positive words/the
words/the total
total number
number of
of words), negative rate (the number of negative words/the total number
words), negative rate (the number of negative words/the total number of words that contain emotions),of words that contain
emotions),
and the totaland the total
number number of words.
of words.
As shown in in Tables
Table 2 andand 3,Table
women 3, women
use the usewords the‘shine,’
words ‘thank,’
‘shine,’ ‘bless,’
‘thank,’and‘bless,’
‘glad’ and ‘glad’
(positive
(positive
words) and words) and ‘problem,’
‘problem,’ ‘scary,’ and‘scary,’
‘illness’and ‘illness’words)
(negative (negative
aboutwords)
twice about
as oftentwice as often
as men. On theas men.
other
On themen
hand, otheruse
hand,
the men
words use the words‘important,’
‘accurate,’ ‘accurate,’ ‘important,’ ‘receptive’
‘receptive’ (positive (positive
words) and words)
‘issue,’ and ‘issue,’
‘fever,’ and
‘fever,’ (negative
‘aches’ and ‘aches’ (negative
words) morewords) moreasthan
than twice often twice as often as woman.
as woman.
Word
Word Men
Men Women
Women
Shine
Shine 11 419
419
Thank
Thank 655
655 714
714
Glad
Glad 154
154 233
233
Bless 168 267
Bless 168 267
Accurate 350 166
Accurate
Important 350
196 16676
Important
Correct 196
149 7654
Correct 149 54
Table 3. Differences in negative words between men and women.
Table 3. Differences in negative words between men and women.
Word Men Women
Word Men Women
Problem 246 337
Problem
Scary 246
80 337147
Scary
Illness 80
144 147211
Issue
Illness 565
144 211179
Fever 358
Issue 565 179140
Aches 184 39
Fever 358 140
Aches 184 39
Appl. Sci. 2019, 9, 1249 6 of 12
P(c) P( F |c)
P(c| F ) = . (1)
P( F )
Based on the naïve Bayes assumption, the probability of belonging to each class is as follows:
n
P̂( F |c) = argmaxP(C = c|F) = argmaxP(C|c) ∏ P( f = f i |C = c). (3)
c c
i =1
The basic assumption is that all features are independent. This assumption is too strong, but it
works well in reality.
Figure3.3.Model
Figure Modelarchitecture
architecturewith
withananexample
examplesentence.
sentence.
To derive abstract features from input data and make the algorithm derive useful features
To derive abstract features from input data and make the algorithm derive useful features
automatically, we employ a hidden layer and a convolutional layer to the network.
automatically, we employ a hidden layer and a convolutional layer to the network.
In a convolution layer, we feed the input matrix to the filter of k*h size, then the input of h*k
In a convolution layer, we feed the input matrix to the filter of k*h size, then the input of h*k size
size is mapped to a cell in the convolutional layer. In Figure 2, the rectangle with a red line is a 3*k
is mapped to a cell in the convolutional layer. In Figure 2, the rectangle with a red line is a 3*k filter.
filter. The filter works as an aggregator of h words. The filter needs to keep the word meaning when
The filter works as an aggregator of h words. The filter needs to keep the word meaning when
aggregating words, so the horizontal size of the filter is set to k. As a result, the convolutional layer
is one-dimensional. The number of nodes in a convolution layer is calculated as n‒h + 1.
The vertical size of the filter can be varied. In this work, we test two sizes of filter, 3 (in red) and
8 (in yellow). The number of filters depends on the user’s decision. More filters are needed to enrich
features. The max pooling layer is also employed to avoid overfitting a result of too much features.
Appl. Sci. 2019, 9, 1249 8 of 12
aggregating words, so the horizontal size of the filter is set to k. As a result, the convolutional layer is
one-dimensional. The number of nodes in a convolution layer is calculated as n-h + 1.
The vertical size of the filter can be varied. In this work, we test two sizes of filter, 3 (in red) and 8
(in yellow). The number of filters depends on the user’s decision. More filters are needed to enrich
features. The max pooling layer is also employed to avoid overfitting a result of too much features.
The max pooling layer adopts a filter with the size of (n-h + 1)*1 and finds the maximum value for all
values in a precedent convolutional layer. The max pooling layer generates one-dimensional m rows.
Finally, m rows are fully connected to the output. The output layer has two nodes to express bi-class,
gender in our case. Two nodes are filled with 1 or 0, it indicates male or female respectively.
4. Results
Algorithm NB SVM RF
Real
Female Male Female Male Female Male
Female 174 (True positive: TP) 200 (False negative: FN) 81 42 402 413
Male 1679 (False positive: FP) 2456 (True negative: TN) 1723 2663 1451 2243
Precision 9.39% 92.47% 4.49% 98.45% 21.69% 84.45%
Recall 46.52% 59.40% 65.85% 60.72% 49.33% 60.72%
Accuracy 58.33% 60.86% 58.66%
False positive rate 40.60% 39.28% 39.28%
False negative rate 53.48% 34.15% 50.67%
The second experiment was performed using senti-word features. Instead of calculating sentiment
value from senti-word, we checked the senti-word occurrence. Among the 7954 words that appeared
more than five times each, we selected only 1594 senti-words as features that are in the NRC
dictionary. The accuracy increased a little bit with the SVM and RF algorithms, as shown in Table 5.
The precision for the female class improved. The senti-words used to express the author’s emotion
were comparatively effective for gender classification. Among the three algorithms, SVM had the
highest performance for the sentiment feature set and the senti-word feature set.
Appl. Sci. 2019, 9, 1249 9 of 12
NB SVM RF
Female Male Female Male Female Male
Female 1706 2434 701 236 514 210
Male 117 252 1122 2450 1309 2470
Precision 93.58% 9.38% 38.45% 91.21% 28.20% 92.16%
Recall 41.21% 68.29% 74.81% 68.59% 70.99% 65.36%
Accuracy 43.42% 69.88% 66.27%
False positive rate 31.71% 31.41% 34.64%
False negative rate 58.79% 25.19% 29.01%
The final experiment was performed using the entire feature set, combining word features and
sentiment features. When we assessed all 7954 words that occurred more than five times along with
the sentiment features, the accuracy improved. However, the performance of SVM rather decreased.
In this case, RF has the highest performance and the precision for the female class somewhat improved.
However, the female class still has a poor performance as shown in Table 6.
Table 6. The accuracy of gender detection using entire feature set (word features + sentiment features).
NB SVM RF
Female Male Female Male Female Male
Female 1838 2623 1 0 754 135
Male 14 34 1822 2686 1069 2551
Precision 99.24% 1.28% 0.05% 100.00% 41.36% 94.97%
Recall 41.20% 70.83% 100.00% 59.58% 84.81% 70.47%
Accuracy 41.52% 59.55% 73.33%
False positive rate 29.17% 40.42% 29.53%
False negative rate 58.80% 0.00% 15.19%
Based on the three experiments varying the feature set, we obtained two findings, one is
the algorithm perspective and the other is the feature perspective. We can conclude that SVM
underperforms other algorithms in the case of a few features. On the other hand, RF works well with
more features. In addition, we can conclude that NB is not appropriate for text classification task.
For feature perspective, words are good features for gender classification in the web forum. Types of
emotion and their density are not enough, although these sentiment features reduce the computational
complexity with a small number of features. However, when using with SVM, the senti-word feature
set generates the comparable performance to the word feature set.
Even though words are good features, we cannot employ all words because we need to consider
the computational complexity and cost to make the algorithm work. Machine learning algorithms
support a limited number of features, unlike the deep learning algorithm, and we need to reduce the
number of words included in the feature set. For feature reduction, we explored the threshold value to
extract words from 5 to 100, as shown in Table 7. The number of words involved in learning is displayed
at the end of Table 7. We also tested which feature extraction method among the occurrence with
boolean and the term frequency is useful. From the experiments results, the occurrence with boolean
outperforms the term frequency. In naïve Bayes, which assumes the independence of the variables,
the number of words decreases when setting a higher threshold value, and the performance improves.
SVM has the opposite behavior with NB. The best performance is achieved when applying the RF
model to the term frequency features with a threshold value of 10, resulting in 4822 words. However,
the performance of RF begins to decline beyond a certain number of features from the first two rows of
Table 7. From all the experiments, we could conclude that traditional machine learning algorithms are
Appl. Sci. 2019, 9, 1249 10 of 12
not good at gender classification, resulting in a large performance gap between two classes. In the next
section, we will see how much the deep learning algorithm improved the performance.
Female Male
CNN Structure Accuracy
Precision Recall Precision Recall
Conv + Pool + Dropout + FC 0.906 0.93 0.91 0.87 0.90
Conv + Pool + Conv + Pool + Dropout + FC 0.910 0.93 0.92 0.88 0.89
Conv + Pool + Conv + FC + Dropout + FC 0.905 0.94 0.89 0.86 0.92
Conv + Conv + Pool + FC + Dropout +FC 0.887 0.91 0.90 0.85 0.87
Appl. Sci. 2019, 9, 1249 11 of 12
Author Contributions: S.P. and performed the experiments and writing. J.W. collected the dataset and participated
in writing.
Funding: This work was supported by the research fund of Soonchunhyang University (project number 20180118)
and a National Research Foundation of Korea (NRF) grant funded by the Korean government (MSIP; Ministry of
Science, ICT & Future Planning (NRF-2017R1D1A3B03036050)).
Acknowledgments: The authors gratefully acknowledge the support by the research fund of Soonchunhyang
University (project number 20180118) and a National Research Foundation of Korea (NRF) grant funded by the
Korean government (MSIP; Ministry of Science, ICT & Future Planning (NRF-2017R1D1A3B03036050)).
Conflicts of Interest: The authors have no conflicts of interests.
References
1. Weaver, J.B., III; Mays, D.; Weaver, S.S.; Hopkins, G.L.; Eroğlu, D.; Bernhardt, J.M. Health information–seeking
behaviors, health indicators, and health risks. Am. J. Public health 2010, 100, 1520–1525. [CrossRef] [PubMed]
2. Woo, J.; Lee, M.J.; Ku, Y.; Chen, H. Modeling the dynamics of medical information through web forums in
medical industry. Technol. Forecast. Soc. Chang. 2015, 97, 77–90. [CrossRef]
3. Denecke, K.; Nejdl, W. How valuable is medical social media data? Content analysis of the medical web.
Inf. Sci. 2009, 179, 1870–1880. [CrossRef]
4. Sullivan, C.F. Gendered cybersupport: A thematic analysis of two online cancer support groups. J. Health
Psychol. 2003, 8, 83–104. [CrossRef] [PubMed]
5. Healthboard. Available online: https://www.healthboards.com/ (accessed on 25 March 2019).
Appl. Sci. 2019, 9, 1249 12 of 12
6. Zhang, Y.; Dang, Y.; Chen, H. Gender classification for web forums. IEEE Trans. Syst. Man Cybern. Part A
Syst. Hum. 2011, 41, 668–677. [CrossRef]
7. Ryu, K.; Jeong, J.; Moon, S. Inferring Sex, Age, Location of Twitter Users. J. KIISE 2014, 32, 46–53.
8. Wang, Y.-C.; Burke, M.; Kraut, R.E. Gender, topic, and audience response: An analysis of user-generated
content on facebook. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems,
Paris, France, 27 April–2 May 2013; pp. 31–34.
9. Na, Y.; Cho, G. Grouping preferred sensations of college students using sementic differential methods of
sensation words. Korean J. Sci. Emot. Sensib. 2002, 5, 9–16.
10. Yan, X.; Yan, L. Gender Classification of Weblog Authors. In Proceedings of the AAAI Spring Symposium:
Computational Approaches to Analyzing Weblogs, Palo Alto, CA, USA, 27–29 March 2006; pp. 228–230.
11. Mukherjee, A.; Liu, B. Improving gender classification of blog authors. In Proceedings of the 2010
Conference on Empirical Methods in Natural Language Processing, Cambridge, MA, USA, 9–11 October
2010; pp. 207–217.
12. Pennacchiotti, M.; Popescu, A.-M. A machine learning approach to twitter user classification. In Proceedings
of the Fifth International AAAI Conference on Weblogs and Social Media, Barcelona, Spain, 17–21 July 2011.
13. Dwivedi, V.P.; Singh, D.K.; Jha, S. Gender Classification of Blog Authors: With Feature Engineering and Deep
Learning using LSTM Networks. In Proceedings of the 2017 Ninth International Conference on Advanced
Computing (ICoAC), Chennai, India, 14–16 December 2017; pp. 142–148.
14. Bartle, A.; Zheng, J. Gender Classification with Deep Learning; Stanford cs224d Course Project Report;
The Stanford NLP Group: Stanford, CA, USA, 2015.
15. Lopes Filho, J.A.B.; Pasti, R.; de Castro, L.N. Gender classification of twitter data based on textual meta-attributes
extraction. In New Advances in Information Systems and Technologies; Springer: Berlin/Heidelberg, Germany,
2016; pp. 1025–1034.
16. Garibo-Orts, O. A big data approach to gender classification in twitter. In Proceedings of the Ninth
International Conference of the CLEF Association (CLEF 2018), Avignon, France, 10–14 September 2018.
17. Kim, Y. Convolutional neural networks for sentence classification. arXiv 2014, arXiv:1408.5882.
18. Severyn, A.; Moschitti, A. Unitn: Training deep convolutional neural network for twitter sentiment
classification. In Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015),
Denver, CO, USA, 4–5 June 2015; pp. 464–469.
19. Mohammad, S.M. Challenges in sentiment analysis. In A Practical Guide to Sentiment Analysis; Springer:
Berlin/Heidelberg, Germany, 2017; pp. 61–83.
20. Nayak, J.; Naik, B.; Behera, H. A comprehensive survey on support vector machine in data mining tasks:
Applications & challenges. Int. J. Database Theory Appl. 2015, 8, 169–186.
21. Zhang, Y.D.; Dong, Z.; Chen, X.; Jia, W.; Du, S.; Muhammad, K.; Wang, S.H. Image based fruit category
classification by 13-layer deep convolutional neural network and data augmentation. Multimedia Tools Appl.
2019, 78, 3613. [CrossRef]
22. Wang, S.H.; Sun, J.; Phillips, P.; Zhao, G.; Zhang, Y.D. Polarimetric synthetic aperture radar image
segmentation by convolutional neural network using graphical processing units. J. Real-Time Image Process.
2018, 15, 631. [CrossRef]
23. word2Vec. Available online: https://code.google.com/archive/p/word2vec/ (accessed on 25 March 2019).
24. Kingma, D.P.; Ba, J.L. Adam: A method for stochastic optimization. arXiv 2014, arXiv:1412.6980v9.
© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access
article distributed under the terms and conditions of the Creative Commons Attribution
(CC BY) license (http://creativecommons.org/licenses/by/4.0/).