Professional Documents
Culture Documents
Especially when recurrent and with today’s world live with their smartphones as
may become a serious health condition. It electronic social media networks that alert
can cause the affected person to suffer users to updates on friends, favourite
greatly and function poorly at work, at celebrities, and global events. Social media
school and in the family. At its worst, has become firmly integrated into a lot of
depression can lead to suicide. Over 700 000 people’s daily lives.Billions of people
people die due to suicide every year. Suicide around the world use social media to share
is the fourth leading cause of death in 15-29- information and make connections. On a
Social media is an internet-based form of new things, develop your interests, and be
Social networks have been offer a potential new way to uncover and get
developed as a great point for its users to help for those suffering from depression.
Variations for the Detection of represented close to each other and far from
the representations of terms that are
Substance Abuse and Mental
predictive for the remaining classes. These
Health Issues on Social Media
vectors are obtained by extending
Writings Word2vec’s objective function, which takes
Author:Diana Ramírez-Cifuentes; Christine those predictive terms as inputs.
Largeron Dataset
Year:2021 The data collected for experimental
Doi:10.1109/ACCESS.2021.3112102 purposes was gathered from a group of
Problem Identified selected subreddits, which are forum
Substance abuse and mental health issues communities of users on Reddit that are
are severe conditions that affect millions. often focused on a specific subject of
Signs of certain conditions have been traced discussion. This is a suitable data source
on social media through the analysis of posts. because it is likely to contain posts of users
Objective living with a given mental disorder. For
instance, the depression subreddit contains
posts of users with Depression, and from detection models are not powerful enough to
people that give advice and support to others. capture critical sentiment information from
Findings the large volume of posts published by each
Results show that variations of our enhanced user, which makes the performance of these
representations outperform in Recall, models not satisfying.
Accuracy, and F1-Score the embedding’s Objective
learned with Word2vec, DistilBERT, To address this problem, The author
GloVe ’s fine-tuned pre-learned embedding proposes a hierarchical posts representations
and other methods based on domain adapted model named Multi-Gated LeakyReLU
embedding. The approach presented has the CNN (MGL-CNN) for identifying depressed
potential to be used on similar binary or individuals in online forums. The model
multi-class classification tasks that deal with consists of two parts: the first one is a post-
small domain-specific textual corpora. level operation, which is used to learn the
representation of each post of the user, and
4. MGL-CNN: A Hierarchical Posts the second one is a user-level operation,
Representations Model for which is used to obtain the overall
Identifying Depressed Individuals in representation of the user's emotional state.
Online Forums Besides, propose another depression
Author: Guozheng Rao; Yue Zhang detection model by changing the number of
Year:2020 gated units in the MGL-CNN, which is
Doi: 10.1109/ACCESS.2020.2973737 named Single-Gated LeakyReLU CNN
Problem Identified (SGL-CNN).
More users suffering from depression turn to Methodology
online forums to express their problems and The author introduced two hierarchical
seek help. In such forums, there is often a neural network models with gated units and
large volume of posts with sensitive content, convolutional networks for fulfilling
indicating that the user has a risk of suicide depression detection task, which are named
and self-harm. Early detection of depression Multi-Gated LeakyReLU CNN (MGL-CNN)
using appropriate deep learning models and and Single-Gated LeakyReLU CNN (SGL-
social media data can prevent potential self- CNN). The user’s dataset is divided into a
harm. However, existing depression certain number of posts, and we can use our
models to identify the genuinely crucial Detection of Depression
sentiment features of each user’s posts and Indications in Text Sequences
suppress other unimportant information as
Author: Marcel Trotzek; Sven Koitka
possible. The proposed models can encode
Year:2020
the relations between posts in user
Doi: 10.1109/TKDE.2018.2885515
representation. It consists of two parts: the
Problem Identified
first one is a post-level operation, which is
Depression is ranked as the largest
used to learn the representation of the user’s
contributor to global disability and is also a
every post, and the second one is a user-
major reason for suicide. Still, many
level operation, which can obtain the overall
individuals suffering from forms of
representation of the user’s emotional state.
depression are not treated for various
The traditional convolutional neural network
reasons. Previous studies have shown that
is weak in identifying crucial depression
depression also has an effect on language
features. According to this situation, we add
usage and that many depressed individuals
gated units to improve the performance of
use social media platforms or the internet in
this task dramatically.
general to get information or discuss their
Dataset
problems.
Experiments are conducted based on the
Objective
Reddit Self-Reported Depression Diagnosis
This paper addresses the early detection of
(RSDD) dataset and the Early Detection of
depression using machine learning models
Depression dataset (eRisk 2017).
based on messages on a social platform. In
Findings
particular, a convolutional neural network
Experimental results showed that our
based on different word embedding is
models performed better than the previous
evaluated and compared to a classification
state-of-the-art models on the Reddit Self-
based on user-level linguistic metadata. An
Reported Depression Diagnosis dataset, and
ensemble of both approaches is shown to
also performed well on the Early Detection
achieve state-of-the-art results in a current
of Depression dataset.
early detection task.
Methodology
5. Utilizing Neural Networks and Neural word embeddings have become a
Linguistic Metadata for Early popular and efficient way to model words
and interactions between them for purposes performance on the eRisk 2017 task for
like text classification tasks. They date back comparison to previously published results
to the concept of distributed word and among these models, future work will
representations that, in contrast to local have to show how these models perform on
representations, do not handle each word yet unseen data.
separately with a single neuron, but use
several neurons to represent a word and let 6. A Novel Co-Training-Based
each neuron be part of the description of Approach for the Classification
several words. This enables distributed
of Mental Illnesses Using Social
representations to learn general concepts of
Media Posts
language instead of just independent word
Author:Subhan Tariq; Nadeem Akhtar
representations.
Year:2019
Dataset
Doi:10.1109/ACCESS.2019.2953087
The dataset utilized in all experiments for
Problem Identified
this paper was first described in 2016 for
Traditional methods either need enough
research on depression and language use and
historic data or to keep the regular
then finally published as part of the CLEF
monitoring on patient activities for
2017 conference eRisk pilot task on early
identification of a patient associated with a
detection of depression. It contains
mental illness disease.
chronological sequences of posts and
Objective
comments from reddit.com, collected for a
In order to address this issue, we propose a
total of 135 depressed users and a random
methodology to classify the patients
control group of 752 users.
associated with chronic mental illness
Findings
diseases (i.e. Anxiety, Depression, Bipolar,
The analysis of the resulting word vectors
and ADHD (Attention Deficit Hyperactivity
has shown that the model has learnt some
Disorder) based on the data extracted from
features specific to this domain and is viable
the Reddit, a well-known network
for general syntactic questions in the English
community platform.
language as shown based on the standard
Methodology
word analogy task. As the results presented
in this paper are optimized to obtain the best
The proposed method is employed through reason for suicide. It has an impact on the
Co-training (type of semi-supervised language usage reflected in the written text.
learning approach) technique by The key objective of our study is to examine
incorporating the discriminative power of Reddit users' posts to detect any factors that
widely used classifiers namely Random may reveal the depression attitudes of
Forrest (RF), Support Vector Machine relevant online users.
(SVM), and Naïve Bayes (NB).
Dataset
We used Reddit API to download posts and
top five associated comments for Objective
construction of a feature space. For such purpose, we employ the Natural
Findings Language Processing (NLP) techniques and
The experimental results indicate the machine learning approaches to train the
effectiveness of Co-training based data and evaluate the efficiency of our
classification rather than the state of the art proposed method. We identify a lexicon of
classifiers by a margin of 3% on average in terms that are more common among
par with every state of art technique. In depressed accounts.
future, the proposed method could be Methodology
employed to investigate any classification The author uses the NLP tools to pre-
problem of any domain by extracting date process the dataset before it is proceeded to
from the social media. the feature selection and training stage. First,
use tokenization to divide the posts into
7. Detection of Depression-Related individual tokens. Next, remove all the
Posts in Reddit Social Media URLs, punctuations and stop words which
could lead into erratic results if stay
Forum
ignored.For n-gram modelling we use the
Author:Michael M. Tadesse; Hongfei Lin
Term frequency inverse document frequency
Year:2019
(TF-IDF) as a numeric statistic where the
Doi:10.1109/ACCESS.2019.2909180
importance of a word with respect to each
Problem Identified
document in corpora is highlighted. The
Depression is viewed as the largest
proposed framework is developed by using
contributor to global disability and a major
Logistic Regression, Support Vector Authors: Ryan S. McGinnis; Ellen W.
Machine, Random Forest, Adaptive McGinnis; Jessica Hruschak; Nestor L.
Boosting and Multilayer Perceptron Lopez-Duran
classifier. Year:2018
Dataset Doi:
The data corpus contains depression- https://ieeexplore.ieee.org/document/851332
indicative posts (1293) and standard posts 7
(548). Depression-indicative posts are Objective
collected from relatively large subedits This paper presents a new approach for
devoted to depression, where depressed diagnosing anxiety and depression in young
users seek support from an online children. The author proposes the use of a
community. Standard posts written by non- 90-second fear induction task during which
depressed users are collected from subedits time participant motion is monitoring using
related to a family or friends. a commercially available wearable sensor.
Findings Methodology
The results show that our proposed method Machine learning and data extracted from
can significantly improve performance the most clinically feasible 20-second phase
accuracy. The best single feature is bigram of the task are used to predict diagnosis in a
with the Support Vector Machine (SVM) sample of children with and without an
classifier to detect depression with 80% internalizing diagnosis. A supervised
accuracy and 0.80 F1 scores.According to learning approach was used to create binary
our study, better performance improvement classification models that relate features
can be achieved by proper feature selections from the inertial measurement unit (IMU),
and their multiple feature combinations. derived signals to internalizing diagnosis.
Dataset
8. Rapid Anxiety and Depression Data were collected from 63 children (57%