Professional Documents
Culture Documents
Machine Learning Compared To Deep Learning - Alexandrescu - Tudor - Filote - Cosmin PDF
Machine Learning Compared To Deep Learning - Alexandrescu - Tudor - Filote - Cosmin PDF
compared to Deep
Learning - algorithms
Filote Cosmin
Tudor Alexandrescu
The Dataset
A wonderful little
It has 50K movie reviews for natural 1 production. <br /><br positive
/>The...
language processing or Text analytics. I thought this was a
2 wonderful way to spend positive
This is a dataset for binary sentiment ti...
Basically there's a family
classification containing substantially 3
where a little boy ...
negative
Encouraged by the
8 positive comments about negative
this...
If you like original gut
9 positive
wrenching laughter yo...
The Dataset
The problem is that we need to do some cleanup before testing our models. The text is
full of html tags. So we will eliminate them. Eliminate the numbers, bring everything to
lower case and lemmatize the text. After that we can see some interesting graphs.
Graphs – most used words
The problem is : how to classify if a review is positive or negative ? But beforewe need
to prepare our data for training. For the sentiment we will convert positive to 1 and
negative to 0. But for the text ( the review ) we will use TF-IDF .
Logistic Regression – we
obtained a high accuracy from
the start. We only chose the
random state parameter.
ML – Models
In general all ML Algorithms that we used performed well (only Random Forest
Classifier was poor). The average accuracy was over 80%. In our case it is ok but there
might be other cases (like medical ones) where an accuracy of 80% is a very poor one.
DL - Models
The used shallow algorithms (machine learning algorithms) generated (most of the
time) very good results related to the global accuracy (on each class). This is an
interesting fact given the complexity of the used algorithms.
Related to the deep learning algorithms, the obtained results are very similar.
Moreover, in each case there is a tendency to overfitting. While we used different
embedding styles and network architectures, we were not able to obtain a global
accuracy above 90% (on the test sample), as happened when using machine learning
algorithms.