You are on page 1of 5

Journal of Engineering Sciences Vol 13 Issue 12,2022

DRUG Recommendation System Based on Sentiment Analysis of


DRUG Reviews Using Machine Learning

B. LOKESWARA NAYAK1, N. LAKSHMI TULASI 2


1
M.Tech PG Scholar, Department of CSE, P.N.C. & Vijai Institute of Engineering & Technology,
Repudi(v), Phirangipuram(m), Guntur, Ap, India.
2
M.Tech, Assoc Professor, Department of CSE , P.N.C. & Vijai Institute of Engineering &
Technology, Repudi(v), Phirangipuram(m), Guntur, Ap, India.

ABSTRACT: In today’s digital era healthcare is one among the major core areas of the medical domain.
People trying to find suitable health-related information that they are concerned with. The Internet could
be a great resource for this kind of data, however you need to take care to avoid getting harmful
information. Nowadays, a colossal quantity of clinical information dispersed totally across different
websites on the Internet prevents users from finding useful information for their well-being improvement.
Errors in medication are one of the foremost severe medical faults that would be a threat to patients’ lives.
These problems increases the requirement to use recommendation systems within the domain of
healthcare to assist users creates additional economical and correct health-related decisions. In this
research, we build a medicine recommendation system that uses patient reviews to predict the sentiment
using various victimization processes like Bow, TF-IDF, Word2Vec, and Manual Feature Analysis,
which can help recommend the top drug for a given disease by different classification algorithms. The
predicted sentiments were evaluated by precision, recall, f1score, accuracy, and AUC score. The results
show that classifier Linear SVC using TF-IDF victimization outperforms all other models with
93%accuracy.

1. INTRODUCTION study comes up with accompanying more drugs,


With the number of coronavirus cases growing tests, accessible for clinical staff every day.
exponentially, the nations are facing a shortage Accordingly, it turns out to be progressively
of doctors, particularly in rural areas where the challenging for doctors to choose which
quantity of specialists is less compared to urban treatment or medications to give to a patient
areas. A doctor takes roughly 6 to 12 years to based on indications, past clinical history. With
procure the necessary qualifications. Thus, the the exponential development of the web and the
number of doctors can’t be expanded quickly in web-based business industry, item reviews have
a short time frame. A Telemedicine framework become an imperative and integral factor for
ought to be energized as far as possible in this acquiring items worldwide. Individuals
difficult time [1]. Clinical blunders are very worldwide become adjusted to analyze reviews
regular nowadays. Over 200 thousand and websites first before settling on a choice to
individuals in China and 100 thousand in the buy a thing. While most of past exploration
USA are affected every year because of zeroed in on rating expectation and proposals on
prescription mistakes. Over 40% medicine, the E-Commerce field, the territory of medical
specialists make mistakes while prescribing care or clinical therapies has been infrequently
since specialists compose the solution as taken care of. There has been an expansion in
referenced by their knowledge, which is very the number of individuals worried about their
restricted [2][3]. Choosing the toplevel well-being and finding a diagnosis online. As
medication is significant for patients who need demonstrated in a Pew American Research
specialists that know wide-based information center survey directed in 2013 [5], roughly 60%
about microscopic organisms, antibacterial of grown-ups searched online for health-related
medications, and patients [6]. Every day a new subjects, and around 35% of users looked for

ISSN:0377-9254 www.jespublication.com Page 218


Journal of Engineering Sciences Vol 13 Issue 12,2022

diagnosing health conditions on the web. A assess the adequacy of the suggested treatment.
medication recommender framework is truly This structure can prescribe the best treatment
vital with the goal that it can assist specialists regimens to new patients as per their
and help patients to build their knowledge of demographic locations and medical
drugs on specific health conditions. A complications. An Electronic Medical Record
recommender framework is a customary system (EMR) of patients gathered from numerous
that proposes an item to the user, dependent on clinics for testing. The result shows that this
their advantage and necessity. These framework improves the cure rate.
frameworks employ the customers’ surveys to In this research [11], multilingual sentiment
break down their sentiment and suggest a analysis was performed using Naive Bayes and
recommendation for their exact need. In the drug Recurrent Neural Network (RNN). Google
recommender system, medicine is offered on a translator API was used to convert multilingual
specific condition dependent on patient reviews tweets into the English language. The results
using sentiment analysis and feature exhibit that RNN with 95.34% outperformed
engineering. Sentiment analysis is a progression Naive Bayes, 77.21%.
of strategies, methods, and tools for The study [12] is based on the fact that the
distinguishing and extracting emotional data, recommended drug should depend upon the
such as opinion and attitudes, from language [7]. patient’s capacity. For example, if the patient’s
On the other hand, Featuring engineering is the immunity is low, at that point, reliable medicines
process of making more features from the ought to be recommended. Proposed a risk level
existing ones; it improves the performance of classification method to identify the patient’s
models. immunity. For example, in excess of 60 risk
factors, hypertension, liquor addiction, and so
2. LITERATURE SURVEY forth have been adopted, which decide the
These days, recommender frameworks are very patient’s capacity to shield himself from
regular in the travel industry, e-commerce, infection. A web-based prototype system was
restaurant, and so forth. Unfortunately, there are also created, which uses a decision support
a limited number of studies available in the field system that helps doctors select first-line drugs.
of drug proposal framework utilizing sentiment Xiaohong Jiang et al. [13] examined three
analysis on the grounds that the medication distinct algorithms, decision tree algorithm,
reviews are substantially more intricate to support vector machine (SVM), and
analyze as it incorporates clinical wordings like backpropagation neural network on treatment
infection names, reactions, a synthetic names data. SVM was picked for the medication
that used in the production of the drug [8]. proposal module as it performed truly well in
The study [9] presents GalenOWL, a semantic- each of the three unique boundaries - model
empowered online framework, to help exactness, model proficiency, model versatility.
specialists discover details on the medications. Additionally, proposed the mistake check
The paper depicts a framework that suggests system to ensure analysis, precision and
drugs for a patient based on the patient’s administration quality.
infection, sensitivities, and drug interactions. For Mohammad Mehedi Hassan et al. [14]
empowering GalenOWL, clinical data and developed a cloudassisted drug proposal
terminology first converted to ontological terms (CADRE). As per patients’ side effects, CADRE
utilizing worldwide standards, such as ICD-10 can suggest drugs with top-N related
and UNII, and then correctly combined with the prescriptions. This proposed framework was
clinical information. initially founded on collaborative filtering
Leilei Sun [10] examined large scale treatment techniques in which the medications are initially
records to locate the best treatment prescription bunched into clusters as indicated by the
for patients. The idea was to use an efficient functional description data. However, after
semantic clustering algorithm estimating the considering its weaknesses like computationally
similarities between treatment records. costly, cold start, and information sparsity, the
Likewise, the author created a framework to model is shifted to a cloud-helped approach

ISSN:0377-9254 www.jespublication.com Page 219


Journal of Engineering Sciences Vol 13 Issue 12,2022

using tensor decomposition for advancing the surveys to break down their sentiment and
quality of experience of medication suggestion. suggest a recommendation for their exact need.
Considering the significance of hashtags in In the drug recommender system, medicine is
sentiment analysis, offered on a specific condition dependent on
Jiugang Li et al. [15] constructed a hashtag patient reviews using sentiment analysis and
recommender framework that utilizes the skip- feature engineering. Sentiment analysis is a
gram model and applied convolutional neural progression of strategies, methods, and tools for
networks (CNN) to learn semantic sentence distinguishing and extracting emotional data,
vectors. These vectors use the features to such as opinion and attitudes. On the other hand,
classify hashtags using LSTM RNN. Results Featuring engineering is the process of making
depict that this model beats the conventional more features from the existing ones; it
models like SVM, Standard RNN. This improves the performance of models.
exploration depends on the fact that it was .ADVANTAGES OF PROPOSED SYSTEM
undergoing regular AI methods like SVM and The system is more effective since it presents
collaborative filtering techniques; the semantic the proposed algorithm used in natural language
features get lost, which has a vital influence in processing responsible for counting the number
getting a decent expectation. of times of all the tokens in review or document.
3. CURRENT SYSTEM The system has exact sentiment analysis
Recommender frameworks point to supply prediction techniques for Data Cleaning and
clients with personalized stock and repair to Visualization
alter the expanding online information over-
burden drawback. Various recommender frame 5. SYSTEM ARCHITECTURE:
work methods are anticipated since the
mid1990s, and numerous shapes of
recommender framework code were created as
of late for a spread of applications. The health-
related substance shared through on-line
feedbacks or surveys contains covered up
assumption designs that emerges through totally
distinctive sources from medical world which
offer benefits to the pharmaceutical industry.
Amid this, the on-line component is fantastically
standard of late for online looking, diverse stock
through distinctive websites like on-line buying
of drugs at entryway step. Numerous websites
and blogs offers clients to rate their stock with
their fulfillment and quality of stock, logistics,
Fig 1 Architecture Diagram
administrations and criticism etc., which the
clients examines for a particular medicine or on
6. IMPLEMENTATION
quality of administration.
DATA OWNER:
In the existing work, the system did not
In this module, initially the data owner has to
implement an exact sentiment analysis for large
register to the cloud server and get authorized.
data sets.
After the authorization from cloud data owner
This system is less performance due to lack Data
will encrypt and add file to the cloud server
Classification and Data Fragmentation technique.
where in after the addition of file data owner
View All Uploaded Files, View All
4. PROPOSED SYSTEM
Transactions.
A recommender framework is a customary
REMOTE SERVER
system that proposes an item to the user,
The remote server manages a cloud to provide
dependent on their advantage and necessity.
data storage service. Data owners encrypt their
These frameworks employ the customers’

ISSN:0377-9254 www.jespublication.com Page 220


Journal of Engineering Sciences Vol 13 Issue 12,2022

data files and store them in the cloud for sharing Future work involves comparison of different
with cloud End users and performs the following oversampling techniques, using different values
operations such as View All Owners and of n-grams, and optimization of algorithms to
Authorize improve the performance of the recommender
,View All Users and Authorize ,View All Cloud system.
Files ,View All Transactions
,View All Attackers ,View File Score Results BIBILOGRAPHY
,View Time Delay Results ,View Throughput [1] Telemedicine,
Results https://www.mohfw.gov.in/pdf/Telemedicine.pd
AUTHENTICATE SERVER f
CA generates the content key and the secret key [2] Wittich CM, Burkle CM, Lanier WL.
requested by the end user and also Medication errors: an overviewfor clinicians.
View All Attackers. Mayo Clin Proc. 2014 Aug;89(8):1116-25.
CLEINT [3] CHEN, M. R., & WANG, H. F. (2013). The
User has to register and login for accessing the reason and prevention ofhospital medication
files in the cloud. User is authorized by the errors. Practical Journal of Clinical Medicine.
cloud to verify the registration. User has to View [4] Drug Review Dataset,
All Files Download. https://archive.ics.uci.edu/ml/datasets/Drug%2B
Review%2BDataset%2B%2528Drugs.com%252
7. CONCLUSION 9#
Whether go for shopping, purchase something [5] Fox, Susannah, and Maeve Duggan. ”Health
online or go to some restaurant, we first check online 2013. 2013.”
the reviews to make the right decisions. URL:http://pewinternet.org/Reports/2013/Health
Motivated by this, in this research sentiment -online.aspx
analysis of drug reviews was studied to build a [6] Bartlett JG, Dowell SF, Mandell LA, File
recommender system using different types of TM Jr, Musher DM, FineMJ. Practice guidelines
machine learning classifiers, such as Logistic for the management of community-
Regression, Perceptron, Multinomial Naive acquiredpneumonia in adults. Infectious
Bayes, Ridge classifier, Stochastic gradient Diseases Society of America. Clin InfectDis.
descent, LinearSVC, applied on Bow, TF-IDF, 2000 Aug;31(2):347-82. doi: 10.1086/313954.
and classifiers such as Decision Tree, Random Epub 2000 Sep 7.PMID: 10987697; PMCID:
Forest, Lgbm, and Catboost were applied on PMC7109923.
Word2Vec and Manual features method. We [7] Fox, Susannah & Duggan, Maeve. (2012).
evaluated them using five different metrics, Health Online 2013. PewResearch Internet
precision, recall, f1score, accuracy, and AUC Project Report.
score, which reveal that the Linear SVC on TF- [8] T. N. Tekade and M. Emmanuel,
IDF outperforms all other models with 93% ”Probabilistic aspect mining approachfor
accuracy. On the other hand, the Decision tree interpretation and evaluation of drug reviews,”
classifier on Word2Vec showed the worst 2016 InternationalConference on Signal
performance by achieving only 78% accuracy. Processing, Communication, Power and
We added best-predicted emotion values from Embedded System (SCOPES), Paralakhemundi,
each method, Perceptron on Bow (91%), 2016, pp. 1471-1476,
LinearSVC on TF-IDF (93%), LGBM on doi:10.1109/SCOPES.2016.7955684.
Word2Vec (91%), Random Forest on manual [9] Doulaverakis, C., Nikolaidis, G., Kleontas,
features (88%), and multiply them by the A. et al. GalenOWL:Ontology-based drug
normalized usefulCount to get the overall score recommendations discovery. J Biomed Semant
of the drug by condition to build a recommender 3,14 (2012). https://doi.org/10.1186/2041-1480-
system. 3-14
[10] Leilei Sun, Chuanren Liu, Chonghui Guo,
8. FUTURE ENHANCEMENT Hui Xiong, and YanmingXie. 2016. Data-driven
Automatic Treatment Regimen Developmentand

ISSN:0377-9254 www.jespublication.com Page 221


Journal of Engineering Sciences Vol 13 Issue 12,2022

Recommendation. In Proceedings of the 22nd


ACM SIGKDDInternational Conference on
Knowledge Discovery and Data Mining(KDD
’16). Association for Computing Machinery,
New York, NY,USA, 1865–1874.
DOI:https://doi.org/10.1145/2939672.2939866.

[11] V. Goel, A. K. Gupta and N. Kumar,


”Sentiment Analysis of Multilingual Twitter
Data using Natural Language Processing,”
20188th International Conference on
Communication Systems and Network
Technologies (CSNT), Bhopal, India, 2018, pp.
208-212, doi:10.1109/CSNT.2018.8820254.
[12] Shimada K, Takada H, Mitsuyama S, et al.
Drug-recommendationsystem for patients with
infectious diseases. AMIA Annu Symp
Proc.2005;2005:1112.
[13] Y. Bao and X. Jiang, ”An intelligent
medicine recommender system framework,”
2016 IEEE 11th Conference on Industrial
Electronics and Applications (ICIEA), Hefei,
2016, pp. 1383-1388,
doi:10.1109/ICIEA.2016.7603801.
[14] Zhang, Yin & Zhang, Dafang & Hassan,
Mohammad & Alamri, Atif &Peng, Limei.
(2014). CADRE: Cloud-Assisted Drug
RecommendationService for Online Pharmacies.
Mobile Networks and Applications. 20.348-355.
10.1007/s11036-014-0537-4.
[15] J. Li, H. Xu, X. He, J. Deng and X. Sun,
”Tweet modeling withLSTM recurrent neural
networks for hashtag recommendation,”
2016International Joint Conference on Neural
Networks (IJCNN), Vancouver,BC, 2016, pp.
1570-1577, doi: 10.1109/IJCNN.2016.7727385.
[16] Zhang, Yin & Jin, Rong & Zhou, Zhi-Hua.
(2010). Understandingbag-of-words model: A
statistical framework. International Journal
ofMachine Learning and Cybernetics. 1. 43-52.
10.1007/s13042-010-0001-0.
[17] J. Ramos et al., “Using tf-idf to determine
word relevance in document queries,” in
Proceedings of the first instructional conference
onmachinelearning, vol. 242, pp. 133–142,
Piscataway, NJ, 2003.

ISSN:0377-9254 www.jespublication.com Page 222

You might also like