You are on page 1of 3

IBM Cloud

S.No Track (Yes/ No) Project Name

1 Data Analytics Yes Hotel Booking Analysis

2 Data Analytics Yes Play Store App Review Analysis

3 Data Analytics Yes Telecom Churn Analysis

4 AI Yes Cardiovascular Risk Prediction

5 AI Yes Credit Card Default Prediction

6 AI Yes Coronavirus Tweet Sentiment Analysis

7 AI Yes Bitcoin Price Prediction

8 AI Yes IMDB Movie Reviews

9 AI Yes E-Commerese Website Logs

10 NLP Yes College Chatbot

Problem Statement

Have you ever wondered when the best time of year to book a hotel room is? Or the optimal length of stay in order to get the
you wanted to predict whether or not a hotel was likely to receive a disproportionately high number of special requests? This
can help you explore those questions! This data set contains booking information for a city hotel and a resort hotel, and includ
when the booking was made, length of stay, the number of adults, children, and/or babies, and the number of available parkin
things. All personally identifying information has been removed from the data. Explore and analyse the data to discover impor
the bookings.

The Play Store apps data has enormous potential to drive app-making businesses to success. Actionable insights can be drawn
on and capture the Android market. Each app (row) has values for category, rating, size, and more. Another dataset contains c
android apps. Explore and analyse the data to discover key factors responsible for app engagement and success.

Orange S.A., formerly France Télécom S.A., is a French multinational telecommunications corporation. The Orange Telecom's C
of cleaned customer activity data (features), along with a churn label specifying whether a customer cancelled the subscriptio
the data to discover key factors responsible for customer churn and come up with ways/recommendations to ensure custome

The dataset is from an ongoing cardiovascular study on residents of the town of Framingham, Massachusetts. The classificatio
whether the patient has a 10-year risk of future coronary heart disease (CHD). The dataset provides the patients’ information.
records and 15 attributes. Each attribute is a potential risk factor. There are both demographic, behavioral, and medical risk fa

This project is aimed at predicting the case of customers' default payments in Taiwan. From the perspective of risk managemen
This challenge asks you to build a classification model to predict the sentiment of COVID-19 tweets.The tweets have been pull
manual tagging has been done then. The names and usernames have been given codes to avoid any privacy concerns.

Bitcoin uses Blockchain concept which is peer-to-peer technology to operate with no central authority or banks; managing tra
issuing of bitcoins is carried out collectively by the network. Bitcoin is open-source; its design is public, nobody owns or contro
can take part.

Movie dataset for binary sentiment classification containing substantially more data than previous benchmark datasets. We p
highly polar movie reviews for training and 25,000 for testing. So, predict the number of positive and negative reviews using e
deep learning algorithms.

E-commerce website logs data created for helping the data analysts to practice exploratory data analysis and data visualizatio
on when the website was accessed, IP address of the source, Country, language in which website was accessed, amount of sal

The current communication and information retrieval systems in our college are outdated and do not effectively cater to the d
academic community. Students, faculty, and staff often face difficulties in accessing essential information, obtaining quick resp
navigating through the students Presepective
Data Set Link










You might also like