Professional Documents
Culture Documents
Data Analyst
Contact: +91- 9810560493
Arun4june@gmail.com
https://github.com/hardworkrit
https://www.linkedin.com/in/arun-kumar-521126117
Delhi (NCR)
PROFESSIONAL SUMMARY
As a data science practitioner, I am passionate about using data to solve complex business problems. I possess strong
technical skills in programming languages such as Python, R, SQL, and experience in data visualization tools such as
Tableau and Power BI. Through my academic projects and internship experience, I have developed expertise in
statistical modeling, machine learning, and data mining techniques. I am a quick learner with excellent problem-
solving skills and the ability to communicate complex data insights to stakeholders. As a career objective, I aim to
utilize my technical skills and knowledge to contribute to the growth of a dynamic data science team while
continuously learning and expanding my skill set.
PROFESSIONAL EXPERINCE
SKILLS
Python (Numpy, Pandas, Matplotlib, Seaborn), SQL, Microsoft Excel, Tableau, Power BI, Machine Learning
Communication, Problem Solving, Teamwork and Collaboration, Adaptability, Time Management, Business Acumen
EDUCATION
Professional Data Science and Machine Learning with Python, from “Data is Good”
Data Science Certification program to learn data, Analysis, Exploratory Data Analysis, Data Visualization.
Data Cleaning by different methods and the Machine Learning for model building.
B.TECH from AKTU(U.P)
Calculus and Linear Algebra
Probability and Statistics
Data Structures and Algorithms
Database Systems and SQL
Machine Learning Techniques
CERTIFICATION
PROJECTS
Hotel Booking Cancellation Analysis | Tech Stack – Python, Data Cleaning, EDA and Machine
Learning
Problem Statement: Predicted whether a booking made in a hotel can be cancelled in future or not.
For this, we have developed models that will identify and flag bookings with high cancellation probability
by understanding the trends and features associated with it.
Working: Made predictions using machine learning with the Supervised Learning method for Binary
Classification. There is already a ground truth, or a marker, whether someone cancels their order or not.
Outcomes: Machine learning and matrix help to find out the cancellation possibilities which help to
make better decision.
Medical Expenses Prediction Analysis | Tech Stack – Python, Data Cleaning, EDA and Machine Learning
Problem Statement: Everyone’s life revolves around their health. Good health is essential to all aspects
of our lives. Because of the quick speed of our lives, we are adopting many habits that are harming our
health. When webecome ill, we tend to spend a lot of money, resulting in a lot of medical expenses.
Working: Predict the future medical expenses of subjects based on certain features building a robust
machine learning model.
Outcomes: We came to know that the most important factor to Predict the Medical Expenses of a
subject is smoking behavior and age.
Diabetes Prediction Analysis | Tech Stack – Python, Data Cleaning, EDA and Machine Learning
Problem Statement: Predicting that whether the patient has diabetes or not on the basis of the
features.
Working: Analyze the Diabetes dataset, design, and implement a Diabetes prediction and
recommendation system utilizing machine learning classification techniques.
Outcomes: After using all these patient records, we are able to build a machine learning model random
forest is the best one to accurately predict whether or not the patients in the dataset have diabetes or not