Welcome to Scribd!

Understanding The Data: Objective

Uploaded by

0% found this document useful (0 votes)

8 views1 page

The document discusses predicting medical insurance charges using variables like age, sex, BMI, children, smoker status, and region using a dataset of 1338 observations and 7 variables. The goal is to build an accurate predictive model for insurance companies to determine premium costs. Regression analysis will be used to predict an outcome (charges) using multiple predictor variables from the data. The source of the data is a Kaggle dataset on medical costs.

Original Description:

Original Title

191262_1.docx

Copyright

Available Formats

DOCX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

8 views1 page

Understanding The Data: Objective

Uploaded by

tushar

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 1

Search inside document

Understanding the Data

Objective:
The variable ‘charges’ is the one we have to predict using the following predictors: age, sex, BMI, children,
smoker and region. The variable age and BMI are continuous variables; the variables sex, smoker and region
are categorical variables.

Information of the data:

As we can see, we are working with dataset with only 1338 observations and 7 variables. What we’d be most
interested here, is with the variable charges that is what we would try to predict.

The data set revolves around “Medical cost for Insurance” and through which we are required to predict the
cost of premium/charges.

To understand some background it can be deduced that, to make their profit, insurance companies should
collect higher premium than the amount paid to the insured person. Due to this, insurance companies invest a
lot of time, effort, and money in creating models that accurately predicts health care costs. In this kernel, we
will try to build the most accurate model as possible but at the same time we would keep everything simple.
In regression analysis a predictive model to the data will be used which can be further used to predict an
outcome variable from one or more independent predictor variables. With simple regression an outcome
variable from a single predictor variable is predicted while with multiple regression an outcome variable
from multiple predictor variables is used.
This predictive model uses a straight line to summarize the data and the method of least squares is used to get
the linear line that gives the description (best fit) of the data.
To make their own profits, the insurance company (insurer) must collect more premiums than the amount
paid to the insured person.
For this, the insurance company invests a lot of time and money in creating a model that accurately predicts
health care costs.
Here we explored a data set dedicated to the cost of treatment of different patients. The cost of treatment
depends on many factors: diagnosis, type of clinic, city of residence, age and so on. We have no data on the
diagnosis of patients. But we have other information that can help us to make a conclusion about the health
of patients and practice regression analysis.

Source of data:
Our major source of data is ‘Kaggle’; it is a subsidiary of Google LLC, an online community of data
scientists and machine learning practitioners. Kaggle allows users to find and publish data sets, explore and
build models in a web-based data-science environment, work with other data scientists and machine learning
engineers, and enter competitions to solve data science challenges.

Prescription: John Paul B. Garrido, RPH
Document57 pages
Prescription: John Paul B. Garrido, RPH
Jape Garr
0% (1)
SMDM - Project Report - Lakshmi
Document26 pages
SMDM - Project Report - Lakshmi
Kannan N
No ratings yet
Health Insurance and Risk Management
Document7 pages
Health Insurance and Risk Management
Pritam Bhowmick
No ratings yet
Top Pet Industry Trends For 2018
Document12 pages
Top Pet Industry Trends For 2018
mail me
No ratings yet
Nextgen Healthcare Ebook Data Analytics Healthcare Edu35
Document28 pages
Nextgen Healthcare Ebook Data Analytics Healthcare Edu35
Randy Marmer
100% (1)
International Marketing - GA
Document54 pages
International Marketing - GA
tushar
No ratings yet
Human Relations in Nursing
Document15 pages
Human Relations in Nursing
Om VaishNav
100% (1)
What Is A Qaly?: Sponsored by An Educational Grant From Aventis Pharma
Document6 pages
What Is A Qaly?: Sponsored by An Educational Grant From Aventis Pharma
cristiancaluian
No ratings yet
Cortana Analytics in Healthcare White Paper
Document16 pages
Cortana Analytics in Healthcare White Paper
Deyverson Costa
100% (1)
Business Analytics Project Report: Deloitte Insurance, Pricing Strategy Development
Document4 pages
Business Analytics Project Report: Deloitte Insurance, Pricing Strategy Development
BISHAL KUMAR PATRO
No ratings yet
Health Insurance Expectations and Job Turnover PDF
Document38 pages
Health Insurance Expectations and Job Turnover PDF
Manish Boygah
No ratings yet
DM Report
Document4 pages
DM Report
Creator5858 S
No ratings yet
MAS5302 - I.D.C.M.Premachandra 1.abstract: Case Study (Linear Models) 2020APST24
Document5 pages
MAS5302 - I.D.C.M.Premachandra 1.abstract: Case Study (Linear Models) 2020APST24
Nuwan Chathuranga
No ratings yet
Worker Decisions To Purchase Health Insurance: Linda J. Blumberg
Document21 pages
Worker Decisions To Purchase Health Insurance: Linda J. Blumberg
holi
No ratings yet
Hospital No Show
Document15 pages
Hospital No Show
Nishant Kumar
No ratings yet
Report
Document12 pages
Report
Neet Naville
No ratings yet
Medical Insurance Cost Prediction
Document7 pages
Medical Insurance Cost Prediction
rajivdubey930
No ratings yet
CureTip - Business Model and Pricing PDF
Document5 pages
CureTip - Business Model and Pricing PDF
Jonathan Duran
No ratings yet
OHA Surge Data FAQ
Document3 pages
OHA Surge Data FAQ
WKYC.com
No ratings yet
SSRN Id4366801
Document4 pages
SSRN Id4366801
Noona
No ratings yet
Healthcare Analytics
Document2 pages
Healthcare Analytics
Simranjit kaur
No ratings yet
To Develop A Classification Model For Health Insurance Sector
Document5 pages
To Develop A Classification Model For Health Insurance Sector
AtulAtulKumarAgarwal
No ratings yet
RiskSolutions Newsletter Spring 2013
Document7 pages
RiskSolutions Newsletter Spring 2013
jcarey13
No ratings yet
Pratica Atuarial
Document28 pages
Pratica Atuarial
Andrey Moura
No ratings yet
Ib 05 042002
Document4 pages
Ib 05 042002
kirs0069
No ratings yet
Module 7 Risk Analysis
Document30 pages
Module 7 Risk Analysis
gmoreno1087
No ratings yet
6 Applications of Predictive Analytics in Business Intelligence
Document6 pages
6 Applications of Predictive Analytics in Business Intelligence
AishwaryaSantosh
No ratings yet
Genetic Testing and Insurance
Document8 pages
Genetic Testing and Insurance
ahwazchagani
No ratings yet
Research Paper On Employee Benefits
Document7 pages
Research Paper On Employee Benefits
krqovxbnd
100% (1)
Data Collection
Document14 pages
Data Collection
martin napanga
No ratings yet
Basics of Predictive Modeling
Document11 pages
Basics of Predictive Modeling
Avanija
No ratings yet
Ek125 Final Project
Document13 pages
Ek125 Final Project
api-716910739
No ratings yet
Life Insurance Thesis
Document6 pages
Life Insurance Thesis
dnnpkqzw
100% (2)
A Message From Our Captain: Helping To Navigate The Troubled Waters of Workers Compensation
Document6 pages
A Message From Our Captain: Helping To Navigate The Troubled Waters of Workers Compensation
jcarey13
No ratings yet
End of Year Purchases - A Look at The Metrics That Drive Practices' Buying Decisions
Document2 pages
End of Year Purchases - A Look at The Metrics That Drive Practices' Buying Decisions
marco_gkloth
No ratings yet
Health Care Recommendation: By: Anubhav Chauhan
Document18 pages
Health Care Recommendation: By: Anubhav Chauhan
Anubhav Chauhan
No ratings yet
4.strategy of The Bezos-Buffett-Dimon-Gawande Healthcare Venture
Document7 pages
4.strategy of The Bezos-Buffett-Dimon-Gawande Healthcare Venture
Sai Teja
No ratings yet
10 Awesome Reasons Why Statistics Are Important
Document3 pages
10 Awesome Reasons Why Statistics Are Important
Eloisa G. Nuguid
No ratings yet
Dr. Harjito: See CV As Attached
Document14 pages
Dr. Harjito: See CV As Attached
Novi Irwansyah
No ratings yet
Health Insurance Research Paper
Document7 pages
Health Insurance Research Paper
xfeivdsif
100% (1)
Predict Health Insurance Cost by Using Machine Learning and DNN Regression Models
Document8 pages
Predict Health Insurance Cost by Using Machine Learning and DNN Regression Models
S Prasanna
No ratings yet
Mobile Health Information Technology and Patient Care A Literature Review and Analysis
Document4 pages
Mobile Health Information Technology and Patient Care A Literature Review and Analysis
afdtfgkbv
No ratings yet
Risks 06 00069 PDF
Document18 pages
Risks 06 00069 PDF
Varin Ali
No ratings yet
Final Report
Document26 pages
Final Report
Salma Shaheen
No ratings yet
Application of Probability Theory
Document7 pages
Application of Probability Theory
Nikhil Ranjan
No ratings yet
Life Expectancy USING MACHINE LEARNING ALGORITHMS
Document5 pages
Life Expectancy USING MACHINE LEARNING ALGORITHMS
Himanshi Gupta
No ratings yet
Stats 10 Ee Sample
Document20 pages
Stats 10 Ee Sample
yuvarajnet
No ratings yet
Introduction To Statistics
Document10 pages
Introduction To Statistics
Vikas Gupta
No ratings yet
Users of Statistics by Samridhi Dahiya (Economics) With Explnation
Document20 pages
Users of Statistics by Samridhi Dahiya (Economics) With Explnation
SAMRIDHI DAHIYA
100% (1)
Endogenous Fringe Benefits Compensating
Document25 pages
Endogenous Fringe Benefits Compensating
Ramadhanti Anti
No ratings yet
Insurance Need Analysis
Document8 pages
Insurance Need Analysis
Rajesh Chowdary Chintamaneni
No ratings yet
Assignment 1&2
Document4 pages
Assignment 1&2
nasreen mody
No ratings yet
Analytics in Healthcare and Online Retail: Business Analytics Group Presentation
Document22 pages
Analytics in Healthcare and Online Retail: Business Analytics Group Presentation
Gayathry Suresh
No ratings yet
Mediclaim Policies
Document1 page
Mediclaim Policies
navagat
No ratings yet
Health Insurance Term Paper
Document4 pages
Health Insurance Term Paper
afdtslawm
100% (1)
Research Paper Employee Benefits
Document7 pages
Research Paper Employee Benefits
pdtgpuplg
100% (1)
Infosys.110 Business Systems: Deliverable 2: Business Section 2014
Document11 pages
Infosys.110 Business Systems: Deliverable 2: Business Section 2014
BowuZhang
No ratings yet
Actuarial Data Cycle
Document3 pages
Actuarial Data Cycle
Kevin Swamber
100% (2)
Abstract
Document4 pages
Abstract
khizar
No ratings yet
Bala
Document28 pages
Bala
Rishi rao Kulakarni
No ratings yet
Floater Health Insurance Analysis Report
Document8 pages
Floater Health Insurance Analysis Report
enumula kumar
No ratings yet
What Is Data Mining?: 1. Classification Analysis
Document5 pages
What Is Data Mining?: 1. Classification Analysis
Junaid Ahnaf
No ratings yet
Payers & Providers Midwest Edition - Issue of October 9, 2012
Document5 pages
Payers & Providers Midwest Edition - Issue of October 9, 2012
PayersandProviders
No ratings yet
Ramos, Ethelene May N Reflective Essay: Business Analytics
Document5 pages
Ramos, Ethelene May N Reflective Essay: Business Analytics
May Ramos
No ratings yet
Medical Insurance Eligibility Verification - The Comprehensive Guide
From Everand
Medical Insurance Eligibility Verification - The Comprehensive Guide
Viruti Shivan
No ratings yet
Institute of Management, Nirma University MBA-FT (2019-21) : Investment and Portfolio Management
Document4 pages
Institute of Management, Nirma University MBA-FT (2019-21) : Investment and Portfolio Management
tushar
No ratings yet
Anmols Assignment
Document9 pages
Anmols Assignment
tushar
No ratings yet
Tushar Soni - PDBE FT - Indv Assignment
Document5 pages
Tushar Soni - PDBE FT - Indv Assignment
tushar
No ratings yet
Tushar Soni
Document2 pages
Tushar Soni
tushar
No ratings yet
Analysis: MARUTI SUZUKI Income Statement Analysis
Document1 page
Analysis: MARUTI SUZUKI Income Statement Analysis
tushar
No ratings yet
Sample:: Sample Size of Stores
Document1 page
Sample:: Sample Size of Stores
tushar
No ratings yet
(A) Scheduled of Dep As Per WDV Method
Document4 pages
(A) Scheduled of Dep As Per WDV Method
tushar
No ratings yet
Equity Research Report - Tushar Soni (NU)
Document47 pages
Equity Research Report - Tushar Soni (NU)
tushar
No ratings yet
Investment and Portfolio Management: Research Paper On Group No: 5
Document12 pages
Investment and Portfolio Management: Research Paper On Group No: 5
tushar
No ratings yet
(A) Scheduled of Dep As Per WDV Method
Document4 pages
(A) Scheduled of Dep As Per WDV Method
tushar
No ratings yet
(A) Scheduled of Dep As Per WDV Method
Document4 pages
(A) Scheduled of Dep As Per WDV Method
tushar
No ratings yet
Variable S Description Nature: Details of The Data Used
Document1 page
Variable S Description Nature: Details of The Data Used
tushar
No ratings yet
Communication For Managers: For Office Use: Grade
Document7 pages
Communication For Managers: For Office Use: Grade
tushar
No ratings yet
Final Accounts
Document35 pages
Final Accounts
tushar
No ratings yet
Prostitution': Business Ethics
Document8 pages
Prostitution': Business Ethics
tushar
No ratings yet
CMNS - GROUP Assignment
Document10 pages
CMNS - GROUP Assignment
tushar
No ratings yet
Task 3 - Evaluating Consumer Needs: Product Name - Forever by Company X
Document4 pages
Task 3 - Evaluating Consumer Needs: Product Name - Forever by Company X
tushar
No ratings yet
HRM 10 Final 2
Document18 pages
HRM 10 Final 2
tushar
No ratings yet
Supplementary Exam 2019 - 2020
Document2 pages
Supplementary Exam 2019 - 2020
tushar
No ratings yet
Task 4 - Client Recommendation: Product Name - Forever by Company X
Document6 pages
Task 4 - Client Recommendation: Product Name - Forever by Company X
tushar
No ratings yet
Indiabulls Realestate LTD.: Particulars 2016-2017 2017-2018 2018-2019
Document2 pages
Indiabulls Realestate LTD.: Particulars 2016-2017 2017-2018 2018-2019
tushar
No ratings yet
Tushar Soni - 191259 - B
Document3 pages
Tushar Soni - 191259 - B
tushar
No ratings yet
Task 5
Document1 page
Task 5
tushar
No ratings yet
Tushar Soni (NU) PPT - ABSLI
Document16 pages
Tushar Soni (NU) PPT - ABSLI
tushar
No ratings yet
Chemo Stability Chart LtoZ
Document34 pages
Chemo Stability Chart LtoZ
arfitaaaa
No ratings yet
Krishna Kishore Hazarika Project
Document76 pages
Krishna Kishore Hazarika Project
Nageshwar Singh
No ratings yet
Shouldice
Document16 pages
Shouldice
Abdullah Ahmed
No ratings yet
Global Health Paper
Document5 pages
Global Health Paper
api-718712730
No ratings yet
B Magazine Feb 2017
Document20 pages
B Magazine Feb 2017
prasch
No ratings yet
Not-Installed Medical Equipment List of All Hospital in Jimma Zone Full Info
Document58 pages
Not-Installed Medical Equipment List of All Hospital in Jimma Zone Full Info
misgana eticha
No ratings yet
ALLEVYN GENTLE BORDER LITE Location Guide
Document2 pages
ALLEVYN GENTLE BORDER LITE Location Guide
Kmj Avotriniaina
No ratings yet
Letter President Schill 11-7-2022
Document10 pages
Letter President Schill 11-7-2022
Nalini Rajamannan
No ratings yet
Bcom214 Industryreport
Document8 pages
Bcom214 Industryreport
api-379224355
No ratings yet
Sri Siddhartha Institute of Medical Science and Research Center Courses & Fees 2023
Document1 page
Sri Siddhartha Institute of Medical Science and Research Center Courses & Fees 2023
SREEKANTH K
No ratings yet
History of Abnormal Psychology Chap 2
Document20 pages
History of Abnormal Psychology Chap 2
ahmad
No ratings yet
Traditional Birth Attendants in
Document4 pages
Traditional Birth Attendants in
Bongani Maphumulo
No ratings yet
Sun MediMax Table
Document1 page
Sun MediMax Table
Khairul Rafizi
No ratings yet
Clinical Audit On Healthcare Provider Hand Hygiene at Neonatal ICU of EL-Helal EL-Emarati Hospital
Document43 pages
Clinical Audit On Healthcare Provider Hand Hygiene at Neonatal ICU of EL-Helal EL-Emarati Hospital
salamred
No ratings yet
Commissioning Guide - Emergency General Surgery Acute Abdominal Pain
Document32 pages
Commissioning Guide - Emergency General Surgery Acute Abdominal Pain
AnisNabillahMohdAzli
No ratings yet
Patient Discharge Details - mnsh.14th Sept'15
Document4 pages
Patient Discharge Details - mnsh.14th Sept'15
Stalinado
No ratings yet
HLL Life Care Limited 1576048010
Document3 pages
HLL Life Care Limited 1576048010
Rama Jan
No ratings yet
Psychiatric Adverse Effects of Antibiotics
Document9 pages
Psychiatric Adverse Effects of Antibiotics
zorpl
No ratings yet
Neonatal Emergency Laparotomy
Document5 pages
Neonatal Emergency Laparotomy
Joilson Travassos
No ratings yet
Bila Stine
Document55 pages
Bila Stine
Sunil Sewak
No ratings yet
Medical Training Initiative Guide: July 2017
Document22 pages
Medical Training Initiative Guide: July 2017
Sanjeev Srivastav
No ratings yet
Cancer Incidence Report 2020
Document98 pages
Cancer Incidence Report 2020
بسام سالم
No ratings yet
Adult Care Solutions
Document8 pages
Adult Care Solutions
EstefaniaElizabethBravo
No ratings yet
Ethical Rules For Dentists: (Prescribed by DCI)
Document8 pages
Ethical Rules For Dentists: (Prescribed by DCI)
Khushi Desai
No ratings yet
APA - DSM 5 Substance Use Disorder
Document2 pages
APA - DSM 5 Substance Use Disorder
Fukha Dharmawan
No ratings yet
TCGRX BullsEye Tablet Splitter
Document2 pages
TCGRX BullsEye Tablet Splitter
maluc
No ratings yet