Professional Documents
Culture Documents
Assignment 1: Case Discussion: Group-Wise Allocation
Assignment 1: Case Discussion: Group-Wise Allocation
This is a group work and all the groups have to solve all the cases and submit the soft copies of
their presentation slides along with R codes by 17th January 2022 (11.00 a.m). The cases have to
be presented on 19th January during class hours.
Group-wise allocation
Groups 2,4,7,5,9 : case 2, case 3
Groups 1,3,6,8 : case 1, case 4
Estimate and interpret a regression equation for Amount Spent using the given data.
In order for a health insurance company to make money, it needs to collect more in yearly
premiums than it spends on medical care to its beneficiaries. Consequently, insurers invest a
great deal of time and money to develop models that accurately forecast medical expenses for
the insured population. The medical_expenses.csv file includes 1,338 examples of beneficiaries
currently enrolled in the insurance plan, with features indicating characteristics of the patient
as well as the total medical expenses charged to the plan for the calendar year. The features
are:
• age: An integer indicating the age of the primary beneficiary (excluding those above 64 years,
as they are generally covered by the government).
• sex: The policy holder's gender: either male or female.
• bmi: The body mass index (BMI), which provides a sense of how over or underweight a person
is relative to their height. BMI is equal to weight (in kilograms) divided by height (in meters)
squared. An ideal BMI is within the range of 18.5 to 24.9.
• children: An integer indicating the number of children/dependents covered by the insurance
plan.
• smoker: A yes or no categorical variable that indicates whether the insured regularly smokes
tobacco.
• region: the beneficiary’s residential area in the US, northeast, southeast, southwest,
northwest.
• charges: Individual medical costs billed by health insurance
A real estate company wants to know the determinants of house sell. They have collected data
from the last few years, with a Sale Status variable, that indicates whether a house was sold slow
or fast. You are asked to conduct an analysis and help the company in predicting Sale Status of a
house.
Dataset: HouseSell.xlsx
Variable Description
• SaleStatus
o SoldFast – House was sold within 6 months
o SoldSlow – House was sold after 6 months