You are on page 1of 147

ICICI Ba

GD Topics If you are going to let a person open a savings acc. who already has a home loan, and he invests in share ma
Group 1
Group 2
Group 3
Group 4 Govt has made a scheme where they would deposit money directly into people's bank account like PMJDY - p

Questions A
Rajib Gupta
1 What is your academic background?
2 why jumping from physics to data science?
3 What do you think of a bank?
4 How are you going to give a loan to a person? , what should you check? using which statisti
5 Are you ready to move anywhere in the country ?
6 Did you have any ICICI interviews before?
7 Are you ready to take a role in sales dept.?

1 Brief about yourself and your family and academic background


2 Tell us about one of your projects/ Some questions related to your projects
3 What sorts of algorithms you've studied
4 What's a knn algo
5 How do you calculate the distance in a knn algo
6 In data manuplination, I was asked to write a sql query for creating a table where we have m
7 Are you ready to move anywhere in the country ?
8 Did you have any ICICI interviews before?

1 Tell about yourself and family background


2 Assumptions of Linear Regression
3
4 difference between kmeans and hierarchical clustering
5 Supervised learning algorithms of my choice, explained about bagging and boosting
6 Sql, Given two employees sales data(monthly) create a new variable to find the difference o

Rushikesh Badgujar
Questions on project
1 *How you will identify default customer for giving loan which statistical model/algorithm you will use
2 * Ready to work in profile of data science with sales
3 * Ready to relocate in India anywhere
4 If u have a data with 100 variable and u have to decide whether the customer will b default or not. How will u d
Shivam Babb

1 Tell about yourself , academic background and your family


2 During your graduation which 2 topics did you find were most interesting
3 If a person observes a bike passing every 15 mins on an average , find the probability of a bike passing every
4 What is the difference between z test and t test
5 Have your ever applied ML models supervised or unsupervised ?
6 What is the difference between Linear and Logistic Regression ?
7 What is the difference between SRSWR AND SRSWOR
8 What all computer languages have you used , which one do you like the most and why ?
9 Ready to move anywhere in India
10 Any of your family members works in icici bank ?
11 Have you ever been interviewed by icici bank ?
12 What all you do as a part of EDA

Avinash Yada

1 Tell about yourself,academic background?


2 How would go about doing interpolation ( if there's a data with date and rate of interest column, with Rate give
3 what are the assumptions of linear regression
4 what is the metric for multicollinearity
5 Why do you want to join the bank
6 what all programming languages have you studied and what is your favourite so far

Hrisav Bhowm
About yourself.
In your past organisation, suppose some huge data is given. What you used to do before starting to work on t
What is accuracy?
Conceptual Questions on Gini
If a dataset is given, which have training and test data, and gini value is given for each data point. Which are u
If in your Logistic Regression model, model has wrongly classified some data, how will you handle them
Are you ready to move anywhere in the country ?
Did you have any ICICI interviews before?
Any questions? Asked regarding analytics depts in ICICI and job position role.

1 Brief about family background?


2 Why do you want to join banking?
3 Assumptions of logistic regression
4 Build model for loan eligibilty for customer
5 Syntax for join query
6 difference between logistic and linear regression
7 how you handle data with errors
8 Code for logisticregression ?
9 how you choose best classfication algorithm for a model ?
10 Imputation for nominal and ordinal values ?
1 Introduction, educational background and family background
2 SQL Join code - gave two tables and asked how will you join them. What will be the primary
3 Added values to above tables (Table 1 - 3 rows of distinct data ; Table 2 - 2 rows with same
4 How will you scale values 55, 65, 75 to a range of 100-120 )original range being 1 - 100)

Chithra Nair
Introduction, educational background and family background
How will you scale values 55, 65, 75 to a range of 20 to 40
software that you are using for python an packages numpy,matplotlib,pandas,seborn-overa
histogram
subsetting a dataset
project details

Deepak Sharma

Introduction, Background, Past Working Experinence


Given Data Set wtih 2 variable t = 1,2,3,_,5_,7 and Sales = 100,200,300,400, 350,_,400Pre
A score 65 on the scael of 0-100 and B score 28 on the scale 20-40.who scores better.how
What are assumption in Linear Regression
What is normal Distribution?

Kaviya U C

Self Introduction - Background, Past Experiences


Proficient programming languages
When to use Linear regression and Logistic regression
How to evaluate a model
Gini Impurity
AUC
SQL queries - join based on sales volume
Project Details

Ananda Chatterjee
SHREYANSH
algo of Naive Bayes classifier in detail
Introduction , Family details , Education details
Why I choose data science

if willing to relocate or not


if any of family members work in ICICI

Aparup
Introduce yourself
Project
Busness case
Numericals
hypertuning of random forest
if any of family members work in ICICI

SHREYANSH

Introduction ,Family Details , Education details


Why I chose data science
scaling of data 70 % , scored 42 in range 30-50
Similarity between names shreyansh , shreya , Kauntiya (I used Euclidean distance after de
Python data set needed to take max of sales from data frame
Business model needed to find out campaign was effective or not
I am ready to work in sales or not
Ready to Relocate
family member workin in ICICI
I want to ask some questions to interviewers or not
What is apriori algorithm where it is used
Abhirup
Tell us about yourself
Type of model for anomaly detection
Why do we use Test train split
Need of scaling and how it is done
Python to Find average salary of each employees in an employee salary dataset
Types of regression errors
CICI Bank
nd he invests in share market who is gonna be unavailable for 2 months in country.
Rajib gupta

nk account like PMJDY - pros and cons

Questions Asked
Rajib Gupta

heck? using which statistical model you can make an automated system to check if a person is eligible to take loan?

Tanya Mangath

g a table where we have max sales of both the employees for all the months

Raj Kumar T

ging and boosting


ble to find the difference of previous and current month

hikesh Badgujar
m you will use

default or not. How will u do it? how will you manupulate the data?
Shivam Babbar

ty of a bike passing every 30 mins

Avinash Yadav

st column, with Rate given as 8,6, missing , 4 : how would you interpolate the missing value)

Hrisav Bhowmick

efore starting to work on the data?

ch data point. Which are undergoing underfit and which are overfit?
will you handle them

Jayesh Belsare

Rajarshi Chakraborty
. What will be the primary key and the join condition (Table 1 - S.No, EmpID, Name ; Table 2 - Name, Marks)
able 2 - 2 rows with same name but different marks [name matching with 1 rowof table 1], 2 rows with names not matching with Table 1). As
al range being 1 - 100)

Chithra Nair

tlib,pandas,seborn-overall idea

eepak Sharma

0,300,400, 350,_,400Predict the missing value of sales


40.who scores better.how much should B score to surpass score of A

Kaviya U C

a Chatterjee
HREYANSH
SH

uclidean distance after decoding into alphabet order)

Abhirup

salary dataset
1
2
3
4
5
6
7
8

1
2
3
4
5
6
7
8
9
10
11

1
2
3
4
5
6
7
8
9
10

1
2
3
4
5
6
7
8
9
10

1
2
3
4
5
6
7
8
9
9
10

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21

1
2
3
4
5
6
7
8
9
10
1
2
3
4
5
6
7
8
9
10
11
12

1
2
3
4
5
6
7
8
9
10
11
Tharanitha
Questio
Tharanitha
Self intro
Some questions on my previous experience
Toics learned till date on our course
what is Linear Regression?
What are the assumptions of Linear Regression?
Difference Between Linear Regression and Decision tree Regression
what is supervised and unsupervised learning?
Finally interviewer mentioned that will get an case study to solve for which will get a week time to solve

Introduction and work experience


Projects related to machine learnng?
Types of clustering I'm aware of
XGBOOST, RFM,Random Forest
what other companies are you looking forward to getting placed throught this coourse
Any other course or training done related to data analytics
why did you decide to take up this course?
used R in any project?
what visuslization tools you know?
have you used power BI?
Logistic regression

RAMANDEEP SHARMA

Introduction and work experience


Project related questions
Do u know about knn regression
how will you explain knn in layman language
How will you find the value of k in kmeans
Why did u left ur job and took this course?
difference between Supervised and unsupervised learning
When will you use knn regression over linear regression?
have u worked on any project apart from these mentioned projects
do u know about deeplearning framework?

SHIVANI NEHRA

Introduction and work experience


Project related questions
What models have you used in your projects?
Do you know about forecasting, XGboost, Random Forests
Explain previous job related projects and your role in that
have you used power BI?
You have used R in any project?
Why did you take up this course?
What are ways to reduce overfitting in a model?
Ideally a cleaner dataset is given to you, what if the data is not so clean. What would be your approach.

SHREYA ARORA

Intro and Work Experience


Asked about previous company, projects done in previous company
What are your hobbies?
Why did you choose Data Science?
Project related questions
How will you proceed if you get unclean data from your client?
If you've promised the cient that your model will give 90% accuracy but you're unable to achieve that, why is accuracy not apt an
Have you done any project in R?
What have you done in visualization (Related to Tableau project)
What is Game Theory?
Whay do you want to get into consulting?

ABHILASH JASH
General Introduction - background and work ex.
Why did you choose data science.
Tell us about subjects you have already studied.
Questions from general data wrangling
Questions deviated to data wrangling in projects.
Explain the dataset used.
Explain what attributes you used in model building.
What model did you use in predicting values.
Which model gave the best result.
How to optimize the same model.
What accuracy score you used and why?
Explain categorical variables and it's types
Difference between frequency plot and histogram
How did you identify and remove outliers.
Have you worked in any kind of classification problem?
Told about one not mentioned in CV, questions from that.
Asked for Feedback and everyday experience of the profile of I get selected.
Any kind of forecasting model used? Tell us something about it if any.
Questions on model interpretation, skipped - Not confident.
R2 and Adjusted R2 when to use what, which is better in what situation.
Assumptions of Linear Regression

SIDDHARTH DWIVEDI
General background and introduction
Explain your work experiene,type of activities performed in your project
Now you have enrolled into data science so how can you relate your past experience with this
Confusion matrix
Ask quetion related to clustering
Data science Use case which I can derive from one of my last project in which I have worked
What is Linear regression &given me some equation and asked to find correct linear regression equation
What is SCD type 2
Tell me what subjects you have studied and will study in near future
Why chosen Data Science
SALONEE MAHAPATRA
General background and introduction
Explain your work experiene, type of activities performed in your project
If you client comes to you with a problem regarding O2C department then how would you improve their system, what all analytic
What are the subjects covered in your course? What are the electives that you have chosen/ would choose?
What are you planning for your capstone project?
How comfortable are you with Python, SQL?
You are an analytics team member of an ecommerce web and they run out of bestselling black sweater in the inventory then wh
Which model would you use for the Q7?
Which tool would you use for the Q7?
What all analytical tools do you know other than SQL?
Will you be okay to work in a Power Plant in case the client demands that?
Are you okay to travel for business?

ARPIT MALHOTRA
General background and introduction
Questions related to my project
Precession Recall
ROC curve Interpretation
FPR TPR, interpretation
F1 score inter pretation
Clustering Metrics
Metrics in Artificial Neural Networks
Hyperparameters in Neural Network
My skill set
How will I do Canabelisation modeing of a Retail Store
Tharanitharan Ramajayam
Questions Asked
Tharanitharan Ramajayam

Aditi Bhardwaj
Written Round

some basic python questions on pandas and numpy functions and what will be the output of a code snippet

Interview
First Round:
Questions on projects made
What algorithms I have used in projects ( Random forest, Decision Tree, Linear Regression etc.)
Algorithms working
Why Random forest is better than Decision Tree
Detailed working of Random forest & XGBoost
Then asked me to write codes for following:
1. make a dataframe of 3 columns -> 1 numerical, 2 categorical
2. make on of the columns Index column
3. groupby and find mean of one of the columns
4. make a new column and put 0 incase of value in other numerical column is greater than 10 otherwise 1: Do this using lambda
5. Make a user defined function that accepts a dataframe and find moving average of 10 days and return the value in the datafra

Second Round:
Asked what are the factors in a Time series data
If there is a coffee shop who's not profiting much , how would I go about analysing it,
What's the range of R2 value
Can it be Negative
What value of RMSE is considered good
Overfitting and Underfitting
und

herwise 1: Do this using lambda and user defined function


nd return the value in the dataframe
SHUBHAM
1 You have so many certifications in Data Visualisation , why do you think data visualisation is important.
2 Have you used any procurement tools
3 Previous experience based questions
4 Case Study based questions
5 If given a chance would you pick a start up that works in analytics or an old school company like vedanta
6 Do you have any constraints in working from remote locations

1 Previous experience based questions


2 Case Study based questions
3 Questions related to Project
4 Questions related to Digitalization
5 Do you have any constraints in working from remote locations
SHUBHAM PANWAR
isualisation is important.

hool company like vedanta

MANSHA DAS
1
2

1
2
3
4
5
6
7
8
9

1
2
3
4
5
6
7
8
9
10
11
12
14
15

1
2
3
4
5
6
7
8
9
10
11
12
Round 1 1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19

Round 1 1
Panel 3 2
3
4
5
6
7
8
9
10
11
12
13
14
15
16

Round 1 1
2 Panelist 2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
Round 1 1
2 Panelists 2
3
4
5
6
7
8
9
10
11
12
13
14

round 1 1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
round 2 1
2
3
4
5
8
9
10
round 3 1
2
3
4

Round 1 1
2
3
4
5
6
6
7

Round 2 1
2
3
4
5
6

HR 1
2
3
4
5

Round 1 1
2
3
4
5
6
6

Round 2 1
2
3
4
5
6

HR 1
2
3
4
5
6
7
8

Tech Round (Combined Round 1 &1 2 Panelists)


Panel 4 2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28

HR Round 1
2
3
4
5
6
7
8
Introduction
Purely non technical interview, focusing just on your communication skills

Rushabh Rumde
Brief Introduction
Walk us through your CV
What was the reason you chose Data Science?
What all Subjects have you learnt?
Python - Dictionary, Tuples, Arrays, OOP
SQL-Where,having,Joins
Tableau - Data Blending, Data Wrangling, Hierarchy
Statistics - DIstributions, Variance.
Univariate, Bivariate and Multivariate Analysis

Different data structure in Python and explainVinayashree K


each one(Dictionary,tuple,arrays,sets,list),difference
between each
Can you use a list inside a list in python
Can you save a data frame in a dictionary?
Can we add a list as key in dictionary?
Syntax for finding correlation in pandas and syntax for drawing histogram in Matplotlib
What is a residual in linear regression and how do you calculate
What is quantile-quantile
Walk us through your work plot in linear regression
experience and kind of work you did. Have you worked with sql in your
previous organization
Can you plot a correlation between a categorical variable and a continuous variable
What is ROC curve?
What is a density plot?
What does are under the curve tells in ROC curve
Will you be open to learn new technology?
Is 1 KG Iron heavier or 1 Kg cotton ?

SHUBHAM PANWAR
Introduction and past experience
what all you have studied till now
why did you join praxis
project related stuff
NLP vs NLU
How would you rate yourself in SQL
What is indexing
foreign key
how would you rate yourself in python
what are decorators in python
What is the docstring command in python
What is TF IDF
SHREYA ARORA
Introduction and previous work experience
There are 2 tables, how will you fetch details of both tables?
What's the difference between Left and Inner Join?
Fetch average salary of all employees segregated by names
How does Partition By work in Window Function?
What's Dictionary in Python?
Can there be similar keys in a dictionary?
Can we add a list as key in dictionary?
What is the difference between a List and Dataframe?
Difference between Series and Dataframe
What's a Normal Distribution?
What's z-score? How to interpret that?
Where does 99.7% of the data lies in a normal distribution?
Correlation v/s Covariance
What happens to mean, mode, median in case of Left and Right skewness?
How do you choose optimum k value in k-Means Clustering?
What do you mean by Accuracy and Error rate? Formulas for both?
Which libraries have you used in ML?
Explain PCA

HARIPRIYA S
Introduction and previous work experience
How would you rate yourself in SQL?
Have you worked on Query Optimization?
Saw Sql in my experience so asked questions on like What is OLTP?
What is Star and Snowflake model
What is Cross Join?
12 rows in left table, 3 rows in right table. how many rows in Left Join?
Another question similar to Q6.
What is a cross join?
What is an Index?
Which ML algorithms are you comfortable with?
Which supervised Learning algorithms do you know?
A situation when you had to deal with conflict in a team and how did u handle it
Suppose you have all the click data of Flipkart, how will you identify potential customers of iPhone
Would you call yourself a team player?
How would you manage people in a team?

ABHILASH JASH
General Introduction
What all subjects been taught.
Classification and Regression difference, explain with examples.
Linear Regression and it's assumptions
Limitations of Linear Regression
Multiple Regression.
Goodness of fit
Project - on Regression
Explain the dataset
What all data cleaning steps did you apply, Explain the EDA you did.
The regressors you used for model, which gave best accuracy and why.
How can you improve performance of multiple regressor
What are decorators in Python
What is a tuple, how is it different from a list.
What is a User defined function, illustrate with an example.
What are the key functions of a function.
Open your Covid19 visualisation project and run it.
Explain the User defined functions you have used step by step.
Model Evaluation in Regression
R-squared and Adjusted R-squared?
How to select beta values for Multiple Regression?

Tell us about yourself


Types of Joins in SQL, Explain each. What is inner join.
Write a sql querrry to give separate results for odd and even intergers in a column
What is skewness and kurtosis
what does a flat distribution indicates
Explain the process of EDA
What % of Null values in a dataset can be dropped without affecting the dataset.
What are measures and dimensions in Tableau
Sahil Nanda
Introduction and work experience related
What all factors we will consider while importing raw material for steel manufacturing plant
For the above business problem asked to do forecasting using macroeconomic variables
Questions on Python: Tuples, set, functions, loops
Python Libraries
Importing files in Python and R
SQL Joins
ML project and different types of algos
PCA
Difference between gaussian and uniform distribution
Density curve
Correlation and correlation coefficient range
What is skewness
Largest data that i have worked on and if i used any database for that

Chithra Nair
general intro and work experience and projects done in the company brief explaination
asked about DB2 , what i exactly learned?-regarding my experience
explain
explain about EY --
one thing brief
you haveoverview
done inofyour
the company
company which was assigned to you only and what were the
challenges you faced there?
do you know pl sql?have you worked in the field?
sql question on general between where and having and asked to program based on that
how to add 2 1-d arrays
explain decision tree--explained gini model , deviance, cart algo
asked difference between gini and entropy in decision tree
what are the different libraries you have used for machine learning
what is matplotlib and seaborn
sql joins what all are there
difference betweenwhatever
clustering-explain left outeryou
andknow?
inner explained kmeans and agglomorative--again asked how you
assign k and concept of centroid? what are the limitations? and explain scree plot.
apllication level question on clustering-where can we use this?
bayesian theorem
association rules mining- how do you do this, explain apriori algorithm and application of association
rules
linear regression and r square, adjusted r square difference
how do you analyze a data? what are the challenges you have faced while analyzing data?
feature selecion--how do you do that?
project details on automation of eda
knn algorithm from scratch -explain the program
how do you work with team, how do you manage pressure?
family details and experience
marks obtained in praxis
how are you planning to settle down?---family details
why ey? Didn't you try PWC?

Aparup Chakraborty

Introduce yourself
What are the projects you have done describe one
Questions related to project,data preparation and models (classfication project)
What evaluation techniques have used in projects and why
Tell top 5 features of you project
How you will use data science to improve your previous job
asked question from SQL joins
Basic python , what is global and what is local variables

Introduce yourself
They asked questions from sales prediction project which was in my CV (big mart sales prediction)
ask about time series componets and algorithm
how logistic regression works
Diffrence between decision tree and random forest
ROC AUC curve

Introduce yourself
family details
previous work experience
what was your contribution to your previous company
extracurricular activities

Ankit Shrivastava
Introduce yourself
Personal Background
Professional Background
Proficiency in SQL, Excel (Conceptual Qs)
Achievements in work ex
Why Data Science
Python Basics

Introduce yourself
Brief about work exp
what subjects studied at praxis
questions on clustering, with details
use case of clustering, application oriented
discussion about kind of role and responsiblity offered by EY

Introduce yourself
family details
previous work experience
what was your contribution to your previous company
extracurricular activities
Hobbies
Have you ever taken responsiblity of any failure in professional experience
Last drawn salary

Rajarshi Chakraborty
Explain your project on K-means Clustering
No. of rows, what were the variables
Why did you use K-means?
How did you use the algorithm? Explain all the steps.
How did you arrive at the ideal value of K and what was it?
What error metric did you use?
What are the business application of this project that you did?
Brief explanation of other projects
Packages you have used in Python
CHAID vs CART (Said haven't been taught CHAID yet)
Explain Random Forest.
Which ML algos do you know?
Explain Apriori algo
What are the business applications of this algo?
Relationship between Support and Lift
Which R packages have you used?
What have you learnt in SQL?
What is inner join?
Difference between inner and left join
Query to find 'Till Date Salary' of employees and employee joining date from two tables (Groupby and inner join)
How would you make this query more efficient? (Explained to me how it can be done using indexing)
What does confusion matrix tell you?
Relationship between Precision and True Positive Rate
How will you increase the accuracy of a model?
What could be the possible reasons for the low accuracy?
Explain how would you approach a problem of data cleaning and then modelling given a dataset. explain all the steps.
Have you done anything on Tableau?
Asked about my scholarship

Introduce yourself, background, family


Why EY? Why consultancy?
Hobbies
Any social work you do?
Do you like Kolkata more or Jaipur?
Are you a team player or do you like working alone?
Have you find out about EY's values, its work culture from any seniors, alumni?
Questions you want tko ask us
Anirudh Sharma

HR Round
HR Round 1
2
3
4
5
6
7

Round 2 1
Panel 4 0
3
4
5
6
7
8
9
10
11
12
13
14

Round 2 1
3 Panelist 2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
What are the different methods to calculate distance?
Explain Manhattan distance.
Any project where you had to do something other than the things you are comfor
Anirudh Sharma

Introduction
About family
About previous companies which I worked for

What are your hobbies


Do you like working in teams or alone?
Are you part of any social work?
Any questions to me?
Did you ever take blame for failure and what measures did you take to make sure it did not happen again?
Last drawn CTC
Introduction
Family background
Why do you want to get into EY?
What feedback have you received about EY?
What are your hobbies?
What will keep you motivated to work for a company?
If given a chance, will you go back to your previous company? Why/Why not?

Walk through the CV


How would you rate yourself in SQL?
Given three tables, Region, Sales, Product what are the required features to aggregate and find region wise sales for a product
What other variable is required to validate the data?
In SQL is Time a fact or a dimension?
If selected what would you want to proceed with in EY Data Wrangling or Analytics/ML?
Details about ML Project on unsupervised learning
What does Kmeans clustering mean?
How will u decide the k value in Kmeans clustering?
What are the assumptions of llinear regression?
What is the least squares method?
How will u visualise the x1 x2 x3 in a linear regression. What does it mean?
Do you know a regressor where one variable is considered while others are masked/hidden?
What is the difference between linear regression and logistic regression?

General Introduction about yourself


Why EY from a Mechanical Engineering domain
What inspired you towards data science, briefly say about your journey.
What all subjects have you studied.
If you were a teacher then what subjects would you like to teach?
Asked to reframe question - Which all subjects would you like to teach as a teacher?
Said - Engineering Mechanics, Machine Learning, Statistics, Mathematics and Associated Programming
What is a random variable
Random variable and Probability distribution for coin tossing and Cube throwing
What is Normal Distribution?
What is Poisson Distribution?
What is Uniform Distribution?
What makes normal distribution normal?
Where do we use Poisson Distribution and how is it different from normal distribution
What is Binomial Distribution, illustrate with an example of the same.
What are Eigen Values?
Physical significance of Eigen Values and Eigen Vectors.
What is Eigen Value Decomposition?
Where do we use the above decomposition?
Business problem on Selling of an I-Phone for an E-commerce Platform - How to go about the problem.
What kind of data to collect and what attributes to collect.
What model to use to forecast demands of I-Phone.
If you have a client who's project requires one unknown skill of yours and time is 2 weeks how will you manage?
Is it easier to say not possible or trying out alternative. And what alternative?
Are you a team player? State instances of team playing in your life.
If you are given to manage a team, how will you segment them based on skills? (Can use assumptions as per convenience)
Do you know something about NLP, state briefly. (Gave a very genealised answer)
What is Regularization?
What is Ridge and Lasso regression. How are they different. What are there properties?
Explain Bias and Variance of Decision Tree and Linear Regression. How to tweak them?
Name some classification algorithms.
What is K value of KNR ?
Questions on Projects were on same line with round 1 - almost all requestioned.
Do you have something else to say us?

Tell us about yourself - brief intro


Have you done any group projects in college? Did u face any clashes in the groups.
What is the one thing that people would say as your weakness
what is linear regression
Bayes Theorem
Activities other than studies.
What are your hobbies
What is kNN . How tha value of k is imputed?
How is distance calculated?
What are the different methods to calculate distance?
Explain Manhattan distance.
Any project where you had to do something other than the things you are comfortable in? how did you manage?
Round 3 1
HR Round 2
3
4
5
6
7
8
9

HR Round 1
2
3
4
5
6
7
8
9
10
11
12
Tell me about yourself, professional and personal details
About Family
Why Consulting role?
Have you heard of EY before or hearing it for the first time today?
Why havn't you applied to EY before?
Explain a situation where you had to take the blame for failure
What are your hobbies?
Any social work?
Any questions?

Introduce yourself.
Say about your family, what do they do?
What are your hobbies?
Have you demonstrated any leadership characteristics in your career. Give examples if any.
Will you push your colleague into danger for a failed project for your mistake?
If you are in trouble do you run away or face it upfront?
Do you contribute towards social good? Give examples.
Why EY and Consulting?
What do you know about EY? Any feedback from glassdoor or quora?
What was your last drawn CTC?
Any question that you want to ask me?
Any additional remarks you want to make?
1
2
3
4
5
6
7
8
9
10
11
12

1
2
3
4
5
6
8
9

1
2
3
4
5
6
7
8
9
10
11
12

1
2
3
4
5
6
7
8
9
10
11
12

Aditi Bhardwaj
1
2
3
4
5
6
7
8
9

1
2
3
4
5
6
7
8
9

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16

1
2
3
4
5
6
7
8
9
10
11
12
13
14

1
2
3
4
5
6
7
8
9
10
11
12
13
14

1
2
3
4
5
6
7
8
9
10

1
2
3
4
5
6
7
8
9
10
11
Round 1 1
2
3
4
5
8
9
10
11

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17

1
2
3
4
5
6
7
8
9
10
11
SUBEX
Questions Aske
Rehan Raza
Introduction & walk us through your cv
Projects(questions about libraries used)
Probabibility of coming 4 evrytime if dice rolled 3 times
Permuattion and combination(Apti questions)
Gradient Descent
supervised and unsupervised learning
underfitting and overfitting
Given a dictionary save it as a dataframe (on jupyter )
What is precision?
What is Confusion Matrix?
Categorical and Numerical features
Linear Regression, equation ,explanation, why and how??

Ramandeep Sharma

Intorduction and work experience


explain your projects
KNN algo and run it by using dummy variable
types of distances with example
Python program to find sum of elements in list
python program for flattening of dictionary
python program for implement queue
code the put and get functions of queue

ARPIT MALHOTRA
Intorduction and work experience
explain your projects
Precsion Recall
ROC curve
Kappa Score
Optimizers
Hyperparameter Tuning in ANN
Activation Fucntions and their adavantages and Disadvantages
Gradient Decent and Stochastic Gradient Decent
Data Structures, queues, decks, cache,
Python program to find sum of elements in list
python program for flattening of dictionary

Chithra Nair
Intorduction and work experience
explain your projects
k means and kNN
t distribution
hypothesis testing in detail and asked application level
outlier
situation based questions on chi square testing
modelling
feature scaling,regression
situation based questions based on project , what will you do?
mean,median which to use
skewness

Aditi Bhardwaj
Work Experience
Poject related questions
types of distributuions
Hyperparamters in Decision classification
ways to handle categorical variables
relationship bet ween mean and median in normal distribution
underfiting and over fitiing
F1 score
decsion tree regression how does it make a prediction

Rushabh Rumde
Brief introduction
Sampling techniques
Project related Questions
Entropy and Information Gain
T-Distribution
Law of Large Numbers
Central Limit Theorem
How does KNN work?
EDA

Haripriya S
Introduce Yourself
Questions on Probability, Permutations
What projects have you worked on? In Praxis and otherwise.
Deeper into ML aspects of the project you worked on.
Types of distributions
Relationship between mean and median in normal distribution
Ways to handle categorical variables
What is underfiting and over fitiing?
What is a confusion matrix?
Why not accuracy instead of the other Scores?
What is Linear Regression?
How does KMeans Clustering work?
How does KNN work and futher questions
How does Decision Tree work?
What is PCA?
Coding: using Dataframes on Jupyter notebook (share our screen)

SHUBHAM PANWAR
General Background and Introduction
Questions related to my project
Precision , Recall
Stemming , Lemmatization, Bag of words
In a list there are multiple elements and only one element does not have duplicate , how will you find it.
Time complexity in the answer to the previous question
Explain any ML algo to us
What are the assumptions of Regression
What is Regularization
what are Hash Maps
Hypothetical situation - Identify if the driver of a car is male / female
How will you identify which features to take and which not in the previous case
How will you identify Conditional depent variables

VIVASWAN JINTURKAR
central limit theorem
deterministic and non deterministic algorithm
parametric and non parametretric
probability of getting addition as 18 if 3 dices are thrown
explain one algorithm you know the best
precision recall f1 score
regularization
seperate elements of nested list
univariate and bivariate analysis of iris dataset
best statistic to use in case of outliers
when does KNN classification fail
is linear regression high bias or high variance model
different distance metrics
how will you choose k in knn

Atul Pandey
General background and introduction
Questions related to my project
Classification Algorithm
Accuracy and Error
FPR TPR, interpretation
F1 score inter pretation
Clustering Metrics
Correlation and its effectiveness for considering relation between Variable
Python coding
My skill set

Anirudh Sharma
Tell me about your self
What has been taught?
Probability of choosing two balls of same colour ball in a bag of 4 balls with 2 each
Probability of choosing king and queen from a deck of cards
CLT
Set theory
Given a list A find uniques values of A that are not in list B
Find number of pairs in a list
K means clustering
Bagging
Difference between t test and z test
Anirudha Nayak
Background Info and Reason for pursuing a course in Analytics
Explain the Projects mentioned in the CV : Objective , Results and Intent
Questions on Basic Probability
Statistics : CLT, Ztest, Ttest
Confusion Matrix, AUC-ROC Curves interpretation
Find number of Pairs in a list ( Python Coding)
Bagging, Boosting, Stacking
Random Forest vs Logistic Regression
Difference between t test and z test

Vinayashree K
About previous work experience
Why Data sciencee?Why praxis and why did you not go for mtech in Data science?
Situation based question. If there are 10,000 people and you have to get the size of tshirt for them what method would you follow
About bootstrap technique
Data science workflow from begining to end for a classification project
Why is knn called lazy learner
beyond what range is a point considered outlier in normal distribution
Probabilistic and determininstic method
Parametric and non parametric
F1 score,accuracy,precsion,recall. How to interpret and when to use which one
What is list comprehension in Python
Which data types are mutable and which are not in Python
How would you choose a classification algorithm ? when will you go for decision tree,when will you go for knn and when logistic
Why do you need validation data? isnt test set data enough ?
Why is knn not used very frequently? when is it not a good idea to use knn?
Different types of null value imputations and which one to use when
What is adjusted R square
Aparup Chakraborty
Introduce yourself
show one of your project (it was a classification project)
evaluation metrics (F1 score,precision recall)
confusion matrix
ROC AUC
how logistic regression work
traing dataset assumptions for linear and logistic regression
Random forest mechanism , bagging concept
Linear regression cost function ,how we optimize it
diffrence between stochastic gradient descent and batch gradient descent
list comprehension in python , he shared a document where I have to write codes
SUBEX
stions Asked
Round 2: 1 Explain one alogorithm
2 Difference between KMeans and HC
3 KNN and how to choose best value of K in KNN
4 How to implement Queue
5 TIme Complexity of Queue
6 Binary Classification
7 Doubling rate question
8 Permutation and Combination
9
Round 2 : 1 Pick one of the algorithm which you mentioned in the Resume and explain everything you k
2 Decision Trees : Pros and Cons , Selection Criteria, Stopping Criteria
3 What other Tree Based Models do you know about : Random Forest, XGBoost , brief follow
4 Quantitaive Puzzles ( CAT level ) : Derangement Problem, Weighing Scale Puzzle, Permuta
5 Experience with Programming and Previous Work Exp
6 Questions on what else have you studied apart from the subjects taught in Praxis
7 Basic Question on Time Complexity, Sorting Algorithm

t method would you follow?

or knn and when logistic regression ?How will you decide?


GD Topic: Will AI overtake the human creativity
1
2
3
4
5
6
7
8
9
10
11
12

GD Topic: Will AI overtake the human creativity


Round 1 1
Technical 2

3
4
5
6

7
8

10

GD Topic: Will AI overtake the human creativity


Round 1 1
2
3
4
5
6
7
8
9
SHREYA ARORA
Will AI overtake the human creativity
Introduction and previous work experience
Asked to write SQL queries related to Joins, Sub queries
Difference between Rank, Dense Rank and Row Number
Fetch output of Inner Join using Left Join
Fetch sum of salary of employees partitioned by Year
Explain any one project
If you had null values how will you deal with that?
How did you analyze each variable? Explain in detail
For any variable, if there are any values which are not in the expected range then what do we call it?
How do you treat the Outliers?
Why do you think mean imputation should be done for missing value treatment?
Guesstimate: Guess the number of 4 wheelers at this time in Mumbai

RAMANDEEP SHARMA
Will AI overtake the human creativity
Introduction
Questions on work experience

Types of joins.
SQL query to find Third highest salary
Question on Cartesian join
Explain your projects and question on this only

How will you do a EDA ?some basic steps which you will apply on every dataset
Situation based questions related to your work ex
have joined table A with 10 records and table B with 20 records without any condition then what is the
name of this join and how many records will be there after join.

diff between tuple and list.

Aparup Chakraborty
Will AI overtake the human creativity
Introduce yourself
Describe a project you have done (It was a classification project)
why Logistic regression, evaluation metrics (F1,precison,recall)
why you have used multiple prediction models
diffrence between logistic and linear regression
what is log odds
How you handle imbalanced data
how random oversampling works
explain random forest , why it is preferable
Round 2 1 introduction
Technical 2 questions on work experience
Nine identical stones are given. One stone is slightly heavier than the
by vertical head 3 other nine. How can you find the heavier one?(puzzle)
4 Minimum iterations to find out the heaviest stone
5 WAP on this puzzle
6 use case of this puzzle/program in real life(hint: correlation)

7 question on cross join


8 location based any constraints???

9 situation based questions related to your work ex


some he questions: if assigned to some technology will you be
10 able to do work on that technology.
How well you handle the pressure situation. give some example
11 from your past work experience.

Round 2 1 SQl join , cross join


2 list comprehesion
3 gave some puzzels to solve
4 what you did in your previous job
5 how you can apply your data science skills to improve your prevoius job
6 if I tell you to write a random forest classifier model in python without use scikit learn
Round 3 1
(business aspect) 2

by cofounder of LV 3
4
5
6

7
8

10

11
12

Round 3 1
business skill 2
by cofounder of LV 3
4
5
python without use scikit learn ,how you will approach 6
7
Introduction
Any achievements in school and clg

Many questions on my interest


Situation based question related to finance
Two envelopes having x and 2x value, will u swap after opening your choice?
Explain reason for the above question
Business skills related questions.Examples on you being a leader,team
player,some out of box ideas which helped your team and company.
Anything we want to improve in last organisation based on the current study

questions on stock market, mutual funds, policies


Asked about the latest project given by finance sir and if you studied this company
at the time of IPO would u have invested in the company?
How do you analyse your portfolio. what are the main factors which you see when
analysing it.
Is your strategy different for different sector while selecting a stock .

Introduce yourself
tell me about your achievements in school ,college or office
previou experinec and type of work
how you handle a typical client
tell me the ways to manage a team well
in this covid situation how a retail can maintain their BAU
think you are a territory manager of a retail in this covid situation how will you manage resources and customer footfall for stable
Round 4 1 introduction
HR round 2 family information

3 why praxis
4 why LV
5 what were the objectives of the certifications
6 explain your projects

7 avg hrs worked in a week in last organisation


8 who are the clients of LV

ources and customer footfall for stable ROI


Anirudh Sharma

Sr No. Round Questions


1 Round 1 CLT Statistic
2 Type 1 Type 2 error
3 Expaln Random Forrest
4 What algo have you written from scratch
5 What is Normal Distribution
6 Library used for Logistic Regression
7 SQL query based on like operator
8 Given the chance which part of Data Science Life Cycle would you work in?
9 bagging Vs boosting
10 Explain quartiles

Round 2 Questions
1 Questions based on Experience
2 Why did you choose Data science
3 If given a change to go back to the company, how would the newly acquired knowledge will
4 How would you use ML to measure the KPI of the business
5 Would you use Regression or Time series forecast for that company and why
6 Would you use the same model for all customers
7 Name Customer segmentation algorithms
8 Explain K means clustering
9 How many segments you'd have for that business
10 Code to get Summary of data
11 Binomial vs Poisoon Distribution with example
12 Estimate the sales of an outlet of KFC for a given month
13 If you are working under me, given the data and time of 2-3 days, how would derive insights
14 Which evaluation metric you would use for Linear regression to check if a model is good
15 Explain Rsq value in Layman's term
16 What is adjusted Rsq value
17 How are both different and which one would you use
18 Model used in your project
19 Whether the project is done in a group
20
For a linear regression model we got Rsq value less than 0.05 but the p value suggested it
variable, what could make this happen? (He later explained what they actually did to solve t

Round 3 HR
ld you work in?

newly acquired knowledge will help

ompany and why

days, how would derive insights from it


n to check if a model is good

05 but the p value suggested it was significant


what they actually did to solve this)
sr no. round 1 questions
1 tell me about yourself
Why datascience
what are the algorithms you have learnt till now ?
how decision tree splits, what is the importance of entropy
case study based on ML , a cse study was given and which algorithm i will use
what is OLS
how will you find out which features are significant or not
how knn regresssion works , full algorithm explanation
what are the projets you did and why have you used those algorithms
python list comprehension questions , lambda questions
sql groupby question
what is the difference of decisoion tree and regression tree and how the algothms are differ
what re the assumtions of linear regression and logistic regression
how will you check differet assumptions of LR
chi square test and where it been used
Annova questio based on case study
why scaling is important and in which algorithms scaling is necessary
how error normalisation of LR is done ? , which test you will do to check error normalisation
which is your favourite ML algo, describe and how will you use it in future any projects
why entropy is used and where you will use gini index
if your independent variables are continuous how many splits will happen or will take place
how to prune decision tree
a lil brief of Random forest as decsion trees are being used in RF
total 3 case studies were being asked on different ML algo, and further questions based on

2 what are the assumption s of linear regression


properties of a normal distribution
difference between list and tuple
what is the max entropy value in decision tree with 2 classes and 3 classes
explain knn clustering
how do you select the optimal number of clusters
explain left join with syntax
what do you know about normalisation? what are the different types of normalization
what is star and snowflake schema
and how the algothms are different , how splitting is been done in regression tree

and further questions based on your approach towards the case studies
Goutham Kuma
Sr.no. Rounds
1 Round1
2
3
4
5
6
1 Round2
2
3
4
5
6

Roopini Mohancha
1 Round 1
2
3
4
5
6
7
1 Round 2
Goutham Kumar R
Questions
Tell me about yourself
Agile methodology
Then direct coding/eda for 45 mins
The noteboook contained Japanese dataset, with ginza library imported
It had five questions 1. Isolate what kind of errors exist. (which column is the source of errors?)2.What sort of errors occur in ea
He was checking the intution on how we approached the problem statement
Projects realted to ML
PCA in-depth discussionWhat if we don't mean center the datawaht happens if you do x centering alone(centering in 1d alone)W
Collabration with Research Scientist at Nigerian Institute for Oceanography(Gourab nath sir, introduced me to a client and we w
Where do you see yourself in the next 3 or 5 years
In startups like goalist, You will be playing multiple roles, Will you be able to manage these?
What keeps you motivating?

Roopini Mohanchander
Tell me about yourself
About six sigma process
Did basic eda on japanese dataset
These were the questions asked1. Isolate what kind of errors exist. (which column is the source of errors?)2.What sort of errors
What is Word2Vec?
Bag of words?
How will you encode categ. to numerical?
Where do you see yourself after 5 years?
What do you think about working for a Japanese company?
Feedback about first round of interview?
Discussions on first round of interview problem statement?
Questions on project (Automation of EDA)1. How will you handle missing values in your automation library?2. How will you hand
KNN classification from scratch1. Why KNN? Why not any other ML models?2. Why standardization?3. Standardization vs Norm
Suggest another non parametric ML algorithm for the same dataset
ort of errors occur in each column? What features in input address correlate with errors?3.What features would be appropriate for a model to

(centering in 1d alone)Why co-variance matrixwhy eigen-values/vectors


me to a client and we were working on his requirements)NASA was collecting the data (.NC file)What we didwhat if we had missing values fo

s?)2.What sort of errors occur in each column? What features in input address correlate with errors?3.What features would be appropriate fo

ry?2. How will you handle outliers?3. Any imputation methods in your library?4. Is it feasible to covert numerical to categorical variable in the
Standardization vs Normalization?4. What is the difference bewteen Train , Test and validation set?5. Purpose of validation set?6. What all V
what if we had missing values for month-wise data(we just had a sample of few months, luckily we didn't have missing values)How will you in
missing values)How will you interploate if you had missing valuesWhat if there are more centers colloecting the data, which sattion would yo
he data, which sattion would you choose! for data collectionIf you are not allowed to use timeseries, can you think of anyother ML algos to in
think of anyother ML algos to interploate the missing dataFew more questions on how I was able to complete the requirements, and the SOP
the requirements, and the SOP doc which I shared with the client.

You might also like