Professional Documents
Culture Documents
GD Topics If you are going to let a person open a savings acc. who already has a home loan, and he invests in share ma
Group 1
Group 2
Group 3
Group 4 Govt has made a scheme where they would deposit money directly into people's bank account like PMJDY - p
Questions A
Rajib Gupta
1 What is your academic background?
2 why jumping from physics to data science?
3 What do you think of a bank?
4 How are you going to give a loan to a person? , what should you check? using which statisti
5 Are you ready to move anywhere in the country ?
6 Did you have any ICICI interviews before?
7 Are you ready to take a role in sales dept.?
Rushikesh Badgujar
Questions on project
1 *How you will identify default customer for giving loan which statistical model/algorithm you will use
2 * Ready to work in profile of data science with sales
3 * Ready to relocate in India anywhere
4 If u have a data with 100 variable and u have to decide whether the customer will b default or not. How will u d
Shivam Babb
Avinash Yada
Hrisav Bhowm
About yourself.
In your past organisation, suppose some huge data is given. What you used to do before starting to work on t
What is accuracy?
Conceptual Questions on Gini
If a dataset is given, which have training and test data, and gini value is given for each data point. Which are u
If in your Logistic Regression model, model has wrongly classified some data, how will you handle them
Are you ready to move anywhere in the country ?
Did you have any ICICI interviews before?
Any questions? Asked regarding analytics depts in ICICI and job position role.
Chithra Nair
Introduction, educational background and family background
How will you scale values 55, 65, 75 to a range of 20 to 40
software that you are using for python an packages numpy,matplotlib,pandas,seborn-overa
histogram
subsetting a dataset
project details
Deepak Sharma
Kaviya U C
Ananda Chatterjee
SHREYANSH
algo of Naive Bayes classifier in detail
Introduction , Family details , Education details
Why I choose data science
Aparup
Introduce yourself
Project
Busness case
Numericals
hypertuning of random forest
if any of family members work in ICICI
SHREYANSH
Questions Asked
Rajib Gupta
heck? using which statistical model you can make an automated system to check if a person is eligible to take loan?
Tanya Mangath
g a table where we have max sales of both the employees for all the months
Raj Kumar T
hikesh Badgujar
m you will use
default or not. How will u do it? how will you manupulate the data?
Shivam Babbar
Avinash Yadav
st column, with Rate given as 8,6, missing , 4 : how would you interpolate the missing value)
Hrisav Bhowmick
ch data point. Which are undergoing underfit and which are overfit?
will you handle them
Jayesh Belsare
Rajarshi Chakraborty
. What will be the primary key and the join condition (Table 1 - S.No, EmpID, Name ; Table 2 - Name, Marks)
able 2 - 2 rows with same name but different marks [name matching with 1 rowof table 1], 2 rows with names not matching with Table 1). As
al range being 1 - 100)
Chithra Nair
tlib,pandas,seborn-overall idea
eepak Sharma
Kaviya U C
a Chatterjee
HREYANSH
SH
Abhirup
salary dataset
1
2
3
4
5
6
7
8
1
2
3
4
5
6
7
8
9
10
11
1
2
3
4
5
6
7
8
9
10
1
2
3
4
5
6
7
8
9
10
1
2
3
4
5
6
7
8
9
9
10
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
1
2
3
4
5
6
7
8
9
10
1
2
3
4
5
6
7
8
9
10
11
12
1
2
3
4
5
6
7
8
9
10
11
Tharanitha
Questio
Tharanitha
Self intro
Some questions on my previous experience
Toics learned till date on our course
what is Linear Regression?
What are the assumptions of Linear Regression?
Difference Between Linear Regression and Decision tree Regression
what is supervised and unsupervised learning?
Finally interviewer mentioned that will get an case study to solve for which will get a week time to solve
RAMANDEEP SHARMA
SHIVANI NEHRA
SHREYA ARORA
ABHILASH JASH
General Introduction - background and work ex.
Why did you choose data science.
Tell us about subjects you have already studied.
Questions from general data wrangling
Questions deviated to data wrangling in projects.
Explain the dataset used.
Explain what attributes you used in model building.
What model did you use in predicting values.
Which model gave the best result.
How to optimize the same model.
What accuracy score you used and why?
Explain categorical variables and it's types
Difference between frequency plot and histogram
How did you identify and remove outliers.
Have you worked in any kind of classification problem?
Told about one not mentioned in CV, questions from that.
Asked for Feedback and everyday experience of the profile of I get selected.
Any kind of forecasting model used? Tell us something about it if any.
Questions on model interpretation, skipped - Not confident.
R2 and Adjusted R2 when to use what, which is better in what situation.
Assumptions of Linear Regression
SIDDHARTH DWIVEDI
General background and introduction
Explain your work experiene,type of activities performed in your project
Now you have enrolled into data science so how can you relate your past experience with this
Confusion matrix
Ask quetion related to clustering
Data science Use case which I can derive from one of my last project in which I have worked
What is Linear regression &given me some equation and asked to find correct linear regression equation
What is SCD type 2
Tell me what subjects you have studied and will study in near future
Why chosen Data Science
SALONEE MAHAPATRA
General background and introduction
Explain your work experiene, type of activities performed in your project
If you client comes to you with a problem regarding O2C department then how would you improve their system, what all analytic
What are the subjects covered in your course? What are the electives that you have chosen/ would choose?
What are you planning for your capstone project?
How comfortable are you with Python, SQL?
You are an analytics team member of an ecommerce web and they run out of bestselling black sweater in the inventory then wh
Which model would you use for the Q7?
Which tool would you use for the Q7?
What all analytical tools do you know other than SQL?
Will you be okay to work in a Power Plant in case the client demands that?
Are you okay to travel for business?
ARPIT MALHOTRA
General background and introduction
Questions related to my project
Precession Recall
ROC curve Interpretation
FPR TPR, interpretation
F1 score inter pretation
Clustering Metrics
Metrics in Artificial Neural Networks
Hyperparameters in Neural Network
My skill set
How will I do Canabelisation modeing of a Retail Store
Tharanitharan Ramajayam
Questions Asked
Tharanitharan Ramajayam
Aditi Bhardwaj
Written Round
some basic python questions on pandas and numpy functions and what will be the output of a code snippet
Interview
First Round:
Questions on projects made
What algorithms I have used in projects ( Random forest, Decision Tree, Linear Regression etc.)
Algorithms working
Why Random forest is better than Decision Tree
Detailed working of Random forest & XGBoost
Then asked me to write codes for following:
1. make a dataframe of 3 columns -> 1 numerical, 2 categorical
2. make on of the columns Index column
3. groupby and find mean of one of the columns
4. make a new column and put 0 incase of value in other numerical column is greater than 10 otherwise 1: Do this using lambda
5. Make a user defined function that accepts a dataframe and find moving average of 10 days and return the value in the datafra
Second Round:
Asked what are the factors in a Time series data
If there is a coffee shop who's not profiting much , how would I go about analysing it,
What's the range of R2 value
Can it be Negative
What value of RMSE is considered good
Overfitting and Underfitting
und
MANSHA DAS
1
2
1
2
3
4
5
6
7
8
9
1
2
3
4
5
6
7
8
9
10
11
12
14
15
1
2
3
4
5
6
7
8
9
10
11
12
Round 1 1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
Round 1 1
Panel 3 2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
Round 1 1
2 Panelist 2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
Round 1 1
2 Panelists 2
3
4
5
6
7
8
9
10
11
12
13
14
round 1 1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
round 2 1
2
3
4
5
8
9
10
round 3 1
2
3
4
Round 1 1
2
3
4
5
6
6
7
Round 2 1
2
3
4
5
6
HR 1
2
3
4
5
Round 1 1
2
3
4
5
6
6
Round 2 1
2
3
4
5
6
HR 1
2
3
4
5
6
7
8
HR Round 1
2
3
4
5
6
7
8
Introduction
Purely non technical interview, focusing just on your communication skills
Rushabh Rumde
Brief Introduction
Walk us through your CV
What was the reason you chose Data Science?
What all Subjects have you learnt?
Python - Dictionary, Tuples, Arrays, OOP
SQL-Where,having,Joins
Tableau - Data Blending, Data Wrangling, Hierarchy
Statistics - DIstributions, Variance.
Univariate, Bivariate and Multivariate Analysis
SHUBHAM PANWAR
Introduction and past experience
what all you have studied till now
why did you join praxis
project related stuff
NLP vs NLU
How would you rate yourself in SQL
What is indexing
foreign key
how would you rate yourself in python
what are decorators in python
What is the docstring command in python
What is TF IDF
SHREYA ARORA
Introduction and previous work experience
There are 2 tables, how will you fetch details of both tables?
What's the difference between Left and Inner Join?
Fetch average salary of all employees segregated by names
How does Partition By work in Window Function?
What's Dictionary in Python?
Can there be similar keys in a dictionary?
Can we add a list as key in dictionary?
What is the difference between a List and Dataframe?
Difference between Series and Dataframe
What's a Normal Distribution?
What's z-score? How to interpret that?
Where does 99.7% of the data lies in a normal distribution?
Correlation v/s Covariance
What happens to mean, mode, median in case of Left and Right skewness?
How do you choose optimum k value in k-Means Clustering?
What do you mean by Accuracy and Error rate? Formulas for both?
Which libraries have you used in ML?
Explain PCA
HARIPRIYA S
Introduction and previous work experience
How would you rate yourself in SQL?
Have you worked on Query Optimization?
Saw Sql in my experience so asked questions on like What is OLTP?
What is Star and Snowflake model
What is Cross Join?
12 rows in left table, 3 rows in right table. how many rows in Left Join?
Another question similar to Q6.
What is a cross join?
What is an Index?
Which ML algorithms are you comfortable with?
Which supervised Learning algorithms do you know?
A situation when you had to deal with conflict in a team and how did u handle it
Suppose you have all the click data of Flipkart, how will you identify potential customers of iPhone
Would you call yourself a team player?
How would you manage people in a team?
ABHILASH JASH
General Introduction
What all subjects been taught.
Classification and Regression difference, explain with examples.
Linear Regression and it's assumptions
Limitations of Linear Regression
Multiple Regression.
Goodness of fit
Project - on Regression
Explain the dataset
What all data cleaning steps did you apply, Explain the EDA you did.
The regressors you used for model, which gave best accuracy and why.
How can you improve performance of multiple regressor
What are decorators in Python
What is a tuple, how is it different from a list.
What is a User defined function, illustrate with an example.
What are the key functions of a function.
Open your Covid19 visualisation project and run it.
Explain the User defined functions you have used step by step.
Model Evaluation in Regression
R-squared and Adjusted R-squared?
How to select beta values for Multiple Regression?
Chithra Nair
general intro and work experience and projects done in the company brief explaination
asked about DB2 , what i exactly learned?-regarding my experience
explain
explain about EY --
one thing brief
you haveoverview
done inofyour
the company
company which was assigned to you only and what were the
challenges you faced there?
do you know pl sql?have you worked in the field?
sql question on general between where and having and asked to program based on that
how to add 2 1-d arrays
explain decision tree--explained gini model , deviance, cart algo
asked difference between gini and entropy in decision tree
what are the different libraries you have used for machine learning
what is matplotlib and seaborn
sql joins what all are there
difference betweenwhatever
clustering-explain left outeryou
andknow?
inner explained kmeans and agglomorative--again asked how you
assign k and concept of centroid? what are the limitations? and explain scree plot.
apllication level question on clustering-where can we use this?
bayesian theorem
association rules mining- how do you do this, explain apriori algorithm and application of association
rules
linear regression and r square, adjusted r square difference
how do you analyze a data? what are the challenges you have faced while analyzing data?
feature selecion--how do you do that?
project details on automation of eda
knn algorithm from scratch -explain the program
how do you work with team, how do you manage pressure?
family details and experience
marks obtained in praxis
how are you planning to settle down?---family details
why ey? Didn't you try PWC?
Aparup Chakraborty
Introduce yourself
What are the projects you have done describe one
Questions related to project,data preparation and models (classfication project)
What evaluation techniques have used in projects and why
Tell top 5 features of you project
How you will use data science to improve your previous job
asked question from SQL joins
Basic python , what is global and what is local variables
Introduce yourself
They asked questions from sales prediction project which was in my CV (big mart sales prediction)
ask about time series componets and algorithm
how logistic regression works
Diffrence between decision tree and random forest
ROC AUC curve
Introduce yourself
family details
previous work experience
what was your contribution to your previous company
extracurricular activities
Ankit Shrivastava
Introduce yourself
Personal Background
Professional Background
Proficiency in SQL, Excel (Conceptual Qs)
Achievements in work ex
Why Data Science
Python Basics
Introduce yourself
Brief about work exp
what subjects studied at praxis
questions on clustering, with details
use case of clustering, application oriented
discussion about kind of role and responsiblity offered by EY
Introduce yourself
family details
previous work experience
what was your contribution to your previous company
extracurricular activities
Hobbies
Have you ever taken responsiblity of any failure in professional experience
Last drawn salary
Rajarshi Chakraborty
Explain your project on K-means Clustering
No. of rows, what were the variables
Why did you use K-means?
How did you use the algorithm? Explain all the steps.
How did you arrive at the ideal value of K and what was it?
What error metric did you use?
What are the business application of this project that you did?
Brief explanation of other projects
Packages you have used in Python
CHAID vs CART (Said haven't been taught CHAID yet)
Explain Random Forest.
Which ML algos do you know?
Explain Apriori algo
What are the business applications of this algo?
Relationship between Support and Lift
Which R packages have you used?
What have you learnt in SQL?
What is inner join?
Difference between inner and left join
Query to find 'Till Date Salary' of employees and employee joining date from two tables (Groupby and inner join)
How would you make this query more efficient? (Explained to me how it can be done using indexing)
What does confusion matrix tell you?
Relationship between Precision and True Positive Rate
How will you increase the accuracy of a model?
What could be the possible reasons for the low accuracy?
Explain how would you approach a problem of data cleaning and then modelling given a dataset. explain all the steps.
Have you done anything on Tableau?
Asked about my scholarship
HR Round
HR Round 1
2
3
4
5
6
7
Round 2 1
Panel 4 0
3
4
5
6
7
8
9
10
11
12
13
14
Round 2 1
3 Panelist 2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
What are the different methods to calculate distance?
Explain Manhattan distance.
Any project where you had to do something other than the things you are comfor
Anirudh Sharma
Introduction
About family
About previous companies which I worked for
HR Round 1
2
3
4
5
6
7
8
9
10
11
12
Tell me about yourself, professional and personal details
About Family
Why Consulting role?
Have you heard of EY before or hearing it for the first time today?
Why havn't you applied to EY before?
Explain a situation where you had to take the blame for failure
What are your hobbies?
Any social work?
Any questions?
Introduce yourself.
Say about your family, what do they do?
What are your hobbies?
Have you demonstrated any leadership characteristics in your career. Give examples if any.
Will you push your colleague into danger for a failed project for your mistake?
If you are in trouble do you run away or face it upfront?
Do you contribute towards social good? Give examples.
Why EY and Consulting?
What do you know about EY? Any feedback from glassdoor or quora?
What was your last drawn CTC?
Any question that you want to ask me?
Any additional remarks you want to make?
1
2
3
4
5
6
7
8
9
10
11
12
1
2
3
4
5
6
8
9
1
2
3
4
5
6
7
8
9
10
11
12
1
2
3
4
5
6
7
8
9
10
11
12
Aditi Bhardwaj
1
2
3
4
5
6
7
8
9
1
2
3
4
5
6
7
8
9
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
1
2
3
4
5
6
7
8
9
10
11
12
13
14
1
2
3
4
5
6
7
8
9
10
11
12
13
14
1
2
3
4
5
6
7
8
9
10
1
2
3
4
5
6
7
8
9
10
11
Round 1 1
2
3
4
5
8
9
10
11
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
1
2
3
4
5
6
7
8
9
10
11
SUBEX
Questions Aske
Rehan Raza
Introduction & walk us through your cv
Projects(questions about libraries used)
Probabibility of coming 4 evrytime if dice rolled 3 times
Permuattion and combination(Apti questions)
Gradient Descent
supervised and unsupervised learning
underfitting and overfitting
Given a dictionary save it as a dataframe (on jupyter )
What is precision?
What is Confusion Matrix?
Categorical and Numerical features
Linear Regression, equation ,explanation, why and how??
Ramandeep Sharma
ARPIT MALHOTRA
Intorduction and work experience
explain your projects
Precsion Recall
ROC curve
Kappa Score
Optimizers
Hyperparameter Tuning in ANN
Activation Fucntions and their adavantages and Disadvantages
Gradient Decent and Stochastic Gradient Decent
Data Structures, queues, decks, cache,
Python program to find sum of elements in list
python program for flattening of dictionary
Chithra Nair
Intorduction and work experience
explain your projects
k means and kNN
t distribution
hypothesis testing in detail and asked application level
outlier
situation based questions on chi square testing
modelling
feature scaling,regression
situation based questions based on project , what will you do?
mean,median which to use
skewness
Aditi Bhardwaj
Work Experience
Poject related questions
types of distributuions
Hyperparamters in Decision classification
ways to handle categorical variables
relationship bet ween mean and median in normal distribution
underfiting and over fitiing
F1 score
decsion tree regression how does it make a prediction
Rushabh Rumde
Brief introduction
Sampling techniques
Project related Questions
Entropy and Information Gain
T-Distribution
Law of Large Numbers
Central Limit Theorem
How does KNN work?
EDA
Haripriya S
Introduce Yourself
Questions on Probability, Permutations
What projects have you worked on? In Praxis and otherwise.
Deeper into ML aspects of the project you worked on.
Types of distributions
Relationship between mean and median in normal distribution
Ways to handle categorical variables
What is underfiting and over fitiing?
What is a confusion matrix?
Why not accuracy instead of the other Scores?
What is Linear Regression?
How does KMeans Clustering work?
How does KNN work and futher questions
How does Decision Tree work?
What is PCA?
Coding: using Dataframes on Jupyter notebook (share our screen)
SHUBHAM PANWAR
General Background and Introduction
Questions related to my project
Precision , Recall
Stemming , Lemmatization, Bag of words
In a list there are multiple elements and only one element does not have duplicate , how will you find it.
Time complexity in the answer to the previous question
Explain any ML algo to us
What are the assumptions of Regression
What is Regularization
what are Hash Maps
Hypothetical situation - Identify if the driver of a car is male / female
How will you identify which features to take and which not in the previous case
How will you identify Conditional depent variables
VIVASWAN JINTURKAR
central limit theorem
deterministic and non deterministic algorithm
parametric and non parametretric
probability of getting addition as 18 if 3 dices are thrown
explain one algorithm you know the best
precision recall f1 score
regularization
seperate elements of nested list
univariate and bivariate analysis of iris dataset
best statistic to use in case of outliers
when does KNN classification fail
is linear regression high bias or high variance model
different distance metrics
how will you choose k in knn
Atul Pandey
General background and introduction
Questions related to my project
Classification Algorithm
Accuracy and Error
FPR TPR, interpretation
F1 score inter pretation
Clustering Metrics
Correlation and its effectiveness for considering relation between Variable
Python coding
My skill set
Anirudh Sharma
Tell me about your self
What has been taught?
Probability of choosing two balls of same colour ball in a bag of 4 balls with 2 each
Probability of choosing king and queen from a deck of cards
CLT
Set theory
Given a list A find uniques values of A that are not in list B
Find number of pairs in a list
K means clustering
Bagging
Difference between t test and z test
Anirudha Nayak
Background Info and Reason for pursuing a course in Analytics
Explain the Projects mentioned in the CV : Objective , Results and Intent
Questions on Basic Probability
Statistics : CLT, Ztest, Ttest
Confusion Matrix, AUC-ROC Curves interpretation
Find number of Pairs in a list ( Python Coding)
Bagging, Boosting, Stacking
Random Forest vs Logistic Regression
Difference between t test and z test
Vinayashree K
About previous work experience
Why Data sciencee?Why praxis and why did you not go for mtech in Data science?
Situation based question. If there are 10,000 people and you have to get the size of tshirt for them what method would you follow
About bootstrap technique
Data science workflow from begining to end for a classification project
Why is knn called lazy learner
beyond what range is a point considered outlier in normal distribution
Probabilistic and determininstic method
Parametric and non parametric
F1 score,accuracy,precsion,recall. How to interpret and when to use which one
What is list comprehension in Python
Which data types are mutable and which are not in Python
How would you choose a classification algorithm ? when will you go for decision tree,when will you go for knn and when logistic
Why do you need validation data? isnt test set data enough ?
Why is knn not used very frequently? when is it not a good idea to use knn?
Different types of null value imputations and which one to use when
What is adjusted R square
Aparup Chakraborty
Introduce yourself
show one of your project (it was a classification project)
evaluation metrics (F1 score,precision recall)
confusion matrix
ROC AUC
how logistic regression work
traing dataset assumptions for linear and logistic regression
Random forest mechanism , bagging concept
Linear regression cost function ,how we optimize it
diffrence between stochastic gradient descent and batch gradient descent
list comprehension in python , he shared a document where I have to write codes
SUBEX
stions Asked
Round 2: 1 Explain one alogorithm
2 Difference between KMeans and HC
3 KNN and how to choose best value of K in KNN
4 How to implement Queue
5 TIme Complexity of Queue
6 Binary Classification
7 Doubling rate question
8 Permutation and Combination
9
Round 2 : 1 Pick one of the algorithm which you mentioned in the Resume and explain everything you k
2 Decision Trees : Pros and Cons , Selection Criteria, Stopping Criteria
3 What other Tree Based Models do you know about : Random Forest, XGBoost , brief follow
4 Quantitaive Puzzles ( CAT level ) : Derangement Problem, Weighing Scale Puzzle, Permuta
5 Experience with Programming and Previous Work Exp
6 Questions on what else have you studied apart from the subjects taught in Praxis
7 Basic Question on Time Complexity, Sorting Algorithm
3
4
5
6
7
8
10
RAMANDEEP SHARMA
Will AI overtake the human creativity
Introduction
Questions on work experience
Types of joins.
SQL query to find Third highest salary
Question on Cartesian join
Explain your projects and question on this only
How will you do a EDA ?some basic steps which you will apply on every dataset
Situation based questions related to your work ex
have joined table A with 10 records and table B with 20 records without any condition then what is the
name of this join and how many records will be there after join.
Aparup Chakraborty
Will AI overtake the human creativity
Introduce yourself
Describe a project you have done (It was a classification project)
why Logistic regression, evaluation metrics (F1,precison,recall)
why you have used multiple prediction models
diffrence between logistic and linear regression
what is log odds
How you handle imbalanced data
how random oversampling works
explain random forest , why it is preferable
Round 2 1 introduction
Technical 2 questions on work experience
Nine identical stones are given. One stone is slightly heavier than the
by vertical head 3 other nine. How can you find the heavier one?(puzzle)
4 Minimum iterations to find out the heaviest stone
5 WAP on this puzzle
6 use case of this puzzle/program in real life(hint: correlation)
by cofounder of LV 3
4
5
6
7
8
10
11
12
Round 3 1
business skill 2
by cofounder of LV 3
4
5
python without use scikit learn ,how you will approach 6
7
Introduction
Any achievements in school and clg
Introduce yourself
tell me about your achievements in school ,college or office
previou experinec and type of work
how you handle a typical client
tell me the ways to manage a team well
in this covid situation how a retail can maintain their BAU
think you are a territory manager of a retail in this covid situation how will you manage resources and customer footfall for stable
Round 4 1 introduction
HR round 2 family information
3 why praxis
4 why LV
5 what were the objectives of the certifications
6 explain your projects
Round 2 Questions
1 Questions based on Experience
2 Why did you choose Data science
3 If given a change to go back to the company, how would the newly acquired knowledge will
4 How would you use ML to measure the KPI of the business
5 Would you use Regression or Time series forecast for that company and why
6 Would you use the same model for all customers
7 Name Customer segmentation algorithms
8 Explain K means clustering
9 How many segments you'd have for that business
10 Code to get Summary of data
11 Binomial vs Poisoon Distribution with example
12 Estimate the sales of an outlet of KFC for a given month
13 If you are working under me, given the data and time of 2-3 days, how would derive insights
14 Which evaluation metric you would use for Linear regression to check if a model is good
15 Explain Rsq value in Layman's term
16 What is adjusted Rsq value
17 How are both different and which one would you use
18 Model used in your project
19 Whether the project is done in a group
20
For a linear regression model we got Rsq value less than 0.05 but the p value suggested it
variable, what could make this happen? (He later explained what they actually did to solve t
Round 3 HR
ld you work in?
and further questions based on your approach towards the case studies
Goutham Kuma
Sr.no. Rounds
1 Round1
2
3
4
5
6
1 Round2
2
3
4
5
6
Roopini Mohancha
1 Round 1
2
3
4
5
6
7
1 Round 2
Goutham Kumar R
Questions
Tell me about yourself
Agile methodology
Then direct coding/eda for 45 mins
The noteboook contained Japanese dataset, with ginza library imported
It had five questions 1. Isolate what kind of errors exist. (which column is the source of errors?)2.What sort of errors occur in ea
He was checking the intution on how we approached the problem statement
Projects realted to ML
PCA in-depth discussionWhat if we don't mean center the datawaht happens if you do x centering alone(centering in 1d alone)W
Collabration with Research Scientist at Nigerian Institute for Oceanography(Gourab nath sir, introduced me to a client and we w
Where do you see yourself in the next 3 or 5 years
In startups like goalist, You will be playing multiple roles, Will you be able to manage these?
What keeps you motivating?
Roopini Mohanchander
Tell me about yourself
About six sigma process
Did basic eda on japanese dataset
These were the questions asked1. Isolate what kind of errors exist. (which column is the source of errors?)2.What sort of errors
What is Word2Vec?
Bag of words?
How will you encode categ. to numerical?
Where do you see yourself after 5 years?
What do you think about working for a Japanese company?
Feedback about first round of interview?
Discussions on first round of interview problem statement?
Questions on project (Automation of EDA)1. How will you handle missing values in your automation library?2. How will you hand
KNN classification from scratch1. Why KNN? Why not any other ML models?2. Why standardization?3. Standardization vs Norm
Suggest another non parametric ML algorithm for the same dataset
ort of errors occur in each column? What features in input address correlate with errors?3.What features would be appropriate for a model to
s?)2.What sort of errors occur in each column? What features in input address correlate with errors?3.What features would be appropriate fo
ry?2. How will you handle outliers?3. Any imputation methods in your library?4. Is it feasible to covert numerical to categorical variable in the
Standardization vs Normalization?4. What is the difference bewteen Train , Test and validation set?5. Purpose of validation set?6. What all V
what if we had missing values for month-wise data(we just had a sample of few months, luckily we didn't have missing values)How will you in
missing values)How will you interploate if you had missing valuesWhat if there are more centers colloecting the data, which sattion would yo
he data, which sattion would you choose! for data collectionIf you are not allowed to use timeseries, can you think of anyother ML algos to in
think of anyother ML algos to interploate the missing dataFew more questions on how I was able to complete the requirements, and the SOP
the requirements, and the SOP doc which I shared with the client.