Genpact
1. What makes you feel that you would be suitable for this role, since you come from a different
background?
2. What is an imbalanced data set??
3. What are the factors you will consider in order to predict the population of a city in the future?
4. Basic statistics questions?
5. What are the approaches for treating the missing values?
6. Evaluation metrics for Classification?
7. Bagging vs Boosting with examples
8. Handling of imbalanced datasets
9. What are your career aspirations?
10.What's the graph of y = |x|-2
11. esstimate on no. Of petrol cars in Delhi
12.Case study on opening a retail store
13.Order of execution of SQL
Company: Accenture
1. What is difference between K-NN and K-Means clustering?
2. How to handle missing data? What imputation techniques can be used?
3. Explain topic modelling in NLP and various methods in performing topic modeling.
4. Explain how you would find and tackle an outlier in the dataset.
5. Follow up: What about inlier?
6. Explain back propagation in few words and its variants?
7. Is interpretability important for machine learning model? If so, ways to achieve interpretability for
a machine learning models?
8. Is interpretability important for machine learning model? If so, ways to achieve interpretability for
a machine learning models?
9. How would you design a data science pipeline?
10. Explain bias - variance trade off. How does this affect the model?
11. What does a statistical test do?
12. How to determine if a coin is biased? Hint: Hypothesis testing
Company: Ericsson
1. How to reverse a linked list
2. Give a logistic regression model in production, how would you find out the coefficients of different
input features.
3. What is the p- value in OLS regression
4. What's the reason for high bias or variance
5. Which models are generally high biased or high variance
6. Write code to find the 8 highest value in the DataFrame
7. What's difference between array and list
8. Whats the difference between Gradient boosting and Xgboost
9. Is XOR data linearly separable
10. How do we classify XOR data using logistic regression
11. Some questions from my previous projects
12. Given a sand timer of 4 and 7 mins how would you calculate 10 mins duration.
13. What's the angle between hour and minute hand in clock as 3:15
Company: FISERVE
1. How would you check if the model is suffering from multi Collinearity?
2. What is transfer learning? Steps you would take to perform transfer learning.
3. Why is CNN architecture suitable for image classification? Not an RNN?
4. What are the approaches for solving class imbalance problem?
5. When sampling what types of biases can be inflected? How to control the biases?
6. Explain concepts of epoch, batch, iteration in machine learning.
7. What type of performance metrics would you choose to evaluate the different classification
models and why?
8. What are some of the types of activation functions and specifically when to use them?
9. What is the difference between Batch and Stochastic Gradient Descent?
10. What is difference between K-NN and K-Means clustering?
11. How to handle missing data? What imputation techniques can be used?
Company: Landmark group
1. Use Case - Consider you are working for pen manufacturing company. How would you help sales
team with leads using Data analysis?
2. Interviewers ask about scenarios or use-case based questions to know interviewee thought
process and problem-solving skills.
3. Assume you were given access to a website google analytics data.
4. In order to increase conversions, how do you perform A/B testing to identify best page design.
5. How is random forest different from Gradient boosting algorithm, given both are tree-based
algorithm?
6. Describe steps involved in creating a neural network?
7. LSTM solves the vanishing gradient problem, that RNN primarily have. How?
8. In brief, how would you perform the task of sentiment analysis?