0% found this document useful (0 votes)

4 views7 pages

100+ Python Questions

The document contains 100 Python interview questions and answers tailored for data analysts, covering topics such as Python basics, NumPy, Pandas, data cleaning, visualization, SQL integration, and statistical analysis. It provides practical code snippets and explanations for each question, making it a comprehensive resource for preparing for data analyst interviews. Key areas include handling missing values, data manipulation, and creating visualizations.

Uploaded by

Chetan Priyanka

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views7 pages

100+ Python Questions

Uploaded by

Chetan Priyanka

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Python Interview Q&A for Data Analysts

( 100 Questions )
Based on Pandas, NumPy, Data Cleaning, Data Visualization, SQL integration, and problem-solving — the areas
where Data Analysts usually get Python interview questions.

Python Interview Q&A for Data Analysts (100 Questions)

1. Python Basics for Data Analysis (Q1–Q15)

Q1. Why is Python popular in Data Analysis?

A: Easy to learn, rich libraries (Pandas, NumPy, Matplotlib), strong community support.

Q2. What are key Python libraries for Data Analysis?

A: Pandas, NumPy, Matplotlib, Seaborn, SciPy, Statsmodels, Scikit-learn.

Q3. What is Jupyter Notebook?

A: An interactive environment for writing, testing, and visualizing Python code.

Q4. What are Python data types important for analysis?

A: int, float, str, list, tuple, dict, set.

Q5. Difference between Python list and NumPy array?

A: List is flexible but slower; NumPy array is faster and optimized for numerical operations.

Q6. What are Python’s mutable vs immutable objects?

A: Mutable → list, dict. Immutable → int, float, str, tuple.

Q7. How do you install Python libraries?

A: Using pip install package_name.

Q8. What is a virtual environment?

A: Isolated workspace to manage project-specific dependencies.

Q9. Difference between Python script and Jupyter Notebook?

A: Script is sequential code, Notebook supports interactive coding + visualization.

Q10. What is the use of Python’s with statement?

A: Resource management (e.g., auto-closing files).

Q11. What is Python’s id() function?

A: Returns memory address of an object.

Q12. What is difference between Python 2 and Python 3 in data analysis?

A: Python 3 supports Unicode, better libraries; Python 2 is outdated.

Q13. What is the difference between is and == in Python?

A: is → identity, == → value equality.

Q14. How do you handle missing values in Python?

A: Using Pandas dropna() or fillna().

Q15. What is type casting in Python?

A: Converting data types (e.g., int("10")).
2. NumPy (Q16–Q25)

Q16. What is NumPy?

A: Numerical Python library for fast numerical computations.

Q17. How to create a NumPy array?

import numpy as np

arr = np.array([1, 2, 3])

Q18. Difference between list and NumPy array?

A: List is general-purpose; NumPy array supports vectorized operations.

Q19. How to create a 2D array in NumPy?

np.array([[1,2,3],[4,5,6]])

Q20. What is broadcasting in NumPy?

A: Performing operations on arrays of different shapes.

Q21. How to calculate mean, median, std in NumPy?

np.mean(arr), np.median(arr), np.std(arr)

Q22. What is difference between np.zeros() and np.ones()?

A: Creates arrays filled with zeros or ones.

Q23. How to generate random numbers in NumPy?

np.random.rand(3,2)

Q24. What is slicing in NumPy arrays?

A: Accessing subsets: arr[1:4].

Q25. How to find unique values in a NumPy array?

np.unique(arr)

3. Pandas (Q26–Q45)

Q26. What is Pandas?

A: A library for data manipulation and analysis.

Q27. Difference between Pandas Series and DataFrame?

A: Series = 1D, DataFrame = 2D table.

Q28. How to create a Pandas DataFrame?

import pandas as pd

df = pd.DataFrame({"Name":["A","B"],"Age":[20,25]})

Q29. How to read CSV file in Pandas?

pd.read_csv("file.csv")

Q30. How to check DataFrame shape?

df.shape

Q31. How to get column names in Pandas?

df.columns

Q32. How to filter rows in Pandas?

df[df["Age"] > 25]

Q33. How to handle missing values?

• df.dropna() → remove

• df.fillna(value) → replace

Q34. How to group data in Pandas?

df.groupby("Department")["Salary"].mean()

Q35. How to merge two DataFrames?

pd.merge(df1, df2, on="ID")

Q36. Difference between loc and iloc?

• loc → label-based

• iloc → index-based

Q37. How to sort DataFrame?

df.sort_values(by="Salary", ascending=False)

Q38. How to rename columns in Pandas?

df.rename(columns={"old":"new"})

Q39. How to apply custom function on DataFrame?

df["Col"].apply(lambda x: x*2)

Q40. How to check data types of columns?

df.dtypes

Q41. How to convert column to datetime in Pandas?

pd.to_datetime(df["Date"])

Q42. How to pivot table in Pandas?

df.pivot_table(values="Sales", index="Region", columns="Year")

Q43. How to reset index in Pandas?

df.reset_index(drop=True)

Q44. How to check duplicate rows?

df.duplicated().sum()

Q45. How to remove duplicates in Pandas?

df.drop_duplicates()
4. Data Cleaning & Transformation (Q46–Q60)

Q46. How do you handle outliers?

A: Using IQR, Z-score, or capping.

Q47. How to replace values in DataFrame?

df["Gender"].replace({"M":"Male","F":"Female"})

Q48. How to check null values in DataFrame?

df.isnull().sum()

Q49. How to combine text columns?

df["FullName"] = df["First"] + " " + df["Last"]

Q50. How to change column data type?

df["Age"] = df["Age"].astype(int)

Q51. How to extract year from datetime column?

df["Year"] = df["Date"].dt.year

Q52. What is difference between applymap, map, apply?

• map → Series

• apply → row/col in DataFrame

• applymap → entire DataFrame

Q53. How to normalize column values?

(df["col"] - df["col"].min()) / (df["col"].max()-df["col"].min())

Q54. How to concatenate DataFrames?

pd.concat([df1, df2])

Q55. How to one-hot encode categorical variables?

pd.get_dummies(df, columns=["Category"])

Q56. How to bin continuous values?

pd.cut(df["Age"], bins=[0,18,30,50,100])

Q57. How to change column order in Pandas?

df = df[["Col2","Col1","Col3"]]

Q58. How to melt a DataFrame?

pd.melt(df, id_vars=["ID"], value_vars=["Math","Science"])

Q59. How to check correlation in Pandas?

df.corr()

Q60. How to detect skewness?

df["col"].skew()

5. Visualization (Q61–Q70)

Q61. How to plot bar chart in Matplotlib?

import matplotlib.pyplot as plt

df["Col"].value_counts().plot(kind="bar")

plt.show()

Q62. How to plot histogram?

df["Col"].hist()

Q63. How to plot line chart?

df.plot(x="Date", y="Sales")

Q64. How to plot scatter plot?

df.plot.scatter(x="Age", y="Salary")

Q65. What is Seaborn?

A: Advanced data visualization library built on Matplotlib.

Q66. How to plot heatmap in Seaborn?

import seaborn as sns

sns.heatmap(df.corr(), annot=True)

Q67. How to plot boxplot in Seaborn?

sns.boxplot(x="Category", y="Sales", data=df)

Q68. How to show multiple plots in one figure?

plt.subplot(2,2,1); plt.plot(df["Sales"])

Q69. How to set figure size in Matplotlib?

plt.figure(figsize=(10,5))

Q70. How to save plot as image?

plt.savefig("chart.png")

6. SQL & Python (Q71–Q75)

Q71. How do you connect Python to SQL?

A: Using libraries like pyodbc, sqlalchemy, sqlite3.

Q72. How to read SQL data into Pandas?

pd.read_sql("SELECT * FROM table", conn)

Q73. How to insert Pandas DataFrame into SQL?

df.to_sql("table", conn, if_exists="replace", index=False)

Q74. What is difference between read_sql_query and read_sql_table?

• query → runs SQL query

• table → fetches full table

Q75. How to handle large datasets from SQL in Python?

A: Use chunks (chunksize), optimize queries, use indexes.

7. Statistics & Analysis (Q76–Q85)

Q76. How to calculate correlation in Python?

df.corr()

Q77. How to calculate standard deviation?

df["col"].std()

Q78. How to calculate variance?

df["col"].var()

Q79. How to calculate skewness and kurtosis?

df["col"].skew(), df["col"].kurt()

Q80. How to detect outliers using IQR?

Q1, Q3 = df["col"].quantile([0.25,0.75])

IQR = Q3 - Q1

outliers = df[(df["col"] < Q1-1.5IQR) | (df["col"] > Q3+1.5IQR)]

Q81. How to calculate moving average?

df["col"].rolling(3).mean()

Q82. How to calculate correlation heatmap?

sns.heatmap(df.corr(), annot=True)

Q83. How to calculate percentile?

np.percentile(df["col"], 90)

Q84. How to calculate z-score?

from scipy.stats import zscore

df["zscore"] = zscore(df["col"])

Q85. How to calculate covariance?

df.cov()

8. Case Studies & Coding (Q86–Q100)

Q86. Find top 5 highest salaries from DataFrame.

df.nlargest(5,"Salary")

Q87. Find employees with salary above average.

df[df["Salary"] > df["Salary"].mean()]

Q88. Count number of employees per department.

df["Dept"].value_counts()

Q89. Find customers who purchased more than 5 times.

df["Customer"].value_counts()[df["Customer"].value_counts() > 5]

Q90. Find 2nd highest salary.

df["Salary"].nlargest(2).iloc[-1]

Q91. Calculate total sales by region.

df.groupby("Region")["Sales"].sum()

Q92. Show top 10 products by revenue.

df.groupby("Product")["Revenue"].sum().nlargest(10)

Q93. Find percentage contribution of each category.

(df.groupby("Category")["Sales"].sum() / df["Sales"].sum())*100

Q94. Find duplicate customers.

df[df.duplicated("CustomerID")]

Q95. Get month with highest sales.

df.groupby(df["Date"].dt.month)["Sales"].sum().idxmax()

Q96. Find average order value (AOV).

df["Sales"].sum()/df["OrderID"].nunique()

Q97. Calculate churn rate.

A: (Lost Customers / Total Customers at Start) * 100.

Q98. Detect null percentage in each column.

df.isnull().mean()*100

Q99. Calculate profit margin % per order.

df["ProfitMargin"] = (df["Profit"]/df["Sales"])*100

Q100. Build customer segmentation using Pandas.

df.groupby("Segment")["Sales"].agg(["mean","sum","count"])

60 Python Interview Qs Every Data Analyst Must Know
No ratings yet
60 Python Interview Qs Every Data Analyst Must Know
11 pages
Python Interview Questions
No ratings yet
Python Interview Questions
6 pages
Python Libraries for Data Science
No ratings yet
Python Libraries for Data Science
96 pages
Python Data Analysis Interview Notes Real World Scenarios
No ratings yet
Python Data Analysis Interview Notes Real World Scenarios
5 pages
CSE445 NSU Week - 3
No ratings yet
CSE445 NSU Week - 3
48 pages
Python Libraries for Statistical Analysis
No ratings yet
Python Libraries for Statistical Analysis
40 pages
Python For Data Science
No ratings yet
Python For Data Science
45 pages
Python Data Analysis Libraries Guide
100% (1)
Python Data Analysis Libraries Guide
43 pages
Python For Data Analysis Edgar
No ratings yet
Python For Data Analysis Edgar
49 pages
100 Python Interview Questions
100% (1)
100 Python Interview Questions
68 pages
Top Python Questions 1735201448
No ratings yet
Top Python Questions 1735201448
25 pages
Python For Data Analysis
No ratings yet
Python For Data Analysis
47 pages
Python For ML
No ratings yet
Python For ML
41 pages
Python Unit 2 Question Bank
No ratings yet
Python Unit 2 Question Bank
5 pages
Python Libraries 2
No ratings yet
Python Libraries 2
80 pages
Murali Internship
No ratings yet
Murali Internship
34 pages
Python MCQs Test Papers Expanded
No ratings yet
Python MCQs Test Papers Expanded
7 pages
Analystics Data Cleaning Questions Interview
No ratings yet
Analystics Data Cleaning Questions Interview
8 pages
Python Data Science Guide
100% (2)
Python Data Science Guide
47 pages
DevOps Session 3 Pandas
No ratings yet
DevOps Session 3 Pandas
33 pages
Viva Questions
No ratings yet
Viva Questions
7 pages
20ca2204 Data Science QB With Answers
No ratings yet
20ca2204 Data Science QB With Answers
48 pages
Python Data Analysis Tutorial
No ratings yet
Python Data Analysis Tutorial
47 pages
Introduction to Pandas in Python
No ratings yet
Introduction to Pandas in Python
5 pages
Python Pandas: Data Manipulation Guide
No ratings yet
Python Pandas: Data Manipulation Guide
84 pages
Every Data Analyst Should Know !
No ratings yet
Every Data Analyst Should Know !
4 pages
Homework File 1434
No ratings yet
Homework File 1434
9 pages
Pandas FAQ: Week 3 Guide
No ratings yet
Pandas FAQ: Week 3 Guide
3 pages
Wa0005.
No ratings yet
Wa0005.
29 pages
Usage of NumPy For Numerical Data in Detail
No ratings yet
Usage of NumPy For Numerical Data in Detail
52 pages
Python For Data Analysis Jan 28
No ratings yet
Python For Data Analysis Jan 28
105 pages
Worksheet Class 12 Ai
No ratings yet
Worksheet Class 12 Ai
38 pages
Unit - 4 - Part 2
No ratings yet
Unit - 4 - Part 2
36 pages
Day 2 Python Interview QnA
No ratings yet
Day 2 Python Interview QnA
15 pages
Data Analyst
No ratings yet
Data Analyst
14 pages
2A - Python+Data Analysis For Pyhton2 v2
No ratings yet
2A - Python+Data Analysis For Pyhton2 v2
38 pages
CH 1 Type A Exercise
No ratings yet
CH 1 Type A Exercise
7 pages
Unit 3 (FODS)
No ratings yet
Unit 3 (FODS)
34 pages
More On Pandas
No ratings yet
More On Pandas
51 pages
Python Data Science: Pandas & ML Basics
100% (1)
Python Data Science: Pandas & ML Basics
41 pages
Essential Pandas Operations Guide
No ratings yet
Essential Pandas Operations Guide
8 pages
Pandas
No ratings yet
Pandas
5 pages
Python Data Exploration Guide
100% (1)
Python Data Exploration Guide
12 pages
Common Python Data Science Interview Questions1
No ratings yet
Common Python Data Science Interview Questions1
5 pages
8th of 10 Python Resources PANDAS Interview Q A ? 1737825285
No ratings yet
8th of 10 Python Resources PANDAS Interview Q A ? 1737825285
19 pages
Python NumPy and Pandas MCQs
No ratings yet
Python NumPy and Pandas MCQs
8 pages
40 NumPy and Pandas Interview Questions With Answers 1740141557
No ratings yet
40 NumPy and Pandas Interview Questions With Answers 1740141557
6 pages
CO3 - 1 - Pandas Series and Data Frame
No ratings yet
CO3 - 1 - Pandas Series and Data Frame
37 pages
Unit III - Notes
No ratings yet
Unit III - Notes
12 pages
Pandas Interview Questions
No ratings yet
Pandas Interview Questions
21 pages
41 DS PL MF
No ratings yet
41 DS PL MF
20 pages
Python Unit Iv - Pandas
No ratings yet
Python Unit Iv - Pandas
36 pages
Lecture Week2
No ratings yet
Lecture Week2
72 pages
Q-Step WS 06112019 Data Analysis and Visualisation With Python
No ratings yet
Q-Step WS 06112019 Data Analysis and Visualisation With Python
76 pages
Govt Ibm Rough Notes
No ratings yet
Govt Ibm Rough Notes
3 pages
Govt Ibm Assign
No ratings yet
Govt Ibm Assign
3 pages
Mentorship Program
No ratings yet
Mentorship Program
15 pages
Software Testing Quality Assurance Notes
No ratings yet
Software Testing Quality Assurance Notes
17 pages
Offer Letter - Founding Team Member
No ratings yet
Offer Letter - Founding Team Member
6 pages
Software Testing Quality Assurance Notes
No ratings yet
Software Testing Quality Assurance Notes
17 pages
Business Plan Analytics Career Connect
No ratings yet
Business Plan Analytics Career Connect
3 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
7 pages
Combined Use of HArmonyCa and Hyaluronic Acid Fillers. A Holistic Approach To Facial Rejuvenation - Sadiye Kus
No ratings yet
Combined Use of HArmonyCa and Hyaluronic Acid Fillers. A Holistic Approach To Facial Rejuvenation - Sadiye Kus
11 pages
Understanding Exploratory Data Analysis
0% (1)
Understanding Exploratory Data Analysis
17 pages
1 s2.0 S0956053X16307449 Main
No ratings yet
1 s2.0 S0956053X16307449 Main
10 pages
ASDM Exam Probability and Data Analysis
No ratings yet
ASDM Exam Probability and Data Analysis
15 pages
Angle 2011, Self-Reported Pain Associated With The Use of Intermaxillary Elastics Compared To Pain Experienced After Initial Archwire Placement
No ratings yet
Angle 2011, Self-Reported Pain Associated With The Use of Intermaxillary Elastics Compared To Pain Experienced After Initial Archwire Placement
5 pages
AP Stats: Data Distribution Basics
No ratings yet
AP Stats: Data Distribution Basics
3 pages
Quantiles for Math Students
No ratings yet
Quantiles for Math Students
30 pages
Statistics Assignment
No ratings yet
Statistics Assignment
3 pages
Case Study 219302405
No ratings yet
Case Study 219302405
14 pages
1 s2.0 S0266352X24000910 Main
No ratings yet
1 s2.0 S0266352X24000910 Main
17 pages
Business Statistics
No ratings yet
Business Statistics
69 pages
Batu Pahat Sem 3 (A)
No ratings yet
Batu Pahat Sem 3 (A)
9 pages
Descriptive Statistics Questions
No ratings yet
Descriptive Statistics Questions
33 pages
Beyond The Rule of 5: Lessons Learned From AbbVie's Drugs and Compound Collection
No ratings yet
Beyond The Rule of 5: Lessons Learned From AbbVie's Drugs and Compound Collection
56 pages
Stats
No ratings yet
Stats
24 pages
Zela.k, Mema.m, Zela.d Koferenc
No ratings yet
Zela.k, Mema.m, Zela.d Koferenc
21 pages
Decision Science & Data Analysis
No ratings yet
Decision Science & Data Analysis
26 pages
SYBSCIT Practical Course Guide
No ratings yet
SYBSCIT Practical Course Guide
14 pages
Statistics Scenarios
No ratings yet
Statistics Scenarios
3 pages
STA201 Lecture 1
No ratings yet
STA201 Lecture 1
21 pages
IB Math SL Statistics Review
No ratings yet
IB Math SL Statistics Review
11 pages
Chapter 6 (Philoid-In)
No ratings yet
Chapter 6 (Philoid-In)
17 pages
Basics of Assessment
No ratings yet
Basics of Assessment
17 pages
Describing Data:: Displaying and Exploring Data
No ratings yet
Describing Data:: Displaying and Exploring Data
28 pages
Data Science
No ratings yet
Data Science
23 pages
Understanding Data: Class 11 Guide
No ratings yet
Understanding Data: Class 11 Guide
11 pages
8438 Ecap792 Data Science Toolbox
No ratings yet
8438 Ecap792 Data Science Toolbox
317 pages
Maths IA
No ratings yet
Maths IA
16 pages
Measures of Variability: - Range - Interquartile Range - Variance - Standard Deviation - Coefficient of Variation
No ratings yet
Measures of Variability: - Range - Interquartile Range - Variance - Standard Deviation - Coefficient of Variation
37 pages