1. What tools and libraries have you used for visualization?
I’ve used Power BI and Tableau for creating interactive dashboards, and in
Python, I’ve worked with libraries like Matplotlib, Seaborn, and Plotly for
custom visualizations. For web-based visual storytelling, I’ve explored Plotly
Dash and Streamlit.
2. When would you use a bar chart vs. a histogram?
A bar chart is used for categorical data to compare counts or values
across discrete categories.
A histogram is used for continuous numerical data to show the
frequency distribution across intervals or bins.
3. How do you create a heatmap in Python?
I typically use Seaborn’s heatmap() function. For example, to visualize
correlation:
4. How would you visualize the distribution of a continuous
variable?
Depending on the purpose:
Histogram for frequency
KDE Plot (Kernel Density Estimate) for smoothed distribution
Boxplot to show distribution and outliers
Violin Plot to combine KDE and boxplot
5. What’s your approach to building dashboards?
My approach includes:
1. Understanding user needs – I align KPIs with stakeholder goals.
2. Data preparation – Clean and aggregate relevant data.
3. Design – Use clear layout, color consistency, and interactivity.
4. Iteration – Share with stakeholders, get feedback, and improve.
Tools: Power BI/Tableau for business users; Streamlit/Dash for tailored
solutions.
6. What are the key elements of a good data visualization?
Clarity: Avoid clutter, label axes, use readable fonts
Relevance: Visuals should answer business questions
Accuracy: Avoid misleading scales or chart types
Interactivity (for dashboards): Drill-downs and filters help explore
data
Design: Use color and layout effectively
7. What is a boxplot and what insights can you get from it?
A boxplot summarizes the distribution of a variable:
Median (central tendency)
Quartiles (spread)
Outliers
Symmetry/skewness
It’s helpful for comparing distributions across groups and spotting
anomalies.
8. How do you handle visualizing time series data?
Use line charts to show trends over time
Add rolling averages to smooth noise
Use seasonal decomposition for deeper insights
For multiple series, I use interactive dashboards to filter and
compare timeframes.
9. What’s the difference between line plots and area charts?
Line plots emphasize the trend or value change over time.
Area charts emphasize both trend and the magnitude (volume) by
filling space under the line — useful for cumulative metrics or
comparing parts of a whole over time.
10. Which chart type would you use to show correlation between
two variables?
A scatter plot is best to show correlation. Adding a regression line or
coloring by a third variable (via hue or size) adds more depth.
To quantify the strength of the correlation, I use the correlation coefficient
along with the visual.