0% found this document useful (0 votes)

10 views5 pages

Interview Questions

The document outlines various SQL concepts, including the use of WHERE and HAVING clauses, CTEs, and joins, along with practical examples using sample data. It also discusses how to calculate pass percentages, create views, and rank test results using SQL and Python with pandas. Additionally, it touches on Power BI topics such as data sources, report development, and the differences between measures and calculated columns.

Uploaded by

gupta.ayushi2425

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views5 pages

Interview Questions

Uploaded by

gupta.ayushi2425

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

Introduction

Technical skillset and projects worked on

Role and Responsibilities in current project

SQL
-----------------------

1. Difference WHERE and HAVING Clause

2. Use of CTE

Sample Data
---------------

Sno Test Station

1 A
2 B
3 C
4 D
5 A

Test Station Status

A Green
B Green
C Red

1. Give output of left join, inner join on above data

2. Which one will take less runtime

Sample Data - Sno is unique number of each device

Sno Measurement_Name Measure_value time_stamp

1 Left Speaker 10 10:00AM
1 Left Speaker 15 11:00 AM
1 Left Speaker 18 12:30 PM
2 Left Speaker 12 10:00AM
3 Left Speaker 16 11:00 AM
4 Left Speaker 15 12:30 PM
5 Left Speaker 10 10:00AM
5 Left Speaker 9 11:00 AM
5 Left Speaker 7 12:30 PM

1. Add a column "Test_result" which will contain values "Pass", "Fail" based on the
condition if Measure_Value>15 then "Pass"
Select [Link], s.Measurement_Name, s.Measure_value, s.time_stamp,
Case when s.Measure_value>15 then 'Pass' else 'Fail' end as Test_Result
from SampleData s

2. Fetch the last test result data for each device based on time_stamp.
With CTE as (
Select Sno, max(time_stamp) as last_timestamp
from SampleData
group by Sno
)
Select [Link], s.Measurement_Name, s.Measure_value, s.time_stamp,
Case when s.Measure_value>15 then 'Pass' else 'Fail' end as Test_Result
from SampleData s join CTE c on [Link] = [Link] and s.time_stamp = c.last_timestamp
3. Create a view with Pass percentage of each device.
CREATE VIEW PassPercentageView AS
SELECT
Sno,
round((SUM(CASE WHEN Measure_value > 15 THEN 1 ELSE 0 END) * 100.0 /
COUNT(*)),2) AS pass_percentage
FROM
SampleData
GROUP BY
Sno;

4. Use RANK function to create a view in desc order of time_stamp and assign rank
to each device
CREATE VIEW RankedTestResults AS
SELECT
Sno,
Measurement_Name,
Measure_value,
time_stamp,
case when Measure_value>15 then 'Pass'
else 'Fail'
end as Test_Result,
Rank() Over (partition by Sno Order by time_stamp desc) as Ranks
from SampleData

import pandas as pd

# Sample data as a list of dictionaries

data = {
'Sno': [1, 1, 1, 2, 3, 4, 5, 5, 5],
'Measurement_Name': ['Left Speaker'] * 9,
'Measure_value': [10, 15, 18, 12, 16, 15, 10, 9, 7],
'time_stamp': ['2024-10-11 [Link]', '2024-10-11 [Link]', '2024-10-11
[Link]',
'2024-10-11 [Link]', '2024-10-11 [Link]', '2024-10-11
[Link]',
'2024-10-11 [Link]', '2024-10-11 [Link]', '2024-10-11
[Link]']
}

# Convert the data to a pandas DataFrame

df = [Link](data)

# Convert time_stamp to datetime format

df['time_stamp'] = pd.to_datetime(df['time_stamp'])

-----------------------------------------------------------------------------------
------
# 1. Add a "Test_result" column
-----------------------------------------------------------------------------------
------
df['Test_result'] = df['Measure_value'].apply(lambda x: 'Pass' if x > 15 else
'Fail')
print("Data with Test_result column:")
print(df)
-----------------------------------------------------------------------------------
------
# 2. Fetch the last test result data for each device based on time_stamp
-----------------------------------------------------------------------------------
------
df_last_test = df.sort_values(by='time_stamp').groupby('Sno').last().reset_index()
print("\nLast test result for each device:")
print(df_last_test)

-----------------------------------------------------------------------------------
------
# 3. Calculate the pass percentage for each device
-----------------------------------------------------------------------------------
------
pass_percentage_df = [Link]('Sno').apply(
lambda x: round((x['Test_result'] == 'Pass').sum() * 100.0 / len(x), 2)
).reset_index(name='pass_percentage')
print("\nPass percentage for each device:")
print(pass_percentage_df)

-----------------------------------------------------------------------------------
------
# 4. Rank the test results for each device based on time_stamp
-----------------------------------------------------------------------------------
------
df['rank'] = [Link]('Sno')['time_stamp'].rank(ascending=False, method='dense')
print("\nRanked test results based on time_stamp:")
print(df)

-----------------------------------------------------------------------------------
-------------------------------------

Sample Data

Sno Measurement Name Measure value time stamp Date Test Station
11 Left Speaker 10 10:00AM 1-Jan A
11 Left Speaker 15 11:00 AM 1-Jan B
11 Left Speaker 18 12:30 PM 1-Jan A
12 Left Speaker 12 10:00AM 1-Jan B
13 Left Speaker 16 11:00 AM 1-Jan A
14 Left Speaker 15 12:30 PM 1-Jan A
15 Left Speaker 10 10:00AM 1-Jan A
15 Left Speaker 9 11:00 AM 1-Jan B
15 Left Speaker 7 12:30 PM 1-Jan B
1 Left Speaker 10 10:00AM 2-Jan A
1 Left Speaker 15 11:00 AM 2-Jan C
1 Left Speaker 18 12:30 PM 2-Jan G
2 Left Speaker 12 10:00AM 2-Jan A
3 Left Speaker 16 11:00 AM 2-Jan A
4 Left Speaker 15 12:30 PM 2-Jan D
5 Left Speaker 10 10:00AM 2-Jan H
5 Left Speaker 9 11:00 AM 2-Jan I
5 Left Speaker 7 12:30 PM 2-Jan B

1. Fetch test pass percentage of each test station

Select Test_Station,
sum(Case when Measure_value>15 then 1 else 0 end) *100/count(*) as
Test_Result_Percentage
from measure
group by Test_Station

2. Highest pass pecentage test station

SELECT
Test_Station,
ROUND(
(SUM(CASE WHEN Measure_value > 15 THEN 1 ELSE 0 END) * 100.0) / COUNT(*),
2
) AS pass_percentage
FROM
measurements
GROUP BY
Test_Station
ORDER BY
pass_percentage DESC
LIMIT 1; -- For MySQL

SELECT TOP 1
Test_Station,
ROUND(
(SUM(CASE WHEN Measure_value > 15 THEN 1 ELSE 0 END) * 100.0) /
NULLIF(COUNT(*), 0),
2
) AS pass_percentage
FROM
measurements
GROUP BY
Test_Station
ORDER BY
pass_percentage DESC;

With CTE as
(Select Test_Station,
sum(Case when Measure_value>15 then 1 else 0 end) *100/count(*) as
Test_Result_Percentage
from measure
group by Test_Station)

Select Test_Station, max(Test_Result_Percentage) from CTE

group by Test_Station

-----------------------------------------------------------------------------------
----------------

import pandas as pd

# Sample data
data = {
'Sno': [11, 11, 11, 12, 13, 14, 15, 15, 15, 1, 1, 1, 2, 3, 4, 5, 5, 5],
'Measurement Name': ['Left Speaker']*18,
'Measure value': [10, 15, 18, 12, 16, 15, 10, 9, 7, 10, 15, 18, 12, 16, 15, 10,
9, 7],
'time stamp': ['10:00AM', '11:00 AM', '12:30 PM', '10:00AM', '11:00 AM', '12:30
PM', '10:00AM', '11:00 AM', '12:30 PM',
'10:00AM', '11:00 AM', '12:30 PM', '10:00AM', '11:00 AM', '12:30
PM', '10:00AM', '11:00 AM', '12:30 PM'],
'Date': ['1-Jan']*9 + ['2-Jan']*9,
'Test Station': ['A', 'B', 'A', 'B', 'A', 'A', 'A', 'B', 'B', 'A', 'C', 'G',
'A', 'A', 'D', 'H', 'I', 'B']
}

# Creating the DataFrame

df = [Link](data)

df["test_pass_fail"] = df["Measure value"].apply(lambda x : "Pass" if x>15 else

"Fail")
[Link](5)

-----------------------------------------------------------------------------------
---------------------------
pass_percent_stat = [Link]('Test Station').agg(total_tests=('test_pass_fail',
'size'),
pass_tests=('test_pass_fail', lambda
x: (x == 'Pass').sum()))
pass_perecnt = pass_percent_stat['pass_tests']/pass_percent_stat['total_tests']*100
pass_percent_stat['pass_percentage'] = pass_perecnt
pass_percent_stat
-----------------------------------------------------------------------------------
------------------------------

pass_percentage_df = [Link]('Test Station').apply(

lambda x: round((x['test_pass_fail'] == 'Pass').sum() * 100.0 / len(x), 2)
).reset_index(name='pass_percentage')
print("\nPass percentage for each device:")
print(pass_percentage_df)

-----------------------------------------------------------------------------------
-----------------------------

pass_highest_percent = pass_percentage_df.sort_values(by='pass_percentage',
ascending=False)
pass_highest_percent.head(1)

-----------------------------------------------------------------------------------
---------------------------------
PYTHON
-------------
1. Implement above scenarios in python using Pandas
2. Some basic theoretical question on python

POWER BI
------------------
1. Various data sources and how can we connect those
2. How to develop a report from scratch (steps)
3. Difference between Measure and Calculated column
4. Which will take less time in loading Measure or Calculated Column

Task - Level1 Database Module
No ratings yet
Task - Level1 Database Module
3 pages
AIML
No ratings yet
AIML
13 pages
Data Cleaning Techniques in Python
No ratings yet
Data Cleaning Techniques in Python
12 pages
cdp201 10 11 2023
No ratings yet
cdp201 10 11 2023
17 pages
Data Analyst
No ratings yet
Data Analyst
7 pages
Assessment Test
No ratings yet
Assessment Test
22 pages
R - Analysis
No ratings yet
R - Analysis
26 pages
DS Question Bank Unit-1 Part-2
No ratings yet
DS Question Bank Unit-1 Part-2
3 pages
Solution - Data Analysis With Python-Project-2 - v1.0
No ratings yet
Solution - Data Analysis With Python-Project-2 - v1.0
14 pages
Python For Machine Learning
No ratings yet
Python For Machine Learning
66 pages
ETL Report Json DB
No ratings yet
ETL Report Json DB
6 pages
Exercise - 6: DS203-2024-S1 Problem1:: Statistics
No ratings yet
Exercise - 6: DS203-2024-S1 Problem1:: Statistics
10 pages
Python For RF
No ratings yet
Python For RF
22 pages
Xii Ip Practical List 2022-23-1
No ratings yet
Xii Ip Practical List 2022-23-1
23 pages
Machine Learning Project Roadmap
No ratings yet
Machine Learning Project Roadmap
4 pages
Project Work Info
No ratings yet
Project Work Info
20 pages
Student Performance Analysis and Prediction 2.3
No ratings yet
Student Performance Analysis and Prediction 2.3
19 pages
List of Practical Ip065 Xii Session 2025 CKC Academy
No ratings yet
List of Practical Ip065 Xii Session 2025 CKC Academy
19 pages
CS3352 Foundations of Data Science Apr May 2024 Question Paper Download
No ratings yet
CS3352 Foundations of Data Science Apr May 2024 Question Paper Download
19 pages
Pyspark Interview Questions
No ratings yet
Pyspark Interview Questions
4 pages
Rough Note Text
No ratings yet
Rough Note Text
4 pages
CIA 1 Key
No ratings yet
CIA 1 Key
3 pages
Wedge Tabla Formulas
No ratings yet
Wedge Tabla Formulas
3 pages
Eda Indepth
No ratings yet
Eda Indepth
19 pages
Lab 13
No ratings yet
Lab 13
5 pages
Air Quality Data Analysis Process
No ratings yet
Air Quality Data Analysis Process
8 pages
04 DS 2023
No ratings yet
04 DS 2023
63 pages
Pandas Ques
No ratings yet
Pandas Ques
3 pages
Data Science Midterm Guide
No ratings yet
Data Science Midterm Guide
14 pages
KIT 601 - DA PUE - Question Paper - Updated
No ratings yet
KIT 601 - DA PUE - Question Paper - Updated
2 pages
Cleaning Data in Python
No ratings yet
Cleaning Data in Python
8 pages
A12 Answer
No ratings yet
A12 Answer
5 pages
Day 62
No ratings yet
Day 62
9 pages
Da QP
No ratings yet
Da QP
2 pages
Question Bank Class XII IP 065 Long Question Answer
No ratings yet
Question Bank Class XII IP 065 Long Question Answer
35 pages
Custom KPIs with SQL in Nemo Analyze
100% (1)
Custom KPIs with SQL in Nemo Analyze
45 pages
Group-3 Report
No ratings yet
Group-3 Report
38 pages
Dehlivery CASESTUDY - Ipynb - Colab
No ratings yet
Dehlivery CASESTUDY - Ipynb - Colab
21 pages
Data Science in Society Cat
No ratings yet
Data Science in Society Cat
5 pages
XII IP Pre Board 2024 Marking Scheme Set22
No ratings yet
XII IP Pre Board 2024 Marking Scheme Set22
7 pages
EDA Cheatsheet - Class Note
No ratings yet
EDA Cheatsheet - Class Note
29 pages
Python Data Science Course Overview
No ratings yet
Python Data Science Course Overview
61 pages
Ip Practical File
No ratings yet
Ip Practical File
20 pages
Exp 8 - LM
No ratings yet
Exp 8 - LM
10 pages
12 Ip Practical List With Solution Complete
No ratings yet
12 Ip Practical List With Solution Complete
5 pages
EDA Cheatsheet - Class Note
No ratings yet
EDA Cheatsheet - Class Note
29 pages
S24 - Bigdata Lab Final 005
No ratings yet
S24 - Bigdata Lab Final 005
9 pages
DM Record Final
No ratings yet
DM Record Final
68 pages
Python Scripts For Machine Learning
No ratings yet
Python Scripts For Machine Learning
13 pages
Ip Practical File
No ratings yet
Ip Practical File
20 pages
Grade 12 - IP Practicals (1 To 9)
No ratings yet
Grade 12 - IP Practicals (1 To 9)
12 pages
Python MCQs
No ratings yet
Python MCQs
21 pages
Microsoft Fabric Analytics Exam Guide
No ratings yet
Microsoft Fabric Analytics Exam Guide
16 pages
Data Science
No ratings yet
Data Science
10 pages
Data Science Lab Group Submission
No ratings yet
Data Science Lab Group Submission
13 pages
EDA+Cheatsheet+ +Class+Note
No ratings yet
EDA+Cheatsheet+ +Class+Note
29 pages
1 2 Merged
No ratings yet
1 2 Merged
12 pages
IP Marking Scheme
No ratings yet
IP Marking Scheme
3 pages
Understanding Generator Functionality
100% (1)
Understanding Generator Functionality
2 pages
Guarding The Gates The Canadian Labour Movement and Immigration 1872 1934 1st Edition David Goutor Instant Access 2025
No ratings yet
Guarding The Gates The Canadian Labour Movement and Immigration 1872 1934 1st Edition David Goutor Instant Access 2025
62 pages
Development of Energy Management System Based On A Rule-Based Power Distribution Strategy For Hybrid Power Sources
No ratings yet
Development of Energy Management System Based On A Rule-Based Power Distribution Strategy For Hybrid Power Sources
12 pages
Structurals WT
No ratings yet
Structurals WT
7 pages
Delhi Public School Navi Mumbai Half Yearly Examination - 2025-26 Sample Paper
No ratings yet
Delhi Public School Navi Mumbai Half Yearly Examination - 2025-26 Sample Paper
6 pages
q1 Science DLL Week 8
No ratings yet
q1 Science DLL Week 8
9 pages
ASNS2613 Chinese Thought, Lecture 8 - Han Fei
No ratings yet
ASNS2613 Chinese Thought, Lecture 8 - Han Fei
20 pages
GMA Operation Manual
No ratings yet
GMA Operation Manual
56 pages
Book 1
No ratings yet
Book 1
2 pages
1.ecb, 20a
No ratings yet
1.ecb, 20a
1 page
Group 3 Emerging Technologies For Business Processes - PPTX.PDF 20250710 210252 0000
No ratings yet
Group 3 Emerging Technologies For Business Processes - PPTX.PDF 20250710 210252 0000
20 pages
Hungarian Vetch: Cultivation & Benefits
No ratings yet
Hungarian Vetch: Cultivation & Benefits
4 pages
Aqq2443-Bhaq2437-t Initial LB CA
No ratings yet
Aqq2443-Bhaq2437-t Initial LB CA
1 page
Reciprocating Compressors Appendix A
0% (1)
Reciprocating Compressors Appendix A
6 pages
Oparin - Haldane Theory and Miller - Urey Experiment
No ratings yet
Oparin - Haldane Theory and Miller - Urey Experiment
32 pages
Ent Imp Points To Diagnose Scenarios
0% (1)
Ent Imp Points To Diagnose Scenarios
46 pages
MPS6
67% (3)
MPS6
4 pages
A Two-Dimensional Introduction To Sashiko
100% (1)
A Two-Dimensional Introduction To Sashiko
8 pages
Unconsecrated Hosts and Communion Loss
No ratings yet
Unconsecrated Hosts and Communion Loss
3 pages
NOTA ADD MATHS FORM 4 Dan FORM 5-FOKUSSTUDY - BLOGSPOT
100% (3)
NOTA ADD MATHS FORM 4 Dan FORM 5-FOKUSSTUDY - BLOGSPOT
40 pages
Ep227 Digital Electronics
No ratings yet
Ep227 Digital Electronics
85 pages
Plate Load Test and Callibrations
No ratings yet
Plate Load Test and Callibrations
13 pages
Fundamentals of Food Process Engineering: Third Edition
No ratings yet
Fundamentals of Food Process Engineering: Third Edition
30 pages
10 Social EM
No ratings yet
10 Social EM
11 pages
Sea Ray SPX 210 Owners Manual EN
No ratings yet
Sea Ray SPX 210 Owners Manual EN
52 pages
Presentation TA6 ISW-Unit 6
No ratings yet
Presentation TA6 ISW-Unit 6
35 pages
The Invisible Light
No ratings yet
The Invisible Light
41 pages
HIGO e Mobility 2021
No ratings yet
HIGO e Mobility 2021
52 pages
Hussmann Products
No ratings yet
Hussmann Products
22 pages
South Beach Diet Phase 1 Meal Plan
No ratings yet
South Beach Diet Phase 1 Meal Plan
3 pages

Interview Questions

Uploaded by

Interview Questions

Uploaded by

Introduction

Technical skillset and projects worked on

1. Difference WHERE and HAVING Clause

Sno Test Station

Test Station Status

1. Give output of left join, inner join on above data

Sample Data - Sno is unique number of each device

Sno Measurement_Name Measure_value time_stamp

# Sample data as a list of dictionaries

# Convert the data to a pandas DataFrame

# Convert time_stamp to datetime format

1. Fetch test pass percentage of each test station

2. Highest pass pecentage test station

Select Test_Station, max(Test_Result_Percentage) from CTE

# Creating the DataFrame

df["test_pass_fail"] = df["Measure value"].apply(lambda x : "Pass" if x>15 else

pass_percentage_df = [Link]('Test Station').apply(

You might also like