0% found this document useful (0 votes)
2K views53 pages

DP-600 Questions (1) - 1-2

The document contains a series of questions related to Microsoft Fabric, covering topics such as T-SQL queries, data modeling, and Power BI functionalities. Each question requires selecting the correct options or completing code snippets to demonstrate understanding of data handling and analytics within the Fabric environment. The questions are designed to assess knowledge on various features and best practices in managing data warehouses and semantic models.

Uploaded by

ganeshdane1
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
2K views53 pages

DP-600 Questions (1) - 1-2

The document contains a series of questions related to Microsoft Fabric, covering topics such as T-SQL queries, data modeling, and Power BI functionalities. Each question requires selecting the correct options or completing code snippets to demonstrate understanding of data handling and analytics within the Fabric environment. The questions are designed to assess knowledge on various features and best practices in managing data warehouses and semantic models.

Uploaded by

ganeshdane1
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 53

DP-600 Questions By AnmolPowerBICorner

Question-1
You have a Fabric warehouse that contains a table named Staging.Sales.
Staging.Sales contains the following columns.

You need to write a T-SQL query that will return data for the year 2023 that displays
ProductID and ProductName and has a summarized Amount that is higher than
10,000.
Which query should you use?

• A.

• B.
• C.

• D.

Question-2
You have a data warehouse that contains a table named Stage.Customers.
Stage.Customers contains all the customer record updates from a customer
relationship management (CRM) system. There can be multiple updates per
customer.
You need to write a T-SQL query that will return the customer ID, name. postal code,
and the last updated time of the most recent row for each customer ID.
How should you complete the code? To answer, select the appropriate options in the
answer area.
NOTE: Each correct selection is worth one point.

Row Number, where x=1


Question-3
HOTSPOT -
You have a Fabric tenant.
You plan to create a Fabric notebook that will use Spark DataFrames to generate
Microsoft Power BI visuals.
You run the following code.

For each of the following statements, select Yes if the statement is true. Otherwise,
select No.
NOTE: Each correct selection is worth one point.

No, Yes, No

Question-4
You are the administrator of a Fabric workspace that contains a lakehouse named
Lakehouse1. Lakehouse1 contains the following tables:
Table1: A Delta table created by using a shortcut
Table2: An external table created by using Spark

Table3: A managed table -


You plan to connect to Lakehouse1 by using its SQL endpoint.
What will you be able to do after connecting to Lakehouse1?

• A. Read Table3.
• B. Update the data Table3.
• C. Read Table2.
• D. Update the data in Table1.

Question-5
You have a Fabric tenant that contains a warehouse.
You use a dataflow to load a new dataset from OneLake to the warehouse.
You need to add a PowerQuery step to identify the maximum values for the numeric
columns.
Which function should you include in the step?

• A. Table.MaxN
• B. Table.Max
• C. Table.Range
• D. Table.Profile

Question-6
You have a Fabric tenant that contains a machine learning model registered in a
Fabric workspace.
You need to use the model to generate predictions by using the PREDICT function in
a Fabric notebook.
Which two languages can you use to perform model scoring? Each correct answer
presents a complete solution.
NOTE: Each correct answer is worth one point.

• A. T-SQL
• B. DAX
• C. Spark SQL
• D. PySpark

Question-7
You are analyzing the data in a Fabric notebook.
You have a Spark DataFrame assigned to a variable named df.
You need to use the Chart view in the notebook to explore the data manually.
Which function should you run to make the data available in the Chart view?

• A. displayHTML
• B. show
• C. write
• D. display
Question-8
You have a Fabric tenant that contains a Microsoft Power BI report named Report1.
Report1 includes a Python visual.
Data displayed by the visual is grouped automatically and duplicate rows are NOT
displayed.
You need all rows to appear in the visual.
What should you do?

• A. Reference the columns in the Python code by index.


• B. Modify the Sort Column By property for all columns.
• C. Add a unique field to each row.
• D. Modify the Summarize By property for all columns.

Question-9
You have a Fabric tenant that contains a semantic model. The model contains data
about retail stores.
You need to write a DAX query that will be executed by using the XMLA endpoint.
The query must return a table of stores that have opened since December 1, 2023.
How should you complete the DAX expression? To answer, drag the appropriate
values to the correct targets. Each value may be used once, more than once, or not at
all. You may need to drag the split bar between panes or scroll to view content.
NOTE: Each correct selection is worth one point.

DEFINE, Evaluate, Summarize


Question-10
You have a Fabric workspace named Workspace1 that contains a dataflow named
Dataflow1. Dataflow1 has a query that returns 2,000 rows.
You view the query in Power Query as shown in the following exhibit.

What can you identify about the pickupLongitude column?

• A. The column has duplicate values.


• B. All the table rows are profiled.
• C. The column has missing values.
• D. There are 935 values that occur only once.

Question-11
You have a Fabric tenant named Tenant1 that contains a workspace named WS1.
WS1 uses a capacity named C1 and contains a dataset named DS1.
You need to ensure read-write access to DS1 is available by using XMLA endpoint.
What should be modified first?

• A. the DS1 settings


• B. the WS1 settings
• C. the C1 settings
• D. the Tenant1 settings

Question-12
You have a Fabric tenant that contains a workspace named Workspace1. Workspace1
is assigned to a Fabric capacity.
You need to recommend a solution to provide users with the ability to create and
publish custom Direct Lake semantic models by using external tools. The solution
must follow the principle of least privilege.
Which three actions in the Fabric Admin portal should you include in the
recommendation? Each correct answer presents part of the solution.
NOTE: Each correct answer is worth one point.

• A. From the Tenant settings, set Allow XMLA Endpoints and Analyze in Excel
with on-premises datasets to Enabled.
• B. From the Tenant settings, set Allow Azure Active Directory guest users to
access Microsoft Fabric to Enabled.
• C. From the Tenant settings, select Users can edit data model in the Power BI
service.
• D. From the Capacity settings, set XMLA Endpoint to Read Write.
• E. From the Tenant settings, set Users can create Fabric items to Enabled.
• F. From the Tenant settings, enable Publish to Web

Question-13
You are creating a semantic model in Microsoft Power BI Desktop.
You plan to make bulk changes to the model by using the Tabular Model Definition
Language (TMDL) extension for Microsoft Visual Studio Code.
You need to save the semantic model to a file.
Which file format should you use?

• A. PBIP
• B. PBIX
• C. PBIT
• D. PBIDS

Question-14
HOTSPOT -
You have a Fabric tenant that contains a warehouse named Warehouse1. Warehouse1
contains three schemas named schemaA, schemaB, and schemaC.
You need to ensure that a user named User1 can truncate tables in schemaA only.
How should you complete the T-SQL statement? To answer, select the appropriate
options in the answer area.
NOTE: Each correct selection is worth one point.
ALTER, SCHEMA

Question-15
You plan to deploy Microsoft Power BI items by using Fabric deployment pipelines.
You have a deployment pipeline that contains three stages named Development,
Test, and Production. A workspace is assigned to each stage.
You need to provide Power BI developers with access to the pipeline. The solution
must meet the following requirements:
Ensure that the developers can deploy items to the workspaces for Development and
Test.
Prevent the developers from deploying items to the workspace for Production.
Follow the principle of least privilege.
Which three levels of access should you assign to the developers? Each correct
answer presents part of the solution.
NOTE: Each correct answer is worth one point.

• A. Build permission to the production semantic models


• B. Admin access to the deployment pipeline
• C. Viewer access to the Development and Test workspaces
• D. Viewer access to the Production workspace
• E. Contributor access to the Development and Test workspaces
• F. Contributor access to the Production workspace

Question-16
You have a Fabric tenant that contains a lakehouse named Lakehouse1. Lakehouse1
contains a Delta table named Customer. When you query Customer, you discover
that the query is slow to execute. You suspect that maintenance was NOT performed
on the table. You need to identify whether maintenance tasks were performed on
Customer. Solution: You run the following Spark SQL statement: REFRESH TABLE
customer Does this meet the goal?

A. Yes

B. No
Correct is describe.history

Question-17
You have a Fabric workspace that contains a DirectQuery semantic model. The
model queries a data source that has 500 million rows. You have a Microsoft Power
Bl report named Report1 that uses the model. Report! contains visuals on multiple
pages. You need to reduce the query execution time for the visuals on all the pages.
What are two features that you can use? Each correct answer presents a complete
solution. NOTE: Each correct answer is worth one point.

A. user-defined aggregations

B. automatic aggregation

C. query caching

D. OneLake integration

Question-18
You have a Fabric tenant that uses a Microsoft tower Bl Premium capacity. You need
to enable scale-out for a semantic model. What should you do first?

A. At the semantic model level, set Large dataset storage format to Off.

B. At the tenant level, set Create and use Metrics to Enabled.

C. At the semantic model level, set Large dataset storage format to On.

D. At the tenant level, set Data Activator to Enabled.

Question-19
You have a Fabric tenant that contains a semantic model. The model uses Direct
Lake mode. You suspect that some DAX queries load unnecessary columns into
memory. You need to identify the frequently used columns that are loaded into
memory. What are two ways to achieve the goal? Each correct answer presents a
complete solution. NOTE: Each correct answer is worth one point.

A. Use the Analyze in Excel feature.

B. Use the Vertipaq Analyzer tool.

C. Query the $system.discovered_STORAGE_TABLE_COLUMN-iN_SEGMeNTS


dynamic management view (DMV).

D. Query the discover_hehory6Rant dynamic management view (DMV).

Question-20
You have a Fabric tenant that contains a semantic model. You need to prevent
report creators from populating visuals by using implicit measures. What are two
tools that you can use to achieve the goal? Each correct answer presents a
complete solution. NOTE: Each correct answer is worth one point.

A. Microsoft Power BI Desktop

B. Tabular Editor

C. Microsoft SQL Server Management Studio (SSMS)

D. DAX Studio

Question-21
You have a Fabric tenant that contains a lakehouse named Lakehouse1. Lakehouse1
contains a subfolder named Subfolder1 that contains CSV files. You need to convert
the CSV files into the delta format that has V-Order optimization enabled. What
should you do from Lakehouse explorer?

A. Use the Load to Tables feature.

B. Create a new shortcut in the Files section.

C. Create a new shortcut in the Tables section.

D. Use the Optimize feature.

Question-22
You have a Fabric tenant that contains a lakehouse named lakehouse1. Lakehouse1
contains an unpartitioned table named Table1. You plan to copy data to Table1 and
partition the table based on a date column in the source data. You create a Copy
activity to copy the data to Table1. You need to specify the partition column in the
Destination settings of the Copy activity. What should you do first?

A. From the Destination tab, set Mode to Append.

B. From the Destination tab, select the partition column,

C. From the Source tab, select Enable partition discovery

D. From the Destination tab, set Mode to Overwrite.

Question-23
You have a Fabric tenant that contains a warehouse. Several times a day. the
performance of all warehouse queries degrades. You suspect that Fabric is
throttling the compute used by the warehouse. What should you use to identify
whether throttling is occurring?
A. the Capacity settings

B. the Monitoring hub

C. dynamic management views (DMVs)

D. the Microsoft Fabric Capacity Metrics app

Question-24
You need to create a data loading pattern for a Type 1 slowly changing dimension
(SCD). Which two actions should you include in the process? Each correct answer
presents part of the solution. NOTE: Each correct answer is worth one point.

A. Update rows when the non-key attributes have changed.

B. Insert new rows when the natural key exists in the dimension table, and the non-
key attribute values have changed.

C. Update the effective end date of rows when the non-key attribute values have
changed.

D. Insert new records when the natural key is a new value in the table.

Question-25
You have a Fabric tenant that contains a complex semantic model. The model is
based on a star schema and contains many tables, including a fact table named Sales.
You need to create a diagram of the model. The diagram must contain only the Sales
table and related tables.
What should you use from Microsoft Power BI Desktop?

A. data categories

B. Data view

C. Model view

D. DAX query view

Question-26
You are the administrator of a Fabric workspace that contains a lakehouse named Lakehouse1.
Lakehouse1 contains the following tables:
Table1: A Delta table created by using a shortcut
Table2: An external table created by using Spark
Table3: A managed table -
You plan to connect to Lakehouse1 by using its SQL endpoint.
What will you be able to do after connecting to Lakehouse1?

• A. Read Table3.
• B. Update the data Table3.
• C. Read Table2.
• D. Update the data in Table1.

Question-27
HOTSPOT (Drag and Drop is not supported)
You have the source data model shown in the following exhibit.

The primary keys of the tables are indicated by a key symbol beside the columns
involved in each key.
You need to create a dimensional data model that will enable the analysis of order
items by date, product, and customer.
What should you include in the solution? To answer, select the appropriate options in
the answer area.
NOTE: Each correct selection is worth one point.

The product ID column, Denormalized into the customer and product entities

Question-28
You have a Fabric tenant that contains a semantic model named Model1. Model1
uses Import mode. Model1 contains a table named Orders. Orders has 100 million
rows and the following fields.

You need to reduce the memory used by Model1 and the time it takes to refresh the
model.
Which two actions should you perform? Each correct answer presents part of the
solution.

NOTE: Each correct answer is worth one point.

A. Split OrderDateTime into separate date and time columns.

B. Replace TotalQuantity with a calculated column.


C. Convert Quantity into the Text data type.

D. Replace TotalSalesAmount with a measure.

Question-29
HOTSPOT -
You have a Fabric workspace named Workspace1 and an Azure Data Lake Storage Gen2
account named storage1. Workspace1 contains a lakehouse named Lakehouse1.
You need to create a shortcut to storage1 in Lakehouse1.
Which connection and endpoint should you specify? To answer, select the appropriate options
in the answer area.
NOTE: Each correct selection is worth one point.

https,dfs

Question-30
HOTSPOT (Drag and Drop is not supported)
You have a Fabric tenant that contains two lakehouses.
You are building a dataflow that will combine data from the lakehouses. The applied
steps from one of the queries in the dataflow is shown in the following exhibit.
Use the drop-down menus to select the answer choice that completes each
statement based on the information presented in the graphic.

NOTE: Each correct selection is worth one point.


Some, Microsoft Power Query engine

Question-31
You have a Fabric tenant that contains a lakehouse named Lakehouse’. Lakehouse1
contains a table named Tablet.
You are creating a new data pipeline.
You plan to copy external data to Table’. The schema of the external data changes
regularly.
You need the copy operation to meet the following requirements:

-Replace Table1 with the schema of the external data.


-Replace all the data in Table1 with the rows in the external data.
-You add a Copy data activity to the pipeline.

What should you do for the Copy data activity?

A. From the Source tab, add additional columns.

B. From the Destination tab, set Table action to Overwrite.

C. From the Settings tab, select Enable staging.

D. From the Source tab, select Enable partition discovery.

E. From the Source tab, select Recursively.

Question-32
You have a Fabric tenant that contains a lakehouse.
You plan to query sales data files by using the SQL endpoint. The files will be in an
Amazon Simple Storage Service (Amazon S3) storage bucket.
You need to recommend which file format to use and where to create a shortcut.
Which two actions should you include in the recommendation? Each correct answer
presents part of the solution.

NOTE: Each correct answer is worth one point.

A. Create a shortcut in the Files section.

B. Use the Parquet format

C. Use the CSV format.

D. Create a shortcut in the Tables section.

E. Use the delta format.

Question-33
HOTSPOT (Drag and Drop is not supported)
You have a Fabric tenant that contains a lakehouse.
You are using a Fabric notebook to save a large DataFrame by using the following
code.

df.write.partitionBy(“year”, “month”,
“day”).mode(“overwrite”).parquet(“Files/SalesOrder”)

For each of the following statements, select Yes if the statement is true. Otherwise,
select No.
NOTE: Each correct selection is worth one point.

Yes, Yes, No/yes


Question-34
You have a Fabric tenant that contains 30 CSV files in OneLake. The files are updated
daily.
You create a Microsoft Power BI semantic model named Model1 that uses the CSV
files as a data source. You configure incremental refresh for Model1 and publish the
model to a Premium capacity in the Fabric tenant.
When you initiate a refresh of Model1, the refresh fails after running out of resources.
What is a possible cause of the failure?

A. Query folding is occurring.

B. Only refresh complete days is selected.

C. XMLA Endpoint is set to Read Only.

D. Query folding is NOT occurring.

E. The delta type of the column used to partition the data has changed.

Question-35
You have a Fabric tenant that contains a warehouse. The warehouse uses row-level
security (RLS).
You create a Direct Lake semantic model that uses the Delta tables and RLS of the
warehouse.
When users interact with a report built from the model, which mode will be used by
the DAX queries?

A. DirectQuery

B. Dual

C. Direct Lake

D. Import

Question-36
HOTSPOT (Drag and Drop is not supported)
You have a Fabric tenant that contains a warehouse named Warehouse1. Warehouse1
contains a fact table named FactSales that has one billion rows.
You run the following T-SQL statement.
CREATE TABLE test.FactSales AS CLONE OF Dbo.FactSales;
For each of the following statements, select Yes if the statement is true. Otherwise,
select No.
NOTE: Each correct selection is worth one point.

Yes,No,Yes

Question-37
You have source data in a folder on a local computer.
You need to create a solution that will use Fabric to populate a data store. The
solution must meet the following requirements:

-Support the use of dataflows to load and append data to the data store.
-Ensure that Delta tables are V-Order optimized and compacted automatically.

Which type of data store should you use?

A. a lakehouse

B. an Azure SQL database

C. a warehouse

D. a KQL database

Question-38
You have a Fabric tenant that contains a data pipeline.
You need to ensure that the pipeline runs every four hours on Mondays and Fridays.
To what should you set Repeat for the schedule?

A. Daily

B. By the minute

C. Weekly

D. Hourly
Question-39
You have a Fabric workspace named Workspace1 that contains a data flow named
Dataflow1 contains a query that returns the data shown in the following exhibit.

You need to transform the data columns into attribute-value pairs, where columns
become rows.
You select the VendorID column.
Which transformation should you select from the context menu of the VendorID
column?

A. Group by

B. Unpivot columns

C. Unpivot other columns

D. Split column

E. Remove other columns

Question-40
You are analyzing customer purchases in a Fabric notebook by using PySpark.
You have the following DataFrames:
transactions: Contains five columns named transaction_id, customer_id, product_id, amount,
and date and has 10 million rows, with each row representing a transaction. customers:
Contains customer details in 1,000 rows and three columns named customer_id, name, and
country.
You need to join the DataFrames on the customer_id column. The solution must minimize data
shuffling.
You write the following code.
from pyspark.sql import functions as F
results =
Which code should you run to populate the results DataFrame?

• A. transactions.join(F.broadcast(customers), transactions.customer_id ==
customers.customer_id)
• B. transactions.join(customers, transactions.customer_id ==
customers.customer_id).distinct()
• C. transactions.join(customers, transactions.customer_id == customers.customer_id)
• D. transactions.crossJoin(customers).where(transactions.customer_id ==
customers.customer_id)

Question-41
HOTSPOT (Drag and Drop is not supported)
You have a Fabric workspace that uses the default Spark starter pool and runtime
version 1.2.
You plan to read a CSV file named Sales_raw.csv in a lakehouse, select columns, and
save the data as a Delta table to the managed area of the lakehouse. Sales_raw.csv
contains 12 columns.
You have the following code.

For each of the following statements, select Yes if the statement is true. Otherwise,
select No.
NOTE: Each correct selection is worth one point.
Yes,No,Yes

Question-42
You have a Fabric tenant that contains a warehouse.
A user discovers that a report that usually takes two minutes to render has been
running for 45 minutes and has still not rendered.
You need to identify what is preventing the report query from completing.
Which dynamic management view (DMV) should you use?

A. sys.dm_exec_requests

B. sys.dm_exec_sessions

C. sys.dm_exec_connections

D. sys.dm_pdw_exec_requests

Question-43
DRAG DROP (Drag and Drop is not supported)
You are creating a data flow in Fabric to ingest data from an Azure SQL database by
using a T-SQL statement.
You need to ensure that any foldable Power Query transformation steps are
processed by the Microsoft SQL Server engine.
How should you complete the code? To answer, drag the appropriate values to the
correct targets. Each value may be used once, more than once, or not at all. You may
need to drag the split bar between panes or scroll to view content.
NOTE: Each correct selection is worth one point.
Value/table, Native Query, Enable folding

Question-44
DRAG DROP (Drag and Drop is not supported)
You have a Fabric tenant that contains a lakehouse named Lakehouse1.
Readings from 100 IoT devices are appended to a Delta table in Lakehouse1. Each set
of readings is approximately 25 KB. Approximately 10 GB of data is received daily.
All the table and SparkSession settings are set to the default.
You discover that queries are slow to execute. In addition, the lakehouse storage
contains data and log files that are no longer used.
You need to remove the files that are no longer used and combine small files into
larger files with a target size of 1 GB per file.
What should you do? To answer, drag the appropriate actions to the correct
requirements. Each action may be used once, more than once, or not at all. You may
need to drag the split bar between panes or scroll to view content.
NOTE: Each correct selection is worth one point.
Run the vacuum command, run the optimize command

Question-45
You have a Microsoft Power BI semantic model.

You need to identify any surrogate key columns in the model that have the Summarize By
property set to a value other than to None. The solution must minimize effort.

What should you use?

• A. DAX Formatter in DAX Studio


• B. Model explorer in Microsoft Power BI Desktop
• C. Model view in Microsoft Power BI Desktop
• D. Best Practice Analyzer in Tabular Editor

Question-46
You have a Fabric warehouse that contains a table named Sales.Products. Sales.Products
contains the following columns.
You need to write a T-SQL query that will return the following columns.

How should you complete the code? To answer, select the appropriate options in the answer
area.

NOTE: Each correct answer is worth one point.

Greatest, Coalesce

Question-47
You have a semantic model named Model1. Model1 contains five tables that all use Import
mode. Model1 contains a dynamic row-level security (RLS) role named HR. The HR role filters
employee data so that HR managers only see the data of the department to which they are
assigned.

You publish Model1 to a Fabric tenant and configure RLS role membership. You share the
model and related reports to users.

An HR manager reports that the data they see in a report is incomplete.

What should you do to validate the data seen by the HR Manager?

• A. Select Test as role to view the data as the HR role.


• B. Filter the data in the report to match the intended logic of the filter for the HR
department.
• C. Select Test as role to view the report as the HR manager.
• D. Ask the HR manager to open the report in Microsoft Power BI Desktop.

Question-49
You are implementing two dimension tables named Customers and Products in a Fabric
warehouse.

You need to use slowly changing dimension (SCD) to manage the versioning of data. The
solution must meet the requirements shown in the following table.

Which type of SCD should you use for each table? To answer, drag the appropriate SCD types to
the correct tables. Each SCD type may be used once, more than once, or not at all. You may
need to drag the split bar between panes or scroll to view content.

NOTE: Each correct selection is worth one point.


Type-2, Type-1

Question-50
You have a Microsoft Power BI semantic model.

You need to identify any surrogate key columns in the model that have the Summarize By
property set to a value other than to None. The solution must minimize effort.

What should you use?

• A. DAX Formatter in DAX Studio


• B. Model explorer in Microsoft Power BI Desktop
• C. Model view in Microsoft Power BI Desktop
• D. Best Practice Analyzer in Tabular Editor

Question-51
You have a Microsoft Power BI report named Report1 that uses a Fabric semantic model.

Users discover that Report1 renders slowly.

You open Performance analyzer and identify that a visual named Orders By Date is the slowest
to render. The duration breakdown for Orders By Date is shown in the following table.

What will provide the greatest reduction in the rendering duration of Report1?

• A. Enable automatic page refresh.


• B. Optimize the DAX query of Orders By Date by using DAX Studio.
• C. Change the visual type of Orders By Date.
• D. Reduce the number of visuals in Report1.

Question-52
HOTSPOT -
You have a Microsoft Power BI report and a semantic model that uses Direct Lake mode.
From Power BI Desktop, you open Performance analyzer as shown in the following exhibit.
Use the drop-down menus to select the answer choice that completes each statement based on
the information presented in the graphic.
NOTE: Each correct selection is worth one point.

Automatic, DirectLake
Question-53
HOTSPOT -
You have a Fabric tenant that contains a lakehouse named Lakehouse1. Lakehouse1 contains a
table named Nyctaxi_raw. Nyctaxi_row contains the following table:

You create a Fabric notebook and attach it to Lakehouse1.


You need to use PySpark code to transform the data. The solution must meet the following
requirements:
Add a column named pickupDate that will contain only the date portion of pickupDateTime.
Filter the DataFrame to include only rows where fareAmount is a positive number that is less
than 100.
How should you complete the code? To answer, select the appropriate options in the answer
area.
NOTE: Each correct selection is worth one point.

Df.withcolumn, .cast(date), .filter(“fareamount>0 and fareamount <100”)


Question-54
Note: This question is part of a series of questions that present the same scenario. Each
question in the series contains a unique solution that might meet the stated goals. Some
question sets might have more than one correct solution, while others might not have a correct
solution.
After you answer a question in this section, you will NOT be able to return to it. As a result,
these questions will not appear in the review screen.
You have a Fabric tenant that contains a new semantic model in OneLake.
You use a Fabric notebook to read the data into a Spark DataFrame.
You need to evaluate the data to calculate the min, max, mean, and standard deviation values
for all the string and numeric columns.
Solution: You use the following PySpark expression:
df.explain()
Does this meet the goal?

• A. Yes
• B. No

Correct Ans is Df.summary()

Question-55
Note: This question is part of a series of questions that present the same scenario. Each
question in the series contains a unique solution that might meet the stated goals. Some
question sets might have more than one correct solution, while others might not have a correct
solution.
After you answer a question in this section, you will NOT be able to return to it. As a result,
these questions will not appear in the review screen.
You have a Fabric tenant that contains a new semantic model in OneLake.
You use a Fabric notebook to read the data into a Spark DataFrame.
You need to evaluate the data to calculate the min, max, mean, and standard deviation values
for all the string and numeric columns.
Solution: You use the following PySpark expression:
df.show()
Does this meet the goal?

• A. Yes
• B. No

Question-56
Note: This question is part of a series of questions that present the same scenario. Each
question in the series contains a unique solution that might meet the stated goals. Some
question sets might have more than one correct solution, while others might not have a correct
solution.
After you answer a question in this section, you will NOT be able to return to it. As a result,
these questions will not appear in the review screen.
You have a Fabric tenant that contains a new semantic model in OneLake.
You use a Fabric notebook to read the data into a Spark DataFrame.
You need to evaluate the data to calculate the min, max, mean, and standard deviation values
for all the string and numeric columns.
Solution: You use the following PySpark expression:
df.summary()
Does this meet the goal?

• A. Yes
• B. No

Question-57
Note: This question is part of a series of questions that present the same scenario. Each
question in the series contains a unique solution that might meet the stated goals. Some
question sets might have more than one correct solution, while others might not have a correct
solution.
After you answer a question in this section, you will NOT be able to return to it. As a result,
these questions will not appear in the review screen.
You have a Fabric tenant that contains a lakehouse named Lakehouse1. Lakehouse1 contains a
Delta table named Customer.
When you query Customer, you discover that the query is slow to execute. You suspect that
maintenance was NOT performed on the table.
You need to identify whether maintenance tasks were performed on Customer.
Solution: You run the following Spark SQL statement:

DESCRIBE HISTORY customer -


Does this meet the goal?

• A. Yes
• B. No

Question-58
Note: This question is part of a series of questions that present the same scenario. Each
question in the series contains a unique solution that might meet the stated goals. Some
question sets might have more than one correct solution, while others might not have a correct
solution.
After you answer a question in this section, you will NOT be able to return to it. As a result,
these questions will not appear in the review screen.
You have a Fabric tenant that contains a lakehouse named Lakehouse1. Lakehouse1 contains a
Delta table named Customer.
When you query Customer, you discover that the query is slow to execute. You suspect that
maintenance was NOT performed on the table.
You need to identify whether maintenance tasks were performed on Customer.
Solution: You run the following Spark SQL statement:
REFRESH TABLE customer -
Does this meet the goal?

• A. Yes
• B. No

Question-59
Note: This question is part of a series of questions that present the same scenario. Each
question in the series contains a unique solution that might meet the stated goals. Some
question sets might have more than one correct solution, while others might not have a correct
solution.
After you answer a question in this section, you will NOT be able to return to it. As a result,
these questions will not appear in the review screen.
You have a Fabric tenant that contains a lakehouse named Lakehouse1. Lakehouse1 contains a
Delta table named Customer.
When you query Customer, you discover that the query is slow to execute. You suspect that
maintenance was NOT performed on the table.
You need to identify whether maintenance tasks were performed on Customer.
Solution: You run the following Spark SQL statement:

EXPLAIN TABLE customer -


Does this meet the goal?

• A. Yes
• B. No

Question-60
HOTSPOT
-

You have a Fabric tenant.

You need to configure OneLake security for users shown in the following table.

The solution must follow the principle of least privilege.

Which permission should you assign to each user? To answer, select the appropriate options in
the answer area.

NOTE: Each correct selection is worth one point.

ReadALL and ReadData

Question-61

You have an Azure Repos repository named Repo1 and a Fabric-enabled Microsoft Power BI
Premium capacity. The capacity contains two workspaces named Workspace1 and
Workspace2. Git integration is enabled at the workspace level.

You plan to use Microsoft Power BI Desktop and Workspace1 to make version-controlled
changes to a semantic model stored in Repo1. The changes will be built and deployed to
Workspace2 by using Azure Pipelines.

You need to ensure that report and semantic model definitions are saved as individual text files
in a folder hierarchy. The solution must minimize development and maintenance effort.

In which file format should you save the changes?

• A. PBIP
• B. PBIDS
• C. PBIT
• D. PBIX

Question-62
DRAG DROP
-
You are implementing a medallion architecture in a single Fabric workspace.

You have a lakehouse that contains the Bronze and Silver layers and a warehouse that contains
the Gold layer.

You create the items required to populate the layers as shown in the following table.

You need to ensure that the layers are populated daily in sequential order such that Silver is
populated only after Bronze is complete, and Gold is populated only after Silver is complete.
The solution must minimize development effort and complexity.

What should you use to execute each set of items? To answer, drag the appropriate options to
the correct items. Each option may be used once, more than once, or not at all. You may need
to drag the split bar between panes or scroll to view content.

NOTE: Each correct selection is worth one point.

A schedule, A pipeline Copy activity, A pipeline Dataflow activity, A pipeline stored procedure
activity

Question-63
DRAG DROP
-

You are building a solution by using a Fabric notebook.

You have a Spark DataFrame assigned to a variable named df. The DataFrame returns four
columns.

You need to change the data type of a string column named Age to integer. The solution must
return a DataFrame that includes all the columns.

How should you complete the code? To answer, drag the appropriate values to the correct
targets. Each value may be used once, more than once, or not at all. You may need to drag the
split bar between panes or scroll to view content.

NOTE: Each correct selection is worth one point.

Withcolumn, col, cast

Question-64
HOTSPOT
-

You have an Azure Data Lake Storage Gen2 account named storage1 that contains a Parquet
file named sales.parquet.

You have a Fabric tenant that contains a workspace named Workspace1.

Using a notebook in Workspace1, you need to load the content of the file to the default
lakehouse. The solution must ensure that the content will display automatically as a table
named Sales in Lakehouse explorer.

How should you complete the code? To answer, select the appropriate options in the answer
area.

NOTE: Each correct selection is worth one point.


Delta,sales

Question-65
You have a Fabric workspace named Workspace1 that contains a lakehouse named
Lakehouse1.

In Workspace1, you create a data pipeline named Pipeline1.

You have CSV files stored in an Azure Storage account.

You need to add an activity to Pipeline1 that will copy data from the CSV files to Lakehouse1.
The activity must support Power Query M formula language expressions.

Which type of activity should you add?

• A. Dataflow
• B. Notebook
• C. Script
• D. Copy data

Question-66
You have a Fabric tenant that contains lakehouse named Lakehouse1. Lakehouse1 contains a
Delta table with eight columns.

You receive new data that contains the same eight columns and two additional columns.

You create a Spark DataFrame and assign the DataFrame to a variable named df. The
DataFrame contains the new data.

You need to add the new data to the Delta table to meet the following requirements:

• Keep all the existing rows.


• Ensure that all the new data is added to the table.

How should you complete the code? To answer, select the appropriate options in the answer
area.
NOTE: Each correct selection is worth one point.

Append, mergeschema:true

Question-67
HOTSPOT
-

You have a Fabric warehouse that contains a table named Sales.Orders. Sales.Orders contains
the following columns.

You need to write a T-SQL query that will return the following columns.
How should you complete the code? To answer, select the appropriate options in the answer
area.

NOTE: Each correct selection is worth one point.

COALESCE, LEAST

Question-68
You have a Fabric tenant that contains a lakehouse.

You plan to use a visual query to merge two tables.

You need to ensure that the query returns all the rows in both tables.

Which type of join should you use?


• A. inner
• B. full outer
• C. left outer
• D. right anti
• E. right outer
• F. left anti

Question-69
You have a Fabric tenant that contains a lakehouse named Lakehouse1. Lakehouse1 contains a
Delta table that has one million Parquet files.

You need to remove files that were NOT referenced by the table during the past 30 days. The
solution must ensure that the transaction log remains consistent, and the ACID properties of
the table are maintained.

What should you do?

• A. From OneLake file explorer, delete the files.


• B. Run the OPTIMIZE command and specify the Z-order parameter.
• C. Run the OPTIMIZE command and specify the V-order parameter.
• D. Run the VACUUM command.

Question-70
You have a Fabric workspace named Workspace1 and an Azure SQL database.

You plan to create a dataflow that will read data from the database, and then transform the
data by performing an inner join.

You need to ignore spaces in the values when performing the inner join. The solution must
minimize development effort.

What should you do?

• A. Append the queries by using fuzzy matching.


• B. Merge the queries by using fuzzy matching.
• C. Append the queries by using a lookup table.
• D. Merge the queries by using a lookup table.

Question-71
You have a Fabric tenant that contains a warehouse named Warehouse1. Warehouse1 contains
two schemas name schema1 and schema2 and a table named schema1.city.

You need to make a copy of schema1.city in schema2. The solution must minimize the copying
of data.
Which T-SQL statement should you run?

• A. INSERT INTO schema2.city SELECT * FROM schema1.city;


• B. SELECT * INTO schema2.city FROM schema1.city;
• C. CREATE TABLE schema2.city AS CLONE OF schema1.city;
• D. CREATE TABLE schema2.city AS SELECT * FROM schema1.city;

Question-72
You have a Fabric tenant that contains a lakehouse named Lakehouse1.

You need to prevent new tables added to Lakehouse1 from being added automatically to the
default semantic model of the lakehouse.

What should you configure?

• A. the SQL analytics endpoint settings


• B. the semantic model settings
• C. the workspace settings
• D. the Lakehouse1 settings

Question-73
HOTSPOT
-

You have a Fabric tenant that contains a workspace named Workspace1. Workspace1 contains
a lakehouse named Lakehouse1 and a warehouse named Warehouse1.

You need to create a new table in Warehouse1 named POSCustomers by querying the customer
table in Lakehouse1.

How should you complete the T-SQL statement? To answer, select the appropriate options in
the answer area.

NOTE: Each correct selection is worth one point.


3,3

Question-74
You have a Fabric tenant.

You are creating an Azure Data Factory pipeline.

You have a stored procedure that returns the number of active customers and their average
sales for the current month.

You need to add an activity that will execute the stored procedure in a warehouse. The returned
values must be available to the downstream activities of the pipeline.

Which type of activity should you add?

• A. Switch
• B. Copy data
• C. Append variable
• D. Lookup

Question-75
You have a Fabric tenant that contains two workspaces named Workspace1 and Workspace2.
Workspace1 contains a lakehouse named Lakehouse1. Workspace2 contains a lakehouse
named Lakehouse2. Lakehouse1 contains a table named dbo.Sales. Lakehouse2 contains a
table named dbo.Customers.

You need to ensure that you can write queries that reference both dbo.Sales and
dbo.Customers in the same SQL query without making additional copies of the tables.

What should you use?

• A. a shortcut
• B. a dataflow
• C. a view
• D. a managed table

Question-76
You have a Fabric tenant that contains a warehouse.

You are designing a star schema model that will contain a customer dimension. The customer
dimension table will be a Type 2 slowly changing dimension (SCD).

You need to recommend which columns to add to the table. The columns must NOT already
exist in the source.

Which three types of columns should you recommend? Each correct answer presents part of
the solution.

NOTE: Each correct answer is worth one point.

• A. a foreign key
• B. a natural key
• C. an effective end date and time
• D. a surrogate key
• E. an effective start date and time

Question-77
You have a Fabric tenant.

You plan to create a data pipeline named Pipeline1. Pipeline1 will include two activities that will
execute in sequence.

You need to ensure that a failure of the first activity will NOT block the second activity.

Which conditional path should you configure between the first activity and the second activity?

• A. Upon Failure
• B. Upon Completion
• C. Upon Skip
• D. Upon Skip

Question-78
DRAG DROP
-

You have a Fabric tenant that contains a Microsoft Power BI report named Report1.

Report1 is slow to render. You suspect that an inefficient DAX query is being executed.

You need to identify the slowest DAX query, and then review how long the query spends in the
formula engine as compared to the storage engine.

Which five actions should you perform in sequence? To answer, move the appropriate actions
from the list of actions to the answer area and arrange them in the correct order.

Question-79
You have a Fabric tenant that contains a semantic model. The model contains 15 tables.

You need to programmatically change each column that ends in the word Key to meet the
following requirements:

• Hide the column.


• Set Nullable to False
• Set Summarize By to None.
• Set Available in MDX to False.
• Mark the column as a key column.

What should you use?


• A. Microsoft Power BI Desktop
• B. ALM Toolkit
• C. Tabular Editor
• D. DAX Studio

Question-80
HOTSPOT
-

You have a Microsoft Power BI semantic model.

You plan to implement calculation groups.

You need to create a calculation item that will change the context from the selected date to
month-to-date (MTD).

How should you complete the DAX expression? To answer, select the appropriate options in the
answer area.

NOTE: Each correct selection is worth one point.

Calculate, Selectedmeasure

Question-81
You have a custom Direct Lake semantic model named Model1 that has one billion rows of
data.
You use Tabular Editor to connect to Model1 by using the XMLA endpoint.
You need to ensure that when users interact with reports based on Model1, their queries always
use Direct Lake mode.

What should you do?

• A. From Model, configure the Default Mode option.


• B. From Partitions, configure the Mode option.
• C. From Model, configure the Storage Location option.
• D. From Model, configure the Direct Lake Behavior option.

Question-82
DRAG DROP
-

You create a semantic model by using Microsoft Power BI Desktop. The model contains one
security role named SalesRegionManager and the following tables:

• Sales
• SalesRegion
• SalesAddress

You need to modify the model to ensure that users assigned the SalesRegionManager role
cannot see a column named Address in SalesAddress.

Which three actions should you perform in sequence? To answer, move the appropriate actions
from the list of actions to the answer area and arrange them in the correct order.

2,3,6

Question 83
You have a Microsoft Power BI semantic model that contains measures. The measures use
multiple CALCULATE functions and a FILTER function.

You are evaluating the performance of the measures.


In which use case will replacing the FILTER function with the KEEPFILTERS function reduce
execution time?

• A. when the FILTER function uses a nested calculate function


• B. when the FILTER function references a measure
• C. when the FILTER function references columns from multiple tables
• D. when the FILTER function references a column from a single table that uses Import
mode

Question-84
You have a Microsoft Fabric tenant that contains a dataflow.

You are exploring a new semantic model.

From Power Query, you need to view column information as shown in the following exhibit.

Which three Data view options should you select? Each correct answer presents part of the
solution.

• A. Show column value distribution


• B. Enable details pane
• C. Enable column profile
• D. Show column quality details
• E. Show column profile in details pane

ACD

Question-85
You have a Fabric notebook that has the Python code and output shown in the following
exhibit.
Which type of analytics are you performing?

• A. descriptive
• B. diagnostic
• C. prescriptive
• D. predictive

Question-86
HOTSPOT
-

You have a Fabric warehouse that contains a table named Sales.Orders. Sales.Orders contains
the following columns.
You need to write a T-SQL query that will return the following columns.

How should you complete the code? To answer, select the appropriate options in the answer
area.

NOTE: Each correct answer is worth one point.

Datetrunc, weekday
Question-87
You have a Fabric tenant that contains JSON files in OneLake. The files have one billion items.

You plan to perform time series analysis of the items.

You need to transform the data, visualize the data to find insights, perform anomaly detection,
and share the insights with other business users. The solution must meet the following
requirements:

• Use parallel processing.


• Minimize the duplication of data.
• Minimize how long it takes to load the data.

What should you use to transform and visualize the data?

• A. the PySpark library in a Fabric notebook


• B. the pandas library in a Fabric notebook
• C. a Microsoft Power BI report that uses core visuals

Question-88
You have a Fabric tenant that contains customer churn data stored as Parquet files in OneLake.
The data contains details about customer demographics and product usage.

You create a Fabric notebook to read the data into a Spark DataFrame. You then create column
charts in the notebook that show the distribution of retained customers as compared to lost
customers based on geography, the number of products purchased, age, and customer tenure.

Which type of analytics are you performing?

• A. diagnostic
• B. descriptive
• C. prescriptive
• D. predictive

Question-89
HOTSPOT
-

You have a Fabric tenant that contains a semantic model. The model contains data about retail
stores.

You need to write a DAX query that will be executed by using the XMLA endpoint. The query
must return the total amount of sales from the same period last year.

How should you complete the DAX expression? To answer, select the appropriate options in the
answer area.

NOTE: Each correct selection is worth one point.

Calculate,_lysales

Question-90
You have a Fabric workspace named Workspace1 that contains a dataflow named Dataflow1.
Dataflow1 returns 500 rows of data.

You need to identify the min and max values for each column in the query results.

Which three Data view options should you select? Each correct answer presents part of the
solution.

NOTE: Each correct answer is worth one point.

• A. Show column value distribution


• B. Enable column profile
• C. Show column profile in details pane
• D. Show column quality details
• E. Enable details pane

Question-91
Note: This question is part of a series of questions that present the same scenario. Each
question in the series contains a unique solution that might meet the stated goals. Some
question sets might have more than one correct solution, while others might not have a correct
solution.
After you answer a question in this section, you will NOT be able to return to it. As a result,
these questions will not appear in the review screen.

You have a Fabric tenant that contains a semantic model named Model1.

You discover that the following query performs slowly against Model1.

You need to reduce the execution time of the query.

Solution: You replace line 4 by using the following code:

NOT ISEMPTY ( CALCULATETABLE ( 'Order Item ' ) )

Does this meet the goal?

• A. Yes
• B. No

Question-92
Note: This question is part of a series of questions that present the same scenario. Each
question in the series contains a unique solution that might meet the stated goals. Some
question sets might have more than one correct solution, while others might not have a correct
solution.

After you answer a question in this section, you will NOT be able to return to it. As a result,
these questions will not appear in the review screen.

You have a Fabric tenant that contains a semantic model named Model1.

You discover that the following query performs slowly against Model1.

You need to reduce the execution time of the query.


Solution: You replace line 4 by using the following code:

ISEMPTY ( RELATEDTABLE ( 'Order Item' ) )

Does this meet the goal?

• A. Yes
• B. No

Question-93
Note: This question is part of a series of questions that present the same scenario. Each
question in the series contains a unique solution that might meet the stated goals. Some
question sets might have more than one correct solution, while others might not have a correct
solution.

After you answer a question in this section, you will NOT be able to return to it. As a result,
these questions will not appear in the review screen.

You have a Fabric tenant that contains a new semantic model in OneLake.

You use a Fabric notebook to read the data into a Spark DataFrame.

You need to evaluate the data to calculate the min, max, mean, and standard deviation values
for all the string and numeric columns.

Solution: You use the following PySpark expression:

df.explain().show()

Does this meet the goal?

• A. Yes
• B. No

Question-94
Note: This question is part of a series of questions that present the same scenario. Each
question in the series contains a unique solution that might meet the stated goals. Some
question sets might have more than one correct solution, while others might not have a correct
solution.

After you answer a question in this section, you will NOT be able to return to it. As a result,
these questions will not appear in the review screen.

You have a Fabric tenant that contains a lakehouse named Lakehouse1. Lakehouse1 contains a
Delta table named Customer.
When you query Customer, you discover that the query is slow to execute. You suspect that
maintenance was NOT performed on the table.

You need to identify whether maintenance tasks were performed on Customer.

Solution: You run the following Spark SQL statement:

DESCRIBE DETAIL customer -

Does this meet the goal?

• A. Yes
• B. No

Question-95
You have a Fabric tenant that contains a Microsoft Power BI report.

You are exploring a new semantic model.

You need to display the following column statistics:

• Count
• Average
• Null count
• Distinct count
• Standard deviation

Which Power Query function should you run?

• A. Table.schema
• B. Table.view
• C. Table.FuzzyGroup
• D. Table.Profile

Question-96

You might also like