KPMG Recent Power BI Interview Q&A
KPMG Recent Power BI Interview Q&A
Direct Query impacts report performance as it depends on the speed of the underlying data source; every user interaction triggers a query to the database. This can lead to slower performance if the data source is slow. Additionally, not all DAX functions are supported in Direct Query mode, and there are limits on model size and complexity compared to Import mode, affecting functionality and performance .
A surrogate key is an artificial key used to uniquely identify each row in a dimension table, crucial for managing Slowly Changing Dimensions (SCD). It allows for seamless tracking of changes and maintenance of historical data without relying on natural keys, which may not be stable over time. This helps in efficiently managing updates and ensuring data integrity in historical records .
The Related function is used to retrieve data from a related table in a one-to-many relationship, relying on existing relationships within the data model. In contrast, Lookupvalue allows data retrieval without the need for pre-existing relationships by specifying search conditions. The use of Related is typically more efficient, but Lookupvalue offers flexibility in scenarios without formal relationships, which can be more demanding in terms of performance .
Incremental Refresh is particularly beneficial for large datasets where refreshing the entire data set would be time-consuming and resource-intensive. It allows only new or updated records to be refreshed, saving processing time and enhancing performance. This approach is crucial for environments with large historical data that frequently updates .
The Decomposition Tree visual in Power BI is used for root cause analysis by breaking down a measure into contributing factors, such as region, product, or time. Its hierarchical approach allows users to visually explore how different factors contribute to a particular outcome, offering a clear understanding of contributing elements and making it easier to identify trends and issues within the data .
A Factless Fact table is a fact table that does not contain any metric or measure. It is primarily used to track events or record transactions that do not have any numerical data to aggregate, such as student attendance. This type of table is useful for recording occurrences of events over time without needing quantitative attributes .
Many-to-many relationships in Power BI models can lead to ambiguity errors and inaccuracies in data analysis because they introduce complex filtering scenarios. Power BI uses bidirectional cross-filtering to manage these relationships, but this requires careful handling to prevent performance bottlenecks and logical errors in the model. Proper model design and control of cross-filtering directions are essential to avoid such issues .
The necessary condition for appending queries in Power BI is that the tables must have the same structure, meaning they should possess the same number of columns and corresponding data types. Failing to meet these conditions can result in mismatched data and incomplete data integration, leading to inaccurate reporting outcomes .
To optimize Power BI performance with large datasets, strategies include using Import mode, applying Incremental Refresh, reducing data at the source via SQL queries, and employing filters at the data load level. Additionally, optimizing DAX queries, implementing aggregated views, and minimizing the use of calculated columns can enhance performance. Key considerations involve maintaining a balance between data granularity and performance efficiency .
Slowly Changing Dimensions (SCD) can be managed in Power BI using Type 2 SCD, which involves adding a new row to the table whenever changes occur in a dimension, allowing users to maintain historical data. This is useful for tracking changes like customer addresses over time, but it can lead to increased complexity in the data model and require more storage space .