Home
Snowflake
DEA-C01 Exam Info
DEA-C01 Exam Questions

Curious about Actual Snowflake SnowPro Certification (DEA-C01) Exam Questions?

Here are sample Snowflake SnowPro Advanced: Data Engineer Certification (DEA-C01) Exam questions from real exam. You can get more Snowflake SnowPro Certification (DEA-C01) Exam premium practice questions at TestInsights.

Page: 1 /
Total 65 questions

Want more questions? Get Premium Access.

Question 1

A company built a sales reporting system with Python, connecting to Snowflake using the Python Connector. Based on the user's selections, the system generates the SQL queries needed to fetch the data for the report First it gets the customers that meet the given query parameters (on average 1000 customer records for each report run) and then it loops the customer records sequentially Inside that loop it runs the generated SQL clause for the current customer to get the detailed data for that customer number from the sales data table

When the Data Engineer tested the individual SQL clauses they were fast enough (1 second to get the customers 0 5 second to get the sales data for one customer) but the total runtime of the report is too long

How can this situation be improved?

AIncrease the size of the virtual warehouse

BIncrease the number of maximum clusters of the virtual warehouse

CDefine a clustering key for the sales data table

DRewrite the report to eliminate the use of the loop construct

Correct : D

This option is the best way to improve the situation, as using a loop construct to run SQL queries for each customer is very inefficient and slow. Instead, the report should be rewritten to use a single SQL query that joins the customer and sales data tables and applies the query parameters as filters. This way, the report can leverage Snowflake's parallel processing and optimization capabilities and reduce the network overhead and latency.

Options Selected by Other Users:

Mark Question:

Start a Discussions

Submit Your Answer:

AIncrease the size of the virtual warehouse

BIncrease the number of maximum clusters of the virtual warehouse

CDefine a clustering key for the sales data table

DRewrite the report to eliminate the use of the loop construct

0 / 1500

Question 2

A Data Engineer is evaluating the performance of a query in a development environment.

DEA-C01 Exam Question 2 Exhibit 1

Based on the Query Profile what are some performance tuning options the Engineer can use? (Select TWO)

AAdd a LIMIT to the ORDER BY If possible

BUse a multi-cluster virtual warehouse with the scaling policy set to standard

CMove the query to a larger virtual warehouse

DCreate indexes to ensure sorted access to data

EIncrease the max cluster count

Correct : A, C

The performance tuning options that the Engineer can use based on the Query Profile are:

Add a LIMIT to the ORDER BY If possible: This option will improve performance by reducing the amount of data that needs to be sorted and returned by the query. The ORDER BY clause requires sorting all rows in the input before returning them, which can be expensive and time-consuming. By adding a LIMIT clause, the query can return only a subset of rows that satisfy the order criteria, which can reduce sorting time and network transfer time.

Create indexes to ensure sorted access to data: This option will improve performance by reducing the amount of data that needs to be scanned and filtered by the query. The query contains several predicates on different columns, such as o_orderdate, o_orderpriority, l_shipmode, etc. By creating indexes on these columns, the query can leverage sorted access to data and prune unnecessary micro-partitions or rows that do not match the predicates. This can reduce IO time and processing time.

The other options are not optimal because:

Use a multi-cluster virtual warehouse with the scaling policy set to standard: This option will not improve performance, as the query is already using a multi-cluster virtual warehouse with the scaling policy set to standard. The Query Profile shows that the query is using a 2XL warehouse with 4 clusters and a standard scaling policy, which means that the warehouse can automatically scale up or down based on the load. Changing the warehouse size or the number of clusters will not affect the performance of this query, as it is already using the optimal resources.

Increase the max cluster count: This option will not improve performance, as the query is not limited by the max cluster count. The max cluster count is a parameter that specifies the maximum number of clusters that a multi-cluster virtual warehouse can scale up to. The Query Profile shows that the query is using a 2XL warehouse with 4 clusters and a standard scaling policy, which means that the warehouse can automatically scale up or down based on the load. The default max cluster count for a 2XL warehouse is 10, which means that the warehouse can scale up to 10 clusters if needed. However, the query does not need more than 4 clusters, as it is not CPU-bound or memory-bound. Increasing the max cluster count will not affect the performance of this query, as it will not use more clusters than necessary.

Options Selected by Other Users:

Mark Question:

Start a Discussions

Submit Your Answer:

AAdd a LIMIT to the ORDER BY If possible

BUse a multi-cluster virtual warehouse with the scaling policy set to standard

CMove the query to a larger virtual warehouse

DCreate indexes to ensure sorted access to data

EIncrease the max cluster count

0 / 1500

Question 3

When would a Data engineer use table with the flatten function instead of the lateral flatten combination?

AWhen TABLE with FLATTEN requires another source in the from clause to refer to

BWhen TABLE with FLATTEN requires no additional source m the from clause to refer to

CWhen the LATERAL FLATTEN combination requires no other source m the from clause to refer to

DWhen table with FLATTEN is acting like a sub-query executed for each returned row

Correct : A

The TABLE function with the FLATTEN function is used to flatten semi-structured data, such as JSON or XML, into a relational format. The TABLE function returns a table expression that can be used in the FROM clause of a query. The TABLE function with the FLATTEN function requires another source in the FROM clause to refer to, such as a table, view, or subquery that contains the semi-structured data. For example:

SELECT t.value:city::string AS city, f.value AS population FROM cities t, TABLE(FLATTEN(input => t.value:population)) f;

In this example, the TABLE function with the FLATTEN function refers to the cities table in the FROM clause, which contains JSON data in a variant column named value. The FLATTEN function flattens the population array within each JSON object and returns a table expression with two columns: key and value. The query then selects the city and population values from the table expression.

Options Selected by Other Users:

Mark Question:

Start a Discussions

Submit Your Answer:

AWhen TABLE with FLATTEN requires another source in the from clause to refer to

BWhen TABLE with FLATTEN requires no additional source m the from clause to refer to

CWhen the LATERAL FLATTEN combination requires no other source m the from clause to refer to

DWhen table with FLATTEN is acting like a sub-query executed for each returned row

0 / 1500

Question 4

Within a Snowflake account permissions have been defined with custom roles and role hierarchies.

To set up column-level masking using a role in the hierarchy of the current user, what command would be used?

ACORRECT_ROLE

BIKVOKER_ROLE

CIS_RCLE_IN_SESSION

DIS_GRANTED_TO_INVOKER_ROLE

Correct : C

The IS_ROLE_IN_SESSION function is used to set up column-level masking using a role in the hierarchy of the current user. Column-level masking is a feature in Snowflake that allows users to apply dynamic data masking policies to specific columns based on the roles of the users who access them. The IS_ROLE_IN_SESSION function takes a role name as an argument and returns true if the role is in the current user's session, or false otherwise. The function can be used in a masking policy expression to determine whether to mask or unmask a column value based on the role of the user. For example:

CREATE OR REPLACE MASKING POLICY email_mask AS (val string) RETURNS string -> CASE WHEN IS_ROLE_IN_SESSION('HR') THEN val ELSE REGEXP_REPLACE(val, '(.).(.@.)', '\1****\2') END;

In this example, the IS_ROLE_IN_SESSION function is used to create a masking policy for an email column. The masking policy returns the original email value if the user has the HR role in their session, or returns a masked email value with asterisks if not.

Options Selected by Other Users:

Mark Question:

Start a Discussions

Submit Your Answer:

ACORRECT_ROLE

BIKVOKER_ROLE

CIS_RCLE_IN_SESSION

DIS_GRANTED_TO_INVOKER_ROLE

0 / 1500

Question 5

A Data Engineer wants to create a new development database (DEV) as a clone of the permanent production database (PROD) There is a requirement to disable Fail-safe for all tables.

Which command will meet these requirements?

ACREATE DATABASE DEV
CLONE PROD
FAIL_SAFE=FALSE;

BCREATE DATABASE DEV
CLONE PROD;

CCREATE TRANSIENT DATABASE DEV
CLONE RPOD

DCREATE DATABASE DEV
CLOSE PROD
DATA_RETENTION_TIME_IN_DAYS =0L

Correct : C

This option will meet the requirements of creating a new development database (DEV) as a clone of the permanent production database (PROD) and disabling Fail-safe for all tables. By using the CREATE TRANSIENT DATABASE command, the Data Engineer can create a transient database that does not have Fail-safe enabled by default. Fail-safe is a feature in Snowflake that provides additional protection against data loss by retaining historical data for seven days beyond the time travel retention period. Transient databases do not have Fail-safe enabled, which means that they do not incur additional storage costs for historical data beyond their time travel retention period. By using the CLONE option, the Data Engineer can create an exact copy of the PROD database, including its schemas, tables, views, and other objects.

Options Selected by Other Users:

Mark Question:

Start a Discussions

Submit Your Answer:

ACREATE DATABASE DEV
CLONE PROD
FAIL_SAFE=FALSE;

BCREATE DATABASE DEV
CLONE PROD;

CCREATE TRANSIENT DATABASE DEV
CLONE RPOD

DCREATE DATABASE DEV
CLOSE PROD
DATA_RETENTION_TIME_IN_DAYS =0L

0 / 1500

Page: 1 / 13
Total 65 questions

Unlock Full
DEA-C01 Exam Features

In Just $49 You can Access

All Official Question Types
Interactive Web-Based Practice Test Software
No Installation or 3rd Party Software Required
Customize your practice sessions (Free Demo)
24/7 Customer Support

Get Full Access Now

Marked Questions
DEA-C01 Exam

DEA-C01 Exam Question 1
DEA-C01 Exam Question 2
DEA-C01 Exam Question 3
DEA-C01 Exam Question 4
DEA-C01 Exam Question 5

Download PDF File Demo

Try Web-Based Exam Practice Software Demo

Commenting

In order to participate in the comments you need to be logged-in.
You can sign-up or login