Which are the following additional Metadata columns Stream contains that could be used for creating Efficient Data science Pipelines and helps in transforming only the New/Modified data only? Choose 3.
A. METADATA$ACTION
B. METADATA$FILE_ID
C. METADATA$ISUPDATE
D. METADATA$DELETE
E. METADATA$ROW_ID
Which ones are the known limitations of using External function? Choose all apply.
A. Currently, external functions cannot be shared with data consumers via Secure Data Sharing.
B. Currently, external functions must be scalar functions. A scalar external function re-turns a single value for each input row.
C. External functions have more overhead than internal functions (both built-in functions and internal UDFs) and usually execute more slowly
D. An external function accessed through an AWS API Gateway private endpoint can be accessed only from a Snowflake VPC (Virtual Private Cloud) on AWS and in the same AWS region.
Which of the following Snowflake parameter can be used to Automatically Suspend Tasks which are running Data science pipelines after specified Failed Runs?
A. SUSPEND_TASK
B. SUSPEND_TASK_AUTO_NUM_FAILURES
C. SUSPEND_TASK_AFTER_NUM_FAILURES
D. There is none as such available.
Mark the incorrect statement regarding Python UDF?
A. Python UDFs can contain both new code and calls to existing packages
B. For each row passed to a UDF, the UDF returns either a scalar (i.e. single) value or, if defined as a table function, a set of rows.
C. A UDF also gives you a way to encapsulate functionality so that you can call it repeatedly from multiple places in code
D. A scalar function (UDF) returns a tabular value for each input row
Which of the following metrics are used to evaluate classification models?
A. Area under the ROC curve
B. F1 score
C. Confusion matrix
D. All of the above
A Data Scientist as data providers require to allow consumers to access all databases and database objects in a share by granting a single privilege on shared databases. Which one is incorrect SnowSQL command used by her while doing this task?
Assuming:
A database named product_db exists with a schema named product_agg and a table named Item_agg.
The database, schema, and table will be shared with two accounts named xy12345 and yz23456.
1.USE ROLE accountadmin;
2.CREATE DIRECT SHARE product_s;
3.GRANT USAGE ON DATABASE product_db TO SHARE product_s;
4.GRANT USAGE ON SCHEMA product_db. product_agg TO SHARE product_s;
5.GRANT SELECT ON TABLE sales_db. product_agg.Item_agg TO SHARE product_s; 6.SHOW GRANTS TO SHARE product_s;
7.ALTER SHARE product_s ADD ACCOUNTS=xy12345, yz23456;
8.SHOW GRANTS OF SHARE product_s;
A. GRANT USAGE ON DATABASE product_db TO SHARE product_s;
B. CREATE DIRECT SHARE product_s;
C. GRANT SELECT ON TABLE sales_db. product_agg.Item_agg TO SHARE product_s;
D. ALTER SHARE product_s ADD ACCOUNTS=xy12345, yz23456;
All Snowpark ML modeling and preprocessing classes are in the ________ namespace?
A. snowpark.ml.modeling
B. snowflake.sklearn.modeling
C. snowflake.scikit.modeling
D. snowflake.ml.modeling
Which type of Python UDFs let you define Python functions that receive batches of input rows as Pandas DataFrames and return batches of results as Pandas arrays or Series?
A. MPP Python UDFs
B. Scaler Python UDFs
C. Vectorized Python UDFs
D. Hybrid Python UDFs
There are a couple of different types of classification tasks in machine learning, Choose the Correct Classification which best categorized the below Application Tasks in Machine learning?
To detect whether email is spam or not
To determine whether or not a patient has a certain disease in medicine.
To determine whether or not quality specifications were met when it comes to QA (Quality Assurance).
A. Multi-Label Classification
B. Multi-Class Classification
C. Binary Classification
D. Logistic Regression
Consider a data frame df with 10 rows and index [ 'r1', 'r2', 'r3', 'row4', 'row5', 'row6', 'r7', 'r8', 'r9', 'row10']. What does the aggregate method shown in below code do?
g = df.groupby(df.index.str.len())
A. aggregate({'A':len, 'B':np.sum})
B. Computes Sum of column A values
C. Computes length of column A
D. Computes length of column A and Sum of Column B values of each group
E. Computes length of column A and Sum of Column B values
Nowadays, the certification exams become more and more important and required by more and more enterprises when applying for a job. But how to prepare for the exam effectively? How to prepare for the exam in a short time with less efforts? How to get a ideal result and how to find the most reliable resources? Here on Vcedump.com, you will find all the answers. Vcedump.com provide not only Snowflake exam questions, answers and explanations but also complete assistance on your exam preparation and certification application. If you are confused on your DSA-C02 exam preparations and Snowflake certification application, do not hesitate to visit our Vcedump.com to find your solutions here.