Home
Databricks
Databricks-Machine-Learning-Associate Exam Info
Databricks-Machine-Learning-Associate Exam Questions

Curious about Actual Databricks Certified Machine Learning Associate Exam Questions?

Here are sample Databricks Certified Machine Learning Associate (Databricks-Machine-Learning-Associate) Exam questions from real exam. You can get more Databricks Machine Learning Associate (Databricks-Machine-Learning-Associate) Exam premium practice questions at TestInsights.

Page: 1 /
Total 74 questions

Want more questions? Get Premium Access.

Question 1

A machine learning engineer has been notified that a new Staging version of a model registered to the MLflow Model Registry has passed all tests. As a result, the machine learning engineer wants to put this model into production by transitioning it to the Production stage in the Model Registry.

From which of the following pages in Databricks Machine Learning can the machine learning engineer accomplish this task?

AThe home page of the MLflow Model Registry

BThe experiment page in the Experiments observatory

CThe model version page in the MLflow Model Registry

DThe model page in the MLflow Model Registry

Correct : C

The machine learning engineer can transition a model version to the Production stage in the Model Registry from the model version page. This page provides detailed information about a specific version of a model, including its metrics, parameters, and current stage. From here, the engineer can perform stage transitions, moving the model from Staging to Production after it has passed all necessary tests.

Reference

Databricks documentation on MLflow Model Registry: https://docs.databricks.com/applications/mlflow/model-registry.html#model-version

Options Selected by Other Users:

Mark Question:

Start a Discussions

Submit Your Answer:

AThe home page of the MLflow Model Registry

BThe experiment page in the Experiments observatory

CThe model version page in the MLflow Model Registry

DThe model page in the MLflow Model Registry

0 / 1500

Question 2

A machine learning engineer has identified the best run from an MLflow Experiment. They have stored the run ID in the run_id variable and identified the logged model name as "model". They now want to register that model in the MLflow Model Registry with the name "best_model".

Which lines of code can they use to register the model associated with run_id to the MLflow Model Registry?

Amlflow.register_model(run_id, 'best_model')

Bmlflow.register_model(f'runs:/{run_id}/model'', 'best_model'')

Cmillow.register_model(f'runs:/{run_id)/model')

Dmlflow.register_model(f'runs:/{run_id}/best_model', 'model')

Correct : B

To register a model that has been identified by a specific run_id in the MLflow Model Registry, the appropriate line of code is:

mlflow.register_model(f'runs:/{run_id}/model', 'best_model')

This code correctly specifies the path to the model within the run (runs:/{run_id}/model) and registers it under the name 'best_model' in the Model Registry. This allows the model to be tracked, managed, and transitioned through different stages (e.g., Staging, Production) within the MLflow ecosystem.

Reference

MLflow documentation on model registry: https://www.mlflow.org/docs/latest/model-registry.html#registering-a-model

Options Selected by Other Users:

Mark Question:

Start a Discussions

Submit Your Answer:

Amlflow.register_model(run_id, 'best_model')

Bmlflow.register_model(f'runs:/{run_id}/model'', 'best_model'')

Cmillow.register_model(f'runs:/{run_id)/model')

Dmlflow.register_model(f'runs:/{run_id}/best_model', 'model')

0 / 1500

Question 3

A new data scientist has started working on an existing machine learning project. The project is a scheduled Job that retrains every day. The project currently exists in a Repo in Databricks. The data scientist has been tasked with improving the feature engineering of the pipeline's preprocessing stage. The data scientist wants to make necessary updates to the code that can be easily adopted into the project without changing what is being run each day.

Which approach should the data scientist take to complete this task?

AThey can create a new branch in Databricks, commit their changes, and push those changes to the Git provider.

BThey can clone the notebooks in the repository into a Databricks Workspace folder and make the necessary changes.

CThey can create a new Git repository, import it into Databricks, and copy and paste the existing code from the original repository before making changes.

DThey can clone the notebooks in the repository into a new Databricks Repo and make the necessary changes.

Correct : A

The best approach for the data scientist to take in this scenario is to create a new branch in Databricks, commit their changes, and push those changes to the Git provider. This approach allows the data scientist to make updates and improvements to the feature engineering part of the preprocessing pipeline without affecting the main codebase that runs daily. By creating a new branch, they can work on their changes in isolation. Once the changes are ready and tested, they can be merged back into the main branch through a pull request, ensuring a smooth integration process and allowing for code review and collaboration with other team members.

Databricks documentation on Git integration: Databricks Repos

Options Selected by Other Users:

Mark Question:

Start a Discussions

Submit Your Answer:

AThey can create a new branch in Databricks, commit their changes, and push those changes to the Git provider.

BThey can clone the notebooks in the repository into a Databricks Workspace folder and make the necessary changes.

CThey can create a new Git repository, import it into Databricks, and copy and paste the existing code from the original repository before making changes.

DThey can clone the notebooks in the repository into a new Databricks Repo and make the necessary changes.

0 / 1500

Question 4

A machine learning engineering team has a Job with three successive tasks. Each task runs a single notebook. The team has been alerted that the Job has failed in its latest run.

Which of the following approaches can the team use to identify which task is the cause of the failure?

ARun each notebook interactively

BReview the matrix view in the Job's runs

CMigrate the Job to a Delta Live Tables pipeline

DChange each Task's setting to use a dedicated cluster

Correct : B

To identify which task is causing the failure in the job, the team should review the matrix view in the Job's runs. The matrix view provides a clear and detailed overview of each task's status, allowing the team to quickly identify which task failed. This approach is more efficient than running each notebook interactively, as it provides immediate insights into the job's execution flow and any issues that occurred during the run.

Databricks documentation on Jobs: Jobs in Databricks

Options Selected by Other Users:

Mark Question:

Start a Discussions

Submit Your Answer:

ARun each notebook interactively

BReview the matrix view in the Job's runs

CMigrate the Job to a Delta Live Tables pipeline

DChange each Task's setting to use a dedicated cluster

0 / 1500

Question 5

Which of the following machine learning algorithms typically uses bagging?

AIGradient boosted trees

BK-means

CRandom forest

DDecision tree

Correct : C

Random Forest is a machine learning algorithm that typically uses bagging (Bootstrap Aggregating). Bagging is a technique that involves training multiple base models (such as decision trees) on different subsets of the data and then combining their predictions to improve overall model performance. Each subset is created by randomly sampling with replacement from the original dataset. The Random Forest algorithm builds multiple decision trees and merges them to get a more accurate and stable prediction.

Databricks documentation on Random Forest: Random Forest in Spark ML

Options Selected by Other Users:

Mark Question:

Start a Discussions

Submit Your Answer:

AIGradient boosted trees

BK-means

CRandom forest

DDecision tree

0 / 1500

Page: 1 / 15
Total 74 questions

Unlock Full
Databricks-Machine-Learning-Associate Exam Features

In Just $49 You can Access

All Official Question Types
Interactive Web-Based Practice Test Software
No Installation or 3rd Party Software Required
Customize your practice sessions (Free Demo)
24/7 Customer Support

Get Full Access Now

Marked Questions
Databricks-Machine-Learning-Associate Exam

Databricks-Machine-Learning-Associate Exam Question 1
Databricks-Machine-Learning-Associate Exam Question 2
Databricks-Machine-Learning-Associate Exam Question 3
Databricks-Machine-Learning-Associate Exam Question 4
Databricks-Machine-Learning-Associate Exam Question 5

Download PDF File Demo

Try Web-Based Exam Practice Software Demo

Commenting

In order to participate in the comments you need to be logged-in.
You can sign-up or login