CertGrid
Microsoft Study Guide

DP-100: Azure Data Scientist Associate Study Guide

DP-100 (Azure Data Scientist Associate) validates your ability to design machine learning solutions, prepare data, train and evaluate models, and operationalize them with Azure Machine Learning. It targets data scientists and ML engineers who use the Azure ML SDK v2, CLI v2, and studio to run experiments and deploy models. Expect heavy emphasis on the Azure ML workspace, compute, jobs, MLflow tracking, endpoints, and MLOps retraining patterns.

Domain 1: Design and Prepare a Machine Learning Solution

Key concepts you must know · 167 practice questions

Domain 2: Explore Data and Train Models

Key concepts you must know · 165 practice questions

Domain 3: Prepare a Model for Deployment

Key concepts you must know · 161 practice questions

Domain 4: Deploy and Retrain a Model

Key concepts you must know · 164 practice questions

DP-100 exam tips

Study guide FAQ

Does DP-100 use the Azure ML SDK v1 or v2?

The current exam centers on the v2 experience: the Python SDK v2, CLI v2 (az ml command group), and Azure ML studio. Learn v2 concepts like data assets, environments, command/sweep/pipeline jobs, and managed online/batch endpoints rather than the deprecated v1 Estimator and inference-config patterns.

How much coding is on the exam?

You should be able to read and reason about Python (SDK v2) and az ml CLI commands and YAML job specs, but you write little to no code from scratch. Most items are scenario-based multiple choice asking you to pick the right service, configuration, metric, or command.

What is the passing score and format?

You need 700 out of 1000 to pass. The exam runs about 100 minutes with roughly 40 to 60 questions in mixed formats: multiple choice, multiple-select, drag-and-drop ordering, and case studies, including the 'does this solution meet the goal' yes/no series.

How do I choose between AutoML and a hyperparameter sweep job?

Use AutoML when you want Azure ML to automatically try many algorithms and preprocessing/hyperparameter combinations and surface the best model with minimal code. Use a sweep job when you already have a chosen algorithm/training script and want to tune its hyperparameters using sampling (grid, random, Bayesian) with an early-termination policy such as bandit.