CertGrid
Cloud Native Study Guide

Prometheus Certified Associate (PCA) Study Guide

The Prometheus Certified Associate (PCA) is a vendor-neutral CNCF exam that validates foundational observability and Prometheus skills, including metrics fundamentals, PromQL, exporters, instrumentation, alerting, and dashboards. It is a 90-minute, multiple-choice exam aimed at developers, SREs, and platform engineers who deploy or query Prometheus and want to prove practical, hands-on monitoring knowledge.

Domain 1: Observability Concepts

Key concepts you must know · 85 practice questions

Domain 2: Prometheus Fundamentals

Key concepts you must know · 168 practice questions

Domain 3: PromQL

Key concepts you must know · 156 practice questions

Domain 4: Instrumentation and Exporters

Key concepts you must know · 153 practice questions

Domain 5: Alerting and Dashboards

Key concepts you must know · 136 practice questions

Prometheus Certified Associate (PCA) exam tips

Study guide FAQ

What format and passing requirements does the PCA exam have?

The PCA is a multiple-choice, online proctored exam lasting about 90 minutes. It is scored on a scaled basis and the passing score is 750. There is no hands-on lab component, but many questions present config snippets or PromQL expressions to interpret.

How much PromQL do I need to know?

PromQL is roughly a quarter of the exam and is the single highest-value area to study. You should be comfortable with label matchers, rate/irate/increase, aggregation with by/without, histogram_quantile, vector matching with on/ignoring and group_left/group_right, and functions like predict_linear, absent, and label_replace.

Do I need to know Grafana and Alertmanager for the exam?

Yes, at a foundational level. Expect questions on Alertmanager routing, grouping, deduplication, inhibition, silences, and notification timing, and on the roles of recording vs alerting rules. Grafana appears as the standard dashboarding layer that queries Prometheus via PromQL.

What are the most common conceptual traps to watch for?

Common traps include confusing SLI/SLO/error budget, assuming summary quantiles can be aggregated across instances (they cannot), misusing the Pushgateway for long-running services, embedding high-cardinality or PII values in labels, and forgetting that the for clause resets if any evaluation returns no data.