home/glossary/Counterfactual Fairness

Counterfactual Fairness

nounid 4802·updated May 13, 2026
candidate

No definition recorded.

MWE

Classifications

Entity Type

Unknownauthoritativeglossary_import_default_pending_classifier

Sensitivity

unclassified

Information Class

unclassified

Variants

plural
Counterfactual Fairnesses
possessive
Counterfactual Fairness's
pluralpossessive
Counterfactual Fairnesses'

Framework definitions

Notes on Measurement1 senseview framework →
§1
A fairness metric that checks whether a classifier produces the same result for one individual as it does for another individual who is identical to the first, except with respect to one or more sensitive attributes. Evaluating a classifier for counterfactual fairness is one method for surfacing potential sources of bias in a model
Counterfactual Fairness1 senseview framework →
§1
Given a predictive problem with fairness considerations, where A, X and Y represent the protected attributes, remaining attributes, and output of interest respectively, let us assume that we are given a causal model (U; V; F), where V = A \cup X. We postulate the following criterion for predictors of Y . Definition 5 (Counterfactual fairness). Predictor ^Y is counterfactually fair if under any context X = x and A = a, P( ^Y_{A - a} (U) = y | X = x; A = a) = P( ^Y_{A - a')(U) = y | X = x;A = a); (1) for all y and for any value a' attainable by A.

Outgoing relationships

No outgoing triples
This term is not the subject of any RDF-style relationship yet.

Incoming relationships

No incoming triples
No other term currently asserts a relationship to this one.