A company wants to improve the auditability of its AI training data. What is the concept of being able to record and track the origin and transformation history of data, that is, where it came from and what transformations it went through to reach its current form, called?

1 / 1
Select an answer
CorrectB

Explanation

A question about choosing the concept of tracking the origin and transformation history of data.

  • 1where it came from and what transformations it went through to reach its current formRepresents the provenance of the data
  • 2record and track the origin and transformation historyTracking provenance = data lineage
AIncorrect

Data cataloging

Data cataloging is the effort to organize metadata such as the location and schema of data assets to make them easier to find.

It is about organizing 'where things are,' not tracking the history (lineage) of 'where it came from and how it was transformed,' so this is incorrect.

BCorrect

Data lineage

Correct. Data lineage is the concept of recording and tracking the provenance of data, from its source through the transformations and processing it went through to reach its current form. It helps with tracing the cause of issues and ensuring auditability and reproducibility.

CIncorrect

Data classification

Data classification is the effort to categorize data according to its sensitivity and label it.

It is about dividing handling rules, not tracking the origin and transformation history, so this is incorrect.

DIncorrect

Data residency

Data residency is the requirement about which country or location data is stored in.

It is about the storage location, not tracking the origin and transformation history, so this is incorrect.

Key Takeaway

Note the correct answer, data lineage (provenance).
- The concept of recording and tracking, from the origin of the data, the transformations and processing it went through to reach its current form.
- It helps with tracing the cause when an issue occurs, supporting audits, and ensuring reproducibility.