Classification
Correct. Classification is a supervised learning task that assigns an input to a predefined category (discrete label). When the answer is a category, such as lesion or no lesion, the task is classification.
A hospital wants to build a model that uses labeled X-ray images previously diagnosed by radiologists to determine whether a new chest X-ray image shows a lesion or no lesion. There was also a proposal to estimate the size of the lesion as a number, but the only requirement this time is the lesion or no-lesion determination. Which type of task is this?
Choosing the type of task that predicts a category.
Classification
Correct. Classification is a supervised learning task that assigns an input to a predefined category (discrete label). When the answer is a category, such as lesion or no lesion, the task is classification.
Regression
Regression is a task that predicts a continuous number.
This question is about predicting a category, lesion or no lesion, so it is not regression, which predicts a number, and is incorrect.
Clustering
Clustering is unsupervised learning that groups unlabeled data into similar groups.
This question is supervised learning that predicts the correct label, lesion or no lesion, and is not grouping, so it is incorrect.
Dimensionality reduction
Dimensionality reduction is unsupervised learning preprocessing that compresses the features of data to make them easier to handle.
This question is a task that predicts a category, not feature compression, so it is incorrect.
Remember the differences among the four representative machine learning tasks.
・Classification: supervised learning that trains on labeled data and assigns an input to a category (discrete label) (for example, lesion / no lesion, dog / cat).
・Regression: also supervised learning, but it predicts a continuous number such as sales or temperature.
・Clustering: unsupervised learning that groups similar data without using labels (for example, customer segmentation).
・Dimensionality reduction: unsupervised learning that compresses features without using labels to make them easier to handle (used for visualization and preprocessing).
Identify them by output: category → classification, number → regression, unlabeled grouping → clustering, compression → dimensionality reduction.