Tokenization
This is incorrect. Tokenization is the process of splitting text into the smallest units (tokens) that a model processes. While both involve 'splitting,' the granularity is different — splitting at the word or sub-word level is too fine-grained for search units. The splitting into units for search is chunking.