原英文版地址: https://www.elastic.co/guide/en/elasticsearch/reference/7.7/analysis-lowercase-tokenizer.html, 原文档版权归 www.elastic.co 所有
本地英文版地址: ../en/analysis-lowercase-tokenizer.html
本地英文版地址: ../en/analysis-lowercase-tokenizer.html
重要: 此版本不会发布额外的bug修复或文档更新。最新信息请参考 当前版本文档。
Lowercase Tokenizeredit
The lowercase
tokenizer, like the
letter
tokenizer breaks text into terms
whenever it encounters a character which is not a letter, but it also
lowercases all terms. It is functionally equivalent to the
letter
tokenizer combined with the
lowercase
token filter, but is more
efficient as it performs both steps in a single pass.
Example outputedit
POST _analyze { "tokenizer": "lowercase", "text": "The 2 QUICK Brown-Foxes jumped over the lazy dog's bone." }
The above sentence would produce the following terms:
[ the, quick, brown, foxes, jumped, over, the, lazy, dog, s, bone ]
Configurationedit
The lowercase
tokenizer is not configurable.