Classic token filter | Elasticsearch Guide [7.7]

原文地址: https://www.elastic.co/guide/en/elasticsearch/reference/7.7/analysis-classic-tokenfilter.html, 原文档版权归 www.elastic.co 所有

IMPORTANT: No additional bug fixes or documentation updates will be released for this version. For the latest information, see the current release documentation.

» » »

« CJK width token filter Common grams token filter »

Classic token filteredit

Performs optional post-processing of terms generated by the classic tokenizer.

This filter removes the english possessive ('s) from the end of words and removes dots from acronyms. It uses Lucene’s ClassicFilter.

Exampleedit

The following analyze API request demonstrates how the classic token filter works.

GET /_analyze
{
  "tokenizer" : "classic",
  "filter" : ["classic"],
  "text" : "The 2 Q.U.I.C.K. Brown-Foxes jumped over the lazy dog's bone."
}

The filter produces the following tokens:

[ The, 2, QUICK, Brown, Foxes, jumped, over, the, lazy, dog, bone ]

Add to an analyzeredit

The following create index API request uses the classic token filter to configure a new custom analyzer.

PUT /classic_example
{
    "settings" : {
        "analysis" : {
            "analyzer" : {
                "classic_analyzer" : {
                    "tokenizer" : "classic",
                    "filter" : ["classic"]
                }
            }
        }
    }
}

« CJK width token filter Common grams token filter »