看来,word_delimiter的设计只适用于单个术语。如果我有像下面这样的一段文字呢:
"Contra-indications of paracetamol can be of certain sorts"在这种情况下,word_delimiter获取整个句子并将其连接起来,而我只需要它连接"Contra-indications",以便在文本块内搜索contra indications、contra-indications和contraindications。
发布于 2016-05-11 01:30:11
你需要一个像这样的分析仪:
{
"settings": {
"analysis": {
"filter": {
"delimiter_filter": {
"type": "word_delimiter",
"catenate_words": true,
"preserve_original": true
}
},
"analyzer": {
"delimiter_analyzer": {
"type": "custom",
"tokenizer": "whitespace",
"filter": [
"lowercase",
"delimiter_filter"
]
}
}
}
},
"mappings": {
"assets": {
"properties": {
"domain": {
"type": "string",
"analyzer": "delimiter_analyzer"
}
}
}
}
}对于示例文本-- Contra-indications of paracetamol can be of certain sorts --这些都是它生成的术语:
"domain": [
"be",
"can",
"certain",
"contra",
"contra-indications",
"contraindications",
"indications",
"of",
"paracetamol",
"sorts"
]https://stackoverflow.com/questions/37148057
复制相似问题