我正在尝试理解elastic search如何在内部支持Geo Spatial搜索。
对于基本搜索,它使用倒排索引;但它如何与其他搜索条件相结合,如搜索特定半径内的特定文本。
我想了解如何存储和查询索引以支持这些查询的内部原理
发布于 2020-05-18 05:37:34
文本和地理查询相互独立地工作。让我们举一个具体的例子:
PUT restaurants
{
"mappings": {
"properties": {
"location": {
"type": "geo_point"
},
"menu": {
"type": "text",
"fields": {
"keyword": {
"type": "keyword"
}
}
}
}
}
}
POST restaurants/_doc
{
"name": "rest1",
"location": {
"lat": 40.739812,
"lon": -74.006201
},
"menu": [
"european",
"french",
"pizza"
]
}
POST restaurants/_doc
{
"name": "rest2",
"location": {
"lat": 40.7403963,
"lon": -73.9950026
},
"menu": [
"pizza",
"kebab"
]
}然后使用geo_distance过滤器对文本字段执行match操作:
GET restaurants/_search
{
"query": {
"bool": {
"must": [
{
"match": {
"menu": "pizza"
}
},
{
"geo_distance": {
"distance": "0.5mi",
"location": {
"lat": 40.7388,
"lon": -73.9982
}
}
},
{
"function_score": {
"query": {
"match_all": {}
},
"boost_mode": "avg",
"functions": [
{
"gauss": {
"location": {
"origin": {
"lat": 40.7388,
"lon": -73.9982
},
"scale": "0.5mi"
}
}
}
]
}
}
]
}
}
}由于geo_distance查询只分配true/false值(--> score=1;只检查位置是否在给定半径内),因此可能需要应用高斯function_score来提高位置,使其更接近给定的原点。
最后,这些分数可以通过使用_geo_distance排序来覆盖,这样您就可以只按邻近度排序(当然,同时保持match查询的完整性):
...
"query: {...},
"sort": [
{
"_geo_distance": {
"location": {
"lat": 40.7388,
"lon": -73.9982
},
"order": "asc"
}
}
]
}https://stackoverflow.com/questions/61853684
复制相似问题