Django haystack LocationField 创建为字符串而不是 elasticsearch 中的 geo_point

2024-01-09

我正在使用 django 1.8.9、django-rest-framework、django-haystack 和 Elasticsearch,并尝试让 LocationField 工作,索引已创建,但类型始终为string代替geo_point,所以显然地理搜索不起作用。

设置.py:

INSTALLED_APPS = (
    'django.contrib.auth',
    'django.contrib.contenttypes',
    'django.contrib.gis',
    'django.contrib.sessions',
    'django.contrib.sites',
    'django.contrib.messages',
    'django.contrib.staticfiles',
    'django.contrib.admin',
    'django_extensions',
    'elasticsearch',
    'rest_framework',
    'haystack',
)

要求.txt:

Django==1.8.9
django-appconf==1.0.1
django-compressor==1.6
django-extensions==1.6.1
django-filter==0.11.0
django-haystack==2.4.1
djangorestframework==3.3.1
djangorestframework-jwt==1.7.2
ecdsa==0.13
elasticsearch==2.2.0
Fabric==1.10.2
future==0.15.2
geopy==1.11.0
gunicorn==19.4.1
Markdown==2.6.5
paramiko==1.16.0
psycopg2==2.6.1
pycrypto==2.6.1
PyJWT==1.4.0
python-dateutil==2.4.2
python-memcached==1.57
setproctitle==1.1.9
six==1.10.0
urllib3==1.14

搜索索引.py:

from haystack import indexes
from blah.api.models import MyModel


class MyIndex(indexes.SearchIndex, indexes.Indexable):
    text = indexes.CharField(document=True, use_template=True)
    description = indexes.CharField(model_attr='description')
    location = indexes.LocationField(model_attr='get_location')
    created = indexes.DateTimeField(model_attr='created')

    def get_model(self):
        return MyModel

MyModel 上的 get_location 属性:

from haystack.utils.geo import Point
def get_location(self):
    return Point(self.lng, self.lat)

创建的elasticsearch索引(对格式感到抱歉!):

{  
   "myindex":{  
      "mappings":{  
         "modelresult":{  
            "properties":{  
               "created":{  
                  "type":"date",
                  "format":"strict_date_optional_time||epoch_millis"
               },
               "description":{  
                  "type":"string"
               },
               "django_ct":{  
                  "type":"string"
               },
               "django_id":{  
                  "type":"string"
               },
               "id":{  
                  "type":"string"
               },
               "location":{  
                  "type":"string"
               },
               "text":{  
                  "type":"string"
               }
            }
         }
      }
   }
}

有人有什么想法吗?感觉它是 django、django-haystack 和 elasticsearch 之间版本的组合,玩得不好,但我似乎无法让任何组合工作。


好吧,我已经弄清楚问题是什么了: 在Elasticsearch 2.0中,元数据发生了变化,其中之一是boost已被删除:https://www.elastic.co/guide/en/elasticsearch/reference/current/writing_20_mapping_changes.html#migration-meta-fields https://www.elastic.co/guide/en/elasticsearch/reference/current/breaking_20_mapping_changes.html#migration-meta-fields

追溯过去elasticsearch/transport.py,PUT 请求http://127.0.0.1:9200/myindex/_mapping/modelresult http://127.0.0.1:9200/myindex/_mapping/modelresult在正文中包含 "_boost": {"name": "boost", "null_value": 1.0} 。

因此,跟踪调用并将它们重新表示为 CURL:

创建索引

curl -X PUT -d '{"settings": {"analysis": {"filter": {"haystack_edgengram": {"max_gram": 15, "type": "edgeNGram", "min_gram": 2}, "haystack_ngram": {"max_gram": 15, "type": "nGram", "min_gram": 3}}, "tokenizer": {"haystack_ngram_tokenizer": {"max_gram": 15, "type": "nGram", "min_gram": 3}, "haystack_edgengram_tokenizer": {"max_gram": 15, "type": "edgeNGram", "side": "front", "min_gram": 2}}, "analyzer": {"edgengram_analyzer": {"filter": ["haystack_edgengram", "lowercase"], "type": "custom", "tokenizer": "standard"}, "ngram_analyzer": {"filter": ["haystack_ngram", "lowercase"], "type": "custom", "tokenizer": "standard"}}}}}' http://127.0.0.1:9200/myindex

失败的请求

curl -X PUT -d '{"modelresult": {"_boost": {"name": "boost", "null_value": 1.0}, "properties": {"django_id": {"include_in_all": false, "index": "not_analyzed", "type": "string"}, "description": {"type": "string", "analyzer": "snowball"}, "created": {"type": "date"}, "text": {"type": "string", "analyzer": "snowball"}, "django_ct": {"include_in_all": false, "index": "not_analyzed", "type": "string"}, "location": {"type": "geo_point"}}}}' http://127.0.0.1:9200/myindex/_mapping/modelresult

更改为这个作品

curl -X PUT -d '{"modelresult": {"properties": {"django_id": {"include_in_all": false, "index": "not_analyzed", "type": "string"}, "description": {"type": "string", "analyzer": "snowball"}, "created": {"type": "date"}, "text": {"type": "string", "analyzer": "snowball"}, "django_ct": {"include_in_all": false, "index": "not_analyzed", "type": "string"}, "location": {"type": "geo_point"}}}}' http://127.0.0.1:9200/myindex/_mapping/modelresult

所以,Python修复在 haystack/backends/elasticsearch_backend.py 中,注释掉第 137-140 行 current_mapping 中的 boost 部分

本文内容由网友自发贡献,版权归原作者所有,本站不承担相应法律责任。如您发现有涉嫌抄袭侵权的内容,请联系:hwhale#tublm.com(使用前将#替换为@)

Django haystack LocationField 创建为字符串而不是 elasticsearch 中的 geo_point 的相关文章

随机推荐