Elasticsearch：在分面时排除过滤器可能吗？（就像在 Solr 中一样）

2023-12-28

我正在考虑从 Solr 更改为 ES。我找不到相关信息的一件事是 ES 是否允许我在分面时定义排除过滤器。

例如考虑producttype具有值：A,B,C我想关注这一点（即：显示计数）。还要考虑查询被限制为producttype: A.

在这种情况下，Solr 允许我指定我想要排除约束producttype: A影响刻面producttype。 IOW，它显示计数producttype就好像约束producttype: A尚未应用。

如何在 Solr 中执行此操作请参阅：http://wiki.apache.org/solr/SimpleFacetParameters http://wiki.apache.org/solr/SimpleFacetParameters> 标记和排除过滤器

在 ElasticSearch 中有什么办法可以做到这一点吗？

是的你可以。

虽然您可以在查询 DSL 中使用过滤器，但搜索 API 还接受顶级filter参数，用于在计算facet之后过滤搜索结果。

例如：

1）首先，创建你的索引，因为你想要product_type要被视为枚举，请将其设置为not_analyzed:

curl -XPUT 'http://127.0.0.1:9200/my_index/?pretty=1'  -d '
{
   "mappings" : {
      "product" : {
         "properties" : {
            "product_type" : {
               "index" : "not_analyzed",
               "type" : "string"
            },
            "product_name" : {
               "type" : "string"
            }
         }
      }
   }
}
'

2）索引一些文档（注意，文档3有不同的product_name):

curl -XPUT 'http://127.0.0.1:9200/my_index/product/1?pretty=1'  -d '
{
   "product_type" : "A",
   "product_name" : "foo bar"
}
'
curl -XPUT 'http://127.0.0.1:9200/my_index/product/2?pretty=1'  -d '
{
   "product_type" : "B",
   "product_name" : "foo bar"
}
'
curl -XPUT 'http://127.0.0.1:9200/my_index/product/3?pretty=1'  -d '
{
   "product_type" : "C",
   "product_name" : "bar"
}
'

3) 搜索名称包含以下内容的产品foo（不包括 doc 3，因此product_type C），计算面product_type对于所有具有foo in the product_name，然后按以下条件过滤搜索结果product_type == A:

curl -XGET 'http://127.0.0.1:9200/my_index/product/_search?pretty=1'  -d '
{
   "query" : {
      "text" : {
         "product_name" : "foo"
      }
   },
   "filter" : {
      "term" : {
         "product_type" : "A"
      }
   },
   "facets" : {
      "product_type" : {
         "terms" : {
            "field" : "product_type"
         }
      }
   }
}
'

# {
#    "hits" : {
#       "hits" : [
#          {
#             "_source" : {
#                "product_type" : "A",
#                "product_name" : "foo bar"
#             },
#             "_score" : 0.19178301,
#             "_index" : "my_index",
#             "_id" : "1",
#             "_type" : "product"
#          }
#       ],
#       "max_score" : 0.19178301,
#       "total" : 1
#    },
#    "timed_out" : false,
#    "_shards" : {
#       "failed" : 0,
#       "successful" : 5,
#       "total" : 5
#    },
#    "facets" : {
#       "product_type" : {
#          "other" : 0,
#          "terms" : [
#             {
#                "count" : 1,
#                "term" : "B"
#             },
#             {
#                "count" : 1,
#                "term" : "A"
#             }
#          ],
#          "missing" : 0,
#          "_type" : "terms",
#          "total" : 2
#       }
#    },
#    "took" : 3
# }

4) 执行搜索foo in the product_name，但通过指定来计算索引中所有产品的构面global范围：

# [Wed Jan 18 17:15:09 2012] Protocol: http, Server: 192.168.5.10:9200
curl -XGET 'http://127.0.0.1:9200/my_index/product/_search?pretty=1'  -d '
{
   "query" : {
      "text" : {
         "product_name" : "foo"
      }
   },
   "filter" : {
      "term" : {
         "product_type" : "A"
      }
   },
   "facets" : {
      "product_type" : {
         "global" : 1,
         "terms" : {
            "field" : "product_type"
         }
      }
   }
}
'

# [Wed Jan 18 17:15:09 2012] Response:
# {
#    "hits" : {
#       "hits" : [
#          {
#             "_source" : {
#                "product_type" : "A",
#                "product_name" : "foo bar"
#             },
#             "_score" : 0.19178301,
#             "_index" : "my_index",
#             "_id" : "1",
#             "_type" : "product"
#          }
#       ],
#       "max_score" : 0.19178301,
#       "total" : 1
#    },
#    "timed_out" : false,
#    "_shards" : {
#       "failed" : 0,
#       "successful" : 5,
#       "total" : 5
#    },
#    "facets" : {
#       "product_type" : {
#          "other" : 0,
#          "terms" : [
#             {
#                "count" : 1,
#                "term" : "C"
#             },
#             {
#                "count" : 1,
#                "term" : "B"
#             },
#             {
#                "count" : 1,
#                "term" : "A"
#             }
#          ],
#          "missing" : 0,
#          "_type" : "terms",
#          "total" : 3
#       }
#    },
#    "took" : 4
# }

更新以回答来自OP的扩展问题：

您还可以将过滤器直接应用于每个方面 - 这些称为facet_filters.

与之前类似的例子：

1）创建索引：

curl -XPUT 'http://127.0.0.1:9200/my_index/?pretty=1'  -d '
{
   "mappings" : {
      "product" : {
         "properties" : {
            "color" : {
               "index" : "not_analyzed",
               "type" : "string"
            },
            "name" : {
               "type" : "string"
            },
            "type" : {
               "index" : "not_analyzed",
               "type" : "string"
            }
         }
      }
   }
}
'

2）索引一些数据：

curl -XPUT 'http://127.0.0.1:9200/my_index/product/1?pretty=1'  -d '
{
   "color" : "red",
   "name" : "foo bar",
   "type" : "A"
}
'

curl -XPUT 'http://127.0.0.1:9200/my_index/product/2?pretty=1'  -d '
{
   "color" : [
      "red",
      "blue"
   ],
   "name" : "foo bar",
   "type" : "B"
}
'

curl -XPUT 'http://127.0.0.1:9200/my_index/product/3?pretty=1'  -d '
{
   "color" : [
      "green",
      "blue"
   ],
   "name" : "bar",
   "type" : "C"
}
'

3) 搜索、过滤同时具备这两种功能的产品type==Aand color == blue，然后对除“其他”过滤器之外的每个属性运行构面：

curl -XGET 'http://127.0.0.1:9200/my_index/product/_search?pretty=1'  -d '
{
   "filter" : {
      "and" : [
         {
            "term" : {
               "color" : "blue"
            }
         },
         {
            "term" : {
               "type" : "A"
            }
         }
      ]
   },
   "facets" : {
      "color" : {
         "terms" : {
            "field" : "color"
         },
         "facet_filter" : {
            "term" : {
               "type" : "A"
            }
         }
      },
      "type" : {
         "terms" : {
            "field" : "type"
         },
         "facet_filter" : {
            "term" : {
               "color" : "blue"
            }
         }
      }
   }
}
'

# [Wed Jan 18 19:58:25 2012] Response:
# {
#    "hits" : {
#       "hits" : [],
#       "max_score" : null,
#       "total" : 0
#    },
#    "timed_out" : false,
#    "_shards" : {
#       "failed" : 0,
#       "successful" : 5,
#       "total" : 5
#    },
#    "facets" : {
#       "color" : {
#          "other" : 0,
#          "terms" : [
#             {
#                "count" : 1,
#                "term" : "red"
#             }
#          ],
#          "missing" : 0,
#          "_type" : "terms",
#          "total" : 1
#       },
#       "type" : {
#          "other" : 0,
#          "terms" : [
#             {
#                "count" : 1,
#                "term" : "C"
#             },
#             {
#                "count" : 1,
#                "term" : "B"
#             }
#          ],
#          "missing" : 0,
#          "_type" : "terms",
#          "total" : 2
#       }
#    },
#    "took" : 3
# }

本文内容由网友自发贡献，版权归原作者所有，本站不承担相应法律责任。如您发现有涉嫌抄袭侵权的内容，请联系:hwhale#tublm.com(使用前将#替换为@)

Elasticsearch：在分面时排除过滤器可能吗？（就像在 Solr 中一样）的相关文章

solr 中的文本字段排序

我正在使用 solr 3 4 并希望 solr 搜索结果在文本字段上排序如何实现像 int 自然排序一样对文本字段进行排序有没有办法在查询时将文本字段转换为int 我的排序字段是字符串类型我希望它在排序时表现得像 int 字段我无法
yii2 作曲家更新致命错误

当我更新我的作曲家以添加yii2 solr扩展我的项目时我遇到如下错误 The yiisoft yii2 composer plugin requires composer plugin api 1 0 0 this WILL break
Elasticsearch 对字符串排序未返回预期结果

当对包含多个单词的字符串字段进行排序时 Elasticsearch 会拆分字符串值并使用最小值或最大值作为排序值即当对值为老虎之眼的字段进行升序排序时排序值为 Eye 当按降序排序时排序值为 Tiger 假设我的索引中有老虎之
使elasticsearch中的所有对象嵌套对象

是否可以让elasticsearch中的所有嵌套对象自动映射到默认嵌套的类型而不是对象是的您可以使用以下方法来做到这一点动态模板 https www elastic co guide en elasticsearch referenc
在ElasticSearch中搜索没有时间的日期字段值

我的数据中有一个日期字段为 type date format dateOptionalTime 现在我的日期字段和值是 INITIAL EXTRACT DATE 2015 04 02T06 47 57 78 05 30 在搜索时我仅根据
如何在弹性搜索（aws）中存储日期范围数据并搜索范围？

我正在尝试在弹性搜索中存储酒店房间可用性然后我需要搜索从某个日期到另一个日期可用的房间我想出了存储数据以确保可用性的两种方式如下这里可用性字典存储了所有日期每个日期键的值是 true 或 false 代表其可用那天与否 id
SolrNet：过滤查询时保留 Facet 计数

当我查询时我收到以下方面 Field1 Key Best Facet 1 Value 999 Key Best Facet 2 Value 999 Field2 Key Second Best Facet 1 Value 421 Key
Elasticsearch 崩溃后无法恢复

磁盘空间不足导致 Elasticsearch 分片崩溃三个节点现在为红色两个节点已恢复它们的状态为黄色 ES 的 CPU 利用率为 150 内存利用率很高正在尝试恢复它们但似乎存在一些版本匹配冲突我清理了磁盘空间并删除了分片的
在 ElasticSearch 7+ 中，如何搜索所有文本字段？

我想在 Elasticsearch 7 3 中存储的文档中搜索单词我希望在以前版本的 Elasticsearch 上运行的一个示例是 query bool must match all oliver must not should fro
Solr 4.0 中的 BaseTokenFilterFactory 去哪儿了？

用于创建您自己的标记和字符过滤器的 Solr 文档说明如下 http wiki apache org solr AnalyzersTokenizersTokenFilters Specifying an Analyzer in the sc
ckan本地安装，solr JSP支持未配置500错误

我正在尝试使用 Ubuntu 14 04 LTS 在本地计算机上安装 CKAN 我按照从找到的源安装的说明进行操作here http docs ckan org en latest maintaining installing instal
随着索引和文档数量恒定，elasticsearch 批量索引会随着时间的推移而变慢

我遇到了使用 NET NEST 客户端和 ElasticSearch 进行批量索引的性能随着时间的推移索引数量和文档数量恒定而降低的情况我们正在奔跑ElasticSearch Version 0 19 11 JVM 23 5 b02在具
局部敏感哈希 - Elasticsearch

有没有允许在 Elasticsearch 上使用 LSH 的插件如果是的话您能否指出该位置并告诉我如何使用它谢谢编辑我发现ES使用了MinHash插件我怎样才能用这个来比较文件呢查找重复项的最佳设置是什么有一个Elastic
Solr MoreLikeThis 不适用于多个分片？

我在 SolrCloud 中有 5 个节点集群每个节点有 2 个分片 Solr版本 6 3 0 现在当我运行 mlt 查询时它仅返回每个节点的结果并且不会将它们分布在所有分片节点上即没有给出任何结果给出结果我什至尝试将其指
从 App Engine 连接到 Kubernetes 引擎

我们希望使用应用程序引擎灵活的流程来更新位于 Google Kubernetes Engine 上的 ElasticSearch 索引我们需要通过 http s 地址连接到 ElasticSearch 推荐的方法是什么我们不想将集群暴露
需要在 java api 中的 Solr 搜索中搜索文本及其周围的几行

我正在使用 solr 7 7 2 并且我使用 solrj 在 Solr 中编写了一个 Java 程序该程序在一个巨大的文本文件中搜索单词我使用以下代码来显示代表整个文本的搜索结果 SolrQuery params new SolrQue
ElasticSearch 定义自定义映射与默认“_doc”映射冲突

尝试创建自定义映射类型时会发生此问题为第一个插入弹性创建自定义映射后想要创建 doc映射类型和冲突就发生在这里第一步我创建一个映射 mappings properties field1 type keyword field2 type
在弹性搜索中使用 GET/POST 时的不同结果

我正在通过 Elastic Search Head 插件尝试弹性搜索当我通过 POST 提交查询时结果符合预期但是当我使用 GET 尝试相同的查询时我总是会返回索引中的所有值那么如何通过 GET 将查询传递到弹性搜索服务器以
在Windows Xampp上安装和使用elasticsearch php客户端

我下载的是elasticsearch 5 1 1 zip来自https www elastic co downloads elasticsearch https www elastic co downloads elasticsearch
ElasticSearch - 仅获取与搜索响应中所有顶级字段匹配的嵌套对象

假设我有以下文档 id 1 name xyz users name abc surname def name xyz surname wef name defg surname pqr 我只想获取与搜索响应中的所有顶级字段匹配的嵌套对象我

随机推荐

在哪里可以找到 Andrew Richards 为 WinDBG 编写的 pde 扩展？

我在网上的一些资源中看到提到它但我找不到它它似乎没有包含在 WinDBG 发行版中有一个公共 OneDrive 其中包含它的 ZIP 文件
如何让 Wireshark 显示我的本地 HTTP 流量？

当我输入此 URI 以在正在运行的 Web API 应用程序上调用 REST 方法时 http SHANNON2 21608 api inventory sendXML duckbill platypus someFileName usin
从数据集到数据表获取过滤后的数据

如何过滤数据集到数据表中的数据就像代码 gt DataRow dr DS Tables 0 Select STAGENAME Develop AND DEVLAPSEDAYS IS NOT NULL 我如何在这里使用数据表以下代码不反映
Gdk pixbuf 从内存加载图像

使用 GTK 3 6 我想显示内存缓冲区中的图像而不是磁盘上的文件我有一个const char data使用图像数据我正在尝试从中创建 GTK 图像到目前为止我已经尝试了两种我认为可行的方法两者都使用GdkPixbuf 因此需要
xsl自动显示xml数据，无需硬编码

这是我的 xml 数据
将电子应用程序发布到 Windows 商店时如何解决“可用的应用程序图标包含默认图标”？

我在这里开源了我的电子反应项目 windows 终端调整器 https github com nateshmbhat windows terminal tweaker 运行后npm run release来自renderer文件夹它在中构
JavaScript 正则表达式性能。

我有一个函数可以纠正一系列异常大写单词的大小写 var line some long string of text AppleScript Bluetooth DivX FireWire GarageBand iPhone iTunes i
C++ 互递归变体类型

我正在尝试使用变体在 C 中表示 PDF 对象类型 PDF 对象是以下对象之一 Boolean Integer Real String Name Stream Array Map
如何以编程方式加载配置文件

假设我有一个自定义配置文件它对应于自定义定义的 ConfigurationSection 和 Config 元素这些配置类存储在库中配置文件看起来像这样
PHP JSON BigInt 编码

我有这样的数组 array id gt 76561198165327575 我需要它在客户端的 JavaScript 中工作所以我试图用它来编码JSON NUMERIC CHECK json encode array JSON NUMER
React 如何决定重新渲染组件

我知道 React 有一个生命周期方法叫做shouldComponentUpdate 默认情况下返回 true 这就是组件决定更新的方式但是当该组件的状态或属性发生变化时如何调用该生命周期方法当我们收到新的 props 或 state
类型错误：this.state.患者.map 不是函数

我是 React js 新手我正在学习创建 React 应用程序但我遇到了映射函数的问题这是我的请求以及我尝试呈现数据的方式 class Patients extends Component constructor props sup
在 php 中第二次发送不同的邮件时，如何删除 phpmailer 中的附件

在 php 文件中我需要向 2 个不同的 ID 发送 2 封不同的电子邮件当我使用如下所示的两个变量时它不起作用 require PHPmailer class phpmailer php First Email email new
使用 Spark 作业服务器进行 Spark SQL 作业时出现“此上下文的作业类型无效”错误

我使用 Spark 作业服务器创建 Spark SQL 作业并按照以下示例使用 HiveContext https github com spark jobserver spark jobserver blob master job se
在 javascript 代码中使用 Url.Action 的更好解决方案

在我当前使用asp net MVC 3 使用razor 的项目中当我进行Ajax调用时我必须将JS保留在视图页面上因为我想使用Url Action来生成URL 这意味着我无法将 js 代码拆分为 JS 文件是否有比我目前正在做的更好
如何将 Three/js 相机控制从第一人称切换到轨道并返回

您可以毫无问题地从 Three js Orbit Controls 切换到 FirstPerson 控件但是当您从第一人称切换到轨道时显示屏会陷入鼠标按下模式您需要做什么才能在第一人称和轨道控制之间无缝地来回切换带有演
什么是常量数组？

什么是常量数组如果我们定义 const char hex char 0 1 2 3 4 5 6 7 8 9 A B C D E F 那么它不应该被程序修改这是什么意思这意味着您无法修改其内容例如您不可以这样做hex char i
Clang Static Analyzer没有发现最基本的问题

我想尝试一下 clang 静态分析器我在 Windows 上使用 Visual Studio 构建 clang 它似乎有效但同时又似乎极其无用我做了一个示例文件示例 c int main void int h 0 return 1
编译器强制的语义类型

假设我有一个代表自动机的类其状态已编号 using state t unsigned 并且其跃迁也编号为 using transition t unsigned 当然在某些时候我最终会弄乱一些电话因为transition t and
Elasticsearch：在分面时排除过滤器可能吗？（就像在 Solr 中一样）

我正在考虑从 Solr 更改为 ES 我找不到相关信息的一件事是 ES 是否允许我在分面时定义排除过滤器例如考虑producttype具有值 A B C我想关注这一点即显示计数还要考虑查询被限制为producttype A 在这种情

Elasticsearch：在分面时排除过滤器可能吗？ （就像在 Solr 中一样）

Elasticsearch：在分面时排除过滤器可能吗？ （就像在 Solr 中一样） 的相关文章

随机推荐

热门标签

Elasticsearch：在分面时排除过滤器可能吗？（就像在 Solr 中一样）

Elasticsearch：在分面时排除过滤器可能吗？（就像在 Solr 中一样）的相关文章