ElasticSearch - function_score 实例说明

2023-11-19

官方说明

function_score

通过实例说明

  • 先准备数据和索引,在ES插入三笔数据,其中language是keywork类型,like是integer类型(代表点赞量)

    { "language": "java", "like": 5 }
    { "language": "python", "like": 5 }
    { "language": "go", "like": 10 }
    
  • functions是一个数组,裡面放著的是将要被使用的加强函数列表,我们在裡面使用了3个filter去过滤数据,并且每个filter都设置了一个加强函数,并且还使用了一个会应用到所有文档的field_value_factor加强函数

    • 可以为列表裡的每个加强函数都指定一个filter,这样做的话,只有在文档满足此filter的要求,此filter的加强函数才会应用到文挡上,也可以不指定filter,这样的话此加强函数就会应用到全部的文挡上

    • 一个文档可以一次满足多条加强函数和多个filter,如果一次满足多个,那麽就会产生多个加强score

    • 因此ES会先使用score_mode定义的方式来合併这些加强score们,得到一个总加强score,得到总加强score之后,才会再使用boost_mode定义的方式去和old_score做合併

    GET 127.0.0.1/mytest/doc/_search
    {
        "query": {
            "function_score": {
                "query": {
                    "match_all": {}  //match_all查出来的所有文档的_score都是1
                },
                "functions": [
                    //第一个filter(使用weight加强函数),如果language是java,加强score就是2
                    {
                        "filter": {
                            "term": {
                                "language": "java"
                            }
                        },
                        "weight": 2
                    },
                    //第二个filter(使用weight加强函数),如果language是go,加强score就是3
                    {
                        "filter": {
                            "term": {
                                "language": "go"
                            }
                        },
                        "weight": 3
                    },
                    //第三个filter(使用weight加强函数),如果like数大于等于10,加强score就是5
                    {
                        "filter": {
                            "range": {
                                "like": {
                                    "gte": 10
                                }
                            }
                        },
                        "weight": 5
                    },
                    //field_value_factor加强函数,会应用到所有文档上,加强score就是like值
                    {
                        "field_value_factor": {
                            "field": "like"
                        }
                    }
                ],
                "score_mode": "multiply", //设置functions裡面的加强score们怎麽合併成一个总加强score
                "boost_mode": "multiply" //设置old_score怎麽和总加强score合併
            }
        }
    }
    
    "hits": [
        {
            "_score": 150, //go同时满足filter2、filter3,且还有一个加强函数field_value_factor产生的加强,因此加强score为3, 5, 10(like的值是10),总加强score为3*5*10=150
            "_source": { "language": "go", "like": 10 }
        },
        {
            "_score": 10, //java只满足filter1,但是因为还有field_value_facotr产生的加强score,因此加强score为2, 5,总加强score为2*5=10
            "_source": { "language": "java", "like": 5 }
        },
        {
            "_score": 5, //python不满足任何filter,因此加强score只有field_value_factor的like值,就是5
            "_source": { "language": "python", "like": 5 }
        }
    ]
    
  • 其实weight加强函数也是可以不和filter搭配,自己单独使用的,只是这样做没啥意义,因为只是会给全部的文档都增加一个固定值而已

    • 不过就DSL语法上来说,他也像其他加强函数一样,是可以直接使用而不用加filter的
    GET 127.0.0.1/mytest/doc/_search
    {
        "query": {
            "function_score": {
                "query": {
                    "match_all": {}
                }
            },
            functions: [
                {
                    "weight": 3
                }
            ]
        }
    }
    
    "hits": [
        {
            "_score": 3,
            "_source": { "language": "go", "like": 10 }
        },
        {
            "_score": 3,
            "_source": { "language": "python", "like": 5 }
        },
        {
            "_score": 3,
            "_source": { "language": "java", "like": 5 }
        }
    ]
    
  • weight加强函数也可以用来调整每个语句的贡献度,权重weight的默认值是1.0,当设置了weight,这个weight值会先和自己那个{}裡的每个句子的评分相乘,之后再通过score_mode和其他加强函数合併

    • 下面的查询,公式为new_score = old_score * [ (like值 * weight1) + weight2 ]
    • 公式解析 : weight1先加强like值(只能使用乘法),接著再透过score_mode定义的方法(sum)和另一个加强函数weight2合併,得到一个总加强score,最后再使用boost_mode定义的方法(默认是multiply)和old_score做合併,得到new_score
    GET 127.0.0.1/mytest/doc/_search
    {
        "query": {
            "function_score": {
                "query": {
                    "match_all": {}
                }
            },
            functions: [
                {
                    "field_value_factor": {
                        "field": "like"
                    },
                    "weight": 3  //weight1, 加强field_value_factor,只能使用乘法,无法改变
                },
                {
                    "weight": 20 //weight2
                }
            ],
            "score_mode": "sum"
        }
    }
    
    "hits": [
        {
            "_score": 50,
            "_source": { "language": "go", "like": 10 }
        },
        {
            "_score": 35,
            "_source": { "language": "python", "like": 5 }
        },
        {
            "_score": 35,
            "_source": { "language": "java", "like": 5 }
        }
    ]
    
    

其他题目样例

题目1

现有如下电影数据。查询title包含lord或者ring,如果tags字段(数组)包含death of a friend短语,那么提高权重

PUT movie_data/_bulk
{"index":{}}
{"revenue":0,"release_date":"2008-07-16T06:00:00.000Z","minutes":130,"movie_id":"15067","title":"The Good, The Bad, The Weird","tags":["gunslinger","gun"],"budget":10000000,"avg_score":7.1}
{"index":{}}
{"revenue":871368364,"release_date":"2001-12-18T07:00:00.000Z","minutes":178,"movie_id":"120","title":"The Lord of the Rings: The Fellowship of the Ring","tags":["elves","dwarves","orcs","middle-earth (tolkien)","hobbit","based on novel","mountains","fireworks","castle","volcano","password","death of a friend","uncle","mirror","wizard","sword and sorcery"],"budget":93000000,"avg_score":8}
{"index":{}}
{"revenue":1118888979,"release_date":"2003-12-01T07:00:00.000Z","minutes":201,"movie_id":"122","title":"The Lord of the Rings: The Return of the King","tags":["elves","orcs","middle-earth (tolkien)","based on novel","suspicion","bravery","war","honor","troll","brutality","violence","ghost","end of trilogy","quest","sword and sorcery"],"budget":94000000,"avg_score":8.1}
{"index":{}}
{"revenue":15304890,"release_date":"2006-09-06T06:00:00.000Z","minutes":96,"movie_id":"1381","title":"The Fountain","tags":["brain tumor","operation","queen","love of one's life","surgeon","tree","dying and death","transience","immortality","maya civilization","monkey","conquest"],"budget":35000000,"avg_score":6.8}
{"index":{}}
{"revenue":0,"release_date":"2000-09-08T06:00:00.000Z","minutes":96,"movie_id":"10685","title":"The Watcher","tags":["chicago","fbi","menace","covered investigation","state of emergency","investigation","suspense","serial killer","little girl","psychiatrist","fbi agent","psychotherapy"],"budget":33000000,"avg_score":4.9}
{"index":{}}
{"revenue":54700105,"release_date":"2004-04-15T06:00:00.000Z","minutes":124,"movie_id":"7220","title":"The Punisher","tags":["chain","submachine gun","undercover","smuggling","twin brother","marvel comic","one man army","massacre","extreme violence","family reunion","pier"],"budget":33000000,"avg_score":6.1}
{"index":{}}
{"revenue":78636257,"release_date":"2016-04-07T06:00:00.000Z","minutes":91,"movie_id":"323676","title":"The Boss","tags":["business woman","ex-con","duringcreditsstinger","girl scouts"],"budget":29000000,"avg_score":5.5}
{"index":{}}
{"revenue":115922175,"release_date":"2013-08-16T06:00:00.000Z","minutes":132,"movie_id":"132363","title":"The Butler","tags":["white house","butler","biography","civil rights"],"budget":25000000,"avg_score":7.2}
{"index":{}}
{"revenue":24902723,"release_date":"2015-08-06T06:00:00.000Z","minutes":120,"movie_id":"293646","title":"The 33","tags":["mining","chile","based on true story","survival","woman director","mining accident"],"budget":25000000,"avg_score":6}
{"index":{}}
{"revenue":85446075,"release_date":"2012-08-30T06:00:00.000Z","minutes":92,"movie_id":"77883","title":"The Possession","tags":[],"budget":14000000,"avg_score":5.8}
{"index":{}}
{"revenue":71009334,"release_date":"2013-10-25T06:00:00.000Z","minutes":117,"movie_id":"109091","title":"The Counselor","tags":["poetry","lawyer","drug smuggling","drug trafficking","red dress"],"budget":25000000,"avg_score":5}
{"index":{}}
{"revenue":43928932,"release_date":"2002-10-02T06:00:00.000Z","minutes":92,"movie_id":"4108","title":"The Transporter","tags":["car journey","transportation","auto","human trafficking"],"budget":21000000,"avg_score":6.6}
{"index":{}}
{"revenue":0,"release_date":"2009-10-16T06:00:00.000Z","minutes":101,"movie_id":"19904","title":"The Stepfather","tags":["step father","remake"],"budget":0,"avg_score":5.4}
{"index":{}}
{"revenue":75597042,"release_date":"2002-03-25T07:00:00.000Z","minutes":127,"movie_id":"14635","title":"The Rookie","tags":["father son relationship","baseball","sports team","sport","life's dream","growing up"],"budget":20000000,"avg_score":6.5}
{"index":{}}
{"revenue":0,"release_date":"2003-12-12T07:00:00.000Z","minutes":120,"movie_id":"41488","title":"The Statement","tags":[],"budget":0,"avg_score":5.9}
{"index":{}}
{"revenue":29657751,"release_date":"2012-03-09T07:00:00.000Z","minutes":111,"movie_id":"70436","title":"The Raven","tags":["poison","blackmail","masked ball","historical figure","buried alive","serial killer","deadline","edgar allan poe","newspaper review","baltimore maryland","newspaper office","hard times","life imitates art","pendulum"],"budget":26000000,"avg_score":6.1}
{"index":{}}
{"revenue":124272124,"release_date":"2011-08-09T06:00:00.000Z","minutes":146,"movie_id":"50014","title":"The Help","tags":["mississippi","based on novel","exploitation","racial segregation","racism","writer","maid","moral courage","ressentiment","southern belle","racial issues","1960s","newspaper columnist"],"budget":25000000,"avg_score":7.8}
{"index":{}}
{"revenue":0,"release_date":"2002-09-27T06:00:00.000Z","minutes":98,"movie_id":"10771","title":"The Tuxedo","tags":["bomb","intelligence","chauffeur","wound","secret agent","head injury"],"budget":0,"avg_score":5.3}
{"index":{}}
{"revenue":69700000,"release_date":"2000-07-07T06:00:00.000Z","minutes":104,"movie_id":"4244","title":"The Kid","tags":["age difference","midlife crisis","suppressed past","self-awareness","childhood memory","humor","changing the past or future"],"budget":65000000,"avg_score":6}
{"index":{}}
{"revenue":179379533,"release_date":"2012-05-15T06:00:00.000Z","minutes":83,"movie_id":"76493","title":"The Dictator","tags":["kurdish"],"budget":65000000,"avg_score":5.9}

答案

GET movie_data/_search
{
  "query": {
    "function_score": {
      "query": {
        "terms": {
          "title": [
            "lord",
            "ring"
          ]
        }
      },
      "functions": [
        {
          "filter": {
            "match_phrase": {
              "tags": "death of a friend"
            }
          },
          "weight": 2
        }
      ],
      "boost_mode": "max"
    }
  }
}

参考地址

主要参考:https://blog.csdn.net/weixin_40341116/article/details/80931573

本文内容由网友自发贡献,版权归原作者所有,本站不承担相应法律责任。如您发现有涉嫌抄袭侵权的内容,请联系:hwhale#tublm.com(使用前将#替换为@)

ElasticSearch - function_score 实例说明 的相关文章

随机推荐

  • 【redis】redis的 key的命名规则

    key的命名规则 定义为 MS TEN SESSION KEY IN LOGIN NAME fqh 使用 进行分割 这样存入redis的是有层次结构的 如下
  • Java程序员常常为这个字所困扰!Java

    Java程序员常常为这个字所困扰 Java Java是一种流行的编程语言 广泛应用于开发各种类型的应用程序 然而 有一个字在Java开发中经常让程序员感到困惑和挫败 这个字就是 异常 异常在Java中是一个重要的概念 用于处理程序运行过程中
  • Peewee的坑

    db create tables Student 当如上使用时可能会报表Student不存在的错误 官方实例如db create tables Student Pet 改成db create tables Student safe True
  • .NET概述

    1 NET技术体系结构 NET平台是Microsoft在20世纪末为了迎接互联网的挑战而推出的应用程序平台 经过近年来的发展 它如今几乎可以在任何硬件平台上发挥作用 服务器 台式机 移动设备 游戏机 虚拟现实 增强现实环境 手表 甚至诸如R
  • TestBench编写_激励产生

    TestBench编写 激励产生 TestBench编写 激励产生 基本背景 读取函数介绍 a fopen函数使用 b fread函数使用 c fclose函数使用 实际使用 TestBench编写 激励产生 基本背景 最近遇到项目中需要对
  • 采用Vivado 配置xilinx GTX的SATA设计

    从Vivado开始 配置GTX的时候 多了一个SATA协议支持 但有些小地方还需要自己另外设置 整理了一下 分享给大家 首先打开Transceivers wizard 打开页签 线速率和参考时钟选择 在协议里面选择SATA2或者SATA3
  • leetcode696题计数二进制子串

    696题计数二进制子串 给定一个字符串 s 计算具有相同数量0和1的非空 连续 子字符串的数量 并且这些子字符串中的所有0和所有1都是组合在一起的 重复出现的子串要计算它们出现的次数 示例 1 输入 00110011 输出 6 解释 有6个
  • Python-opencv之目标定位

    最近团队准备参加一个无人机比赛 大致的规则是这样的 固定翼飞机从跑道起飞 然后在空中转体360 通过GPS粗定位飞行至一个高13米左右 宽6米左右八字形框前 距离约50米左右 这时依靠计算机视觉的方法 让飞机准确的穿过去 之后还有其他的动作
  • idea git提交忽略文件

    1 在idea的Plugin中 搜索 ignore插件 并安装使用 安装成功后会重启idea 2 在项目的根目录下创建一个创建一个 gitignore file 3 编辑gitignore文件 4 提交gitignore文件后 在该文件中限
  • unity3d 理解刚体(Rigidbody)和碰撞体(Collider)以及触发器(Is Trigger),边学边更新

    unity3d 理解刚体 Rigidbody 和碰撞体 Collider 以及触发器 Is Trigger 边学边更新 分类 Unity3D 2014 04 01 16 50 2755人阅读 评论 2 收藏 举报 刚体 Rigidbody
  • 8-0. 查找整数(10)

    本题要求从输入的N个整数中查找给定的X 如果找到 输出X的位置 从0开始数 如果没有找到 输出 Not Found 输入格式 输入在第1行中给出2个正整数N lt 20 和X 第2行给出N个整数 数字均不超过长整型 其间以空格分隔 输出格式
  • 强化学习实验中的绘图技巧-使用seaborn绘制paper中的图片

    强化学习实验中的绘图技巧 使用seaborn绘制paper中的图片 使用seaborn绘制折线图时参数数据可以传递ndarray或者pandas 不同的源数据对应的其他参数也略有不同 1 ndarray 先看一个小例子 def getdat
  • mysql指令清除临时表_MySQL如何删除#sql开头的临时表

    1 现象 巡检时发现服务器磁盘空间不足 通过查看大文件进行筛选是发现有几个 sql开头的文件 且存在超过100G及10G以上的文件 2 原因 如果MySQL在一个 ALTER TABLE操作 ALGORITHM INPLACE 的中间退出
  • 全网最详细中英文ChatGPT-GPT-4示例文档-智能聊天机器人从0到1快速入门——官网推荐的48种最佳应用场景(附python/node.js/curl命令源代码,小白也能学)

    从0到1快速入门智能聊天机器人应用场景 Introduce 简介 setting 设置 Prompt 提示 Sample response 回复样本 API request 接口请求 python接口请求示例 node js接口请求示例 c
  • Java之单元测试(JUnit单元测试框架)

    一 概述 单元测试就是针对最小的功能单元编写测试代码 Java程序最小的功能单元是方法 所以单元测试就是针对Java方法的测试 进而检查方法的正确性 常规测试有什么问题 只有一个main方法 如果一个方法的测试失败了 其他方法会受到影响 无
  • 都是分号惹的祸(ORA-00911: invalid character)

    今天在写SQL查询Oracle中的数据时遇到一个问题 在一般的SQL查询分析器中写好的SQL语句 运行一切正常 扔到用C 写的程序中就报错 错误代码如下 System Data OleDb OleDbException One or mor
  • PHP的语法

    h3 h3 p p 1 语言标记 开始标记 中间写PHP代码 2 echo 输出内容 3 语句结束符 是结束符 代码碰到 号 才表示一句代码完成 标记里的最后一行 可以不用 号 但是养成习惯
  • 2023年计算机科学与信息技术国际会议(ECCSIT 2023)

    会议简介 Brief Introduction 2023年计算机科学与信息技术国际会议 ECCSIT 2023 会议时间 2023年12月15日 17日 召开地点 中国 北海 大会官网 www eccsit org 2023年计算机科学与信
  • Linux查看当前文件夹的大小

    在Linux中 可以使用du disk usage 命令来查看当前文件夹的大小 以下是一些使用du的方法 查看当前文件夹的大小 为了查看当前文件夹的总大小 可以在文件夹中运行 du sh 这里 s 表示摘要模式 只显示总计 h 表示人类可读
  • ElasticSearch - function_score 实例说明

    官方说明 function score 通过实例说明 先准备数据和索引 在ES插入三笔数据 其中language是keywork类型 like是integer类型 代表点赞量 language java like 5 language py