admin管理员组
文章数量:1026989

List

背景

ranking is a prediction task on list of objects. 所以 point-wise, pair-wise 等方法的训练任务与工作场景有差异, list-wise 理应更好.

list-wise ranking with S-IE

该论文见参考[1].

Session Infomation Embedding (S-IE)

算是一个预训练, task为正负样本二分类, 为后面list-wise作准备.

图: 将点击与曝光内容分别pooling, 后与 target,user 作concat.

list-wise ranking

图公式书写太差, 有误, (1)式中分子下标i可能为 t t t,分母下标i可能为 l l l; (2)式中i及右括号应放在上标位置.

实验

数据集. CIKM CUP 2016.是电商网站搜索引擎的日志.
ndcg作指标. ctr预估, 通常用二分类的任务去做, 其指标为AUC/GAUC. 现在是list-wise, 就用nDCG.

我的疑惑

session s 的rep由target得到,即 r e p ( s e s s i o n ) = f ( t a r g e t , o t h e r ) rep(session)=f(target,other) rep(session)=f(target,other) 那么 target 与图2中的 n 个item是什么关系? 论文有说each session with the contained item behaviors is treated as a list-wisw training sample,但还不是很清楚.
为啥用搜索引擎的日志, 找个推荐数据集不是更直接么?

论文2,ListNet

loss定义

论文1的list-wise借鉴了参考2, ICML’2017的微软的论文.
定义 list-wise 的损失函数:
∑ i = 1 m L ( y ( i ) , z ( i ) ) (1) \sum _{i=1}^mL(y^{(i)},z^{(i)}) \tag 1 i=1∑mL(y(i),z(i))(1)
where m = ∣ t r a i n s e t ∣ m=|trainset| m=∣trainset∣ , y ( i ) = ( y 1 ( i ) , y 2 ( i ) , . . . , y n ( i ) ( i ) ) y^{(i)}=(y^{(i)}_1,y^{(i)}_2,...,y^{(i)}_{n^{(i)}}) y(i)=(y1(i),y2(i),...,yn(i)(i)), 是一个list,表示与query q ( i ) q^{(i)} q(i) 相关的 n ( i ) n^{(i)} n(i) 个文档的相关性得分. 与之类似, z ( i ) = ( f ( x 1 ( i ) ) , f ( x 2 ( i ) ) , . . . , f ( x n ( i ) ( i ) ) ) z^{(i)}=(f(x^{(i)}_1),f(x^{(i)}_2),...,f(x^{(i)}_{n^{(i)}})) z(i)=(f(x1(i)),f(x2(i)),...,f(xn(i)(i))) 是 ranking function f ( ⋅ ) f(\cdot) f(⋅) 计算出的预估相关性.

probability model

图: permutation probability
对size=n的list作全排列, 有 n ! n! n! 种结果, 计算量不可接受, 也就是 NP_Hard? 所以提出 top one probability.

图: top one probability.

图: 定理6, doc j 排第一的概率描述

图: 对 ϕ ( ⋅ ) \phi(\cdot) ϕ(⋅)作指数函数定义后, 可以改写定理6 , 就成了soft-max

图: soft_max 得到 label与pred两个list的概率后, 用交叉熵作损失函数, 得到了最终的loss.

实验

数据集, 其中一个是CSearch, 来自商业搜索引擎.This data set provides five levels of relevance judgments, ranging from 4 (”perfect match”) to 0 (”bad match”).
指标,nDCG@5 and MAP(mean average precision).

思考讨论

所谓 list-wise
所谓list-wise 也只是损失函数相关, 预测阶段依旧是point-wise打分并排序, 由此得到序列.
谷歌的Seq2Slate的论文里有一段清晰的描述：

In listwise approaches the loss depends on the full permutation of items. Although these losses consider inter-item dependencies, the ranking function itself is pointwise, so at inference time the model still assigns a score to each item which does not depend on scores of other items (i.e., an item’s score will not change if it is placed in a different set).

loss 与常规多分类有何异同
已经很像了, recsys中召回任务的设计就可以是transformer那样的多分类. 但常规的label是one-hot(可能带有 label smooth), 此处是一个不那么陡峭的分布.

参考

CIKM’2017,alibaba,Session-aware Information Embedding for E-commerce Product Recommendation
ICML’2007,Learning to Rank: From Pairwise Approach to Listwise Approach

List

背景

ranking is a prediction task on list of objects. 所以 point-wise, pair-wise 等方法的训练任务与工作场景有差异, list-wise 理应更好.

list-wise ranking with S-IE

该论文见参考[1].

Session Infomation Embedding (S-IE)

算是一个预训练, task为正负样本二分类, 为后面list-wise作准备.

图: 将点击与曝光内容分别pooling, 后与 target,user 作concat.

list-wise ranking

图公式书写太差, 有误, (1)式中分子下标i可能为 t t t,分母下标i可能为 l l l; (2)式中i及右括号应放在上标位置.

实验

数据集. CIKM CUP 2016.是电商网站搜索引擎的日志.
ndcg作指标. ctr预估, 通常用二分类的任务去做, 其指标为AUC/GAUC. 现在是list-wise, 就用nDCG.

我的疑惑

session s 的rep由target得到,即 r e p ( s e s s i o n ) = f ( t a r g e t , o t h e r ) rep(session)=f(target,other) rep(session)=f(target,other) 那么 target 与图2中的 n 个item是什么关系? 论文有说each session with the contained item behaviors is treated as a list-wisw training sample,但还不是很清楚.
为啥用搜索引擎的日志, 找个推荐数据集不是更直接么?

论文2,ListNet

loss定义

probability model

实验

数据集, 其中一个是CSearch, 来自商业搜索引擎.This data set provides five levels of relevance judgments, ranging from 4 (”perfect match”) to 0 (”bad match”).
指标,nDCG@5 and MAP(mean average precision).

思考讨论

所谓 list-wise
所谓list-wise 也只是损失函数相关, 预测阶段依旧是point-wise打分并排序, 由此得到序列.
谷歌的Seq2Slate的论文里有一段清晰的描述：

In listwise approaches the loss depends on the full permutation of items. Although these losses consider inter-item dependencies, the ranking function itself is pointwise, so at inference time the model still assigns a score to each item which does not depend on scores of other items (i.e., an item’s score will not change if it is placed in a different set).

loss 与常规多分类有何异同
已经很像了, recsys中召回任务的设计就可以是transformer那样的多分类. 但常规的label是one-hot(可能带有 label smooth), 此处是一个不那么陡峭的分布.

参考

CIKM’2017,alibaba,Session-aware Information Embedding for E-commerce Product Recommendation
ICML’2007,Learning to Rank: From Pairwise Approach to Listwise Approach

本文标签： list

版权声明：本文标题：List 内容由热心网友自发贡献，该文观点仅代表作者本人，转载请联系作者并注明出处：http://it.en369.cn/jiaocheng/1698280101a288880.html，本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容，一经查实，本站将立刻删除。

369IT编程

List

List

背景

list-wise ranking with S-IE

Session Infomation Embedding (S-IE)

list-wise ranking

实验

我的疑惑

论文2,ListNet

loss定义

probability model

实验

思考讨论

参考

List

背景

list-wise ranking with S-IE

Session Infomation Embedding (S-IE)

list-wise ranking

实验

我的疑惑

论文2,ListNet

loss定义

probability model

实验

思考讨论

参考

更多相关文章

list, vector, map, set 区别与用法比较

List

c# - Casting IEnumerable&lt;T&gt; to Array, List, etc. generically? - Stack Overflow

好用分享_Free ChatGPT Site List

ChatGPT3-Free-Prompt-List 项目教程

发表评论

推荐文章

How to _really_ deleteclearforget a object instance in javascript? - Stack Overflow

javascript - How to check if two objects properties match? - Stack Overflow

javascript - ExtJS4 LinkButton Component - Stack Overflow

flutter - Use Dart Package only for iOS devices - Stack Overflow

javascript - Prismic - How to make API calls without exposing Access Token - Stack Overflow

热门文章

java - Trying to get the datetime of 12am on last Monday - it always gives me 12pm - Stack Overflow

javascript - [webpack]: npm ERR! Maximum call stack size exceeded - Stack Overflow

&quot;Access is Denied&quot; error in javascript - Stack Overflow

node.js - Using Fetch to put JSON in body of document - Javascript - Stack Overflow

dojo - How to set Arcgis Javascript dojoConfig relative path of packages - Stack Overflow

What permissions does a role need for the user to be assigned as the author of a post?

React Native Detox: Accept Bluetooth Permissions - Stack Overflow

javascript - How to hide other tabs content and display only selected tabs content with JQuery? - Stack Overflow

javascript - Jest: automock modules, but only those defined in __mocks__, rather than all - Stack Overflow

javascript - tn3 gallery within web page called by API into WordPress site not working - Stack Overflow

最新文章

windows设置断电重启开机后自动输入锁屏密码登录

Windows系统设置开机默认开启数字小键盘

Windows11 开机自动同步时间（开机时间不更新问题）

windows配置开机自启动软件或脚本

【Redis】Windows设置Redis为开机自启动

程序员刚毕业，先去大厂镀金还是先去小厂攒经验？

万象2008清空boss账户密码

【Tools】GitBook简明教程

oracle exadata celldisk 闪存盘受损导致性能下降

SDUT 2138 图结构练习——BFSDFS——判断可达性

javascript - Type &#39;undefined&#39; is not assignable to type &#39;menuItemProps[]&#39; - Stack Overflow

javascript - VS 2015 Angular 2 import modules cannot be resolved - Stack Overflow

javascript - Get the JSON objects that are not present in another array - Stack Overflow

javascript - How to dismiss a phonegap notification programmatically - Stack Overflow

c - Solaris 10 make Error code 1 Fatal Error when trying to build python 2.7.16 - Stack Overflow

c# - Casting IEnumerable<T> to Array, List, etc. generically? - Stack Overflow

"Access is Denied" error in javascript - Stack Overflow

javascript - Jest: automock modules, but only those defined in mocks, rather than all - Stack Overflow

javascript - Type 'undefined' is not assignable to type 'menuItemProps[]' - Stack Overflow