【The Task】

Given a sequence of click events performed by some user during a typical session in an e-commerce website, the goal is to predict whether the user is going to buy something or not, and if he is buying, what would be the items he is going to buy. The task could therefore be divided into two sub goals:

  1. Is the user going to buy items in this session? Yes|No
  2. If yes, what are the items that are going to be bought?

【The Data】

Training Data Files

The training data comprises two different files:

  1. yoochoose-clicks.dat - Click events. Each record/line in the file has the following fields:
    1. Session ID – the id of the session. In one session there are one or many clicks.
    2. Timestamp – the time when the click occurred.
    3. Item ID – the unique identifier of the item.
    4. Category – the category of the item.
  2. yoochoose-buys.dat - Buy events. Each record/line in the file has the following fields:
    1. Session ID - the id of the session. In one session there are one or many buying events.
    2. Timestamp - the time when the buy occurred.
    3. Item ID – the unique identifier of item.
    4. Price – the price of the item.
    5. Quantity – how many of this item were bought.

The Session ID in yoochoose-buys.dat will always exist in the yoochoose-clicks.dat file – the records with the same Session ID together form the sequence of click events of a certain user during the session. The session could be short (few minutes) or very long (few hours), it could have one click or hundreds of clicks. All depends on the activity of the user.

Test File

The Test data is one file:

  1. yoochoose-test.dat - identically structured as the yoochoose-clicks.dat of the training data
    1. Session ID
    2. Timestamp
    3. Item ID
    4. Category

via:  http://2015.recsyschallenge.com/challenge.html

最新文章

  1. 【CodeForces 699A】Launch of Collider
  2. PDF 补丁丁 0.4.2.1063 测试版发布:新增检查新版本功能
  3. 使用HttpRequester模拟发送及接收Json请求
  4. Linux下TOmcat调试命令
  5. IE专用CSS,最全的CSS hack方式一览
  6. Linux文件类型与扩展名
  7. C#编译时出现“不安全代码只会在使用 /unsafe 编译的情况下出现”错误的解决
  8. asp.net 内部重定向
  9. boost库中thread多线程详解2——mutex与lock
  10. 网络摄像头Androi端显示(mjpeg)源码分析
  11. block之---应用场景:做参数和返回值
  12. MySQL插入、更新、删除数据
  13. JVM和java应用服务器调优
  14. 使用Python管理数据库
  15. java web(四):request、response一些用法和文件的上传和下载
  16. Hdoj 1421.搬寝室 题解
  17. vsCode如何从github拉取项目
  18. Android 使用正则表达式验证身份证号是否符合规则
  19. /var/spool/postfix/maildrop/ 中有大量的文件
  20. [react001] 使用webpack自动构建react 项目

热门文章

  1. 14年安徽省赛数论题etc.
  2. 「PKUSC2018」星际穿越 (70分做法)
  3. BZOJ 3669 [Noi2014]魔法森林(贪心+LCT)
  4. JDK源码学习笔记——LinkedHashMap
  5. lib_chan库学习
  6. Educational Codeforces Round 8 F. Bear and Fair Set 最大流
  7. Xcode9出现错误safe area layout guide before ios 9 真正解决办法
  8. 移动端调试神器(eruda)
  9. Shared libraries with GCC on Linux
  10. Oracle 11gR2 RAC 数据库不能连接(ORA-12537: TNS:connection closed)的解决