http://math.mit.edu/~rothvoss/18.304.3PM/Presentations/1-Eric-Boosting304FinalRpdf.pdf

Consider MIT Admissions

【qualitative quantitative】

2-class system (Admit/Deny)
Both Quantitative Data and Qualitative Data
We consider (Y/N) answers to be Quantitative (-1,+1)
Region, for instance, is qualitative.
 
 
 
Rules of Thumb, Weak Classifiers
Easy to come up with rules of thumb that correctly classify the training data at
better than chance.
E.g. IF “GoodAtMath”==Y THEN predict “Admit”.
Difficult to find a single, highly accurate prediction rule. This is where our Weak
Learning Algorithm,AdaBoost, helps us.
 
 
 
What is a Weak Learner? 
【generalization error better than random guessing】
For any distribution, with high probability, given polynomially many examples and polynomial time we can find a classifier with generalization error
better than random guessing.
 
 
 
Weak Learning Assumption
 
We assume that our Weak Learning Algorithm (Weak
Learner) can consistently find weak classifiers (rules of
thumb which classify the data correctly at better than 50%)
 
【boosting】
 
Given this assumption, we can use boosting to generate a
single weighted classifier which correctly classifies our
training data at 99%-100%.
 
 
 
【AdaBoost Specifics 】
How does AdaBoost weight training examples optimally?
Focus on difficult data points. The data points that have been
misclassified most by the previous weak classifier.
How does AdaBoost combine these weak classifiers into a
comprehensive prediction?
Use an optimally weighted majority vote of weak classifier.
 
 
 
AdaBoost Technical Description
 
 
Missing details: How to generate distribution? How to get single classifier?
 
 
Constructing Dt
 
 
 
 
Getting a Single Classifier

 
 

最新文章

  1. Python 装饰器学习
  2. 初识onselectstart
  3. 开创学习的四核时代-迅为iTOP4412学习开发板
  4. django 架构点点滴滴
  5. .NET事件的指导原则
  6. ReaderWriterLock类(转)
  7. REST响应处理
  8. noip2011 公交观光
  9. SRM 391(1-250pt)
  10. regular expression 基本语法
  11. 3、href和src的区别
  12. 扫描局域网内的ip和主机名
  13. js登录滑动验证,不滑动无法登陆
  14. ceph 高级运维
  15. 环境与工具3:从打字开始 | vim | sublime
  16. 【CF1151E】Number of Components
  17. BZOJ2870 最长道路
  18. Kubenetes 核心概念理解
  19. git branch 不显示的原因
  20. Docker与虚拟机技术

热门文章

  1. 简单说明PHP的垃圾收集机制是怎样的?【转】
  2. 解决win2008下IIS7的HTTP500错误
  3. Yii关联查询(转载)
  4. LeetCode OJ--Permutation Sequence *
  5. 机器学习实战读书笔记(五)Logistic回归
  6. linux grep 查找文件内容
  7. zip 压缩文件夹
  8. Android view 数据缓存
  9. GDI+ ColorMatrix的完全揭秘
  10. epoll 浅析以及 nio 中的 Selector