Boosting AdaBoosting Algorithm
2024-10-20 11:56:08
http://math.mit.edu/~rothvoss/18.304.3PM/Presentations/1-Eric-Boosting304FinalRpdf.pdf
Consider MIT Admissions
【qualitative quantitative】
•
2-class system (Admit/Deny)
•
Both Quantitative Data and Qualitative Data
•
We consider (Y/N) answers to be Quantitative (-1,+1)
•
Region, for instance, is qualitative.
Rules of Thumb, Weak Classifiers
Easy to come up with rules of thumb that correctly classify the training data at
better than chance.
•
E.g. IF “GoodAtMath”==Y THEN predict “Admit”.
•
Difficult to find a single, highly accurate prediction rule. This is where our Weak
Learning Algorithm,AdaBoost, helps us.
What is a Weak Learner?
【generalization error better than random guessing】
For any distribution, with high probability, given polynomially many examples and polynomial time we can find a classifier with generalization error
better than random guessing.
Weak Learning Assumption
•
We assume that our Weak Learning Algorithm (Weak
Learner) can consistently find weak classifiers (rules of
thumb which classify the data correctly at better than 50%)
•
【boosting】
Given this assumption, we can use boosting to generate a
single weighted classifier which correctly classifies our
training data at 99%-100%.
【AdaBoost Specifics 】
•
How does AdaBoost weight training examples optimally?
•
Focus on difficult data points. The data points that have been
misclassified most by the previous weak classifier.
•
How does AdaBoost combine these weak classifiers into a
comprehensive prediction?
•
Use an optimally weighted majority vote of weak classifier.
AdaBoost Technical Description
Missing details: How to generate distribution? How to get single classifier?
Constructing Dt
Getting a Single Classifier
最新文章
- Python 装饰器学习
- 初识onselectstart
- 开创学习的四核时代-迅为iTOP4412学习开发板
- django 架构点点滴滴
- .NET事件的指导原则
- ReaderWriterLock类(转)
- REST响应处理
- noip2011 公交观光
- SRM 391(1-250pt)
- regular expression 基本语法
- 3、href和src的区别
- 扫描局域网内的ip和主机名
- js登录滑动验证,不滑动无法登陆
- ceph 高级运维
- 环境与工具3:从打字开始 | vim | sublime
- 【CF1151E】Number of Components
- BZOJ2870 最长道路
- Kubenetes 核心概念理解
- git branch 不显示的原因
- Docker与虚拟机技术