摘要: 本文是吴恩达 (Andrew Ng)老师《机器学习》课程,第一章《绪论:初识机器学习》中第2课时《什么是机器学习?》的视频原文字幕。为本人在视频学习过程中逐字逐句记录下来以便日后查阅使用。现分享给大家。如有错误,欢迎大家批评指正,在此表示诚挚地感谢!同时希望对大家的学习能有所帮助。

What is machine learning? In this article we will try to define what it is and also try to give you a sense of when you want to use machine learning. Even among machine learning practitioners, there isn't a well accepted definition of what is and what isn't machine learning. But let me show you a couple of examples of the ways that people have tried to define it.

Here's the definition of what is machine learning that is due to Arthur Samuel. He defined machine learning as the field of study that gives computers the ability to learn without being explicitly programmed. Samuel's claim to fame was that back in 1950's,he wrote a checkers playing program. And the amazing thing about this checkers playing program was that Arthur Samuel himself, wasn't a very good checkers player. But what he did was, he had a program play tens of thousands of games against itself. And by watching what sorts of board positions tended to lead to wins, and what sort of board positions tended to lead losses, the checkers playing program learns over time what are good board positions and what are bad board positions. And eventually learn to play checkers better than Arthur Samuel himself was able to. This was a remarkable result. Arthur Samuel himself turned out not to be a very good checkers player. But because the computer has the patience to play tens of thousands of games itself. No human has the patience to play that many games. By doing this the computer was able to get so much checkers-playing experience that it eventually became a better checkers player than Arthur Samuel himself. This is somewhat informal definition, and an older one.

Here's a slightly more recent definition by Tom Mitchell who's a friend in Carnegie Mellon. So Tom defines machine learning by saying that, a well-posed learning problem is defined as follows. He says, a computer program is said to learn from experience E, with respect to some task T, and some performance measurement P, if its performance on T as measured by P improves with experience E. I actually think he came up with this definition just to make it rhyme. For the checkers playing example, the experience E will be the experience of having the program play tens of thousands of games against itself. The task T, will be the task of playing checkers. And the performance measure P, will be the probability that it wins the next game of checkers against some new opponent.

Throughout these videos, besides me trying to teach you stuff, I will occasionally ask you a question to make sure you understand the content. Here's one.

On top is a definition of machine learning by Tom Mitchell. Let's say your email program watches which email you do or do not flag as spam. So in an email client like this (Gmail), you may click this spam button to report some email as spam, but not other emails. Based on which emails you mark as spam, so your email program learns better how to filter spam email. What is the task T in this setting? In a few seconds, the video will pause. And when it does so, you can use your mouse to select one of these four radio buttons to let me know which of these four you think is the right answer to this question. So hopefully you got that this (1st) is the right answer. Classifying emails is the task T. In fact, this definition defines task T, performance measure P, and this experience E. And so watching you label email as spam or not spam, this would be the experience E. And the fraction of emails correctly classified, that might be a performance measure P. And so, our system's performance on the task T, on the performance measure P will improve after the experience E.

In this class I hope to teach you about various different types of learning algorithms. There are several different types of learning algorithms. The main two types are what we call supervised learning and unsupervised learning. I'll define what these terms mean in the next couple videos (articles). But it turns out that in supervised learning, the idea is that we're going to teach the computer how to do something, whereas in unsupervised learning we're going to let it learn by itself. Don't worry if these two terms don't make sense yet, in the next two videos (articles), I'm going to say exactly what these two types of learning are. You will also hear other buzz terms, such as reinforcement learning and recommender systems. These are other types of learning algorithms that we'll talk about later. But the two most used types of learning algorithms are probably supervised learning and unsupervised learning and I'll define them in the next two videos (articles) and we'll spend most of this class talking about these two types of learning algorithms. It turns out one of the other things we'll spend a lot of time on in this class is practical advice for applying learning algorithms. This is something I feel pretty strong about, and it's actually something that I don't know of any other university teaches. Teaching about learning algorithm is like giving you a set of tools, and equally important or more important than giving you the tools is to teach you how to apply these tools. I like to make an analogy to learning to become a carpenter. Imagine that someone is teaching you how to be a carpenter, and they say here's a hammer, here's a screwdriver, here's a saw, good luck. Well, that's not good, right? You have all these tools, but the more important thing, is to learn how to use these tools properly. There's huge difference between people that know how to use these machine learning algorithms, versus people who don't know how to use these tools well. Here in Silicon Valley where I live, when I go visit different companies, even at the top Silicon Valley companies, very often I see people are trying to apply machine learning algorithms to some problem. And sometimes they have been going at it for six months. But sometimes when I look at what they're doing, I say, you know, I could have told them like, gee, I could have told you six months ago that you should be taking a learning algorithm and applying it like the slightly modified way, and your chance of success would have been much higher. So what we're going to do in this class is actually spend a lot of time talking about how, if you actually tried to develop a machine learning algorithm, how to make those best practices type decisions about the way in which you build the system, so that when you're applying learning algorithm, you're less likely to end up one of those people who end up pursuing some path for six months that, you know, someone else could have figured out it just wasn't gonna work at all, and it's just a waste of time for six months. So I'm actually going to spend a lot of the time teaching you those sorts of best practices in machine learning and AI, and how to get this stuff to work and how we do it, how the best people do it in Silicon Valley and around the world. I hope to make you one of the best people in knowing how to design and build serious machine learning and AI systems. So, that's machine learning and these are the main topics I hope to teach. In the next video (article), I'm going to define what is supervised learning and after that, what is unsupervised learning. And also, start to talk about when you would use each of them.

<end>

最新文章

  1. Android数据加密之Rsa加密
  2. Python之路Day14--html
  3. 并发框架Disruptor浅析
  4. java单例模式实现方式
  5. 如何在IE8设置透明背景
  6. #Mac技巧#如何在Mac系统上新建TXT文档,以及打开txt文稿的乱码问题如何解决
  7. 我是如何使用git的
  8. cocos2dx 安卓编译问题收集
  9. Objective-C ,ios,iphone开发基础:3分钟教你做一个iphone手机浏览器
  10. HDU - 4944 FSF’s game
  11. DELPHI加密字串(异或运算加密)
  12. Centos7架设NMP服务器笔记
  13. java数据结构与算法(一)
  14. SharePoint 2010 之寻找页面布局
  15. 吴恩达机器学习笔记61-应用实例:图片文字识别(Application Example: Photo OCR)【完结】
  16. git Disconnected:No supported authentication methods available问题解决
  17. Python科学计算和可视化
  18. JS 一张图理解prototype、proto和constructor的关系
  19. Python学习之旅(三十一)
  20. php $_REQUEST写法防注入突破

热门文章

  1. flask实现文件下载功能
  2. Postgresql pg_dump 与 pg_restore 使用举例
  3. bzoj 3629
  4. 特征工程学习01-sklearn单机特征工程
  5. 贴两个mysql优化的配置文件
  6. Mysql 生成指定时间段内的所有日期
  7. stdu1309(不老的传说)
  8. vue.js 中this.$router.push()的使用
  9. mysql解压文件安装
  10. Java 代码里乱打日志了,这才是正确的打日志姿势