4.2 The Regression Equation

Because we could draw many different lines through the cluster of data points, we need a method to choose the “best” line. The method, called the least-squares criterion, is based on an analysis of the errors made in using a line to fifit the data points.

存在有限个可能的的模型(可以使用之后的方法得到模型),从中取出最有可能的2个:并用最小二乘法计算error:

比如(a)中的e

最后得到:

计算,最后确定模型为b,这只是对模型的评价,生成模型可以使用以下方法:

推导:

Suppose that a scatterplot indicates a linear relationship between two variables. Then,within the range of the observed values of the predictor variable, we can reasonably use the regression equation to make predictions for the response variable. However,to do so outside that range, which is called extrapolation,

比如减价趋势下的产品价格,离开观测值范围后,价格可能会处于负值状态,所以线性关系必须注明自变量range

In the context of regression, an outlier is a data point that lies far from the regression line

Outliers and Influential Observations

Outliers是偏离直线太远的值

influential observation : a data point whose removal causes the regression equation (and line) to change considerably

Eg.在加入(2,169)前后的直线发生了巨大变化,所以(2,169)是一个influential observation

解决办法:

1.缩小x的range

2.添加influential observation 周围的点

Nonetheless, we may need either to remove it—thus limiting the analysis to Orions between 4 and 7 years old—or to obtain additional data on 2- and 3-year-old Orions so that the regression analysis is not so dependent on one data point

outlier和influential observation实际上很难分清:An outlier may or may not be an inflfluential observation, and an inflfluential observation may or may not be an outlier. Many statistical software packages identify potential outliers and inflfluential observations.

否则会出现:

该分布实际上应该为curvilinear regression

多重线性回归:

曲线回归:

最新文章

  1. Linux实战教学笔记08:Linux 文件的属性(上半部分)
  2. 运行第一个abp项目VS2015+localDB
  3. linux内核追踪——find_next_bit函数详详详解
  4. confluence wiki搭建使用
  5. 删除ecshop底部共执行个查询Gzip 已禁用,占用内存方法
  6. 进程间的通讯(IPC)方式
  7. cocos2dx动画Animation介绍
  8. unity中js脚本与c#脚本互相调用
  9. 浅析 GRUB 如何加载 linux kernel
  10. Oracle自治事务
  11. 持续交付工具ThoughtWorks Go部署step by step
  12. Android后台执行的定时器实现
  13. jQuery实现全选、反选和不选功能
  14. Xcode中编译iOS程序,运行出错:Thread 1: signal SIGABRT
  15. Java对数
  16. JavaScript基础笔记(八)DOM扩展
  17. php-parser在Aop编程中的使用
  18. LAMP平台部署
  19. 关于std::map的第三个参数
  20. HDU 2201 熊猫阿波的故事

热门文章

  1. XML--XML Schema Definition(二)
  2. 软件设计之基于Java的连连看小游戏(三)——所有功能的实现
  3. Map 查找表操作
  4. PAT Advanced 1053 Path of Equal Weight (30) [树的遍历]
  5. python学习笔记-模块和包
  6. Luogu P3031 高于中位数
  7. 记录一次URL中有特殊字符怎么处理?
  8. leetcode 690.员工的重要性
  9. MySQL--事务,隔离性和隔离级别
  10. Morse code(多模式串匹配)