Definition

目前关于机器学期比较公认的两种定义：

Arthur Samuel (1959) : Field of study that give computers the ability to learn without being explicitly programmed.

Arthur Samuel 主要想表达的是机器可以在无设定的条件下自主地学习，像人类对待未知的事物一样进行自主学习。这是一个对机器学习一个相对抽象的概述。

Tom Mitchell (1988) : A computer program is said to learn from experiece E with respect to some task T and some performance measure P, if its performce on T, as measured by P, improves with experience E.

Tom Mitchell 在这边提出了机器学习分别于三个参量E、T、P之间的联系，即机器不断从E活动中获取经验来更好地更有能力地完成任务T，P则是度量机器完成T的优劣情况。这就好比如人通过下 n 盘围棋来提高自己围棋的赢的胜算，P则是度量这个人下围棋的成功率。Tom Mitchell 将机器学习的定义更加细化了分工了，Amdrew Ng教授则诙谐地说这个定义可能是谐音有趣。

classification

机器学习算法主要有两大类：

—— Supervised learning （监督式学习）

( the idea is that we're goring to teach the computer how to do something)

—— Unsupervised learning (非监督式学习)

( the idea is that we're goirng let it learning by itself )

Others: Reinforcement learning, recommender systems

监督式学习主要是机器知道正确答案地学习，即邮件可判断为是垃圾邮件与否，标准答案已有，机机器则在已知答案的数据集中不断学习。

非监督式学习则不知道答案，只给数据集，让机器自主学习给出数据结构或解决公式。

Supervised Learning

Definition: in every example in our data set we are told what is the "correct answer" that we would have quite liked the algorithms have predicted on that examples.