html数据，从中提取出当前时间点北京各监测站的 PM2.5浓度，输出格式如下。要求：写出各个处 理步骤，并给出解释。
- Machine learning is the science of getting computers to learn without being explicitly programmed.
- Machine Learning Honor Code
- Grew out of work in AI - New capability for computers
- Database mining Large datasets from growth of automation/web. E.g. Web click data, medical records, biology, engineering - Applicaitons can't program by hand. E.g., Autonomous helicopter, handwriting recognition, most of Natural Language Processing(NLP), Computer Vision. - Self customizing programs E.g., Amazon, Netflix product recommendations - Understanding human learning(brain, real AI).
To know what is machine learning and give u a sense of when u want to use machine learning.
- Arthur Samuel(1959). Machine Learning: Field of study that gives computers the ability to learn without being explicitly programmed.
- Tom Mitchell(1998) Well-posed Learning Problem: A computer Program is said to learn from experience $E$ with respect to some task $T$ and some performance measure $P$, if its performance on $T$ as measured by $P$ improves with experience $E$.
- Supervised learning
- Unsupervised learning
Others: Reinforcement learning(强化学习), recommender systems(推荐系统).
Also talk about practical advice for applying learning algorithms.
“right answers” given.
- Regression: Predict continuous valued output.(Used: house price)
- Classification: Discrete valued output(0 or 1)(Used: Breast cancer)
How do you deal with an infinite number of features, and how do you even store an infinite number of things on the computer?
SVM will be a neat mathematical trick that will allow a computer to deal with an infinite number of features.
Data does not have any labels.
- Google News.(Cluster the news of the same topic)
- DNA microarray data.
- Organize large computer clusters.
- Social network analysis
- Market segmentation
- Astronomical data analysis(how galaxies are formed.)
Clustering is just one type of Unsupervised Learning.
recognize and separate the audio.
when you use the right programming enviroment, many learning algorithms can be really shot programs.
If you use Octave as your learning tool and as your prorotyping tool(原型工具), it will let you learn and prototype learning algorithms much quickly.