Course Objectives
This course will introduce basic and advanced techniques on data mining of very large amounts of data, which is so large that it does not fit in main memory. The course takes an algorithmic point of view, that is, data mining is about applying algorithms to data, rather than using data to “train” a machine-learning engine of some sort. The goal of this course is to help students understand and exploit the techniques of a new computing paradigm called data-intensive scalable computing.
课程目标
本课程将介绍针对大数据的一系列数据挖掘基本和高级技术。 数据挖掘是一种将算法应用于数据,而不是使用数据来“训练”某种机器学习算法的学科。 本课程的目的是帮助学生理解和掌握数据密集型可扩展的计算技术。