Visa svensk kursplan
 
SYLLABUS
Data Mining and Statistical Learning, 15 ECTS Credits
 
COURSE CATEGORY   Course within Master´s Programme in Statistics, Data Analysis and Knowledge Discovery
MAIN FIELD OF STUDY   Statistik - STA
SUBJECT AREA  
  COURSE CODE   732A20
AIM OF THE COURSE
The course lays the foundation for professional work and research in which large amounts of data are explored, modified, modelled and assessed to uncover previously unknown patterns and trends.

Having completed the course, the student should be able to:
- utilize powerful statistical software to explore large and complex data sets, derive data-based predictors and classifiers, and assess the performance of such tools,
- use knowledge about powerful techniques for data-based prediction and classification,
- display a a good understanding of the major principles for statistical learning from data,
- demonstrate insightful assessment of the quality of given data sets and the information content on which predictions and classifications can be based.
CONTENTS
The course content comprises practical as well as theoretical elements, for example:
- computer exercises,
- basic concepts in statistical learning, in particular supervised learning,
- model selection strategies involving the use of training sets, validation sets, and test sets,
- decision trees and linear classification methods, such as discriminant analysis,
- classification and prediction based on neural networks, support vector machines, and generalized additive models, including logistic regression,
- ridge regression, spline smoothers and roughness penalty techniques,
- ensemble methods, including bagging and boosting.
TEACHING
Computer exercises in which the students have access to supervision provide practical experience of data analysis. The teaching comprises lectures, seminars, and computer exercises. The lectures are devoted to presentations of theories, concepts, and methods. The seminars comprise student presentations and discussions of assignments.
Language of instruction: English.
EXAMINATION
Assignments encompassing computer-based data analysis. One final written examination.



Students who have passed an examination may not retake it in order to improve their grades.
ADMISSION REQUIREMENTS

Students entering the course should have passed at least one course in basic statistics and be familiar with linear statistical models, in particular simple and multiple regression. Also, it is a prerequisite that the students have passed courses in calculus and linear algebra. Documented knowledge of English equivalent of ”Engelska” is required, or an intenational proficiency test, e.g. TOEFL, minimum score 550/213.
GRADING
The course is graded according to the ECTS grading scale A-F
CERTIFICATE
COURSE LITERATURE
The course literature is decided upon by the department in question.
OTHER INFORMATION
Planning and implementation of a course must take its starting point in the wording of the syllabus. The course evaluation included in each course must therefore take up the question how well the course agrees with the syllabus.

The course is carried out in such a way that both men´s and women´s experience and knowledge is made visible and developed.
 
Data Mining and Statistical Learning
Data Mining and Statistical Learning
 
Department responsible
for the course or equivalent:
MAI - Department of Mathematics
           
Registrar No: 1330/06-41   Course Code: 732A20      
    Exam codes: see Local Computer System      
Subject/Subject Area : Statistik - STA          
           
Level   Education level     Subject Area Code   Field of Education  
A1X   Advanced level     STA   SA