Study Guide@lith   Link to LiU Homepage
 

Linköping Institute of Technology

Link to LiU Homepage
 
Valid for year : 2011
 
TANA07 Data mining using matrix methods, 6 ECTS credits.
/Datautvinning med matrismetoder/

For:   CS   D   IT   MMAT  


OBS!

Overlapping course contents: TANA25/TANA08

 

Prel. scheduled hours: 38
Rec. self-study hours: 122

  Area of Education: Science

Main field of studies: Mathematics, Applied Mathematics

  Advancement level (G1, G2, A): A

Aim:
Many problems in data mining and patter recognition can be solved using matrix methods. Student should acquire knowledge of basic concepts in the area and be acquainted with selected algorithms. After the course the student is expected to be able to 1) use the singular value decomposition (SVD) and similar matrix factorizations to solve least squares problems and compute orthogonal bases, 2) use SVD, clustering and similar methods to do text mining, pattern recognition and computation of pagerank, 3) use simple software (parser) for text processing for text mining

Prerequisites: (valid for students admitted to programmes within which the course is offered)
Basic courses in numerical algorithms (scientific computing) and programming.

Note: Admission requirements for non-programme students usually also include admission requirements for the programme and threshhold requirements for progression within the programme, or corresponding.

Organisation:
The theory is presented mainly at seminars. Programming projects will give practical experience in solving applied problems and using suitable tools.

Course contents:
Orthogonal transformations, QR decomposition, least squares problems. Eigenvalue and singular value decomposition (SVD), separation of signal and noise. Data compression by SVD. Algorithmic aspects of matrix factorizations (robustness, efficiency, program libraries), the use of the algorithms as building blocks for solving applied problems. Pattern recognition (handwritten digits). Information retrieval and search engines, text-mining. Clustering and classification.

Course literature:
L. Eldén: Matrix methods in data mining and pattern recognition, SIAM 2007

Examination:
Written examination.
Laboratory work.
3 ECTS
3 ECTS
 



Course language is English.
Department offering the course: MAI.
Director of Studies: Torbjörn Larsson
Examiner: Lars Eldén
Link to the course homepage at the department


Course Syllabus in Swedish

Linköping Institute of Technology

Link to top of pagep


Contact: TFK , val@tfk.liu.se
Last updated: 04/26/2011