|Course Title||Machine Learning and Data Mining|
|Classes||Timetable for all classes|
|Units of Credit||6|
This course explores machine learning as the algorithmic approach to learning from data. The course also covers key aspects of data mining, which is understood as the application of machine learning tools to obtain insight from data. Algorithms are placed in the context of their theoretical foundations in order to understand their derivation and correct application. Topics include: linear models for regression and classification, local methods (nearest neighbour), neural networks, tree learning, kernel machines, unsupervised learning, ensemble learning, computational and statistical learning theory, and Bayesian learning. To expand and extend the development of theory and algorithms presented in lectures practical applications will be given in tutorials, programming tasks and assignments.
Before commencing this course, students should have completed the pre-requisite courses (or equivalent) and ensure they have acquired knowledge in the relevant areas:
After completing this course, students will be able to:
This course contributes to the development of the following graduate capabilities:
|Graduate Capability||Acquired in|
|Scholars capable of independent and collaborative enquiry, rigorous in their analysis, critique and reflection, and able to innovate by applying their knowledge and skills to the solution of novel as well as routine problems||lectures, tutorials, assignments and exams|
|Entrepreneurial leaders capable of initiating and embracing innovation and change, as well as engaging and enabling others to contribute to change||assignment 2 (project)|
|Professionals capable of ethical, self- directed practice and independent lifelong learning||suggested references, tutorials and assignments|
|Global citizens who are culturally adept and capable of respecting diversity and acting in a socially just and responsible way||lectures, tutorials and assignment 2 (project)|
This course is taught to emphasise that theory, algorithms and empirical work are essential inter-dependent components of machine learning. Teaching is mainly focused on lectures and assignments, with tutorials to expand and reinforce the lecture content. Assessment is by mid-session and final exams, and two marked assignments. The assignments are aimed at giving students an opportunity for active learning in a structured way with submission deadlines. The purpose is to give students practical experience of machine learning and relate lecture material to real applications. The second assignment has a broad scope and should be treated as a small-scale project with submission of software and a written report.
The Student Code of Conduct ( Information , Policy ) sets out what the University expects from students as members of the UNSW community. As well as the learning, teaching and research environment, the University aims to provide an environment that enables students to achieve their full potential and to provide an experience consistent with the University's values and guiding principles. A condition of enrolment is that students inform themselves of the University's rules and policies affecting them, and conduct themselves accordingly.
In particular, students have the responsibility to observe standards of equity and respect in dealing with every member of the University community. This applies to all activities on UNSW premises and all external activities related to study and research. This includes behaviour in person as well as behaviour on social media, for example Facebook groups set up for the purpose of discussing UNSW courses or course work. Behaviour that is considered in breach of the Student Code Policy as discriminatory, sexually inappropriate, bullying, harassing, invading another's privacy or causing any person to fear for their personal safety is serious misconduct and can lead to severe penalties, including suspension or exclusion from UNSW.
Plagiarism is defined as using the words or ideas of others and presenting them as your own. UNSW and CSE treat plagiarism as academic misconduct, which means that it carries penalties as severe as being excluded from further study at UNSW. There are several on-line sources to help you understand what plagiarism is and how it is dealt with at UNSW:
Make sure that you read and understand these. Ignorance is not accepted as an excuse for plagiarism. In particular, you are also responsible that your assignment files are not accessible by anyone but you by setting the correct permissions in your CSE directory and code repository, if using. Note also that plagiarism includes paying or asking another person to do a piece of work for you and then submitting it as your own work.
UNSW has an ongoing commitment to fostering a culture of learning informed by academic integrity. All UNSW staff and students have a responsibility to adhere to this principle of academic integrity. Plagiarism undermines academic integrity and is not tolerated at UNSW. Plagiarism at UNSW is defined as using the words or ideas of others and passing them off as your own.
If you haven't done so yet, please take the time to read the full text of
The pages below describe the policies and procedures in more detail:
You should also read the following page which describes your rights and responsibilities in the CSE context:
|Assignment1||Applications of machine learning algorithms||Week 4||10%||1,2,3,4,5|
|Midterm||All topics covered up to Week 5||Week 6||10%||1,2,3,4,5,6|
|Assignment2||Machine learning project||Week 13||30%||1-6,9|
|Final Exam||All topics||Exam period||50%||1-8|
Assignment submission will be electronic via the CSE <t>give</t> system. Details of submission, deadlines and late penalties, etc. will be in the assignment specifications.
The overall course mark will be the sum of the marks for the course components.
The outcome of attaining a course mark in the range 45-49 will be decided on a case-by-case basis.
Note: this schedule may be subject to change !
|1||Introduction to Machine Learning||-||-||-|
|2||Supervised Learning - Regression||Introductory topics||-||-|
|3||Supervised Learning - Classification||Regression||-||-|
|4||Supervised Learning - Tree Learning||Classification||Ass1 due||-|
|5||Supervised Learning - Neural Learning||Tree Learning||-||-|
|6||Supervised Learning - Kernel Methods||Neural Learning||-||Midterm|
|7||Unsupervised Learning||Kernel Methods||-||-|
|8||No classes (public holiday)||-||Ass2 proposals due||-|
|9||Learning Theory||Unsupervised Learning||-||-|
|10||Ensemble Learning||Learning Theory||-||-|
|11||Bayesian Inference - Supervised Learning||Ensemble Learning||-||-|
|12||Bayesian Inference - Unsupervised Learning||Bayes - supervised||-||-|
|13||-||Bayes - unsupervised||Ass2 due||-|
Owing to the expansion of machine learning in recent years, and the wide availability of online materials, it is no longer possible to recommend a single textbook for this course. However, below is a list of books (those with an asterisk have copies freely available online) that can be consulted to back up and expand on the course content. If you plan to continue with machine learning, any of these (and many others) are worth owning:
Other resources (e.g. links to on-line documentation) will be made available in the relevant course materials.
This course is evaluated each session using the myExperience system.
In the previous offering of this course, students requested that feedback on assessment results should be made available more quickly.
Based on these comments, we are restructuring some of the assessment methods to better address the needs of a larger class.
Resource created Wednesday 21 February 2018, 03:57:01 PM, last modified Thursday 06 June 2019, 04:26:49 PM.