This research-led course provides an introduction to recent developments in Random Matrix Theory and Online Learning that addresses the challenges and opportunities posed by the availability of large amounts of data.
In the first instance, we will review some classic results from multivariate statistical theory, matrix analysis, and probability theory. Then we will present the salient statistical features of big data (e.g., heterogeneity, noise accumulation, spurious correlation, and incidental endogeneity) and show how this impacts on traditional statistical methods and theory.
We follow with an introduction to modern Random Matrix theory and its application in statistics. Applications presented may include topics such as high-dimensional statistical inference, large covariance matrices, large-scale statistical learning through subsampling, sparsification of large matrices, principal component analysis, and dimension reduction.
We conclude with an introduction to the theory of online learning (aka. sequential prediction) to handle the situation of streaming data.
Students will use and learn about the latest computational tools to work with big and streaming data sets. Example data sets may be drawn from areas such finance, web analytics, digital marketing, and satellite imagery data.
Learning Outcomes
Upon successful completion, students will have the knowledge and skills to:
- Explain in detail how statistical features of big data impacts traditional statistical methods and theory.
- Discuss in depth random Matrix theory and its application in statistics on large scale.
- Critically discuss the theory of sequential prediction and management of streaming data.
- Demonstrate in detail the use of computational tools to work with big and streaming data sets.
Research-Led Teaching
This course is based on recent research papers and surveys applications of random matrix theory in statistics. The topic is rapidly advancing and recent results may be introduced into the course as they appear in the literature.
Examination Material or equipment
Working version of RStudio and R.
Required Resources
Working version of RStudio and R. Resources (research papers, etc) and lecture notes will be provided throughout the semester on Wattle.
Staff Feedback
Students will be given feedback in the following forms in this course:- Written comments
- Verbal comments
- Feedback to the whole class, to groups, to individuals, focus groups
Student Feedback
ANU is committed to the demonstration of educational excellence and regularly seeks feedback from students. Students are encouraged to offer feedback directly to their Course Convener or through their College and Course representatives (if applicable). The feedback given in these surveys is anonymous and provides the Colleges, University Education Committee and Academic Board with opportunities to recognise excellent teaching, and opportunities for improvement. The Surveys and Evaluation website provides more information on student surveys at ANU and reports on the feedback provided on ANU courses.Class Schedule
Week/Session | Summary of Activities | Assessment |
---|---|---|
1 | Introduction to the challenges of Big Data and overview of the course. Review of some prerequisite concepts. | |
2 | Further matrix analysis, eigenvalues and eigenvectors, the multivariate normal distribution. | |
3 | Fundamental tools for studying limiting spectral distributions, Marcenko-Pastur distributions, Fisher spectral distribution. | Assignment 1 Due |
4 | CLT for linear spectral statistics: Introduction and integration tools. | |
5 | Moments and statistics of the Marcenko-Pastur distribution. | |
6 | CLT for linear spectral statistics: Sample covariance matrix, Bai and Silverstein’s CLT, CLT for random Fisher matrices. | Assignment 2 Due |
7 | Generalised variance in higher dimensions. | Assignment 3 Due |
8 | Multiple correlation coefficient. | |
9 | Multivariate linear regression in the high-dimensional setting. | Assignment 4 Due |
10 | PCA and high-dimensional spiked population models. | |
11 | Applications and recent theoretical results. | |
12 | Applications and recent theoretical results. | Assignment 5 Due |
Tutorial Registration
Tutorial registration will be available two weeks prior to the beginning of the semester and will close at the end of week 1. More details can be found on the Timetable webpage. https://www.anu.edu.au/students/program-administration/timetabling.
Assessment Summary
Assessment task | Value | Due Date | Return of assessment | Learning Outcomes |
---|---|---|---|---|
Assignment 1 | 20 % | 08/08/2023 | 22/08/2023 | 1,2,3,4 |
Assignment 2 | 20 % | 29/08/2023 | 12/09/2023 | 1,2,3,4 |
Assignment 3 | 20 % | 19/09/2023 | 03/10/2023 | 1,2,3,4 |
Assignment 4 | 20 % | 04/10/2023 | 18/10/2023 | 1,2,3,4 |
Final Project | 20 % | 02/11/2023 | 30/11/2023 | 1,2,3,4 |
* If the Due Date and Return of Assessment date are blank, see the Assessment Tab for specific Assessment Task details
Policies
ANU has educational policies, procedures and guidelines, which are designed to ensure that staff and students are aware of the University’s academic standards, and implement them. Students are expected to have read the Academic Misconduct Rule before the commencement of their course. Other key policies and guidelines include:- Student Assessment (Coursework) Policy and Procedure
- Special Assessment Consideration Policy and General Information
- Student Surveys and Evaluations
- Deferred Examinations
- Student Complaint Resolution Policy and Procedure
Assessment Requirements
The ANU is using Turnitin to enhance student citation and referencing techniques, and to assess assignment submissions as a component of the University's approach to managing Academic Integrity. For additional information regarding Turnitin please visit the ANU Online website Students may choose not to submit assessment items through Turnitin. In this instance you will be required to submit, alongside the assessment item itself, hard copies of all references included in the assessment item.Moderation of Assessment
Marks that are allocated during Semester are to be considered provisional until formalised by the College examiners meeting at the end of each Semester. If appropriate, some moderation of marks might be applied prior to final results being released.Participation
This course has 4 hours of contact time per week, this consists of:
(a) In-person weekly lectures where the theory material is covered (2 hours). These lectures will be held on campus and recorded.
(b) A 1 hour weekly workshop where applications and illustrations of the theory are demonstrated. This will be a live RStudio session.
(c) A weekly tutorial held in a computer lab (1 hour) where you get the opportunity to practice what you have learnt.
Students are encouraged to attend all of these sessions on a weekly basis to take notes and ask questions.
Examination(s)
There is no final examination for this course. The final assessment is a take-home project (Assessment Task 5).
Assessment Task 1
Learning Outcomes: 1,2,3,4
Assignment 1
This assessment is to be done individually. The final project will be a take-home assessment that will typically involve a mix of ‘pen-and-paper’ questions and/or ‘computational’ questions. This assessment will match the style of questions given in the tutorials, so practicing on those questions would be quite beneficial. The question(s) will cover material that has been seen in previous lectures and are aimed at ensuring students are routinely studying the material. The assessment will be released two weeks before the due date. The assessment will be submitted in Wattle using TurnitIn by 9:00am on the due date and marks/feedback will be given on the 'Return of Assessment Date'.
Assessment Task 2
Learning Outcomes: 1,2,3,4
Assignment 2
This assessment is to be done individually. The final project will be a take-home assessment that will typically involve a mix of ‘pen-and-paper’ questions and/or ‘computational’ questions. This assessment will match the style of questions given in the tutorials, so practicing on those questions would be quite beneficial. The question(s) will cover material that has been seen in previous lectures and are aimed at ensuring students are routinely studying the material. The assessment will be released two weeks before the due date. The assessment will be submitted in Wattle using TurnitIn by 9:00am on the due date and marks/feedback will be given on the 'Return of Assessment Date'.
Assessment Task 3
Learning Outcomes: 1,2,3,4
Assignment 3
This assessment is to be done individually. The final project will be a take-home assessment that will typically involve a mix of ‘pen-and-paper’ questions and/or ‘computational’ questions. This assessment will match the style of questions given in the tutorials, so practicing on those questions would be quite beneficial. The question(s) will cover material that has been seen in previous lectures and are aimed at ensuring students are routinely studying the material. The assessment will be released two weeks before the due date. The assessment will be submitted in Wattle using TurnitIn by 9:00am on the due date and marks/feedback will be given on the 'Return of Assessment Date'.
Assessment Task 4
Learning Outcomes: 1,2,3,4
Assignment 4
This assessment is to be done individually. The final project will be a take-home assessment that will typically involve a mix of ‘pen-and-paper’ questions and/or ‘computational’ questions. This assessment will match the style of questions given in the tutorials, so practicing on those questions would be quite beneficial. The question(s) will cover material that has been seen in previous lectures and are aimed at ensuring students are routinely studying the material. The assessment will be released two weeks before the due date. The assessment will be submitted in Wattle using TurnitIn by 9:00am on the due date and marks/feedback will be given on the 'Return of Assessment Date'.
Assessment Task 5
Learning Outcomes: 1,2,3,4
Final Project
This assessment is to be done individually. The final project will be a take-home assessment that will typically involve a mix of ‘pen-and-paper’ questions and/or ‘computational’ questions. This assessment will match the style of questions given in the tutorials, so practicing on those questions would be quite beneficial. The question(s) will cover material that has been seen in previous lectures and are aimed at ensuring students are routinely studying the material. The assessment will be released two weeks before the due date. The assessment will be submitted in Wattle using TurnitIn by 9:00am on the due date and marks/feedback will be given on the 'Return of Assessment Date'.
Academic Integrity
Academic integrity is a core part of our culture as a community of scholars. At its heart, academic integrity is about behaving ethically. This means that all members of the community commit to honest and responsible scholarly practice and to upholding these values with respect and fairness. The Australian National University commits to embedding the values of academic integrity in our teaching and learning. We ensure that all members of our community understand how to engage in academic work in ways that are consistent with, and actively support academic integrity. The ANU expects staff and students to uphold high standards of academic integrity and act ethically and honestly, to ensure the quality and value of the qualification that you will graduate with. The University has policies and procedures in place to promote academic integrity and manage academic misconduct. Visit the following Academic honesty & plagiarism website for more information about academic integrity and what the ANU considers academic misconduct. The ANU offers a number of services to assist students with their assignments, examinations, and other learning activities. The Academic Skills and Learning Centre offers a number of workshops and seminars that you may find useful for your studies.Online Submission
The ANU uses Turnitin to enhance student citation and referencing techniques, and to assess assignment submissions as a component of the University's approach to managing Academic Integrity. The use of Turnitin is mandatory without an exemption. Any student identified, either during the current semester or in retrospect, as having used ghost writing services will be investigated under the University’s Academic Misconduct Rule. The assignments and final project must be done individually, any similarities between results will be investigated under the University’s Academic Misconduct Rule.
Hardcopy Submission
All assessment submission in the course is online.
Late Submission
Late submission of assignments will incur a penalty of 5% per working day. Further, any late assessment tasks will not to be accepted after the earlier of the following: (1) The tenth working day after the due date; or, (2) The date specied in the class summary for the return of the assessment item.
Referencing Requirements
Accepted academic practice for referencing sources that you use in presentations can be found via the links on the Wattle site, under the file named “ANU and College Policies, Program Information, Student Support Services and Assessment”. Alternatively, you can seek help through the Students Learning Development website.Returning Assignments
Assignments will be graded and returned on Wattle via TurnitIn. Students are encouraged to use the consultation time of the tutor and lecturer to obtain further feedback if necessary.
Extensions and Penalties
Extensions and late submission of assessment pieces are covered by the Student Assessment (Coursework) Policy and Procedure The Course Convener may grant extensions for assessment pieces that are not examinations or take-home examinations. If you need an extension, you must request an extension in writing on or before the due date. If you have documented and appropriate medical evidence that demonstrates you were not able to request an extension on or before the due date, you may be able to request it after the due date.Resubmission of Assignments
Assignments may not be resubmitted.
Privacy Notice
The ANU has made a number of third party, online, databases available for students to use. Use of each online database is conditional on student end users first agreeing to the database licensor’s terms of service and/or privacy policy. Students should read these carefully. In some cases student end users will be required to register an account with the database licensor and submit personal information, including their: first name; last name; ANU email address; and other information. In cases where student end users are asked to submit ‘content’ to a database, such as an assignment or short answers, the database licensor may only use the student’s ‘content’ in accordance with the terms of service — including any (copyright) licence the student grants to the database licensor. Any personal information or content a student submits may be stored by the licensor, potentially offshore, and will be used to process the database service in accordance with the licensors terms of service and/or privacy policy. If any student chooses not to agree to the database licensor’s terms of service or privacy policy, the student will not be able to access and use the database. In these circumstances students should contact their lecturer to enquire about alternative arrangements that are available.Distribution of grades policy
Academic Quality Assurance Committee monitors the performance of students, including attrition, further study and employment rates and grade distribution, and College reports on quality assurance processes for assessment activities, including alignment with national and international disciplinary and interdisciplinary standards, as well as qualification type learning outcomes. Since first semester 1994, ANU uses a grading scale for all courses. This grading scale is used by all academic areas of the University.Support for students
The University offers students support through several different services. You may contact the services listed below directly or seek advice from your Course Convener, Student Administrators, or your College and Course representatives (if applicable).- ANU Health, safety & wellbeing for medical services, counselling, mental health and spiritual support
- ANU Diversity and inclusion for students with a disability or ongoing or chronic illness
- ANU Dean of Students for confidential, impartial advice and help to resolve problems between students and the academic or administrative areas of the University
- ANU Academic Skills and Learning Centre supports you make your own decisions about how you learn and manage your workload.
- ANU Counselling Centre promotes, supports and enhances mental health and wellbeing within the University student community.
- ANUSA supports and represents undergraduate and ANU College students
- PARSA supports and represents postgraduate and research students
Convener
![]() |
|
|||
Research InterestsProbability theory, stochastic process, and applications to large-scale data science problems. |
AsPr Dale Roberts
![]() |
|
Instructor
![]() |
|
|||
Research Interests |
AsPr Dale Roberts
![]() |
|