JCU Logo

James Cook University Subject Handbook - 2024

For subject information from 2025 and onwards, please visit the new JCU Course and Subject Handbook website.

MA5405 - Data Mining

Credit points:03
Year:2024
Student Contribution Band:Band 1
Prerequisites:MA2405 OR MA2000 OR SC2202 OR SC2209 OR SC5202 OR MA5821
Administered by:College of Science and Engineering

Available to postgraduate science students.

Subject Description

    Recent advances in technology makes it possible to collect, store and analyse very large data sets. Consequently, the contemporary scientist must be skilled in extracting important information embedded in large and complex data sets if they are to offer advances in knowledge to industry, business, research and societies of the 21st century. Moreover, employers are increasingly demanding that graduates can make important discoveries by interrogating large data sets. This subject will provide the bridge between mathematical theory and applied computing methods via the R programming language to give students a strong grounding in statistical learning methods for analysing Big Data sets. A range of supervised and unsupervised learning methods will be covered.

Learning Outcomes

  • translate between mathematical, visual and conceptual characterisations of statistical learning methods suitable for Big Data
  • evaluate large and complex data sets using appropriate data mining techniques
  • design, implement and validate supervised and unsupervised machine learning systems
  • implement statistical models in the R computing environment
  • learn techniques for coping with the analysis of large data sets

Subject Assessment

  • Written > Examination (centrally administered) - (40%) - Individual
  • Written > Test/Quiz 1 - (10%) - Individual
  • Capstone assignment - (50%) - Individual

Note that minor variations might occur due to the continuous subject quality improvement process, and in case of minor variation(s) in assessment details, the Subject Outline represents the latest official information.

Assumed Knowledge:  Students must have a good understanding of STATISTICS which includes knowledge of basic probability, hypothesis testing, law of large numberes, central limit theorum and ability to use R for data analysis (or have done the JCU R Bootcamp). SC5202 or SC2202 or SC2209 or will have acquired equivalent knowledge through industry experience.

Availabilities

Cairns Nguma-bada, Study Period 2, Internal

Census date:Thursday, 22 Aug 2024
Study Period Dates:Monday, 22 Jul 2024 to Friday, 15 Nov 2024
Lecturer(s):
Professor Yvette Everingham
Assoc. Professor Wayne Read
Workload expectations:The student workload for this 3 credit point subject is approximately 130 hours.

    Townsville Bebegu Yumba, Study Period 2, Internal

    Census date:Thursday, 22 Aug 2024
    Study Period Dates:Monday, 22 Jul 2024 to Friday, 15 Nov 2024
    Coordinator(s):
    DR Carla Ewels
    Lecturer(s):
    DR Sourav Das
    DR Carla Ewels
    Professor Yvette Everingham
    Assoc. Professor Wayne Read
    Workload expectations:The student workload for this 3 credit point subject is approximately 130 hours.