Academic Catalog 2017-2018 
    
    Apr 19, 2024  
Academic Catalog 2017-2018 [ARCHIVED CATALOG]

Add to Portfolio (opens a new window)

STA 364 - Big Data Analytics

Course Units: 1
(Spring; Staff) This course focuses on the analysis of large data sets in diverse application areas using statistical programming languages. Students will develop an understanding of the role of machine learning methods within the context of the scientific method. They will analyze real data sets using downloadable statistical programming packages, including on a course project of their own choosing. This analysis will include exploratory data analysis, visualization, and use of more sophisticated classification and predictive algorithms including nearest neighbor, naïve Bayes, classification and regression trees (CART), neural networks, and others. During the course we will pay special attention to validating models using the “train and test” regimen, as well as through cross validation and bootstrapping. In the process of studying the machine learning methods themselves, students will develop an ability to manipulate big data to accomplish the previous objectives. This includes downloading, merging, appending and reshaping data, and creating new variables. Successful completion of this course would be advantageous for those considering graduate study or employment in the areas of statistics, data science, machine learning, computer science, econometrics, or related disciplines. Prerequisite(s): STA 264  or ECO 243  or permission from the Chair.



Add to Portfolio (opens a new window)