Get in Touch

Course Outline

Day One: Language Fundamentals

  • Course Introduction
  • Understanding Data Science
    • Defining Data Science
    • The Data Science Workflow
  • Introduction to the R Language
  • Variables and Data Types
  • Control Structures (Loops and Conditionals)
  • R Scalars, Vectors, and Matrices
    • Creating R Vectors
    • Working with Matrices
  • String and Text Manipulation
    • The Character Data Type
    • File Input and Output
  • Lists
  • Functions
    • Function Fundamentals
    • Closures
    • Using lapply and sapply Functions
  • DataFrames
  • Practical Labs for All Topics

Day Two: Intermediate R Programming

  • DataFrames and File Input/Output
  • Importing Data from Files
  • Data Preparation Techniques
  • Utilising Built-in Datasets
  • Data Visualization
    • The Graphics Package
    • plot(), barplot(), hist(), boxplot(), and scatter plots
    • Heat Maps
    • The ggplot2 Package (qplot(), ggplot())
  • Data Exploration with dplyr
  • Practical Labs for All Topics

Day Three: Advanced Programming with R

  • Statistical Modelling with R
    • Statistical Functions
    • Handling Missing Values (NA)
    • Probability Distributions (Binomial, Poisson, Normal)
  • Regression Analysis
    • Introduction to Linear Regression
  • Recommendation Systems
  • Text Processing (tm Package and Wordclouds)
  • Clustering
    • Clustering Fundamentals
    • K-Means Clustering
  • Classification
    • Classification Fundamentals
    • Naive Bayes
    • Decision Trees
    • Model Training using the caret Package
    • Algorithm Evaluation
  • R and Big Data
    • Connecting R to Databases
    • The Big Data Ecosystem
  • Practical Labs for All Topics

Requirements

  • A foundational background in programming is recommended

Prerequisites and Setup

  • A modern laptop
  • The latest version of R Studio and the R environment installed
 21 Hours

Testimonials (7)

Related Categories