Programming with Big Data in R Training Course

Big Data is a term that refers to solutions destined for storing and processing large data sets. Developed by Google initially, these Big Data solutions have evolved and inspired other similar projects, many of which are available as open-source. R is a popular programming language in the financial industry.

This course is available as onsite live training in Botswana or online live training.

Thank you for sending your enquiry! One of our team members will contact you shortly.

Thank you for sending your booking! One of our team members will contact you shortly.

Course Outline

Introduction to Programming Big Data with R (bpdR)

Setting up your environment to use pbdR
Scope and tools available in pbdR
Packages commonly used with Big Data alongside pbdR

Message Passing Interface (MPI)

Using pbdR MPI 5
Parallel processing
Point-to-point communication
Send Matrices
Summing Matrices
Collective communication
Summing Matrices with Reduce
Scatter / Gather
Other MPI communications

Distributed Matrices

Creating a distributed diagonal matrix
SVD of a distributed matrix
Building a distributed matrix in parallel

Statistics Applications

Monte Carlo Integration
Reading Datasets
Reading on all processes
Broadcasting from one process
Reading partitioned data
Distributed Regression
Distributed Bootstrap

21 Hours

Need help picking the right course?

Testimonials (2)

The subject matter and the pace were perfect.

Tim - Ottawa Research and Development Center, Science Technology Branch, Agriculture and Agri-Food Canada

Course - Programming with Big Data in R

Michael the trainer is very knowledgeable and skillful about the subject of Big Data and R. He is very flexible and quickly customize the training meeting clients' need. He is also very capable to solve technical and subject matter problems on the go. Fantastic and professional training!.

Programming with Big Data in R Training Course

Course Outline

Introduction to Programming Big Data with R (bpdR)

Message Passing Interface (MPI)

Distributed Matrices

Statistics Applications

Testimonials (2)

Tim - Ottawa Research and Development Center, Science Technology Branch, Agriculture and Agri-Food Canada

Course - Programming with Big Data in R

Xiaoyuan Geng - Ottawa Research and Development Center, Science Technology Branch, Agriculture and Agri-Food Canada

Course - Programming with Big Data in R

Related Categories

This site in other countries/regions

Europe

Asia Pacific

North America

South America

Africa / Middle East

Other sites

Programming with Big Data in R Training Course

Course Outline

Introduction to Programming Big Data with R (bpdR)

Message Passing Interface (MPI)

Distributed Matrices

Statistics Applications

Testimonials (2)

Tim - Ottawa Research and Development Center, Science Technology Branch, Agriculture and Agri-Food Canada

Course - Programming with Big Data in R

Xiaoyuan Geng - Ottawa Research and Development Center, Science Technology Branch, Agriculture and Agri-Food Canada

Course - Programming with Big Data in R

Related Courses

Big Data Analytics with Google Colab and Apache Spark

Big Data Analytics in Health

Hadoop and Spark for Administrators

A Practical Introduction to Stream Processing

SMACK Stack for Data Science

Apache Spark Fundamentals

Administration of Apache Spark

Apache Spark in the Cloud

Spark for Developers

OBJECTIVE:

AUDIENCE :

Scaling Data Pipelines with Spark NLP

Python and Spark for Big Data (PySpark)

Python, Spark, and Hadoop for Big Data

Apache Spark SQL

Stratio: Rocket and Intelligence Modules with PySpark

Introduction to Data Visualization with Tidyverse and R

Related Categories

Big Data

R Language

This site in other countries/regions

Europe

Asia Pacific

North America

South America

Africa / Middle East

Other sites