NVIDIA GPU Programming - Extended Training Course

This instructor-led, live training course provides guidance on programming GPUs for parallel computing, utilizing different platforms, working with the CUDA platform and its features, and executing various optimization techniques via CUDA. Applications encompass deep learning, analytics, image processing, and engineering solutions.

This course is available as onsite live training in Botswana or online live training.

Thank you for sending your enquiry! One of our team members will contact you shortly.

Thank you for sending your booking! One of our team members will contact you shortly.

Course Outline

Introduction

Grasping the Fundamentals of Heterogeneous Computing Methodology

Why Parallel Computing? Understanding the Need for Parallel Computing

Multi-Core Processors - Architecture and Design

Introduction to Threads, Thread Basics and Basic Concepts of Parallel Programming

Grasping the Fundamentals of GPU Software Optimization Processes

OpenMP - A Standard for Directive-Based Parallel Programming

Practical / Demonstration of Various Programs on Multicore Machines

Introduction to GPU Computing

GPUs for Parallel Computing

GPUs Programming Model

Practical / Demonstration of Various Programs on GPU

SDK, Toolkit and Installation of Environment for GPU

Working with Various Libraries

Demonstration of GPU and Tools with Sample Programs and OpenACC

Understanding the CUDA Programming Model

Learning the CUDA Architecture

Exploring and Setting Up the CUDA Development Environments

Working with the CUDA Runtime API

Understanding the CUDA Memory Model

Exploring Additional CUDA API Features

Accessing Global Memory Efficiently in CUDA: Global Memory Optimization

Optimizing Data Transfers in CUDA Using CUDA Streams

Using Shared Memory in CUDA

Understanding and Using Atomic Operations and Instructions in CUDA

Case Study: Basic Digital Image Processing with CUDA

Working with Multi-GPU Programming

Advanced Hardware Profiling and Sampling on NVIDIA / CUDA

Using CUDA Dynamic Parallelism API for Dynamic Kernel Launch

Summary and Conclusion

Requirements

C Programming
Linux GCC

21 Hours

Need help picking the right course?
southafrica@nobleprog.co.za or +27 (0)10 005 5793

NVIDIA GPU Programming - Extended Training Course

Course Outline

Requirements

Testimonials (1)

Tadeusz Kaluba - Nokia Solutions and Networks Sp. z o.o.

Course - NVIDIA GPU Programming - Extended

Related Categories

This site in other countries/regions

Europe

Asia Pacific

North America

South America

Africa / Middle East

Other sites

NVIDIA GPU Programming - Extended Training Course

Course Outline

Requirements

Testimonials (1)

Tadeusz Kaluba - Nokia Solutions and Networks Sp. z o.o.

Course - NVIDIA GPU Programming - Extended

Related Courses

Developing AI Applications with Huawei Ascend and CANN

Deploying AI Models with CANN and Ascend AI Processors

AI Inference and Deployment with CloudMatrix

GPU Programming on Biren AI Accelerators

Cambricon MLU Development with BANGPy and Neuware

Introduction to CANN for AI Framework Developers

CANN for Edge AI Deployment

Understanding Huawei’s AI Compute Stack: From CANN to MindSpore

Optimizing Neural Network Performance with CANN SDK

CANN SDK for Computer Vision and NLP Pipelines

Building Custom AI Operators with CANN TIK and TVM

Migrating CUDA Applications to Chinese GPU Architectures

Performance Optimization on Ascend, Biren, and Cambricon

Related Categories

GPU

This site in other countries/regions

Europe

Asia Pacific

North America

South America

Africa / Middle East

Other sites