AI Inference and Deployment with CloudMatrix Training Course
CloudMatrix serves as Huawei's unified platform for AI development and deployment, specifically designed to facilitate scalable, production-ready inference pipelines.
This live training session, which can be conducted online or at your premises under the guidance of an instructor, targets beginner to intermediate AI professionals. The objective is to equip participants with the skills to deploy and monitor AI models using the CloudMatrix platform, leveraging its integration with CANN and MindSpore.
Upon completing this training, participants will be capable of:
- Utilising CloudMatrix for the packaging, deployment, and serving of models.
- Converting and optimising models specifically for Ascend chipsets.
- Establishing pipelines for both real-time and batch inference tasks.
- Monitoring deployments and tuning performance within production environments.
Course Format
- Interactive lectures and group discussions.
- Practical application of CloudMatrix through real-world deployment scenarios.
- Guided exercises focusing on conversion, optimisation, and scaling.
Customization Options
- If you require a tailored version of this course based on your specific AI infrastructure or cloud environment, please contact us to make arrangements.
Course Outline
Introduction to Huawei CloudMatrix
- Overview of the CloudMatrix ecosystem and deployment workflow.
- Supported models, data formats, and deployment modes.
- Typical use cases and compatible chipsets.
Preparing Models for Deployment
- Exporting models from training tools such as MindSpore, TensorFlow, and PyTorch.
- Employing ATC (Ascend Tensor Compiler) for format conversion.
- Distinguishing between static and dynamic shape models.
Deploying to CloudMatrix
- Creating services and registering models.
- Deploying inference services via the user interface or command line interface (CLI).
- Managing routing, authentication, and access control.
Serving Inference Requests
- Differentiating between batch and real-time inference flows.
- Implementing data preprocessing and postprocessing pipelines.
- Integrating CloudMatrix services into external applications.
Monitoring and Performance Tuning
- Tracking deployment logs and requests.
- Managing resource scaling and load balancing.
- Optimising latency and throughput.
Integration with Enterprise Tools
- Connecting CloudMatrix with OBS and ModelArts.
- Utilising workflows and model versioning.
- Implementing CI/CD for model deployment and rollback procedures.
End-to-End Inference Pipeline
- Deploying a complete image classification pipeline.
- Benchmarking and validating accuracy.
- Simulating failover scenarios and system alerts.
Summary and Next Steps
Requirements
- A foundational understanding of AI model training workflows.
- Practical experience with Python-based machine learning frameworks.
- Basic familiarity with cloud deployment concepts.
Target Audience
- AI operations teams.
- Machine learning engineers.
- Cloud deployment specialists working with Huawei infrastructure.
Need help picking the right course?
southafrica@nobleprog.co.za or +27 (0)10 005 5793
AI Inference and Deployment with CloudMatrix Training Course - Enquiry
Testimonials (2)
The extensive selection of tools presented
Miruna Buzduga - Aeronamic Eastern Europe
Course - AI Enablement Training for Engineers
Step by step training with a lot of exercises. It was like a workshop and I am very glad about that.
Ireneusz - Inter Cars S.A.
Course - Intelligent Applications Fundamentals
Related Courses
Developing AI Applications with Huawei Ascend and CANN
21 HoursHuawei Ascend comprises a series of AI processors engineered for high-performance inference and training tasks.
This instructor-led live training, available online or on-site, targets intermediate-level AI engineers and data scientists keen on developing and optimizing neural network models via Huawei's Ascend platform and the CANN toolkit.
Upon completion of this training, participants will be capable of:
- Setting up and configuring the CANN development environment.
- Creating AI applications employing MindSpore and CloudMatrix workflows.
- Enhancing performance on Ascend NPUs through custom operators and tiling techniques.
- Deploying models to either edge or cloud environments.
Course Format
- Interactive lectures and discussions.
- Practical application of Huawei Ascend and the CANN toolkit within sample applications.
- Guided exercises centred on model construction, training, and deployment.
Course Customization Options
- For customized training tailored to your specific infrastructure or datasets, please reach out to us to make arrangements.
Deploying AI Models with CANN and Ascend AI Processors
14 HoursCANN (Compute Architecture for Neural Networks) serves as Huawei's AI compute stack, designed to deploy and optimize AI models on Ascend AI processors.
This live training session, led by an instructor and available either online or onsite, targets intermediate-level AI developers and engineers. It focuses on efficiently deploying trained AI models onto Huawei Ascend hardware by leveraging the CANN toolkit alongside tools such as MindSpore, TensorFlow, or PyTorch.
Upon completing this training, participants will be able to:
- Grasp the CANN architecture and its critical role within the AI deployment pipeline.
- Convert and adapt models from widely used frameworks into formats compatible with Ascend.
- Utilise tools like ATC, OM model conversion, and MindSpore for both edge and cloud inference tasks.
- Troubleshoot deployment challenges and optimise performance on Ascend hardware.
Course Format
- Interactive lectures combined with practical demonstrations.
- Hands-on laboratory exercises using CANN tools and Ascend simulators or physical devices.
- Practical deployment scenarios grounded in real-world AI models.
Course Customisation Options
- To request a customised training programme for this course, please contact us to make arrangements.
AI Engineering Fundamentals
14 HoursThis instructor-led, live training in Botswana (online or on-site) is designed for beginner to intermediate-level AI engineers and software developers who wish to build a solid foundation in AI engineering principles and practices.
Upon completion of this training, participants will be capable of:
- Grasping the core concepts and technologies underlying AI and machine learning.
- Developing basic machine learning models using TensorFlow and PyTorch.
- Applying AI techniques to resolve practical issues in software development.
- Managing and maintaining AI projects in accordance with AI engineering best practices.
- Recognising the ethical implications and responsibilities associated with developing AI systems.
AI Enablement Training for Engineers
14 HoursThis intensive two-day practical workshop provides participants with the essential skills to utilise Generative AI tools effectively, securely, and consistently within technical and operational contexts. The course centres on transforming AI into a dependable daily assistant for drafting and refining documentation, structuring requirements, expediting analysis and reporting, facilitating problem-solving, and streamlining cross-functional communication. Attendees will master proven prompting strategies, validation methods to minimise errors and "hallucinations," and lightweight governance frameworks to safeguard confidential data and intellectual property. The curriculum is crafted to yield immediately applicable templates, workflows, and a reproducible methodology for AI-assisted work.
Building Intelligent Applications with AI and ML
28 HoursThis instructor-led, live training in Botswana (online or onsite) is designed for intermediate to advanced AI professionals and software developers aiming to build intelligent applications using AI and ML.
Upon completion of this training, participants will be able to:
- Comprehend the advanced concepts and technologies underpinning AI and ML.
- Analyze and visualize data to guide AI/ML model development.
- Construct, train, and deploy AI/ML models with effectiveness.
- Develop intelligent applications capable of addressing real-world challenges.
- Assess the ethical implications of AI applications across various industries.
Introduction to CANN for AI Framework Developers
7 HoursCANN (Compute Architecture for Neural Networks) is Huawei’s AI computing toolkit designed to compile, optimise, and deploy AI models on Ascend AI processors.
This instructor-led, live training (available online or onsite) is tailored for beginner-level AI developers who wish to understand how CANN fits into the model lifecycle from training to deployment, and how it works with frameworks like MindSpore, TensorFlow, and PyTorch.
By the end of this training, participants will be able to:
- Understand the purpose and architecture of the CANN toolkit.
- Set up a development environment with CANN and MindSpore.
- Convert and deploy a simple AI model to Ascend hardware.
- Gain foundational knowledge for future CANN optimisation or integration projects.
Format of the Course
- Interactive lecture and discussion.
- Hands-on labs with simple model deployment.
- Step-by-step walkthrough of the CANN toolchain and integration points.
Course Customisation Options
- To request a customised training for this course, please contact us to arrange.
CANN for Edge AI Deployment
14 HoursHuawei's Ascend CANN toolkit facilitates robust AI inference on edge devices like the Ascend 310. It offers critical tools for compiling, optimising, and deploying models in environments with limited compute power and memory.
This instructor-led, live training (available online or onsite) targets intermediate-level AI developers and integrators who wish to deploy and optimise models on Ascend edge devices using the CANN toolchain.
Upon completing this training, participants will be able to:
- Prepare and convert AI models for the Ascend 310 using CANN tools.
- Build lightweight inference pipelines using MindSpore Lite and AscendCL.
- Optimise model performance for environments with constrained compute and memory.
- Deploy and monitor AI applications in real-world edge use cases.
Course Format
- Interactive lectures and demonstrations.
- Hands-on laboratory work involving edge-specific models and scenarios.
- Live deployment examples on virtual or physical edge hardware.
Customisation Options
- To request tailored training for this course, please contact us to arrange.
Understanding Huawei’s AI Compute Stack: From CANN to MindSpore
14 HoursHuawei’s AI stack — spanning from the low-level CANN SDK to the high-level MindSpore framework — delivers a closely integrated environment for AI development and deployment, specifically optimised for Ascend hardware.
This instructor-led, live training (available online or on-site) targets beginner to intermediate-level technical professionals who wish to grasp how the CANN and MindSpore components collaborate to facilitate AI lifecycle management and infrastructure decision-making.
Upon completing this training, participants will be equipped to:
- Comprehend the layered architecture of Huawei’s AI compute stack.
- Identify how CANN facilitates model optimisation and hardware-level deployment.
- Assess the MindSpore framework and toolchain in comparison with industry alternatives.
- Position Huawei's AI stack within enterprise or cloud/on-premises environments.
Format of the Course
- Interactive lectures and discussions.
- Live system demonstrations and case-based walkthroughs.
- Optional guided labs focusing on the model flow from MindSpore to CANN.
Course Customisation Options
- To request bespoke training for this course, please contact us to make arrangements.
Optimizing Neural Network Performance with CANN SDK
14 HoursThe CANN SDK (Compute Architecture for Neural Networks) serves as Huawei’s foundational AI computing platform, enabling developers to fine-tune and maximise the performance of neural networks deployed on Ascend AI processors.
This instructor-led live training session, available online or onsite, is designed for advanced AI developers and system engineers who aim to optimise inference performance through CANN’s sophisticated toolkit, including the Graph Engine, TIK, and custom operator development.
Upon completion of this training, participants will be able to:
- Comprehend CANN’s runtime architecture and performance lifecycle.
- Utilise profiling tools and the Graph Engine for detailed performance analysis and optimisation.
- Develop and optimise custom operators using TIK and TVM.
- Address memory bottlenecks and enhance model throughput.
Course Format
- Interactive lectures and discussions.
- Practical hands-on labs featuring real-time profiling and operator tuning.
- Optimisation exercises based on edge-case deployment scenarios.
Course Customisation Options
- To request a tailored version of this course, please contact us to make arrangements.
CANN SDK for Computer Vision and NLP Pipelines
14 HoursThe CANN SDK (Compute Architecture for Neural Networks) offers robust deployment and optimization tools for real-time AI applications in computer vision and NLP, particularly on Huawei Ascend hardware.
This instructor-led, live training (available online or onsite) is designed for intermediate-level AI practitioners who want to build, deploy, and optimize vision and language models for production environments using the CANN SDK.
By the conclusion of this training, participants will be able to:
- Deploy and optimize CV and NLP models using CANN and AscendCL.
- Utilize CANN tools to convert models and integrate them into live pipelines.
- Optimize inference performance for tasks such as detection, classification, and sentiment analysis.
- Construct real-time CV/NLP pipelines for both edge and cloud-based deployment scenarios.
Course Format
- Interactive lectures and demonstrations.
- Hands-on labs involving model deployment and performance profiling.
- Live pipeline design using real-world CV and NLP use cases.
Course Customization Options
- To request a customized training session for this course, please contact us to make arrangements.
Building Custom AI Operators with CANN TIK and TVM
14 HoursCANN TIK (Tensor Instruction Kernel) and Apache TVM facilitate the advanced optimisation and customisation of AI model operators for Huawei Ascend hardware.
This instructor-led, live training session (available online or onsite) is designed for advanced-level system developers who wish to construct, deploy, and fine-tune custom operators for AI models using CANN’s TIK programming model and TVM compiler integration.
Upon completion of this training, participants will be capable of:
- Writing and testing custom AI operators using the TIK DSL for Ascend processors.
- Integrating custom operations into the CANN runtime and execution graph.
- Leveraging TVM for operator scheduling, auto-tuning, and benchmarking.
- Debugging and optimizing instruction-level performance for custom computation patterns.
Course Format
- Interactive lectures and demonstrations.
- Hands-on coding of operators utilizing TIK and TVM pipelines.
- Testing and tuning on Ascend hardware or simulators.
Course Customization Options
- To request a customized training session for this course, please contact us to arrange.
Migrating CUDA Applications to Chinese GPU Architectures
21 HoursChinese GPU architectures, including Huawei Ascend, Biren, and Cambricon MLUs, present viable alternatives to CUDA, specifically designed for local AI and HPC markets.
This instructor-led, live training (available online or onsite) is intended for advanced GPU programmers and infrastructure specialists seeking to migrate and optimize existing CUDA applications for deployment on Chinese hardware platforms.
Upon completing this training, participants will be able to:
- Evaluate the compatibility of existing CUDA workloads with Chinese chip alternatives.
- Port CUDA codebases to Huawei CANN, Biren SDK, and Cambricon BANGPy environments.
- Compare performance metrics and identify optimization opportunities across different platforms.
- Address practical challenges related to cross-architecture support and deployment.
Course Format
- Interactive lectures and discussions.
- Hands-on labs focusing on code translation and performance comparisons.
- Guided exercises centred on multi-GPU adaptation strategies.
Customization Options
- To request customized training for this course tailored to your specific platform or CUDA project, please contact us to arrange.
Intelligent Applications Fundamentals
14 HoursThis instructor-led, live training in Botswana (online or onsite) is aimed at beginner-level IT professionals who wish to gain a foundational understanding of intelligent applications and how they can be applied in various industries.
By the end of this training, participants will be able to:
- Understand the history, principles, and impact of artificial intelligence.
- Identify and apply different machine learning algorithms.
- Manage and analyze data effectively for AI applications.
- Recognize the practical applications and limitations of AI in different sectors.
- Discuss the ethical considerations and societal implications of AI technology.
Intelligent Applications Advanced
21 HoursThis instructor-led live training in Botswana (online or onsite) is designed for data scientists, engineers, and AI practitioners at intermediate to advanced levels who aim to master the nuances of intelligent applications and utilise them to address complex, real-world challenges.
Upon completion of this training, participants will be able to:
- Implement and analyse deep learning architectures.
- Apply machine learning at scale within a distributed computing environment.
- Design and execute reinforcement learning models for decision-making.
- Develop sophisticated NLP systems for language understanding.
- Utilise computer vision techniques for image and video analysis.
- Address ethical considerations in the development and deployment of AI systems.
Performance Optimization on Ascend, Biren, and Cambricon
21 HoursAscend, Biren, and Cambricon represent the forefront of AI hardware development in China, providing distinct acceleration and profiling capabilities tailored for large-scale AI operations.
This instructor-led training, available online or onsite, is designed for advanced AI infrastructure and performance engineers seeking to enhance model inference and training processes across various Chinese AI chip architectures.
Upon completion, participants will be equipped to:
- Evaluate model performance on Ascend, Biren, and Cambricon systems.
- Pinpoint system bottlenecks and inefficiencies in memory and computation.
- Implement optimizations at the graph, kernel, and operator levels.
- Refine deployment pipelines to maximise throughput and reduce latency.
Course Format
- Engaging lectures combined with interactive discussions.
- Practical application of profiling and optimization tools across each platform.
- Structured exercises focused on real-world tuning scenarios.
Customization Options
- For tailored training based on your specific performance environment or model requirements, please reach out to us to arrange.