- Strong understanding of deep learning model architectures and training workflows
- Experience with model deployment using CANN, TensorFlow, or PyTorch
- Familiarity with Linux CLI, shell scripting, and Python programming
Audience
- AI performance engineers
- Inference optimization specialists
- Developers working with edge AI or real-time systems
CANN SDK (Compute Architecture for Neural Networks) is Huawei’s AI compute foundation that allows developers to fine-tune and optimize the performance of deployed neural networks on Ascend AI processors.
This instructor-led, live training (online or onsite) is aimed at advanced-level AI developers and system engineers who wish to optimize inference performance using CANN’s advanced toolset, including the Graph Engine, TIK, and custom operator development.
By the end of this training, participants will be able to:
- Understand CANN's runtime architecture and performance lifecycle.
- Use profiling tools and Graph Engine for performance analysis and optimization.
- Create and optimize custom operators using TIK and TVM.
- Resolve memory bottlenecks and improve model throughput.
Format of the Course
- Interactive lecture and discussion.
- Hands-on labs with real-time profiling and operator tuning.
- Optimization exercises using edge-case deployment examples.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Overview of CANN Optimization Capabilities
- How inference performance is handled in CANN
- Optimization goals for edge and embedded AI systems
- Understanding AI Core utilization and memory allocation
Using Graph Engine for Analysis
- Introduction to the Graph Engine and execution pipeline
- Visualizing operator graphs and runtime metrics
- Modifying computational graphs for optimization
Profiling Tools and Performance Metrics
- Using CANN Profiling Tool (profiler) for workload analysis
- Analyzing kernel execution time and bottlenecks
- Memory access profiling and tiling strategies
Custom Operator Development with TIK
- Overview of TIK and operator programming model
- Implementing a custom operator using TIK DSL
- Testing and benchmarking operator performance
Advanced Operator Optimization with TVM
- Intro to TVM integration with CANN
- Auto-tuning strategies for computational graphs
- When and how to switch between TVM and TIK
Memory Optimization Techniques
- Managing memory layout and buffer placement
- Techniques to reduce on-chip memory consumption
- Best practices for asynchronous execution and reuse
Real-World Deployment and Case Studies
- Case study: performance tuning for smart city camera pipeline
- Case study: optimizing autonomous vehicle inference stack
- Guidelines for iterative profiling and continuous improvement
Summary and Next Steps
United Arab Emirates - Optimizing Neural Network Performance with CANN SDK
Qatar - Optimizing Neural Network Performance with CANN SDK
Egypt - Optimizing Neural Network Performance with CANN SDK
Saudi Arabia - Optimizing Neural Network Performance with CANN SDK
South Africa - Optimizing Neural Network Performance with CANN SDK
Brasil - Optimizing Neural Network Performance with CANN SDK
Canada - Optimizing Neural Network Performance with CANN SDK
中国 - Optimizing Neural Network Performance with CANN SDK
香港 - Optimizing Neural Network Performance with CANN SDK
澳門 - Optimizing Neural Network Performance with CANN SDK
台灣 - Optimizing Neural Network Performance with CANN SDK
USA - Optimizing Neural Network Performance with CANN SDK
Österreich - Optimizing Neural Network Performance with CANN SDK
Schweiz - Optimizing Neural Network Performance with CANN SDK
Deutschland - Optimizing Neural Network Performance with CANN SDK
Czech Republic - Optimizing Neural Network Performance with CANN SDK
Denmark - Optimizing Neural Network Performance with CANN SDK
Estonia - Optimizing Neural Network Performance with CANN SDK
Finland - Optimizing Neural Network Performance with CANN SDK
Greece - Optimizing Neural Network Performance with CANN SDK
Magyarország - Optimizing Neural Network Performance with CANN SDK
Ireland - Optimizing Neural Network Performance with CANN SDK
Luxembourg - Optimizing Neural Network Performance with CANN SDK
Latvia - Optimizing Neural Network Performance with CANN SDK
España - Optimizing Neural Network Performance with CANN SDK
Italia - Optimizing Neural Network Performance with CANN SDK
Lithuania - Optimizing Neural Network Performance with CANN SDK
Nederland - Optimizing Neural Network Performance with CANN SDK
Norway - Optimizing Neural Network Performance with CANN SDK
Portugal - Optimizing Neural Network Performance with CANN SDK
România - Optimizing Neural Network Performance with CANN SDK
Sverige - Optimizing Neural Network Performance with CANN SDK
Türkiye - Optimizing Neural Network Performance with CANN SDK
Malta - Optimizing Neural Network Performance with CANN SDK
Belgique - Optimizing Neural Network Performance with CANN SDK
France - Optimizing Neural Network Performance with CANN SDK
日本 - Optimizing Neural Network Performance with CANN SDK
Australia - Optimizing Neural Network Performance with CANN SDK
Malaysia - Optimizing Neural Network Performance with CANN SDK
New Zealand - Optimizing Neural Network Performance with CANN SDK
Philippines - Optimizing Neural Network Performance with CANN SDK
Singapore - Optimizing Neural Network Performance with CANN SDK
Thailand - Optimizing Neural Network Performance with CANN SDK
Vietnam - Optimizing Neural Network Performance with CANN SDK
India - Optimizing Neural Network Performance with CANN SDK
Argentina - Optimizing Neural Network Performance with CANN SDK
Chile - Optimizing Neural Network Performance with CANN SDK
Costa Rica - Optimizing Neural Network Performance with CANN SDK
Ecuador - Optimizing Neural Network Performance with CANN SDK
Guatemala - Optimizing Neural Network Performance with CANN SDK
Colombia - Optimizing Neural Network Performance with CANN SDK
México - Optimizing Neural Network Performance with CANN SDK
Panama - Optimizing Neural Network Performance with CANN SDK
Peru - Optimizing Neural Network Performance with CANN SDK
Uruguay - Optimizing Neural Network Performance with CANN SDK
Venezuela - Optimizing Neural Network Performance with CANN SDK
Polska - Optimizing Neural Network Performance with CANN SDK
United Kingdom - Optimizing Neural Network Performance with CANN SDK
South Korea - Optimizing Neural Network Performance with CANN SDK
Pakistan - Optimizing Neural Network Performance with CANN SDK
Sri Lanka - Optimizing Neural Network Performance with CANN SDK
Bulgaria - Optimizing Neural Network Performance with CANN SDK
Bolivia - Optimizing Neural Network Performance with CANN SDK
Indonesia - Optimizing Neural Network Performance with CANN SDK
Kazakhstan - Optimizing Neural Network Performance with CANN SDK
Moldova - Optimizing Neural Network Performance with CANN SDK
Morocco - Optimizing Neural Network Performance with CANN SDK
Tunisia - Optimizing Neural Network Performance with CANN SDK
Kuwait - Optimizing Neural Network Performance with CANN SDK
Oman - Optimizing Neural Network Performance with CANN SDK
Slovakia - Optimizing Neural Network Performance with CANN SDK
Kenya - Optimizing Neural Network Performance with CANN SDK
Nigeria - Optimizing Neural Network Performance with CANN SDK
Botswana - Optimizing Neural Network Performance with CANN SDK
Slovenia - Optimizing Neural Network Performance with CANN SDK
Croatia - Optimizing Neural Network Performance with CANN SDK
Serbia - Optimizing Neural Network Performance with CANN SDK
Bhutan - Optimizing Neural Network Performance with CANN SDK
Nepal - Optimizing Neural Network Performance with CANN SDK
Uzbekistan - Optimizing Neural Network Performance with CANN SDK