-
TOP
-
AI Products / Services
- NPU “A3000 V2”
Product Overview and Features
-
High-Performance NPU: Featuring a scalable NPU architecture with over 40 TOPS of compute power, capable of handling a wide range of inference tasks.
-
Multicore Parallel Processing: A multicore design that can concurrently process multiple diverse models.
-
Mixed Precision Computation: Adopts a mixed precision computation approach to balance inference performance and accuracy.
-
Extensive Data Format Support: Supports a broad range of data formats including INT4/8, FP4/FP8/FP16.
-
Comprehensive Model Support: Wide coverage of ONNX operators, enabling a diverse set of AI models.
-
Edge Computing Optimization: Optimized for edge computing with high performance, power efficiency, and area efficiency(PPA).
Details and main functions
-
Scalable MAC Support: Each core supports from 96 to 2048 MAC units, enabling advanced multicore processing capabilities.
-
High-Performance Inference: Delivers over 40 TOPS of inference performance, executing complex computations at high speed.
-
Edge Device Optimization: Designed for low power consumption and cost, achieving high performance, power efficiency, and area efficiency (PPA). Compared to NPUs at the same level, it achieves around 50% smaller die size.
-
Mixed Precision Arithmetic: DMP proprietary mixed precision arithmetic unit enables combination of high accuracy and high-speed processing.
-
Broad Data Format Support: In addition to FP16 and INT8, the latest ML trends are supported, including INT4 and FP4, addressing diverse application needs.
-
Proprietary Profiler for Performance and Accuracy Analysis: a custom profiler providing per-layer performance analysis and accuracy degradation, facilitating the optimization of both speed and precision.
top