DMP DIGITAL MEDIA PROFESSIONALS CONTACT

Transformative NPU driving the Future of Edge AI ZIA™︎- A3000 Transformative NPU driving the Future of Edge AI ZIA™︎- A3000

Product Overview and Features

  • High-Performance NPU: Featuring a scalable NPU architecture with over 40 TOPS of compute power, capable of handling a wide range of inference tasks.
  • Multicore Parallel Processing: A multicore design that can concurrently process multiple diverse models.
  • Mixed Precision Computation: Adopts a mixed precision computation approach to balance inference performance and accuracy.
  • Extensive Data Format Support: Supports a broad range of data formats including INT4/8, FP4/FP8/FP16.
  • Comprehensive Model Support: Wide coverage of ONNX operators, enabling a diverse set of AI models.
  • Edge Computing Optimization: Optimized for edge computing with high performance, power efficiency, and area efficiency(PPA).
Application

Details and main functions

  • Scalable MAC Support: Each core supports from 96 to 2048 MAC units, enabling advanced multicore processing capabilities.
  • High-Performance Inference: Delivers over 40 TOPS of inference performance, executing complex computations at high speed.
  • Edge Device Optimization: Designed for low power consumption and cost, achieving high performance, power efficiency, and area efficiency (PPA). Compared to NPUs at the same level, it achieves around 50% smaller die size.
  • Mixed Precision Arithmetic: DMP proprietary mixed precision arithmetic unit enables combination of high accuracy and high-speed processing.
  • Broad Data Format Support: In addition to FP16 and INT8, the latest ML trends are supported, including INT4 and FP4, addressing diverse application needs.
  • Proprietary Profiler for Performance and Accuracy Analysis: a custom profiler providing per-layer performance analysis and accuracy degradation, facilitating the optimization of both speed and precision.

Use case

use case
Contact
top

Follow DMP