atc-model-converter

📁 ascend-ai-coding/awesome-ascend-skills 📅 4 days ago

总安装量

周安装量

#19723

全站排名

安装命令

npx skills add https://github.com/ascend-ai-coding/awesome-ascend-skills --skill atc-model-converter

Agent 安装分布

gemini-cli 17

github-copilot 17

codex 17

kimi-cli 17

cursor 17

amp 17

Skill 文档

ATC Model Converter

Complete guide for converting ONNX models to Ascend AI processor compatible format using ATC (Ascend Tensor Compiler) tool.

Supported CANN Versions: 8.3.RC1, 8.5.0

â ï¸ Critical Compatibility Requirements

Before starting, ensure your environment meets these requirements:

Component	Requirement	Why
Python	3.7, 3.8, 3.9, or 3.10	Python 3.11+ incompatible with CANN 8.1.RC1
NumPy	< 2.0 (e.g., 1.26.4)	CANN uses deprecated NumPy API
ONNX Opset	11 or 13 (for CANN 8.1.RC1)	Higher opset versions not supported

Quick Environment Setup:

# Create Python 3.10 environment (recommended)
conda create -n atc_py310 python=3.10 -y
conda activate atc_py310

# Install compatible dependencies
pip install torch torchvision ultralytics onnx onnxruntime
pip install "numpy<2.0" --force-reinstall
pip install decorator attrs absl-py psutil protobuf sympy

â ï¸ IMPORTANT: SoC Version Must Match Exactly

SoC version in ATC conversion must exactly match your target device!
# Get exact SoC version from your device
npu-smi info | grep Name
# Output: Name: 910B3 â Use: --soc_version=Ascend910B3
# Output: Name: 310P3 â Use: --soc_version=Ascend310P3
Common Error:
[ACL ERROR] EE1001: supported socVersion=Ascend910B3, 
but the model socVersion=Ascend910B
Fix: Use exact SoC version from npu-smi info, not generic version!

Quick Start

# 1. Check your CANN version and environment
./scripts/check_env_enhanced.sh

# 2. Source the appropriate environment
source /usr/local/Ascend/ascend-toolkit/set_env.sh  # For 8.1.RC1/8.3.RC1
# OR
source /usr/local/Ascend/cann/set_env.sh            # For 8.5.0+

# 3. Basic ONNX to OM conversion
atc --model=model.onnx --framework=5 --output=output_model \
    --soc_version=Ascend910B3

# With input shape specification
atc --model=model.onnx --framework=5 --output=output_model \
    --soc_version=Ascend910B3 \
    --input_shape="input:1,3,640,640"

YOLO Model Conversion & Inference

YOLO Task Types & Output Formats

Task	Model Example	ONNX Output	Post-processing
Detection	yolo26n.pt	`(1, 84, 8400)`	decode + NMS
Pose	yolo26n-pose.pt	`(1, 300, 57)`	filter only
Segmentation	yolo26n-seg.pt	`(1, 116, 8400)`	decode + NMS + mask
OBB	yolo26n-obb.pt	`(1, 15, 8400)`	decode + NMS

Note: YOLO ONNX outputs are raw feature maps, not processed detections. The yolo_om_infer.py script handles decode + NMS automatically.

Step 1: Export YOLO to ONNX

from ultralytics import YOLO

model = YOLO('yolo26n.pt')  # or yolo26n-pose.pt, yolo26n-seg.pt, etc.

# Export with opset 11 for CANN 8.1.RC1 compatibility
model.export(format='onnx', imgsz=640, opset=11, simplify=True)

Step 2: Convert to OM

# Get your SoC version first
npu-smi info | grep Name

# Convert
atc --model=yolo26n.onnx --framework=5 --output=yolo26n \
    --soc_version=Ascend910B3 \
    --input_shape="images:1,3,640,640"

Step 3: Run Inference

# Detection (default)
python3 scripts/yolo_om_infer.py --model yolo26n.om \
    --source image.jpg --task detect --output result.jpg

# Pose estimation
python3 scripts/yolo_om_infer.py --model yolo26n-pose.om \
    --source image.jpg --task pose --output result_pose.jpg

# Segmentation
python3 scripts/yolo_om_infer.py --model yolo26n-seg.om \
    --source image.jpg --task segment --output result_seg.jpg

# Oriented Bounding Box
python3 scripts/yolo_om_infer.py --model yolo26n-obb.om \
    --source image.jpg --task obb --output result_obb.jpg

YOLO Python API

from yolo_om_infer import YoloOMInferencer, draw_results

# Initialize for detection
inferencer = YoloOMInferencer(
    model_path="yolo26n.om",
    task="detect",  # or "pose", "segment", "obb"
    device_id=0,
    conf_thres=0.25,
    iou_thres=0.45
)

# Run inference
result = inferencer("image.jpg")

# Access results
print(f"Detections: {result['num_detections']}")
print(f"Inference time: {result['timing']['infer_ms']:.1f}ms")

for det in result['detections']:
    print(f"  {det['cls_name']}: {det['conf']:.2f} at {det['box']}")

# Cleanup
inferencer.free_resource()

For detailed YOLO guide, see YOLO_GUIDE.md.

OM Model Inference (General)

After converting your model to OM format, use ais_bench for Python inference.

Install ais_bench

# Download pre-built wheel packages (recommended)
# See: https://gitee.com/ascend/tools/blob/master/ais-bench_workload/tool/ais_bench/README.md

# Example for Python 3.10, aarch64:
wget https://aisbench.obs.myhuaweicloud.com/packet/ais_bench_infer/0.0.2/ait/aclruntime-0.0.2-cp310-cp310-linux_aarch64.whl
wget https://aisbench.obs.myhuaweicloud.com/packet/ais_bench_infer/0.0.2/ait/ais_bench-0.0.2-py3-none-any.whl

pip3 install ./aclruntime-*.whl ./ais_bench-*.whl

Basic Inference

# Print model info
python3 scripts/infer_om.py --model model.om --info

# Run inference with random input
python3 scripts/infer_om.py --model model.om --input-shape "1,3,640,640"

# Run inference with actual input
python3 scripts/infer_om.py --model model.om --input test.npy --output result.npy

Python API

from ais_bench.infer.interface import InferSession
import numpy as np

session = InferSession(device_id=0, model_path="model.om")
print("Inputs:", [(i.name, i.shape) for i in session.get_inputs()])
print("Outputs:", [(o.name, o.shape) for o in session.get_outputs()])

input_data = np.random.randn(1, 3, 640, 640).astype(np.float32)
outputs = session.infer([input_data], mode='static')

print(f"Inference time: {session.summary().exec_time_list[-1]:.3f} ms")
session.free_resource()

See INFERENCE.md for detailed ais_bench usage.

Precision Comparison

Verify conversion accuracy by comparing ONNX (CPU) vs OM (NPU) outputs.

# Basic comparison
python3 scripts/compare_precision.py --onnx model.onnx --om model.om --input test.npy

# With custom tolerances
python3 scripts/compare_precision.py --onnx model.onnx --om model.om --input test.npy \
    --atol 1e-3 --rtol 1e-2

Metric	Description	Good Value
`cosine_similarity`	1.0 = identical	>0.99
`max_abs_diff`	Maximum absolute difference	<1e-3 (FP32)
`is_close`	Pass/fail based on atol/rtol	True

CANN Version Guide

CANN Version	Environment Path	Notes
8.3.RC1	`/usr/local/Ascend/ascend-toolkit/set_env.sh`	Standard installation
8.5.0+	`/usr/local/Ascend/cann/set_env.sh`	Must install matching ops package

# Auto-detect CANN version
./scripts/setup_env.sh

Core Parameters

Parameter	Required	Description	Example
`--model`	Yes	Input ONNX model path	`--model=resnet50.onnx`
`--framework`	Yes	Framework type (5=ONNX)	`--framework=5`
`--output`	Yes	Output OM model path	`--output=resnet50`
`--soc_version`	Yes	Must match device exactly	`--soc_version=Ascend910B3`
`--input_shape`	Optional	Input tensor shapes	`--input_shape="input:1,3,224,224"`
`--precision_mode`	Optional	Precision mode	`--precision_mode=force_fp16`

For complete parameters, see PARAMETERS.md.

SoC Version Reference

Device	SoC Version	How to Check
Atlas 910B3	Ascend910B3	`npu-smi info \| grep Name`
Atlas 310P	Ascend310P1/P3	`npu-smi info \| grep Name`
Atlas 200I DK A2	Ascend310B4	`npu-smi info \| grep Name`

Always verify with npu-smi info – do not assume version!

Troubleshooting

Error: Opname not found in model

# Verify input names
python3 scripts/get_onnx_info.py model.onnx

# Use correct name in conversion
atc --model=model.onnx --input_shape="correct_name:1,3,224,224" ...

Error: Invalid soc_version

# Check actual chip version - must be EXACT match
npu-smi info | grep Name
# Use: Ascend + Name value (e.g., Ascend910B3, not Ascend910B)

Conversion Too Slow

export TE_PARALLEL_COMPILER=16
atc --model=model.onnx ...

YOLO Detection Results Look Wrong

Ensure you’re using correct --task parameter
Detection models need decode + NMS (script handles this)
Pose models output top-300 detections (no NMS needed)

See FAQ.md for more troubleshooting.

Resources

scripts/

Conversion & Environment:

check_env_enhanced.sh – â Comprehensive compatibility check
get_onnx_info.py – Inspect ONNX model inputs/outputs
setup_env.sh – Auto-setup CANN environment with SoC warning
convert_onnx.sh – Batch conversion helper

Inference & Testing:

yolo_om_infer.py – â End-to-end YOLO inference (detect/pose/segment/obb)
infer_om.py – â Python inference for OM models using ais_bench
compare_precision.py – â Compare ONNX vs OM output precision

references/

YOLO_GUIDE.md – â YOLO detailed guide (formats, post-processing)
PARAMETERS.md – Complete ATC parameter reference
AIPP_CONFIG.md – AIPP configuration guide
INFERENCE.md – ais_bench inference guide
FAQ.md – Frequently asked questions
CANN_VERSIONS.md – Version-specific guidance

GitHub 仓库 ↗ ← 返回陌讯 Skills 聚合平台