onnx-spliter

When using NPU for inference with complex models, it may not be possible to convert the entire model into a format fully supported by the NPU due to insufficient operator support. As a result, the model needs to be split, with the computationally intensive parts converted to a format supported by the NPU, while the remaining parts run on the CPU. This repository is designed to facilitate that process, allowing users to easily split their ONNX models as needed.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
convert.py		convert.py
part1.onnx		part1.onnx
part2.onnx		part2.onnx
requirements.txt		requirements.txt
yolo11n.onnx		yolo11n.onnx

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

onnx-spliter

About

Releases

Packages

Languages

License

LJ-Hao/onnx-spliter

Folders and files

Latest commit

History

Repository files navigation

onnx-spliter

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages