feat: add onnxslim #2258

inisis · 2025-05-12T12:37:59Z

What does this PR do?

This pr introduces onnxslim, a new high performance onnx optimization tool.

for gpt2 model, we can reduce number of operators and model size without losing accuracy, and applies to many other models.

optimum-cli export onnx --model gpt2 gpt2_onnx/ --slim

+------------------------------+------------------------------------------+------------------------------------------+
|             Add              |                   114                    |                   114                    |
|             Cast             |                    34                    |                    3                     |
|            Concat            |                   177                    |                   113                    |
|           Constant           |                   1054                   |                    0                     |
|       ConstantOfShape        |                    4                     |                    2                     |
|             Div              |                    37                    |                    25                    |
|            Equal             |                    2                     |                    2                     |
|            Expand            |                    2                     |                    2                     |
|            Gather            |                   186                    |                   185                    |
|             Gemm             |                    48                    |                    48                    |
|             Less             |                    1                     |                    1                     |
|            MatMul            |                    25                    |                    25                    |
|             Mul              |                    99                    |                    97                    |
|             Pow              |                    37                    |                    37                    |
|            Range             |                    1                     |                    1                     |
|          ReduceMean          |                    50                    |                    50                    |
|           Reshape            |                   150                    |                   145                    |
|            Shape             |                   250                    |                   100                    |
|            Slice             |                    76                    |                    14                    |
|           Softmax            |                    12                    |                    12                    |
|            Split             |                    12                    |                    12                    |
|             Sqrt             |                    61                    |                    25                    |
|           Squeeze            |                    52                    |                    2                     |
|             Sub              |                    27                    |                    27                    |
|             Tanh             |                    12                    |                    12                    |
|          Transpose           |                    61                    |                    61                    |
|          Unsqueeze           |                   271                    |                   187                    |
|            Where             |                    5                     |                    5                     |
+------------------------------+------------------------------------------+------------------------------------------+
|          Model Size          |                475.12 MB                 |                475.02 MB                 |
+------------------------------+------------------------------------------+------------------------------------------+
|         Elapsed Time         |                                        9.57 s                                       |
+------------------------------+------------------------------------------+------------------------------------------+

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

Who can review?

@IlyasMoutawwakil @xenova

optimum/commands/export/onnx.py

optimum/exporters/onnx/convert.py

IlyasMoutawwakil

looks clean ! we will need some basic testing for this feature, maybe in test_export_cli.py

tests/exporters/onnx/test_export_cli.py

IlyasMoutawwakil · 2025-05-12T13:23:12Z

@inisis the table is very cool ! is it possible to get the "diff" i.e. how many operators were removed per operator type and overall reduction in number of ops ?

inisis · 2025-05-12T13:30:47Z

@inisis the table is very cool ! is it possible to get the "diff" i.e. how many operators were removed per operator type and overall reduction in number of ops ?

@IlyasMoutawwakil Currently, we don't support that, but this feature can be added, and we provides api for doing so, and there will be different colors in terminal, green means less.

HuggingFaceDocBuilderDev · 2025-05-12T13:37:37Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

…or tests

inisis · 2025-05-13T00:48:59Z

@IlyasMoutawwakil It runs very slow locally, can I use your CI server to do it. I simply enable onnxslim in test_export_cli.py, and if all passed I will remove it.

IlyasMoutawwakil · 2025-05-13T08:08:54Z

can I use your CI server to do it

The CI runs on basic Github runners.

inisis · 2025-05-13T08:38:04Z

@IlyasMoutawwakil Currently, CI test will raise No module named 'onnxslim', so where to put the dependency, a possible solution is that when user set slim to True, optimum automatically installs onnxslim if an import error happened.

IlyasMoutawwakil · 2025-05-13T10:24:22Z

@inisis please add it to "tests" extras, and add a check when slim=True, something like if not is_onnx_slim_available(): raise ...

xenova · 2025-05-13T16:58:24Z

Just want to add my voice in support of onnxslim: it's an amazing library I use for all my Transformers.js models 🤗. So, having this built-in to Optimum could be really helpful too! 👍

Happy to provide a review when PR is ready too.

optimum/exporters/onnx/convert.py

Co-authored-by: Ilyas Moutawwakil <57442720+IlyasMoutawwakil@users.noreply.github.com>

tests/exporters/onnx/test_export_cli.py

setup.py

optimum/exporters/onnx/convert.py

tests/exporters/onnx/test_export_cli.py

Co-authored-by: Ilyas Moutawwakil <57442720+IlyasMoutawwakil@users.noreply.github.com>

optimum/exporters/onnx/convert.py

tests/exporters/onnx/test_export_cli.py

optimum/exporters/onnx/convert.py

IlyasMoutawwakil

LGTM ! thanks for iterating on this !

feat: add onnxslim

0c38796

IlyasMoutawwakil reviewed May 12, 2025

View reviewed changes

optimum/commands/export/onnx.py Outdated Show resolved Hide resolved

IlyasMoutawwakil reviewed May 12, 2025

View reviewed changes

optimum/exporters/onnx/convert.py Outdated Show resolved Hide resolved

IlyasMoutawwakil reviewed May 12, 2025

View reviewed changes

inisis added 2 commits May 12, 2025 21:06

fix style and rename simplify to slim

d7f971c

add onnxslim tests

69acb5b

IlyasMoutawwakil reviewed May 12, 2025

View reviewed changes

tests/exporters/onnx/test_export_cli.py Outdated Show resolved Hide resolved

fix format

92a3401

add slim args for main_export and make slim true in test_export_cli f…

fe1d005

…or tests

fix format

91cf8c4

add is_onnxslim_available func and add onnxslim to test dependency

065d9b0

refactor format and pin onnxslim to 0.1.53

a3513c0

IlyasMoutawwakil reviewed May 15, 2025

View reviewed changes

optimum/exporters/onnx/convert.py Outdated Show resolved Hide resolved

Update optimum/exporters/onnx/convert.py

662a9ca

Co-authored-by: Ilyas Moutawwakil <57442720+IlyasMoutawwakil@users.noreply.github.com>

IlyasMoutawwakil reviewed May 16, 2025

View reviewed changes

tests/exporters/onnx/test_export_cli.py Outdated Show resolved Hide resolved

IlyasMoutawwakil reviewed May 16, 2025

View reviewed changes

setup.py Outdated Show resolved Hide resolved

IlyasMoutawwakil reviewed May 16, 2025

View reviewed changes

optimum/exporters/onnx/convert.py Outdated Show resolved Hide resolved

IlyasMoutawwakil reviewed May 16, 2025

View reviewed changes

tests/exporters/onnx/test_export_cli.py Outdated Show resolved Hide resolved

inisis and others added 2 commits May 16, 2025 18:21

Update tests/exporters/onnx/test_export_cli.py

dbb9414

Co-authored-by: Ilyas Moutawwakil <57442720+IlyasMoutawwakil@users.noreply.github.com>

use glob and refactor tests

2e12460

IlyasMoutawwakil reviewed May 16, 2025

View reviewed changes

optimum/exporters/onnx/convert.py Outdated Show resolved Hide resolved

Update optimum/exporters/onnx/convert.py

8c62b74

IlyasMoutawwakil reviewed May 16, 2025

View reviewed changes

tests/exporters/onnx/test_export_cli.py Outdated Show resolved Hide resolved

IlyasMoutawwakil and others added 4 commits May 16, 2025 12:38

Update tests/exporters/onnx/test_export_cli.py

361f00d

remove glob

0cf4e9d

fix tests error

21c9c2c

add slim to _onnx_export

1876aa7

IlyasMoutawwakil reviewed May 16, 2025

View reviewed changes

optimum/exporters/onnx/convert.py Outdated Show resolved Hide resolved

Update optimum/exporters/onnx/convert.py

a899c37

IlyasMoutawwakil approved these changes May 16, 2025

View reviewed changes

IlyasMoutawwakil merged commit d1af494 into huggingface:main May 16, 2025
24 checks passed

inisis mentioned this pull request May 31, 2025

fix: wav2vec2_conformer shape inference bug inisis/OnnxSlim#113

Merged

feat: add onnxslim #2258

feat: add onnxslim #2258

Uh oh!

Conversation

inisis commented May 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Who can review?

Uh oh!

Uh oh!

Uh oh!

IlyasMoutawwakil left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

IlyasMoutawwakil commented May 12, 2025

Uh oh!

inisis commented May 12, 2025

Uh oh!

HuggingFaceDocBuilderDev commented May 12, 2025

Uh oh!

inisis commented May 13, 2025

Uh oh!

IlyasMoutawwakil commented May 13, 2025

Uh oh!

inisis commented May 13, 2025

Uh oh!

IlyasMoutawwakil commented May 13, 2025

Uh oh!

xenova commented May 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

IlyasMoutawwakil left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

inisis commented May 12, 2025 •

edited

Loading

xenova commented May 13, 2025 •

edited

Loading

IlyasMoutawwakil left a comment •

edited

Loading