-
Notifications
You must be signed in to change notification settings - Fork 199
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
使用Turbo进行命名实体识别,测试速度无明显提升 #245
Comments
可能是输入太短?用onnxrt有加速么? |
看起来挺正常的,你多测几次,避免warmup开销,试试设置一下OMP线程数目 |
好的,OMP线程数目默认是机器本身CPU个数吗? |
好的,非常感谢!! |
把pytorc的代码用C++重写了一遍,加入了算子融合,矩阵乘法用了mkl,其他操作用omp并行加速。 |
请在README里注明一下实际上目前版本的turbo只是在调用onnxruntime吧,编译二进制还挺麻烦的,代码里都没用上,不如直接掉onnxruntime |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Hi,feifeibear
我仿照您提供的例子(bert_for_sequence_classification_example.py)写了个用于命名实体识别的测试程序,运行下来发现Turbo提速并不明显,请问这中间有什么潜在的注意事项吗?
The text was updated successfully, but these errors were encountered: