Skip to content

Commit

Permalink
Set vectorization
Browse files Browse the repository at this point in the history
  • Loading branch information
howetuft committed Jan 4, 2025
1 parent 5623593 commit dbdabc3
Show file tree
Hide file tree
Showing 7 changed files with 23 additions and 8 deletions.
2 changes: 2 additions & 0 deletions .github/workflows/wheels.yml
Original file line number Diff line number Diff line change
Expand Up @@ -28,6 +28,8 @@ jobs:
#os: [ubuntu-latest]
#python-minor: [ 11]
#os: [windows-latest]
#python-minor: [ 11]
#os: [macos-13]
exclude:
- os: macos-14
python-minor: 8
Expand Down
10 changes: 6 additions & 4 deletions conan-profiles/conan-profile-Linux-X64
Original file line number Diff line number Diff line change
Expand Up @@ -8,9 +8,11 @@ compiler.version=12
compiler.cppstd={{ os.getenv("CXX_VERSION") }}
compiler.libcxx=libstdc++11

[conf]
openvdb/*:tools.cmake.cmaketoolchain:extra_variables={ 'USE_CCACHE':'OFF' }

[options]
openvdb/*:simd=AVX
embree3/*:avx512=True
embree3/*:avx=True

[conf]
tools.build:cflags+=["-ftree-vectorize", "-march=x86-64-v3", "-O3"]
tools.build:cxxflags+=["-ftree-vectorize", "-march=x86-64-v3", "-O3"]
openvdb/*:tools.cmake.cmaketoolchain:extra_variables={ 'USE_CCACHE':'OFF' }
3 changes: 2 additions & 1 deletion conan-profiles/conan-profile-Windows-X64
Original file line number Diff line number Diff line change
Expand Up @@ -10,7 +10,8 @@ compiler.runtime=dynamic

[conf]
tools.env.virtualenv:powershell=False
tools.build:cxxflags=["/openmp", "/MP", "/DEBUG:NONE"]
tools.build:cxxflags=["/openmp", "/MP", "/DEBUG:NONE", "/arch:AVX", "/O2"]
embree3/*:tools.build:cxxflags=["/openmp", "/MP", "/DEBUG:NONE", "/O2"]

[options]
openvdb/*:simd=AVX
Expand Down
1 change: 1 addition & 0 deletions conan-profiles/conan-profile-common
Original file line number Diff line number Diff line change
Expand Up @@ -16,3 +16,4 @@ yasm/*

[options]
oidn/*:shared=True
embree3/*:with_tbb=False
1 change: 1 addition & 0 deletions conan-profiles/conan-profile-macOS-ARM64
Original file line number Diff line number Diff line change
Expand Up @@ -37,3 +37,4 @@ embree3/*:neon2x=True
[buildenv]
CXX=clang++
CC=clang
&:tools.build:cxxflags=["-O3"]
5 changes: 4 additions & 1 deletion conan-profiles/conan-profile-macOS-X64
Original file line number Diff line number Diff line change
Expand Up @@ -13,9 +13,12 @@ compiler.libcxx=libc++
minizip-ng/*:with_zlib=True
minizip-ng/*:with_libcomp=False
openvdb/*:simd=AVX
embree3/*:avx512=True
embree3/*:avx=True
embree3/*:shared=True

[buildenv]
CXX=clang++
CC=clang

[conf]
&:tools.build:cxxflags+=["-ftree-vectorize", "-march=x86-64-v3", "-O3", "-mno-avx2"]
9 changes: 7 additions & 2 deletions doc/notes.md
Original file line number Diff line number Diff line change
Expand Up @@ -91,7 +91,12 @@ author).
tried to have it minimal and it's 10.13. For the delocation to succeed, we also
have to set `MACOSX_DEPLOYMENT_TARGET` environment variable to 10.13.

## Vectorization
For Intel, we stick to x86-64-v3 (Haswell, 2013), no more. For all platform
except MacOS Intel, we rely on "-O" option.
For MacOS Intel: Github runner is not standard, we have to deactivate avx2
extensions. Otherwise `pyluxcore.so` cannot be tested ("Illegal instruction").
See https://github.com/ggerganov/whisper.cpp/issues/358 and `sysctl -a
machdep.cpu`


we

0 comments on commit dbdabc3

Please sign in to comment.