Releases: EnzymeAD/Enzyme.jl
Releases · EnzymeAD/Enzyme.jl
v0.10.5
Enzyme v0.10.5
Closed issues:
- Custom pullbacks for PP AD example (#26)
- Crash with Tullio (#99)
- Correctly deal with array alloc of
eltype(A) = Any
(#118) - Union of Return (#154)
- TypeAnalysis fails on Julia 1.8 LLVM 13 (#171)
- Crash on multidimensional setindex (#175)
- Understanding a huge crash report (#181)
- Global leads to wrong gradient (#185)
- Kernel crashing issue (#192)
- Union leads to illegal type analysis (#236)
- Segfault when testing package on Julia 1.8 on a mac. (#250)
- Attributor pass broken on 1.8-dev/ LLVM13 (#267)
foldl
withCUDA.jl
results injl_lazy_load_and_lookup
(#272)- Assertion `getMinSignedBits() <= 64 && "Too many bits for int64_t"' failed. (#273)
- Verification or llvm inliner error? (#276)
- Out of place broadcasting creates unexpected results (#278)
- Segfault when used with
Optim.jl
(#298) - MethodError on dynamic
Val
construction (#309) - Support reshaping via
jl_array_reshape
(#310) - Segfault when using with BLAS (#315)
- Nested broadcast not working (#338)
- MethodError when trying to autodiff through a complex matrix (#359)
- Differentiating trigonometric functions with KernelGradients and CUDAKernels (#375)
- Jacobian producing unexpected results (#377)
- Problems for CUDA.jl kernel autodiff (#378)
- Mutation of gradient input in MatMul example for Square Matrices (#382)
- limitations of CUDA kernels differentiation (#383)
- guess_activity(::Type{Union{}}) is ambiguous (#386)
- Correctness bug with 2D array offsets (#388)
- enzyme doesn't work on Apple silicon (#394)
- Segfault with dense matrix multiplication (#391)
- Segfault with sparse matrix multiplication (#393)
- sum() sometimes returns incorrect result (#397)
- exp10 causes illegal type analyses error (#401)
- differentiation works on cpu, fails on gpu, for exponentiation (#412)
- memory error on second invocation of derivative GPU kernel (#413)
- v1.6 only LLVM-level error (#414)
- Concatenating AD for multiple GPU kernels (#415)
- can't
push!
twice (#417) - atomic operations error (#421)
- Uncaught Enzyme exception crashes Julia 1.8 (#423)
- Support for tan operation (#424)
- sqrt in CUDA kernel gives error (#426)
- Uncaught LLVM-level error from neural SDE (#429)
- Push Undefined Memory Error (#430)
- Gradients for struct fields are zero when a Bool field is present (#431)
BatchDuplicated
inForward
breaks forsinc
(#434)- BLAS fallback has short-circuiting zero path (#445)
- Gradient fails to deal with GPU movements (#448)
- Gradient through acos errors (#449)
- Pullback for array function gives zero gradients with forward mode (#450)
orig
is not defined (#454)
Merged pull requests:
- Mark aliasing property of error (#104) (@wsmoses)
- Set default activity of any (#187) (@wsmoses)
- Inactive arg copy (#279) (@wsmoses)
- Add fast exp (#339) (@wsmoses)
- Add ijl box variations (#374) (@wsmoses)
- Add preserve nvvm (#379) (@wsmoses)
- Add array reshape rev (#380) (@wsmoses)
- Add runtime activity for fwd (#385) (@wsmoses)
- Box tutorial documentation (#387) (@swilliamson7)
- Fix getwidth bug (#390) (@wsmoses)
- fix cuda tests (#395) (@wsmoses)
- Custom allocator support (#396) (@vchuravy)
- Handle TSCtx on 1.9 (#398) (@vchuravy)
- Use literate for box example (#399) (@vchuravy)
- Use gc_alloc_obj for tapes containing Julia objects (#400) (@vchuravy)
- Handle exp10 (#403) (@wsmoses)
- Better unimlemented vector assertions (#405) (@wsmoses)
- Fix movebefore (#406) (@wsmoses)
- Use Anonymous Struct for sret (#408) (@vchuravy)
- docs for Sparse Array Structure Initializer (#409) (@ryanstoner1)
- Copy over correct IR dumping code (#410) (@pchintalapudi)
- Bump JLL (#411) (@wsmoses)
- support double push (#420) (@vchuravy)
- Rules for sinh and cosh (#425) (@sriharikrishna)
- Detect integer form of eqtable (#427) (@wsmoses)
- Fix nightly (#433) (@wsmoses)
- Fix zero'ing in augmented forward (#436) (@wsmoses)
- Reuse julia type rules (#437) (@wsmoses)
- Dynamic val fix (#438) (@wsmoses)
- Handle not concrete type trees (#439) (@wsmoses)
- Remove preprocessed funcs & fix typeof (#440) (@wsmoses)
- Fix nightly pt2 (#441) (@wsmoses)
- Bump jll (#442) (@wsmoses)
- Add sinc (#444) (@wsmoses)
- Fix typo
delf
->delF
(#446) (@avik-pal) - Add stacktraces to errors (#452) (@wsmoses)
- Fix orig name bug (#455) (@wsmoses)
- Add acos (#456) (@wsmoses)
- Cleanup CI (#458) (@vchuravy)
v0.10.4
Enzyme v0.10.4
Closed issues:
v0.10.3
Enzyme v0.10.3
Merged pull requests:
v0.10.2
Enzyme v0.10.2
Closed issues:
- Differentiate
dsyevr_64_
(#64) - Crash with Tullio (#99)
- Enable parallel flag as option within autodiff (#106)
dgemm_64
support missing (#113)- dgbtrs support missing (#136)
sgemv
support missing (#162)- Unsupported
Val
in CUDA kernel - Enzyme v0.10.0 - GPUCompiler v0.15 (#358) - Cannot use Enzyme on clean Julia installation (#360)
- Wrong gradients (mutating buffers) (#362)
- Unsupported call through a literal pointer (call to
__libc_malloc
) (#367)
Merged pull requests:
v0.10.1
v0.10.0
Enzyme v0.10.0
Closed issues:
- Analysis error ("could not deduce type of integer") on GEP when the stars align correctly (#311)
Symbols not found: [ cblas_xerbla ]
(#312)- Segmentation fault during NVPTX emission due to debuginfo (#316)
- Forward over Reverse Crash (#320)
- Support for BigFloat (#348)
- AbstractDifferentiation (#349)
- Problem in Windows Subsystem Linux (WSL2) (#353)
Merged pull requests:
- Generalize activity sets (#296) (@wsmoses)
- merge autodiff and fwddiff API (#306) (@vchuravy)
- Handle xerbla (#317) (@wsmoses)
- add tests to Jacobian using LinAlg (#319) (@vchuravy)
- Fix metadata (#323) (@wsmoses)
- Fix debug info2 (#324) (@wsmoses)
- Fix scope bug for arg (#325) (@wsmoses)
- Fix in to haskey (#327) (@wsmoses)
- Add log1p (#328) (@wsmoses)
- Re-enable tests (#329) (@wsmoses)
- Move validation up (#330) (@vchuravy)
- Add type error and array fix (#332) (@wsmoses)
- Rand (#333) (@wsmoses)
- Fix type analysis on 1.8 (#335) (@wsmoses)
- Fix jl_arraycopy on phi (#336) (@wsmoses)
- Ensure that loading CUDA doesn't break things on 1.6 (#341) (@vchuravy)
- Rebase: Move compilation out of threadsfor (#342) (@vchuravy)
- Unbreak CI until JLL bump (#344) (@vchuravy)
- CompatHelper: bump compat for GPUCompiler to 0.16, (keep existing compat) (#345) (@github-actions[bot])
- Jacobian returns one array (#351) (@vchuravy)
- Remove tuple from gradient(::Forward) without chunking (#352) (@vchuravy)
- Bump jll version (#355) (@vchuravy)
v0.9.6
Enzyme v0.9.6
Merged pull requests: