Skip to content

jiashenC/attention-gym

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Attention Gym

class 101

Implement attention operator using basic PyTorch functions to match PyTorch MultiAttention behavior.

class 102

Implement the attention operator in CUDA.

class 201

Implement the flash attention operator using basic PyTorch functions (emulation for understanding).

class 202

Implement flash attention operator in CUDA.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published