GitHub - jiashenC/attention-gym

Attention Gym

Implement attention operator using basic PyTorch functions to match PyTorch MultiAttention behavior.

Implement the attention operator in CUDA.

Implement the flash attention operator using basic PyTorch functions (emulation for understanding).

Implement flash attention operator in CUDA.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
class101		class101
class102		class102
class201		class201
class202		class202
.gitignore		.gitignore
README.md		README.md