Skip to content

[Infer] Add head_dim=96 dispatch for block attention #724

[Infer] Add head_dim=96 dispatch for block attention

[Infer] Add head_dim=96 dispatch for block attention #724