Fix version compatibility issue with transformers>4.34.0 for flash-attention2 patch #2655

Trangle · 2023-11-08T12:15:58Z

Why are these changes needed?

the rotary_emb logic changed in transformers==4.35.0, fix the compatibility

Related issue number (if applicable)

#2648

Checks

I've run format.sh to lint the changes in this PR.
I've included any doc changes needed.
I've made sure the relevant tests are passing (if applicable).

merrymercy · 2023-11-09T11:43:02Z

does this work for older versions of transformers?

Trangle · 2023-11-09T15:26:26Z

does this work for older versions of transformers?

test in 4.34 and 4.35，4.35 is when the logic of rotary_emb changed.

Trangle · 2023-11-13T06:58:56Z

does this work for older versions of transformers?

Also tested in 4.30.

However, there are some questions. Do we not need to perform a restore operation on the kv head here?

about after line # 72

if getattr(self, "num_key_value_groups", None):
k = repeat_kv(k, self.num_key_value_groups)
v= repeat_kv(v, self.num_key_value_groups)

Niyx52094 · 2023-11-27T09:38:19Z

how long is this feature ok for users? When I use this file for transformers 4.35.0 to fine tune llama2 7b in 8X A100, it gives an error about " out of memory ".

Trangle · 2023-11-30T03:20:35Z

how long is this feature ok for users? When I use this file for transformers 4.35.0 to fine tune llama2 7b in 8X A100, it gives an error about " out of memory ".

This is an issue with transformers after ver 4.35. After reducing the batch size, try again. Currently, this issue has not been fixed in 4.36 yet. You can try fixed versions below 4.35, such as 4.34.

rebase & cherry

efdb712

Trangle force-pushed the fix_fs2_patch branch from e3791d1 to efdb712 Compare December 9, 2023 05:32

Trangle mentioned this pull request Dec 15, 2023

llama2_flash_attn_monkey_patch error #2648

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix version compatibility issue with transformers>4.34.0 for flash-attention2 patch #2655

Fix version compatibility issue with transformers>4.34.0 for flash-attention2 patch #2655

Trangle commented Nov 8, 2023 •

edited

Loading

merrymercy commented Nov 9, 2023 •

edited

Loading

Trangle commented Nov 9, 2023

Trangle commented Nov 13, 2023 •

edited

Loading

Niyx52094 commented Nov 27, 2023

Trangle commented Nov 30, 2023 •

edited

Loading

Fix version compatibility issue with transformers>4.34.0 for flash-attention2 patch #2655

Are you sure you want to change the base?

Fix version compatibility issue with transformers>4.34.0 for flash-attention2 patch #2655

Conversation

Trangle commented Nov 8, 2023 • edited Loading

Why are these changes needed?

Related issue number (if applicable)

Checks

merrymercy commented Nov 9, 2023 • edited Loading

Trangle commented Nov 9, 2023

Trangle commented Nov 13, 2023 • edited Loading

Niyx52094 commented Nov 27, 2023

Trangle commented Nov 30, 2023 • edited Loading

Trangle commented Nov 8, 2023 •

edited

Loading

merrymercy commented Nov 9, 2023 •

edited

Loading

Trangle commented Nov 13, 2023 •

edited

Loading

Trangle commented Nov 30, 2023 •

edited

Loading