Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

how long convolution ensures causal language modeling #47

Open
0205090923 opened this issue Sep 5, 2024 · 5 comments
Open

how long convolution ensures causal language modeling #47

0205090923 opened this issue Sep 5, 2024 · 5 comments

Comments

@0205090923
Copy link

Hello, I would like to know how long convolution ensures causal language modeling. It seems that I couldn't find any explicit padding applied in the code.

@0205090923
Copy link
Author

I noticed in another response that you mentioned zero padding was applied to the kernel. I would like to know where this step is performed in the code. Looking forward to your reply

@0205090923
Copy link
Author

Hello, I noticed that in long_conv_kernel.py self.L = L*2 if not causal else L, so we should set the L = L for causal? This seems to be inconsistent with the explanations elsewhere.. I'm so confused, can you kindly explain the causal for Longconv?

@DanFu09
Copy link
Contributor

DanFu09 commented Sep 5, 2024 via email

@0205090923
Copy link
Author

So we should set L = 2 * L for causal? could kindly explain can it works for causal... it seems no explicit padding is applied in the code, thank you

@DanFu09
Copy link
Contributor

DanFu09 commented Sep 5, 2024 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants