Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Generating longer videos with lower resolution using AR model #82

Open
zlenyk opened this issue Jan 24, 2025 · 0 comments
Open

Generating longer videos with lower resolution using AR model #82

zlenyk opened this issue Jan 24, 2025 · 0 comments
Assignees
Labels
question Further information is requested

Comments

@zlenyk
Copy link

zlenyk commented Jan 24, 2025

Hello! QQ about my understanding of current architecture of the autoregressive model for video generation.

Current limit of frames generated comes from "max_seq_len" - how many tokens we keep as context for generating new tokens. We shouldn't generate more tokens to start loosing tokens from beginning.
But doesn't it mean that if we set video resolution to 2x smaller (so 4x fewer tokens), we should be able to generate 4x longer videos? If I'm not mistaken it's just the memory constraint.

Is there an error in that logic?
Thank you!

@mharrim mharrim added the question Further information is requested label Jan 24, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

3 participants