Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

gunzip hg38.knownGene.gtf.gz not in gzip format #267

Open
nush320 opened this issue Jun 13, 2023 · 1 comment
Open

gunzip hg38.knownGene.gtf.gz not in gzip format #267

nush320 opened this issue Jun 13, 2023 · 1 comment

Comments

@nush320
Copy link

nush320 commented Jun 13, 2023

Hi, I am on Chapter VI-Data Sources - Downloading complete genomes.
The following command from "Get the GTF file" is not working: gunzip hg38.knownGene.gtf.gz

It says its not in gzip format.

When I check the file type its showing "ASCII text".

The command after that works: cat hg38.knownGene.gtf | wc -l if I replace hg38.knownGene.gtf as hg38.knownGene.gtf.gz but it prints out 3626358 instead of 3091269.

Thanks.

@ialbert
Copy link
Member

ialbert commented Jun 14, 2023

strangely enough, and for reasons I can't explain the file hg38.knownGene.gtf.gz is not a gzip file, it is already unpacked. I'm quite certain it did not use to be like that and that it used to be a gzipped file ...

I think the server automatically unpacks that file for us upon request.

I would have to investigate what happens there, evidently not the correct behavior there ... oh well ... bioinformatics

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants