Running GGD recipe: hg38 dbsnp 154-20210112 URL transformed to HTTPS due to an HST

thanks for figuring out! I've updated the recipe: <a href="https://github.com

Thanks Serhiy! I believe that also <a href="https://github.com/chapm

Hi Peter <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard

Error during install - dbsnp issue about bcbio-nextgen HOT 15 OPEN

jchien-ucd commented on July 17, 2024

Error during install - dbsnp issue

from bcbio-nextgen.

Comments (15)

wangpenhok commented on July 17, 2024

I met this problem, too. The reason is that version 154 is not available from 'ftp.ncbi.nih.gov' anymore. Instead, they offer 156 as the latest version. so you can edit the GGD recipe manually by updating the URL and version.

from bcbio-nextgen.

jchien-ucd commented on July 17, 2024

Hi Perter, Where can I find the recipe file? When I edited the ggd-run.sh in the genome folder under txtmp, it was overwritten during the upgrade. So, I must not editing the right file at the right place. Jeremy From: Peter Wang ***@***.***> Sent: Tuesday, February 14, 2023 6:52 PM To: bcbio/bcbio-nextgen ***@***.***> Cc: Jeremy R Chien ***@***.***>; Author ***@***.***> Subject: Re: [bcbio/bcbio-nextgen] Error during install (Issue #3699) I met this problem, too. The reason is that version 154 is not available from 'ftp.ncbi.nih.gov' anymore. Instead, they offer 156 as the latest version. so you can edit the GGD recipe manually by updating the URL and version. - Reply to this email directly, view it on GitHub<https://nam12.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fbcbio%2Fbcbio-nextgen%2Fissues%2F3699%23issuecomment-1430677115&data=05%7C01%7Cjrchien%40ucdavis.edu%7Cab16cd8ca0174bf123ad08db0eff8bd8%7Ca8046f6466c04f009046c8daf92ff62b%7C0%7C0%7C638120262957530077%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=%2F%2BwvszvQnlWBIv%2B%2F6fBnrsr%2F%2FecmCQdDmBGXuVd%2Bqtk%3D&reserved=0>, or unsubscribe<https://nam12.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fnotifications%2Funsubscribe-auth%2FAOCCTWG2IY3VPDU3TCQKHFTWXRADJANCNFSM6AAAAAAU4GHZDQ&data=05%7C01%7Cjrchien%40ucdavis.edu%7Cab16cd8ca0174bf123ad08db0eff8bd8%7Ca8046f6466c04f009046c8daf92ff62b%7C0%7C0%7C638120262957530077%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=5m6oN52%2Bp%2FBm4I%2BmySJj%2B0s%2FVJDz2eibcIbrZ6LZfMg%3D&reserved=0>. You are receiving this because you authored the thread.Message ID: ***@***.******@***.***>> **CONFIDENTIALITY NOTICE** This e-mail communication and any attachments are for the sole use of the intended recipient and may contain information that is confidential and privileged under state and federal privacy laws. If you received this e-mail in error, be aware that any unauthorized use, disclosure, copying, or distribution is strictly prohibited. If you received this e-mail in error, please contact the sender immediately and destroy/delete all copies of this message.

from bcbio-nextgen.

jchien-ucd commented on July 17, 2024

Hi Peter, I found where the recipe file was, and after the edit it is installed correctly. The issue is resolved. Jeremy From: Peter Wang ***@***.***> Sent: Tuesday, February 14, 2023 6:52 PM To: bcbio/bcbio-nextgen ***@***.***> Cc: Jeremy R Chien ***@***.***>; Author ***@***.***> Subject: Re: [bcbio/bcbio-nextgen] Error during install (Issue #3699) I met this problem, too. The reason is that version 154 is not available from 'ftp.ncbi.nih.gov' anymore. Instead, they offer 156 as the latest version. so you can edit the GGD recipe manually by updating the URL and version. - Reply to this email directly, view it on GitHub<https://nam12.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fbcbio%2Fbcbio-nextgen%2Fissues%2F3699%23issuecomment-1430677115&data=05%7C01%7Cjrchien%40ucdavis.edu%7Cab16cd8ca0174bf123ad08db0eff8bd8%7Ca8046f6466c04f009046c8daf92ff62b%7C0%7C0%7C638120262957530077%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=%2F%2BwvszvQnlWBIv%2B%2F6fBnrsr%2F%2FecmCQdDmBGXuVd%2Bqtk%3D&reserved=0>, or unsubscribe<https://nam12.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fnotifications%2Funsubscribe-auth%2FAOCCTWG2IY3VPDU3TCQKHFTWXRADJANCNFSM6AAAAAAU4GHZDQ&data=05%7C01%7Cjrchien%40ucdavis.edu%7Cab16cd8ca0174bf123ad08db0eff8bd8%7Ca8046f6466c04f009046c8daf92ff62b%7C0%7C0%7C638120262957530077%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=5m6oN52%2Bp%2FBm4I%2BmySJj%2B0s%2FVJDz2eibcIbrZ6LZfMg%3D&reserved=0>. You are receiving this because you authored the thread.Message ID: ***@***.******@***.***>> **CONFIDENTIALITY NOTICE** This e-mail communication and any attachments are for the sole use of the intended recipient and may contain information that is confidential and privileged under state and federal privacy laws. If you received this e-mail in error, be aware that any unauthorized use, disclosure, copying, or distribution is strictly prohibited. If you received this e-mail in error, please contact the sender immediately and destroy/delete all copies of this message.

from bcbio-nextgen.

lindakohn commented on July 17, 2024

I encountered this problem too, located and edited all 154 entries to 156 in <path_to-bcbio>/tmpbcbio-install/cloudbiolinux/ggd-recipes//dbsnp.yaml

from bcbio-nextgen.

naumenko-sa commented on July 17, 2024

thanks for figuring out!
I've updated the recipe:
https://github.com/chapmanb/cloudbiolinux/blob/master/ggd-recipes/hg38/dbsnp.yaml

from bcbio-nextgen.

lindakohn commented on July 17, 2024

Thanks Serhiy!

I believe that also https://github.com/chapmanb/cloudbiolinux/blob/master/ggd-recipes/hg19/dbsnp.yaml needs to be changed accordingly

/Linda

from bcbio-nextgen.

naumenko-sa commented on July 17, 2024

Thanks, Linda!
I've updated the hg19 recipe!
SN

from bcbio-nextgen.

wangpenhok commented on July 17, 2024

Thanks Serhiy!

I think the gnomad genome recipe should be updated according to the new field names of gnomad files as well

from bcbio-nextgen.

naumenko-sa commented on July 17, 2024

Hi Peter @wangpenhok !

I see 3.1 vcf sites are still there?
https://console.cloud.google.com/storage/browser/gcp-public-data--gnomad/release/3.1/vcf/genomes

What is the issue you are encountering?

from bcbio-nextgen.

wangpenhok commented on July 17, 2024

Hi Serhiy@naumenko-sa

The vcf sites are okay, but the file downloaded from $gnomad_fields_to_keep_url may be outdated.

As I had mentioned here, [https://github.com/chapmanb/cloudbiolinux/pull/400#issuecomment-1475735769],

The majority of field names of the latest vcf files are not compatible with names in the file gnomad_fields_to_keep,
Here are some headers from the latest vcfs:
##INFO=<ID=nhomalt-sas-XY,Number=A,Type=Integer,Description="Count of homozygous individuals in XY samples of South Asian ancestry"> ##INFO=<ID=AC-fin-XX,Number=A,Type=Integer,Description="Alternate allele count for XX samples of Finnish ancestry"> ##INFO=<ID=AN-fin-XX,Number=1,Type=Integer,Description="Total number of alleles in XX samples of Finnish ancestry"> ##INFO=<ID=AF-fin-XX,Number=A,Type=Float,Description="Alternate allele frequency in XX samples of Finnish ancestry"> ##INFO=<ID=nhomalt-fin-XX,Number=A,Type=Integer,Description="Count of homozygous individuals in XX samples of Finnish ancestry"> ##INFO=<ID=AC-nfe-XX,Number=A,Type=Integer,Description="Alternate allele count for XX samples of Non-Finnish European ancestry"> ##INFO=<ID=AN-nfe-XX,Number=1,Type=Integer,Description="Total number of alleles in XX samples of Non-Finnish European ancestry"> ##INFO=<ID=AF-nfe-XX,Number=A,Type=Float,Description="Alternate allele frequency in XX samples of Non-Finnish European ancestry"> ##INFO=<ID=nhomalt-nfe-XX,Number=A,Type=Integer,Description="Count of homozygous individuals in XX samples of Non-Finnish European ancestry"> ##INFO=<ID=AC-sas,Number=A,Type=Integer,Description="Alternate allele count for samples of South Asian ancestry"> ##INFO=<ID=AN-sas,Number=1,Type=Integer,Description="Total number of alleles in samples of South Asian ancestry"> ##INFO=<ID=AF-sas,Number=A,Type=Float,Description="Alternate allele frequency in samples of South Asian ancestry"> ##INFO=<ID=nhomalt-sas,Number=A,Type=Integer,Description="Count of homozygous individuals in samples of South Asian ancestry"> ##INFO=<ID=AC-oth-XX,Number=A,Type=Integer,Description="Alternate allele count for XX samples of Other ancestry"> ##INFO=<ID=AN-oth-XX,Number=1,Type=Integer,Description="Total number of alleles in XX samples of Other ancestry"> ##INFO=<ID=AF-oth-XX,Number=A,Type=Float,Description="Alternate allele frequency in XX samples of Other ancestry"> ##INFO=<ID=nhomalt-oth-XX,Number=A,Type=Integer,Description="Count of homozygous individuals in XX samples of Other ancestry"> ##INFO=<ID=AC-amr-XX,Number=A,Type=Integer,Description="Alternate allele count for XX samples of Latino ancestry"> ##INFO=<ID=AN-amr-XX,Number=1,Type=Integer,Description="Total number of alleles in XX samples of Latino ancestry"> ##INFO=<ID=AF-amr-XX,Number=A,Type=Float,Description="Alternate allele frequency in XX samples of Latino ancestry"> ##INFO=<ID=nhomalt-amr-XX,Number=A,Type=Integer,Description="Count of homozygous individuals in XX samples of Latino ancestry"> ##INFO=<ID=AC-XX,Number=A,Type=Integer,Description="Alternate allele count for XX samples"> ##INFO=<ID=AN-XX,Number=1,Type=Integer,Description="Total number of alleles in XX samples">

It is obvious that _ has been replaced with -. In addition, male/female seems to be XY/XX now.

I solved this problem by manually update the gnomad_fields_to_keep file, but this could be annoying for those who install bcbio relying on the pipeline.

Would you please update the file gnomad_fields_to_keep into the latest version?
Thanks~

from bcbio-nextgen.

btesson-lysarc commented on July 17, 2024

Hello, I think there is still a small issue with the hg38 recipe for dbsnp, the version number should also be updated:
version=GCF_000001405.38 should be version=GCF_000001405.40

Thanks a lot for all the work put into bcbio.

from bcbio-nextgen.

naumenko-sa commented on July 17, 2024

Thanks @wangpenhok !
I've updated the fields file and the recipe.
I have not changed the recipe version to avoid gnomad updates for users.

@matthdsm I think you compiled that list years ago, could you please review?
Smaller populations are not in Gnomad anymore.

from bcbio-nextgen.

matthdsm commented on July 17, 2024

@matthdsm I think you compiled that list years ago, could you please review?

Hi Serhiy,

It seems you already recompiled the list? Do you need me to do something more?

from bcbio-nextgen.

tt9756067 commented on July 17, 2024

Thanks Serhiy!

I believe that also https://github.com/chapmanb/cloudbiolinux/blob/master/ggd-recipes/hg19/dbsnp.yaml needs to be changed accordingly

/Linda

Hi Serhiy,

I got the same problem but for GRCh37. Would you mind also updating the dbsnp.yaml for GRCh37?

from bcbio-nextgen.

Roscutts commented on July 17, 2024

Just to note I have had the same problem now with GCF_000001405.38 should be version=GCF_000001405.40 but I see it has already been noted in the comments

from bcbio-nextgen.

Error during install - dbsnp issue about bcbio-nextgen HOT 15 OPEN

Comments (15)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent