
via Johns Hopkins University
Creation of Johns Hopkins-led team allows worldwide scientific collaboration for studies of human genetics and health
Harnessing the power of genomics to find risk factors for major diseases or search for relatives relies on the costly and time-consuming ability to analyze huge numbers of genomes. A team co-led by a Johns Hopkins University computer scientist has leveled the playing field by creating a cloud-based platform that grants researchers easy access to one of the world’s largest genomics databases.
Known as AnVIL (Genomic Data Science Analysis, Visualization, and Informatics Lab-space), the new platform gives any researcher with an Internet connection access to thousands of analysis tools, patient records, and more than 300,000 genomes. The work, a project of the National Human Genome Institute, appears today in Cell Genomics.
“AnVIL is inverting the model of genomics data sharing, offering unprecedented new opportunities for science by connecting researchers and datasets in new ways and promising to enable exciting new discoveries,” said project co-leader Michael Schatz, Bloomberg Distinguished Professor of computer science and biology at Johns Hopkins.
Michael Schatz
Bloomberg Distinguished Professor
Typically, genomic analysis starts with researchers downloading massive amounts of data from centralized warehouses to their own data centers, a process that is not only time-consuming, inefficient, and expensive, but also makes collaborating with researchers at other institutions difficult. Genetic risk factors for ailments such as cancer or cardiovascular disease are often very subtle, so researchers must analyze thousands of patients’ genomes to discover new associations. The raw data for a single human genome comprises about 40GB, so downloading thousands of genomes to conduct such research can take takes several days to several weeks.
“AnVIL will be transformative for institutions of all sizes, especially smaller institutions that don’t have the resources to build their own data centers. It is our hope that AnVIL levels the playing field, so that everyone has equal access to make discoveries,” Schatz said.
In addition, studies requiring the integration of data collected at multiple institutions means each institution must download its own copy while ensuring that patient-data security is maintained. This challenge is expected to become even greater in the future, as researchers embark on ever-larger studies requiring the analysis of hundreds of thousands to millions of genomes at once.
“Connecting to AnVIL remotely eliminates the need for these massive downloads and saves on the overhead,” Schatz says. “Instead of painfully moving data to researchers, we allow researchers to effortlessly move to the data in the cloud. It also makes sharing datasets much easier so that data can be connected in new ways to find new associations, and it simplifies a lot of computing issues, like providing strong encryption and privacy for patient datasets.”
AnVIL also provides researchers with several major analysis tools, including Galaxy, developed in part at Johns Hopkins, along with other popular tools such as R/Bioconductor, Jupyter notebooks, WDLs, Gen3, and Dockstore to support both interactive analysis and large-scale batch computing. Collectively, these tools allow researchers to tackle even the largest studies without having to build out their own computing environments.
Researchers from all over the world currently use the AnVIL platform to study a variety of genetic diseases, including autism spectrum disorders, cardiovascular disease, and epilepsy. Schatz’s team, part of the Telomere-to-Telomere Consortium, used it to reanalyze thousands of human genomes with the new reference genome to discover more than 1 million new variants.
Already, the AnVIL team has collected petabytes of data (1 petabyte equals one million GB) from several of the largest NHGRI projects, including hundreds of thousands of genomes from the Genotype-Tissue Expression, Centers for Mendelian Genetics, and Centers for Common Disease Genomics projects, with plans to host many more projects in the near future.
Original Article: New cloud-based platform opens genomics data to all
More from: Johns Hopkins University | Broad Institute | Harvard University | Vanderbilt University | University of Chicago | Oregon Health & Science University | Yale School of Medicine | University of California Santa Cruz | Roswell Park Comprehensive Cancer Center| Pennsylvania State University | City University of New York | Carnegie Institution for Science | Washington University in St. Louis
The Latest Updates from Bing News & Google News
Go deeper with Bing News on:
Genomics databases
- Genomic studies shed light on the origins of bee venom
Bees, wasps and ants belong to the Hymenoptera order and inject a whole cocktail of venomous ingredients when they sting. Despite their tremendous ecological and economic importance, little was ...
- Verge Genomics Will Use Modality.AI in Its ALS Phase 1b Clinical Trial of Its Lead Drug Candidate VRG50635
Verge Genomics, a clinical-stage biotechnology company transforming drug discovery using artificial intelligence and human data, announced that its Phase 1b proof-of-concept (POC) study of its lead ...
- TRISH to investigate the effects of spaceflight on the human genome, central nervous system
The Translational Research Institute for Space Health (TRISH) will conduct a suite of human health and performance research projects during Axiom Space's upcoming Axiom Mission 3 (Ax-3) to the ...
- Conservation supported by genomics research
Bermuda fish eDNA database; Bermuda Ocean Genome Legacy reference genomes; and ancient cahow DNA. Information provided by CariGenetics. BioQuest has also partnered with The Berkeley Institute and ...
- Genome study unveils genetic ties between cannabis use disorder and lung cancer risk
A genome-wide association study in Nature Genetics reveals insights into the genetics of cannabis use disorder, highlighting its strong association with psychopathology and a causal link to lung ...
Go deeper with Google Headlines on:
Genomics databases
[google_news title=”” keyword=”genomics databases” num_posts=”5″ blurb_length=”0″ show_thumb=”left”]
Go deeper with Bing News on:
Genomics data
- Genomic studies shed light on the origins of bee venom
Bees, wasps and ants belong to the Hymenoptera order and inject a whole cocktail of venomous ingredients when they sting. Despite their tremendous ecological and economic importance, little was ...
- Bionano Announces Publication Demonstrating Utility of OGM to Assess Genome Integrity of CRISPR-Edited Cells as Part of Gene Therapy Development
Inc. (Nasdaq: BNGO) today announced a publication demonstrating the use of optical genome mapping (OGM) to identify structural variations (SVs) introduced by CRISPR-Cas9 gene editing of CD4+ T-cells ...
- Verge Genomics Will Use Modality.AI in Its ALS Phase 1b Clinical Trial of Its Lead Drug Candidate VRG50635
Modality’s multimodal AI platform will capture critical speech and language changesSOUTH SAN FRANCISCO, Calif., Nov. 29, 2023 (GLOBE NEWSWIRE) -- Verge Genomics, a clinical-stage biotechnology company ...
- Bahrain, First in the Middle East to Acquire Latest Machine used for Human Genome Sequencing
The Ministry of Health in the Kingdom of Bahrain has become the first in the Middle East to acquire and inaugurate the use of the ...
- Conservation supported by genomics research
“With BioQuest, we aim to bring cutting-edge genomic technology and research to the forefront of conservation.” He said the aim was to build local capacity while empowering scientists with the data ...
Go deeper with Google Headlines on:
Genomics data
[google_news title=”” keyword=”genomics data” num_posts=”5″ blurb_length=”0″ show_thumb=”left”]