Compiling Big Data in a Human-Centric Way

When a group of researchers in the Undiagnosed Disease Network at Baylor College of Medicine realized they were spending days combing through databases searching for information regarding gene variants, they decided to do something about it. By creating MARRVEL (Model organism Aggregated Resources for Rare Variant ExpLoration) they are now able to help not only their own lab but also researchers everywhere search databases all at once and in a matter of minutes.

This collaborative effort among Baylor, the Jan and Dan Duncan Neurological Research Institute at Texas Children's Hospital and Harvard Medical School is described in the latest online edition of the American Journal of Human Genetics.

Big data search engine
"One big problem we have is that tens of thousands of human genome variants and phenotypes are spread throughout a number of databases, each one with their own organization and nomenclature that aren't easily accessible," said Julia Wang, an M.D./Ph.D. candidate in the Medical Scientist Training Program at Baylor and a McNair Student Scholar in the Bellen lab, as well as first author on the publication. "MARRVEL is a way to assess the large volume of data, providing a concise summary of the most relevant information in a rapid user-friendly format."

MARRVEL displays information from OMIM, ExAC, ClinVar, Geno2MP, DGV, and DECIPHER, all separate databases to which researchers across the globe have contributed, sharing tens of thousands of human genome variants and phenotypes. Since there is not a set standard for recording this type of information, each one has a different approach and searching each database can yield results organized in different ways. Similarly, decades of research in various model organisms, from mouse to yeast, are also stored in their own individual databases with different sets of standards.

Dr. Zhandong Liu, assistant professor in pediatrics - neurology at Baylor, a member of the Jan and Dan Duncan Neurological Research Institute at Texas Children's and co-corresponding author on the publication, explains that MARRVEL acts similar to an internet search engine.

"This program helps to collate the information in a common language, drawing parallels and putting it together on one single page. Our program curates model organism specific databases to concurrently display a concise summary of the data," Liu said.

Supporting researchers
A user can first search for a gene or variant, Wang explains. Results may include what is known about this gene overall, whether or not that gene is associated with a disease, whether it is highly occurring in the general population and how it is affected by certain mutations.

"MARRVEL helps to facilitate analysis of human genes and variants by cross-disciplinary integration of 18 million records so we can speed up the discovery process through computation," Liu said. "All this information is basically inaccessible unless researchers can access it efficiently and apply it to their own work to find causes, treatments and hopefully identify new diseases."

Collaboration
This project started as a necessity for the Model Organism Screening Center for the Undiagnosed Disease Network at Baylor, but as it grew, the group began reaching out to researchers in different disciplines for feedback on how MARRVEL might benefit them.

"This program is just the start. I think our tool is going to be a model for us to help clinicians and basic scientists more efficiently use the information already publicly available," Wang said. "It will help us understand and process all of the different mutations that researchers are discovering."

"The most exciting part is how this project is bringing so many different researchers together," Liu said. "We are working with labs we might not have normally collaborated with, trying to put together a puzzle of all this data."

Both Wang and Liu are thankful to the contributions from the genetics communities allowing them access to the databases as they developed MARRVEL.

Julia Wang, Rami Al-Ouran, Yanhui Hu, Seon-Young Kim, Ying-Wooi Wan, Michael F. Wangler, Shinya Yamamoto, Hsiao-Tuan Chao, Aram Comjean, Stephanie E. Mohr, Norbert Perrimon, Zhandong Liu, Hugo J. Bellen.
MARRVEL: Integration of Human and Model Organism Genetic Resources to Facilitate Functional Annotation of the Human Genome.
The American Journal of Human Genetics, doi: 10.1016/j.ajhg.2017.04.010.

Most Popular Now

Virtual Humans Help Aspiring Doctors Lea…

For medical student Katie Goldrath, the first time delivering difficult health news came when she had to tell a young woman named Robin and her mom, Delmy, that Robin had...

Read more

'Smart Contact Lens Sensor' for Diabetic…

A recent study, affiliated with Ulsan National Institute of Science and Technology (UNIST), South Korea, has proposed the possibility of in situ human health monitoring simply by wearing a contact...

Read more

2017 eHealth Competition Awards SilverCl…

The eHealth Competition is an initiative that rewards the best digital health solutions produced by SMEs across Europe. This edition has been supported by Astrazeneca, Ship2B and Younoodle. This competition...

Read more

ECDC Report Shows Strong Potential of E-…

Twenty one EU/EEA countries have developed or are in the process of developing systems to digitally record information about vaccination, according to a new "ECDC survey report on immunisation information...

Read more

Devicare Raises 3 Million Euros in its C…

Devicare, a company specializing in innovative medical devices for chronic home care patients under Remote Patient Monitoring (RPM), has closed out a seed round of 3 million euros. This funding...

Read more

Successful Conclusion to conhIT 2017, th…

25 - 27 April 2017, Berlin, Germany. As conhIT, which took place from 25 to 27 April in Berlin, came to a close, 500 exhibitors, 9,500 participants from around the world...

Read more

Compiling Big Data in a Human-Centric Wa…

When a group of researchers in the Undiagnosed Disease Network at Baylor College of Medicine realized they were spending days combing through databases searching for information regarding gene variants, they...

Read more

Scopis Introduces the First Mixed-Realit…

Scopis, a company specializing in surgical navigation and medical augmented and mixed reality technologies, announced today the launch of its newest development, the Holographic Navigation Platform for use in surgery...

Read more

IMS MAXIMS Launches Vital Signs Mobile A…

Clinical technology specialist IMS MAXIMS will be launching its fully integrated vital signs application at eHealth Week on 3rd and 4th May in Olympia, London. Delegates will be the first...

Read more

Immunisation Information Systems in the …

Immunisation information systems (IIS) are defined as confidential, population-based, computerised databases that record all immunisation doses administered by participating providers to persons residing within a given geopolitical area. At the...

Read more

Abbott Announces CE Mark and First Use o…

Abbott (NYSE: ABT) today announced CE Mark and first use of the new Confirm RxTM Insertable Cardiac Monitor (ICM), the world's first smartphone compatible ICM that will help physicians identify...

Read more

Using a Smartphone to Screen for Male In…

More than 45 million couples worldwide grapple with infertility, but current standard methods for diagnosing male infertility can be expensive, labor-intensive and require testing in a clinical setting. Cultural and...

Read more