MARV: a tool for genome-wide multi-phenotype analysis of rare variants

Main Authors: Kaakinen, Marika, Mägi, Reedik, Fischer, Krista, Heikkinen, Jani, Järvelin, Marjo-Riitta, Morris, Andrew P., Prokopenko, Inga
Format: Article
Terbitan: , 2017
Subjects:
Online Access: https://zenodo.org/record/344076
ctrlnum 344076
fullrecord <?xml version="1.0"?> <dc schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ http://www.openarchives.org/OAI/2.0/oai_dc.xsd"><creator>Kaakinen, Marika</creator><creator>M&#xE4;gi, Reedik</creator><creator>Fischer, Krista</creator><creator>Heikkinen, Jani</creator><creator>J&#xE4;rvelin, Marjo-Riitta</creator><creator>Morris, Andrew P.</creator><creator>Prokopenko, Inga</creator><date>2017-02-16</date><description>Background: Genome-wide association studies have enabled identification of thousands of loci for hundreds of traits. Yet, for most human traits a substantial part of the estimated heritability is unexplained. This and recent advances in technology to produce high-dimensional data cost-effectively have led to method development beyond standard common variant analysis, including single-phenotype rare variant and multi-phenotype common variant analysis, with the latter increasing power for locus discovery and providing suggestions of pleiotropic effects. However, there are currently no optimal methods and tools for the combined analysis of rare variants and multiple phenotypes.Results: We propose a user-friendly software tool MARV for Multi-phenotype Analysis of Rare Variants. The tool is based on a method that collapses rare variants within a genomic region and models the proportion of minor alleles in the rare variants on a linear combination of multiple phenotypes. MARV provides analyses of all phenotype combinations within one run and calculates the Bayesian Information Criterion to facilitate model selection. The running time increases with the size of the genetic data while the number of phenotypes to analyse has little effect both on running time and required memory. We illustrate the use of MARV with analysis of triglycerides (TG), fasting insulin (FI) and waist-to-hip ratio (WHR) in 4,721 individuals from the Northern Finland Birth Cohort 1966. The analysis suggests novel multi-phenotype effects for these metabolic traits at APOA5 and ZNF259, and at ZNF259 provides stronger support for association (P TG+FI&#x2009;=&#x2009;1.8&#x2009;&#xD7;&#x2009;10&#x2212;9) than observed in single phenotype rare variant analyses (P TG&#x2009;=&#x2009;6.5&#x2009;&#xD7;&#x2009;10&#x2212;8 and P FI&#x2009;=&#x2009;0.27).Conclusions: MARV is a computationally efficient, flexible and user-friendly software tool allowing rapid identification of rare variant effects on multiple phenotypes, thus paving the way for novel discoveries and insights into biology of complex traits.</description><identifier>https://zenodo.org/record/344076</identifier><identifier>10.1186/s12859-017-1530-2</identifier><identifier>oai:zenodo.org:344076</identifier><relation>url:https://zenodo.org/communities/fp7-bmc</relation><rights>info:eu-repo/semantics/openAccess</rights><rights>https://creativecommons.org/licenses/by/4.0/legalcode</rights><source>BMC Bioinformatics 18(1) 110</source><subject>Rare variant analysis</subject><subject>Multi-phenotype analysis</subject><subject>High-dimensional data</subject><title>MARV: a tool for genome-wide multi-phenotype analysis of rare variants</title><type>Journal:Article</type><type>Journal:Article</type><recordID>344076</recordID></dc>
format Journal:Article
Journal
author Kaakinen, Marika
Mägi, Reedik
Fischer, Krista
Heikkinen, Jani
Järvelin, Marjo-Riitta
Morris, Andrew P.
Prokopenko, Inga
title MARV: a tool for genome-wide multi-phenotype analysis of rare variants
publishDate 2017
isbn 1285901715302
topic Rare variant analysis
Multi-phenotype analysis
High-dimensional data
url https://zenodo.org/record/344076
contents Background: Genome-wide association studies have enabled identification of thousands of loci for hundreds of traits. Yet, for most human traits a substantial part of the estimated heritability is unexplained. This and recent advances in technology to produce high-dimensional data cost-effectively have led to method development beyond standard common variant analysis, including single-phenotype rare variant and multi-phenotype common variant analysis, with the latter increasing power for locus discovery and providing suggestions of pleiotropic effects. However, there are currently no optimal methods and tools for the combined analysis of rare variants and multiple phenotypes.Results: We propose a user-friendly software tool MARV for Multi-phenotype Analysis of Rare Variants. The tool is based on a method that collapses rare variants within a genomic region and models the proportion of minor alleles in the rare variants on a linear combination of multiple phenotypes. MARV provides analyses of all phenotype combinations within one run and calculates the Bayesian Information Criterion to facilitate model selection. The running time increases with the size of the genetic data while the number of phenotypes to analyse has little effect both on running time and required memory. We illustrate the use of MARV with analysis of triglycerides (TG), fasting insulin (FI) and waist-to-hip ratio (WHR) in 4,721 individuals from the Northern Finland Birth Cohort 1966. The analysis suggests novel multi-phenotype effects for these metabolic traits at APOA5 and ZNF259, and at ZNF259 provides stronger support for association (P TG+FI = 1.8 × 10−9) than observed in single phenotype rare variant analyses (P TG = 6.5 × 10−8 and P FI = 0.27).Conclusions: MARV is a computationally efficient, flexible and user-friendly software tool allowing rapid identification of rare variant effects on multiple phenotypes, thus paving the way for novel discoveries and insights into biology of complex traits.
id IOS16997.344076
institution DEFAULT
institution_type library:public
library
library DEFAULT
collection DEFAULT
city DEFAULT
province DEFAULT
repoId IOS16997
first_indexed 2022-06-06T05:19:21Z
last_indexed 2022-06-06T05:19:21Z
recordtype dc
merged_child_boolean 1
_version_ 1739482615163912192
score 17.610363