WISARD[wɪzərd]
Workbench for Integrated Superfast Association study with Related Data
HOME  |   DOWNLOAD  |   OPTIONS  |   TROUBLE?  |   LOGIN
 

Epistasis Analysis

Available statistics [top]

Epistasis analysis is computationally very intensive and few statistics are available. The most efficient statistic depends on the ascertainment condition, the property of phenotypes (dichotomous/quantitative), the presence of covariates and the absence/presence of population stratification.

The meaning of gene x gene interaction has not been clearly understood, and statistical gene x gene interaction has often been confused with biological gene x gene interaction. Statistical interaction is defined as a departure from additivity in a linear model using a selected measurement scale, and to infer biological interactions, statistically modeled interactions and main effect terms should not be separately interpreted (Wang et al Nat Genet 2010).

Summary for available statistics:

  • BOOST: useful for case-control design. It is approximately similar to the logistic regreeion and covariate effect cannot be adjusted. Both statistical and biological interaction can be detected.
  • MDR: useful for case-control design. Covariate effect cannot be adjusted. It tests the marginal genetic effect and gene x gene statistical interaction effect, and useful for biological interaction.
  • Generalized MDR (GMDR): extended MDR for quantitative phenotype and it can adjust the covariate effect. It tests the marginal genetic effect and gene x gene statistical interaction effect, and useful for biological interaction.
  • FastEpistasis (PLINK): FastEpistasis method was orignally implemented in PLINK, to test SNP-SNP interaction in a relatively fast manner. However, it should be noted that the actual performance on an exhaustive search of SNP-SNP interaction of this method is actually slower than those of MDR or BOOST.
  • HiSCom-GGI: This method, Hierarchical Structural Component analysis for Gene-Gene interaction, investigates gene-gene interaction with natural hierarchy of variant-gene-phenotype. Unlike other analyses, this method has an advantage by providing variant-wise effect, gene-level effect and interaction effect.


BOolean Operation based Screening and Testing [top]

BOolean Operation based Screening and Testing (BOOST) method proposed by Wan et al(Am J Hum Genet 2010), can evaluate exhaustive two-way combination in genome-wide case/control dataset in feasible time.

BOOST is approximately similar to the result from logistic regression but the distribution for p-value calculation is not clear. Therefore, it is recommended to reanlyze most signification results from BOOST with logistic regression.

Example code

  • Calculating BOOST with WISARD
  • Run BOOST C:\Users\WISARD> wisard --bed test_miss0.bed --boost
  • Reporting more significant results larger than a certain threshold
  • Report significant results over T>10 C:\Users\WISARD> wisard --bed test_miss0.bed --boost --thrboost 10
    BOOST provides some measure which corresponds to the likelihood ratio test for logistic/log-linear model, and the threshold for this measure is used for screening significant statistical interaction. WISARD can set this threshold by using --thrboost option and the default vaule is 30. 30 corresponds to the unadjusted p-value =$4.89\times 10^{-6}$ and if larger value is set, smaller number of interaction analysis will be reported.

Multifactor Dimensionality Reduction [top]

Multifactor Dimensionality Reduction (MDR) was introduced by Ritchie et al. (Am J Hum Genet 2001), to identify gene-gene interaction in genotype data, against binary trait. The original MDR cannot adjust covariate effects, but it was extended to the generalized MDR (GMDR) which allows for covariate adjustment and the analysis of quantitative phenotypes (Lou et al Am J Hum Genet 2007).

WISARD can calculate both of MDR and GMDR.

Running MDR

Performing MDR analysis C:\Users\WISARD> wisard --ped test_miss0.ped --mdr --out mdr_1storder

In default, it investigates first-order combination. Hence, it is required to set an additional parameter to find a real "interaction", an order of the combination.

Performing MDR analysis for second-order exhaustive combination C:\Users\WISARD> wisard --ped test_miss0.ped --mdr --order 2 --out mdr_2ndorder

Moreover, MDR in WISARD reports ALL results of the tested interactions. Instead, an additonal parameter --top controls the number of outputs, by sorting the result in descending order by the comparison measure (In default, it is Balanced Accuracy (BA)).

Performing MDR analysis for second-order exhaustive combination C:\Users\WISARD> wisard --ped test_miss0.ped --mdr --order 2 --top 10 --out mdr_2ndorder_top10
mdr.res is... A result of MDR (TSV)
Detailed information about this format is not yet available

Running GMDR

Performing Generalized MDR analysis C:\Users\WISARD> wisard --ped test_miss0.ped --gmdr

Running GMDR with covariate effect adjustment

Performing Generalized MDR analysis with covariate adjustment of age and height C:\Users\WISARD> wisard --ped test_miss0.ped --gmdr --sampvar test_miss0_phen.txt --cname age,height

Running MDR with permutation

WISARD supports permutation scheme in MDR analysis, in order to assess the significance of given interaction pair.

Running MDR with permutation C:\Users\WISARD> wisard --bed test_miss0.bed --mdr --nperm 1000
NOTE!
This scheme will requires a lot of time to accomplish!

Some other options for MDR and GMDR are available:

  • --cv: for cross-validation
    !!! Experimental function !!!
  • --top N: this option make WISARD report top N gene$\times$gene interaction
  • --measure: for sorting standard
    !!! Experimental function !!!
  • --order: for an order of interaction investigation


!!! Experimental function !!!

Hierarchical MDR [top]

Although MDR can detect genetic interaction effectively, an identification of high-order intearction with exhaustive apporach can be impractical, due to an enormous number of possible combinations for high-order. In order to such limitation, we implemented a new approach called hierarchical MDR, an efficient method for high-order interaction in large-scale genetic data.

!!! Experimental function !!!

Gene-level MDR [top]

MDR is powerful to identify gene-gene interaction, but its computational burden is not achievable for large-scale genetic analysis. This problem is alleviated by using block-based MDR and gene is a reasonable choice to define block. WISARD provides gene-level MDR method originally proposed by Oh et al(BMC Bioinformatics 2012).

Example code

  • Running gene-level MDR
  • Performing gene-level MDR analysis C:\Users\WISARD> wisard --ped test_miss0.ped --set gene_definition.txt --genemdr
    Analysis of gene-level MDR requires the gene set file which defines variants for each gene by using --set option. The detila for gene-set file can be found at gene-level analysis page .
  • Options for GMDR and MDR such as --cv,--top N and --order also can be used for gene-level MDR.

FastEpistasis [top]

FastEpistasis is a SNP-SNP interaction analysis method that was implemented in PLINK.

Running FastEpistasis C:\Users\WISARD> wisard --bed test_miss0 --quickepi

Hierarchical Structural Component analyis for Gene-Gene Interaction [top]



Edit this page
Last modified : 2017-09-11 15:55:00