WISARD[wɪzərd] Workbench for Integrated Superfast Association study with Related Data |
|
This section describes about
Rare variant association is tested with a set of rare variants simultaneously because of large false negative rate, and thus a set of rare variants should be defined. WISARD supports four types of set file format, and it can be selected by using --set option.
NOTE! |
--set option is mandatory for running gene-level analysis! |
For type-I format, each line consists of two columnes for gene set name (e.g. SET_A) and variant name respectively, and they should be separated with whitespace (space or tab). Gene set name might be a gene name.
NOTE! |
Variants which belong to the same gene should be contiguously placed! |
Type-II file format is equal to t he set definition used in PLINK(see here for plink). Each set must start with a set name which can not have any spaces in it. The name is followed by a list of variants in that gene set, and the keyword END specifies the end of that particular set. You also can refer below example:
NOTE! |
Do not use END as a name of variant! |
Type-III file format is similar to the type-I definition, but all variants for each set should be enumerated in a single line. Type-III file format is equal to the set definition used in EPACTS.
Type-IV file format is different with the other three types of set.
It defines a set of multiple variants by allocating specific region to each set.
Each set can be overlapped among other sets, and a variant which is placed on overlapped region
will be assigned to every sets that occupies that region.
In many analysis toolsets such as Rvtests uses an existing format for representing gene information.
NOTE! |
Generated set from --makeset contains existing variants in given dataset |
WISARD provides several gene-set- or pathway-level analysis, which investigates association between pathway(s) and phenotype(s). In order to perform these type of analyses, an extra input similar to the gene-variant mapping, named 'pathway-gene mapping', is required.
Since the pathway-gene mapping(or gene-set) file associates a pathway(or a gene-set) with multiple genes, its format is similar to the gene-variant mapping. Currently, WISARD supports type-III format, which defines one pathway(or one gene-set) per a line by enumerating their beloging genes.
In the above example, three pathways are defined with their genes. Unlike gene-variant mapping, genes in the pathway not necessarily contiguous or within same chromosome.
Currently, the following analyses utilizes the pathway-gene mapping information.