WISARD[wɪzərd] Workbench for Integrated Superfast Association study with Related Data |
|
WISARD provides several helpful measures for variants in each chromosome. This section describes about
WISARD can provide a summary for variants in each chromosome, and
it is produced with --variantsummary option.
If --indel option is added, the summary for each chromosome includes information about indel and substitution.
Running WISARD with --variantsummary option provides the following information for variants in each chromosome:
Running WISARD with --variantsummary and --indel options provides the following information for variants in each chromosome:
From this code, summary.variant.res is generated and the meaning of each column is
summary.variant.res is... | A chromosome-wise summary for the various information from variants (TSV) | ||
Column | Format | Modifier | Description | CHR | 1~22/X/Y/Mt/Un | NONE | Target chromosome | MINPOS | integer | NONE | Minimally observed physical position for current chromosome in the dataset | MAXPOS | integer | NONE | Maximally observed physical position for current chromosome in the dataset | NSNV | integer | NONE | Number of Single Nucleotide Variants in current chromosome | NMONO | integer | NONE | Number of monotone variants in current chromosome | NSINGLETON | integer | NONE | Number of singleton variants in current chromosome | NDOUBLETON | integer | NONE | Number of doubleton variants in current chromosome | NSUB | real | --indel | Number of substitutions in current chromosome | NINS | real | --indel | Number of insertions in current chromosome | MININS | real | --indel | A length of shortest insertion in current chromosome | MAXINS | real | --indel | A length of longest insertion in current chromosome | AVGINS | real | --indel | An average length of insertions in current chromosome | NDEL | real | --indel | Number of deletions in current chromosome | MINDEL | real | --indel | A length of shortest deletion in current chromosome | MAXDEL | real | --indel | A length of longest deletion in current chromosome | AVGDEL | real | --indel | An average length of deletions in current chromosome |
---|
WISARD provides options to find out monotone (Have no alternative allele), singleton(single alternative allele in the dataset) and doubleton(two alternative alleles in the dataset). Monotone is not informative for genetic association analysis. Central limit theorem is hardly met for singletone and doubleton, and single variant analysis with them has large false negative/positive finding. Therefore, it is better to remove monotone, singleton and doubleton from the analysis.
Some variants are family-specific: i.e., minor alleles are observed only for individuals in a single family. Runnging WISARD with --famuniq option provides summary about the family-specific variants.
variant.famuniq.res is... | A list of family-specific variants (TSV) | ||
Column | Format | Modifier | Description | CHR | integer | --set | Number of variants in the group | VARIANT | integer | --set | Number of variants in the group | POS | integer | --set | Number of variants in the group | ALT | integer | --set | Number of variants in the group | ANNOT | integer | --set | Number of variants in the group | FID | string | NONE | Family ID having that variant | COUNT | string | NONE | Minor allele count for that varinat |
---|
Some variants are population-specific: i.e., minor alleles are observed only for individuals in a single population. Runnging WISARD with --popuniq option provides summary about the population-specific variants. Note that this option requires an assignment of population group for samples. To understand this, refer this section.
variant.popuniq.res is... | A list of population-specific variants (TSV) | ||
Column | Format | Modifier | Description | CHR | string | NONE | Minor allele count for that varinat | VARIANT | string | NONE | Minor allele count for that varinat | POS | string | NONE | Minor allele count for that varinat | ALT | string | NONE | Minor allele count for that varinat | ANNOT | string | NONE | Minor allele count for that varinat | POP_ID | string | NONE | Population ID having that variant | COUNT | string | NONE | Minor allele count for that varinat |
---|