|
2SNP: Phasing Package |
|
Press here if button does not work Or send an e-mail to alexz@cs.gsu.edu which includes: subject: Request for 2SNP Package body: 1. Name. 2. Affiliation. 3. Maximum datasize (# of genotypes in your data / # of SNPs in your data)
ON JULY 20, 2007 WE RELEASE NEW VERSION OF 2SNP PACKAGE.
Input parametrs: input_genotype_file_name output_Haplotype_file_name Sample running: ./2snp genotypes.txt haplotypes.txt [-TRIO][-HALFKNOWN] File formats: input_genotype_file: One line per genotype, SNPs values are in {0,1,2,?} 0 - homozygous SNP with major allele 1 - homozygous SNP with minor allele 2 - heterozygous SNP ? - missing data 3 - missing data with one known allele equal to 0 4 - missing data with one known allele equal to 1 -TRIO - parameter for trio data, if missed then data are phased as unrelated -HALFKNOWN - parameter for missing data with one known allele For the trio data genotypes should be given in triplets: FATHER 1 MOTHER 1 CHILD 1 FATHER 2 MOTHER 2 CHILD 2 ....... output_haplotype_file: Two haplotypes per genotype. One line per haplotype, SNPs values are in {0,1} 0 - major allele SNP 1 - minor allele SNP The output haplotypes file for TRIO data has format: for each TRIO there are 4 haplotypes: non-transmitted haplotype of FATHER 1 non-transmitted haplotype of MOTHER 1 haplotype of child received from FATHER 1 haplotype of child received from MOTHER 1 ......... Mendelian errors presented in TRIO data are reported in mendel.err Sample input and output: The input file genotypes.txt contains 9 genotypes (3 trios) each with 96 SNPs The output file haplotypes-unrel.txt is a result of phasing by 2SNP of genotypes.txt as unrelated which contains 18 haplotypes each with 96 SNPs. The output file haplotypes-trio.txt is a result of phasing by 2SNP of genotypes.txt as trios which contains 12 haplotypes each with 96 SNPs. File mendel.err traks mendelian errors.
Alexander Zelikovsky Phone: (404) 651-0676 Fax: (815) 642-0052 Email: alexz@cs.gsu.edu Office: 1443, Peachtree Str. 34 web:http://www.cs.gsu.edu/~cscazz/ Dumitru Brinza Phone: (858) 822-2496 Email: dima@cs.ucsd.edu Office: 9500 Gilman Dr., San Diego web:http://www-cse.ucsd.edu/~dbrinza/ --------------------------------------------------------- This code may be freely used for all non-commercial purposes. (c) Copyright, 2005 by Professor Alexander Zelikovsky Department of Computer Science, Georgia State University Atlanta, GA 30303 (404) 651-0676 alexz@cs.gsu.edu http://www.cs.gsu.edu/~cscazz/ |