FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6659, 290 aa 1>>>pF1KE6659 290 - 290 aa - 290 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.4595+/-0.000865; mu= 14.4416+/- 0.052 mean_var=60.9864+/-12.144, 0's: 0 Z-trim(104.8): 20 B-trim: 10 in 2/51 Lambda= 0.164232 statistics sampled from 8070 (8073) to 8070 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.627), E-opt: 0.2 (0.248), width: 16 Scan time: 1.800 The best scores are: opt bits E(32554) CCDS6007.1 NAT1 gene_id:9|Hs108|chr8 ( 290) 1951 470.7 5.5e-133 CCDS55205.1 NAT1 gene_id:9|Hs108|chr8 ( 352) 1951 470.7 6.5e-133 CCDS6008.1 NAT2 gene_id:10|Hs108|chr8 ( 290) 1597 386.8 9.7e-108 >>CCDS6007.1 NAT1 gene_id:9|Hs108|chr8 (290 aa) initn: 1951 init1: 1951 opt: 1951 Z-score: 2501.0 bits: 470.7 E(32554): 5.5e-133 Smith-Waterman score: 1951; 100.0% identity (100.0% similar) in 290 aa overlap (1-290:1-290) 10 20 30 40 50 60 pF1KE6 MDIEAYLERIGYKKSRNKLDLETLTDILQHQIRAVPFENLNIHCGDAMDLGLEAIFDQVV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS60 MDIEAYLERIGYKKSRNKLDLETLTDILQHQIRAVPFENLNIHCGDAMDLGLEAIFDQVV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 RRNRGGWCLQVNHLLYWALTTIGFETTMLGGYVYSTPAKKYSTGMIHLLLQVTIDGRNYI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS60 RRNRGGWCLQVNHLLYWALTTIGFETTMLGGYVYSTPAKKYSTGMIHLLLQVTIDGRNYI 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 VDAGFGRSYQMWQPLELISGKDQPQVPCVFRLTEENGFWYLDQIRREQYIPNEEFLHSDL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS60 VDAGFGRSYQMWQPLELISGKDQPQVPCVFRLTEENGFWYLDQIRREQYIPNEEFLHSDL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 LEDSKYRKIYSFTLKPRTIEDFESMNTYLQTSPSSVFTSKSFCSLQTPDGVHCLVGFTLT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS60 LEDSKYRKIYSFTLKPRTIEDFESMNTYLQTSPSSVFTSKSFCSLQTPDGVHCLVGFTLT 190 200 210 220 230 240 250 260 270 280 290 pF1KE6 HRRFNYKDNTDLIEFKTLSEEEIEKVLKNIFNISLQRKLVPKHGDRFFTI :::::::::::::::::::::::::::::::::::::::::::::::::: CCDS60 HRRFNYKDNTDLIEFKTLSEEEIEKVLKNIFNISLQRKLVPKHGDRFFTI 250 260 270 280 290 >>CCDS55205.1 NAT1 gene_id:9|Hs108|chr8 (352 aa) initn: 1951 init1: 1951 opt: 1951 Z-score: 2499.7 bits: 470.7 E(32554): 6.5e-133 Smith-Waterman score: 1951; 100.0% identity (100.0% similar) in 290 aa overlap (1-290:63-352) 10 20 30 pF1KE6 MDIEAYLERIGYKKSRNKLDLETLTDILQH :::::::::::::::::::::::::::::: CCDS55 SGIQARKKQQSVFWIKTEDQPTFNLLRKGIMDIEAYLERIGYKKSRNKLDLETLTDILQH 40 50 60 70 80 90 40 50 60 70 80 90 pF1KE6 QIRAVPFENLNIHCGDAMDLGLEAIFDQVVRRNRGGWCLQVNHLLYWALTTIGFETTMLG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 QIRAVPFENLNIHCGDAMDLGLEAIFDQVVRRNRGGWCLQVNHLLYWALTTIGFETTMLG 100 110 120 130 140 150 100 110 120 130 140 150 pF1KE6 GYVYSTPAKKYSTGMIHLLLQVTIDGRNYIVDAGFGRSYQMWQPLELISGKDQPQVPCVF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 GYVYSTPAKKYSTGMIHLLLQVTIDGRNYIVDAGFGRSYQMWQPLELISGKDQPQVPCVF 160 170 180 190 200 210 160 170 180 190 200 210 pF1KE6 RLTEENGFWYLDQIRREQYIPNEEFLHSDLLEDSKYRKIYSFTLKPRTIEDFESMNTYLQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 RLTEENGFWYLDQIRREQYIPNEEFLHSDLLEDSKYRKIYSFTLKPRTIEDFESMNTYLQ 220 230 240 250 260 270 220 230 240 250 260 270 pF1KE6 TSPSSVFTSKSFCSLQTPDGVHCLVGFTLTHRRFNYKDNTDLIEFKTLSEEEIEKVLKNI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 TSPSSVFTSKSFCSLQTPDGVHCLVGFTLTHRRFNYKDNTDLIEFKTLSEEEIEKVLKNI 280 290 300 310 320 330 280 290 pF1KE6 FNISLQRKLVPKHGDRFFTI :::::::::::::::::::: CCDS55 FNISLQRKLVPKHGDRFFTI 340 350 >>CCDS6008.1 NAT2 gene_id:10|Hs108|chr8 (290 aa) initn: 1597 init1: 1597 opt: 1597 Z-score: 2047.7 bits: 386.8 E(32554): 9.7e-108 Smith-Waterman score: 1597; 80.7% identity (92.8% similar) in 290 aa overlap (1-290:1-290) 10 20 30 40 50 60 pF1KE6 MDIEAYLERIGYKKSRNKLDLETLTDILQHQIRAVPFENLNIHCGDAMDLGLEAIFDQVV ::::::.::::::.::::::::::::::.::::::::::::.:::.::.::::::::..: CCDS60 MDIEAYFERIGYKNSRNKLDLETLTDILEHQIRAVPFENLNMHCGQAMELGLEAIFDHIV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 RRNRGGWCLQVNHLLYWALTTIGFETTMLGGYVYSTPAKKYSTGMIHLLLQVTIDGRNYI ::::::::::::.:::::::::::.::::::: : :..::::::.:::::::::::::: CCDS60 RRNRGGWCLQVNQLLYWALTTIGFQTTMLGGYFYIPPVNKYSTGMVHLLLQVTIDGRNYI 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 VDAGFGRSYQMWQPLELISGKDQPQVPCVFRLTEENGFWYLDQIRREQYIPNEEFLHSDL :::: : : :::::::::::::::::::.: :::: :.:::::::::::: :.:::.: : CCDS60 VDAGSGSSSQMWQPLELISGKDQPQVPCIFCLTEERGIWYLDQIRREQYITNKEFLNSHL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 LEDSKYRKIYSFTLKPRTIEDFESMNTYLQTSPSSVFTSKSFCSLQTPDGVHCLVGFTLT : .:..::: :::.::::::::::::::::::.: : . ::::::::.::.::::: :: CCDS60 LPKKKHQKIYLFTLEPRTIEDFESMNTYLQTSPTSSFITTSFCSLQTPEGVYCLVGFILT 190 200 210 220 230 240 250 260 270 280 290 pF1KE6 HRRFNYKDNTDLIEFKTLSEEEIEKVLKNIFNISLQRKLVPKHGDRFFTI .:.:::::::::.:::::.:::.:.::.:::.::: :.:::: :: .:: CCDS60 YRKFNYKDNTDLVEFKTLTEEEVEEVLRNIFKISLGRNLVPKPGDGSLTI 250 260 270 280 290 290 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 15:11:45 2016 done: Tue Nov 8 15:11:45 2016 Total Scan time: 1.800 Total Display time: -0.040 Function used was FASTA [36.3.4 Apr, 2011]