FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6666, 290 aa 1>>>pF1KE6666 290 - 290 aa - 290 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.2823+/-0.000863; mu= 15.1211+/- 0.051 mean_var=59.6192+/-12.026, 0's: 0 Z-trim(105.1): 16 B-trim: 148 in 1/51 Lambda= 0.166104 statistics sampled from 8227 (8232) to 8227 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.638), E-opt: 0.2 (0.253), width: 16 Scan time: 2.240 The best scores are: opt bits E(32554) CCDS6008.1 NAT2 gene_id:10|Hs108|chr8 ( 290) 1936 472.3 1.8e-133 CCDS6007.1 NAT1 gene_id:9|Hs108|chr8 ( 290) 1594 390.4 8.5e-109 CCDS55205.1 NAT1 gene_id:9|Hs108|chr8 ( 352) 1594 390.4 1e-108 >>CCDS6008.1 NAT2 gene_id:10|Hs108|chr8 (290 aa) initn: 1936 init1: 1936 opt: 1936 Z-score: 2509.7 bits: 472.3 E(32554): 1.8e-133 Smith-Waterman score: 1936; 99.3% identity (100.0% similar) in 290 aa overlap (1-290:1-290) 10 20 30 40 50 60 pF1KE6 MDIEAYFERIGYKNSRNKLDLETLTDILEHQIRAVPFENLNMHCGQAMELGLEAIFDHIV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS60 MDIEAYFERIGYKNSRNKLDLETLTDILEHQIRAVPFENLNMHCGQAMELGLEAIFDHIV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 RRNRGGWCLQVNQLLYWALTTIGFQTTMLGGYFYIPPVNKYSTGMVHLLLQVTIDGRNYI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS60 RRNRGGWCLQVNQLLYWALTTIGFQTTMLGGYFYIPPVNKYSTGMVHLLLQVTIDGRNYI 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 VDAGSGSSSQMWQPLELISGKDQPQVPCIFCLTEERGIWYLDQIRREQYITNKEFLNSHL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS60 VDAGSGSSSQMWQPLELISGKDQPQVPCIFCLTEERGIWYLDQIRREQYITNKEFLNSHL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 LPKKKHQKIYLFTLEPQTIEDFESMNTYLQTSPTSSFITTSFCSLQTPEGVYCLVGFILT ::::::::::::::::.::::::::::::::::::::::::::::::::::::::::::: CCDS60 LPKKKHQKIYLFTLEPRTIEDFESMNTYLQTSPTSSFITTSFCSLQTPEGVYCLVGFILT 190 200 210 220 230 240 250 260 270 280 290 pF1KE6 YRKFNYKDNTDLVEFKTLTEEEVEEVLKNIFKISLGRNLVPKPGDGSLTI :::::::::::::::::::::::::::.:::::::::::::::::::::: CCDS60 YRKFNYKDNTDLVEFKTLTEEEVEEVLRNIFKISLGRNLVPKPGDGSLTI 250 260 270 280 290 >>CCDS6007.1 NAT1 gene_id:9|Hs108|chr8 (290 aa) initn: 1594 init1: 1594 opt: 1594 Z-score: 2066.8 bits: 390.4 E(32554): 8.5e-109 Smith-Waterman score: 1594; 80.7% identity (92.8% similar) in 290 aa overlap (1-290:1-290) 10 20 30 40 50 60 pF1KE6 MDIEAYFERIGYKNSRNKLDLETLTDILEHQIRAVPFENLNMHCGQAMELGLEAIFDHIV ::::::.::::::.::::::::::::::.::::::::::::.:::.::.::::::::..: CCDS60 MDIEAYLERIGYKKSRNKLDLETLTDILQHQIRAVPFENLNIHCGDAMDLGLEAIFDQVV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 RRNRGGWCLQVNQLLYWALTTIGFQTTMLGGYFYIPPVNKYSTGMVHLLLQVTIDGRNYI ::::::::::::.:::::::::::.::::::: : :..::::::.:::::::::::::: CCDS60 RRNRGGWCLQVNHLLYWALTTIGFETTMLGGYVYSTPAKKYSTGMIHLLLQVTIDGRNYI 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 VDAGSGSSSQMWQPLELISGKDQPQVPCIFCLTEERGIWYLDQIRREQYITNKEFLNSHL :::: : : :::::::::::::::::::.: :::: :.:::::::::::: :.:::.: : CCDS60 VDAGFGRSYQMWQPLELISGKDQPQVPCVFRLTEENGFWYLDQIRREQYIPNEEFLHSDL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 LPKKKHQKIYLFTLEPQTIEDFESMNTYLQTSPTSSFITTSFCSLQTPEGVYCLVGFILT : .:..::: :::.:.::::::::::::::::.: : . ::::::::.::.::::: :: CCDS60 LEDSKYRKIYSFTLKPRTIEDFESMNTYLQTSPSSVFTSKSFCSLQTPDGVHCLVGFTLT 190 200 210 220 230 240 250 260 270 280 290 pF1KE6 YRKFNYKDNTDLVEFKTLTEEEVEEVLKNIFKISLGRNLVPKPGDGSLTI .:.:::::::::.:::::.:::.:.::::::.::: :.:::: :: .:: CCDS60 HRRFNYKDNTDLIEFKTLSEEEIEKVLKNIFNISLQRKLVPKHGDRFFTI 250 260 270 280 290 >>CCDS55205.1 NAT1 gene_id:9|Hs108|chr8 (352 aa) initn: 1594 init1: 1594 opt: 1594 Z-score: 2065.4 bits: 390.4 E(32554): 1e-108 Smith-Waterman score: 1594; 80.7% identity (92.8% similar) in 290 aa overlap (1-290:63-352) 10 20 30 pF1KE6 MDIEAYFERIGYKNSRNKLDLETLTDILEH ::::::.::::::.::::::::::::::.: CCDS55 SGIQARKKQQSVFWIKTEDQPTFNLLRKGIMDIEAYLERIGYKKSRNKLDLETLTDILQH 40 50 60 70 80 90 40 50 60 70 80 90 pF1KE6 QIRAVPFENLNMHCGQAMELGLEAIFDHIVRRNRGGWCLQVNQLLYWALTTIGFQTTMLG :::::::::::.:::.::.::::::::..:::::::::::::.:::::::::::.::::: CCDS55 QIRAVPFENLNIHCGDAMDLGLEAIFDQVVRRNRGGWCLQVNHLLYWALTTIGFETTMLG 100 110 120 130 140 150 100 110 120 130 140 150 pF1KE6 GYFYIPPVNKYSTGMVHLLLQVTIDGRNYIVDAGSGSSSQMWQPLELISGKDQPQVPCIF :: : :..::::::.:::::::::::::::::: : : :::::::::::::::::::.: CCDS55 GYVYSTPAKKYSTGMIHLLLQVTIDGRNYIVDAGFGRSYQMWQPLELISGKDQPQVPCVF 160 170 180 190 200 210 160 170 180 190 200 210 pF1KE6 CLTEERGIWYLDQIRREQYITNKEFLNSHLLPKKKHQKIYLFTLEPQTIEDFESMNTYLQ :::: :.:::::::::::: :.:::.: :: .:..::: :::.:.::::::::::::: CCDS55 RLTEENGFWYLDQIRREQYIPNEEFLHSDLLEDSKYRKIYSFTLKPRTIEDFESMNTYLQ 220 230 240 250 260 270 220 230 240 250 260 270 pF1KE6 TSPTSSFITTSFCSLQTPEGVYCLVGFILTYRKFNYKDNTDLVEFKTLTEEEVEEVLKNI :::.: : . ::::::::.::.::::: ::.:.:::::::::.:::::.:::.:.::::: CCDS55 TSPSSVFTSKSFCSLQTPDGVHCLVGFTLTHRRFNYKDNTDLIEFKTLSEEEIEKVLKNI 280 290 300 310 320 330 280 290 pF1KE6 FKISLGRNLVPKPGDGSLTI :.::: :.:::: :: .:: CCDS55 FNISLQRKLVPKHGDRFFTI 340 350 290 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 15:14:57 2016 done: Tue Nov 8 15:14:57 2016 Total Scan time: 2.240 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]