FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6107, 157 aa 1>>>pF1KE6107 157 - 157 aa - 157 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.9697+/-0.000768; mu= 13.4242+/- 0.046 mean_var=56.4552+/-11.291, 0's: 0 Z-trim(106.5): 34 B-trim: 10 in 1/49 Lambda= 0.170696 statistics sampled from 8964 (8997) to 8964 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.666), E-opt: 0.2 (0.276), width: 16 Scan time: 1.250 The best scores are: opt bits E(32554) CCDS14173.1 AP1S2 gene_id:8905|Hs108|chrX ( 157) 1021 259.2 7.5e-70 CCDS75958.1 AP1S2 gene_id:8905|Hs108|chrX ( 160) 1005 255.3 1.2e-68 CCDS47669.1 AP1S1 gene_id:1174|Hs108|chr7 ( 158) 927 236.1 7e-63 CCDS42827.1 AP1S3 gene_id:130340|Hs108|chr2 ( 154) 766 196.4 5.9e-51 CCDS33062.1 AP2S1 gene_id:1175|Hs108|chr19 ( 142) 460 121.0 2.7e-28 CCDS77322.1 AP2S1 gene_id:1175|Hs108|chr19 ( 144) 460 121.0 2.7e-28 CCDS77321.1 AP2S1 gene_id:1175|Hs108|chr19 ( 158) 460 121.1 2.9e-28 CCDS45093.1 AP4S1 gene_id:11154|Hs108|chr14 ( 144) 389 103.6 4.9e-23 CCDS10357.1 AP3S2 gene_id:10239|Hs108|chr15 ( 193) 349 93.8 5.8e-20 CCDS83021.1 AP3S1 gene_id:1176|Hs108|chr5 ( 162) 346 93.0 8.4e-20 CCDS4123.1 AP3S1 gene_id:1176|Hs108|chr5 ( 193) 344 92.5 1.4e-19 CCDS77323.1 AP2S1 gene_id:1175|Hs108|chr19 ( 156) 335 90.3 5.3e-19 CCDS58309.1 AP4S1 gene_id:11154|Hs108|chr14 ( 149) 301 81.9 1.7e-16 CCDS9642.1 AP4S1 gene_id:11154|Hs108|chr14 ( 159) 301 81.9 1.8e-16 CCDS55977.1 AP3S2 gene_id:100526783|Hs108|chr15 ( 394) 302 82.4 3.3e-16 CCDS58310.1 AP4S1 gene_id:11154|Hs108|chr14 ( 135) 280 76.7 5.6e-15 >>CCDS14173.1 AP1S2 gene_id:8905|Hs108|chrX (157 aa) initn: 1021 init1: 1021 opt: 1021 Z-score: 1367.6 bits: 259.2 E(32554): 7.5e-70 Smith-Waterman score: 1021; 100.0% identity (100.0% similar) in 157 aa overlap (1-157:1-157) 10 20 30 40 50 60 pF1KE6 MQFMLLFSRQGKLRLQKWYVPLSDKEKKKITRELVQTVLARKPKMCSFLEWRDLKIVYKR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 MQFMLLFSRQGKLRLQKWYVPLSDKEKKKITRELVQTVLARKPKMCSFLEWRDLKIVYKR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 YASLYFCCAIEDQDNELITLEIIHRYVELLDKYFGSVCELDIIFNFEKAYFILDEFLLGG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 YASLYFCCAIEDQDNELITLEIIHRYVELLDKYFGSVCELDIIFNFEKAYFILDEFLLGG 70 80 90 100 110 120 130 140 150 pF1KE6 EVQETSKKNVLKAIEQADLLQEEAETPRSVLEEIGLT ::::::::::::::::::::::::::::::::::::: CCDS14 EVQETSKKNVLKAIEQADLLQEEAETPRSVLEEIGLT 130 140 150 >>CCDS75958.1 AP1S2 gene_id:8905|Hs108|chrX (160 aa) initn: 954 init1: 940 opt: 1005 Z-score: 1346.2 bits: 255.3 E(32554): 1.2e-68 Smith-Waterman score: 1005; 98.1% identity (98.1% similar) in 160 aa overlap (1-157:1-160) 10 20 30 40 50 60 pF1KE6 MQFMLLFSRQGKLRLQKWYVPLSDKEKKKITRELVQTVLARKPKMCSFLEWRDLKIVYKR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 MQFMLLFSRQGKLRLQKWYVPLSDKEKKKITRELVQTVLARKPKMCSFLEWRDLKIVYKR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 YASLYFCCAIEDQDNELITLEIIHRYVELLDKYFGSVCELDIIFNFEKAYFILDEFLLGG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 YASLYFCCAIEDQDNELITLEIIHRYVELLDKYFGSVCELDIIFNFEKAYFILDEFLLGG 70 80 90 100 110 120 130 140 150 pF1KE6 EVQETSKKNVLKAIEQADLLQE---EAETPRSVLEEIGLT :::::::::::::::::::::: ::::::::::::::: CCDS75 EVQETSKKNVLKAIEQADLLQEDAKEAETPRSVLEEIGLT 130 140 150 160 >>CCDS47669.1 AP1S1 gene_id:1174|Hs108|chr7 (158 aa) initn: 927 init1: 927 opt: 927 Z-score: 1242.4 bits: 236.1 E(32554): 7e-63 Smith-Waterman score: 927; 87.8% identity (97.4% similar) in 156 aa overlap (1-156:2-157) 10 20 30 40 50 pF1KE6 MQFMLLFSRQGKLRLQKWYVPLSDKEKKKITRELVQTVLARKPKMCSFLEWRDLKIVYK :.:::::::::::::::::. ::::.::..:::.:.::::::::::::::::::.::: CCDS47 MMRFMLLFSRQGKLRLQKWYLATSDKERKKMVRELMQVVLARKPKMCSFLEWRDLKVVYK 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE6 RYASLYFCCAIEDQDNELITLEIIHRYVELLDKYFGSVCELDIIFNFEKAYFILDEFLLG :::::::::::: :::::::::.:::::::::::::::::::::::::::::::::::.: CCDS47 RYASLYFCCAIEGQDNELITLELIHRYVELLDKYFGSVCELDIIFNFEKAYFILDEFLMG 70 80 90 100 110 120 120 130 140 150 pF1KE6 GEVQETSKKNVLKAIEQADLLQEEAETPRSVLEEIGLT :.::.::::.:::::::::::::: :.:::::::.:: CCDS47 GDVQDTSKKSVLKAIEQADLLQEEDESPRSVLEEMGLA 130 140 150 >>CCDS42827.1 AP1S3 gene_id:130340|Hs108|chr2 (154 aa) initn: 785 init1: 753 opt: 766 Z-score: 1028.3 bits: 196.4 E(32554): 5.9e-51 Smith-Waterman score: 766; 75.2% identity (93.1% similar) in 145 aa overlap (1-145:2-146) 10 20 30 40 50 pF1KE6 MQFMLLFSRQGKLRLQKWYVPLSDKEKKKITRELVQTVLARKPKMCSFLEWRDLKIVYK ..:.:::::::::::::::. : :::.::::::.:: .:.: . ::..:..::.::: CCDS42 MIHFILLFSRQGKLRLQKWYITLPDKERKKITREIVQIILSRGHRTSSFVDWKELKLVYK 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE6 RYASLYFCCAIEDQDNELITLEIIHRYVELLDKYFGSVCELDIIFNFEKAYFILDEFLLG ::::::::::::.:::::.::::.::::::::::::.::::::::::::::::::::..: CCDS42 RYASLYFCCAIENQDNELLTLEIVHRYVELLDKYFGNVCELDIIFNFEKAYFILDEFIIG 70 80 90 100 110 120 120 130 140 150 pF1KE6 GEVQETSKKNVLKAIEQADLLQEEAETPRSVLEEIGLT ::.:::::: ..::::..:.::: : CCDS42 GEIQETSKKIAVKAIEDSDMLQETMEEYMNKPTF 130 140 150 >>CCDS33062.1 AP2S1 gene_id:1175|Hs108|chr19 (142 aa) initn: 437 init1: 437 opt: 460 Z-score: 621.6 bits: 121.0 E(32554): 2.7e-28 Smith-Waterman score: 460; 47.8% identity (80.6% similar) in 134 aa overlap (1-134:2-135) 10 20 30 40 50 pF1KE6 MQFMLLFSRQGKLRLQKWYVPLSDKEKKKITRELVQTVLARKPKMCSFLEWRDLKIVYK ..:.:. .: :: :: :::. ..: ::.:. .:. .: .: : .:.:.:..::.:. CCDS33 MIRFILIQNRAGKTRLAKWYMQFDDDEKQKLIEEVHAVVTVRDAKHTNFVEFRNFKIIYR 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE6 RYASLYFCCAIEDQDNELITLEIIHRYVELLDKYFGSVCELDIIFNFEKAYFILDEFLLG :::.:::: .. .::.: :: :: .::.:..:: .:::::..::: :.: ..::..:. CCDS33 RYAGLYFCICVDVNDNNLAYLEAIHNFVEVLNEYFHNVCELDLVFNFYKVYTVVDEMFLA 70 80 90 100 110 120 120 130 140 150 pF1KE6 GEVQETSKKNVLKAIEQADLLQEEAETPRSVLEEIGLT ::..:::. .::: . CCDS33 GEIRETSQTKVLKQLLMLQSLE 130 140 >>CCDS77322.1 AP2S1 gene_id:1175|Hs108|chr19 (144 aa) initn: 437 init1: 437 opt: 460 Z-score: 621.5 bits: 121.0 E(32554): 2.7e-28 Smith-Waterman score: 460; 47.8% identity (80.6% similar) in 134 aa overlap (1-134:4-137) 10 20 30 40 50 pF1KE6 MQFMLLFSRQGKLRLQKWYVPLSDKEKKKITRELVQTVLARKPKMCSFLEWRDLKIV ..:.:. .: :: :: :::. ..: ::.:. .:. .: .: : .:.:.:..::. CCDS77 MVWIRFILIQNRAGKTRLAKWYMQFDDDEKQKLIEEVHAVVTVRDAKHTNFVEFRNFKII 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE6 YKRYASLYFCCAIEDQDNELITLEIIHRYVELLDKYFGSVCELDIIFNFEKAYFILDEFL :.:::.:::: .. .::.: :: :: .::.:..:: .:::::..::: :.: ..::.. CCDS77 YRRYAGLYFCICVDVNDNNLAYLEAIHNFVEVLNEYFHNVCELDLVFNFYKVYTVVDEMF 70 80 90 100 110 120 120 130 140 150 pF1KE6 LGGEVQETSKKNVLKAIEQADLLQEEAETPRSVLEEIGLT :.::..:::. .::: . CCDS77 LAGEIRETSQTKVLKQLLMLQSLE 130 140 >>CCDS77321.1 AP2S1 gene_id:1175|Hs108|chr19 (158 aa) initn: 437 init1: 437 opt: 460 Z-score: 620.9 bits: 121.1 E(32554): 2.9e-28 Smith-Waterman score: 460; 47.8% identity (80.6% similar) in 134 aa overlap (1-134:18-151) 10 20 30 40 pF1KE6 MQFMLLFSRQGKLRLQKWYVPLSDKEKKKITRELVQTVLARKP ..:.:. .: :: :: :::. ..: ::.:. .:. .: .: CCDS77 MKLKGLGKRCKRREDLEIRFILIQNRAGKTRLAKWYMQFDDDEKQKLIEEVHAVVTVRDA 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE6 KMCSFLEWRDLKIVYKRYASLYFCCAIEDQDNELITLEIIHRYVELLDKYFGSVCELDII : .:.:.:..::.:.:::.:::: .. .::.: :: :: .::.:..:: .:::::.. CCDS77 KHTNFVEFRNFKIIYRRYAGLYFCICVDVNDNNLAYLEAIHNFVEVLNEYFHNVCELDLV 70 80 90 100 110 120 110 120 130 140 150 pF1KE6 FNFEKAYFILDEFLLGGEVQETSKKNVLKAIEQADLLQEEAETPRSVLEEIGLT ::: :.: ..::..:.::..:::. .::: . CCDS77 FNFYKVYTVVDEMFLAGEIRETSQTKVLKQLLMLQSLE 130 140 150 >>CCDS45093.1 AP4S1 gene_id:11154|Hs108|chr14 (144 aa) initn: 380 init1: 360 opt: 389 Z-score: 527.0 bits: 103.6 E(32554): 4.9e-23 Smith-Waterman score: 389; 36.6% identity (78.9% similar) in 142 aa overlap (1-142:2-143) 10 20 30 40 50 pF1KE6 MQFMLLFSRQGKLRLQKWYVPLSDKEKKKITRELVQTVLARKPKMCSFLEWRDLKIVYK ..:.:. ..::. ::.:.: .. ... . :.... :.:. ..:::.:..:.:..:. CCDS45 MIKFFLMVNKQGQTRLSKYYEHVDINKRTLLETEVIKSCLSRSNEQCSFIEYKDFKLIYR 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE6 RYASLYFCCAIEDQDNELITLEIIHRYVELLDKYFGSVCELDIIFNFEKAYFILDEFLLG .::.:.. ...: .::. :.:: .::.::.::. : ::::.::..:...::::..:. CCDS45 QYAALFIVVGVNDTENEMAIYEFIHNFVEVLDEYFSRVSELDIMFNLDKVHIILDEMVLN 70 80 90 100 110 120 120 130 140 150 pF1KE6 GEVQETSKKNVLKAIEQADLLQEEAETPRSVLEEIGLT : . ::.. .: . : ..: CCDS45 GCIVETNRARILAPLLILDKMSES 130 140 >>CCDS10357.1 AP3S2 gene_id:10239|Hs108|chr15 (193 aa) initn: 331 init1: 227 opt: 349 Z-score: 471.8 bits: 93.8 E(32554): 5.8e-20 Smith-Waterman score: 349; 37.2% identity (72.3% similar) in 148 aa overlap (1-142:2-149) 10 20 30 40 50 pF1KE6 MQFMLLFSRQGKLRLQKWYVPLSDKEKKKITRELVQTVLARKPKMCSFLEW------RD .: .:.:. .:: :: ..: . .. ...:.:: . :: : ..:.::: : CCDS10 MIQAILVFNNHGKPRLVRFYQRFPEEIQQQIVRETFHLVLKRDDNICNFLEGGSLIGGSD 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE6 LKIVYKRYASLYFCCAIEDQDNELITLEIIHRYVELLDKYFGSVCELDIIFNFEKAYFIL :..:..::.::: ......:: :..:. .:: ::: : .:::::.::...:...:: CCDS10 YKLIYRHYATLYFVFCVDSSESELGILDLIQVFVETLDKCFENVCELDLIFHMDKVHYIL 70 80 90 100 110 120 120 130 140 150 pF1KE6 DEFLLGGEVQETSKKNVLKAIEQADLLQEEAETPRSVLEEIGLT .: ..:: : ::. .... :: . :.. CCDS10 QEVVMGGMVLETNMNEIVAQIEAQNRLEKSEGGLSAAPARAVSAVKNINLPEIPRNINIG 130 140 150 160 170 180 >>CCDS83021.1 AP3S1 gene_id:1176|Hs108|chr5 (162 aa) initn: 340 init1: 213 opt: 346 Z-score: 469.0 bits: 93.0 E(32554): 8.4e-20 Smith-Waterman score: 346; 38.2% identity (70.4% similar) in 152 aa overlap (1-146:2-152) 10 20 30 40 50 pF1KE6 MQFMLLFSRQGKLRLQKWYVPLSDKEKKKITRELVQTVLARKPKMCSFLEW------RD .. .:.:. .:: ::.:.: : :. ...: :: . : : ..:.::: : CCDS83 MIKAILIFNNHGKPRLSKFYQPYSEDTQQQIIRETFHLVSKRDENVCNFLEGGLLIGGSD 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE6 LKIVYKRYASLYFCCAIEDQDNELITLEIIHRYVELLDKYFGSVCELDIIFNFEKAYFIL :..:..::.::: ......:: :..:. .:: ::: : .:::::.::. .:.. :: CCDS83 NKLIYRHYATLYFVFCVDSSESELGILDLIQVFVETLDKCFENVCELDLIFHVDKVHNIL 70 80 90 100 110 120 120 130 140 150 pF1KE6 DEFLLGGEVQETSKKNVLKAIEQADLLQEEAETPRSVLEEIGLT :...:: : ::. .... :. . : :..:: CCDS83 AEMVMGGMVLETNMNEIVTQIDAQNKL-EKSETFIFQSPRQDR 130 140 150 160 157 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 09:30:29 2016 done: Tue Nov 8 09:30:29 2016 Total Scan time: 1.250 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]