FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6185, 164 aa 1>>>pF1KE6185 164 - 164 aa - 164 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.1588+/-0.000738; mu= 12.4820+/- 0.044 mean_var=53.0736+/-10.268, 0's: 0 Z-trim(106.9): 38 B-trim: 0 in 0/50 Lambda= 0.176050 statistics sampled from 9204 (9238) to 9204 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.681), E-opt: 0.2 (0.284), width: 16 Scan time: 1.450 The best scores are: opt bits E(32554) CCDS42827.1 AP1S3 gene_id:130340|Hs108|chr2 ( 154) 933 244.5 2.1e-65 CCDS14173.1 AP1S2 gene_id:8905|Hs108|chrX ( 157) 776 204.6 2.2e-53 CCDS75958.1 AP1S2 gene_id:8905|Hs108|chrX ( 160) 768 202.6 8.9e-53 CCDS47669.1 AP1S1 gene_id:1174|Hs108|chr7 ( 158) 761 200.8 3e-52 CCDS33062.1 AP2S1 gene_id:1175|Hs108|chr19 ( 142) 453 122.5 9.7e-29 CCDS77322.1 AP2S1 gene_id:1175|Hs108|chr19 ( 144) 446 120.8 3.4e-28 CCDS77321.1 AP2S1 gene_id:1175|Hs108|chr19 ( 158) 446 120.8 3.7e-28 CCDS45093.1 AP4S1 gene_id:11154|Hs108|chr14 ( 144) 366 100.5 4.4e-22 CCDS10357.1 AP3S2 gene_id:10239|Hs108|chr15 ( 193) 348 95.9 1.4e-20 CCDS83021.1 AP3S1 gene_id:1176|Hs108|chr5 ( 162) 322 89.3 1.1e-18 CCDS4123.1 AP3S1 gene_id:1176|Hs108|chr5 ( 193) 319 88.6 2.3e-18 CCDS77323.1 AP2S1 gene_id:1175|Hs108|chr19 ( 156) 311 86.5 7.6e-18 CCDS58309.1 AP4S1 gene_id:11154|Hs108|chr14 ( 149) 291 81.4 2.5e-16 CCDS9642.1 AP4S1 gene_id:11154|Hs108|chr14 ( 159) 289 80.9 3.7e-16 CCDS55977.1 AP3S2 gene_id:100526783|Hs108|chr15 ( 394) 290 81.3 7.1e-16 CCDS58310.1 AP4S1 gene_id:11154|Hs108|chr14 ( 135) 268 75.6 1.3e-14 >>CCDS42827.1 AP1S3 gene_id:130340|Hs108|chr2 (154 aa) initn: 933 init1: 933 opt: 933 Z-score: 1287.7 bits: 244.5 E(32554): 2.1e-65 Smith-Waterman score: 933; 100.0% identity (100.0% similar) in 143 aa overlap (1-143:1-143) 10 20 30 40 50 60 pF1KE6 MIHFILLFSRQGKLRLQKWYITLPDKERKKITREIVQIILSRGHRTSSFVDWKELKLVYK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 MIHFILLFSRQGKLRLQKWYITLPDKERKKITREIVQIILSRGHRTSSFVDWKELKLVYK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 RYASLYFCCAIENQDNELLTLEIVHRYVELLDKYFGNVCELDIIFNFEKAYFILDEFIIG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 RYASLYFCCAIENQDNELLTLEIVHRYVELLDKYFGNVCELDIIFNFEKAYFILDEFIIG 70 80 90 100 110 120 130 140 150 160 pF1KE6 GEIQETSKKIAVKAIEDSDMLQENRLSPRGRDCSEPRSCHCTLA ::::::::::::::::::::::: CCDS42 GEIQETSKKIAVKAIEDSDMLQETMEEYMNKPTF 130 140 150 >>CCDS14173.1 AP1S2 gene_id:8905|Hs108|chrX (157 aa) initn: 776 init1: 776 opt: 776 Z-score: 1072.1 bits: 204.6 E(32554): 2.2e-53 Smith-Waterman score: 776; 74.3% identity (93.2% similar) in 148 aa overlap (2-149:1-148) 10 20 30 40 50 60 pF1KE6 MIHFILLFSRQGKLRLQKWYITLPDKERKKITREIVQIILSRGHRTSSFVDWKELKLVYK ..:.:::::::::::::::. : :::.::::::.:: .:.: . ::..:..::.::: CCDS14 MQFMLLFSRQGKLRLQKWYVPLSDKEKKKITRELVQTVLARKPKMCSFLEWRDLKIVYK 10 20 30 40 50 70 80 90 100 110 120 pF1KE6 RYASLYFCCAIENQDNELLTLEIVHRYVELLDKYFGNVCELDIIFNFEKAYFILDEFIIG ::::::::::::.:::::.::::.::::::::::::.::::::::::::::::::::..: CCDS14 RYASLYFCCAIEDQDNELITLEIIHRYVELLDKYFGSVCELDIIFNFEKAYFILDEFLLG 60 70 80 90 100 110 130 140 150 160 pF1KE6 GEIQETSKKIAVKAIEDSDMLQENRLSPRGRDCSEPRSCHCTLA ::.:::::: ..::::..:.:::. .:: CCDS14 GEVQETSKKNVLKAIEQADLLQEEAETPRSVLEEIGLT 120 130 140 150 >>CCDS75958.1 AP1S2 gene_id:8905|Hs108|chrX (160 aa) initn: 783 init1: 751 opt: 768 Z-score: 1060.9 bits: 202.6 E(32554): 8.9e-53 Smith-Waterman score: 768; 70.7% identity (89.8% similar) in 157 aa overlap (2-158:1-152) 10 20 30 40 50 60 pF1KE6 MIHFILLFSRQGKLRLQKWYITLPDKERKKITREIVQIILSRGHRTSSFVDWKELKLVYK ..:.:::::::::::::::. : :::.::::::.:: .:.: . ::..:..::.::: CCDS75 MQFMLLFSRQGKLRLQKWYVPLSDKEKKKITRELVQTVLARKPKMCSFLEWRDLKIVYK 10 20 30 40 50 70 80 90 100 110 120 pF1KE6 RYASLYFCCAIENQDNELLTLEIVHRYVELLDKYFGNVCELDIIFNFEKAYFILDEFIIG ::::::::::::.:::::.::::.::::::::::::.::::::::::::::::::::..: CCDS75 RYASLYFCCAIEDQDNELITLEIIHRYVELLDKYFGSVCELDIIFNFEKAYFILDEFLLG 60 70 80 90 100 110 130 140 150 160 pF1KE6 GEIQETSKKIAVKAIEDSDMLQENRLSPRGRDCSEPRSCHCTLA ::.:::::: ..::::..:.:::. ... ::: CCDS75 GEVQETSKKNVLKAIEQADLLQED-----AKEAETPRSVLEEIGLT 120 130 140 150 160 >>CCDS47669.1 AP1S1 gene_id:1174|Hs108|chr7 (158 aa) initn: 761 init1: 761 opt: 761 Z-score: 1051.4 bits: 200.8 E(32554): 3e-52 Smith-Waterman score: 761; 71.1% identity (94.0% similar) in 149 aa overlap (1-149:1-149) 10 20 30 40 50 60 pF1KE6 MIHFILLFSRQGKLRLQKWYITLPDKERKKITREIVQIILSRGHRTSSFVDWKELKLVYK :..:.:::::::::::::::.. ::::::..::..:..:.: . ::..:..::.::: CCDS47 MMRFMLLFSRQGKLRLQKWYLATSDKERKKMVRELMQVVLARKPKMCSFLEWRDLKVVYK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 RYASLYFCCAIENQDNELLTLEIVHRYVELLDKYFGNVCELDIIFNFEKAYFILDEFIIG ::::::::::::.:::::.:::..::::::::::::.::::::::::::::::::::..: CCDS47 RYASLYFCCAIEGQDNELITLELIHRYVELLDKYFGSVCELDIIFNFEKAYFILDEFLMG 70 80 90 100 110 120 130 140 150 160 pF1KE6 GEIQETSKKIAVKAIEDSDMLQENRLSPRGRDCSEPRSCHCTLA :..:.:::: ..::::..:.:::. ::: CCDS47 GDVQDTSKKSVLKAIEQADLLQEEDESPRSVLEEMGLA 130 140 150 >>CCDS33062.1 AP2S1 gene_id:1175|Hs108|chr19 (142 aa) initn: 467 init1: 452 opt: 453 Z-score: 629.4 bits: 122.5 E(32554): 9.7e-29 Smith-Waterman score: 453; 44.4% identity (78.9% similar) in 142 aa overlap (1-142:1-139) 10 20 30 40 50 60 pF1KE6 MIHFILLFSRQGKLRLQKWYITLPDKERKKITREIVQIILSRGHRTSSFVDWKELKLVYK ::.:::. .: :: :: :::. . : :..:. .:. .. : . ..::.....:..:. CCDS33 MIRFILIQNRAGKTRLAKWYMQFDDDEKQKLIEEVHAVVTVRDAKHTNFVEFRNFKIIYR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 RYASLYFCCAIENQDNELLTLEIVHRYVELLDKYFGNVCELDIIFNFEKAYFILDEFIIG :::.:::: .. .::.: :: .: .::.:..:: ::::::..::: :.: ..::.... CCDS33 RYAGLYFCICVDVNDNNLAYLEAIHNFVEVLNEYFHNVCELDLVFNFYKVYTVVDEMFLA 70 80 90 100 110 120 130 140 150 160 pF1KE6 GEIQETSKKIAVKAIEDSDMLQENRLSPRGRDCSEPRSCHCTLA :::.:::. .:.... ::: CCDS33 GEIRETSQ---TKVLKQLLMLQSLE 130 140 >>CCDS77322.1 AP2S1 gene_id:1175|Hs108|chr19 (144 aa) initn: 455 init1: 440 opt: 446 Z-score: 619.7 bits: 120.8 E(32554): 3.4e-28 Smith-Waterman score: 446; 44.0% identity (78.7% similar) in 141 aa overlap (2-142:4-141) 10 20 30 40 50 pF1KE6 MIHFILLFSRQGKLRLQKWYITLPDKERKKITREIVQIILSRGHRTSSFVDWKELKLV :.:::. .: :: :: :::. . : :..:. .:. .. : . ..::.....:.. CCDS77 MVWIRFILIQNRAGKTRLAKWYMQFDDDEKQKLIEEVHAVVTVRDAKHTNFVEFRNFKII 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE6 YKRYASLYFCCAIENQDNELLTLEIVHRYVELLDKYFGNVCELDIIFNFEKAYFILDEFI :.:::.:::: .. .::.: :: .: .::.:..:: ::::::..::: :.: ..::.. CCDS77 YRRYAGLYFCICVDVNDNNLAYLEAIHNFVEVLNEYFHNVCELDLVFNFYKVYTVVDEMF 70 80 90 100 110 120 120 130 140 150 160 pF1KE6 IGGEIQETSKKIAVKAIEDSDMLQENRLSPRGRDCSEPRSCHCTLA ..:::.:::. .:.... ::: CCDS77 LAGEIRETSQ---TKVLKQLLMLQSLE 130 140 >>CCDS77321.1 AP2S1 gene_id:1175|Hs108|chr19 (158 aa) initn: 455 init1: 440 opt: 446 Z-score: 619.0 bits: 120.8 E(32554): 3.7e-28 Smith-Waterman score: 446; 44.0% identity (78.7% similar) in 141 aa overlap (2-142:18-155) 10 20 30 40 pF1KE6 MIHFILLFSRQGKLRLQKWYITLPDKERKKITREIVQIILSRGH :.:::. .: :: :: :::. . : :..:. .:. .. : CCDS77 MKLKGLGKRCKRREDLEIRFILIQNRAGKTRLAKWYMQFDDDEKQKLIEEVHAVVTVRDA 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE6 RTSSFVDWKELKLVYKRYASLYFCCAIENQDNELLTLEIVHRYVELLDKYFGNVCELDII . ..::.....:..:.:::.:::: .. .::.: :: .: .::.:..:: ::::::.. CCDS77 KHTNFVEFRNFKIIYRRYAGLYFCICVDVNDNNLAYLEAIHNFVEVLNEYFHNVCELDLV 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE6 FNFEKAYFILDEFIIGGEIQETSKKIAVKAIEDSDMLQENRLSPRGRDCSEPRSCHCTLA ::: :.: ..::....:::.:::. .:.... ::: CCDS77 FNFYKVYTVVDEMFLAGEIRETSQ---TKVLKQLLMLQSLE 130 140 150 >>CCDS45093.1 AP4S1 gene_id:11154|Hs108|chr14 (144 aa) initn: 365 init1: 365 opt: 366 Z-score: 509.9 bits: 100.5 E(32554): 4.4e-22 Smith-Waterman score: 366; 36.1% identity (76.4% similar) in 144 aa overlap (1-144:1-144) 10 20 30 40 50 60 pF1KE6 MIHFILLFSRQGKLRLQKWYITLPDKERKKITREIVQIILSRGHRTSSFVDWKELKLVYK ::.:.:. ..::. ::.:.: . ..: . :... :::... ::...:..::.:. CCDS45 MIKFFLMVNKQGQTRLSKYYEHVDINKRTLLETEVIKSCLSRSNEQCSFIEYKDFKLIYR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 RYASLYFCCAIENQDNELLTLEIVHRYVELLDKYFGNVCELDIIFNFEKAYFILDEFIIG .::.:.. .... .::. :..: .::.::.::. : ::::.::..:...::::.... CCDS45 QYAALFIVVGVNDTENEMAIYEFIHNFVEVLDEYFSRVSELDIMFNLDKVHIILDEMVLN 70 80 90 100 110 120 130 140 150 160 pF1KE6 GEIQETSKKIAVKAIEDSDMLQENRLSPRGRDCSEPRSCHCTLA : : ::.. . . : ..:. CCDS45 GCIVETNRARILAPLLILDKMSES 130 140 >>CCDS10357.1 AP3S2 gene_id:10239|Hs108|chr15 (193 aa) initn: 350 init1: 257 opt: 348 Z-score: 483.1 bits: 95.9 E(32554): 1.4e-20 Smith-Waterman score: 348; 36.7% identity (72.0% similar) in 150 aa overlap (1-144:1-150) 10 20 30 40 50 pF1KE6 MIHFILLFSRQGKLRLQKWYITLPDKERKKITREIVQIILSRGHRTSSFVDWKEL----- ::. ::.:. .:: :: ..: .:.. ...:.:: ...:.: .:.. : CCDS10 MIQAILVFNNHGKPRLVRFYQRFPEEIQQQIVRETFHLVLKRDDNICNFLEGGSLIGGSD 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE6 -KLVYKRYASLYFCCAIENQDNELLTLEIVHRYVELLDKYFGNVCELDIIFNFEKAYFIL ::.:..::.::: ......:: :.... .:: ::: : ::::::.::...:...:: CCDS10 YKLIYRHYATLYFVFCVDSSESELGILDLIQVFVETLDKCFENVCELDLIFHMDKVHYIL 70 80 90 100 110 120 120 130 140 150 160 pF1KE6 DEFIIGGEIQETSKKIAVKAIEDSDMLQENRLSPRGRDCSEPRSCHCTLA .: ..:: . ::. . : :: .. :... CCDS10 QEVVMGGMVLETNMNEIVAQIEAQNRLEKSEGGLSAAPARAVSAVKNINLPEIPRNINIG 130 140 150 160 170 180 >>CCDS83021.1 AP3S1 gene_id:1176|Hs108|chr5 (162 aa) initn: 320 init1: 233 opt: 322 Z-score: 448.7 bits: 89.3 E(32554): 1.1e-18 Smith-Waterman score: 322; 35.2% identity (67.9% similar) in 159 aa overlap (1-149:1-159) 10 20 30 40 50 pF1KE6 MIHFILLFSRQGKLRLQKWYITLPDKERKKITREIVQIILSRGHRTSSFVDWKEL----- ::. ::.:. .:: ::.:.: . ...: :: ... .: . . .:.. : CCDS83 MIKAILIFNNHGKPRLSKFYQPYSEDTQQQIIRETFHLVSKRDENVCNFLEGGLLIGGSD 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE6 -KLVYKRYASLYFCCAIENQDNELLTLEIVHRYVELLDKYFGNVCELDIIFNFEKAYFIL ::.:..::.::: ......:: :.... .:: ::: : ::::::.::. .:.. :: CCDS83 NKLIYRHYATLYFVFCVDSSESELGILDLIQVFVETLDKCFENVCELDLIFHVDKVHNIL 70 80 90 100 110 120 120 130 140 150 160 pF1KE6 DEFIIGGEIQETSKKIAVKAIEDSDMLQENRL----SPRGRDCSEPRSCHCTLA :...:: . ::. . : :. .. :.... ::: CCDS83 AEMVMGGMVLETNMNEIVTQIDAQNKLEKSETFIFQSPRQDR 130 140 150 160 164 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 10:10:31 2016 done: Tue Nov 8 10:10:31 2016 Total Scan time: 1.450 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]