FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6229, 193 aa 1>>>pF1KE6229 193 - 193 aa - 193 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.0587+/-0.000884; mu= 14.1328+/- 0.053 mean_var=54.0930+/-11.027, 0's: 0 Z-trim(103.8): 29 B-trim: 5 in 1/51 Lambda= 0.174383 statistics sampled from 7587 (7609) to 7587 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.625), E-opt: 0.2 (0.234), width: 16 Scan time: 1.700 The best scores are: opt bits E(32554) CCDS4123.1 AP3S1 gene_id:1176|Hs108|chr5 ( 193) 1260 324.9 2e-89 CCDS10357.1 AP3S2 gene_id:10239|Hs108|chr15 ( 193) 1089 281.8 1.8e-76 CCDS83021.1 AP3S1 gene_id:1176|Hs108|chr5 ( 162) 994 257.9 2.3e-69 CCDS55977.1 AP3S2 gene_id:100526783|Hs108|chr15 ( 394) 966 251.0 6.8e-67 CCDS33062.1 AP2S1 gene_id:1175|Hs108|chr19 ( 142) 365 99.6 9.1e-22 CCDS77323.1 AP2S1 gene_id:1175|Hs108|chr19 ( 156) 359 98.1 2.8e-21 CCDS77322.1 AP2S1 gene_id:1175|Hs108|chr19 ( 144) 358 97.9 3.1e-21 CCDS77321.1 AP2S1 gene_id:1175|Hs108|chr19 ( 158) 358 97.9 3.4e-21 CCDS75958.1 AP1S2 gene_id:8905|Hs108|chrX ( 160) 348 95.4 2e-20 CCDS14173.1 AP1S2 gene_id:8905|Hs108|chrX ( 157) 344 94.4 3.9e-20 CCDS47669.1 AP1S1 gene_id:1174|Hs108|chr7 ( 158) 343 94.1 4.6e-20 CCDS45093.1 AP4S1 gene_id:11154|Hs108|chr14 ( 144) 336 92.3 1.4e-19 CCDS42827.1 AP1S3 gene_id:130340|Hs108|chr2 ( 154) 321 88.6 2.1e-18 CCDS58309.1 AP4S1 gene_id:11154|Hs108|chr14 ( 149) 233 66.4 9.4e-12 CCDS9642.1 AP4S1 gene_id:11154|Hs108|chr14 ( 159) 233 66.4 9.9e-12 >>CCDS4123.1 AP3S1 gene_id:1176|Hs108|chr5 (193 aa) initn: 1260 init1: 1260 opt: 1260 Z-score: 1719.1 bits: 324.9 E(32554): 2e-89 Smith-Waterman score: 1260; 100.0% identity (100.0% similar) in 193 aa overlap (1-193:1-193) 10 20 30 40 50 60 pF1KE6 MIKAILIFNNHGKPRLSKFYQPYSEDTQQQIIRETFHLVSKRDENVCNFLEGGLLIGGSD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 MIKAILIFNNHGKPRLSKFYQPYSEDTQQQIIRETFHLVSKRDENVCNFLEGGLLIGGSD 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 NKLIYRHYATLYFVFCVDSSESELGILDLIQVFVETLDKCFENVCELDLIFHVDKVHNIL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 NKLIYRHYATLYFVFCVDSSESELGILDLIQVFVETLDKCFENVCELDLIFHVDKVHNIL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 AEMVMGGMVLETNMNEIVTQIDAQNKLEKSEAGLAGAPARAVSAVKNMNLPEIPRNINIG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 AEMVMGGMVLETNMNEIVTQIDAQNKLEKSEAGLAGAPARAVSAVKNMNLPEIPRNINIG 130 140 150 160 170 180 190 pF1KE6 DISIKVPNLPSFK ::::::::::::: CCDS41 DISIKVPNLPSFK 190 >>CCDS10357.1 AP3S2 gene_id:10239|Hs108|chr15 (193 aa) initn: 1107 init1: 1089 opt: 1089 Z-score: 1486.6 bits: 281.8 E(32554): 1.8e-76 Smith-Waterman score: 1089; 84.4% identity (94.8% similar) in 192 aa overlap (1-192:1-192) 10 20 30 40 50 60 pF1KE6 MIKAILIFNNHGKPRLSKFYQPYSEDTQQQIIRETFHLVSKRDENVCNFLEGGLLIGGSD ::.:::.::::::::: .::: . :. ::::.::::::: :::.:.::::::: :::::: CCDS10 MIQAILVFNNHGKPRLVRFYQRFPEEIQQQIVRETFHLVLKRDDNICNFLEGGSLIGGSD 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 NKLIYRHYATLYFVFCVDSSESELGILDLIQVFVETLDKCFENVCELDLIFHVDKVHNIL :::::::::::::::::::::::::::::::::::::::::::::::::::.:::: :: CCDS10 YKLIYRHYATLYFVFCVDSSESELGILDLIQVFVETLDKCFENVCELDLIFHMDKVHYIL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 AEMVMGGMVLETNMNEIVTQIDAQNKLEKSEAGLAGAPARAVSAVKNMNLPEIPRNINIG :.:::::::::::::::.::.:::.:::::.::..:::::::::::.:::::::::::: CCDS10 QEVVMGGMVLETNMNEIVAQIEAQNRLEKSEGGLSAAPARAVSAVKNINLPEIPRNINIG 130 140 150 160 170 180 190 pF1KE6 DISIKVPNLPSFK :..:::::: .: CCDS10 DLNIKVPNLSQFV 190 >>CCDS83021.1 AP3S1 gene_id:1176|Hs108|chr5 (162 aa) initn: 990 init1: 990 opt: 994 Z-score: 1358.6 bits: 257.9 E(32554): 2.3e-69 Smith-Waterman score: 994; 96.2% identity (98.1% similar) in 158 aa overlap (1-158:1-158) 10 20 30 40 50 60 pF1KE6 MIKAILIFNNHGKPRLSKFYQPYSEDTQQQIIRETFHLVSKRDENVCNFLEGGLLIGGSD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 MIKAILIFNNHGKPRLSKFYQPYSEDTQQQIIRETFHLVSKRDENVCNFLEGGLLIGGSD 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 NKLIYRHYATLYFVFCVDSSESELGILDLIQVFVETLDKCFENVCELDLIFHVDKVHNIL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 NKLIYRHYATLYFVFCVDSSESELGILDLIQVFVETLDKCFENVCELDLIFHVDKVHNIL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 AEMVMGGMVLETNMNEIVTQIDAQNKLEKSEAGLAGAPARAVSAVKNMNLPEIPRNINIG :::::::::::::::::::::::::::::::. . .: CCDS83 AEMVMGGMVLETNMNEIVTQIDAQNKLEKSETFIFQSPRQDR 130 140 150 160 190 pF1KE6 DISIKVPNLPSFK >>CCDS55977.1 AP3S2 gene_id:100526783|Hs108|chr15 (394 aa) initn: 975 init1: 957 opt: 966 Z-score: 1314.4 bits: 251.0 E(32554): 6.8e-67 Smith-Waterman score: 966; 84.3% identity (94.8% similar) in 172 aa overlap (21-192:222-393) 10 20 30 40 50 pF1KE6 MIKAILIFNNHGKPRLSKFYQPYSEDTQQQIIRETFHLVSKRDENVCNFL . . :. ::::.::::::: :::.:.:::: CCDS55 GASWTDNIMAQKCSKGAAAEIREQGDGAEDEEWPEEIQQQIVRETFHLVLKRDDNICNFL 200 210 220 230 240 250 60 70 80 90 100 110 pF1KE6 EGGLLIGGSDNKLIYRHYATLYFVFCVDSSESELGILDLIQVFVETLDKCFENVCELDLI ::: :::::: ::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 EGGSLIGGSDYKLIYRHYATLYFVFCVDSSESELGILDLIQVFVETLDKCFENVCELDLI 260 270 280 290 300 310 120 130 140 150 160 170 pF1KE6 FHVDKVHNILAEMVMGGMVLETNMNEIVTQIDAQNKLEKSEAGLAGAPARAVSAVKNMNL ::.:::: :: :.:::::::::::::::.::.:::.:::::.::..:::::::::::.:: CCDS55 FHMDKVHYILQEVVMGGMVLETNMNEIVAQIEAQNRLEKSEGGLSAAPARAVSAVKNINL 320 330 340 350 360 370 180 190 pF1KE6 PEIPRNINIGDISIKVPNLPSFK :::::::::::..:::::: .: CCDS55 PEIPRNINIGDLNIKVPNLSQFV 380 390 >>CCDS33062.1 AP2S1 gene_id:1175|Hs108|chr19 (142 aa) initn: 380 init1: 257 opt: 365 Z-score: 504.3 bits: 99.6 E(32554): 9.1e-22 Smith-Waterman score: 365; 40.5% identity (73.6% similar) in 148 aa overlap (1-148:1-142) 10 20 30 40 50 60 pF1KE6 MIKAILIFNNHGKPRLSKFYQPYSEDTQQQIIRETFHLVSKRDENVCNFLEGGLLIGGSD ::. ::: : :: ::.:.:. ...: .:..:.:. .:. :: . ::.: . CCDS33 MIRFILIQNRAGKTRLAKWYMQFDDDEKQKLIEEVHAVVTVRDAKHTNFVEF------RN 10 20 30 40 50 70 80 90 100 110 120 pF1KE6 NKLIYRHYATLYFVFCVDSSESELGILDLIQVFVETLDKCFENVCELDLIFHVDKVHNIL :.:::.:: ::: .::: ....:. :. :. :::.:.. :.:::::::.:. ::.... CCDS33 FKIIYRRYAGLYFCICVDVNDNNLAYLEAIHNFVEVLNEYFHNVCELDLVFNFYKVYTVV 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE6 AEMVMGGMVLETNMNEIVTQIDAQNKLEKSEAGLAGAPARAVSAVKNMNLPEIPRNINIG :: ..: . ::...... :. ..:: CCDS33 DEMFLAGEIRETSQTKVLKQLLMLQSLE 120 130 140 >>CCDS77323.1 AP2S1 gene_id:1175|Hs108|chr19 (156 aa) initn: 380 init1: 257 opt: 359 Z-score: 495.5 bits: 98.1 E(32554): 2.8e-21 Smith-Waterman score: 364; 40.1% identity (72.6% similar) in 157 aa overlap (1-148:1-156) 10 20 30 40 50 60 pF1KE6 MIKAILIFNNHGKPRLSKFYQPYSEDTQQQIIRETFHLVSKRDENVCNFLEGGLLIGGSD ::. ::: : :: ::.:.:. ...: .:..:.:. .:. :: . ::.: : :. .: CCDS77 MIRFILIQNRAGKTRLAKWYMQFDDDEKQKLIEEVHAVVTVRDAKHTNFVEV-LAISVAD 10 20 30 40 50 70 80 90 100 110 pF1KE6 N---------KLIYRHYATLYFVFCVDSSESELGILDLIQVFVETLDKCFENVCELDLIF . :.:::.:: ::: .::: ....:. :. :. :::.:.. :.:::::::.: CCDS77 SLSVLQFRNFKIIYRRYAGLYFCICVDVNDNNLAYLEAIHNFVEVLNEYFHNVCELDLVF 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE6 HVDKVHNILAEMVMGGMVLETNMNEIVTQIDAQNKLEKSEAGLAGAPARAVSAVKNMNLP . ::.... :: ..: . ::...... :. ..:: CCDS77 NFYKVYTVVDEMFLAGEIRETSQTKVLKQLLMLQSLE 120 130 140 150 180 190 pF1KE6 EIPRNINIGDISIKVPNLPSFK >>CCDS77322.1 AP2S1 gene_id:1175|Hs108|chr19 (144 aa) initn: 368 init1: 257 opt: 358 Z-score: 494.7 bits: 97.9 E(32554): 3.1e-21 Smith-Waterman score: 358; 40.1% identity (73.5% similar) in 147 aa overlap (2-148:4-144) 10 20 30 40 50 pF1KE6 MIKAILIFNNHGKPRLSKFYQPYSEDTQQQIIRETFHLVSKRDENVCNFLEGGLLIGG :. ::: : :: ::.:.:. ...: .:..:.:. .:. :: . ::.: CCDS77 MVWIRFILIQNRAGKTRLAKWYMQFDDDEKQKLIEEVHAVVTVRDAKHTNFVEF------ 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 SDNKLIYRHYATLYFVFCVDSSESELGILDLIQVFVETLDKCFENVCELDLIFHVDKVHN . :.:::.:: ::: .::: ....:. :. :. :::.:.. :.:::::::.:. ::.. CCDS77 RNFKIIYRRYAGLYFCICVDVNDNNLAYLEAIHNFVEVLNEYFHNVCELDLVFNFYKVYT 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE6 ILAEMVMGGMVLETNMNEIVTQIDAQNKLEKSEAGLAGAPARAVSAVKNMNLPEIPRNIN .. :: ..: . ::...... :. ..:: CCDS77 VVDEMFLAGEIRETSQTKVLKQLLMLQSLE 120 130 140 >>CCDS77321.1 AP2S1 gene_id:1175|Hs108|chr19 (158 aa) initn: 368 init1: 257 opt: 358 Z-score: 494.1 bits: 97.9 E(32554): 3.4e-21 Smith-Waterman score: 358; 40.1% identity (73.5% similar) in 147 aa overlap (2-148:18-158) 10 20 30 40 pF1KE6 MIKAILIFNNHGKPRLSKFYQPYSEDTQQQIIRETFHLVSKRDE :. ::: : :: ::.:.:. ...: .:..:.:. .:. :: CCDS77 MKLKGLGKRCKRREDLEIRFILIQNRAGKTRLAKWYMQFDDDEKQKLIEEVHAVVTVRDA 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE6 NVCNFLEGGLLIGGSDNKLIYRHYATLYFVFCVDSSESELGILDLIQVFVETLDKCFENV . ::.: . :.:::.:: ::: .::: ....:. :. :. :::.:.. :.:: CCDS77 KHTNFVEF------RNFKIIYRRYAGLYFCICVDVNDNNLAYLEAIHNFVEVLNEYFHNV 70 80 90 100 110 110 120 130 140 150 160 pF1KE6 CELDLIFHVDKVHNILAEMVMGGMVLETNMNEIVTQIDAQNKLEKSEAGLAGAPARAVSA :::::.:. ::.... :: ..: . ::...... :. ..:: CCDS77 CELDLVFNFYKVYTVVDEMFLAGEIRETSQTKVLKQLLMLQSLE 120 130 140 150 170 180 190 pF1KE6 VKNMNLPEIPRNINIGDISIKVPNLPSFK >>CCDS75958.1 AP1S2 gene_id:8905|Hs108|chrX (160 aa) initn: 327 init1: 213 opt: 348 Z-score: 480.4 bits: 95.4 E(32554): 2e-20 Smith-Waterman score: 348; 37.9% identity (68.9% similar) in 161 aa overlap (2-162:1-153) 10 20 30 40 50 60 pF1KE6 MIKAILIFNNHGKPRLSKFYQPYSEDTQQQIIRETFHLVSKRDENVCNFLEGGLLIGGSD .. .:.:. .:: ::.:.: : :. ...: :: . : : ..:.::: : CCDS75 MQFMLLFSRQGKLRLQKWYVPLSDKEKKKITRELVQTVLARKPKMCSFLEW------RD 10 20 30 40 50 70 80 90 100 110 120 pF1KE6 NKLIYRHYATLYFVFCVDSSESELGILDLIQVFVETLDKCFENVCELDLIFHVDKVHNIL :..:..::.::: ......:: :..:. .:: ::: : .:::::.::. .:.. :: CCDS75 LKIVYKRYASLYFCCAIEDQDNELITLEIIHRYVELLDKYFGSVCELDIIFNFEKAYFIL 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE6 AEMVMGGMVLETNMNEIVTQIDAQNKLEKSEAGLAGAPARAVSAVKNMNLPEIPRNINIG :...:: : ::. .... :. : : . .: : .: :.: CCDS75 DEFLLGGEVQETSKKNVLKAIE-QADLLQEDAKEAETP-RSVLEEIGLT 120 130 140 150 160 190 pF1KE6 DISIKVPNLPSFK >>CCDS14173.1 AP1S2 gene_id:8905|Hs108|chrX (157 aa) initn: 340 init1: 213 opt: 344 Z-score: 475.1 bits: 94.4 E(32554): 3.9e-20 Smith-Waterman score: 344; 38.4% identity (69.5% similar) in 151 aa overlap (2-152:1-144) 10 20 30 40 50 60 pF1KE6 MIKAILIFNNHGKPRLSKFYQPYSEDTQQQIIRETFHLVSKRDENVCNFLEGGLLIGGSD .. .:.:. .:: ::.:.: : :. ...: :: . : : ..:.::: : CCDS14 MQFMLLFSRQGKLRLQKWYVPLSDKEKKKITRELVQTVLARKPKMCSFLE------WRD 10 20 30 40 50 70 80 90 100 110 120 pF1KE6 NKLIYRHYATLYFVFCVDSSESELGILDLIQVFVETLDKCFENVCELDLIFHVDKVHNIL :..:..::.::: ......:: :..:. .:: ::: : .:::::.::. .:.. :: CCDS14 LKIVYKRYASLYFCCAIEDQDNELITLEIIHRYVELLDKYFGSVCELDIIFNFEKAYFIL 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE6 AEMVMGGMVLETNMNEIVTQIDAQNKLEKSEAGLAGAPARAVSAVKNMNLPEIPRNINIG :...:: : ::. .... :. : : . :: CCDS14 DEFLLGGEVQETSKKNVLKAIE-QADLLQEEAETPRSVLEEIGLT 120 130 140 150 190 pF1KE6 DISIKVPNLPSFK 193 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 11:17:01 2016 done: Tue Nov 8 11:17:01 2016 Total Scan time: 1.700 Total Display time: -0.030 Function used was FASTA [36.3.4 Apr, 2011]