FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6208, 193 aa 1>>>pF1KE6208 193 - 193 aa - 193 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.1761+/-0.000861; mu= 13.3775+/- 0.052 mean_var=57.3573+/-11.448, 0's: 0 Z-trim(104.5): 28 B-trim: 0 in 0/51 Lambda= 0.169348 statistics sampled from 7932 (7959) to 7932 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.63), E-opt: 0.2 (0.244), width: 16 Scan time: 1.520 The best scores are: opt bits E(32554) CCDS10357.1 AP3S2 gene_id:10239|Hs108|chr15 ( 193) 1263 316.7 5.6e-87 CCDS55977.1 AP3S2 gene_id:100526783|Hs108|chr15 ( 394) 1109 279.2 2.2e-75 CCDS4123.1 AP3S1 gene_id:1176|Hs108|chr5 ( 193) 1089 274.2 3.5e-74 CCDS83021.1 AP3S1 gene_id:1176|Hs108|chr5 ( 162) 873 221.4 2.3e-58 CCDS47669.1 AP1S1 gene_id:1174|Hs108|chr7 ( 158) 353 94.3 4e-20 CCDS33062.1 AP2S1 gene_id:1175|Hs108|chr19 ( 142) 352 94.1 4.3e-20 CCDS77323.1 AP2S1 gene_id:1175|Hs108|chr19 ( 156) 350 93.6 6.5e-20 CCDS75958.1 AP1S2 gene_id:8905|Hs108|chrX ( 160) 350 93.6 6.7e-20 CCDS42827.1 AP1S3 gene_id:130340|Hs108|chr2 ( 154) 349 93.4 7.7e-20 CCDS14173.1 AP1S2 gene_id:8905|Hs108|chrX ( 157) 349 93.4 7.8e-20 CCDS77322.1 AP2S1 gene_id:1175|Hs108|chr19 ( 144) 345 92.4 1.4e-19 CCDS77321.1 AP2S1 gene_id:1175|Hs108|chr19 ( 158) 345 92.4 1.5e-19 CCDS45093.1 AP4S1 gene_id:11154|Hs108|chr14 ( 144) 340 91.1 3.3e-19 CCDS58309.1 AP4S1 gene_id:11154|Hs108|chr14 ( 149) 236 65.7 1.5e-11 CCDS9642.1 AP4S1 gene_id:11154|Hs108|chr14 ( 159) 236 65.8 1.6e-11 >>CCDS10357.1 AP3S2 gene_id:10239|Hs108|chr15 (193 aa) initn: 1263 init1: 1263 opt: 1263 Z-score: 1675.0 bits: 316.7 E(32554): 5.6e-87 Smith-Waterman score: 1263; 100.0% identity (100.0% similar) in 193 aa overlap (1-193:1-193) 10 20 30 40 50 60 pF1KE6 MIQAILVFNNHGKPRLVRFYQRFPEEIQQQIVRETFHLVLKRDDNICNFLEGGSLIGGSD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 MIQAILVFNNHGKPRLVRFYQRFPEEIQQQIVRETFHLVLKRDDNICNFLEGGSLIGGSD 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 YKLIYRHYATLYFVFCVDSSESELGILDLIQVFVETLDKCFENVCELDLIFHMDKVHYIL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 YKLIYRHYATLYFVFCVDSSESELGILDLIQVFVETLDKCFENVCELDLIFHMDKVHYIL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 QEVVMGGMVLETNMNEIVAQIEAQNRLEKSEGGLSAAPARAVSAVKNINLPEIPRNINIG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 QEVVMGGMVLETNMNEIVAQIEAQNRLEKSEGGLSAAPARAVSAVKNINLPEIPRNINIG 130 140 150 160 170 180 190 pF1KE6 DLNIKVPNLSQFV ::::::::::::: CCDS10 DLNIKVPNLSQFV 190 >>CCDS55977.1 AP3S2 gene_id:100526783|Hs108|chr15 (394 aa) initn: 1109 init1: 1109 opt: 1109 Z-score: 1466.8 bits: 279.2 E(32554): 2.2e-75 Smith-Waterman score: 1109; 98.3% identity (100.0% similar) in 173 aa overlap (21-193:222-394) 10 20 30 40 50 pF1KE6 MIQAILVFNNHGKPRLVRFYQRFPEEIQQQIVRETFHLVLKRDDNICNFL ...::::::::::::::::::::::::::: CCDS55 GASWTDNIMAQKCSKGAAAEIREQGDGAEDEEWPEEIQQQIVRETFHLVLKRDDNICNFL 200 210 220 230 240 250 60 70 80 90 100 110 pF1KE6 EGGSLIGGSDYKLIYRHYATLYFVFCVDSSESELGILDLIQVFVETLDKCFENVCELDLI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 EGGSLIGGSDYKLIYRHYATLYFVFCVDSSESELGILDLIQVFVETLDKCFENVCELDLI 260 270 280 290 300 310 120 130 140 150 160 170 pF1KE6 FHMDKVHYILQEVVMGGMVLETNMNEIVAQIEAQNRLEKSEGGLSAAPARAVSAVKNINL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 FHMDKVHYILQEVVMGGMVLETNMNEIVAQIEAQNRLEKSEGGLSAAPARAVSAVKNINL 320 330 340 350 360 370 180 190 pF1KE6 PEIPRNINIGDLNIKVPNLSQFV ::::::::::::::::::::::: CCDS55 PEIPRNINIGDLNIKVPNLSQFV 380 390 >>CCDS4123.1 AP3S1 gene_id:1176|Hs108|chr5 (193 aa) initn: 1107 init1: 1089 opt: 1089 Z-score: 1445.3 bits: 274.2 E(32554): 3.5e-74 Smith-Waterman score: 1089; 84.4% identity (94.8% similar) in 192 aa overlap (1-192:1-192) 10 20 30 40 50 60 pF1KE6 MIQAILVFNNHGKPRLVRFYQRFPEEIQQQIVRETFHLVLKRDDNICNFLEGGSLIGGSD ::.:::.::::::::: .::: . :. ::::.::::::: :::.:.::::::: :::::: CCDS41 MIKAILIFNNHGKPRLSKFYQPYSEDTQQQIIRETFHLVSKRDENVCNFLEGGLLIGGSD 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 YKLIYRHYATLYFVFCVDSSESELGILDLIQVFVETLDKCFENVCELDLIFHMDKVHYIL :::::::::::::::::::::::::::::::::::::::::::::::::::.:::: :: CCDS41 NKLIYRHYATLYFVFCVDSSESELGILDLIQVFVETLDKCFENVCELDLIFHVDKVHNIL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 QEVVMGGMVLETNMNEIVAQIEAQNRLEKSEGGLSAAPARAVSAVKNINLPEIPRNINIG :.:::::::::::::::.::.:::.:::::.::..:::::::::::.:::::::::::: CCDS41 AEMVMGGMVLETNMNEIVTQIDAQNKLEKSEAGLAGAPARAVSAVKNMNLPEIPRNINIG 130 140 150 160 170 180 190 pF1KE6 DLNIKVPNLSQFV :..:::::: .: CCDS41 DISIKVPNLPSFK 190 >>CCDS83021.1 AP3S1 gene_id:1176|Hs108|chr5 (162 aa) initn: 870 init1: 870 opt: 873 Z-score: 1161.3 bits: 221.4 E(32554): 2.3e-58 Smith-Waterman score: 873; 82.3% identity (91.8% similar) in 158 aa overlap (1-158:1-158) 10 20 30 40 50 60 pF1KE6 MIQAILVFNNHGKPRLVRFYQRFPEEIQQQIVRETFHLVLKRDDNICNFLEGGSLIGGSD ::.:::.::::::::: .::: . :. ::::.::::::: :::.:.::::::: :::::: CCDS83 MIKAILIFNNHGKPRLSKFYQPYSEDTQQQIIRETFHLVSKRDENVCNFLEGGLLIGGSD 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 YKLIYRHYATLYFVFCVDSSESELGILDLIQVFVETLDKCFENVCELDLIFHMDKVHYIL :::::::::::::::::::::::::::::::::::::::::::::::::::.:::: :: CCDS83 NKLIYRHYATLYFVFCVDSSESELGILDLIQVFVETLDKCFENVCELDLIFHVDKVHNIL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 QEVVMGGMVLETNMNEIVAQIEAQNRLEKSEGGLSAAPARAVSAVKNINLPEIPRNINIG :.:::::::::::::::.::.:::.::::: . .: CCDS83 AEMVMGGMVLETNMNEIVTQIDAQNKLEKSETFIFQSPRQDR 130 140 150 160 190 pF1KE6 DLNIKVPNLSQFV >>CCDS47669.1 AP1S1 gene_id:1174|Hs108|chr7 (158 aa) initn: 350 init1: 228 opt: 353 Z-score: 474.8 bits: 94.3 E(32554): 4e-20 Smith-Waterman score: 353; 37.1% identity (72.2% similar) in 151 aa overlap (1-151:1-145) 10 20 30 40 50 60 pF1KE6 MIQAILVFNNHGKPRLVRFYQRFPEEIQQQIVRETFHLVLKRDDNICNFLEGGSLIGGSD :.. .:.:. .:: :: ..: .. ....::: ...:: : ..:.::: : CCDS47 MMRFMLLFSRQGKLRLQKWYLATSDKERKKMVRELMQVVLARKPKMCSFLEW------RD 10 20 30 40 50 70 80 90 100 110 120 pF1KE6 YKLIYRHYATLYFVFCVDSSESELGILDLIQVFVETLDKCFENVCELDLIFHMDKVHYIL :..:..::.::: ......:: :.::. .:: ::: : .:::::.::...:...:: CCDS47 LKVVYKRYASLYFCCAIEGQDNELITLELIHRYVELLDKYFGSVCELDIIFNFEKAYFIL 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE6 QEVVMGGMVLETNMNEIVAQIEAQNRLEKSEGGLSAAPARAVSAVKNINLPEIPRNINIG .: .::: : .:. . .. :: . :.. . CCDS47 DEFLMGGDVQDTSKKSVLKAIEQADLLQEEDESPRSVLEEMGLA 120 130 140 150 190 pF1KE6 DLNIKVPNLSQFV >>CCDS33062.1 AP2S1 gene_id:1175|Hs108|chr19 (142 aa) initn: 365 init1: 258 opt: 352 Z-score: 474.2 bits: 94.1 E(32554): 4.3e-20 Smith-Waterman score: 352; 37.8% identity (73.6% similar) in 148 aa overlap (1-148:1-142) 10 20 30 40 50 60 pF1KE6 MIQAILVFNNHGKPRLVRFYQRFPEEIQQQIVRETFHLVLKRDDNICNFLEGGSLIGGSD ::. ::. : :: ::...:..: .. .:....:. .: :: . ::.: . CCDS33 MIRFILIQNRAGKTRLAKWYMQFDDDEKQKLIEEVHAVVTVRDAKHTNFVEF------RN 10 20 30 40 50 70 80 90 100 110 120 pF1KE6 YKLIYRHYATLYFVFCVDSSESELGILDLIQVFVETLDKCFENVCELDLIFHMDKVHYIL .:.:::.:: ::: .::: ....:. :. :. :::.:.. :.:::::::.:.. ::. .. CCDS33 FKIIYRRYAGLYFCICVDVNDNNLAYLEAIHNFVEVLNEYFHNVCELDLVFNFYKVYTVV 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE6 QEVVMGGMVLETNMNEIVAQIEAQNRLEKSEGGLSAAPARAVSAVKNINLPEIPRNINIG .:. ..: . ::...... :. . :: CCDS33 DEMFLAGEIRETSQTKVLKQLLMLQSLE 120 130 140 >>CCDS77323.1 AP2S1 gene_id:1175|Hs108|chr19 (156 aa) initn: 365 init1: 258 opt: 350 Z-score: 470.9 bits: 93.6 E(32554): 6.5e-20 Smith-Waterman score: 350; 36.5% identity (72.4% similar) in 156 aa overlap (1-148:1-156) 10 20 30 40 50 pF1KE6 MIQAILVFNNHGKPRLVRFYQRFPEEIQQQIVRETFHLVLKRDDNICNFLE--------G ::. ::. : :: ::...:..: .. .:....:. .: :: . ::.: . CCDS77 MIRFILIQNRAGKTRLAKWYMQFDDDEKQKLIEEVHAVVTVRDAKHTNFVEVLAISVADS 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE6 GSLIGGSDYKLIYRHYATLYFVFCVDSSESELGILDLIQVFVETLDKCFENVCELDLIFH :.. ..:.:::.:: ::: .::: ....:. :. :. :::.:.. :.:::::::.:. CCDS77 LSVLQFRNFKIIYRRYAGLYFCICVDVNDNNLAYLEAIHNFVEVLNEYFHNVCELDLVFN 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE6 MDKVHYILQEVVMGGMVLETNMNEIVAQIEAQNRLEKSEGGLSAAPARAVSAVKNINLPE . ::. ...:. ..: . ::...... :. . :: CCDS77 FYKVYTVVDEMFLAGEIRETSQTKVLKQLLMLQSLE 130 140 150 180 190 pF1KE6 IPRNINIGDLNIKVPNLSQFV >>CCDS75958.1 AP1S2 gene_id:8905|Hs108|chrX (160 aa) initn: 331 init1: 227 opt: 350 Z-score: 470.8 bits: 93.6 E(32554): 6.7e-20 Smith-Waterman score: 350; 36.6% identity (69.6% similar) in 161 aa overlap (2-162:1-153) 10 20 30 40 50 60 pF1KE6 MIQAILVFNNHGKPRLVRFYQRFPEEIQQQIVRETFHLVLKRDDNICNFLEGGSLIGGSD .: .:.:. .:: :: ..: . .. ...:.:: . :: : ..:.::: : CCDS75 MQFMLLFSRQGKLRLQKWYVPLSDKEKKKITRELVQTVLARKPKMCSFLEW------RD 10 20 30 40 50 70 80 90 100 110 120 pF1KE6 YKLIYRHYATLYFVFCVDSSESELGILDLIQVFVETLDKCFENVCELDLIFHMDKVHYIL :..:..::.::: ......:: :..:. .:: ::: : .:::::.::...:...:: CCDS75 LKIVYKRYASLYFCCAIEDQDNELITLEIIHRYVELLDKYFGSVCELDIIFNFEKAYFIL 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE6 QEVVMGGMVLETNMNEIVAQIEAQNRLEKSEGGLSAAPARAVSAVKNINLPEIPRNINIG .: ..:: : ::. .... :: . :. : . : :.: CCDS75 DEFLLGGEVQETSKKNVLKAIEQADLLQ--EDAKEAETPRSVLEEIGLT 120 130 140 150 160 190 pF1KE6 DLNIKVPNLSQFV >>CCDS42827.1 AP1S3 gene_id:130340|Hs108|chr2 (154 aa) initn: 351 init1: 258 opt: 349 Z-score: 469.7 bits: 93.4 E(32554): 7.7e-20 Smith-Waterman score: 349; 36.7% identity (72.0% similar) in 150 aa overlap (1-150:1-144) 10 20 30 40 50 60 pF1KE6 MIQAILVFNNHGKPRLVRFYQRFPEEIQQQIVRETFHLVLKRDDNICNFLEGGSLIGGSD ::. ::.:. .:: :: ..: .:.. ...:.:: ...:.: .:.. : CCDS42 MIHFILLFSRQGKLRLQKWYITLPDKERKKITREIVQIILSRGHRTSSFVDWKEL----- 10 20 30 40 50 70 80 90 100 110 120 pF1KE6 YKLIYRHYATLYFVFCVDSSESELGILDLIQVFVETLDKCFENVCELDLIFHMDKVHYIL ::.:..::.::: ......:: :.... .:: ::: : ::::::.::...:...:: CCDS42 -KLVYKRYASLYFCCAIENQDNELLTLEIVHRYVELLDKYFGNVCELDIIFNFEKAYFIL 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE6 QEVVMGGMVLETNMNEIVAQIEAQNRLEKSEGGLSAAPARAVSAVKNINLPEIPRNINIG .: ..:: . ::. . : :: .. :... CCDS42 DEFIIGGEIQETSKKIAVKAIEDSDMLQETMEEYMNKPTF 120 130 140 150 >>CCDS14173.1 AP1S2 gene_id:8905|Hs108|chrX (157 aa) initn: 331 init1: 227 opt: 349 Z-score: 469.6 bits: 93.4 E(32554): 7.8e-20 Smith-Waterman score: 349; 37.2% identity (72.3% similar) in 148 aa overlap (2-149:1-142) 10 20 30 40 50 60 pF1KE6 MIQAILVFNNHGKPRLVRFYQRFPEEIQQQIVRETFHLVLKRDDNICNFLEGGSLIGGSD .: .:.:. .:: :: ..: . .. ...:.:: . :: : ..:.::: : CCDS14 MQFMLLFSRQGKLRLQKWYVPLSDKEKKKITRELVQTVLARKPKMCSFLEW------RD 10 20 30 40 50 70 80 90 100 110 120 pF1KE6 YKLIYRHYATLYFVFCVDSSESELGILDLIQVFVETLDKCFENVCELDLIFHMDKVHYIL :..:..::.::: ......:: :..:. .:: ::: : .:::::.::...:...:: CCDS14 LKIVYKRYASLYFCCAIEDQDNELITLEIIHRYVELLDKYFGSVCELDIIFNFEKAYFIL 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE6 QEVVMGGMVLETNMNEIVAQIEAQNRLEKSEGGLSAAPARAVSAVKNINLPEIPRNINIG .: ..:: : ::. .... :: . :.. CCDS14 DEFLLGGEVQETSKKNVLKAIEQADLLQEEAETPRSVLEEIGLT 120 130 140 150 193 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 11:05:45 2016 done: Tue Nov 8 11:05:46 2016 Total Scan time: 1.520 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]