FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB8893, 217 aa 1>>>pF1KB8893 217 - 217 aa - 217 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.8574+/-0.000754; mu= 7.3933+/- 0.046 mean_var=129.2091+/-27.849, 0's: 0 Z-trim(112.9): 170 B-trim: 801 in 2/49 Lambda= 0.112831 statistics sampled from 13379 (13569) to 13379 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.76), E-opt: 0.2 (0.417), width: 16 Scan time: 2.290 The best scores are: opt bits E(32554) CCDS11532.1 HOXB7 gene_id:3217|Hs108|chr17 ( 217) 1466 249.0 1.6e-66 CCDS5408.1 HOXA7 gene_id:3204|Hs108|chr7 ( 230) 810 142.3 2.4e-34 CCDS11531.1 HOXB6 gene_id:3216|Hs108|chr17 ( 224) 472 87.3 8.6e-18 CCDS5407.1 HOXA6 gene_id:3203|Hs108|chr7 ( 233) 455 84.5 6e-17 CCDS41792.1 HOXC6 gene_id:3223|Hs108|chr12 ( 153) 432 80.6 5.8e-16 CCDS5406.1 HOXA5 gene_id:3202|Hs108|chr7 ( 270) 435 81.3 6.5e-16 CCDS8871.1 HOXC6 gene_id:3223|Hs108|chr12 ( 235) 432 80.8 8.1e-16 CCDS11530.1 HOXB5 gene_id:3215|Hs108|chr17 ( 269) 423 79.3 2.5e-15 CCDS8870.1 HOXC8 gene_id:3224|Hs108|chr12 ( 242) 417 78.3 4.5e-15 CCDS56148.1 HOXD8 gene_id:3234|Hs108|chr2 ( 289) 418 78.5 4.6e-15 CCDS11533.1 HOXB8 gene_id:3218|Hs108|chr17 ( 243) 410 77.2 1e-14 CCDS2268.1 HOXD8 gene_id:3234|Hs108|chr2 ( 290) 406 76.6 1.8e-14 CCDS56149.1 HOXD8 gene_id:3234|Hs108|chr2 ( 106) 393 74.1 3.6e-14 CCDS8872.1 HOXC5 gene_id:3222|Hs108|chr12 ( 222) 396 74.9 4.5e-14 CCDS2269.1 HOXD4 gene_id:3233|Hs108|chr2 ( 255) 391 74.1 8.8e-14 CCDS8873.1 HOXC4 gene_id:3221|Hs108|chr12 ( 264) 376 71.7 4.9e-13 CCDS5405.1 HOXA4 gene_id:3201|Hs108|chr7 ( 320) 366 70.1 1.8e-12 CCDS11529.1 HOXB4 gene_id:3214|Hs108|chr17 ( 251) 362 69.4 2.3e-12 >>CCDS11532.1 HOXB7 gene_id:3217|Hs108|chr17 (217 aa) initn: 1466 init1: 1466 opt: 1466 Z-score: 1307.6 bits: 249.0 E(32554): 1.6e-66 Smith-Waterman score: 1466; 99.5% identity (100.0% similar) in 217 aa overlap (1-217:1-217) 10 20 30 40 50 60 pF1KB8 MSSLYYANALFSKYPASSSVFATGAFPEQTSCAFASNPQRPGYGAGSGASFAASMQGLYP ::::::::.::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 MSSLYYANTLFSKYPASSSVFATGAFPEQTSCAFASNPQRPGYGAGSGASFAASMQGLYP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 GGGGMAGQSAAGVYAAGYGLEPSSFNMHCAPFEQNLSGVCPGDSAKAAGAKEQRDSDLAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 GGGGMAGQSAAGVYAAGYGLEPSSFNMHCAPFEQNLSGVCPGDSAKAAGAKEQRDSDLAA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 ESNFRIYPWMRSSGTDRKRGRQTYTRYQTLELEKEFHYNRYLTRRRRIEIAHTLCLTERQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 ESNFRIYPWMRSSGTDRKRGRQTYTRYQTLELEKEFHYNRYLTRRRRIEIAHTLCLTERQ 130 140 150 160 170 180 190 200 210 pF1KB8 IKIWFQNRRMKWKKENKTAGPGTTGQDRAEAEEEEEE ::::::::::::::::::::::::::::::::::::: CCDS11 IKIWFQNRRMKWKKENKTAGPGTTGQDRAEAEEEEEE 190 200 210 >>CCDS5408.1 HOXA7 gene_id:3204|Hs108|chr7 (230 aa) initn: 791 init1: 586 opt: 810 Z-score: 730.1 bits: 142.3 E(32554): 2.4e-34 Smith-Waterman score: 817; 58.8% identity (76.4% similar) in 233 aa overlap (1-217:1-224) 10 20 30 40 50 60 pF1KB8 MSSLYYANALFSKYPASSSVFATGAFPEQTSCAFASNPQRPGYGAGSGASFAASMQGLYP ::: ::.::::::: :..:.: .. : :::.:: : :: :::::.:: ::... ::: CCDS54 MSSSYYVNALFSKYTAGASLFQNA---EPTSCSFAPNSQRSGYGAGAGA-FASTVPGLYN 10 20 30 40 50 70 80 90 100 110 pF1KB8 GGGGMAGQSAAGVYAAGYGLEPSSF-NMHCAPFEQNLSGVCPGDSAKAAGAKEQRDS-DL .. . :: .:.:::: ... :. :: ..::. :.: .: ::.: : .. . CCDS54 VNSPLY-QSP---FASGYGLGADAYGNLPCASYDQNIPGLC-SDLAKGACDKTDEGALHG 60 70 80 90 100 110 120 130 140 150 160 170 pF1KB8 AAESNFRIYPWMRSSGTDRKRGRQTYTRYQTLELEKEFHYNRYLTRRRRIEIAHTLCLTE :::.:::::::::::: ::::::::::::::::::::::.::::::::::::::.::::: CCDS54 AAEANFRIYPWMRSSGPDRKRGRQTYTRYQTLELEKEFHFNRYLTRRRRIEIAHALCLTE 120 130 140 150 160 170 180 190 200 210 pF1KB8 RQIKIWFQNRRMKWKKENKTAGP--------------GTTGQDRAEAEEEEEE :::::::::::::::::.: :: .:.. :.:. :...:: CCDS54 RQIKIWFQNRRMKWKKEHKDEGPTAAAAPEGAVPSAAATAAADKADEEDDDEEEEDEEE 180 190 200 210 220 230 >>CCDS11531.1 HOXB6 gene_id:3216|Hs108|chr17 (224 aa) initn: 481 init1: 393 opt: 472 Z-score: 432.9 bits: 87.3 E(32554): 8.6e-18 Smith-Waterman score: 505; 45.1% identity (63.4% similar) in 235 aa overlap (1-216:1-222) 10 20 30 40 50 pF1KB8 MSSLYYANALFSKYPASSSVFATGAFPEQTSCAFASNPQR--PG-YGAGSGASFAASMQG ::: :..:. : ::.. : .: .: ..: .: : :. :: : : . . . .. CCDS11 MSS-YFVNSTFPVTLASGQESFLGQLPLYSS-GYA-DPLRHYPAPYGPGPGQDKGFATSS 10 20 30 40 50 60 70 80 90 100 pF1KB8 LYPGGGGMAGQSAA---GVYAAGYGLEPSSFNMHCA----PFE-QNLSGVCPGDSAKAAG :: .:: :..: : : : . :. . : ::. . .. : :.. . CCDS11 YYPPAGGGYGRAAPCDYGPAPAFYREKESACALSGADEQPPFHPEPRKSDCAQDKSVFGE 60 70 80 90 100 110 110 120 130 140 150 160 pF1KB8 AKEQRDSDLAAESNFRIYPWMR--------SSGTDRKRGRQTYTRYQTLELEKEFHYNRY ..::. : .::::. : : . .::::::::::::::::::::::: CCDS11 TEEQKCST-------PVYPWMQRMNSCNSSSFGPSGRRGRQTYTRYQTLELEKEFHYNRY 120 130 140 150 160 170 170 180 190 200 210 pF1KB8 LTRRRRIEIAHTLCLTERQIKIWFQNRRMKWKKENKTAGPGTTGQDRAEAEEEEEE :::::::::::.::::::::::::::::::::::.: ...: :: :::.. CCDS11 LTRRRRIEIAHALCLTERQIKIWFQNRRMKWKKESKLL---SASQLSAEEEEEKQAE 180 190 200 210 220 >>CCDS5407.1 HOXA6 gene_id:3203|Hs108|chr7 (233 aa) initn: 458 init1: 392 opt: 455 Z-score: 417.7 bits: 84.5 E(32554): 6e-17 Smith-Waterman score: 456; 43.3% identity (62.3% similar) in 215 aa overlap (12-215:40-233) 10 20 30 pF1KB8 MSSLYYANALFSKYPASS---SVFATGAFPEQTSCAFASNP ..: ::: ..... : .:.. ..: : CCDS54 FPGSLPSGQDSFLGQLPLYQAGYDALRPFPASYGASSLPDKTYTSPCFYQQSNSVLACNR 10 20 30 40 50 60 40 50 60 70 80 90 pF1KB8 QRPGYGAGSGASFAASMQGLYPGGGGMAGQSAAGVYAAGYGLEPSSFNMHCAPFEQNLSG :::. : ...: :.:.: : . : : .: .: ::. . CCDS54 ASYEYGASCFYS-DKDLSGASPSGSGK--QRGPGDY------------LHFSP-EQQYK- 70 80 90 100 110 100 110 120 130 140 150 pF1KB8 VCPGDSAKAAGAKEQRDSDLAAESNFRIYPWMRSS--------GTDRKRGRQTYTRYQTL : ::... : : .: . . .::::. :. .:::::::::::: CCDS54 --P-DSSSGQG-KALHDEGADRKYTSPVYPWMQRMNSCAGAVYGSHGRRGRQTYTRYQTL 120 130 140 150 160 160 170 180 190 200 210 pF1KB8 ELEKEFHYNRYLTRRRRIEIAHTLCLTERQIKIWFQNRRMKWKKENKTAGPGTTGQDRAE :::::::.:::::::::::::..:::::::::::::::::::::::: . . . .: CCDS54 ELEKEFHFNRYLTRRRRIEIANALCLTERQIKIWFQNRRMKWKKENKLINSTQPSGEDSE 170 180 190 200 210 220 pF1KB8 AEEEEEE :. : CCDS54 AKAGE 230 >>CCDS41792.1 HOXC6 gene_id:3223|Hs108|chr12 (153 aa) initn: 430 init1: 383 opt: 432 Z-score: 400.0 bits: 80.6 E(32554): 5.8e-16 Smith-Waterman score: 434; 53.2% identity (71.9% similar) in 139 aa overlap (94-217:7-145) 70 80 90 100 110 120 pF1KB8 GMAGQSAAGVYAAGYGLEPSSFNMHCAPFEQNLSGVCPGDSAKAAGAKEQ-RDSDLAAES :: : : ..:: : . .. CCDS41 MLSNCRQNTLGHNTQTSIAQDFSSEQGRTAPQDQKA 10 20 30 130 140 150 160 170 pF1KB8 NFRIYPWMR--------SSGTDRKRGRQTYTRYQTLELEKEFHYNRYLTRRRRIEIAHTL ...:::::. . :.::.:::: :.::::::::::::.:::::::::::::..: CCDS41 SIQIYPWMQRMNSHSGVGYGADRRRGRQIYSRYQTLELEKEFHFNRYLTRRRRIEIANAL 40 50 60 70 80 90 180 190 200 210 pF1KB8 CLTERQIKIWFQNRRMKWKKENKTA------GPGTTGQDRAEAEEEEEE :::::::::::::::::::::.. . : :.:... . ::..:: CCDS41 CLTERQIKIWFQNRRMKWKKESNLTSTLSGGGGGATADSLGGKEEKREETEEEKQKE 100 110 120 130 140 150 >>CCDS5406.1 HOXA5 gene_id:3202|Hs108|chr7 (270 aa) initn: 442 init1: 379 opt: 435 Z-score: 399.2 bits: 81.3 E(32554): 6.5e-16 Smith-Waterman score: 438; 39.6% identity (64.5% similar) in 197 aa overlap (16-197:59-255) 10 20 30 40 pF1KB8 MSSLYYANALFSKYPASSSVFATGAFPEQTSCAFASNPQRPGYGA ..:. :..: .. . . .. : .: :. CCDS54 SVSEQFRDSASMHSGRYGYGYNGMDLSVGRSGSGHFGSGERARSYAASASAAPAEPRYSQ 30 40 50 60 70 80 50 60 70 80 90 pF1KB8 GSGASFAASMQGL-------YPGGGGMAGQSAAGVYAAGYGLEPSSFNMHCAPFEQNLSG . .. . . . : ::. . : . . ..: . . .: .. . :: CCDS54 PATSTHSPQPDPLPCSAVAPSPGSDSHHGGKNSLSNSSGASADAGSTHISSREGVGTASG 90 100 110 120 130 140 100 110 120 130 140 150 pF1KB8 VCPGDSAKAAGAKEQRDSDLAAESNFRIYPWMRS--------SGTDRKRGRQTYTRYQTL . :.. :. : . . : .. .::::::. .: . ::.: .::::::: CCDS54 AEEDAPASSEQASAQSEPSPAPPAQPQIYPWMRKLHISHDNIGGPEGKRARTAYTRYQTL 150 160 170 180 190 200 160 170 180 190 200 210 pF1KB8 ELEKEFHYNRYLTRRRRIEIAHTLCLTERQIKIWFQNRRMKWKKENKTAGPGTTGQDRAE :::::::.::::::::::::::.:::.:::::::::::::::::.:: CCDS54 ELEKEFHFNRYLTRRRRIEIAHALCLSERQIKIWFQNRRMKWKKDNKLKSMSMAAAGGAF 210 220 230 240 250 260 pF1KB8 AEEEEEE CCDS54 RP 270 >>CCDS8871.1 HOXC6 gene_id:3223|Hs108|chr12 (235 aa) initn: 415 init1: 383 opt: 432 Z-score: 397.4 bits: 80.8 E(32554): 8.1e-16 Smith-Waterman score: 449; 41.1% identity (63.2% similar) in 231 aa overlap (8-217:2-227) 10 20 30 40 50 pF1KB8 MSSLYYANALFSKYPASSSVFATG--AFPEQTSCAFASNPQR--PGYGAGSGASFAASMQ :. :.. :. : .: : ..:. . . : .: : :::. . . : CCDS88 MNSYFTN-PSLSCHLAGGQDVLPNVALNSTAYDPVRHFSTYGAAVAQNRIYSTP 10 20 30 40 50 60 70 80 90 100 110 pF1KB8 GLYPGGGGMAGQSAAGVYAAGYGLEPSSFNM--HCAPFEQNLSGVCPGDSAKAAGAKEQ- .: ... .:. : : : . . .: .: .:: : : ..:: CCDS88 -FYSPQENVVFSSSRGPYDYGSNSFYQEKDMLSNC---RQNTLGHNTQTSIAQDFSSEQG 60 70 80 90 100 120 130 140 150 160 pF1KB8 RDSDLAAESNFRIYPWMR--------SSGTDRKRGRQTYTRYQTLELEKEFHYNRYLTRR : . .....:::::. . :.::.:::: :.::::::::::::.::::::: CCDS88 RTAPQDQKASIQIYPWMQRMNSHSGVGYGADRRRGRQIYSRYQTLELEKEFHFNRYLTRR 110 120 130 140 150 160 170 180 190 200 210 pF1KB8 RRIEIAHTLCLTERQIKIWFQNRRMKWKKENKTA------GPGTTGQDRAEAEEEEEE ::::::..::::::::::::::::::::::.. . : :.:... . ::..:: CCDS88 RRIEIANALCLTERQIKIWFQNRRMKWKKESNLTSTLSGGGGGATADSLGGKEEKREETE 170 180 190 200 210 220 CCDS88 EEKQKE 230 >>CCDS11530.1 HOXB5 gene_id:3215|Hs108|chr17 (269 aa) initn: 441 init1: 387 opt: 423 Z-score: 388.7 bits: 79.3 E(32554): 2.5e-15 Smith-Waterman score: 435; 44.1% identity (64.7% similar) in 204 aa overlap (12-197:57-254) 10 20 30 40 pF1KB8 MSSLYYANALFSKYPASSSVFATGAFPEQTSCAFASNPQRP .. :::: : :: : .: :: . :.: CCDS11 GSSLSGSYRDPAAMHTGSYGYNYNGMDLSVNRSSASSSHF--GAVGE-SSRAFPAPAQEP 30 40 50 60 70 80 50 60 70 80 90 pF1KB8 GY-GAGSGASFAA--SM-------QGLYPGGGGMAGQSAAGVYAAGYGLEPSSFNMHCAP . :.:. :... :. .: :.... . :.... .:.. : . . : CCDS11 RFRQAASSCSLSSPESLPCTNGDSHGAKPSASSPSDQATSASSSANF-TEIDEASASSEP 90 100 110 120 130 140 100 110 120 130 140 pF1KB8 FEQNLSGVCPGDSAKAAGAKEQRDSDLAAESNF-RIYPWMRS-------SGTDRKRGRQT : . : : : . . : : :.. .:.::::. .: : ::.: . CCDS11 EEAASQLSSP--SLARAQPEPMATSTAAPEGQTPQIFPWMRKLHISHDMTGPDGKRARTA 150 160 170 180 190 200 150 160 170 180 190 200 pF1KB8 YTRYQTLELEKEFHYNRYLTRRRRIEIAHTLCLTERQIKIWFQNRRMKWKKENKTAGPGT ::::::::::::::.::::::::::::::.:::.:::::::::::::::::.:: CCDS11 YTRYQTLELEKEFHFNRYLTRRRRIEIAHALCLSERQIKIWFQNRRMKWKKDNKLKSMSL 210 220 230 240 250 260 210 pF1KB8 TGQDRAEAEEEEEE CCDS11 ATAGSAFQP >>CCDS8870.1 HOXC8 gene_id:3224|Hs108|chr12 (242 aa) initn: 452 init1: 382 opt: 417 Z-score: 384.1 bits: 78.3 E(32554): 4.5e-15 Smith-Waterman score: 467; 40.4% identity (60.8% similar) in 240 aa overlap (1-217:1-230) 10 20 30 40 50 pF1KB8 MSSLYYANALFSKYPASSSVFATGAFPEQTSCAFASNPQRPGYGA----GSGASFAASMQ ::: :..: ::::: :. :. : .: : :: : . : :.: : ..: CCDS88 MSS-YFVNPLFSKYKAGESLE-----PAYYDCRF---PQSVGRSHALVYGPGGS-APGFQ 10 20 30 40 50 60 70 80 90 100 pF1KB8 GLYPGGGGMAGQSAAGVYAAGYGLEPSSFNMH--CAPF-------EQNLSGVC------- . ....:. .:: .: :.. : . : .:.: :. CCDS88 HASHHVQDFFHHGTSGISNSGYQQNPCSLSCHGDASKFYGYEALPRQSLYGAQQEASVVQ 60 70 80 90 100 110 110 120 130 140 150 pF1KB8 -PGDSAKAAGAKEQRDSDLAAESNFRI-YPWMRSSGTDRKRGRQTYTRYQTLELEKEFHY : ...: . . .. : .:. . .:::: . :. :::::.::::::::::: . CCDS88 YPDCKSSANTNSSEGQGHLNQNSSPSLMFPWMRPHAPGRRSGRQTYSRYQTLELEKEFLF 120 130 140 150 160 170 160 170 180 190 200 210 pF1KB8 NRYLTRRRRIEIAHTLCLTERQIKIWFQNRRMKWKKEN-KTAGPGTTGQDRAEAEEEEEE : ::::.::::..:.: :::::.::::::::::::::: : ::. ....: : .::: CCDS88 NPYLTRKRRIEVSHALGLTERQVKIWFQNRRMKWKKENNKDKLPGARDEEKVEEEGNEEE 180 190 200 210 220 230 CCDS88 EKEEEEKEENKD 240 >>CCDS56148.1 HOXD8 gene_id:3234|Hs108|chr2 (289 aa) initn: 438 init1: 388 opt: 418 Z-score: 383.9 bits: 78.5 E(32554): 4.6e-15 Smith-Waterman score: 446; 40.2% identity (61.1% similar) in 229 aa overlap (22-217:58-282) 10 20 30 40 50 pF1KB8 MSSLYYANALFSKYPASSSVFATGAFPEQTSCAFAS-NPQRPGYGAGSGAS :.: ::. : : .:. : :.: :. CCDS56 NPTYYDCHFAPEVGGRHAAAAAALQLYGNSAAG-FPHAPPQAHAHPHPSPPPSGTGCGGR 30 40 50 60 70 80 60 70 80 pF1KB8 FAASMQGLYPGGGGMAG--QSA----------------AGVYAAGYGLEPSSF----NMH . ... ..::::. :. :.: .:. : ::..: :.. CCDS56 EGRGQEYFHPGGGSPAAAYQAAPPPPPHPPPPPPPPPCGGIACHG---EPAKFYGYDNLQ 90 100 110 120 130 140 90 100 110 120 130 140 pF1KB8 CAPF---EQNLSGVCPGDSAKAAGA-KEQRDSDLAAESNFRIYPWMRSSGTDRKRGRQTY :. .:. : : ...: :. : . : ...:::: .. :.:::::: CCDS56 RQPIFTTQQEAELVQYPDCKSSSGNIGEDPDHLNQSSSPSQMFPWMRPQAPGRRRGRQTY 150 160 170 180 190 200 150 160 170 180 190 200 pF1KB8 TRYQTLELEKEFHYNRYLTRRRRIEIAHTLCLTERQIKIWFQNRRMKWKKEN-KTAGPGT .:.::::::::: .: ::::.::::..:.: :::::.::::::::::::::: : : . CCDS56 SRFQTLELEKEFLFNPYLTRKRRIEVSHALALTERQVKIWFQNRRMKWKKENNKDKFPVS 210 220 230 240 250 260 210 pF1KB8 T-----GQDRAEAEEEEEE :. . ::.: ::. CCDS56 RQEVKDGETKKEAQELEEDRAEGLTN 270 280 217 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 23:20:57 2016 done: Mon Nov 7 23:20:57 2016 Total Scan time: 2.290 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]