FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB8954, 330 aa 1>>>pF1KB8954 330 - 330 aa - 330 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.3060+/-0.000924; mu= 7.7514+/- 0.057 mean_var=354.8446+/-73.449, 0's: 0 Z-trim(116.9): 71 B-trim: 0 in 0/53 Lambda= 0.068086 statistics sampled from 17450 (17521) to 17450 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.825), E-opt: 0.2 (0.538), width: 16 Scan time: 2.720 The best scores are: opt bits E(32554) CCDS8865.1 HOXC13 gene_id:3229|Hs108|chr12 ( 330) 2320 240.9 1.1e-63 CCDS11536.1 HOXB13 gene_id:10481|Hs108|chr17 ( 284) 848 96.2 3.4e-20 CCDS2264.2 HOXD13 gene_id:3239|Hs108|chr2 ( 343) 814 92.9 3.9e-19 CCDS5412.1 HOXA13 gene_id:3209|Hs108|chr7 ( 388) 804 92.0 8.2e-19 >>CCDS8865.1 HOXC13 gene_id:3229|Hs108|chr12 (330 aa) initn: 2320 init1: 2320 opt: 2320 Z-score: 1256.7 bits: 240.9 E(32554): 1.1e-63 Smith-Waterman score: 2320; 100.0% identity (100.0% similar) in 330 aa overlap (1-330:1-330) 10 20 30 40 50 60 pF1KB8 MTTSLLLHPRWPESLMYVYEDSAAESGIGGGGGGGGGGTGGAGGGCSGASPGKAPSMDGL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS88 MTTSLLLHPRWPESLMYVYEDSAAESGIGGGGGGGGGGTGGAGGGCSGASPGKAPSMDGL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 GSSCPASHCRDLLPHPVLGRPPAPLGAPQGAVYTDIPAPEAARQCAPPPAPPTSSSATLG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS88 GSSCPASHCRDLLPHPVLGRPPAPLGAPQGAVYTDIPAPEAARQCAPPPAPPTSSSATLG 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 YGYPFGGSYYGCRLSHNVNLQQKPCAYHPGDKYPEPSGALPGDDLSSRAKEFAFYPSFAS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS88 YGYPFGGSYYGCRLSHNVNLQQKPCAYHPGDKYPEPSGALPGDDLSSRAKEFAFYPSFAS 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB8 SYQAMPGYLDVSVVPGISGHPEPRHDALIPVEGYQHWALSNGWDSQVYCSKEQSQSAHLW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS88 SYQAMPGYLDVSVVPGISGHPEPRHDALIPVEGYQHWALSNGWDSQVYCSKEQSQSAHLW 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB8 KSPFPDVVPLQPEVSSYRRGRKKRVPYTKVQLKELEKEYAASKFITKEKRRRISATTNLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS88 KSPFPDVVPLQPEVSSYRRGRKKRVPYTKVQLKELEKEYAASKFITKEKRRRISATTNLS 250 260 270 280 290 300 310 320 330 pF1KB8 ERQVTIWFQNRRVKEKKVVSKSKAPHLHST :::::::::::::::::::::::::::::: CCDS88 ERQVTIWFQNRRVKEKKVVSKSKAPHLHST 310 320 330 >>CCDS11536.1 HOXB13 gene_id:10481|Hs108|chr17 (284 aa) initn: 787 init1: 351 opt: 848 Z-score: 476.0 bits: 96.2 E(32554): 3.4e-20 Smith-Waterman score: 848; 48.2% identity (71.6% similar) in 282 aa overlap (51-323:3-279) 30 40 50 60 70 pF1KB8 DSAAESGIGGGGGGGGGGTGGAGGGCSGASPGKAPSMDG---LGSSCPASHCRDLLPH-P ::. ..:: . . :. :.:. : : CCDS11 MEPGNYATLDGAKDIEGLLGAGGGRNLVAHSP 10 20 30 80 90 100 110 120 130 pF1KB8 VLGRPPAPLGAPQ-GAVYTDIP-APEAARQCAP-PPAPPTSSSATLGYGYPFGGSYYGCR . ..: :: : . . :.: . : .:: : : .: .: : . ::: :::.::.:: CCDS11 LTSHPAAPTLMPAVNYAPLDLPGSAEPPKQCHPCPGVPQGTSPAPVPYGY-FGGGYYSCR 40 50 60 70 80 90 140 150 160 170 180 190 pF1KB8 LSHNVNLQQKPCAYHPG-DKYPEPSGALPGDDLSSRAKEFAFYPSFASSYQAMPGYLDVS .:.. . :::: :: . :.. :: ::::::.. ..:: : .::::: CCDS11 VSRS---SLKPCAQAATLAAYPAET-PTAGEEYPSRPTEFAFYPGYPGTYQPMASYLDVS 100 110 120 130 140 200 210 220 230 240 250 pF1KB8 VVPGISGHPEPRHDALIPVEGYQHWALSNGWDSQVYCSKEQSQSAHLWKSPFPDVVPLQP :: ... :::::.:.::..:: :::..::.::. :. ::. . .::. : : .: CCDS11 VVQTLGAPGEPRHDSLLPVDSYQSWALAGGWNSQMCCQGEQNPPGPFWKAAFADSSGQHP 150 160 170 180 190 200 260 270 280 290 300 310 pF1KB8 -EVSSYRRGRKKRVPYTKVQLKELEKEYAASKFITKEKRRRISATTNLSERQVTIWFQNR .. ..:::::::.::.: ::.:::.::::.:::::.:::.:::.:.:::::.::::::: CCDS11 PDACAFRRGRKKRIPYSKGQLRELEREYAANKFITKDKRRKISAATSLSERQITIWFQNR 210 220 230 240 250 260 320 330 pF1KB8 RVKEKKVVSKSKAPHLHST :::::::..: : CCDS11 RVKEKKVLAKVKNSATP 270 280 >>CCDS2264.2 HOXD13 gene_id:3239|Hs108|chr2 (343 aa) initn: 888 init1: 392 opt: 814 Z-score: 457.1 bits: 92.9 E(32554): 3.9e-19 Smith-Waterman score: 898; 48.3% identity (70.8% similar) in 329 aa overlap (29-323:16-339) 10 20 30 40 50 pF1KB8 MTTSLLLHPRWPESLMYVYEDSAAESGIGGGGGGGGGGTGG-------AGGGCSG--ASP :::.::. ..... :.: : : ..: CCDS22 MSRAGSWDMDGLRADGGGAGGAPASSSSSSVAAAAASGQCRGFLSAP 10 20 30 40 60 70 80 90 100 pF1KB8 GKAPSMDGLGSSCPASHCRDLLPHPVLGRPPAP--LGAPQGAVYTDIPA--PEA--ARQC : . .: ... :. .. : . :. ... . . : ::: :..: CCDS22 VFAGTHSGRAAAAAAAAAAAAAAASGFAYPGTSERTGSSSSSSSSAVVAARPEAPPAKEC 50 60 70 80 90 100 110 120 130 140 150 pF1KB8 -APPPA-----PPTSSSATLGYGYPFGGSYYGCRLSHNVNLQQK-----PCAY---HPGD :: :: :: :. .::::: ::..::.::.::.:.:::. : : : . CCDS22 PAPTPAAAAAAPP--SAPALGYGYHFGNGYYSCRMSHGVGLQQNALKSSPHASLGGFPVE 110 120 130 140 150 160 160 170 180 190 200 pF1KB8 KYPEPSG----ALPGDDLSSRAKEFAFYPSFASSYQAMPGYLDVSVVPGISGHPEPRHDA :: . :: ..:.... .:::: .:: ...: :: .:::.:. . : :: ::::.: CCDS22 KYMDVSGLASSSVPANEVPARAKEVSFYQGYTSPYQHVPGYIDMVSTFG-SG--EPRHEA 170 180 190 200 210 220 210 220 230 240 250 260 pF1KB8 LIPVEGYQHWALSNGWDSQVYCSKEQSQSAHLWKSPFP-DVVPLQPEVSSYRRGRKKRVP : .:::: :.:.:::.:::::.:.: :..:.::: :: ::. ::.. :::::::::: CCDS22 YISMEGYQSWTLANGWNSQVYCTKDQPQGSHFWKSSFPGDVALNQPDMCVYRRGRKKRVP 230 240 250 260 270 280 270 280 290 300 310 320 pF1KB8 YTKVQLKELEKEYAASKFITKEKRRRISATTNLSERQVTIWFQNRRVKEKKVVSKSKAPH :::.::::::.::: .:::.:.:::::::.::::::::::::::::::.::.::: : CCDS22 YTKLQLKELENEYAINKFINKDKRRRISAATNLSERQVTIWFQNRRVKDKKIVSKLKDTV 290 300 310 320 330 340 330 pF1KB8 LHST CCDS22 S >>CCDS5412.1 HOXA13 gene_id:3209|Hs108|chr7 (388 aa) initn: 941 init1: 585 opt: 804 Z-score: 451.2 bits: 92.0 E(32554): 8.2e-19 Smith-Waterman score: 907; 48.6% identity (67.1% similar) in 350 aa overlap (22-323:45-385) 10 20 30 40 50 pF1KB8 MTTSLLLHPRWPESLMYVYEDSAAESGIGGGGGGGGGGTGGAGGGCSGASP .:: .. :.:::: ...:.:: ... CCDS54 TVMFLYDNGGGLVADELNKNMEGAAAAAAAAAAAAAAGAGGGGFPHPAAAAAGGNFSVAA 20 30 40 50 60 70 60 70 80 90 pF1KB8 GKAPSMDGLGSSCPASHCRDLLPHPVLGRP-------PAPLGAPQGAVYTDI-------- . : . .. :..::.:. ::. : :: :: .:. . CCDS54 AAAAA-----AAAAANQCRNLMAHPAPLAPGAASAYSSAPGEAPPSAAAAAAAAAAAAAA 80 90 100 110 120 100 110 120 130 pF1KB8 ---------PAP------EAARQCAPPPAPPTSSS--ATLGYGYPFGGSYYGC-RLSHNV :.: :::.::.: : ::: :.: ::: ::..:: : :.. . CCDS54 AAAASSSGGPGPAGPAGAEAAKQCSPCSAAAQSSSGPAALPYGY-FGSGYYPCARMGPHP 130 140 150 160 170 180 140 150 160 170 180 pF1KB8 NLQQKPCAYHPG---------DKYPEPSGALPGDDLSSRAKEFAFY-PSFASS----YQA : : :: .:. ::: . .: ....:::::::::: ..:.. .: CCDS54 N-AIKSCA-QPASAAAAAAFADKYMDTAGP-AAEEFSSRAKEFAFYHQGYAAGPYHHHQP 190 200 210 220 230 240 190 200 210 220 230 240 pF1KB8 MPGYLDVSVVPGISGHPEPRHDAL-IPVEGYQHWALSNGWDSQVYCSKEQSQSAHLWKSP ::::::. ::::..: : ::. : .:.:.:: ::: :::..:.:: :::.: ::::: CCDS54 MPGYLDMPVVPGLGGPGESRHEPLGLPMESYQPWALPNGWNGQMYCPKEQAQPPHLWKST 250 260 270 280 290 300 250 260 270 280 290 300 pF1KB8 FPDVVPLQPEVSSYRRGRKKRVPYTKVQLKELEKEYAASKFITKEKRRRISATTNLSERQ .:::: ..::::::::::::::::::::::.:::..:::::.::::::::::::::: CCDS54 LPDVVSHPSDASSYRRGRKKRVPYTKVQLKELEREYATNKFITKDKRRRISATTNLSERQ 310 320 330 340 350 360 310 320 330 pF1KB8 VTIWFQNRRVKEKKVVSKSKAPHLHST :::::::::::::::..: : CCDS54 VTIWFQNRRVKEKKVINKLKTTS 370 380 330 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 04:23:20 2016 done: Tue Nov 8 04:23:20 2016 Total Scan time: 2.720 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]