FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB8954, 330 aa 1>>>pF1KB8954 330 - 330 aa - 330 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 9.0344+/-0.000377; mu= 2.9305+/- 0.024 mean_var=345.0164+/-70.232, 0's: 0 Z-trim(124.3): 88 B-trim: 1068 in 1/57 Lambda= 0.069049 statistics sampled from 45720 (45850) to 45720 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.829), E-opt: 0.2 (0.538), width: 16 Scan time: 8.790 The best scores are: opt bits E(85289) NP_059106 (OMIM: 142976,614931) homeobox protein H ( 330) 2320 244.1 3.2e-64 NP_006352 (OMIM: 604607) homeobox protein Hox-B13 ( 284) 848 97.3 4e-20 NP_000514 (OMIM: 113200,113300,142989,186000,18630 ( 343) 814 94.1 4.7e-19 NP_000513 (OMIM: 140000,142959,176305) homeobox pr ( 388) 804 93.1 1e-18 XP_011509370 (OMIM: 113200,113300,142989,186000,18 ( 324) 392 52.0 2e-06 NP_776272 (OMIM: 142975) homeobox protein Hox-C12 ( 282) 282 41.0 0.0037 NP_055027 (OMIM: 605559) homeobox protein Hox-C11 ( 304) 277 40.5 0.0055 NP_067016 (OMIM: 142988) homeobox protein Hox-D12 ( 270) 275 40.2 0.0059 >>NP_059106 (OMIM: 142976,614931) homeobox protein Hox-C (330 aa) initn: 2320 init1: 2320 opt: 2320 Z-score: 1274.1 bits: 244.1 E(85289): 3.2e-64 Smith-Waterman score: 2320; 100.0% identity (100.0% similar) in 330 aa overlap (1-330:1-330) 10 20 30 40 50 60 pF1KB8 MTTSLLLHPRWPESLMYVYEDSAAESGIGGGGGGGGGGTGGAGGGCSGASPGKAPSMDGL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_059 MTTSLLLHPRWPESLMYVYEDSAAESGIGGGGGGGGGGTGGAGGGCSGASPGKAPSMDGL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 GSSCPASHCRDLLPHPVLGRPPAPLGAPQGAVYTDIPAPEAARQCAPPPAPPTSSSATLG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_059 GSSCPASHCRDLLPHPVLGRPPAPLGAPQGAVYTDIPAPEAARQCAPPPAPPTSSSATLG 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 YGYPFGGSYYGCRLSHNVNLQQKPCAYHPGDKYPEPSGALPGDDLSSRAKEFAFYPSFAS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_059 YGYPFGGSYYGCRLSHNVNLQQKPCAYHPGDKYPEPSGALPGDDLSSRAKEFAFYPSFAS 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB8 SYQAMPGYLDVSVVPGISGHPEPRHDALIPVEGYQHWALSNGWDSQVYCSKEQSQSAHLW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_059 SYQAMPGYLDVSVVPGISGHPEPRHDALIPVEGYQHWALSNGWDSQVYCSKEQSQSAHLW 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB8 KSPFPDVVPLQPEVSSYRRGRKKRVPYTKVQLKELEKEYAASKFITKEKRRRISATTNLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_059 KSPFPDVVPLQPEVSSYRRGRKKRVPYTKVQLKELEKEYAASKFITKEKRRRISATTNLS 250 260 270 280 290 300 310 320 330 pF1KB8 ERQVTIWFQNRRVKEKKVVSKSKAPHLHST :::::::::::::::::::::::::::::: NP_059 ERQVTIWFQNRRVKEKKVVSKSKAPHLHST 310 320 330 >>NP_006352 (OMIM: 604607) homeobox protein Hox-B13 [Hom (284 aa) initn: 787 init1: 351 opt: 848 Z-score: 482.3 bits: 97.3 E(85289): 4e-20 Smith-Waterman score: 848; 48.2% identity (71.6% similar) in 282 aa overlap (51-323:3-279) 30 40 50 60 70 pF1KB8 DSAAESGIGGGGGGGGGGTGGAGGGCSGASPGKAPSMDG---LGSSCPASHCRDLLPH-P ::. ..:: . . :. :.:. : : NP_006 MEPGNYATLDGAKDIEGLLGAGGGRNLVAHSP 10 20 30 80 90 100 110 120 130 pF1KB8 VLGRPPAPLGAPQ-GAVYTDIP-APEAARQCAP-PPAPPTSSSATLGYGYPFGGSYYGCR . ..: :: : . . :.: . : .:: : : .: .: : . ::: :::.::.:: NP_006 LTSHPAAPTLMPAVNYAPLDLPGSAEPPKQCHPCPGVPQGTSPAPVPYGY-FGGGYYSCR 40 50 60 70 80 90 140 150 160 170 180 190 pF1KB8 LSHNVNLQQKPCAYHPG-DKYPEPSGALPGDDLSSRAKEFAFYPSFASSYQAMPGYLDVS .:.. . :::: :: . :.. :: ::::::.. ..:: : .::::: NP_006 VSRS---SLKPCAQAATLAAYPAET-PTAGEEYPSRPTEFAFYPGYPGTYQPMASYLDVS 100 110 120 130 140 200 210 220 230 240 250 pF1KB8 VVPGISGHPEPRHDALIPVEGYQHWALSNGWDSQVYCSKEQSQSAHLWKSPFPDVVPLQP :: ... :::::.:.::..:: :::..::.::. :. ::. . .::. : : .: NP_006 VVQTLGAPGEPRHDSLLPVDSYQSWALAGGWNSQMCCQGEQNPPGPFWKAAFADSSGQHP 150 160 170 180 190 200 260 270 280 290 300 310 pF1KB8 -EVSSYRRGRKKRVPYTKVQLKELEKEYAASKFITKEKRRRISATTNLSERQVTIWFQNR .. ..:::::::.::.: ::.:::.::::.:::::.:::.:::.:.:::::.::::::: NP_006 PDACAFRRGRKKRIPYSKGQLRELEREYAANKFITKDKRRKISAATSLSERQITIWFQNR 210 220 230 240 250 260 320 330 pF1KB8 RVKEKKVVSKSKAPHLHST :::::::..: : NP_006 RVKEKKVLAKVKNSATP 270 280 >>NP_000514 (OMIM: 113200,113300,142989,186000,186300,19 (343 aa) initn: 888 init1: 392 opt: 814 Z-score: 463.1 bits: 94.1 E(85289): 4.7e-19 Smith-Waterman score: 898; 48.3% identity (70.8% similar) in 329 aa overlap (29-323:16-339) 10 20 30 40 50 pF1KB8 MTTSLLLHPRWPESLMYVYEDSAAESGIGGGGGGGGGGTGG-------AGGGCSG--ASP :::.::. ..... :.: : : ..: NP_000 MSRAGSWDMDGLRADGGGAGGAPASSSSSSVAAAAASGQCRGFLSAP 10 20 30 40 60 70 80 90 100 pF1KB8 GKAPSMDGLGSSCPASHCRDLLPHPVLGRPPAP--LGAPQGAVYTDIPA--PEA--ARQC : . .: ... :. .. : . :. ... . . : ::: :..: NP_000 VFAGTHSGRAAAAAAAAAAAAAAASGFAYPGTSERTGSSSSSSSSAVVAARPEAPPAKEC 50 60 70 80 90 100 110 120 130 140 150 pF1KB8 -APPPA-----PPTSSSATLGYGYPFGGSYYGCRLSHNVNLQQK-----PCAY---HPGD :: :: :: :. .::::: ::..::.::.::.:.:::. : : : . NP_000 PAPTPAAAAAAPP--SAPALGYGYHFGNGYYSCRMSHGVGLQQNALKSSPHASLGGFPVE 110 120 130 140 150 160 160 170 180 190 200 pF1KB8 KYPEPSG----ALPGDDLSSRAKEFAFYPSFASSYQAMPGYLDVSVVPGISGHPEPRHDA :: . :: ..:.... .:::: .:: ...: :: .:::.:. . : :: ::::.: NP_000 KYMDVSGLASSSVPANEVPARAKEVSFYQGYTSPYQHVPGYIDMVSTFG-SG--EPRHEA 170 180 190 200 210 220 210 220 230 240 250 260 pF1KB8 LIPVEGYQHWALSNGWDSQVYCSKEQSQSAHLWKSPFP-DVVPLQPEVSSYRRGRKKRVP : .:::: :.:.:::.:::::.:.: :..:.::: :: ::. ::.. :::::::::: NP_000 YISMEGYQSWTLANGWNSQVYCTKDQPQGSHFWKSSFPGDVALNQPDMCVYRRGRKKRVP 230 240 250 260 270 280 270 280 290 300 310 320 pF1KB8 YTKVQLKELEKEYAASKFITKEKRRRISATTNLSERQVTIWFQNRRVKEKKVVSKSKAPH :::.::::::.::: .:::.:.:::::::.::::::::::::::::::.::.::: : NP_000 YTKLQLKELENEYAINKFINKDKRRRISAATNLSERQVTIWFQNRRVKDKKIVSKLKDTV 290 300 310 320 330 340 330 pF1KB8 LHST NP_000 S >>NP_000513 (OMIM: 140000,142959,176305) homeobox protei (388 aa) initn: 941 init1: 585 opt: 804 Z-score: 457.1 bits: 93.1 E(85289): 1e-18 Smith-Waterman score: 907; 48.6% identity (67.1% similar) in 350 aa overlap (22-323:45-385) 10 20 30 40 50 pF1KB8 MTTSLLLHPRWPESLMYVYEDSAAESGIGGGGGGGGGGTGGAGGGCSGASP .:: .. :.:::: ...:.:: ... NP_000 TVMFLYDNGGGLVADELNKNMEGAAAAAAAAAAAAAAGAGGGGFPHPAAAAAGGNFSVAA 20 30 40 50 60 70 60 70 80 90 pF1KB8 GKAPSMDGLGSSCPASHCRDLLPHPVLGRP-------PAPLGAPQGAVYTDI-------- . : . .. :..::.:. ::. : :: :: .:. . NP_000 AAAAA-----AAAAANQCRNLMAHPAPLAPGAASAYSSAPGEAPPSAAAAAAAAAAAAAA 80 90 100 110 120 100 110 120 130 pF1KB8 ---------PAP------EAARQCAPPPAPPTSSS--ATLGYGYPFGGSYYGC-RLSHNV :.: :::.::.: : ::: :.: ::: ::..:: : :.. . NP_000 AAAASSSGGPGPAGPAGAEAAKQCSPCSAAAQSSSGPAALPYGY-FGSGYYPCARMGPHP 130 140 150 160 170 180 140 150 160 170 180 pF1KB8 NLQQKPCAYHPG---------DKYPEPSGALPGDDLSSRAKEFAFY-PSFASS----YQA : : :: .:. ::: . .: ....:::::::::: ..:.. .: NP_000 N-AIKSCA-QPASAAAAAAFADKYMDTAGP-AAEEFSSRAKEFAFYHQGYAAGPYHHHQP 190 200 210 220 230 240 190 200 210 220 230 240 pF1KB8 MPGYLDVSVVPGISGHPEPRHDAL-IPVEGYQHWALSNGWDSQVYCSKEQSQSAHLWKSP ::::::. ::::..: : ::. : .:.:.:: ::: :::..:.:: :::.: ::::: NP_000 MPGYLDMPVVPGLGGPGESRHEPLGLPMESYQPWALPNGWNGQMYCPKEQAQPPHLWKST 250 260 270 280 290 300 250 260 270 280 290 300 pF1KB8 FPDVVPLQPEVSSYRRGRKKRVPYTKVQLKELEKEYAASKFITKEKRRRISATTNLSERQ .:::: ..::::::::::::::::::::::.:::..:::::.::::::::::::::: NP_000 LPDVVSHPSDASSYRRGRKKRVPYTKVQLKELEREYATNKFITKDKRRRISATTNLSERQ 310 320 330 340 350 360 310 320 330 pF1KB8 VTIWFQNRRVKEKKVVSKSKAPHLHST :::::::::::::::..: : NP_000 VTIWFQNRRVKEKKVINKLKTTS 370 380 >>XP_011509370 (OMIM: 113200,113300,142989,186000,186300 (324 aa) initn: 454 init1: 392 opt: 392 Z-score: 236.2 bits: 52.0 E(85289): 2e-06 Smith-Waterman score: 422; 36.3% identity (54.3% similar) in 289 aa overlap (41-323:63-320) 20 30 40 50 60 70 pF1KB8 WPESLMYVYEDSAAESGIGGGGGGGGGGTGGAGGGCSGASPGKAPSMDGLGSSCPASHCR ::.: :::. . :.. . :: XP_011 SLALLLRGGLRAIGADNLRSRLGTHACSRAGAAG-CSGTVGPRKPGLRASGS-------- 40 50 60 70 80 80 90 100 110 120 pF1KB8 DLLPHPVLGRPPAPLGAPQGAV-YTDIPAPEAARQCAPPPAPP--TSSSATLGYGYPFGG : . : . .. .: . . . : .. : . ::: . : :: XP_011 --LSSGEVRFPRVQVSLHKGRLRFRALGRPASSSLSLPGLSGCFCLSSSHSRRNPAPHGG 90 100 110 120 130 140 130 140 150 160 170 180 pF1KB8 SYYGCRLSHNVNLQQKPCAYHPGDKYPEPS---GALPGDDLSSRAKEFAFYPSFASSYQA . : .. . : . : :: : : : . . :.. : : .:. . XP_011 RSW-CGSWGILSSWARSTQTHRLPRVPVPSDATGKLAGTSSARRGELRAAGP--GSGAEH 150 160 170 180 190 190 200 210 220 230 240 pF1KB8 MPGYLDVSVVPGISGHPEPRHDALIPVEGYQHWALSNGWDSQVYCSKEQSQSAHLWKSPF :. ..:. : . :: : .::. . .:: :. : . XP_011 CPSA-SLSAPPFVLGHFPHLHPFALPVRTMWIPHRDNG-----LCGME-----------I 200 210 220 230 240 250 260 270 280 290 300 pF1KB8 PDVVPLQPEVSSYRRGRKKRVPYTKVQLKELEKEYAASKFITKEKRRRISATTNLSERQV ::. ::.. :::::::::::::.::::::.::: .:::.:.:::::::.:::::::: XP_011 GDVALNQPDMCVYRRGRKKRVPYTKLQLKELENEYAINKFINKDKRRRISAATNLSERQV 250 260 270 280 290 300 310 320 330 pF1KB8 TIWFQNRRVKEKKVVSKSKAPHLHST ::::::::::.::.::: : XP_011 TIWFQNRRVKDKKIVSKLKDTVS 310 320 >>NP_776272 (OMIM: 142975) homeobox protein Hox-C12 [Hom (282 aa) initn: 448 init1: 241 opt: 282 Z-score: 177.6 bits: 41.0 E(85289): 0.0037 Smith-Waterman score: 282; 30.2% identity (53.7% similar) in 255 aa overlap (81-324:37-278) 60 70 80 90 100 pF1KB8 PGKAPSMDGLGSSCPASHCRDLLPHPVLGRPPAP-LGAPQGAVYTDIPAPEAARQCAPPP : : :. :. .. : .:. : : NP_776 LNPGFVGPLVNIHTGDTFYFPNFRASGAQLPGLPSLSYPRRDNVCSLSWP-SAEPCNGYP 10 20 30 40 50 60 110 120 130 140 150 160 pF1KB8 APPTSSSATLGYGYPFGGSYYGCRLSHNVNLQQKPCAYHPGDKYP-EPSGALPGDDLSSR : .: ..:. ::: . :. . . ..::: : : : :: .. NP_776 QPYLGSPVSLNP--PFGRTCELARVEDGKGYYREPCAEGGGGGLKREERGRDPGAGPGA- 70 80 90 100 110 120 170 180 190 200 210 220 pF1KB8 AKEFAFYPSFASSYQAMPGYLDVSVVPGIS---GHPEPRHD--ALIPVEGYQHWALSNGW :. : :. :. : .. : . : : :: . .:. . .: : NP_776 ----ALLPLEPSGPPALGFKYDYAAGGGGGDGGGGAGPPHDPPSCQSLESDSSSSLLNEG 130 140 150 160 170 230 240 250 260 270 pF1KB8 DSQVYCSKEQSQSAHLWKSPFPDV----VPLQPEVSSYRRGRKKRVPYTKVQLKELEKEY .. . . : . : .: . .: : ..: :.:::: ::.:.:: ::: :. NP_776 NKGAGAGDPGSLVSPL--NPGGGLSASGAPWYP-INS--RSRKKRKPYSKLQLAELEGEF 180 190 200 210 220 230 280 290 300 310 320 330 pF1KB8 AASKFITKEKRRRISATTNLSERQVTIWFQNRRVKEKKVVSKSKAPHLHST ...:::...::..: :::..:: :::::::.:.:... . .: NP_776 LVNEFITRQRRRELSDRLNLSDQQVKIWFQNRRMKKKRLLLREQALSFF 240 250 260 270 280 >>NP_055027 (OMIM: 605559) homeobox protein Hox-C11 [Hom (304 aa) initn: 324 init1: 235 opt: 277 Z-score: 174.6 bits: 40.5 E(85289): 0.0055 Smith-Waterman score: 282; 30.8% identity (53.0% similar) in 247 aa overlap (99-318:52-290) 70 80 90 100 110 120 pF1KB8 CRDLLPHPVLGRPPAPLGAPQGAVYTDIPAPEA-ARQCAPPPAPPTSSSATLGYGY-PFG :.: .:: . : . . ..:: : : NP_055 FGERGSCASNLYLPSCTYYMPEFSTVSSFLPQAPSRQISYPYSAQVPPVREVSYGLEPSG 30 40 50 60 70 80 130 140 150 160 pF1KB8 -----GSYYGCRLSHNVNLQQKPC--------------AYHPGDKYPEPSGALPGDDLSS .:: .: . . .:... : . . : ..: : :. :: NP_055 KWHHRNSYSSCYAAAD-ELMHRECLPPSTVTEILMKNEGSYGGHHHPSAPHATPAGFYSS 90 100 110 120 130 140 170 180 190 200 210 220 pF1KB8 RAKEFAFYPSFASSYQ-AMPGYLDVSVVPGISGHPEPRHDALIP-VEGYQHWALSNGWDS :. .. .: .. :. : : . : ::. : . . : . : : : .. NP_055 VNKNSVLPQAFDRFFDNAYCGGGDPPAEPPCSGKGEAKGEPEAPPASGLASRA-EAGAEA 150 160 170 180 190 230 240 250 260 270 280 pF1KB8 QVY---CSKEQSQSAH-LWKSPFPDVVPLQPEVSSYRRGRKKRVPYTKVQLKELEKEYAA .. . .: ::: . : : ..: : : :::: ::.: :..:::.:. NP_055 EAEEENTNPSSSGSAHSVAKEPAKGAAPNAP------RTRKKRCPYSKFQIRELEREFFF 200 210 220 230 240 250 290 300 310 320 330 pF1KB8 SKFITKEKRRRISATTNLSERQVTIWFQNRRVKEKKVVSKSKAPHLHST . .:.:::: ..: ::..::: :::::::.::::. NP_055 NVYINKEKRLQLSRMLNLTDRQVKIWFQNRRMKEKKLSRDRLQYFSGNPLL 260 270 280 290 300 >>NP_067016 (OMIM: 142988) homeobox protein Hox-D12 [Hom (270 aa) initn: 321 init1: 242 opt: 275 Z-score: 174.1 bits: 40.2 E(85289): 0.0059 Smith-Waterman score: 293; 29.4% identity (56.9% similar) in 255 aa overlap (84-328:41-270) 60 70 80 90 100 110 pF1KB8 APSMDGLGSSCPASHCRDLLPHPVLGRPPAPLGAPQGAVYTDIPAPEAARQCAPPPAPPT :.. :.:: .: . .::: : :. NP_067 YVGSLLNLQSPDSFYFSNLRPNGGQLAALPPISYPRGA----LPWAATPASCAP--AQPA 20 30 40 50 60 120 130 140 150 160 170 pF1KB8 SSSATLGYGYPFGGSYYGCRLSHNVNLQQKPCAYHPGD--KYPEPSGALPGDDLSSRAKE ...: :.. :. .. : ..:: : . :. : .: : . .:.. NP_067 GATAFGGFSQPYLAG------SGPLGLQPPTAKDGPEEQAKFYAPEAA-AGPEERGRTR- 70 80 90 100 110 180 190 200 210 220 pF1KB8 FAFYPSFASSYQAMPGYLDVSVVP----GISGHPEPRHDALI---P-VEGYQHWALSNGW :::: . :. .... :. :. : .:. : . :.. ..: NP_067 ----PSFAPESSLAPAVAALKAAKYDYAGV-GRATPGSTTLLQGAPCAPGFKD--DTKG- 120 130 140 150 160 230 240 250 260 270 280 pF1KB8 DSQVYCSKEQSQSAHLWKSPFPDVVPLQPEVSSYRRGRKKRVPYTKVQLKELEKEYAASK .. . . . : . .:: .: .. :.:::: :::: :. :::.:. ... NP_067 PLNLNMTVQAAGVASCLRPSLPDGLPWG---AAPGRARKKRKPYTKQQIAELENEFLVNE 170 180 190 200 210 220 290 300 310 320 330 pF1KB8 FITKEKRRRISATTNLSERQVTIWFQNRRVKEKKVVSKSKAPHLHST ::...::...: :::..:: :::::::.:.:.:: . .: :. NP_067 FINRQKRKELSNRLNLSDQQVKIWFQNRRMKKKRVVLREQALALY 230 240 250 260 270 330 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 04:23:21 2016 done: Tue Nov 8 04:23:22 2016 Total Scan time: 8.790 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]