FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB7642, 387 aa
1>>>pF1KB7642 387 - 387 aa - 387 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 12.4489+/-0.000409; mu= -12.9474+/- 0.026
mean_var=484.3548+/-97.923, 0's: 0 Z-trim(125.4): 127 B-trim: 2234 in 1/61
Lambda= 0.058276
statistics sampled from 48950 (49121) to 48950 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.829), E-opt: 0.2 (0.576), width: 16
Scan time: 10.430
The best scores are: opt bits E(85289)
NP_064447 (OMIM: 605212) barH-like 2 homeobox prot ( 387) 2639 235.5 1.6e-61
NP_064448 (OMIM: 605211) barH-like 1 homeobox prot ( 327) 1003 97.9 3.6e-20
NP_001099044 (OMIM: 613380) homeobox protein HMX3 ( 357) 351 43.2 0.0012
NP_005213 (OMIM: 600030) homeobox protein DLX-6 [H ( 293) 328 41.1 0.004
>>NP_064447 (OMIM: 605212) barH-like 2 homeobox protein (387 aa)
initn: 2639 init1: 2639 opt: 2639 Z-score: 1225.6 bits: 235.5 E(85289): 1.6e-61
Smith-Waterman score: 2639; 100.0% identity (100.0% similar) in 387 aa overlap (1-387:1-387)
10 20 30 40 50 60
pF1KB7 MTMEGASGSSFGIDTILSSASSGSPGMMNGDFRPLGEARTADFRSQATPSPCSEIDTVGT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_064 MTMEGASGSSFGIDTILSSASSGSPGMMNGDFRPLGEARTADFRSQATPSPCSEIDTVGT
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB7 APSSPISVTMEPPEPHLVADATQHHHHLHHSQQPPPPAAAPTQSLQPLPQQQQPLPPQQP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_064 APSSPISVTMEPPEPHLVADATQHHHHLHHSQQPPPPAAAPTQSLQPLPQQQQPLPPQQP
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB7 PPPPPQQLGSAASAPRTSTSSFLIKDILGDSKPLAACAPYSTSVSSPHHTPKQESNAVHE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_064 PPPPPQQLGSAASAPRTSTSSFLIKDILGDSKPLAACAPYSTSVSSPHHTPKQESNAVHE
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB7 SFRPKLEQEDSKTKLDKREDSQSDIKCHGTKEEGDREITSSRESPPVRAKKPRKARTAFS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_064 SFRPKLEQEDSKTKLDKREDSQSDIKCHGTKEEGDREITSSRESPPVRAKKPRKARTAFS
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB7 DHQLNQLERSFERQKYLSVQDRMDLAAALNLTDTQVKTWYQNRRTKWKRQTAVGLELLAE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_064 DHQLNQLERSFERQKYLSVQDRMDLAAALNLTDTQVKTWYQNRRTKWKRQTAVGLELLAE
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB7 AGNYSALQRMFPSPYFYHPSLLGSMDSTTAAAAAAAMYSSMYRTPPAPHPQLQRPLVPRV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_064 AGNYSALQRMFPSPYFYHPSLLGSMDSTTAAAAAAAMYSSMYRTPPAPHPQLQRPLVPRV
310 320 330 340 350 360
370 380
pF1KB7 LIHGLGPGGQPALNPLSSPIPGTPHPR
:::::::::::::::::::::::::::
NP_064 LIHGLGPGGQPALNPLSSPIPGTPHPR
370 380
>>NP_064448 (OMIM: 605211) barH-like 1 homeobox protein (327 aa)
initn: 1075 init1: 723 opt: 1003 Z-score: 483.2 bits: 97.9 E(85289): 3.6e-20
Smith-Waterman score: 1075; 53.0% identity (68.4% similar) in 389 aa overlap (3-387:1-327)
10 20 30 40 50 60
pF1KB7 MTMEGASGSSFGIDTILSSASSGSPGMMNGDFRPLGEARTADFRSQATPSPCSEIDTVGT
:::..: ::::.::: .:::.. .:: :: .: :: :: :: .. .
NP_064 MEGSNG--FGIDSILSH-RAGSPALPKGD--PL----LGDCRSPLELSPRSESSSDCS
10 20 30 40
70 80 90 100 110 120
pF1KB7 APSSPISVTMEPPEPHLVADATQHHHHLHHSQQPPPPAAAPTQSLQPLPQQQQPLPPQQP
.:.:: .: :. : :..: : .. ::
NP_064 SPASPGRDCLETGTPR------------------PGGASGPG-----LDSHLQP------
50 60 70 80
130 140 150 160 170
pF1KB7 PPPPPQQLGSAASAPRTSTSSFLIKDILGDSKPLAACAPYSTS--VSSPHHTPKQESNAV
:: :: . :: ::::::.:::.: ::::::::::.: ..:. . ..:.
NP_064 -----GQL-SAPAQSRTVTSSFLIRDILADCKPLAACAPYSSSGQPAAPEPGGRLAAKAA
90 100 110 120 130
180 190 200 210 220 230
pF1KB7 HESFRPKLEQEDSKTKLDKREDSQSDIKCHGTKEEGDREITSSRESPPVRAKKPRKARTA
:.:: ::.. :... :.:. : .::::::::.:::.::::: :::::::::
NP_064 -EDFRDKLDKSGSNAS------SDSEYK---VKEEGDREISSSRDSPPVRLKKPRKARTA
140 150 160 170 180
240 250 260 270 280 290
pF1KB7 FSDHQLNQLERSFERQKYLSVQDRMDLAAALNLTDTQVKTWYQNRRTKWKRQTAVGLELL
:.:::: ::::::::::::::::::.:::.::::::::::::::::::::::::::::::
NP_064 FTDHQLAQLERSFERQKYLSVQDRMELAASLNLTDTQVKTWYQNRRTKWKRQTAVGLELL
190 200 210 220 230 240
300 310 320 330 340 350
pF1KB7 AEAGNYSALQRMFPSPYFYHPSLLGSMDSTTAAAAAAAMYSSMYRTPPAPHPQLQRPLVP
::::::::::::::::::: ::....: .::.: .:: : :: : :::::::
NP_064 AEAGNYSALQRMFPSPYFYPQSLVSNLD------PGAALY--LYRGPSAPPPALQRPLVP
250 260 270 280 290
360 370 380
pF1KB7 RVLIHGLGPGGQPA--LNPLSSPIPGTPHPR
:.::::: ...: : ::.. .: . .::
NP_064 RILIHGLQGASEPPPPLPPLAGVLPRAAQPR
300 310 320
>>NP_001099044 (OMIM: 613380) homeobox protein HMX3 [Hom (357 aa)
initn: 383 init1: 250 opt: 351 Z-score: 186.4 bits: 43.2 E(85289): 0.0012
Smith-Waterman score: 357; 29.7% identity (51.1% similar) in 364 aa overlap (49-360:4-351)
20 30 40 50 60 70
pF1KB7 SASSGSPGMMNGDFRPLGEARTADFRSQATPSPCSEIDTVGTAPSSPISVTMEPPEPHLV
:.: :..::: ..: :: :.
NP_001 MPEPGP----DAAGTASAQPQPPPPPPPAPKES
10 20
80 90 100
pF1KB7 ADATQH-----HHHLHHSQQPPP-----PA-----------AAPTQSLQ-----------
. .. ::. . :::: :: :: .:.
NP_001 PFSIKNLLNGDHHRPPPKPQPPPRTLFAPASAAAAAAAAAAAAAKGALEGAAGFALSQVG
30 40 50 60 70 80
110 120 130 140 150
pF1KB7 ----P---LPQQQQPLPPQQPPPPP----PQQLGSAASA-PRTSTSSFLIKDILGDSKPL
: .: :. :: . : : : :.. :: .: : .: ::.:
NP_001 DLAFPRFEIPAQRFALPAHYLERSPAWWYPYTLTPAGGHLPRPEASE---KALLRDSSPA
90 100 110 120 130 140
160 170 180 190 200 210
pF1KB7 AAC---APYSTSVSSPHHTPKQESNAVHESFRPKLEQEDSKTKLDKREDSQSDIKCHGTK
.. .: ..: : . .:.. : . . ..:.:: . . . . ..
NP_001 SGTDRDSPEPLLKADPDHK-ELDSKSPDEIILEESDSEESKKEGEAAPGAAGASVGAAAA
150 160 170 180 190 200
220 230 240 250 260 270
pF1KB7 EEGDREITSSRESPPVR-AKKPRKARTAFSDHQLNQLERSFERQKYLSVQDRMDLAAALN
: .. .. ::: . : . .:.::.:: :. ::: .:. ..::: ..: :::.:.
NP_001 TPGAEDWKKGAESPEKKPACRKKKTRTVFSRSQVFQLESTFDMKRYLSSSERAGLAASLH
210 220 230 240 250 260
280 290 300 310 320 330
pF1KB7 LTDTQVKTWYQNRRTKWKRQTAVGLELLAEAGNYSALQRMFPSPYFYHPSLLGSMDSTTA
::.:::: :.::::.::::: :. :: : ...: ::. : .:: .:..
NP_001 LTETQVKIWFQNRRNKWKRQLAAELE--AANLSHAAAQRIVRVPILYHE------NSAAE
270 280 290 300 310
340 350 360 370 380
pF1KB7 AAAAAAMYSSMYRTPPA---PHP-QLQRPLVPRVLIHGLGPGGQPALNPLSSPIPGTPHP
.::::: . . . : ::: ..:.: :
NP_001 GAAAAAAGAPVPVSQPLLTFPHPVYYSHPVVSSVPLLRPV
320 330 340 350
pF1KB7 R
>>NP_005213 (OMIM: 600030) homeobox protein DLX-6 [Homo (293 aa)
initn: 469 init1: 207 opt: 328 Z-score: 177.1 bits: 41.1 E(85289): 0.004
Smith-Waterman score: 328; 29.8% identity (55.9% similar) in 272 aa overlap (103-355:32-290)
80 90 100 110 120 130
pF1KB7 PEPHLVADATQHHHHLHHSQQPPPPAAAPTQSLQPLPQQQQPLPPQQPPPPPPQQLGSAA
:. : :::: :: :::::: : :
NP_005 MTMTTMADGLEGQDSSKSAFMEFGQQQQQQQQQQQQQQQQQQQPP--PPPPPPPQPHSQQ
10 20 30 40 50
140 150 160 170 180
pF1KB7 SAPRTSTSSFLIKDILGDSKPLAACAPYSTSVSSPHHTPKQESNA----VHESF--RPKL
:.: . . . .. . . . :: . . . :: :.. :.:. : .
NP_005 SSPAMAGAHYPLHCLHSAAAAAAAGSHHHHHHQHHHHGSPYASGGGNSYNHRSLAAYPYM
60 70 80 90 100 110
190 200 210 220 230 240
pF1KB7 EQEDSKTKLDKREDSQSDIKCHGTKEEGDREITSSRESPPVR----AKKPRKARTAFSDH
. . . :.. ..:.. . .: .. :.. :. :. .: .:: :: :: .:.
NP_005 SHSQHSPYLQSYHNSSAAAQTRG--DDTDQQKTTVIENGEIRFNGKGKKIRKPRTIYSSL
120 130 140 150 160 170
250 260 270 280 290 300
pF1KB7 QLNQLERSFERQKYLSVQDRMDLAAALNLTDTQVKTWYQNRRTKWKRQTAVGLELLAEAG
::. :.. :.. .::.. .: .:::.:.::.:::: :.::.:.:.:. : ..
NP_005 QLQALNHRFQQTQYLALPERAELAASLGLTQTQVKIWFQNKRSKFKKLLKQG----SNPH
180 190 200 210 220 230
310 320 330 340 350
pF1KB7 NYSALQRMFP-SPYFYHPSLLGSMDSTTAAAAAAAM--------YSSMYRTPPAPHPQLQ
. . :: :: :.: : ..:.: ...: :: : .: . .:
NP_005 ESDPLQGSAALSPR--SPALPPVWD-VSASAKGVSMPPNSYMPGYSHWYSSP--HQDTMQ
240 250 260 270 280
360 370 380
pF1KB7 RPLVPRVLIHGLGPGGQPALNPLSSPIPGTPHPR
::
NP_005 RPQMM
290
387 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 03:10:58 2016 done: Sun Nov 6 03:10:59 2016
Total Scan time: 10.430 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]