FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB7642, 387 aa 1>>>pF1KB7642 387 - 387 aa - 387 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 12.4489+/-0.000409; mu= -12.9474+/- 0.026 mean_var=484.3548+/-97.923, 0's: 0 Z-trim(125.4): 127 B-trim: 2234 in 1/61 Lambda= 0.058276 statistics sampled from 48950 (49121) to 48950 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.829), E-opt: 0.2 (0.576), width: 16 Scan time: 10.430 The best scores are: opt bits E(85289) NP_064447 (OMIM: 605212) barH-like 2 homeobox prot ( 387) 2639 235.5 1.6e-61 NP_064448 (OMIM: 605211) barH-like 1 homeobox prot ( 327) 1003 97.9 3.6e-20 NP_001099044 (OMIM: 613380) homeobox protein HMX3 ( 357) 351 43.2 0.0012 NP_005213 (OMIM: 600030) homeobox protein DLX-6 [H ( 293) 328 41.1 0.004 >>NP_064447 (OMIM: 605212) barH-like 2 homeobox protein (387 aa) initn: 2639 init1: 2639 opt: 2639 Z-score: 1225.6 bits: 235.5 E(85289): 1.6e-61 Smith-Waterman score: 2639; 100.0% identity (100.0% similar) in 387 aa overlap (1-387:1-387) 10 20 30 40 50 60 pF1KB7 MTMEGASGSSFGIDTILSSASSGSPGMMNGDFRPLGEARTADFRSQATPSPCSEIDTVGT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_064 MTMEGASGSSFGIDTILSSASSGSPGMMNGDFRPLGEARTADFRSQATPSPCSEIDTVGT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 APSSPISVTMEPPEPHLVADATQHHHHLHHSQQPPPPAAAPTQSLQPLPQQQQPLPPQQP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_064 APSSPISVTMEPPEPHLVADATQHHHHLHHSQQPPPPAAAPTQSLQPLPQQQQPLPPQQP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 PPPPPQQLGSAASAPRTSTSSFLIKDILGDSKPLAACAPYSTSVSSPHHTPKQESNAVHE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_064 PPPPPQQLGSAASAPRTSTSSFLIKDILGDSKPLAACAPYSTSVSSPHHTPKQESNAVHE 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB7 SFRPKLEQEDSKTKLDKREDSQSDIKCHGTKEEGDREITSSRESPPVRAKKPRKARTAFS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_064 SFRPKLEQEDSKTKLDKREDSQSDIKCHGTKEEGDREITSSRESPPVRAKKPRKARTAFS 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB7 DHQLNQLERSFERQKYLSVQDRMDLAAALNLTDTQVKTWYQNRRTKWKRQTAVGLELLAE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_064 DHQLNQLERSFERQKYLSVQDRMDLAAALNLTDTQVKTWYQNRRTKWKRQTAVGLELLAE 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB7 AGNYSALQRMFPSPYFYHPSLLGSMDSTTAAAAAAAMYSSMYRTPPAPHPQLQRPLVPRV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_064 AGNYSALQRMFPSPYFYHPSLLGSMDSTTAAAAAAAMYSSMYRTPPAPHPQLQRPLVPRV 310 320 330 340 350 360 370 380 pF1KB7 LIHGLGPGGQPALNPLSSPIPGTPHPR ::::::::::::::::::::::::::: NP_064 LIHGLGPGGQPALNPLSSPIPGTPHPR 370 380 >>NP_064448 (OMIM: 605211) barH-like 1 homeobox protein (327 aa) initn: 1075 init1: 723 opt: 1003 Z-score: 483.2 bits: 97.9 E(85289): 3.6e-20 Smith-Waterman score: 1075; 53.0% identity (68.4% similar) in 389 aa overlap (3-387:1-327) 10 20 30 40 50 60 pF1KB7 MTMEGASGSSFGIDTILSSASSGSPGMMNGDFRPLGEARTADFRSQATPSPCSEIDTVGT :::..: ::::.::: .:::.. .:: :: .: :: :: :: .. . NP_064 MEGSNG--FGIDSILSH-RAGSPALPKGD--PL----LGDCRSPLELSPRSESSSDCS 10 20 30 40 70 80 90 100 110 120 pF1KB7 APSSPISVTMEPPEPHLVADATQHHHHLHHSQQPPPPAAAPTQSLQPLPQQQQPLPPQQP .:.:: .: :. : :..: : .. :: NP_064 SPASPGRDCLETGTPR------------------PGGASGPG-----LDSHLQP------ 50 60 70 80 130 140 150 160 170 pF1KB7 PPPPPQQLGSAASAPRTSTSSFLIKDILGDSKPLAACAPYSTS--VSSPHHTPKQESNAV :: :: . :: ::::::.:::.: ::::::::::.: ..:. . ..:. NP_064 -----GQL-SAPAQSRTVTSSFLIRDILADCKPLAACAPYSSSGQPAAPEPGGRLAAKAA 90 100 110 120 130 180 190 200 210 220 230 pF1KB7 HESFRPKLEQEDSKTKLDKREDSQSDIKCHGTKEEGDREITSSRESPPVRAKKPRKARTA :.:: ::.. :... :.:. : .::::::::.:::.::::: ::::::::: NP_064 -EDFRDKLDKSGSNAS------SDSEYK---VKEEGDREISSSRDSPPVRLKKPRKARTA 140 150 160 170 180 240 250 260 270 280 290 pF1KB7 FSDHQLNQLERSFERQKYLSVQDRMDLAAALNLTDTQVKTWYQNRRTKWKRQTAVGLELL :.:::: ::::::::::::::::::.:::.:::::::::::::::::::::::::::::: NP_064 FTDHQLAQLERSFERQKYLSVQDRMELAASLNLTDTQVKTWYQNRRTKWKRQTAVGLELL 190 200 210 220 230 240 300 310 320 330 340 350 pF1KB7 AEAGNYSALQRMFPSPYFYHPSLLGSMDSTTAAAAAAAMYSSMYRTPPAPHPQLQRPLVP ::::::::::::::::::: ::....: .::.: .:: : :: : ::::::: NP_064 AEAGNYSALQRMFPSPYFYPQSLVSNLD------PGAALY--LYRGPSAPPPALQRPLVP 250 260 270 280 290 360 370 380 pF1KB7 RVLIHGLGPGGQPA--LNPLSSPIPGTPHPR :.::::: ...: : ::.. .: . .:: NP_064 RILIHGLQGASEPPPPLPPLAGVLPRAAQPR 300 310 320 >>NP_001099044 (OMIM: 613380) homeobox protein HMX3 [Hom (357 aa) initn: 383 init1: 250 opt: 351 Z-score: 186.4 bits: 43.2 E(85289): 0.0012 Smith-Waterman score: 357; 29.7% identity (51.1% similar) in 364 aa overlap (49-360:4-351) 20 30 40 50 60 70 pF1KB7 SASSGSPGMMNGDFRPLGEARTADFRSQATPSPCSEIDTVGTAPSSPISVTMEPPEPHLV :.: :..::: ..: :: :. NP_001 MPEPGP----DAAGTASAQPQPPPPPPPAPKES 10 20 80 90 100 pF1KB7 ADATQH-----HHHLHHSQQPPP-----PA-----------AAPTQSLQ----------- . .. ::. . :::: :: :: .:. NP_001 PFSIKNLLNGDHHRPPPKPQPPPRTLFAPASAAAAAAAAAAAAAKGALEGAAGFALSQVG 30 40 50 60 70 80 110 120 130 140 150 pF1KB7 ----P---LPQQQQPLPPQQPPPPP----PQQLGSAASA-PRTSTSSFLIKDILGDSKPL : .: :. :: . : : : :.. :: .: : .: ::.: NP_001 DLAFPRFEIPAQRFALPAHYLERSPAWWYPYTLTPAGGHLPRPEASE---KALLRDSSPA 90 100 110 120 130 140 160 170 180 190 200 210 pF1KB7 AAC---APYSTSVSSPHHTPKQESNAVHESFRPKLEQEDSKTKLDKREDSQSDIKCHGTK .. .: ..: : . .:.. : . . ..:.:: . . . . .. NP_001 SGTDRDSPEPLLKADPDHK-ELDSKSPDEIILEESDSEESKKEGEAAPGAAGASVGAAAA 150 160 170 180 190 200 220 230 240 250 260 270 pF1KB7 EEGDREITSSRESPPVR-AKKPRKARTAFSDHQLNQLERSFERQKYLSVQDRMDLAAALN : .. .. ::: . : . .:.::.:: :. ::: .:. ..::: ..: :::.:. NP_001 TPGAEDWKKGAESPEKKPACRKKKTRTVFSRSQVFQLESTFDMKRYLSSSERAGLAASLH 210 220 230 240 250 260 280 290 300 310 320 330 pF1KB7 LTDTQVKTWYQNRRTKWKRQTAVGLELLAEAGNYSALQRMFPSPYFYHPSLLGSMDSTTA ::.:::: :.::::.::::: :. :: : ...: ::. : .:: .:.. NP_001 LTETQVKIWFQNRRNKWKRQLAAELE--AANLSHAAAQRIVRVPILYHE------NSAAE 270 280 290 300 310 340 350 360 370 380 pF1KB7 AAAAAAMYSSMYRTPPA---PHP-QLQRPLVPRVLIHGLGPGGQPALNPLSSPIPGTPHP .::::: . . . : ::: ..:.: : NP_001 GAAAAAAGAPVPVSQPLLTFPHPVYYSHPVVSSVPLLRPV 320 330 340 350 pF1KB7 R >>NP_005213 (OMIM: 600030) homeobox protein DLX-6 [Homo (293 aa) initn: 469 init1: 207 opt: 328 Z-score: 177.1 bits: 41.1 E(85289): 0.004 Smith-Waterman score: 328; 29.8% identity (55.9% similar) in 272 aa overlap (103-355:32-290) 80 90 100 110 120 130 pF1KB7 PEPHLVADATQHHHHLHHSQQPPPPAAAPTQSLQPLPQQQQPLPPQQPPPPPPQQLGSAA :. : :::: :: :::::: : : NP_005 MTMTTMADGLEGQDSSKSAFMEFGQQQQQQQQQQQQQQQQQQQPP--PPPPPPPQPHSQQ 10 20 30 40 50 140 150 160 170 180 pF1KB7 SAPRTSTSSFLIKDILGDSKPLAACAPYSTSVSSPHHTPKQESNA----VHESF--RPKL :.: . . . .. . . . :: . . . :: :.. :.:. : . NP_005 SSPAMAGAHYPLHCLHSAAAAAAAGSHHHHHHQHHHHGSPYASGGGNSYNHRSLAAYPYM 60 70 80 90 100 110 190 200 210 220 230 240 pF1KB7 EQEDSKTKLDKREDSQSDIKCHGTKEEGDREITSSRESPPVR----AKKPRKARTAFSDH . . . :.. ..:.. . .: .. :.. :. :. .: .:: :: :: .:. NP_005 SHSQHSPYLQSYHNSSAAAQTRG--DDTDQQKTTVIENGEIRFNGKGKKIRKPRTIYSSL 120 130 140 150 160 170 250 260 270 280 290 300 pF1KB7 QLNQLERSFERQKYLSVQDRMDLAAALNLTDTQVKTWYQNRRTKWKRQTAVGLELLAEAG ::. :.. :.. .::.. .: .:::.:.::.:::: :.::.:.:.:. : .. NP_005 QLQALNHRFQQTQYLALPERAELAASLGLTQTQVKIWFQNKRSKFKKLLKQG----SNPH 180 190 200 210 220 230 310 320 330 340 350 pF1KB7 NYSALQRMFP-SPYFYHPSLLGSMDSTTAAAAAAAM--------YSSMYRTPPAPHPQLQ . . :: :: :.: : ..:.: ...: :: : .: . .: NP_005 ESDPLQGSAALSPR--SPALPPVWD-VSASAKGVSMPPNSYMPGYSHWYSSP--HQDTMQ 240 250 260 270 280 360 370 380 pF1KB7 RPLVPRVLIHGLGPGGQPALNPLSSPIPGTPHPR :: NP_005 RPQMM 290 387 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 03:10:58 2016 done: Sun Nov 6 03:10:59 2016 Total Scan time: 10.430 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]