FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5154, 100 aa 1>>>pF1KE5154 100 - 100 aa - 100 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.8833+/-0.000252; mu= 8.3896+/- 0.016 mean_var=104.2172+/-20.918, 0's: 0 Z-trim(123.2): 172 B-trim: 2100 in 1/59 Lambda= 0.125633 statistics sampled from 42456 (42701) to 42456 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.844), E-opt: 0.2 (0.501), width: 16 Scan time: 4.150 The best scores are: opt bits E(85289) NP_067545 (OMIM: 603260) homeobox protein BarH-lik ( 254) 667 129.9 1.7e-30 XP_011541346 (OMIM: 604823) PREDICTED: homeobox pr ( 134) 447 89.8 1e-18 XP_011541345 (OMIM: 604823) PREDICTED: homeobox pr ( 233) 447 90.0 1.6e-18 NP_003649 (OMIM: 604823) homeobox protein BarH-lik ( 279) 447 90.1 1.8e-18 NP_001091639 (OMIM: 611074) brain-specific homeobo ( 233) 237 52.0 4.5e-07 NP_068777 (OMIM: 142995) H2.0-like homeobox protei ( 488) 232 51.3 1.5e-06 NP_835221 (OMIM: 600029) homeobox protein DLX-1 is ( 255) 212 47.5 1.1e-05 NP_064447 (OMIM: 605212) barH-like 2 homeobox prot ( 387) 209 47.1 2.2e-05 NP_002720 (OMIM: 604420) hematopoietically-express ( 270) 206 46.4 2.5e-05 NP_064448 (OMIM: 605211) barH-like 1 homeobox prot ( 327) 207 46.7 2.5e-05 NP_004396 (OMIM: 126255) homeobox protein DLX-2 [H ( 328) 204 46.1 3.7e-05 XP_016867292 (OMIM: 220600,600028) PREDICTED: home ( 161) 198 44.8 4.6e-05 XP_005250242 (OMIM: 220600,600028) PREDICTED: home ( 161) 198 44.8 4.6e-05 NP_005213 (OMIM: 600030) homeobox protein DLX-6 [H ( 293) 201 45.5 4.9e-05 XP_016868632 (OMIM: 610772) PREDICTED: homeobox pr ( 265) 198 44.9 6.7e-05 NP_005212 (OMIM: 220600,600028) homeobox protein D ( 289) 198 45.0 7.1e-05 NP_005211 (OMIM: 104510,190320,600525) homeobox pr ( 287) 194 44.2 0.00012 NP_001925 (OMIM: 601911,616788) homeobox protein D ( 168) 189 43.1 0.00015 XP_016879780 (OMIM: 601911,616788) PREDICTED: home ( 168) 189 43.1 0.00015 NP_149416 (OMIM: 607808) homeobox protein Nkx-2.4 ( 354) 193 44.1 0.00016 NP_612138 (OMIM: 601911,616788) homeobox protein D ( 240) 189 43.3 0.00019 NP_001476 (OMIM: 601135) homeobox protein GBX-2 is ( 348) 191 43.8 0.0002 NP_076920 (OMIM: 142965) homeobox protein Hox-B4 [ ( 251) 189 43.3 0.0002 NP_796374 (OMIM: 605955) homeobox protein Nkx-6.2 ( 277) 189 43.3 0.00021 XP_016872278 (OMIM: 605955) PREDICTED: homeobox pr ( 277) 189 43.3 0.00021 NP_002135 (OMIM: 142968,614744) homeobox protein H ( 301) 189 43.4 0.00023 NP_006159 (OMIM: 602563) homeobox protein Nkx-6.1 ( 367) 190 43.6 0.00023 NP_001180 (OMIM: 602183,613330) homeobox protein N ( 333) 189 43.4 0.00025 XP_016867454 (OMIM: 603354) PREDICTED: homeobox pr ( 203) 185 42.5 0.00028 XP_016867453 (OMIM: 603354) PREDICTED: homeobox pr ( 204) 185 42.5 0.00028 NP_066305 (OMIM: 604640) T-cell leukemia homeobox ( 291) 187 43.0 0.00029 XP_011530999 (OMIM: 600034) PREDICTED: homeobox pr ( 119) 179 41.2 0.0004 NP_001129743 (OMIM: 217095,611770) homeobox protei ( 301) 184 42.5 0.00043 NP_001092304 (OMIM: 603354) homeobox protein GBX-1 ( 363) 185 42.7 0.00043 NP_055435 (OMIM: 142974) homeobox protein Hox-C4 [ ( 264) 183 42.2 0.00044 NP_705897 (OMIM: 142974) homeobox protein Hox-C4 [ ( 264) 183 42.2 0.00044 NP_001035091 (OMIM: 214300,600147) homeobox protei ( 139) 179 41.3 0.00045 NP_660328 (OMIM: 606727) homeobox protein Nkx-2.3 ( 364) 184 42.5 0.00049 NP_001158727 (OMIM: 142994,176450) motor neuron an ( 189) 180 41.6 0.0005 NP_005512 (OMIM: 186770) T-cell leukemia homeobox ( 330) 183 42.3 0.00052 XP_011538046 (OMIM: 186770) PREDICTED: T-cell leuk ( 342) 183 42.3 0.00053 NP_001182446 (OMIM: 186770) T-cell leukemia homeob ( 257) 180 41.7 0.00063 XP_011538047 (OMIM: 186770) PREDICTED: T-cell leuk ( 269) 180 41.7 0.00065 NP_008827 (OMIM: 142950) homeobox protein Hox-A7 [ ( 230) 179 41.4 0.00066 NP_000200 (OMIM: 125853,260370,600733,606176,60639 ( 283) 180 41.7 0.00067 NP_004518 (OMIM: 214300,600147) homeobox protein M ( 254) 179 41.5 0.00071 NP_005513 (OMIM: 142955,601536) homeobox protein H ( 335) 180 41.8 0.00076 NP_004088 (OMIM: 600034) homeobox protein EMX1 [Ho ( 290) 179 41.5 0.00078 NP_710160 (OMIM: 142972) homeobox protein Hox-C6 i ( 153) 175 40.6 0.0008 NP_004378 (OMIM: 108900,187500,217095,225250,60058 ( 324) 179 41.6 0.00084 >>NP_067545 (OMIM: 603260) homeobox protein BarH-like 1 (254 aa) initn: 667 init1: 667 opt: 667 Z-score: 668.6 bits: 129.9 E(85289): 1.7e-30 Smith-Waterman score: 667; 100.0% identity (100.0% similar) in 100 aa overlap (1-100:155-254) 10 20 30 pF1KE5 MGLEKRFEKQKYLSTPDRIDLAESLGLSQL :::::::::::::::::::::::::::::: NP_067 GKLEAAGPGEPGTKAKKGRRSRTVFTELQLMGLEKRFEKQKYLSTPDRIDLAESLGLSQL 130 140 150 160 170 180 40 50 60 70 80 90 pF1KE5 QVKTWYQNRRMKWKKIVLQGGGLESPTKPKGRPKKNSIPTSEQLTEQERAKDAEKPAEVP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_067 QVKTWYQNRRMKWKKIVLQGGGLESPTKPKGRPKKNSIPTSEQLTEQERAKDAEKPAEVP 190 200 210 220 230 240 100 pF1KE5 GEPSDRSRED :::::::::: NP_067 GEPSDRSRED 250 >>XP_011541346 (OMIM: 604823) PREDICTED: homeobox protei (134 aa) initn: 464 init1: 315 opt: 447 Z-score: 456.8 bits: 89.8 E(85289): 1e-18 Smith-Waterman score: 447; 68.7% identity (86.9% similar) in 99 aa overlap (1-99:1-97) 10 20 30 40 50 60 pF1KE5 MGLEKRFEKQKYLSTPDRIDLAESLGLSQLQVKTWYQNRRMKWKKIVLQGGGLESPTKPK :::::.:.::::::::::.:::.::::.:::::::::::::::::.::.:: :.::::: XP_011 MGLEKKFQKQKYLSTPDRLDLAQSLGLTQLQVKTWYQNRRMKWKKMVLKGGQ-EAPTKPK 10 20 30 40 50 70 80 90 100 pF1KE5 GRPKKNSIPTSEQLTEQERAKDAEKPAEVPGEPSDRSRED ::::::::::::.. .:. .. . : :::. ..: XP_011 GRPKKNSIPTSEEIEAEEKMNSQAQGQE-QLEPSQGQEELCEAQEPKARDVPLEMAEPPD 60 70 80 90 100 110 XP_011 PPQELPIPSSEPPPLS 120 130 >>XP_011541345 (OMIM: 604823) PREDICTED: homeobox protei (233 aa) initn: 454 init1: 315 opt: 447 Z-score: 453.6 bits: 90.0 E(85289): 1.6e-18 Smith-Waterman score: 447; 68.7% identity (86.9% similar) in 99 aa overlap (1-99:100-196) 10 20 30 pF1KE5 MGLEKRFEKQKYLSTPDRIDLAESLGLSQL :::::.:.::::::::::.:::.::::.:: XP_011 LASSESETEQPTPRQKKPRRSRTIFTELQLMGLEKKFQKQKYLSTPDRLDLAQSLGLTQL 70 80 90 100 110 120 40 50 60 70 80 90 pF1KE5 QVKTWYQNRRMKWKKIVLQGGGLESPTKPKGRPKKNSIPTSEQLTEQERAKDAEKPAEVP :::::::::::::::.::.:: :.:::::::::::::::::.. .:. .. . : XP_011 QVKTWYQNRRMKWKKMVLKGGQ-EAPTKPKGRPKKNSIPTSEEIEAEEKMNSQAQGQE-Q 130 140 150 160 170 180 100 pF1KE5 GEPSDRSRED :::. ..: XP_011 LEPSQGQEELCEAQEPKARDVPLEMAEPPDPPQELPIPSSEPPPLS 190 200 210 220 230 >>NP_003649 (OMIM: 604823) homeobox protein BarH-like 2 (279 aa) initn: 454 init1: 315 opt: 447 Z-score: 452.6 bits: 90.1 E(85289): 1.8e-18 Smith-Waterman score: 447; 68.7% identity (86.9% similar) in 99 aa overlap (1-99:146-242) 10 20 30 pF1KE5 MGLEKRFEKQKYLSTPDRIDLAESLGLSQL :::::.:.::::::::::.:::.::::.:: NP_003 LASSESETEQPTPRQKKPRRSRTIFTELQLMGLEKKFQKQKYLSTPDRLDLAQSLGLTQL 120 130 140 150 160 170 40 50 60 70 80 90 pF1KE5 QVKTWYQNRRMKWKKIVLQGGGLESPTKPKGRPKKNSIPTSEQLTEQERAKDAEKPAEVP :::::::::::::::.::.:: :.:::::::::::::::::.. .:. .. . : NP_003 QVKTWYQNRRMKWKKMVLKGGQ-EAPTKPKGRPKKNSIPTSEEIEAEEKMNSQAQGQE-Q 180 190 200 210 220 230 100 pF1KE5 GEPSDRSRED :::. ..: NP_003 LEPSQGQEELCEAQEPKARDVPLEMAEPPDPPQELPIPSSEPPPLS 240 250 260 270 >>NP_001091639 (OMIM: 611074) brain-specific homeobox pr (233 aa) initn: 219 init1: 219 opt: 237 Z-score: 247.9 bits: 52.0 E(85289): 4.5e-07 Smith-Waterman score: 237; 48.7% identity (73.7% similar) in 76 aa overlap (2-73:124-199) 10 20 30 pF1KE5 MGLEKRFEKQKYLSTPDRIDLAESLGLSQLQ ::::::: :.:::::.:..:: .:.::. : NP_001 LFPHPQHAELPGKHCRRRKARTVFSDSQLSGLEKRFEIQRYLSTPERVELATALSLSETQ 100 110 120 130 140 150 40 50 60 70 80 pF1KE5 VKTWYQNRRMKWKKIVLQGGGL----ESPTKPKGRPKKNSIPTSEQLTEQERAKDAEKPA ::::.:::::: :: . .. ..: .:.: :. . :. . NP_001 VKTWFQNRRMKHKKQLRKSQDEPKAPDGPESPEGSPRGSEAATAAEARLSLPAGPFVLTE 160 170 180 190 200 210 90 100 pF1KE5 EVPGEPSDRSRED NP_001 PEDEVDIGDEGELGSGPHVL 220 230 >>NP_068777 (OMIM: 142995) H2.0-like homeobox protein [H (488 aa) initn: 209 init1: 209 opt: 232 Z-score: 238.8 bits: 51.3 E(85289): 1.5e-06 Smith-Waterman score: 232; 39.4% identity (69.7% similar) in 99 aa overlap (2-100:290-385) 10 20 30 pF1KE5 MGLEKRFEKQKYLSTPDRIDLAESLGLSQLQ ::::::: :::.. ::: .:: :::.. : NP_068 YAVLTKDTMPQTYKRKRSWSRAVFSNLQRKGLEKRFEIQKYVTKPDRKQLAAMLGLTDAQ 260 270 280 290 300 310 40 50 60 70 80 90 pF1KE5 VKTWYQNRRMKWKKIVLQGGGLESPTKPKGRPKKNSIPTSEQLTEQERAKDAEKPAEVPG ::.:.:::::::.. .. . .. : :. ... :... ::.. . ... .:. . NP_068 VKVWFQNRRMKWRH-SKEAQAQKDKDKEAGEKPSGGAPAADG--EQDERSPSRSEGEAES 320 330 340 350 360 370 100 pF1KE5 EPSDRSRED : :: : NP_068 ESSDSESLDMAPSDTERTEGSERSLHQTTVIKAPVTGALITASSAGSGGSSGGGGNSFSF 380 390 400 410 420 430 >>NP_835221 (OMIM: 600029) homeobox protein DLX-1 isofor (255 aa) initn: 219 init1: 201 opt: 212 Z-score: 222.9 bits: 47.5 E(85289): 1.1e-05 Smith-Waterman score: 212; 47.1% identity (76.5% similar) in 68 aa overlap (3-69:143-210) 10 20 30 pF1KE5 MGLEKRFEKQKYLSTPDRIDLAESLGLSQLQV :..::.. .::. :.: .:: ::::.: :: NP_835 VVEGGEVRFNGKGKKIRKPRTIYSSLQLQALNRRFQQTQYLALPERAELAASLGLTQTQV 120 130 140 150 160 170 40 50 60 70 80 90 pF1KE5 KTWYQNRRMKWKKIVLQGGG-LESPTKPKGRPKKNSIPTSEQLTEQERAKDAEKPAEVPG : :.::.: :.::.. :::. ::. . .:: . . : NP_835 KIWFQNKRSKFKKLMKQGGAALEGSALANGRALSAGSPPVPPGWNPNSSSGKGSGGNAGS 180 190 200 210 220 230 100 pF1KE5 EPSDRSRED NP_835 YIPSYTSWYPSAHQEAMQQPQLM 240 250 >>NP_064447 (OMIM: 605212) barH-like 2 homeobox protein (387 aa) initn: 209 init1: 209 opt: 209 Z-score: 217.6 bits: 47.1 E(85289): 2.2e-05 Smith-Waterman score: 209; 67.4% identity (88.4% similar) in 43 aa overlap (3-45:247-289) 10 20 30 pF1KE5 MGLEKRFEKQKYLSTPDRIDLAESLGLSQLQV ::. ::.:::::. ::.::: .:.:.. :: NP_064 EITSSRESPPVRAKKPRKARTAFSDHQLNQLERSFERQKYLSVQDRMDLAAALNLTDTQV 220 230 240 250 260 270 40 50 60 70 80 90 pF1KE5 KTWYQNRRMKWKKIVLQGGGLESPTKPKGRPKKNSIPTSEQLTEQERAKDAEKPAEVPGE :::::::: :::. NP_064 KTWYQNRRTKWKRQTAVGLELLAEAGNYSALQRMFPSPYFYHPSLLGSMDSTTAAAAAAA 280 290 300 310 320 330 >>NP_002720 (OMIM: 604420) hematopoietically-expressed h (270 aa) initn: 214 init1: 196 opt: 206 Z-score: 216.7 bits: 46.4 E(85289): 2.5e-05 Smith-Waterman score: 206; 40.6% identity (64.4% similar) in 101 aa overlap (3-100:152-245) 10 20 30 pF1KE5 MGLEKRFEKQKYLSTPDRIDLAESLGLSQLQV :::.:: ::::: :.: ::. : ::. :: NP_002 KPLLWSPFLQRPLHKRKGGQVRFSNDQTIELEKKFETQKYLSPPERKRLAKMLQLSERQV 130 140 150 160 170 180 40 50 60 70 80 90 pF1KE5 KTWYQNRRMKWKKIVLQGGGLESPTKPKGRPKKNSIPTSEQLTEQERAKDAEKPAEVPGE :::.:::: ::... :.: . : . . .:. .: . .:. .. .: : . . NP_002 KTWFQNRRAKWRRL-----KQENPQSNK-KEELESLDSSCD-QRQDLPSEQNKGASLDSS 190 200 210 220 230 100 pF1KE5 ---PSDRSRED :: :.:: NP_002 QCSPSPASQEDLESEISEDSDQEVDIEGDKSYFNAG 240 250 260 270 >>NP_064448 (OMIM: 605211) barH-like 1 homeobox protein (327 aa) initn: 207 init1: 207 opt: 207 Z-score: 216.6 bits: 46.7 E(85289): 2.5e-05 Smith-Waterman score: 207; 67.4% identity (88.4% similar) in 43 aa overlap (3-45:193-235) 10 20 30 pF1KE5 MGLEKRFEKQKYLSTPDRIDLAESLGLSQLQV ::. ::.:::::. ::..:: ::.:.. :: NP_064 EISSSRDSPPVRLKKPRKARTAFTDHQLAQLERSFERQKYLSVQDRMELAASLNLTDTQV 170 180 190 200 210 220 40 50 60 70 80 90 pF1KE5 KTWYQNRRMKWKKIVLQGGGLESPTKPKGRPKKNSIPTSEQLTEQERAKDAEKPAEVPGE :::::::: :::. NP_064 KTWYQNRRTKWKRQTAVGLELLAEAGNYSALQRMFPSPYFYPQSLVSNLDPGAALYLYRG 230 240 250 260 270 280 100 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 22:01:57 2016 done: Mon Nov 7 22:01:58 2016 Total Scan time: 4.150 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]