FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5229, 186 aa 1>>>pF1KE5229 186 - 186 aa - 186 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.6680+/-0.000312; mu= 12.9283+/- 0.020 mean_var=107.8411+/-21.016, 0's: 0 Z-trim(118.7): 183 B-trim: 160 in 1/52 Lambda= 0.123504 statistics sampled from 31759 (31982) to 31759 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.74), E-opt: 0.2 (0.375), width: 16 Scan time: 4.820 The best scores are: opt bits E(85289) NP_954582 (OMIM: 604294,614402) ventral anterior h ( 186) 1240 230.7 1.1e-60 NP_001106175 (OMIM: 604294,614402) ventral anterio ( 334) 960 181.1 1.7e-45 NP_036608 (OMIM: 604295) ventral anterior homeobox ( 290) 488 96.9 3.1e-20 XP_006712045 (OMIM: 604295) PREDICTED: ventral ant ( 150) 468 93.0 2.3e-19 XP_011531052 (OMIM: 604295) PREDICTED: ventral ant ( 150) 468 93.0 2.3e-19 XP_011531053 (OMIM: 604295) PREDICTED: ventral ant ( 150) 468 93.0 2.3e-19 XP_011530999 (OMIM: 600034) PREDICTED: homeobox pr ( 119) 200 45.2 4.7e-05 NP_004088 (OMIM: 600034) homeobox protein EMX1 [Ho ( 290) 200 45.6 8.8e-05 NP_064448 (OMIM: 605211) barH-like 1 homeobox prot ( 327) 193 44.4 0.00023 NP_004089 (OMIM: 269160,600035) homeobox protein E ( 252) 186 43.0 0.00045 NP_001417 (OMIM: 131290) homeobox protein engraile ( 392) 186 43.2 0.00061 NP_076920 (OMIM: 142965) homeobox protein Hox-B4 [ ( 251) 180 41.9 0.00094 NP_001418 (OMIM: 131310) homeobox protein engraile ( 333) 179 41.9 0.0013 NP_001158727 (OMIM: 142994,176450) motor neuron an ( 189) 171 40.2 0.0023 NP_001073927 (OMIM: 142991) homeobox even-skipped ( 476) 175 41.3 0.0027 NP_005506 (OMIM: 142994,176450) motor neuron and p ( 401) 171 40.5 0.004 XP_016868632 (OMIM: 610772) PREDICTED: homeobox pr ( 265) 166 39.5 0.0055 XP_005264260 (OMIM: 600034) PREDICTED: homeobox pr ( 280) 163 39.0 0.0083 >>NP_954582 (OMIM: 604294,614402) ventral anterior homeo (186 aa) initn: 1240 init1: 1240 opt: 1240 Z-score: 1210.7 bits: 230.7 E(85289): 1.1e-60 Smith-Waterman score: 1240; 100.0% identity (100.0% similar) in 186 aa overlap (1-186:1-186) 10 20 30 40 50 60 pF1KE5 MFGKPDKMDVRCHSDAEAARVSKNAHKESRESKGAEGNLPAAFLKEPQGAFSASGAAEDC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_954 MFGKPDKMDVRCHSDAEAARVSKNAHKESRESKGAEGNLPAAFLKEPQGAFSASGAAEDC 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 NKSKSNSAADPDYCRRILVRDAKGSIREIILPKGLDLDRPKRTRTSFTAEQLYRLEMEFQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_954 NKSKSNSAADPDYCRRILVRDAKGSIREIILPKGLDLDRPKRTRTSFTAEQLYRLEMEFQ 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 RCQYVVGRERTELARQLNLSETQANSEENNERFKRGIKKQKKKRKKEPANDESRRGDSGG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_954 RCQYVVGRERTELARQLNLSETQANSEENNERFKRGIKKQKKKRKKEPANDESRRGDSGG 130 140 150 160 170 180 pF1KE5 RGWQPL :::::: NP_954 RGWQPL >>NP_001106175 (OMIM: 604294,614402) ventral anterior ho (334 aa) initn: 961 init1: 944 opt: 960 Z-score: 937.9 bits: 181.1 E(85289): 1.7e-45 Smith-Waterman score: 960; 89.8% identity (93.4% similar) in 167 aa overlap (1-167:1-162) 10 20 30 40 50 60 pF1KE5 MFGKPDKMDVRCHSDAEAARVSKNAHKESRESKGAEGNLPAAFLKEPQGAFSASGAAEDC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MFGKPDKMDVRCHSDAEAARVSKNAHKESRESKGAEGNLPAAFLKEPQGAFSASGAAEDC 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 NKSKSNSAADPDYCRRILVRDAKGSIREIILPKGLDLDRPKRTRTSFTAEQLYRLEMEFQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 NKSKSNSAADPDYCRRILVRDAKGSIREIILPKGLDLDRPKRTRTSFTAEQLYRLEMEFQ 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 RCQYVVGRERTELARQLNLSETQANSEENNERFKRGIKKQKKKRKKEPANDESRRGDSGG :::::::::::::::::::::::.. .:.: :::: . :. NP_001 RCQYVVGRERTELARQLNLSETQVKVWFQNRR-----TKQKKDQGKDSELRSVVSETAAT 130 140 150 160 170 pF1KE5 RGWQPL NP_001 CSVLRLLEQGRLLSPPGLPALLPPCATGALGSALRGPSLPALGAGAAAGSAAAAAAAAPG 180 190 200 210 220 230 >>NP_036608 (OMIM: 604295) ventral anterior homeobox 2 [ (290 aa) initn: 508 init1: 461 opt: 488 Z-score: 484.2 bits: 96.9 E(85289): 3.1e-20 Smith-Waterman score: 488; 62.6% identity (80.2% similar) in 131 aa overlap (46-173:45-174) 20 30 40 50 60 70 pF1KE5 AEAARVSKNAHKESRESKGAEGNLPAAFLKEPQGAFSASGAAEDCNKSKSNSAADP---D : :. ..: :. . . :.. : : NP_036 RAESGGGGGRCGDRSGAGDLRADGGGHSPTEVAGTSASSPAGSRESGADSDGQPGPGEAD 20 30 40 50 60 70 80 90 100 110 120 130 pF1KE5 YCRRILVRDAKGSIREIILPKGLDLDRPKRTRTSFTAEQLYRLEMEFQRCQYVVGRERTE .:::::::::::.::::.:::::::::::::::::::::::::::::::::::::::::: NP_036 HCRRILVRDAKGTIREIVLPKGLDLDRPKRTRTSFTAEQLYRLEMEFQRCQYVVGRERTE 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 LARQLNLSETQANSEENNERFKRGIKKQKKKRKKEPANDESRRGDSGGRGWQPL :::::::::::.. .:.: :. : :.. .:. ... : NP_036 LARQLNLSETQVKVWFQNRRTKQK-KDQSRDLEKRASSSASEAFATSNILRLLEQGRLLS 140 150 160 170 180 190 NP_036 VPRAPSLLALTPSLPGLPASHRGTSLGDPRNSSPRLNPLSSASASPPLPPPLPAVCFSSA 200 210 220 230 240 250 >>XP_006712045 (OMIM: 604295) PREDICTED: ventral anterio (150 aa) initn: 492 init1: 461 opt: 468 Z-score: 468.5 bits: 93.0 E(85289): 2.3e-19 Smith-Waterman score: 468; 74.3% identity (85.1% similar) in 101 aa overlap (46-143:45-145) 20 30 40 50 60 70 pF1KE5 AEAARVSKNAHKESRESKGAEGNLPAAFLKEPQGAFSASGAAEDCNKSKSNSAADP---D : :. ..: :. . . :.. : : XP_006 RAESGGGGGRCGDRSGAGDLRADGGGHSPTEVAGTSASSPAGSRESGADSDGQPGPGEAD 20 30 40 50 60 70 80 90 100 110 120 130 pF1KE5 YCRRILVRDAKGSIREIILPKGLDLDRPKRTRTSFTAEQLYRLEMEFQRCQYVVGRERTE .:::::::::::.::::.:::::::::::::::::::::::::::::::::::::::::: XP_006 HCRRILVRDAKGTIREIVLPKGLDLDRPKRTRTSFTAEQLYRLEMEFQRCQYVVGRERTE 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 LARQLNLSETQANSEENNERFKRGIKKQKKKRKKEPANDESRRGDSGGRGWQPL ::::::::::: XP_006 LARQLNLSETQNQPQS 140 150 >>XP_011531052 (OMIM: 604295) PREDICTED: ventral anterio (150 aa) initn: 492 init1: 461 opt: 468 Z-score: 468.5 bits: 93.0 E(85289): 2.3e-19 Smith-Waterman score: 468; 74.3% identity (85.1% similar) in 101 aa overlap (46-143:45-145) 20 30 40 50 60 70 pF1KE5 AEAARVSKNAHKESRESKGAEGNLPAAFLKEPQGAFSASGAAEDCNKSKSNSAADP---D : :. ..: :. . . :.. : : XP_011 RAESGGGGGRCGDRSGAGDLRADGGGHSPTEVAGTSASSPAGSRESGADSDGQPGPGEAD 20 30 40 50 60 70 80 90 100 110 120 130 pF1KE5 YCRRILVRDAKGSIREIILPKGLDLDRPKRTRTSFTAEQLYRLEMEFQRCQYVVGRERTE .:::::::::::.::::.:::::::::::::::::::::::::::::::::::::::::: XP_011 HCRRILVRDAKGTIREIVLPKGLDLDRPKRTRTSFTAEQLYRLEMEFQRCQYVVGRERTE 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 LARQLNLSETQANSEENNERFKRGIKKQKKKRKKEPANDESRRGDSGGRGWQPL ::::::::::: XP_011 LARQLNLSETQNQPQS 140 150 >>XP_011531053 (OMIM: 604295) PREDICTED: ventral anterio (150 aa) initn: 492 init1: 461 opt: 468 Z-score: 468.5 bits: 93.0 E(85289): 2.3e-19 Smith-Waterman score: 468; 74.3% identity (85.1% similar) in 101 aa overlap (46-143:45-145) 20 30 40 50 60 70 pF1KE5 AEAARVSKNAHKESRESKGAEGNLPAAFLKEPQGAFSASGAAEDCNKSKSNSAADP---D : :. ..: :. . . :.. : : XP_011 RAESGGGGGRCGDRSGAGDLRADGGGHSPTEVAGTSASSPAGSRESGADSDGQPGPGEAD 20 30 40 50 60 70 80 90 100 110 120 130 pF1KE5 YCRRILVRDAKGSIREIILPKGLDLDRPKRTRTSFTAEQLYRLEMEFQRCQYVVGRERTE .:::::::::::.::::.:::::::::::::::::::::::::::::::::::::::::: XP_011 HCRRILVRDAKGTIREIVLPKGLDLDRPKRTRTSFTAEQLYRLEMEFQRCQYVVGRERTE 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 LARQLNLSETQANSEENNERFKRGIKKQKKKRKKEPANDESRRGDSGGRGWQPL ::::::::::: XP_011 LARQLNLSETQNQPQS 140 150 >>XP_011530999 (OMIM: 600034) PREDICTED: homeobox protei (119 aa) initn: 176 init1: 163 opt: 200 Z-score: 211.7 bits: 45.2 E(85289): 4.7e-05 Smith-Waterman score: 200; 39.5% identity (69.8% similar) in 86 aa overlap (99-184:20-102) 70 80 90 100 110 120 pF1KE5 ADPDYCRRILVRDAKGSIREIILPKGLDLDRPKRTRTSFTAEQLYRLEMEFQRCQYVVGR .::: ::.:. :: ::: :.. .:::: XP_011 MVASDVPQDGLLLHGPFARKPKRIRTAFSPSQLLRLERAFEKNHYVVGA 10 20 30 40 130 140 150 160 170 180 pF1KE5 ERTELARQLNLSETQANSEENNERFKRGIKKQKKKRKKEPANDESRRGDSGGRGWQPL :: .:: .:.:::::.. .:.: : :..: ... : ......:. :. XP_011 ERKQLAGSLSLSETQVKVWFQNRRTKY---KRQKLEEEGPESEQKKKGSHHINRWRIATK 50 60 70 80 90 100 XP_011 QANGEDIDVTSND 110 >>NP_004088 (OMIM: 600034) homeobox protein EMX1 [Homo s (290 aa) initn: 180 init1: 163 opt: 200 Z-score: 206.8 bits: 45.6 E(85289): 8.8e-05 Smith-Waterman score: 200; 39.5% identity (69.8% similar) in 86 aa overlap (99-184:191-273) 70 80 90 100 110 120 pF1KE5 ADPDYCRRILVRDAKGSIREIILPKGLDLDRPKRTRTSFTAEQLYRLEMEFQRCQYVVGR .::: ::.:. :: ::: :.. .:::: NP_004 WVLRNRFFGHRFQASDVPQDGLLLHGPFARKPKRIRTAFSPSQLLRLERAFEKNHYVVGA 170 180 190 200 210 220 130 140 150 160 170 180 pF1KE5 ERTELARQLNLSETQANSEENNERFKRGIKKQKKKRKKEPANDESRRGDSGGRGWQPL :: .:: .:.:::::.. .:.: : :..: ... : ......:. :. NP_004 ERKQLAGSLSLSETQVKVWFQNRRTKY---KRQKLEEEGPESEQKKKGSHHINRWRIATK 230 240 250 260 270 NP_004 QANGEDIDVTSND 280 290 >>NP_064448 (OMIM: 605211) barH-like 1 homeobox protein (327 aa) initn: 120 init1: 120 opt: 193 Z-score: 199.4 bits: 44.4 E(85289): 0.00023 Smith-Waterman score: 193; 38.5% identity (66.4% similar) in 122 aa overlap (40-154:120-232) 10 20 30 40 50 60 pF1KE5 VRCHSDAEAARVSKNAHKESRESKGAEGNLPAAFLKEPQGAFSASGAAEDC----NKSKS ::: :: : ..:. :::: .:: : NP_064 RTVTSSFLIRDILADCKPLAACAPYSSSGQPAA--PEPGGRLAAK-AAEDFRDKLDKSGS 90 100 110 120 130 140 70 80 90 100 110 120 pF1KE5 NSAADPDYCRRILVRDAKGSIREIILPKG---LDLDRPKRTRTSFTAEQLYRLEMEFQRC :...: .: :.. .:. ::: . . : .:...::.:: .:: .:: :.: NP_064 NASSDSEYK----VKE-EGD-REISSSRDSPPVRLKKPRKARTAFTDHQLAQLERSFERQ 150 160 170 180 190 200 130 140 150 160 170 180 pF1KE5 QYVVGRERTELARQLNLSETQANSEENNERFKRGIKKQKKKRKKEPANDESRRGDSGGRG .:. ..: ::: .:::..::... .:.: : NP_064 KYLSVQDRMELAASLNLTDTQVKTWYQNRRTKWKRQTAVGLELLAEAGNYSALQRMFPSP 210 220 230 240 250 260 pF1KE5 WQPL NP_064 YFYPQSLVSNLDPGAALYLYRGPSAPPPALQRPLVPRILIHGLQGASEPPPPLPPLAGVL 270 280 290 300 310 320 >>NP_004089 (OMIM: 269160,600035) homeobox protein EMX2 (252 aa) initn: 164 init1: 164 opt: 186 Z-score: 194.1 bits: 43.0 E(85289): 0.00045 Smith-Waterman score: 186; 38.4% identity (69.8% similar) in 86 aa overlap (99-184:153-235) 70 80 90 100 110 120 pF1KE5 ADPDYCRRILVRDAKGSIREIILPKGLDLDRPKRTRTSFTAEQLYRLEMEFQRCQYVVGR .::: ::.:. :: ::: :.. .:::: NP_004 LIHRYRYLGHRFQGNDTSPESFLLHNALARKPKRIRTAFSPSQLLRLEHAFEKNHYVVGA 130 140 150 160 170 180 130 140 150 160 170 180 pF1KE5 ERTELARQLNLSETQANSEENNERFKRGIKKQKKKRKKEPANDESRRGDSGGRGWQPL :: .::..:.:.:::.. .:.: : .:.:: ... .. ....: :. NP_004 ERKQLAHSLSLTETQVKVWFQNRRTK--FKRQKLEEEGSDSQ-QKKKGTHHINRWRIATK 190 200 210 220 230 NP_004 QASPEEIDVTSDD 240 250 186 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 22:42:17 2016 done: Mon Nov 7 22:42:18 2016 Total Scan time: 4.820 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]