FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB9715, 392 aa
1>>>pF1KB9715 392 - 392 aa - 392 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 11.6417+/-0.000423; mu= -6.3236+/- 0.027
mean_var=579.3323+/-118.381, 0's: 0 Z-trim(126.3): 32 B-trim: 1016 in 1/61
Lambda= 0.053286
statistics sampled from 51953 (52018) to 51953 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.84), E-opt: 0.2 (0.61), width: 16
Scan time: 10.180
The best scores are: opt bits E(85289)
NP_001417 (OMIM: 131290) homeobox protein engraile ( 392) 2686 220.4 5.9e-57
NP_001418 (OMIM: 131310) homeobox protein engraile ( 333) 774 73.3 9.4e-13
>>NP_001417 (OMIM: 131290) homeobox protein engrailed-1 (392 aa)
initn: 2686 init1: 2686 opt: 2686 Z-score: 1143.5 bits: 220.4 E(85289): 5.9e-57
Smith-Waterman score: 2686; 100.0% identity (100.0% similar) in 392 aa overlap (1-392:1-392)
10 20 30 40 50 60
pF1KB9 MEEQQPEPKSQRDSALGAAAAATPGGLSLSLSPGASGSSGSGSDGDSVPVSPQPAPPSPP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 MEEQQPEPKSQRDSALGAAAAATPGGLSLSLSPGASGSSGSGSDGDSVPVSPQPAPPSPP
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB9 AAPCLPPLAHHPHLPPHPPPPPPQHLAAPAHQPQPAAQLHRTTNFFIDNILRPDFGCKKE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 AAPCLPPLAHHPHLPPHPPPPPPQHLAAPAHQPQPAAQLHRTTNFFIDNILRPDFGCKKE
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB9 QPPPQLLVAAAARGGAGGGGRVERDRGQTAAGRDPVHPLGTRAPGAASLLCAPDANCGPP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 QPPPQLLVAAAARGGAGGGGRVERDRGQTAAGRDPVHPLGTRAPGAASLLCAPDANCGPP
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB9 DGSQPAAAGAGASKAGNPAAAAAAAAAAVAAAAAAAAAKPSDTGGGGSGGGAGSPGAQGT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 DGSQPAAAGAGASKAGNPAAAAAAAAAAVAAAAAAAAAKPSDTGGGGSGGGAGSPGAQGT
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB9 KYPEHGNPAILLMGSANGGPVVKTDSQQPLVWPAWVYCTRYSDRPSSGPRTRKLKKKKNE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 KYPEHGNPAILLMGSANGGPVVKTDSQQPLVWPAWVYCTRYSDRPSSGPRTRKLKKKKNE
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB9 KEDKRPRTAFTAEQLQRLKAEFQANRYITEQRRQTLAQELSLNESQIKIWFQNKRAKIKK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 KEDKRPRTAFTAEQLQRLKAEFQANRYITEQRRQTLAQELSLNESQIKIWFQNKRAKIKK
310 320 330 340 350 360
370 380 390
pF1KB9 ATGIKNGLALHLMAQGLYNHSTTTVQDKDESE
::::::::::::::::::::::::::::::::
NP_001 ATGIKNGLALHLMAQGLYNHSTTTVQDKDESE
370 380 390
>>NP_001418 (OMIM: 131310) homeobox protein engrailed-2 (333 aa)
initn: 897 init1: 740 opt: 774 Z-score: 350.0 bits: 73.3 E(85289): 9.4e-13
Smith-Waterman score: 919; 47.1% identity (62.3% similar) in 395 aa overlap (1-392:1-333)
10 20 30 40 50 60
pF1KB9 MEEQQPEPKSQRDSALGAAAAATPGGLSLSLSPGASGSSGSGSDGDSVPVSPQPAPPSPP
:::..:.: : ::::. : . :::. :::. : : :: : .
NP_001 MEENDPKP--------GEAAAAVEGQRQPESSPGG----GSGGGGGS---SPGEADTGRR
10 20 30 40
70 80 90 100 110 120
pF1KB9 AAPCLPPLAHHPHLPPHPPPPPPQHLAAPAHQPQPAAQLHRTTNFFIDNILRPDFGCKKE
: :: . : ::... .: :: :::::::::::.:: .:.
NP_001 RALMLPAV-----------------LQAPGNHQHP----HRITNFFIDNILRPEFGRRKD
50 60 70 80
130 140 150 160 170
pF1KB9 QPPPQLLVAAAARGGAGG-GGRVERDRGQTAAGRDPVHPLGTRAPGAASLLCAPDANCGP
.... ::::: :: . : :.: . . :.: : . ::: :. ::
NP_001 AGTCCAGAGGGRGGGAGGEGGASGAEGGGGAGGSEQLLGSGSREP-RQNPPCAPGAG-GP
90 100 110 120 130 140
180 190 200 210 220 230
pF1KB9 -PD-GSQPAAAGAGASKAGNPAAAAAAAAAAVAAAAAAAAAKPSDTGGGGSGGGAGSPGA
: ::. . : :.::. : ::. .:: :.:
NP_001 LPAAGSDSPGDGEGGSKTL------------------------SLHGGAKKGGDPGGPLD
150 160 170
240 250 260 270 280 290
pF1KB9 QGTKYPEHGNPAILLMGSANGGPVVKTDSQQPLVWPAWVYCTRYSDRPSSGPRTRKLKKK
. : :. . . ...... . . . ::..:::::::::::::::::::.:: :::
NP_001 GSLKARGLGGGDLSVSSDSDSSQAGANLGAQPMLWPAWVYCTRYSDRPSSGPRSRKPKKK
180 190 200 210 220 230
300 310 320 330 340 350
pF1KB9 KNEKEDKRPRTAFTAEQLQRLKAEFQANRYITEQRRQTLAQELSLNESQIKIWFQNKRAK
. .:::::::::::::::::::::::.:::.::::::.::::::::::::::::::::::
NP_001 NPNKEDKRPRTAFTAEQLQRLKAEFQTNRYLTEQRRQSLAQELSLNESQIKIWFQNKRAK
240 250 260 270 280 290
360 370 380 390
pF1KB9 IKKATGIKNGLALHLMAQGLYNHSTTTVQDKDESE
:::::: :: ::.:::::::::::::. . :..::
NP_001 IKKATGNKNTLAVHLMAQGLYNHSTTAKEGKSDSE
300 310 320 330
392 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 06:08:11 2016 done: Sun Nov 6 06:08:12 2016
Total Scan time: 10.180 Total Display time: -0.020
Function used was FASTA [36.3.4 Apr, 2011]