FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE3677, 268 aa
1>>>pF1KE3677 268 - 268 aa - 268 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 7.9767+/-0.000273; mu= 5.6989+/- 0.017
mean_var=200.8772+/-39.417, 0's: 0 Z-trim(125.8): 24 B-trim: 0 in 0/60
Lambda= 0.090492
statistics sampled from 50217 (50250) to 50217 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.855), E-opt: 0.2 (0.589), width: 16
Scan time: 7.870
The best scores are: opt bits E(85289)
NP_061140 (OMIM: 608689) mesoderm posterior protei ( 268) 1889 257.8 1.5e-68
NP_001035047 (OMIM: 277300,605195,608681) mesoderm ( 397) 702 103.0 9.2e-22
NP_001099039 (OMIM: 612209) mesogenin-1 [Homo sapi ( 193) 241 42.5 0.00071
>>NP_061140 (OMIM: 608689) mesoderm posterior protein 1 (268 aa)
initn: 1889 init1: 1889 opt: 1889 Z-score: 1351.4 bits: 257.8 E(85289): 1.5e-68
Smith-Waterman score: 1889; 100.0% identity (100.0% similar) in 268 aa overlap (1-268:1-268)
10 20 30 40 50 60
pF1KE3 MAQPLCPPLSESWMLSAAWGPTRRPPPSDKDCGRSLVSSPDSWGSTPADSPVASPARPGT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_061 MAQPLCPPLSESWMLSAAWGPTRRPPPSDKDCGRSLVSSPDSWGSTPADSPVASPARPGT
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE3 LRDPRAPSVGRRGARSSRLGSGQRQSASEREKLRMRTLARALHELRRFLPPSVAPAGQSL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_061 LRDPRAPSVGRRGARSSRLGSGQRQSASEREKLRMRTLARALHELRRFLPPSVAPAGQSL
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE3 TKIETLRLAIRYIGHLSAVLGLSEESLQRRCRQRGDAGSPRGCPLCPDDCPAQMQTRTQA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_061 TKIETLRLAIRYIGHLSAVLGLSEESLQRRCRQRGDAGSPRGCPLCPDDCPAQMQTRTQA
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE3 EGQGQGRGLGLVSAVRAGASWGSPPACPGARAAPEPRDPPALFAEAACPEGQAMEPSPPS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_061 EGQGQGRGLGLVSAVRAGASWGSPPACPGARAAPEPRDPPALFAEAACPEGQAMEPSPPS
190 200 210 220 230 240
250 260
pF1KE3 PLLPGDVLALLETWMPLSPLEWLPEEPK
::::::::::::::::::::::::::::
NP_061 PLLPGDVLALLETWMPLSPLEWLPEEPK
250 260
>>NP_001035047 (OMIM: 277300,605195,608681) mesoderm pos (397 aa)
initn: 720 init1: 620 opt: 702 Z-score: 511.7 bits: 103.0 E(85289): 9.2e-22
Smith-Waterman score: 794; 52.1% identity (62.7% similar) in 303 aa overlap (1-268:1-286)
10 20 30 40 50
pF1KE3 MAQ-PLCPPLS----ESWMLSAAWGPTRRPPPSDKDCGRSLVSSPDSWGSTPADSP--VA
::: : :: : . :... .:: . . : . : .:: :: :: : :. .
NP_001 MAQSP--PPQSLLGHDHWIFAQGWGWA-----GHWD-STSPASSSDSSGSCPCDGARGLP
10 20 30 40 50
60 70 80 90 100 110
pF1KE3 SPARPG-TLRDPRAPSVGRRGARSSRLGSGQRQSASEREKLRMRTLARALHELRRFLPPS
.: :. . : .: .. : ::.. : :::::::::::::::::::::::::::::::
NP_001 QPQPPSCSSRAAEAAATTPRRARTGPAG-GQRQSASEREKLRMRTLARALHELRRFLPPS
60 70 80 90 100 110
120 130 140 150 160 170
pF1KE3 VAPAGQSLTKIETLRLAIRYIGHLSAVLGLSEESLQRRCRQRGDAGSPRGCPLCPDDCPA
.::::::::::::::::::::::::::::::::::: : ::::::::: ::::::: ::
NP_001 LAPAGQSLTKIETLRLAIRYIGHLSAVLGLSEESLQCRRRQRGDAGSPWGCPLCPDRGPA
120 130 140 150 160 170
180 190 200 210
pF1KE3 QMQTRTQAEGQGQGRGLG--------------------LVSAVRAGASWGSPPACPGARA
. ::.....:::::.: : ::::: : :::::: :::::.:
NP_001 EAQTQAEGQGQGQGQGQGQGQGQGQGQGQGQGQGRRPGLVSAVLAEASWGSPSACPGAQA
180 190 200 210 220 230
220 230 240 250 260
pF1KE3 APE-------PRDPPALFAEAACPEGQAMEPSPPSPLLPGDVLALLETWMPLSPLEWLPE
::: :: : . ::. : ::: : : : : . :
NP_001 APERLGRGVHDTDPWA--TPPYCPKIQ----SPPYSSQGTTSDASL--WTPPQGCPWTQS
240 250 260 270 280
pF1KE3 EPK
:.
NP_001 SPEPRNPPVPWTAAPATLELAAVYQGLSVSPEPCLSLGAPSLLPHPSCQRLQPQTPGRCW
290 300 310 320 330 340
>>NP_001099039 (OMIM: 612209) mesogenin-1 [Homo sapiens] (193 aa)
initn: 261 init1: 220 opt: 241 Z-score: 190.5 bits: 42.5 E(85289): 0.00071
Smith-Waterman score: 241; 47.2% identity (76.4% similar) in 89 aa overlap (57-145:101-187)
30 40 50 60 70 80
pF1KE3 PSDKDCGRSLVSSPDSWGSTPADSPVASPARPGTLRDPRAPSVGRRGARSSRLGSGQRQS
.: :. .:.. ..:.. :.. .:..
NP_001 HGGASSGGSEGCSVGGASGLVEVDYNMLAFQPTHLQGGGGPKA-QKGTKV-RMSVQRRRK
80 90 100 110 120
90 100 110 120 130 140
pF1KE3 ASEREKLRMRTLARALHELRRFLPPSVAPAGQSLTKIETLRLAIRYIGHLSAVLGLSEES
::::::::::::: ::: :: .::: . :: ::::.::. .:.:::.:. .:. ..:
NP_001 ASEREKLRMRTLADALHTLRNYLPPVYSQRGQPLTKIQTLKYTIKYIGELTDLLNRGREP
130 140 150 160 170 180
150 160 170 180 190 200
pF1KE3 LQRRCRQRGDAGSPRGCPLCPDDCPAQMQTRTQAEGQGQGRGLGLVSAVRAGASWGSPPA
NP_001 RAQSA
190
268 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 14:10:01 2016 done: Sun Nov 6 14:10:02 2016
Total Scan time: 7.870 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]