
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0185.17
(469 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
BG586159 weakly similar to PIR|T47841|T4 hypothetical protein T2... 126 2e-29
BI309716 weakly similar to PIR|G96722|G9 hypothetical protein F2... 100 1e-21
TC89912 weakly similar to PIR|B84512|B84512 probable retroelemen... 82 4e-16
AJ502495 weakly similar to GP|18071369|g putative gag-pol polypr... 78 6e-15
TC85125 weakly similar to SP|P10978|POLX_TOBAC Retrovirus-relate... 58 2e-14
BF650113 weakly similar to GP|4753889|emb| Tpv2-1c {Phaseolus vu... 74 1e-13
CB893680 weakly similar to GP|1167523|db ORF(AA 1-1338) {Nicotia... 50 3e-09
BE123913 weakly similar to GP|22093573|d polyprotein {Oryza sati... 52 6e-07
BG644677 weakly similar to GP|7682800|gb| Hypothetical protein T... 51 1e-06
CB893805 similar to GP|10177935|d copia-type polyprotein {Arabid... 45 5e-05
TC84531 37 0.021
BG586293 weakly similar to PIR|E84473|E84 probable retroelement ... 35 0.047
TC86088 calmodulin-like protein 1 [Medicago truncatula] 29 3.4
BE941052 weakly similar to PIR|B85188|B85 retrotransposon like p... 29 3.4
TC85566 homologue to SP|P27880|HS12_MEDSA 18.2 kDa class I heat ... 29 4.4
AW560353 similar to PIR|T00660|T006 hypothetical protein F3I6.23... 28 9.9
BG644690 weakly similar to GP|18542179|gb putative pol protein {... 28 9.9
>BG586159 weakly similar to PIR|T47841|T4 hypothetical protein T2O9.150 -
Arabidopsis thaliana, partial (11%)
Length = 732
Score = 126 bits (316), Expect = 2e-29
Identities = 77/221 (34%), Positives = 122/221 (54%), Gaps = 2/221 (0%)
Frame = +1
Query: 195 IKIIRDGEIIGLSQSHYIEKVLKKFDHFDCKSVSTPFDQNTKL-QPHKRCPVAQLEYSKA 253
+++I++ E I + Q Y+ +L++F P KL + V +Y +
Sbjct: 1 VEVIQNEEGIYICQRKYVTDLLERFGMEKSNLSRNPIAPRCKLIKDENGVKVDATKYKQI 180
Query: 254 IRCLMYAMTCTRPDIAYVVGRLSRYTCNPSKDHWHVVNRVLKYLNGTINYSLIYNGYPSV 313
+ CLMY + TRPD+ YV+ +SR+ P++ H H V RVL+YLNGTIN ++Y S
Sbjct: 181 VGCLMY-LAATRPDLMYVLSLISRFMNCPTELHMHAVKRVLRYLNGTINLGIMYKRNGSE 357
Query: 314 -LEGYTYASWVTCVKDHASTSG*IFNLGGGAVSWGSKKQTCIADSTMAAEFIALAACSKE 372
LE YT + + + D STSG +F L GAVSW SKKQ + ST AEFIA A C+ +
Sbjct: 358 KLEAYTDSDYAGDLDDRKSTSGYVFMLSSGAVSWSSKKQPVVTLSTTKAEFIAAAFCACQ 537
Query: 373 TV*LRNLLYEILIWPKLMRQVSIHCDSQCTLSKAYSQVYNG 413
+V +R +L ++ ++++CD+ T+ + + V +G
Sbjct: 538 SVWMRRVLEKLGYTQS--GSITMYCDNNSTIKLSKNPVLHG 654
>BI309716 weakly similar to PIR|G96722|G9 hypothetical protein F20P5.25
[imported] - Arabidopsis thaliana, partial (10%)
Length = 744
Score = 100 bits (249), Expect = 1e-21
Identities = 59/183 (32%), Positives = 101/183 (54%), Gaps = 3/183 (1%)
Frame = +2
Query: 155 LYVDDMLIFGTDLNEVEKTKSFLSNSFDMKDLGEADVILGIKIIRDGEIIGLSQSHYIEK 214
+YVDD+++ G D++E++ K FL + F +KDLG LG+++ R + I L+Q Y +
Sbjct: 179 VYVDDIVLAGNDISEIQHVKCFLIDRFKIKDLGSLRYFLGLEVARSKQGILLNQRKYTLE 358
Query: 215 VLKKFDHFDCKSVSTPFDQNTKLQPHKRCPV--AQLEYSKAIRCLMYAMTCTRPDIAYVV 272
+L+ + KS TP+D + KL + P+ + +Y + I L+Y +T TRPDI++ V
Sbjct: 359 LLEDSGNLAVKSTLTPYDISLKLH-NSDSPLYNDETQYRRLIGKLIY-LTTTRPDISFAV 532
Query: 273 GRLSRYTCNPSKDHWHVVNRVLKYLNGTINYSLIYNGYPSV-LEGYTYASWVTCVKDHAS 331
+LS++ P + H+ RVL+YL L Y+ ++ L + + W TC S
Sbjct: 533 QQLSQFVSKPQQVHYQAAIRVLQYLKTAPAKGLFYSATSNLKLSSFADSDWATCPTTRKS 712
Query: 332 TSG 334
+G
Sbjct: 713 VTG 721
>TC89912 weakly similar to PIR|B84512|B84512 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana, partial
(10%)
Length = 814
Score = 82.0 bits (201), Expect = 4e-16
Identities = 51/147 (34%), Positives = 78/147 (52%), Gaps = 3/147 (2%)
Frame = +1
Query: 293 VLKYLNGTINYSLIYNGYPS---VLEGYTYASWVTCVKDHASTSG*IFNLGGGAVSWGSK 349
VLKYLN ++ SL Y LEGY A + V S SG +F L G +SW +
Sbjct: 16 VLKYLNESLKSSLKYTKAAQEEDALEGYVDADYAGNVDTRKSLSGFVFTLYGTTISWKAN 195
Query: 350 KQTCIADSTMAAEFIALAACSKETV*LRNLLYEILIWPKLMRQVSIHCDSQCTLSKAYSQ 409
+Q+ + ST AE+IA K+ + L+ ++ E+ I V IHCDSQ + A Q
Sbjct: 196 QQSVVTLSTTQAEYIAFVEGVKDAIWLKGMIGELGI---TQEYVKIHCDSQSAIHLANHQ 366
Query: 410 VYNGKSRHIVLRHSHVKDLITNGVISI 436
VY+ +++HI +R ++D+I + I +
Sbjct: 367 VYHERTKHIDIRLHFIRDMIESKEIVV 447
>AJ502495 weakly similar to GP|18071369|g putative gag-pol polyprotein {Oryza
sativa}, partial (9%)
Length = 542
Score = 78.2 bits (191), Expect = 6e-15
Identities = 41/122 (33%), Positives = 72/122 (58%)
Frame = +2
Query: 320 ASWVTCVKDHASTSG*IFNLGGGAVSWGSKKQTCIADSTMAAEFIALAACSKETV*LRNL 379
+ W + STSG F+LG GA+SW SKKQ +A ST AE+IA +C+ +TV LR +
Sbjct: 2 SDWAGDTETRKSTSGYAFHLGTGAISWSSKKQPVVAFSTAEAEYIASTSCATQTVWLRRI 181
Query: 380 LYEILIWPKLMRQVSIHCDSQCTLSKAYSQVYNGKSRHIVLRHSHVKDLITNGVISIVFV 439
L ++ + I+CD++ ++ + + V++G+S+HI ++ +++LI + I +
Sbjct: 182 LE--VMHHEQNTPTKIYCDNKSAIALSKNPVFHGRSKHIDIQFHKIRELIAEKEVVIEYC 355
Query: 440 RT 441
T
Sbjct: 356 PT 361
>TC85125 weakly similar to SP|P10978|POLX_TOBAC Retrovirus-related Pol
polyprotein from transposon TNT 1-94 [Contains: Protease
(EC 3.4.23.-);, partial (7%)
Length = 705
Score = 58.2 bits (139), Expect(2) = 2e-14
Identities = 29/76 (38%), Positives = 44/76 (57%)
Frame = +1
Query: 290 VNRVLKYLNGTINYSLIYNGYPSVLEGYTYASWVTCVKDHASTSG*IFNLGGGAVSWGSK 349
V R+++Y+ GT ++ + G + GY + + ST+G +F L GGAVSW SK
Sbjct: 1 VKRIMRYIKGTSGVAVCFGGSELTVRGYVDSDFAGDHDKRKSTTGYVFTLAGGAVSWLSK 180
Query: 350 KQTCIADSTMAAEFIA 365
QT +A ST AE++A
Sbjct: 181 LQTVVALSTTEAEYMA 228
Score = 38.5 bits (88), Expect(2) = 2e-14
Identities = 17/77 (22%), Positives = 43/77 (55%)
Frame = +3
Query: 365 ALAACSKETV*LRNLLYEILIWPKLMRQVSIHCDSQCTLSKAYSQVYNGKSRHIVLRHSH 424
+L KE + ++ L+ E+ Q++++CDSQ L A + ++ +++HI +++
Sbjct: 228 SLPQACKEAIWMQRLMEEL---GHKQEQITVYCDSQSALHIARNPAFHSRTKHIGIQYHF 398
Query: 425 VKDLITNGVISIVFVRT 441
V++++ G + + + T
Sbjct: 399 VREVVEEGSVDMQKIHT 449
>BF650113 weakly similar to GP|4753889|emb| Tpv2-1c {Phaseolus vulgaris},
partial (13%)
Length = 494
Score = 73.9 bits (180), Expect = 1e-13
Identities = 44/123 (35%), Positives = 66/123 (52%), Gaps = 2/123 (1%)
Frame = +1
Query: 263 CTRPDIAYVVGRLSRYTCNPSKDHWHVVNRVLKYLNGTINYSLI--YNGYPSVLEGYTYA 320
C RPDI Y V +S++ +P K H NR+L+Y+ GT+ Y L+ Y V E Y+
Sbjct: 115 C*RPDICYSVSVISKFMHDPRKPHLIAANRILRYVRGTMEYGLLFPYGAKSEVYELICYS 294
Query: 321 SWVTCVKDHASTSG*IFNLGGGAVSWGSKKQTCIADSTMAAEFIALAACSKETV*LRNLL 380
C D STSG +F A+SW +KKQ A S+ AE+IA + + + L +++
Sbjct: 295 DSDWC-GDRRSTSGYVFKFNDAAISWCTKKQPITALSSYEAEYIAGTFATFQALWLDSVI 471
Query: 381 YEI 383
E+
Sbjct: 472 KEL 480
>CB893680 weakly similar to GP|1167523|db ORF(AA 1-1338) {Nicotiana tabacum},
partial (7%)
Length = 780
Score = 49.7 bits (117), Expect(2) = 3e-09
Identities = 30/87 (34%), Positives = 53/87 (60%), Gaps = 2/87 (2%)
Frame = -3
Query: 160 MLIFGTDLNEVEKTKSFLSNSFDMKDLGEADVILGIKIIRDGE--IIGLSQSHYIEKVLK 217
+L+ G++++E++ K+ S DMKDLG A I+G++I+ D + ++ LSQ YI +VL+
Sbjct: 271 LLVVGSNIDEIKNLKTRFSKEIDMKDLGPAKKIIGMQIMIDKQKGVL*LSQVEYITRVLQ 92
Query: 218 KFDHFDCKSVSTPFDQNTKLQPHKRCP 244
F+ + VST + L H++ P
Sbjct: 91 IFNMGNAILVSTTLASHFCLS-HEQSP 14
Score = 29.3 bits (64), Expect(2) = 3e-09
Identities = 16/52 (30%), Positives = 25/52 (47%)
Frame = -1
Query: 73 GC*DCFPKWRVG*GCVYETTWRVCYQRPRT*SV*ID*VVIWVKTSTQTMASK 124
GC DC WR+ *G ++ T R+ + + +W KT ++TM K
Sbjct: 474 GCEDCISSWRLS*GYIHAPT*RILIRSGENGGKTKE-EHVWTKTRSKTMYMK 322
>BE123913 weakly similar to GP|22093573|d polyprotein {Oryza sativa (japonica
cultivar-group)}, partial (8%)
Length = 503
Score = 51.6 bits (122), Expect = 6e-07
Identities = 25/86 (29%), Positives = 47/86 (54%)
Frame = +1
Query: 149 KAVMTCLYVDDMLIFGTDLNEVEKTKSFLSNSFDMKDLGEADVILGIKIIRDGEIIGLSQ 208
K + +YVDD+ + G +++ K+ L+ F++KDLG LG+++ R + +SQ
Sbjct: 109 KKAILIVYVDDIFLTGDHGK*IKRLKNLLAEEFEIKDLGNLKYFLGMEVARWKKGSSISQ 288
Query: 209 SHYIEKVLKKFDHFDCKSVSTPFDQN 234
Y+ +LK+ CK++ P+ N
Sbjct: 289 RKYVLDLLKETRMIGCKTIRDPYGCN 366
>BG644677 weakly similar to GP|7682800|gb| Hypothetical protein T15F17.l
{Arabidopsis thaliana}, partial (3%)
Length = 539
Score = 50.8 bits (120), Expect = 1e-06
Identities = 33/99 (33%), Positives = 51/99 (51%), Gaps = 1/99 (1%)
Frame = -3
Query: 261 MTCTRPDIAYVVGRLSRYTCNPSKDHWHVVNRVLKYLNGTINYSLIYNGYPSV-LEGYTY 319
+T P+I + + LSRY+ P+ H + + + KYL G I+ L Y+ S L GY
Sbjct: 525 LTLQGPNITFSINLLSRYSSAPTMRH*NGIKHICKYLKGIIDMGLFYSKDCSPDLIGYVN 346
Query: 320 ASWVTCVKDHASTSG*IFNLGGGAVSWGSKKQTCIADST 358
A +++ S +G IF G +SW S K + IA S+
Sbjct: 345 A*YLSDPHKARS*TGYIFTCGNTVISWRSTK*STIATSS 229
>CB893805 similar to GP|10177935|d copia-type polyprotein {Arabidopsis
thaliana}, partial (14%)
Length = 778
Score = 45.4 bits (106), Expect = 5e-05
Identities = 19/53 (35%), Positives = 32/53 (59%)
Frame = +3
Query: 148 GKAVMTCLYVDDMLIFGTDLNEVEKTKSFLSNSFDMKDLGEADVILGIKIIRD 200
GK ++ LYVDD++ G D N E+ K + F+M DLG+ LG+++ ++
Sbjct: 597 GKILIISLYVDDLIFIGNDENMFEEFKKSMKKEFNMSDLGKMHYFLGVEVTQN 755
>TC84531
Length = 655
Score = 36.6 bits (83), Expect = 0.021
Identities = 14/34 (41%), Positives = 24/34 (70%)
Frame = -3
Query: 408 SQVYNGKSRHIVLRHSHVKDLITNGVISIVFVRT 441
++ YNGK R I +HS +++ ++NG + + FVRT
Sbjct: 647 NRYYNGKRRQIRRKHSTIREYLSNGTVRVDFVRT 546
>BG586293 weakly similar to PIR|E84473|E84 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana, partial
(7%)
Length = 763
Score = 35.4 bits (80), Expect = 0.047
Identities = 23/80 (28%), Positives = 41/80 (50%)
Frame = +1
Query: 2 KIQTNRLQMDL*KENDSCWYH**I*S*TCC*KIYTKGRY*LF*YLCTCCKNCIN*SVVGT 61
+ +T+R ++DL* + W I S C ++ R+ L +CT C N + + +G
Sbjct: 82 RCETDRFEVDL*D*EE*RWNVDQIQSKASCKRLRETTRHRLRRSVCTSCSNRNHMTTLGV 261
Query: 62 SFSL*VCNP*NGC*DCFPKW 81
S + + +P + C +C PKW
Sbjct: 262 SSN*WMLDPSHRCKNCIPKW 321
>TC86088 calmodulin-like protein 1 [Medicago truncatula]
Length = 867
Score = 29.3 bits (64), Expect = 3.4
Identities = 20/63 (31%), Positives = 30/63 (46%), Gaps = 5/63 (7%)
Frame = -1
Query: 248 LEYSKAIRCLMYAMTCTRPDIAYVVGRLSRYTCNPS----KDHWHVVNRVLK-YLNGTIN 302
L Y I + +C R ++Y + + C PS H H +++LK YL+ TIN
Sbjct: 693 LSYLSHIIS*ILKFSCIRVPVSYRTRTRTMFICTPSDHNGSTHGHSFDKLLKVYLSITIN 514
Query: 303 YSL 305
SL
Sbjct: 513 ISL 505
>BE941052 weakly similar to PIR|B85188|B85 retrotransposon like protein
[imported] - Arabidopsis thaliana, partial (4%)
Length = 480
Score = 29.3 bits (64), Expect = 3.4
Identities = 13/50 (26%), Positives = 25/50 (50%)
Frame = +2
Query: 395 IHCDSQCTLSKAYSQVYNGKSRHIVLRHSHVKDLITNGVISIVFVRTVKE 444
+ CD ++ VY+ + +HI + V+DL+ G + + V TV +
Sbjct: 23 LRCDYLSATYLTHNPVYHSRMKHISIDIHFVRDLVQQGKLKVQHVCTVDQ 172
>TC85566 homologue to SP|P27880|HS12_MEDSA 18.2 kDa class I heat shock
protein. [Alfalfa] {Medicago sativa}, complete
Length = 946
Score = 28.9 bits (63), Expect = 4.4
Identities = 13/46 (28%), Positives = 27/46 (58%)
Frame = -1
Query: 193 LGIKIIRDGEIIGLSQSHYIEKVLKKFDHFDCKSVSTPFDQNTKLQ 238
LG+++ +D + + + Q Y +++K DCK ++TP + KL+
Sbjct: 811 LGMEVRQDNK*VLICQMKYTREIMK-----DCKRINTPVNLKEKLE 689
>AW560353 similar to PIR|T00660|T006 hypothetical protein F3I6.23 -
Arabidopsis thaliana, partial (16%)
Length = 623
Score = 27.7 bits (60), Expect = 9.9
Identities = 17/39 (43%), Positives = 22/39 (55%), Gaps = 2/39 (5%)
Frame = +3
Query: 394 SIHCDSQCTLSKAYSQVYNGKSR-HIVLRHSH-VKDLIT 430
S HC C ++ SQV N + HI LRHS+ + LIT
Sbjct: 495 SSHCSINCYRFRSQSQVSNTLCKNHITLRHSYKMTSLIT 611
>BG644690 weakly similar to GP|18542179|gb putative pol protein {Zea mays},
partial (22%)
Length = 629
Score = 27.7 bits (60), Expect = 9.9
Identities = 29/92 (31%), Positives = 41/92 (44%)
Frame = -1
Query: 33 KIYTKGRY*LF*YLCTCCKNCIN*SVVGTSFSL*VCNP*NGC*DCFPKWRVG*GCVYETT 92
+I +K R L * TCC+N *+ V NGC +C WR G V + T
Sbjct: 392 RIQSKRRNRL**GFFTCCQNGSY*NFNSFCCIHGVQAVPNGCEECIY*WRSQRGGVCQAT 213
Query: 93 WRVCYQRPRT*SV*ID*VVIWVKTSTQTMASK 124
+ R V I+* IW + S+++M K
Sbjct: 212 SWI*RCRGTKSCVQIE*DTIWSEASSKSMV*K 117
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.351 0.154 0.546
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 15,875,052
Number of Sequences: 36976
Number of extensions: 246130
Number of successful extensions: 2034
Number of sequences better than 10.0: 34
Number of HSP's better than 10.0 without gapping: 1488
Number of HSP's successfully gapped in prelim test: 65
Number of HSP's that attempted gapping in prelim test: 523
Number of HSP's gapped (non-prelim): 1578
length of query: 469
length of database: 9,014,727
effective HSP length: 100
effective length of query: 369
effective length of database: 5,317,127
effective search space: 1962019863
effective search space used: 1962019863
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 14 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 38 (21.9 bits)
S2: 60 (27.7 bits)
Lotus: description of TM0185.17