
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0220.7
(720 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC93066 weakly similar to GP|19920130|gb|AAM08562.1 Putative ret... 245 4e-65
BG587170 similar to PIR|F86470|F8 probable retroelement polyprot... 139 5e-33
BI262917 weakly similar to GP|19920130|g Putative retroelement {... 135 1e-32
BG586273 weakly similar to PIR|F86470|F8 probable retroelement p... 77 2e-14
BF635063 weakly similar to PIR|F84486|F84 probable retroelement ... 71 2e-12
BE239977 weakly similar to GP|23237899|db polyprotein-like {Oryz... 60 2e-09
BF631997 weakly similar to GP|18542925|gb Putative pol polyprote... 51 2e-06
AJ499215 weakly similar to GP|6642775|gb| gag-pol polyprotein {V... 51 2e-06
AW773859 47 3e-05
AL366725 39 0.005
AW736233 GP|21105451|gb small nuclear ribonucleoprotein D1 {Dani... 39 0.007
BG586308 weakly similar to PIR|F84528|F8 probable retroelement p... 39 0.009
BG450974 similar to PIR|T05112|T05 splicing factor 9G8-like SR p... 37 0.020
AW684891 37 0.027
TC87868 similar to PIR|T05112|T05112 splicing factor 9G8-like SR... 35 0.10
BQ122739 34 0.17
TC91742 similar to GP|18481720|gb|AAL73542.1 hypothetical protei... 33 0.38
TC83119 similar to GP|13357253|gb|AAK20050.1 putative zinc finge... 32 0.86
TC78254 similar to GP|15795167|dbj|BAB03155. gene_id:MQC3.30~unk... 32 1.1
TC89711 weakly similar to PIR|T10725|T10725 protein kinase Xa21 ... 32 1.1
>TC93066 weakly similar to GP|19920130|gb|AAM08562.1 Putative retroelement
{Oryza sativa} [Oryza sativa (japonica cultivar-group)],
partial (10%)
Length = 823
Score = 245 bits (626), Expect = 4e-65
Identities = 129/275 (46%), Positives = 180/275 (64%), Gaps = 5/275 (1%)
Frame = +1
Query: 448 LGVCEHCIL-GKQRKVSFSKAGRKSKSEKLELVHTDVWGPAPVKSLGGSRYYVTFIDDST 506
L C+H + G ++KVSFS A ++K L+ +H+D+WGP+ V S GG RY +T IDD
Sbjct: 13 LEFCKHLLFFGNRKKVSFSTATHRTKGI-LDYIHSDLWGPSKVTSYGGRRYMMTIIDDFP 189
Query: 507 RKVWVYFLKSKSDVFSVFKKWKTEVENQTGLKIKSLKSDNGGEYDSQEFKKFCSENGIRM 566
RKVWVYFL+ K++ F FKKW+ VE QTG +K L +DN E+ S +F +FC+ +GI
Sbjct: 190 RKVWVYFLRYKNETFPTFKKWRILVETQTGKNVKKLITDN*LEFCSSDFNEFCTNHGIAR 369
Query: 567 IKTIPGTPEQNGVAERMNRTLNERARCMRIQSGL--PKMFWPDAINTAAYLINRGPSVPL 624
KTIP P+QNGVAERM RTL ERARCM +GL + W +A +TA +L+NR P L
Sbjct: 370 HKTIPRNPQQNGVAERMIRTLLERARCMLSNAGL*N*RDLWVEAASTACHLVNRSPHSAL 549
Query: 625 DYQLPEEVWYGKEVSLSHLKVFGCVSYVLIDSDKRDKLDPKAIKCFFIGYGSDMYGYRFW 684
D+++PE++W G V S+L++FGC +Y L++ KL P+A +C F+ Y S+ GYR W
Sbjct: 550 DFKVPEDIWSGNLVDYSNLRIFGCPAYALVND---GKLAPRAGECIFLSYASESKGYRLW 720
Query: 685 --DEQNRKIIRSINVTFNESVLYKDRSSAESMSSS 717
D +++K+I S +VTFNE L S +S SS
Sbjct: 721 CSDPKSQKLILSRDVTFNEDALLS--SGKQSFVSS 819
>BG587170 similar to PIR|F86470|F8 probable retroelement polyprotein
[imported] - Arabidopsis thaliana, partial (13%)
Length = 718
Score = 139 bits (349), Expect = 5e-33
Identities = 84/248 (33%), Positives = 129/248 (51%), Gaps = 7/248 (2%)
Frame = -3
Query: 381 KVTKGNLVVARGKKRGSLYMVAEEDMIA-------VTEAINSSSIWHQRLGHMSEKGMKI 433
+V + + ++ G +G LYM+ + D ++ + ++N ++WH RLGH + + +
Sbjct: 716 EVIESSQLIGEGVTKGDLYMLEKLDPVSNYKCSFTSSSSLNKDALWHARLGHPHGRALNL 537
Query: 434 MASKGKMSNLKHVDLGVCEHCILGKQRKVSFSKAGRKSKSEKLELVHTDVWGPAPVKSLG 493
M N CE CILGK K F + ++ +L++TD+W AP S
Sbjct: 536 MLPGVVFENKN------CEACILGKHCKNVFPRTSTVYEN-CFDLIYTDLW-TAPSLSRD 381
Query: 494 GSRYYVTFIDDSTRKVWVYFLKSKSDVFSVFKKWKTEVENQTGLKIKSLKSDNGGEYDSQ 553
+Y+VTFID+ ++ W+ + SK V FK ++ V N KIK L+SDNGGEY S
Sbjct: 380 NHKYFVTFIDEKSKYTWLTLIPSKDRVIDAFKNFQAYVTNHYHAKIKILRSDNGGEYTSY 201
Query: 554 EFKKFCSENGIRMIKTIPGTPEQNGVAERMNRTLNERARCMRIQSGLPKMFWPDAINTAA 613
FK +GI + P TP+QNGVA+R N+ L E AR + Q+ ++TA
Sbjct: 200 AFKSHLDHHGILHQTSCPYTPQQNGVAKRKNKHLMEVARSLMFQAN---------VSTAC 48
Query: 614 YLINRGPS 621
YLIN P+
Sbjct: 47 YLINWIPT 24
>BI262917 weakly similar to GP|19920130|g Putative retroelement {Oryza
sativa} [Oryza sativa (japonica cultivar-group)],
partial (8%)
Length = 426
Score = 135 bits (341), Expect(2) = 1e-32
Identities = 62/113 (54%), Positives = 78/113 (68%)
Frame = +1
Query: 573 TPEQNGVAERMNRTLNERARCMRIQSGLPKMFWPDAINTAAYLINRGPSVPLDYQLPEEV 632
TP+QNGVAERMNRTL ER R M +G+ K FW +A+ TA Y+INR PS +D + P E+
Sbjct: 85 TPQQNGVAERMNRTLLERTRAMLKTAGMAKSFWAEAVKTACYVINRSPSTVIDLKTPMEM 264
Query: 633 WYGKEVSLSHLKVFGCVSYVLIDSDKRDKLDPKAIKCFFIGYGSDMYGYRFWD 685
W GK V S L VFGC YV+ +S +R KLDPK+ KC F+GY ++ GY WD
Sbjct: 265 WKGKPVDYSSLHVFGCPVYVMYNSQERTKLDPKSRKCIFLGYADNVKGYXLWD 423
Score = 23.1 bits (48), Expect(2) = 1e-32
Identities = 9/17 (52%), Positives = 10/17 (57%)
Frame = +3
Query: 548 GEYDSQEFKKFCSENGI 564
GEY EF FC + GI
Sbjct: 9 GEYVDGEFLAFCKQEGI 59
>BG586273 weakly similar to PIR|F86470|F8 probable retroelement polyprotein
[imported] - Arabidopsis thaliana, partial (7%)
Length = 705
Score = 77.4 bits (189), Expect = 2e-14
Identities = 39/103 (37%), Positives = 64/103 (61%)
Frame = -2
Query: 612 AAYLINRGPSVPLDYQLPEEVWYGKEVSLSHLKVFGCVSYVLIDSDKRDKLDPKAIKCFF 671
A YLINR P+ L Q P EV ++ SL++++VFGC+ YVL+ + R+KL+ ++ K F
Sbjct: 704 ACYLINRIPTRVLKDQAPFEVLNQRKPSLTYMRVFGCLCYVLVPGELRNKLEARSRKAMF 525
Query: 672 IGYGSDMYGYRFWDEQNRKIIRSINVTFNESVLYKDRSSAESM 714
IGY + GY+ +D + R+++ S +V F E Y + + E +
Sbjct: 524 IGYSTTQKGYKCYDPEARRVLVSRDVKFIEERGYYEEKNQEDL 396
>BF635063 weakly similar to PIR|F84486|F84 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana, partial
(4%)
Length = 677
Score = 70.9 bits (172), Expect = 2e-12
Identities = 45/152 (29%), Positives = 78/152 (50%), Gaps = 3/152 (1%)
Frame = -2
Query: 5 KVKIERFDG-RDFGFWKMLMEDYLYQKMLYQPLTGK--KPNDMKQEDWDLLDRQALGVIR 61
K IE+F G DFG WK+ ME L Q+ + L G+ P M + + + +A I
Sbjct: 559 KRDIEKFTGDNDFGLWKVKMEAVLIQQKCEKALKGEVSLPVTMSRAEKTEMVDKARSAIV 380
Query: 62 LTLSKNVAFNIVNEKTTADLMKALSNMYEKPFAANKVHLIRRLFNLRMGEGNSVTEHINS 121
L L V + E+T A + L ++Y A++ L ++L++ RM E ++ E +
Sbjct: 379 LCLGDKVLREVAKERTAASMWAKL*SLYMTKSLAHRQFLKQQLYSFRMVESKAIMEQLTE 200
Query: 122 FNTIISQLSSVKITFDNELMVLSLLQSLPDSW 153
FN I+ L ++++ ++E + LL +LP S+
Sbjct: 199 FNKILDDLENIEVQLEDEEKAILLLCALPKSF 104
>BE239977 weakly similar to GP|23237899|db polyprotein-like {Oryza sativa
(japonica cultivar-group)}, partial (2%)
Length = 514
Score = 60.5 bits (145), Expect = 2e-09
Identities = 31/76 (40%), Positives = 44/76 (57%)
Frame = -2
Query: 644 KVFGCVSYVLIDSDKRDKLDPKAIKCFFIGYGSDMYGYRFWDEQNRKIIRSINVTFNESV 703
++FGC S+V I SD R K D +A+KC FI Y S GYR + +RK S +VTF+E
Sbjct: 456 RIFGCTSFVHIHSDGRSKFDHRALKCVFIRYSSTQKGYRCYHPPSRKYFVSRDVTFHEQE 277
Query: 704 LYKDRSSAESMSSSKQ 719
Y ++ + S K+
Sbjct: 276 SYFGQN*SSSGGKYKE 229
>BF631997 weakly similar to GP|18542925|gb Putative pol polyprotein {Oryza
sativa}, partial (6%)
Length = 650
Score = 50.8 bits (120), Expect = 2e-06
Identities = 51/158 (32%), Positives = 77/158 (48%), Gaps = 8/158 (5%)
Frame = -1
Query: 524 FKKWKTEVENQTGLKIKSLKSDNGGEYDSQEFKKFC----SENGIRMIKTIPG----TPE 575
FK+WK VE+QTG K+K L+++ + FC + G K I G T E
Sbjct: 608 FKEWKILVESQTGKKVKRLQTE*------RLGNLFCRVQ*TLQGRTHYKAIYGEKYSTKE 447
Query: 576 QNGVAERMNRTLNERARCMRIQSGLPKMFWPDAINTAAYLINRGPSVPLDYQLPEEVWYG 635
A N ERARCM +GL + F +AI+T YL+N P + +L + YG
Sbjct: 446 WCC*ANEYN--FLERARCMFSNAGLNRSFQAEAISTKCYLVNVLPLLL*TVRL--HLRYG 279
Query: 636 KEVSLSHLKVFGCVSYVLIDSDKRDKLDPKAIKCFFIG 673
++L ++ GC +Y ++ KL+P++ K F G
Sbjct: 278 L-INLLITQILGCPAYYHVN---EGKLEPRSKKGLFWG 177
Score = 48.5 bits (114), Expect = 9e-06
Identities = 27/75 (36%), Positives = 39/75 (52%)
Frame = -2
Query: 512 YFLKSKSDVFSVFKKWKTEVENQTGLKIKSLKSDNGGEYDSQEFKKFCSENGIRMIKTIP 571
+ +K KS+ F + + + G + K +NG E S EF + C E I T+
Sbjct: 643 FMMKHKSEAFYFSRNGRFWSRVKQGRR*NVFKLNNGLEICSAEFNELCKEEHITRQYTVR 464
Query: 572 GTPEQNGVAERMNRT 586
TP++NGVAERMN T
Sbjct: 463 NTPQKNGVAERMNIT 419
>AJ499215 weakly similar to GP|6642775|gb| gag-pol polyprotein {Vitis
vinifera}, partial (18%)
Length = 567
Score = 50.8 bits (120), Expect = 2e-06
Identities = 26/89 (29%), Positives = 47/89 (52%)
Frame = +3
Query: 287 PVDSWVIDSGASFHTIPSKELLSNYICGKFGKVYLADGKPLDIVGIGDIDIRSSNGTLWT 346
P W+IDSG + H ++L KV + +G +++ GIG + ++S +G
Sbjct: 96 PSKYWLIDSGCTHHMTHDRDLFKELNKSTISKVRMLNGAHIEVEGIGTVLVKSHSG-YKQ 272
Query: 347 LHNVRHVPGIKRNLISIGQLDDEGYHTTF 375
+ NV + P + ++L+S+ QL +GY F
Sbjct: 273 ISNVLYAPKLNQSLLSVPQLLTKGYKVLF 359
>AW773859
Length = 538
Score = 46.6 bits (109), Expect = 3e-05
Identities = 29/121 (23%), Positives = 55/121 (44%), Gaps = 12/121 (9%)
Frame = -3
Query: 388 VVARGKKRGSLYMVAEEDMIAVTEA------------INSSSIWHQRLGHMSEKGMKIMA 435
++ G+ LY + +D + T A + ++WH RLGH+S + K+++
Sbjct: 356 MIGSGELFEGLYFLTTQDTPSATIASSQVQPQPQPTFLPQEALWHFRLGHLSNR--KLLS 183
Query: 436 SKGKMSNLKHVDLGVCEHCILGKQRKVSFSKAGRKSKSEKLELVHTDVWGPAPVKSLGGS 495
+ VC+ C + +K+ F + ++ S+ EL H D+WGP +S+
Sbjct: 182 LHSNFPFITIDQNSVCDICHYSRHKKLPFQLSTNRA-SKCYELFHFDIWGPFSTQSIHNQ 6
Query: 496 R 496
R
Sbjct: 5 R 3
>AL366725
Length = 485
Score = 39.3 bits (90), Expect = 0.005
Identities = 13/24 (54%), Positives = 19/24 (79%)
Frame = +2
Query: 232 RNDITCWNCDRKGHFTNQCKAPRK 255
RNDI C+NC+ +GH +QCK P++
Sbjct: 413 RNDIVCFNCNEEGHIGSQCKQPKR 484
>AW736233 GP|21105451|gb small nuclear ribonucleoprotein D1 {Danio rerio},
partial (14%)
Length = 510
Score = 38.9 bits (89), Expect = 0.007
Identities = 31/102 (30%), Positives = 42/102 (40%), Gaps = 1/102 (0%)
Frame = +1
Query: 200 NTESRGRGSQKSHNQSQGRGRSKSRGRSQTRVR-NDITCWNCDRKGHFTNQCKAPRKKKN 258
N+ES+ G++ +N +GRGR + RGR + R N + C C R H +C
Sbjct: 187 NSESQEVGTEY-YNAGRGRGRGRGRGRGRGRSNSNRLQCQICARNNHDAARCCF------ 345
Query: 259 YQKR*DDDESANAATEEVADTLICSLDSPVDSWVIDSGASFH 300
R D S+ A S W DSGAS H
Sbjct: 346 ---RYDQASSSQAHHRAPPSEYAASSSYSEAPWYPDSGASHH 462
>BG586308 weakly similar to PIR|F84528|F8 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana, partial
(7%)
Length = 686
Score = 38.5 bits (88), Expect = 0.009
Identities = 20/67 (29%), Positives = 35/67 (51%)
Frame = -2
Query: 544 SDNGGEYDSQEFKKFCSENGIRMIKTIPGTPEQNGVAERMNRTLNERARCMRIQSGLPKM 603
+DNG + S +F++FC IR+ P P+ NG AE N+ + + ++ + L K
Sbjct: 661 TDNGSHFISNKFREFCERWRIRLNTASPRYPQSNGQAEASNKIIIDG---LKKRLDLKKG 491
Query: 604 FWPDAIN 610
W D ++
Sbjct: 490 CWADELD 470
>BG450974 similar to PIR|T05112|T05 splicing factor 9G8-like SR protein
RSZp22 [validated] - Arabidopsis thaliana, partial (54%)
Length = 364
Score = 37.4 bits (85), Expect = 0.020
Identities = 16/43 (37%), Positives = 22/43 (50%)
Frame = +1
Query: 209 QKSHNQSQGRGRSKSRGRSQTRVRNDITCWNCDRKGHFTNQCK 251
Q SHN G G + GR R +D+ C+ C GHF +C+
Sbjct: 238 QLSHNSKSGGGGGRGGGRGG-RGGDDLKCYECGEPGHFARECR 363
>AW684891
Length = 539
Score = 37.0 bits (84), Expect = 0.027
Identities = 26/88 (29%), Positives = 43/88 (48%)
Frame = -3
Query: 494 GSRYYVTFIDDSTRKVWVYFLKSKSDVFSVFKKWKTEVENQTGLKIKSLKSDNGGEYDSQ 553
G ++++ FI +ST + FLK K +++ F K I+ L S+ G Y Q
Sbjct: 348 GXKWFLAFIHNST-SIT*LFLK-KHEIWFQFVK-----------SIERLCSNKGKVYVDQ 208
Query: 554 EFKKFCSENGIRMIKTIPGTPEQNGVAE 581
++ ENG+ +P+QNGV+E
Sbjct: 207 NLSQYLEENGVSQELKCVNSPQQNGVSE 124
>TC87868 similar to PIR|T05112|T05112 splicing factor 9G8-like SR protein
RSZp22 [validated] - Arabidopsis thaliana, partial (91%)
Length = 860
Score = 35.0 bits (79), Expect = 0.10
Identities = 24/83 (28%), Positives = 36/83 (42%)
Frame = +3
Query: 169 LKFDDIRDLILSEDIRRKDSGESSNTFGSALNTESRGRGSQKSHNQSQGRGRSKSRGRSQ 228
+ FDD RD ++D R G++ + N+ S G G GRGR G
Sbjct: 168 IDFDDRRD---AQDAIRDLDGKNGWRVELSHNSRSGGGGG----GGGGGRGRGGGGGGG- 323
Query: 229 TRVRNDITCWNCDRKGHFTNQCK 251
+D+ C+ C GHF +C+
Sbjct: 324 ----SDLKCYECGEPGHFARECR 380
>BQ122739
Length = 575
Score = 34.3 bits (77), Expect = 0.17
Identities = 22/81 (27%), Positives = 40/81 (49%), Gaps = 6/81 (7%)
Frame = +1
Query: 366 LDDEGYHTTFGGGAWKVTKGNLVVARGKKRGSLYMVAEEDMIAVTEAIN------SSSIW 419
LDD+GY G ++T ++V+ +GK L ++ + +AI SS
Sbjct: 223 LDDQGYIFNIEDGDLRITNDSMVLMKGKLENGLSLL*GRTSMDTADAIYIRCNLVSSRSS 402
Query: 420 HQRLGHMSEKGMKIMASKGKM 440
R+GH+S+ G+K + + G +
Sbjct: 403 AYRMGHVSKGGIKKLNTIGAL 465
>TC91742 similar to GP|18481720|gb|AAL73542.1 hypothetical protein {Sorghum
bicolor}, partial (32%)
Length = 669
Score = 33.1 bits (74), Expect = 0.38
Identities = 22/62 (35%), Positives = 30/62 (47%), Gaps = 4/62 (6%)
Frame = -3
Query: 247 TNQCK-APRKKKNYQKR*DDDESANAATEEVADTLICSLDSPVDSW---VIDSGASFHTI 302
+ CK +P +N R + AA+ V+ LIC SPV+SW V +G SF
Sbjct: 262 SRHCKVSPLNTRNLASRDLPKQLDQAASRSVSPNLICKRGSPVESWPFEVDQTGTSFGPG 83
Query: 303 PS 304
PS
Sbjct: 82 PS 77
>TC83119 similar to GP|13357253|gb|AAK20050.1 putative zinc finger protein
{Oryza sativa (japonica cultivar-group)}, partial (16%)
Length = 421
Score = 32.0 bits (71), Expect = 0.86
Identities = 22/76 (28%), Positives = 34/76 (43%)
Frame = +3
Query: 175 RDLILSEDIRRKDSGESSNTFGSALNTESRGRGSQKSHNQSQGRGRSKSRGRSQTRVRND 234
++L +S D R + S + S + R + S+ + R R SRG S+ D
Sbjct: 180 KNLKMSSDSRSRSRSRSRSRSRSRSPRIRKIRSDRHSYRDAPYR-RDSSRGFSR-----D 341
Query: 235 ITCWNCDRKGHFTNQC 250
C NC R GH+ +C
Sbjct: 342 NLCKNCKRPGHYAREC 389
>TC78254 similar to GP|15795167|dbj|BAB03155. gene_id:MQC3.30~unknown
protein {Arabidopsis thaliana}, partial (51%)
Length = 1464
Score = 31.6 bits (70), Expect = 1.1
Identities = 20/62 (32%), Positives = 28/62 (44%)
Frame = +1
Query: 184 RRKDSGESSNTFGSALNTESRGRGSQKSHNQSQGRGRSKSRGRSQTRVRNDITCWNCDRK 243
RRK +G+ N + RG+ + H GRGR + RGR + R R R+
Sbjct: 589 RRKAAGDDGN---DSDEEAKRGKMLELGHTSPTGRGRGRGRGRGRGRGRGRA----IQRE 747
Query: 244 GH 245
GH
Sbjct: 748 GH 753
>TC89711 weakly similar to PIR|T10725|T10725 protein kinase Xa21 (EC
2.7.1.-) A1 receptor type - long-staminate rice,
partial (7%)
Length = 1391
Score = 31.6 bits (70), Expect = 1.1
Identities = 13/25 (52%), Positives = 16/25 (64%)
Frame = -2
Query: 462 VSFSKAGRKSKSEKLELVHTDVWGP 486
+S K+ E LEL+HTDVWGP
Sbjct: 910 ISPFKSSTSHAQEVLELIHTDVWGP 836
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.317 0.134 0.396
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 21,525,730
Number of Sequences: 36976
Number of extensions: 290932
Number of successful extensions: 1472
Number of sequences better than 10.0: 48
Number of HSP's better than 10.0 without gapping: 1410
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 1442
length of query: 720
length of database: 9,014,727
effective HSP length: 103
effective length of query: 617
effective length of database: 5,206,199
effective search space: 3212224783
effective search space used: 3212224783
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 62 (28.5 bits)
Lotus: description of TM0220.7