
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0112.2
(150 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
NP595172 polyprotein [Glycine max] 107 2e-24
TC211627 92 1e-19
TC213413 weakly similar to UP|Q84ZV5 (Q84ZV5) Polyprotein, parti... 90 4e-19
BM731326 weakly similar to GP|21740635|em OSJNBb0043H09.2 {Oryza... 87 3e-18
TC211973 weakly similar to UP|Q84ZV5 (Q84ZV5) Polyprotein, parti... 62 1e-16
TC212592 weakly similar to UP|Q84ZV5 (Q84ZV5) Polyprotein, parti... 80 4e-16
TC213114 weakly similar to UP|Q8W150 (Q8W150) Polyprotein, parti... 74 3e-14
BE807407 42 1e-07
BU549069 44 2e-05
TC204929 similar to GB|AAD39665.1|5103835|F9L1 ESTs gb|T22508, g... 40 3e-04
BQ272766 weakly similar to GP|28558781|gb| pol protein {Cucumis ... 36 0.008
BE211654 33 0.039
AW348690 30 0.33
BU578017 30 0.43
TC218053 weakly similar to UP|Q6I621 (Q6I621) Protein phosphatas... 30 0.56
BU547665 29 0.96
CA784736 28 1.6
TC211469 28 2.1
TC205313 similar to UP|TL15_ARATH (O22160) Thylakoid lumenal 15 ... 27 4.8
BG790334 weakly similar to GP|2160154|gb| F19K23.18 gene product... 27 4.8
>NP595172 polyprotein [Glycine max]
Length = 4659
Score = 107 bits (268), Expect = 2e-24
Identities = 53/144 (36%), Positives = 80/144 (54%)
Frame = +1
Query: 7 VFIKLRALRENSVVTRDCPQLTAPYYGPYPIIQRIGAVAYRLQLPEGGRVHPVFHASLLK 66
V +KL+ R++S V R +L+ Y+GP+ ++ +IG VAY+L+LP R+HPVFH S LK
Sbjct: 4072 VLVKLQPYRQHSAVLRKNQKLSMRYFGPFKVLAKIGDVAYKLELPSAARIHPVFHVSQLK 4251
Query: 67 EAVGNNSVELQLLDHLTGEEVASVHPFSVITSRFTTRQESTLPQVWIQWQGKPADEPTWK 126
G L E + P ++ SR R + + Q+ +QW+ DE TW+
Sbjct: 4252 PFNGTAQDPYLPLPLTVTEMGPVMQPVKILASRIIIRGHNQIEQILVQWENGLQDEATWE 4431
Query: 127 DTLNIRSQFPVFNLEDKVDLSAGG 150
D +I++ +P FNLEDKV G
Sbjct: 4432 DIEDIKASYPTFNLEDKVVFKGEG 4503
>TC211627
Length = 1034
Score = 91.7 bits (226), Expect = 1e-19
Identities = 49/146 (33%), Positives = 76/146 (51%)
Frame = +3
Query: 5 EWVFIKLRALRENSVVTRDCPQLTAPYYGPYPIIQRIGAVAYRLQLPEGGRVHPVFHASL 64
+WV+++L R+ S+ + +L+ +YGPY I R+G VAYRLQLP ++HP+FH SL
Sbjct: 432 DWVYVRL*PYRQTSIQST-YTKLSKRFYGPYQIQARVGQVAYRLQLPPTSKIHPIFHVSL 608
Query: 65 LKEAVGNNSVELQLLDHLTGEEVASVHPFSVITSRFTTRQESTLPQVWIQWQGKPADEPT 124
LK G EL L + V P + + +PQV +QW ++ T
Sbjct: 609 LKVHHGPIPPELLALPPFSTTNHPLVQPLQFLDWKMDESTTPPIPQVLVQWTNLAPEDTT 788
Query: 125 WKDTLNIRSQFPVFNLEDKVDLSAGG 150
W+ ++ +++LEDKV GG
Sbjct: 789 WESWTQLKD---IYDLEDKVCFQTGG 857
>TC213413 weakly similar to UP|Q84ZV5 (Q84ZV5) Polyprotein, partial (5%)
Length = 761
Score = 90.1 bits (222), Expect = 4e-19
Identities = 51/141 (36%), Positives = 74/141 (52%), Gaps = 1/141 (0%)
Frame = +1
Query: 5 EWVFIKLRALRENSVVTRDCPQLTAPYYGPYPIIQRIGAVAYRLQLPEGGRVHPVFHASL 64
+WV +KLR R+ S +LT YYGP+ + +R+G V YRL+L R+HPVFH SL
Sbjct: 10 DWVLVKLRPHRQGSASETTYSKLTKRYYGPFEVQERLGKVVYRLKLTAHSRIHPVFHVSL 189
Query: 65 LKEAVGN-NSVELQLLDHLTGEEVASVHPFSVITSRFTTRQESTLPQVWIQWQGKPADEP 123
LK VG+ + L + EE + P +VI S+ V +QW +
Sbjct: 190 LKAFVGDPETTHAGPLPVMRTEEATNT-PLTVIDSKLVPADNGPRRMVLVQWPSASRQDA 366
Query: 124 TWKDTLNIRSQFPVFNLEDKV 144
+W+D +R + +NLEDKV
Sbjct: 367 SWEDWQVLRER---YNLEDKV 420
>BM731326 weakly similar to GP|21740635|em OSJNBb0043H09.2 {Oryza sativa
(japonica cultivar-group)}, partial (3%)
Length = 424
Score = 87.0 bits (214), Expect = 3e-18
Identities = 45/111 (40%), Positives = 71/111 (63%), Gaps = 1/111 (0%)
Frame = +1
Query: 5 EWVFIKLRALRENSVVTRDCPQLTAPYYGPYPIIQRIGAVAYRLQLPEGGRVHPVFHASL 64
+WV++K+R R+ S+ R P+LTA +YGPY +++++GAVA++LQLP R+HPVFH S
Sbjct: 97 DWVYLKIRPHRQGSMPPRLHPKLTARFYGPYLVMRQVGAVAFQLQLPSEARIHPVFHVSQ 276
Query: 65 LKEAVGNNSVELQLLDHLTGEEVASVH-PFSVITSRFTTRQESTLPQVWIQ 114
LK A+GN+ + +L L E A ++ P ++ R +Q QV I+
Sbjct: 277 LKRALGNHQAQEELPPDL--EHQAELYFPVQILKIREVQKQHEVERQVLIR 423
>TC211973 weakly similar to UP|Q84ZV5 (Q84ZV5) Polyprotein, partial (4%)
Length = 730
Score = 61.6 bits (148), Expect(2) = 1e-16
Identities = 26/65 (40%), Positives = 42/65 (64%)
Frame = +1
Query: 7 VFIKLRALRENSVVTRDCPQLTAPYYGPYPIIQRIGAVAYRLQLPEGGRVHPVFHASLLK 66
VF+K++ R S+ R +L+ +Y P+ + ++G +AY+L LP ++HPVFH SLLK
Sbjct: 154 VFLKMQPYRRRSLAKRINEKLSPRFYAPFQVFNKVGTIAYKLDLPSHIKIHPVFHVSLLK 333
Query: 67 EAVGN 71
+A N
Sbjct: 334 KASPN 348
Score = 40.4 bits (93), Expect(2) = 1e-16
Identities = 16/41 (39%), Positives = 25/41 (60%)
Frame = +3
Query: 110 QVWIQWQGKPADEPTWKDTLNIRSQFPVFNLEDKVDLSAGG 150
+V IQW+ P E +W+ ++ F +++LEDKV L GG
Sbjct: 438 KVLIQWKNLPPSENSWESVAKLQEIFSIYHLEDKVSLLGGG 560
>TC212592 weakly similar to UP|Q84ZV5 (Q84ZV5) Polyprotein, partial (4%)
Length = 664
Score = 80.1 bits (196), Expect = 4e-16
Identities = 46/138 (33%), Positives = 75/138 (54%), Gaps = 2/138 (1%)
Frame = +2
Query: 9 IKLRALRENSVVTRD--CPQLTAPYYGPYPIIQRIGAVAYRLQLPEGGRVHPVFHASLLK 66
+KLR R++S + +L+ ++GP+ +++R+G VAYRLQLP ++HPVFH SLLK
Sbjct: 17 LKLRPHRQSSAKGPEPITGKLSKRFFGPFQVVERVGKVAYRLQLPVDAKLHPVFHCSLLK 196
Query: 67 EAVGNNSVELQLLDHLTGEEVASVHPFSVITSRFTTRQESTLPQVWIQWQGKPADEPTWK 126
GN L + + + P ++ +R + +V +QWQG D+ TW+
Sbjct: 197 PFQGNPPDTAAPLPPTLFDHQSVIAPLVILATRTVNDDDI---EVLVQWQGLSPDDATWE 367
Query: 127 DTLNIRSQFPVFNLEDKV 144
+ + F+LEDKV
Sbjct: 368 KWTELCKE---FHLEDKV 412
>TC213114 weakly similar to UP|Q8W150 (Q8W150) Polyprotein, partial (7%)
Length = 810
Score = 73.9 bits (180), Expect = 3e-14
Identities = 49/143 (34%), Positives = 77/143 (53%), Gaps = 4/143 (2%)
Frame = +3
Query: 5 EWVFIKLRALRENSVVTRDCPQLTAPYYGPYPIIQRIGAVAYRLQLPEGGRVHPVFHASL 64
+WV++KL+ R S+ + +L+ +YGPY I ++IG VA+ L LP ++HPVFHASL
Sbjct: 54 DWVYLKLQPYRLKSLAKKRNEKLSPRFYGPYQIKKQIGLVAFELDLPPARKIHPVFHASL 233
Query: 65 LKEAVGNNSVELQLLDHLTGEEVASVHPFSVITSRFTTRQESTLPQVWIQWQ----GKPA 120
LK+AV + Q L + E+++S F + S+ + L + W+ KP
Sbjct: 234 LKKAVA-ATANPQPLPLMLSEDLSS--EFFQLKSKLSITILMVLLKFSFNWKIFRILKPL 404
Query: 121 DEPTWKDTLNIRSQFPVFNLEDK 143
P WK + R+ FP F L +
Sbjct: 405 GNP-WKSS---RNSFPHFTLRTR 461
Score = 38.5 bits (88), Expect = 0.001
Identities = 19/42 (45%), Positives = 25/42 (59%)
Frame = +2
Query: 108 LPQVWIQWQGKPADEPTWKDTLNIRSQFPVFNLEDKVDLSAG 149
+ +V IQ + P E TW+ I+ QFP F+LEDKV L G
Sbjct: 353 IAEVLIQLEDLPDFEATWESVEVIKEQFPSFHLEDKVTLLGG 478
>BE807407
Length = 341
Score = 41.6 bits (96), Expect(2) = 1e-07
Identities = 16/28 (57%), Positives = 24/28 (85%)
Frame = +2
Query: 25 PQLTAPYYGPYPIIQRIGAVAYRLQLPE 52
P+LTA YYGPY ++ ++G VA++LQLP+
Sbjct: 155 PKLTARYYGPYLVLIKVGVVAFQLQLPK 238
Score = 29.6 bits (65), Expect(2) = 1e-07
Identities = 14/26 (53%), Positives = 18/26 (68%)
Frame = +1
Query: 57 HPVFHASLLKEAVGNNSVELQLLDHL 82
+PVFHA LK AVG++ VE +L L
Sbjct: 250 YPVFHALQLKRAVGHHLVEAELHQEL 327
>BU549069
Length = 615
Score = 44.3 bits (103), Expect = 2e-05
Identities = 30/113 (26%), Positives = 52/113 (45%), Gaps = 2/113 (1%)
Frame = -1
Query: 26 QLTAPYYGPYPIIQRIGAVAYRLQLPEG-GRVHPVFHASLLKEAVGNNSVELQLLDHLTG 84
+LT + G + I++R VAY++ LP +H VFH S L+ + + S ++L D
Sbjct: 567 KLTPHFIGHFQILKRAXPVAYQIALPPSLSNLHNVFHVSQLRMYIHDPSHVVKLDDVQVK 388
Query: 85 EEVA-SVHPFSVITSRFTTRQESTLPQVWIQWQGKPADEPTWKDTLNIRSQFP 136
E + P + R + P V + W G ++ TW+ +R +P
Sbjct: 387 ENLTYETLPLRIEDRRTKHLRRKENPLVKVIWGGTSGEDATWELESQMRVAYP 229
>TC204929 similar to GB|AAD39665.1|5103835|F9L1 ESTs gb|T22508, gb|H36196 and
gb|AI100134 come from this gene. {Arabidopsis thaliana;}
, partial (53%)
Length = 823
Score = 40.4 bits (93), Expect = 3e-04
Identities = 19/61 (31%), Positives = 35/61 (57%)
Frame = -2
Query: 90 VHPFSVITSRFTTRQESTLPQVWIQWQGKPADEPTWKDTLNIRSQFPVFNLEDKVDLSAG 149
VHPF+++ + + + + +V +QW+G P +E +W+ ++R + NL DKV G
Sbjct: 636 VHPFAILDWKLDSSGMTPVKRVLVQWEGLPPEEASWELWDDLRD---LHNLADKVVFDEG 466
Query: 150 G 150
G
Sbjct: 465 G 463
>BQ272766 weakly similar to GP|28558781|gb| pol protein {Cucumis melo},
partial (9%)
Length = 410
Score = 35.8 bits (81), Expect = 0.008
Identities = 26/101 (25%), Positives = 47/101 (45%), Gaps = 2/101 (1%)
Frame = -3
Query: 26 QLTAPYYGPYPIIQRIGAVAYRLQLPEG-GRVHPVFHASLLKEAVGNNSVELQLLDHLTG 84
+LT + GP+ I++++ VAY++ LP +H VFH S + + + + S + L
Sbjct: 303 KLTPHFIGPFQILKKVDFVAYQIALPPSLTSLHNVFHVSQIHKYINDPSHMVNLDVVQVK 124
Query: 85 EEVA-SVHPFSVITSRFTTRQESTLPQVWIQWQGKPADEPT 124
E +A P + R + + V + W G +E T
Sbjct: 123 ENLAHETLPLRIKDMRTKHLRGKEILLVKVIWGGASGEEAT 1
>BE211654
Length = 454
Score = 33.5 bits (75), Expect = 0.039
Identities = 15/40 (37%), Positives = 24/40 (59%)
Frame = -3
Query: 111 VWIQWQGKPADEPTWKDTLNIRSQFPVFNLEDKVDLSAGG 150
V I+W+G P E +W+ +++ FP +LE+KV L G
Sbjct: 452 VLIKWRGLPHHEDSWELADEMQAVFPQLDLENKVHLEGRG 333
>AW348690
Length = 644
Score = 30.4 bits (67), Expect = 0.33
Identities = 25/70 (35%), Positives = 32/70 (45%), Gaps = 4/70 (5%)
Frame = +3
Query: 81 HLTGEEVASVHPFSVITSRFTTRQESTLPQVWIQWQGKPADEPTWKDTLNIRS----QFP 136
H T + V P SV+ + +E +V I W D P KD L + S QFP
Sbjct: 330 HSTDDMELQVSPESVLIAHSNASREV---EVLIHWH----DIPISKDFLYLFSLMQQQFP 488
Query: 137 VFNLEDKVDL 146
F+ EDKV L
Sbjct: 489 SFHFEDKVSL 518
>BU578017
Length = 425
Score = 30.0 bits (66), Expect = 0.43
Identities = 20/70 (28%), Positives = 33/70 (46%), Gaps = 2/70 (2%)
Frame = +2
Query: 55 RVHPVFHASLLKEAVGNNSVE--LQLLDHLTGEEVASVHPFSVITSRFTTRQESTLPQVW 112
R+H VFH SLLK G +L D ++ + P +++ R ++ + +V
Sbjct: 8 RIHLVFHCSLLKPFHGTPDSHNFAELPDKFINDQ-PFLTPLAILDYRRSSSAANAPWEVL 184
Query: 113 IQWQGKPADE 122
+QW G DE
Sbjct: 185 VQWHGLSPDE 214
>TC218053 weakly similar to UP|Q6I621 (Q6I621) Protein phosphatase 2A B'kappa
subunit, partial (74%)
Length = 2136
Score = 29.6 bits (65), Expect = 0.56
Identities = 18/51 (35%), Positives = 28/51 (54%), Gaps = 5/51 (9%)
Frame = -3
Query: 98 SRFTTRQESTLPQ-----VWIQWQGKPADEPTWKDTLNIRSQFPVFNLEDK 143
+RF Q + PQ +++QWQGKPA++ T K+ +R LE+K
Sbjct: 1291 NRFRFVQRNQSPQQKYLVLFLQWQGKPAND-TPKNLQQLRDSIMSLGLENK 1142
>BU547665
Length = 653
Score = 28.9 bits (63), Expect = 0.96
Identities = 11/19 (57%), Positives = 13/19 (67%)
Frame = -3
Query: 132 RSQFPVFNLEDKVDLSAGG 150
++ FP FNL DK D SA G
Sbjct: 369 KNSFPTFNLRDKADFSASG 313
>CA784736
Length = 438
Score = 28.1 bits (61), Expect = 1.6
Identities = 12/25 (48%), Positives = 17/25 (68%)
Frame = +2
Query: 122 EPTWKDTLNIRSQFPVFNLEDKVDL 146
E TW+D +I S +P NLE KV++
Sbjct: 50 EATWEDEDSIGSTYPDLNLEHKVEV 124
>TC211469
Length = 480
Score = 27.7 bits (60), Expect = 2.1
Identities = 14/61 (22%), Positives = 29/61 (46%)
Frame = +2
Query: 90 VHPFSVITSRFTTRQESTLPQVWIQWQGKPADEPTWKDTLNIRSQFPVFNLEDKVDLSAG 149
V P +++ + + V +QW G P ++ +W+ +++ +NLED+V
Sbjct: 71 VTPLTILDWKLDSSVTPPRRLVLVQWMGLPPEDTSWELWDDLQQS---YNLEDEVVFGGA 241
Query: 150 G 150
G
Sbjct: 242 G 244
>TC205313 similar to UP|TL15_ARATH (O22160) Thylakoid lumenal 15 kDa protein,
chloroplast precursor (p15), partial (76%)
Length = 945
Score = 26.6 bits (57), Expect = 4.8
Identities = 15/48 (31%), Positives = 21/48 (43%)
Frame = -1
Query: 11 LRALRENSVVTRDCPQLTAPYYGPYPIIQRIGAVAYRLQLPEGGRVHP 58
+R L S R CP +T+ YGP P+ G+ + RV P
Sbjct: 342 IRVLPLKSFPVRSCPLVTSCPYGPPPLNASAGSTKMKEAERRATRVAP 199
>BG790334 weakly similar to GP|2160154|gb| F19K23.18 gene product
{Arabidopsis thaliana}, partial (19%)
Length = 390
Score = 26.6 bits (57), Expect = 4.8
Identities = 9/20 (45%), Positives = 11/20 (55%)
Frame = +3
Query: 106 STLPQVWIQWQGKPADEPTW 125
STLP +W W GK + W
Sbjct: 144 STLPPLWTSWDGKGSFRKLW 203
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.320 0.138 0.426
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,999,242
Number of Sequences: 63676
Number of extensions: 89037
Number of successful extensions: 415
Number of sequences better than 10.0: 46
Number of HSP's better than 10.0 without gapping: 412
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 412
length of query: 150
length of database: 12,639,632
effective HSP length: 89
effective length of query: 61
effective length of database: 6,972,468
effective search space: 425320548
effective search space used: 425320548
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 54 (25.4 bits)
Lotus: description of TM0112.2