
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0189.9
(214 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC211815 67 3e-22
TC224482 similar to UP|Q6WAY3 (Q6WAY3) Gag/pol polyprotein, part... 82 1e-16
NP595172 polyprotein [Glycine max] 68 4e-12
BE660092 weakly similar to GP|9884624|dbj retroelement pol polyp... 62 1e-10
TC234722 similar to UP|Q6WAY9 (Q6WAY9) Pol (Fragment), partial (... 56 1e-08
BI424213 55 3e-08
TC211627 50 1e-06
BE801213 weakly similar to GP|6691193|gb| F7F22.17 {Arabidopsis ... 35 3e-06
CO981347 43 9e-05
TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, co... 43 1e-04
TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete 43 1e-04
TC232772 weakly similar to UP|Q9LQH2 (Q9LQH2) F15O4.13, partial ... 41 5e-04
BI425021 37 0.005
BQ299538 33 0.097
TC213008 weakly similar to UP|Q94AA3 (Q94AA3) AT3g09210/F3L24_8,... 30 1.1
TC213114 weakly similar to UP|Q8W150 (Q8W150) Polyprotein, parti... 28 2.4
TC205815 similar to UP|Q9SUY5 (Q9SUY5) Dihydrolipoamide S-acetyl... 28 3.1
BE804087 28 3.1
BQ272766 weakly similar to GP|28558781|gb| pol protein {Cucumis ... 28 4.1
>TC211815
Length = 704
Score = 67.0 bits (162), Expect(2) = 3e-22
Identities = 30/67 (44%), Positives = 45/67 (66%)
Frame = +3
Query: 3 DNGMQFASNQTKEFCEEMGIQRRFSSVEHPQTNGQADSANKVIQQGLKRPLSEAKGAWLD 62
DNG+QF + + EF + I+ R +SV+HPQTN +A++ANKVI LK+ L A G W++
Sbjct: 3 DNGLQFTNRKLNEFPSGLNIKHRVTSVKHPQTNRRAEAANKVILGDLKKLLDGANGRWVE 182
Query: 63 ELPIVFW 69
+L + W
Sbjct: 183DLVEILW 203
Score = 54.7 bits (130), Expect(2) = 3e-22
Identities = 39/131 (29%), Positives = 62/131 (46%), Gaps = 3/131 (2%)
Frame = +2
Query: 77 STTGKTPFRMTYGADVMLPVEIDNSSWRTTPKFEGENSSNMAVELDLLSETHNEARLKEA 136
STT +TPF + YG VMLP+E+ E N + ++LDL+ + + +
Sbjct: 233 STTHETPF*LIYGISVMLPIEVGEVFL*RHYFAEV*NKEALQIDLDLIKQVREDTVIMT* 412
Query: 137 AMKQRAAAKYDTKVKPREMQEG---DLVLKKRTGVTGNKLSPIWEGPYRILKALGRGAYH 193
A KQR +++K+ G + + + + +K + EGP++I GAY
Sbjct: 413 AFKQRMTRCFNSKLPSTV*GRGPSMEGIQRSLEVLVRSKFTTN*EGPFKIRHNSKNGAYK 592
Query: 194 LESLDGKRVPR 204
LE L GK V R
Sbjct: 593 LEELSGKVVLR 625
>TC224482 similar to UP|Q6WAY3 (Q6WAY3) Gag/pol polyprotein, partial (6%)
Length = 669
Score = 82.4 bits (202), Expect = 1e-16
Identities = 53/171 (30%), Positives = 89/171 (51%), Gaps = 6/171 (3%)
Frame = +1
Query: 39 DSANKVIQQGLKRPLSEAKGAWLDELPIVFWSYNTTQHSTTGKTPFRMTYGADVMLPVEI 98
++ANK I++ +++ K W + LP Y T+ ++TG TPF + YG + +LP E+
Sbjct: 1 EAANKNIKKIIQKMTVSYKD-WHEMLPFALHGYRTSVRTSTGATPFSLVYGMEAVLPFEV 177
Query: 99 DNSSWRTTPKF---EGENSSNMAVELDLLSETHNEARLKEAAMKQRAAAKYDTKVKPREM 155
+ S R + E E + +L+L+ A +QR + +D KV R+
Sbjct: 178 EVPSLRILAESGLKESEWAQTRYDQLNLIEGKRLTAMSHGRLYQQRMKSAFDKKVCLRKF 357
Query: 156 QEGDLVLKKRTGVTGN---KLSPIWEGPYRILKALGRGAYHLESLDGKRVP 203
EGDLVLKK + + K +P +EGP+ + +A GA L ++DG+ +P
Sbjct: 358 HEGDLVLKKMSHAVKDHRGKWAPNYEGPFVVKRAFSGGALVLTNMDGEELP 510
>NP595172 polyprotein [Glycine max]
Length = 4659
Score = 67.8 bits (164), Expect = 4e-12
Identities = 56/210 (26%), Positives = 93/210 (43%), Gaps = 8/210 (3%)
Frame = +1
Query: 1 VSDNGMQFASNQTKEFCEEMGIQRRFSSVEHPQTNGQADSANKVIQQGLKRPLSEAKGAW 60
VSD F S + + G SS HPQ++GQ++ NK ++ L+ E W
Sbjct: 3619 VSDRDRVFTSTFWQHLFKLQGTTLAMSSAYHPQSDGQSEVLNKCLEMYLRCFTYEHPKGW 3798
Query: 61 LDELPIVFWSYNTTQHSTTGKTPFRMTYGAD----VMLPVEIDNSSWRTTPKFEGENSSN 116
+ LP + YNT H + G TPFR YG + ID+ + + ++
Sbjct: 3799 VKALPWAEFWYNTAYHMSLGMTPFRALYGREPPTLTRQACSIDDPA-EVREQLTDRDALL 3975
Query: 117 MAVELDLLSETHNEARLKEAAMKQRAAAKY----DTKVKPREMQEGDLVLKKRTGVTGNK 172
++++L T + +K A K+R + + VK + ++ VL+K K
Sbjct: 3976 AKLKINL---TRAQQVMKRQADKKRLDVSFQIGDEVLVKLQPYRQHSAVLRK-----NQK 4131
Query: 173 LSPIWEGPYRILKALGRGAYHLESLDGKRV 202
LS + GP+++L +G AY LE R+
Sbjct: 4132 LSMRYFGPFKVLAKIGDVAYKLELPSAARI 4221
>BE660092 weakly similar to GP|9884624|dbj retroelement pol polyprotein-like
{Arabidopsis thaliana}, partial (13%)
Length = 378
Score = 62.4 bits (150), Expect = 1e-10
Identities = 27/86 (31%), Positives = 52/86 (60%)
Frame = -3
Query: 18 EEMGIQRRFSSVEHPQTNGQADSANKVIQQGLKRPLSEAKGAWLDELPIVFWSYNTTQHS 77
++ G+ R S+ HPQTNGQA+ +N+ I++ L++ + ++ W L W++ T +
Sbjct: 361 KKYGVVHRVSTPYHPQTNGQAEISNREIKRILEKIVQPSRKDWSTRLDDALWAHRTAYKA 182
Query: 78 TTGKTPFRMTYGADVMLPVEIDNSSW 103
G +P+R+ +G LPVEI++ ++
Sbjct: 181 PIGMSPYRVVFGKACHLPVEIEHKAY 104
>TC234722 similar to UP|Q6WAY9 (Q6WAY9) Pol (Fragment), partial (32%)
Length = 482
Score = 55.8 bits (133), Expect = 1e-08
Identities = 25/73 (34%), Positives = 44/73 (60%)
Frame = -3
Query: 31 HPQTNGQADSANKVIQQGLKRPLSEAKGAWLDELPIVFWSYNTTQHSTTGKTPFRMTYGA 90
HPQTNGQA+ +NK I++ L+ + ++ W +L FW+Y + G +PF++ YG
Sbjct: 231 HPQTNGQAEVSNKEIKRVLENIVVSSRKDWALKLDDAFWAYRIAFKTPIGLSPFQLVYGK 52
Query: 91 DVMLPVEIDNSSW 103
L VE+++ ++
Sbjct: 51 ACHLSVELEHKAY 13
>BI424213
Length = 426
Score = 54.7 bits (130), Expect = 3e-08
Identities = 27/85 (31%), Positives = 47/85 (54%)
Frame = +1
Query: 14 KEFCEEMGIQRRFSSVEHPQTNGQADSANKVIQQGLKRPLSEAKGAWLDELPIVFWSYNT 73
K ++G + FS+ HPQT+GQ N+ + L+ L +W + LP V ++YN
Sbjct: 16 KTLWAKLGTKLLFSTTCHPQTDGQTKVVNRSLSTLLRALLKGNHKSWDEYLPHVEFAYNR 195
Query: 74 TQHSTTGKTPFRMTYGADVMLPVEI 98
H TT ++PF + YG + + P+++
Sbjct: 196 GVHRTTKQSPFEVVYGFNPLTPLDL 270
>TC211627
Length = 1034
Score = 49.7 bits (117), Expect = 1e-06
Identities = 26/77 (33%), Positives = 40/77 (51%)
Frame = +2
Query: 8 FASNQTKEFCEEMGIQRRFSSVEHPQTNGQADSANKVIQQGLKRPLSEAKGAWLDELPIV 67
F S E G + RFS+ HPQT+GQ + N++++Q L+ + + W L +
Sbjct: 5 FISGLWHELFHISGTKLRFSTAYHPQTDGQTEVINRILEQYLRAFVHDHPQHWFKFLSLA 184
Query: 68 FWSYNTTQHSTTGKTPF 84
YNT+ HS G +PF
Sbjct: 185E*CYNTSVHSGIGFSPF 235
>BE801213 weakly similar to GP|6691193|gb| F7F22.17 {Arabidopsis thaliana},
partial (3%)
Length = 416
Score = 34.7 bits (78), Expect(2) = 3e-06
Identities = 14/54 (25%), Positives = 29/54 (52%)
Frame = +2
Query: 50 KRPLSEAKGAWLDELPIVFWSYNTTQHSTTGKTPFRMTYGADVMLPVEIDNSSW 103
++ ++ ++ W +L W+ T + + G TPF+M Y LPVE+ + ++
Sbjct: 224 EKNVASSRKDWSSKLEDALWACKTAKKTPIGLTPFQMVYRKACHLPVELKHKAY 385
Score = 33.1 bits (74), Expect(2) = 3e-06
Identities = 14/42 (33%), Positives = 23/42 (54%)
Frame = +3
Query: 1 VSDNGMQFASNQTKEFCEEMGIQRRFSSVEHPQTNGQADSAN 42
+SD G F +Q + + ++ + + HPQTNGQA +N
Sbjct: 75 ISDGGSHFYYSQLNKVLKHDSVRHKVETSYHPQTNGQAKVSN 200
>CO981347
Length = 624
Score = 43.1 bits (100), Expect = 9e-05
Identities = 33/98 (33%), Positives = 46/98 (46%), Gaps = 10/98 (10%)
Frame = +2
Query: 2 SDNGMQFASNQTKEFCEEMGIQRRFSSVEHPQTNGQADSANKVIQQGLKRPLSEAKGAWL 61
+DNG++F Q EFC ++GI+R P NG A+ N I + ++ L A+
Sbjct: 176 TDNGLEFVLEQFNEFCRKIGIKRHKIVPHTP*QNGLAERMNMTILERVRCMLLSAR---- 343
Query: 62 DELPIVFW--SYNTTQH-------STTG-KTPFRMTYG 89
LP FW + NTT + ST G KTP G
Sbjct: 344 --LPKTFWGEAANTTSYLINRCPSSTLGFKTPMEAWSG 451
>TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, complete
Length = 4734
Score = 42.7 bits (99), Expect = 1e-04
Identities = 24/69 (34%), Positives = 36/69 (51%)
Frame = +1
Query: 2 SDNGMQFASNQTKEFCEEMGIQRRFSSVEHPQTNGQADSANKVIQQGLKRPLSEAKGAWL 61
SD+G +F +++ EFC GI FS+ PQ NG + N+ +Q+ R + AK
Sbjct: 2461 SDHGREFENSKFTEFCTSEGITHEFSAAITPQQNGIVERKNRTLQEA-ARVMLHAK---- 2625
Query: 62 DELPIVFWS 70
ELP W+
Sbjct: 2626 -ELPYNLWA 2649
>TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete
Length = 4731
Score = 42.7 bits (99), Expect = 1e-04
Identities = 24/69 (34%), Positives = 36/69 (51%)
Frame = +1
Query: 2 SDNGMQFASNQTKEFCEEMGIQRRFSSVEHPQTNGQADSANKVIQQGLKRPLSEAKGAWL 61
SD+G +F +++ EFC GI FS+ PQ NG + N+ +Q+ R + AK
Sbjct: 2458 SDHGREFENSRFTEFCTSEGITHEFSAAITPQQNGIVERKNRTLQEA-ARVMLHAK---- 2622
Query: 62 DELPIVFWS 70
ELP W+
Sbjct: 2623 -ELPYNLWA 2646
>TC232772 weakly similar to UP|Q9LQH2 (Q9LQH2) F15O4.13, partial (7%)
Length = 729
Score = 40.8 bits (94), Expect = 5e-04
Identities = 29/137 (21%), Positives = 62/137 (45%), Gaps = 5/137 (3%)
Frame = +1
Query: 64 LPIVFWSYNTTQHSTTGKTPFRMTYGADVMLPVEIDNSSWRTTPKFEGENSSNMAVELDL 123
LP V ++YN HSTT PF + Y + + P++ S G + +++
Sbjct: 7 LPHVEFAYNRAVHSTTQHFPFEVVYDFNPLTPLDSLPLS-----NISGFKHKDAHAKVEY 171
Query: 124 LSETHNEARLK-----EAAMKQRAAAKYDTKVKPREMQEGDLVLKKRTGVTGNKLSPIWE 178
+ H +A+ + E+ +KQ + ++P + + ++ +KL P +
Sbjct: 172 IKRLHEQAKTQIAKKNESYVKQTNKNRKKVVLEPSDWVWVHMRKERFPKQRMSKLQPRGD 351
Query: 179 GPYRILKALGRGAYHLE 195
GP+++L+ + AY ++
Sbjct: 352 GPFQVLERINYNAYKID 402
>BI425021
Length = 426
Score = 37.4 bits (85), Expect = 0.005
Identities = 20/50 (40%), Positives = 28/50 (56%)
Frame = -1
Query: 1 VSDNGMQFASNQTKEFCEEMGIQRRFSSVEHPQTNGQADSANKVIQQGLK 50
VSD F S+ ++ G R SS HPQT+GQ + N+VI+Q L+
Sbjct: 204 VSDRDPLFISHFWQDLFRLSGTVLRMSSAYHPQTDGQTEVLNRVIEQYLR 55
>BQ299538
Length = 426
Score = 33.1 bits (74), Expect = 0.097
Identities = 20/69 (28%), Positives = 35/69 (49%)
Frame = +3
Query: 21 GIQRRFSSVEHPQTNGQADSANKVIQQGLKRPLSEAKGAWLDELPIVFWSYNTTQHSTTG 80
G + S+ HP +GQ + N ++ L+ +++ + L + YNT H++TG
Sbjct: 144 GTYLKMSTSYHP*IDGQ--TVNHCLETFLRCFVADQPKM*VQWLSWAEYWYNTNFHASTG 317
Query: 81 KTPFRMTYG 89
TPF + YG
Sbjct: 318 TTPFEVVYG 344
>TC213008 weakly similar to UP|Q94AA3 (Q94AA3) AT3g09210/F3L24_8, partial
(17%)
Length = 422
Score = 29.6 bits (65), Expect = 1.1
Identities = 13/42 (30%), Positives = 24/42 (56%)
Frame = +1
Query: 23 QRRFSSVEHPQTNGQADSANKVIQQGLKRPLSEAKGAWLDEL 64
+RR + E +T A + +V+++ L + KG+W+DEL
Sbjct: 196 ERRRARSERRETRSGAKNWREVVEERLMEKPKKQKGSWMDEL 321
>TC213114 weakly similar to UP|Q8W150 (Q8W150) Polyprotein, partial (7%)
Length = 810
Score = 28.5 bits (62), Expect = 2.4
Identities = 14/41 (34%), Positives = 22/41 (53%)
Frame = +3
Query: 162 LKKRTGVTGNKLSPIWEGPYRILKALGRGAYHLESLDGKRV 202
LK KLSP + GPY+I K +G A+ L+ +++
Sbjct: 87 LKSLAKKRNEKLSPRFYGPYQIKKQIGLVAFELDLPPARKI 209
>TC205815 similar to UP|Q9SUY5 (Q9SUY5) Dihydrolipoamide S-acetyltransferase,
partial (60%)
Length = 1893
Score = 28.1 bits (61), Expect = 3.1
Identities = 21/69 (30%), Positives = 30/69 (43%)
Frame = +2
Query: 131 ARLKEAAMKQRAAAKYDTKVKPREMQEGDLVLKKRTGVTGNKLSPIWEGPYRILKALGRG 190
+ +KE A K RA K+KP E Q G + +K I P + A+GRG
Sbjct: 1001 SEVKELAAKARAG-----KLKPHEFQGGTFSISNLGMFPVDKFCAIINPPQACILAVGRG 1165
Query: 191 AYHLESLDG 199
+E + G
Sbjct: 1166 NKVVEPVIG 1192
>BE804087
Length = 160
Score = 28.1 bits (61), Expect = 3.1
Identities = 14/53 (26%), Positives = 26/53 (48%)
Frame = +2
Query: 39 DSANKVIQQGLKRPLSEAKGAWLDELPIVFWSYNTTQHSTTGKTPFRMTYGAD 91
++ANK I++ + + K W + Y T ++TG TP+ + YG +
Sbjct: 2 EAANKNIKKNI*KMTVSYKD-WHEMFSFALHMYRTLVRTSTGATPYSLVYGKE 157
>BQ272766 weakly similar to GP|28558781|gb| pol protein {Cucumis melo},
partial (9%)
Length = 410
Score = 27.7 bits (60), Expect = 4.1
Identities = 19/56 (33%), Positives = 29/56 (50%), Gaps = 7/56 (12%)
Frame = -3
Query: 146 YDTKVKPREMQEGDLVLKKRTGVTGN-------KLSPIWEGPYRILKALGRGAYHL 194
+D + K E + GD V + T TG KL+P + GP++ILK + AY +
Sbjct: 402 HDKRRKDLEFEVGDHVFLRVTP*TGVGRALKS*KLTPHFIGPFQILKKVDFVAYQI 235
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.317 0.131 0.386
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 8,104,248
Number of Sequences: 63676
Number of extensions: 91058
Number of successful extensions: 438
Number of sequences better than 10.0: 42
Number of HSP's better than 10.0 without gapping: 431
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 433
length of query: 214
length of database: 12,639,632
effective HSP length: 93
effective length of query: 121
effective length of database: 6,717,764
effective search space: 812849444
effective search space used: 812849444
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 57 (26.6 bits)
Lotus: description of TM0189.9