
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0229.10
(277 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
BE211208 80 8e-16
CO982036 80 1e-15
TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete 65 3e-11
TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, co... 64 1e-10
BM527454 weakly similar to GP|27901709|gb| gag-pol polyprotein {... 39 1e-07
BF068614 similar to GP|27817858|db OJ1081_B12.7 {Oryza sativa (j... 49 2e-06
BI321712 49 2e-06
TC232593 weakly similar to UP|Q9XG91 (Q9XG91) Tpv2-1c protein (F... 46 2e-05
BU548243 45 3e-05
TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Frag... 40 0.001
BU764568 40 0.002
BI317550 weakly similar to GP|9759590|dbj| polyprotein-like {Ara... 37 0.010
BI787454 weakly similar to GP|21434|emb|CA ORF4 {Solanum tuberos... 37 0.013
TC232995 36 0.017
AI855899 similar to GP|2244960|emb| retrotransposon like protein... 36 0.017
BI972503 weakly similar to GP|18378607|gb polyprotein {Oryza sat... 33 0.11
TC225402 weakly similar to UP|Q6I923 (Q6I923) Copia-like retrotr... 33 0.11
BM731813 weakly similar to PIR|T02323|T02 nodulin-like protein [... 33 0.14
AW759031 weakly similar to GP|22093573|db polyprotein {Oryza sat... 33 0.18
BM143109 32 0.31
>BE211208
Length = 413
Score = 80.5 bits (197), Expect = 8e-16
Identities = 49/121 (40%), Positives = 69/121 (56%), Gaps = 13/121 (10%)
Frame = +2
Query: 1 DIIITGSSSALMNNFISKLNLVFLLKQLG*LEHFLGIEVKSLSDGTLVLTQFRYIRDL*A 60
DIIITG S+ L+ + + LN F LKQLG L++FLGIEV G+++LTQ +YI DL
Sbjct: 50 DIIITGRSNYLIQSLVHHLNSNFSLKQLGQLDYFLGIEVHHTPTGSVLLTQSKYICDLLH 229
Query: 61 KVNISYGEKLQIPHNRGC-------------SFYGSIAGSLQ*LTITRPEQSYSVKKVCP 107
K +++ + + P + Y S+ G+LQ TITRPE S++ KVC
Sbjct: 230 KTDMAEAKPISSPMVTNLRLSKNGDDLLSDPTMYRSVVGALQYPTITRPEISFAANKVCQ 409
Query: 108 F 108
F
Sbjct: 410 F 412
>CO982036
Length = 674
Score = 79.7 bits (195), Expect = 1e-15
Identities = 53/133 (39%), Positives = 73/133 (54%), Gaps = 13/133 (9%)
Frame = -2
Query: 1 DIIITGSSSALMNNFISKLNLVFLLKQLG*LEHFLGIEVKSLSDGTLVLTQFRYIRDL*A 60
DIIITGSS L+ N SKLN F LK LG L++F+ IEVKS+ D L+ + I ++
Sbjct: 631 DIIITGSSCTLIQNLTSKLNSSFPLKLLGKLDYFVEIEVKSMPD--LLFSLRTSIFEIFC 458
Query: 61 KVNISYGEKLQIPHNRGC-------------SFYGSIAGSLQ*LTITRPEQSYSVKKVCP 107
+ + + P C +FY S+ G+LQ T+ RPE S++V KVC
Sbjct: 457 RKPR*QAQPISSPMTTTCKLSKSDSDLFSGPTFYRSVVGALQYTTVIRPEISFAVNKVCQ 278
Query: 108 FMSSLHDSHRTAI 120
FMS+ DSH T +
Sbjct: 277 FMSNPLDSHWTEV 239
>TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete
Length = 4731
Score = 65.1 bits (157), Expect = 3e-11
Identities = 75/309 (24%), Positives = 132/309 (42%), Gaps = 32/309 (10%)
Frame = +1
Query: 1 DIIITGSSSALMNNFISKLNLVFLLKQLG*LEHFLGIEVKSLSDGTLVLTQFRYIRDL*A 60
DI+ G S+ ++ +F+ ++ F + +G L +FLG++VK + D ++ L+Q RY +++
Sbjct: 3790 DIVFGGMSNEMLRHFVQQMQSEFEMSLVGELTYFLGLQVKQMED-SIFLSQSRYAKNIVK 3966
Query: 61 KVNISYGEKLQIP-------------HNRGCSFYGSIAGSLQ*LTITRPEQSYSVKKVCP 107
K + + P + S Y S+ GSL LT +RP+ +Y+V
Sbjct: 3967 KFGMENASHKRTPAPTHLKLSKDEAGTSVDQSLYRSMIGSLLYLTASRPDITYAVGVCAR 4146
Query: 108 FMSSLHDSHRTAINASSTNSKAMSLLASSQACFSRLSFI----SSWV**C*LGHGS*RSS 163
+ ++ SH T + S S + + W G R S
Sbjct: 4147 YQANPKISHLTQVKRILKYVNGTSDYGIMYCHCSNPMLVGYCDADWA-----GSADDRKS 4311
Query: 164 FYLRCVYLLRPILIFWRS*KQTVVSGSSSEAEYRSLGLSSC-----------RYVKDPDS 212
C YL LI W S KQ VS S++EAEY + G SSC Y + D
Sbjct: 4312 TSGGCFYLGNN-LISWFSKKQNCVSLSTAEAEYIAAG-SSCSQLVWMKQMLKEYNVEQDV 4485
Query: 213 ITW----LKVVNLNTAPSIF*QVYHHFT*L*HYPTC*YYMELDILFFRKIVINRQLTAQH 268
+T + +N++ P V H T ++++ + R +V ++ +T +H
Sbjct: 4486 MTLYCDNMSAINISKNP-----VQHSRT---------KHIDIRHHYIRDLVDDKVITLKH 4623
Query: 269 IPADLQLAN 277
+ + Q+A+
Sbjct: 4624 VDTEEQIAD 4650
>TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, complete
Length = 4734
Score = 63.5 bits (153), Expect = 1e-10
Identities = 77/311 (24%), Positives = 136/311 (42%), Gaps = 34/311 (10%)
Frame = +1
Query: 1 DIIITGSSSALMNNFISKLNLVFLLKQLG*LEHFLGIEVKSLSDGTLVLTQFRYIRDL*A 60
DI+ G S+ ++ +F+ ++ F + +G L +FLG++VK + D ++ L+Q +Y +++
Sbjct: 3793 DIVFGGMSNEMLRHFVQQMQSEFEMSLVGELTYFLGLQVKQMED-SIFLSQSKYAKNIVK 3969
Query: 61 KVNISYGEKLQIP-------------HNRGCSFYGSIAGSLQ*LTITRPEQSYSVKKVC- 106
K + + P + S Y S+ GSL LT +RP+ +Y+V VC
Sbjct: 3970 KFGMENASHKRTPAPTHLKLSKDEAGTSVDQSLYRSMIGSLLYLTASRPDITYAV-GVCA 4146
Query: 107 -----PFMSSLHDSHRTAINASSTNSKAMSLLASSQACFSRLSFISSWV**C*LGHGS*R 161
P +S L+ R + T+ + S + + W G R
Sbjct: 4147 RYQANPKISHLNQVKRILKYVNGTSDYGIMYCHCSDSMLVGYC-DADWA-----GSADDR 4308
Query: 162 SSFYLRCVYLLRPILIFWRS*KQTVVSGSSSEAEYRSLGLSSC-----------RYVKDP 210
S C Y L LI W S KQ VS S++EAEY + G SSC Y +
Sbjct: 4309 KSTSGGCFY-LGTNLISWFSKKQNCVSLSTAEAEYIAAG-SSCSQLVWMKQMLKEYNVEQ 4482
Query: 211 DSITW----LKVVNLNTAPSIF*QVYHHFT*L*HYPTC*YYMELDILFFRKIVINRQLTA 266
D +T + +N++ P V H T ++++ + R +V ++ +T
Sbjct: 4483 DVMTLYCDNMSAINISKNP-----VQHSRT---------KHIDIRHHYIRDLVDDKVITL 4620
Query: 267 QHIPADLQLAN 277
+H+ + Q+A+
Sbjct: 4621 EHVDTEEQIAD 4653
>BM527454 weakly similar to GP|27901709|gb| gag-pol polyprotein {Vitis
vinifera}, partial (19%)
Length = 437
Score = 38.9 bits (89), Expect(2) = 1e-07
Identities = 22/58 (37%), Positives = 32/58 (54%)
Frame = +2
Query: 1 DIIITGSSSALMNNFISKLNLVFLLKQLG*LEHFLGIEVKSLSDGTLVLTQFRYIRDL 58
DI+ITG+ + L F K LG E+FLGIEV DG ++++Q +Y D+
Sbjct: 59 DIVITGNDQGKIAQLKGHLFSHFQTKDLGKFEYFLGIEVAQSKDG-IIISQRKYALDI 229
Score = 33.9 bits (76), Expect(2) = 1e-07
Identities = 18/37 (48%), Positives = 21/37 (56%)
Frame = +1
Query: 81 YGSIAGSLQ*LTITRPEQSYSVKKVCPFMSSLHDSHR 117
Y + G L LTITRP S+ V V FM S H+ HR
Sbjct: 325 YRILVGKLIYLTITRPNISFVVGVVSQFMQSPHNDHR 435
>BF068614 similar to GP|27817858|db OJ1081_B12.7 {Oryza sativa (japonica
cultivar-group)}, partial (3%)
Length = 413
Score = 49.3 bits (116), Expect = 2e-06
Identities = 27/56 (48%), Positives = 36/56 (64%)
Frame = +2
Query: 1 DIIITGSSSALMNNFISKLNLVFLLKQLG*LEHFLGIEVKSLSDGTLVLTQFRYIR 56
DI ITG+ L+ IS+LN F LKQLG L++FLGIEVK L D + + ++ R
Sbjct: 65 DITITGNCVLLIQQLISQLNSQFALKQLGLLDYFLGIEVKYLPDKGISCPRLKWQR 232
Score = 47.4 bits (111), Expect = 7e-06
Identities = 27/68 (39%), Positives = 35/68 (50%), Gaps = 13/68 (19%)
Frame = +1
Query: 56 RDL*AKVNISYGEKLQIPHNRGC-------------SFYGSIAGSLQ*LTITRPEQSYSV 102
RDL K ++ + + P C + YGS+ G+LQ T+TRPE SYSV
Sbjct: 199 RDLLPKTKMAEAQPISSPMVSSCKLSKSGSDLFQDPTLYGSVVGALQYATLTRPEFSYSV 378
Query: 103 KKVCPFMS 110
KVC FMS
Sbjct: 379 NKVCQFMS 402
>BI321712
Length = 399
Score = 48.9 bits (115), Expect = 2e-06
Identities = 35/122 (28%), Positives = 60/122 (48%), Gaps = 13/122 (10%)
Frame = -3
Query: 1 DIIITGSSSALMNNFISKLNLVFLLKQLG*LEHFLGIEVKSLSDGTLVLTQFRYIRDL*A 60
D+I TG++ ++ F ++ F + +G + ++LGIEVK D + +TQ Y +++
Sbjct: 367 DLIFTGNNPSMFEEFKKDMSNEFEMTDMGLMAYYLGIEVKQ-EDKGIFITQEGYAKEVLK 191
Query: 61 KVNISYGEKLQIP---------HNRG----CSFYGSIAGSLQ*LTITRPEQSYSVKKVCP 107
K + + P H +G + Y S+ GSL+ LT TRP+ Y V V
Sbjct: 190 KFKMDDANPVGTPMECGSKLSKHEKGENVDPTLYKSLIGSLRYLTCTRPDILYVVGVVSR 11
Query: 108 FM 109
+M
Sbjct: 10 YM 5
>TC232593 weakly similar to UP|Q9XG91 (Q9XG91) Tpv2-1c protein (Fragment),
partial (16%)
Length = 562
Score = 45.8 bits (107), Expect = 2e-05
Identities = 29/115 (25%), Positives = 56/115 (48%), Gaps = 13/115 (11%)
Frame = +1
Query: 1 DIIITGSSSALMNNFISKLNLVFLLKQLG*LEHFLGIEVKSLSDGTLVLTQFRYIRDL*A 60
D+++T + L+ F ++ F + LG + +FLGIE+K S +++ Q +Y +++
Sbjct: 217 DLLVTRDDARLVEEFKQEMMQAFEMTNLGLMTYFLGIEIKQ-SQNKVLICQRKYAKEILK 393
Query: 61 KVNISYGEKLQIPHNRGCSF-------------YGSIAGSLQ*LTITRPEQSYSV 102
K + + + P N+ F Y S+ G L LT TRP+ +++
Sbjct: 394 KFQMEECKSVSTPMNQKEKFNKVDGADKIDEGYYRSLIGCLMYLTATRPDILFAI 558
>BU548243
Length = 599
Score = 45.4 bits (106), Expect = 3e-05
Identities = 37/116 (31%), Positives = 54/116 (45%), Gaps = 16/116 (13%)
Frame = -1
Query: 172 LRPILIFWRS*KQTVVSGSSSEAEYRSLGLSSCRYVKDPDSITWLKVVNLN-----TAPS 226
L P LI W S KQ V + SS+EAEYRS+ +S +TW++ + + T P
Sbjct: 527 LGPNLISWWSRKQQVTAQSSTEAEYRSIAQTSA-------ELTWIQALLMELQIPFTPPV 369
Query: 227 IF*Q-----------VYHHFT*L*HYPTC*YYMELDILFFRKIVINRQLTAQHIPA 271
I V+H T +ME+D+ F + V+++QL HIPA
Sbjct: 368 ILCDNKSAVAIAHNLVFHSRT---------KHMEIDVFFVHEKVLSKQLQIFHIPA 228
>TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Fragment),
partial (30%)
Length = 687
Score = 40.0 bits (92), Expect = 0.001
Identities = 19/42 (45%), Positives = 28/42 (66%)
Frame = +2
Query: 176 LIFWRS*KQTVVSGSSSEAEYRSLGLSSCRYVKDPDSITWLK 217
L+ W+S KQTVV+ SS+EAEYRS+ + +C + W+K
Sbjct: 104 LVSWKSKKQTVVARSSAEAEYRSMAMVTC-------ELMWIK 208
>BU764568
Length = 420
Score = 39.7 bits (91), Expect = 0.002
Identities = 20/42 (47%), Positives = 28/42 (66%)
Frame = +3
Query: 176 LIFWRS*KQTVVSGSSSEAEYRSLGLSSCRYVKDPDSITWLK 217
LI W+S KQ+VV+ SS+EAEYR++ L +C + WLK
Sbjct: 198 LISWKSKKQSVVAKSSAEAEYRAMALVTCELI-------WLK 302
>BI317550 weakly similar to GP|9759590|dbj| polyprotein-like {Arabidopsis
thaliana}, partial (18%)
Length = 421
Score = 37.0 bits (84), Expect = 0.010
Identities = 22/58 (37%), Positives = 34/58 (57%)
Frame = -2
Query: 1 DIIITGSSSALMNNFISKLNLVFLLKQLG*LEHFLGIEVKSLSDGTLVLTQFRYIRDL 58
DII+ G S + + L+L F +K LG L++FLG+EV G + ++Q +Y DL
Sbjct: 264 DIILAGDSIDEFDRIKNVLDLAFKIKNLGKLKYFLGLEVAHSRLG-ITISQRKYCLDL 94
>BI787454 weakly similar to GP|21434|emb|CA ORF4 {Solanum tuberosum}, partial
(21%)
Length = 421
Score = 36.6 bits (83), Expect = 0.013
Identities = 32/114 (28%), Positives = 50/114 (43%), Gaps = 13/114 (11%)
Frame = +2
Query: 1 DIIITGSSSALMNNFISKLNLVFLLKQLG*LEHFLGIEVKSLSDGTLVLTQFRYIRDL*A 60
DI+IT + + L F K L L++FLGIEV DG +V++Q +Y D+
Sbjct: 83 DIMITKKDATKIVQLKEHLFNHFQTKDLRYLKYFLGIEVAQSGDG-VVISQRKYALDILE 259
Query: 61 KVNISYGEKLQIPHNRGCSF-------------YGSIAGSLQ*LTITRPEQSYS 101
+ + + P + Y + G L LTITRP+ S++
Sbjct: 260 ETGMQNCRLVDSPMDPNLKLMAYQSEVYPDPERYRRLVGKLIYLTITRPDISFA 421
>TC232995
Length = 1009
Score = 36.2 bits (82), Expect = 0.017
Identities = 18/78 (23%), Positives = 39/78 (49%)
Frame = +2
Query: 1 DIIITGSSSALMNNFISKLNLVFLLKQLG*LEHFLGIEVKSLSDGTLVLTQFRYIRDL*A 60
DII ++ +L F + F + +G L++FLG+++K G + + Q +Y ++L
Sbjct: 218 DIIFGSTNDSLCKEFSLDMQSEFEMSMMGELKYFLGLQIKQTQ*G-IFINQSKYCKELIK 394
Query: 61 KVNISYGEKLQIPHNRGC 78
+ + + + P + C
Sbjct: 395 RFGMDSAKHMSTPMSTNC 448
>AI855899 similar to GP|2244960|emb| retrotransposon like protein
{Arabidopsis thaliana}, partial (18%)
Length = 418
Score = 36.2 bits (82), Expect = 0.017
Identities = 17/31 (54%), Positives = 22/31 (70%)
Frame = +1
Query: 81 YGSIAGSLQ*LTITRPEQSYSVKKVCPFMSS 111
Y I G+LQ +T+TRP +Y+V KV FMSS
Sbjct: 103 YRDIVGALQYVTLTRPNIAYNVNKVSEFMSS 195
>BI972503 weakly similar to GP|18378607|gb polyprotein {Oryza sativa
(japonica cultivar-group)}, partial (5%)
Length = 327
Score = 33.5 bits (75), Expect = 0.11
Identities = 30/109 (27%), Positives = 48/109 (43%), Gaps = 13/109 (11%)
Frame = +2
Query: 5 TGSSSALMNNFISKLNLVFLLKQLG*LEHFLGIEVKSLSDGTLVLTQFRYIRDL*AKVNI 64
TG+ + + L F K LG L++FLGIEV + S +V++Q +Y D+ + +
Sbjct: 2 TGNDATKIVQLKEHLFSHFQTKDLGYLKYFLGIEV-AQSGVDVVISQRKYALDILEETGM 178
Query: 65 SYGEKLQIPHNRGCSF-------------YGSIAGSLQ*LTITRPEQSY 100
+ P + Y + G L LTITRP+ S+
Sbjct: 179 QNCRPVDSPMDPNLKLMADQSEIYHDPERYRRLVGKLIYLTITRPDISF 325
>TC225402 weakly similar to UP|Q6I923 (Q6I923) Copia-like retrotransposon
Hopscotch polyprotein, partial (7%)
Length = 1446
Score = 33.5 bits (75), Expect = 0.11
Identities = 13/32 (40%), Positives = 24/32 (74%)
Frame = +2
Query: 176 LIFWRS*KQTVVSGSSSEAEYRSLGLSSCRYV 207
L+ W+S K VV+ SS+EAEY+++ +++C +
Sbjct: 80 LVLWKSNK*NVVARSSAEAEYKAMTVATCELI 175
>BM731813 weakly similar to PIR|T02323|T02 nodulin-like protein [imported] -
Arabidopsis thaliana, partial (2%)
Length = 406
Score = 33.1 bits (74), Expect = 0.14
Identities = 29/70 (41%), Positives = 37/70 (52%), Gaps = 3/70 (4%)
Frame = -2
Query: 127 SKAMSLLASSQACFSRLSFISSWV**C*LGHGS*RSS---FYLRCVYLLRPILIFWRS*K 183
S+++ SS ACFS SSW+ S SS F LRC +LR L+F+RS K
Sbjct: 309 SRSLLESKSSSACFSFFHSPSSWIIRRPSESTSTSSSNLDFLLRCSDVLRADLVFFRSSK 130
Query: 184 QTVVSGSSSE 193
SGSSS+
Sbjct: 129 S---SGSSSD 109
>AW759031 weakly similar to GP|22093573|db polyprotein {Oryza sativa
(japonica cultivar-group)}, partial (5%)
Length = 430
Score = 32.7 bits (73), Expect = 0.18
Identities = 20/47 (42%), Positives = 25/47 (52%)
Frame = +3
Query: 1 DIIITGSSSALMNNFISKLNLVFLLKQLG*LEHFLGIEVKSLSDGTL 47
DI+ITG ++ S L+ F K LG +FLGIEV S G L
Sbjct: 135 DIVITGDDFDGIHRLKSHLHNKFQTKDLGPPNYFLGIEVAQSSSGFL 275
>BM143109
Length = 415
Score = 32.0 bits (71), Expect = 0.31
Identities = 16/58 (27%), Positives = 31/58 (52%)
Frame = +1
Query: 1 DIIITGSSSALMNNFISKLNLVFLLKQLG*LEHFLGIEVKSLSDGTLVLTQFRYIRDL 58
DII ++ +L F + F + + L FLG+++K +G + ++Q +Y +DL
Sbjct: 214 DIIFGSTNDSLCKKFSQDMQNEFEMSMMRELNFFLGLQIKQTKNG-IFISQSKYCKDL 384
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.348 0.152 0.497
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 13,250,689
Number of Sequences: 63676
Number of extensions: 199415
Number of successful extensions: 1516
Number of sequences better than 10.0: 53
Number of HSP's better than 10.0 without gapping: 1496
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 1504
length of query: 277
length of database: 12,639,632
effective HSP length: 96
effective length of query: 181
effective length of database: 6,526,736
effective search space: 1181339216
effective search space used: 1181339216
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 14 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 38 (21.8 bits)
S2: 58 (26.9 bits)
Lotus: description of TM0229.10