
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0279a.6
(1582 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
CF922226 156 6e-38
TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete 146 4e-35
TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, co... 144 4e-34
BU549979 102 1e-21
AW185460 101 3e-21
BE802806 95 3e-19
TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polypro... 94 5e-19
BE474381 similar to GP|21434|emb|CA ORF4 {Solanum tuberosum}, pa... 64 3e-16
CO982036 84 6e-16
TC222253 similar to UP|Q9ZQE4 (Q9ZQE4) Copia-like retroelement p... 81 3e-15
BG508993 79 2e-14
BI784757 73 9e-13
TC231544 weakly similar to UP|Q9FLR2 (Q9FLR2) Polyprotein-like, ... 68 3e-11
BU764568 50 1e-10
BI424202 65 2e-10
AI855899 similar to GP|2244960|emb| retrotransposon like protein... 65 2e-10
TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Frag... 64 7e-10
TC203574 weakly similar to UP|Q850I3 (Q850I3) Gag-pol polyprotei... 63 1e-09
TC211311 weakly similar to UP|O24587 (O24587) Pol protein, parti... 62 2e-09
CO983062 40 2e-08
>CF922226
Length = 667
Score = 156 bits (395), Expect = 6e-38
Identities = 90/228 (39%), Positives = 127/228 (55%), Gaps = 13/228 (5%)
Frame = -3
Query: 90 MSKTLMNKLFAKQRLYSLKMQEGGDLQAHVYAFNNILADLTRLGVTVDDEDKTIILLCSL 149
M+K+L+N+L+ KQ LYS KM E + + FN ++ DL + VT+DDED+ ++LLC L
Sbjct: 665 MTKSLVNRLYXKQSLYSFKMHEDRSVGEQLDLFNKLILDLENIDVTIDDEDQALLLLCYL 486
Query: 150 PGSYDHLVTTLTYGKDSITLDSISSTLLQHAQRRRSVEEGGGSSGEGLFVKGGQDRGRGK 209
P SY H TL +G+DS++LD + T L + E+ +SGEGL RGK
Sbjct: 485 PKSYSHFKETLLFGRDSVSLDEV-QTALNSKELNERKEKKSSASGEGL-------TARGK 330
Query: 210 GKAVDSG-KKKRSKSKDRKTAE-------CYSCKQIGHWKRDCPNRSGKSGNS-----SS 256
DS KK+ K +++K E CY CK+ GH ++ CP R G++ S
Sbjct: 329 TFKKDSEFDKKKQKPENQKNGEGNIFKIRCYHCKKEGHTRKVCPERQKNGGSNNRKKDSG 150
Query: 257 AANVVQSDGSCSEEDLLCVSYVKCTDAWVLDSGCSCHMTPHREWFNSF 304
A +VQ DG S E L+ VS W++DSGCS HMTP++ WF F
Sbjct: 149 NAAIVQDDGYESAEALM-VSEKNPETKWIMDSGCSWHMTPNKSWFEQF 9
>TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete
Length = 4731
Score = 146 bits (369), Expect(2) = 4e-35
Identities = 98/332 (29%), Positives = 161/332 (47%), Gaps = 12/332 (3%)
Frame = +1
Query: 220 RSKSKDRKTAECYSCKQIGHWKRDC------PNRSGKSGNSSSAANVVQSDGSCSEEDLL 273
+ K RK C+ C + GH K C P+ +S NS V + S L+
Sbjct: 1477 QQKKSKRKKWRCHYCGKYGHIKPFCYHLHGHPHHGTQSSNSRKKMMWVPKHKAVS---LV 1647
Query: 274 CVSYVKCT--DAWVLDSGCSCHMTPHREWFNSFKSCDFGYVYLGDDKPCIIKGM*QVKIA 331
+ ++ + + W LDSGCS HMT +E+ + + C YV GD I GM +
Sbjct: 1648 VHTSLRASAKEDWYLDSGCSRHMTGVKEFLLNIEPCSTSYVTFGDGSKGKIIGMGK---- 1815
Query: 332 LDDGGVRTLSQVRYVPEVTKNLISLGTLHENGYSFKSEENRDILRVSKGAMTVMRAKRTA 391
L G+ +L++V V +T NLIS+ L + G++ ++ ++ K + +M+ R+
Sbjct: 1816 LVHDGLPSLNKVLLVKGLTANLISISQLCDEGFNVNFTKSECLVTNEKSEV-LMKGSRSK 1992
Query: 392 GNIYKLLGSTVMGDVASVETDDDATKLWHMRLGHLSERGMMELYKRNLLKGVRSCTIG-- 449
N Y + + +D ++WH R GHL RGM ++ + ++G+ + I
Sbjct: 1993 DNCYLWTPQETSYSSTCLSSKEDEVRIWHQRFGHLHLRGMKKIIDKGAVRGIPNLKIEEG 2172
Query: 450 -LCKYCVLGKQCRVRFKTGQHKTKG-ILDYVHSDVWGPTKEPSVGGFRYFVTFTDDFSRK 507
+C C +GKQ ++ + QH+T +L+ +H D+ GP + S+GG RY DDFSR
Sbjct: 2173 RICGECQIGKQVKMSHQKLQHQTTSRVLELLHMDLMGPMQVESLGGKRYAYVVVDDFSRF 2352
Query: 508 VWVYFLKYKSEVFAKFKLWKAEVENQTGRKIK 539
WV F++ KSE F FK ++ + IK
Sbjct: 2353 TWVNFIREKSETFEVFKELSLRLQREKDCVIK 2448
Score = 129 bits (323), Expect = 1e-29
Identities = 69/207 (33%), Positives = 117/207 (56%)
Frame = +1
Query: 1304 LGMEIHREKGAKKLWLSQNSYVEGVLSRFDMSKANPVSTPLANHFKLSLEQCPKTASEIE 1363
LG+++ + + + ++LSQ+ Y + ++ +F M A+ TP H KLS ++ + +
Sbjct: 3892 LGLQVKQMEDS--IFLSQSRYAKNIVKKFGMENASHKRTPAPTHLKLSKDEAGTSVDQS- 4062
Query: 1364 GMSKISHASAVGCLMYVMVCTRPDLAQAASQVCKFMSKPGKQHWEAVKWILRYLKGTADR 1423
+ S +G L+Y + +RPD+ A ++ + P H VK IL+Y+ GT+D
Sbjct: 4063 -----LYRSMIGSLLY-LTASRPDITYAVGVCARYQANPKISHLTQVKRILKYVNGTSDY 4224
Query: 1424 GIMFSREQGVVPLVVGYVDSDYAGDLDDRRSTTGYVFTLAGGPICWKSSVQSIVAMSTTE 1483
GIM+ P++VGY D+D+AG DDR+ST+G F L I W S Q+ V++ST E
Sbjct: 4225 GIMYCHCSN--PMLVGYCDADWAGSADDRKSTSGGCFYLGNNLISWFSKKQNCVSLSTAE 4398
Query: 1484 AEYMAVAEAVKEALWLTGLVKKLGVEQ 1510
AEY+A + + +W+ ++K+ VEQ
Sbjct: 4399 AEYIAAGSSCSQLVWMKQMLKEYNVEQ 4479
Score = 30.4 bits (67), Expect = 6.4
Identities = 42/142 (29%), Positives = 57/142 (39%), Gaps = 2/142 (1%)
Frame = +2
Query: 1091 RRWSP*KRMRHGILFSCHMERELLVASGCTRKSQQ*QKKKVK--SSRLA*LQRGIHNRRG 1148
+ WS K M+ G F E L SG +R KKV +R L + +
Sbjct: 3257 KNWSNSKGMKSGS*FLGLRELM*LAPSGSSRTKPM---KKVS*PETRPDWLLKATLRLKV 3427
Query: 1149 LIMMRFSVWWLDTLLSGQY*LW*PAGICTESRWM*KQFSSMVI*RSKSTWRSQMDSVKLE 1208
+MR LD S Y + + + +RWM*+ M * KS W SQ D
Sbjct: 3428 *TLMRLLPQLLDLSPSDYYLV*LVSSNSSCTRWM*RAHF*MDT*MKKSMWSSQRDLQTRL 3607
Query: 1209 MADLFAN*RGLCMA*SSLQGSG 1230
+ ++ R L M *S LQ G
Sbjct: 3608 IQIMYTGSRRLSMD*SKLQELG 3673
Score = 21.9 bits (45), Expect(2) = 4e-35
Identities = 11/21 (52%), Positives = 14/21 (66%)
Frame = +2
Query: 572 HHSRMVLQRG*TGL*QRRQGA 592
HH+RM RG TGL +R G+
Sbjct: 2546 HHNRMG*LRGKTGLCKRLLGS 2608
>TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, complete
Length = 4734
Score = 144 bits (362), Expect = 4e-34
Identities = 96/329 (29%), Positives = 160/329 (48%), Gaps = 9/329 (2%)
Frame = +1
Query: 220 RSKSKDRKTAECYSCKQIGHWKRDCPNRSGKSGN---SSSAANVVQSDGSCSEEDLLCVS 276
+ K RK C+ C + GH K C + G + SSS+ + L+ +
Sbjct: 1480 QQKKSKRKKWRCHYCGKYGHIKPFCYHLHGHPHHGTQSSSSGRKMMWVPKHKIVSLVVHT 1659
Query: 277 YVKCT--DAWVLDSGCSCHMTPHREWFNSFKSCDFGYVYLGDDKPCIIKGM*QVKIALDD 334
++ + + W LDSGCS HMT +E+ + + C YV GD I GM + L
Sbjct: 1660 SLRASAKEDWYLDSGCSRHMTGVKEFLVNIEPCSTSYVTFGDGSKGKITGMGK----LVH 1827
Query: 335 GGVRTLSQVRYVPEVTKNLISLGTLHENGYSFKSEENRDILRVSKGAMTVMRAKRTAGNI 394
G+ +L++V V +T NLIS+ L + G++ ++ ++ K + +M+ R+ N
Sbjct: 1828 DGLPSLNKVLLVKGLTANLISISQLCDEGFNVNFTKSECLVTNEKSEV-LMKGSRSKDNC 2004
Query: 395 YKLLGSTVMGDVASVETDDDATKLWHMRLGHLSERGMMELYKRNLLKGVRSCTIG---LC 451
Y + + +D K+WH R GHL RGM ++ + ++G+ + I +C
Sbjct: 2005 YLWTPQETSYSSTCLFSKEDEVKIWHQRFGHLHLRGMKKIIDKGAVRGIPNLKIEEGRIC 2184
Query: 452 KYCVLGKQCRVRFKTGQHKTKG-ILDYVHSDVWGPTKEPSVGGFRYFVTFTDDFSRKVWV 510
C +GKQ ++ + QH+T +L+ +H D+ GP + S+GG RY DDFSR WV
Sbjct: 2185 GECQIGKQVKMSHQKLQHQTTSRVLELLHMDLMGPMQVESLGGKRYAYVVVDDFSRFTWV 2364
Query: 511 YFLKYKSEVFAKFKLWKAEVENQTGRKIK 539
F++ KS+ F FK ++ + IK
Sbjct: 2365 NFIREKSDTFEVFKELSLRLQREKDCVIK 2451
Score = 125 bits (314), Expect = 1e-28
Identities = 68/207 (32%), Positives = 116/207 (55%)
Frame = +1
Query: 1304 LGMEIHREKGAKKLWLSQNSYVEGVLSRFDMSKANPVSTPLANHFKLSLEQCPKTASEIE 1363
LG+++ + + + ++LSQ+ Y + ++ +F M A+ TP H KLS ++ + +
Sbjct: 3895 LGLQVKQMEDS--IFLSQSKYAKNIVKKFGMENASHKRTPAPTHLKLSKDEAGTSVDQS- 4065
Query: 1364 GMSKISHASAVGCLMYVMVCTRPDLAQAASQVCKFMSKPGKQHWEAVKWILRYLKGTADR 1423
+ S +G L+Y + +RPD+ A ++ + P H VK IL+Y+ GT+D
Sbjct: 4066 -----LYRSMIGSLLY-LTASRPDITYAVGVCARYQANPKISHLNQVKRILKYVNGTSDY 4227
Query: 1424 GIMFSREQGVVPLVVGYVDSDYAGDLDDRRSTTGYVFTLAGGPICWKSSVQSIVAMSTTE 1483
GIM+ ++VGY D+D+AG DDR+ST+G F L I W S Q+ V++ST E
Sbjct: 4228 GIMYCHCSD--SMLVGYCDADWAGSADDRKSTSGGCFYLGTNLISWFSKKQNCVSLSTAE 4401
Query: 1484 AEYMAVAEAVKEALWLTGLVKKLGVEQ 1510
AEY+A + + +W+ ++K+ VEQ
Sbjct: 4402 AEYIAAGSSCSQLVWMKQMLKEYNVEQ 4482
Score = 35.0 bits (79), Expect = 0.26
Identities = 51/223 (22%), Positives = 80/223 (35%), Gaps = 42/223 (18%)
Frame = +1
Query: 91 SKTLMNKL-FAKQRLYSLKMQEGGDLQAHVYAFNNILADLTRLGVTVDDEDKTIILLCSL 149
SK M++L + +LKM+E + I T LG + DE +L SL
Sbjct: 358 SKVKMSRLQLLATKFENLKMKEEECIHDFHMNILEIANACTALGERMTDEKLVRKILRSL 537
Query: 150 PGSYDHLVTTLTYGKD--SITLDSISSTL------LQHAQRRRSVEEGGGSSGEGLFVKG 201
P +D VT + +D ++ +D + +L L ++S S+ EG +
Sbjct: 538 PKRFDMKVTAIEEAQDICNMRVDELIGSLQTFELGLSDRTEKKSKNLAFVSNDEGEEDEY 717
Query: 202 GQDRGRGKGKAV---------------------------------DSGKKKRSKSKDRKT 228
D G AV + KK K K
Sbjct: 718 DLDTDEGLTNAVVLLGKQFNKVLNRMDRRQKPHVRNIPFDIRKGSEYQKKSDEKPSHSKG 897
Query: 229 AECYSCKQIGHWKRDCPNRSGKSGNSSSAANVVQSDGSCSEED 271
+C+ C+ GH K +CP K S V +SD + SE++
Sbjct: 898 IQCHGCEGYGHIKAECPTHLKKQRKGLS---VCRSDDTESEQE 1017
>BU549979
Length = 615
Score = 102 bits (254), Expect = 1e-21
Identities = 47/112 (41%), Positives = 71/112 (62%)
Frame = -1
Query: 1397 KFMSKPGKQHWEAVKWILRYLKGTADRGIMFSREQGVVPLVVGYVDSDYAGDLDDRRSTT 1456
++ S PG HW+ K ++RYL+GT D +M+ + + V+GY DSD+AG +D RRST+
Sbjct: 606 RYQSNPGIDHWKTAKKVMRYLQGTKDYMLMYKQTNCLE--VIGYSDSDFAGCVDSRRSTS 433
Query: 1457 GYVFTLAGGPICWKSSVQSIVAMSTTEAEYMAVAEAVKEALWLTGLVKKLGV 1508
GY+F LA G + W+SS Q+++A ST E E++ EA +WL + L V
Sbjct: 432 GYIFMLADGVVSWRSSKQTLIATSTMEVEFVPCFEATSHGVWLKSFMSSLRV 277
>AW185460
Length = 411
Score = 101 bits (251), Expect = 3e-21
Identities = 53/108 (49%), Positives = 69/108 (63%)
Frame = +2
Query: 1384 TRPDLAQAASQVCKFMSKPGKQHWEAVKWILRYLKGTADRGIMFSREQGVVPLVVGYVDS 1443
TRPD+ A S + +FM P + H+ A K ILRYL+GT GI ++ E ++GY DS
Sbjct: 92 TRPDIMYATSLLSRFMQSPSQIHFGAGKRILRYLQGTKAFGIWYTTETNSE--LLGYTDS 265
Query: 1444 DYAGDLDDRRSTTGYVFTLAGGPICWKSSVQSIVAMSTTEAEYMAVAE 1491
D+AG DD +ST+GY F+L G W S Q+ VA ST EAEY+AVAE
Sbjct: 266 DWAGSTDDMKSTSGYAFSLGSGMFSWASKKQATVAQSTAEAEYVAVAE 409
>BE802806
Length = 285
Score = 94.7 bits (234), Expect = 3e-19
Identities = 43/89 (48%), Positives = 68/89 (76%)
Frame = -2
Query: 1299 AAKKILGMEIHREKGAKKLWLSQNSYVEGVLSRFDMSKANPVSTPLANHFKLSLEQCPKT 1358
+A++ILG++IHR++ +L+LSQ++Y++ V+ RF M ++ PVSTPL +H KLS+ Q P+T
Sbjct: 269 SARRILGIDIHRDRAKGELFLSQSNYLKKVVERFRMHQSKPVSTPLGHHTKLSVTQAPET 90
Query: 1359 ASEIEGMSKISHASAVGCLMYVMVCTRPD 1387
A E M++ +A+ VG +MY MVC+RPD
Sbjct: 89 AEERSKMNQTPYANGVGSIMYGMVCSRPD 3
>TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polyprotein, partial
(7%)
Length = 804
Score = 94.0 bits (232), Expect = 5e-19
Identities = 47/138 (34%), Positives = 82/138 (59%), Gaps = 1/138 (0%)
Frame = +1
Query: 1374 VGCLMYVMVCTRPDLAQAASQVCKFMSKPGKQHWEAVKWILRYLKGTADRGIMFS-REQG 1432
+G L Y + +RP++ A S + +FM +P H +A K +LR +KGT G++F + +
Sbjct: 34 IGSLRY-LCNSRPNICFAVSLISRFMKRPRLSHMQAAKRVLRLIKGTIGSGVLFPFKAKS 210
Query: 1433 VVPLVVGYVDSDYAGDLDDRRSTTGYVFTLAGGPICWKSSVQSIVAMSTTEAEYMAVAEA 1492
P ++GY DSD+ D + +ST GY+F P+ S Q ++A+ST EAEY+A +
Sbjct: 211 GKPDLLGYTDSDWKRDPEQEKSTGGYLFMYNDAPVA*SSKKQDVIALSTCEAEYVAASLG 390
Query: 1493 VKEALWLTGLVKKLGVEQ 1510
+A+W+ L+++L + +
Sbjct: 391 ACQAVWMMNLLEELKLRE 444
>BE474381 similar to GP|21434|emb|CA ORF4 {Solanum tuberosum}, partial (20%)
Length = 406
Score = 64.3 bits (155), Expect(2) = 3e-16
Identities = 33/70 (47%), Positives = 47/70 (67%)
Frame = +3
Query: 1439 GYVDSDYAGDLDDRRSTTGYVFTLAGGPICWKSSVQSIVAMSTTEAEYMAVAEAVKEALW 1498
GY D+D+AG + DRRST+GY T GG + S QS+VA S+ EAE+ A+A + E LW
Sbjct: 153 GYTDADWAGSVTDRRSTSGYC-TFVGGNLVS*SKKQSVVARSSAEAEFRALAHGICETLW 329
Query: 1499 LTGLVKKLGV 1508
+ L+++L V
Sbjct: 330 VKKLLQELKV 359
Score = 40.8 bits (94), Expect(2) = 3e-16
Identities = 20/38 (52%), Positives = 25/38 (65%)
Frame = +1
Query: 1388 LAQAASQVCKFMSKPGKQHWEAVKWILRYLKGTADRGI 1425
+A A S V +FM PG +H EA ILRYLKG+ RG+
Sbjct: 7 IAFAVSMVSQFMHAPGHEHLEAAFRILRYLKGSPGRGL 120
>CO982036
Length = 674
Score = 83.6 bits (205), Expect = 6e-16
Identities = 56/166 (33%), Positives = 83/166 (49%), Gaps = 1/166 (0%)
Frame = -2
Query: 1312 KGAKKLWLSQNSYVEGVLSRFDMSKANPVSTPLANHFKLSLEQCPKTASEIEGMSKISHA 1371
K L S + + + R +A P+S+P+ KLS K+ S++ +
Sbjct: 514 KSMPDLLFSLRTSIFEIFCRKPR*QAQPISSPMTTTCKLS-----KSDSDLFS-GPTFYR 353
Query: 1372 SAVGCLMYVMVCTRPDLAQAASQVCKFMSKPGKQHWEAVKWILRYLKGTADRGIMFSREQ 1431
S VG L Y V RP+++ A ++VC+FMS P HW VK ILRYLKG+ G+
Sbjct: 352 SVVGALQYTTVI-RPEISFAVNKVCQFMSNPLDSHWTEVKRILRYLKGSLSYGL*LKPAI 176
Query: 1432 GVVPLVV-GYVDSDYAGDLDDRRSTTGYVFTLAGGPICWKSSVQSI 1476
PL + G+ D+D+A +DD+RST+G L I W Q +
Sbjct: 175 SSQPLPIRGFCDADWASAVDDKRSTSGAAVFLGPNLISWWXXKQQV 38
>TC222253 similar to UP|Q9ZQE4 (Q9ZQE4) Copia-like retroelement pol
polyprotein, partial (4%)
Length = 919
Score = 81.3 bits (199), Expect = 3e-15
Identities = 43/92 (46%), Positives = 58/92 (62%), Gaps = 1/92 (1%)
Frame = +1
Query: 449 GLCKYCVLGKQCRVRFKTGQH-KTKGILDYVHSDVWGPTKEPSVGGFRYFVTFTDDFSRK 507
G+C C +GK+ R F TG+ + K +L VH D+ + P+ G YF+TF DDFS+K
Sbjct: 325 GVCDTCEIGKKHRESFPTGKSWRMKKLLKIVHLDLC-TVEIPTHGDNNYFITFIDDFSKK 501
Query: 508 VWVYFLKYKSEVFAKFKLWKAEVENQTGRKIK 539
+WVYFLK KSE FK++KA E Q G K+K
Sbjct: 502 MWVYFLKQKSEACNAFKMFKAFAEKQNGCKVK 597
>BG508993
Length = 374
Score = 78.6 bits (192), Expect = 2e-14
Identities = 40/93 (43%), Positives = 56/93 (60%)
Frame = +1
Query: 1418 KGTADRGIMFSREQGVVPLVVGYVDSDYAGDLDDRRSTTGYVFTLAGGPICWKSSVQSIV 1477
KGT D G+ +S +VG+ DSD+AGD+DDR+STTG+VF + W S Q IV
Sbjct: 4 KGTIDFGLFYSPSNNYK--LVGFCDSDFAGDVDDRKSTTGFVFFMGDCVFTWSSKKQGIV 177
Query: 1478 AMSTTEAEYMAVAEAVKEALWLTGLVKKLGVEQ 1510
+ T EAEY+A A+WL L+++L + Q
Sbjct: 178 TLFTCEAEYVAATSCTCHAIWLRRLLEELQLLQ 276
>BI784757
Length = 430
Score = 73.2 bits (178), Expect = 9e-13
Identities = 37/91 (40%), Positives = 56/91 (60%), Gaps = 1/91 (1%)
Frame = +1
Query: 450 LCKYCVLGKQCRVRFKTGQH-KTKGILDYVHSDVWGPTKEPSVGGFRYFVTFTDDFSRKV 508
+C C+ KQ R FK + K L+ ++SDV GP + S+GG RYF++F D+ +RKV
Sbjct: 64 VCDGCLQCKQSRSTFKQNVPIRAKEKLEVIYSDVCGPMQTESLGGNRYFISFIDELTRKV 243
Query: 509 WVYFLKYKSEVFAKFKLWKAEVENQTGRKIK 539
WVY ++ KS+ F F+ +K + Q+G IK
Sbjct: 244 WVYLIRRKSDFFEVFEKFKNMAKKQSGSLIK 336
>TC231544 weakly similar to UP|Q9FLR2 (Q9FLR2) Polyprotein-like, partial (16%)
Length = 662
Score = 68.2 bits (165), Expect = 3e-11
Identities = 38/98 (38%), Positives = 58/98 (58%), Gaps = 2/98 (2%)
Frame = +3
Query: 1413 ILRYLKGTADRGIMFSREQGVVPLVVGYVDSDYAGDLDDRRSTTGYVFTLAGGPICWKSS 1472
+L+YLKG +G+ FSRE + ++G+ D+D+A +D +S T Y F L I WK+
Sbjct: 30 VLKYLKGCPRKGLSFSRESPIQ--ILGFSDADWATCIDSSKSITWYCFFLGSSLISWKAK 203
Query: 1473 VQSIV--AMSTTEAEYMAVAEAVKEALWLTGLVKKLGV 1508
Q+ V + S++EA+Y A+ E WLT L+K L V
Sbjct: 204 KQNTVSRSSSSSEAKYRALTSTTCELQWLTYLLKDLHV 317
>BU764568
Length = 420
Score = 49.7 bits (117), Expect(2) = 1e-10
Identities = 25/56 (44%), Positives = 34/56 (60%)
Frame = +3
Query: 1455 TTGYVFTLAGGPICWKSSVQSIVAMSTTEAEYMAVAEAVKEALWLTGLVKKLGVEQ 1510
T+GY + G I WKS QS+VA S+ EAEY A+A E +WL L+ +L E+
Sbjct: 165 TSGYCVLIGGNLISWKSKKQSVVAKSSAEAEYRAMALVTCELIWLKQLL*ELKFEE 332
Score = 36.6 bits (83), Expect(2) = 1e-10
Identities = 19/62 (30%), Positives = 31/62 (49%)
Frame = +1
Query: 1397 KFMSKPGKQHWEAVKWILRYLKGTADRGIMFSREQGVVPLVVGYVDSDYAGDLDDRRSTT 1456
+F++ P + HW AV IL+ K +G+++ E ++GY D+D G DR
Sbjct: 4 QFLNSPCQDHWNAVS*ILK*TKSAPGKGLIY--EDKGHSQIIGYSDAD*VGSPSDRHQDI 177
Query: 1457 GY 1458
Y
Sbjct: 178 VY 183
>BI424202
Length = 421
Score = 65.5 bits (158), Expect = 2e-10
Identities = 33/128 (25%), Positives = 68/128 (52%)
Frame = -3
Query: 412 DDDATKLWHMRLGHLSERGMMELYKRNLLKGVRSCTIGLCKYCVLGKQCRVRFKTGQHKT 471
+++++ LWH RLGH+S + L +L + C+ GKQ + K G ++
Sbjct: 395 NEESSMLWHRRLGHISIERIKRLVNEGVLSTLDFADFETYVDCIKGKQTN-KSKKGAKRS 219
Query: 472 KGILDYVHSDVWGPTKEPSVGGFRYFVTFTDDFSRKVWVYFLKYKSEVFAKFKLWKAEVE 531
+L+ +H+D+ P + + +YF+TF DD+SR +++YF+ + ++ K + ++
Sbjct: 218 SNLLEIIHTDICCPDMDAN--SLKYFITFIDDYSRYMYLYFI-LRMKL*MSLKFLRLKLR 48
Query: 532 NQTGRKIK 539
N K++
Sbjct: 47 NNVENKLR 24
>AI855899 similar to GP|2244960|emb| retrotransposon like protein {Arabidopsis
thaliana}, partial (18%)
Length = 418
Score = 65.5 bits (158), Expect = 2e-10
Identities = 47/143 (32%), Positives = 71/143 (48%), Gaps = 1/143 (0%)
Frame = +1
Query: 1333 DMSKANPVSTPLANHFKLSLEQCPKTASEIEGMSKISHASAVGCLMYVMVCTRPDLAQAA 1392
+M N +STP+ + +KLS K SE+ + + VG L YV + TRP++A
Sbjct: 10 NMLDCNGISTPMVSSYKLS-----KFGSELLPNAH-QYRDIVGALQYVTL-TRPNIAYNV 168
Query: 1393 SQVCKFMSKPGKQHWEAVKWILRYLKGTADRGIMFSREQGVVPLVV-GYVDSDYAGDLDD 1451
++V +FMS P + + VK ILRYL GT +G++ + + Y D D+ D +
Sbjct: 169 NKVSEFMSSPLQSY*LTVKRILRYLSGTVTQGLLLQPAHMDAKISLRAYNDLDWGSDPAE 348
Query: 1452 RRSTTGYVFTLAGGPICWKSSVQ 1474
RST+G I W S Q
Sbjct: 349 MRSTSGSCIFSGSNLIAWSSKKQ 417
>TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Fragment), partial
(30%)
Length = 687
Score = 63.5 bits (153), Expect = 7e-10
Identities = 29/68 (42%), Positives = 44/68 (64%)
Frame = +2
Query: 1439 GYVDSDYAGDLDDRRSTTGYVFTLAGGPICWKSSVQSIVAMSTTEAEYMAVAEAVKEALW 1498
GY D+D+AG DRRST+GY + G + WKS Q++VA S+ EAEY ++A E +W
Sbjct: 23 GYCDADWAGCPMDRRSTSGYCVFIGGNLVSWKSKKQTVVARSSAEAEYRSMAMVTCELMW 202
Query: 1499 LTGLVKKL 1506
+ +++L
Sbjct: 203 IKQFLQEL 226
>TC203574 weakly similar to UP|Q850I3 (Q850I3) Gag-pol polyprotein (Fragment),
partial (25%)
Length = 1037
Score = 62.8 bits (151), Expect = 1e-09
Identities = 46/133 (34%), Positives = 70/133 (52%)
Frame = +1
Query: 1374 VGCLMYVMVCTRPDLAQAASQVCKFMSKPGKQHWEAVKWILRYLKGTADRGIMFSREQGV 1433
VG L Y+ + T D+ + V +F + P HW V ILR +KG +G++ ++G
Sbjct: 640 VGKLNYLTL-TITDIYFHVNVVSQFFNAPHDSHWNVVVQILR*IKGLPGKGLI-DMDKGQ 813
Query: 1434 VPLVVGYVDSDYAGDLDDRRSTTGYVFTLAGGPICWKSSVQSIVAMSTTEAEYMAVAEAV 1493
+V + ++++ D DRRST GY + G I WKS Q A S+TE Y AVA
Sbjct: 814 ANTIV-HRNANWERDASDRRSTIGYCVFIGGDLILWKSKKQI*AARSSTEV-YWAVANTT 987
Query: 1494 KEALWLTGLVKKL 1506
E L L ++++L
Sbjct: 988 CELL*LKLMLQEL 1026
>TC211311 weakly similar to UP|O24587 (O24587) Pol protein, partial (15%)
Length = 1213
Score = 62.0 bits (149), Expect = 2e-09
Identities = 46/156 (29%), Positives = 74/156 (46%)
Frame = +3
Query: 1301 KKILGMEIHREKGAKKLWLSQNSYVEGVLSRFDMSKANPVSTPLANHFKLSLEQCPKTAS 1360
+K+ G+ IH+EK Y + L RF M +A P++TP+ + ++ S
Sbjct: 633 QKVYGIFIHQEK-----------YTKSHLKRFRMDEAKPMATPMHRSTIIDKDEKGNHTS 779
Query: 1361 EIEGMSKISHASAVGCLMYVMVCTRPDLAQAASQVCKFMSKPGKQHWEAVKWILRYLKGT 1420
E ++ + L Y + +RPD+ +F S P H AVK ILRYL GT
Sbjct: 780 *KE------YSGMIDSLSY-LTSSRPDIVFVVCLCARFQSYPKISHVTAVKRILRYLVGT 938
Query: 1421 ADRGIMFSREQGVVPLVVGYVDSDYAGDLDDRRSTT 1456
+ + F + ++GY D +AGD +R+ST+
Sbjct: 939 TNHCLWFKKRSEFD--LLGYCDVYFAGDKVERKSTS 1040
>CO983062
Length = 390
Score = 40.4 bits (93), Expect(2) = 2e-08
Identities = 20/54 (37%), Positives = 28/54 (51%)
Frame = +3
Query: 1445 YAGDLDDRRSTTGYVFTLAGGPICWKSSVQSIVAMSTTEAEYMAVAEAVKEALW 1498
+A ++DDRRST+ I W S Q + A S+TEAEY + + E W
Sbjct: 228 WASNIDDRRSTSRAAIFPGRNLISWWSKKQKVTARSSTEAEYQILVQTSIELTW 389
Score = 38.1 bits (87), Expect(2) = 2e-08
Identities = 16/35 (45%), Positives = 23/35 (65%)
Frame = +2
Query: 1384 TRPDLAQAASQVCKFMSKPGKQHWEAVKWILRYLK 1418
TRP++ ++V ++M+ P HW VK ILRYLK
Sbjct: 44 TRPEINYVVNKVXQYMTNPLDSHWAVVKRILRYLK 148
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.341 0.149 0.500
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 76,301,870
Number of Sequences: 63676
Number of extensions: 1189326
Number of successful extensions: 12840
Number of sequences better than 10.0: 122
Number of HSP's better than 10.0 without gapping: 10129
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 11850
length of query: 1582
length of database: 12,639,632
effective HSP length: 110
effective length of query: 1472
effective length of database: 5,635,272
effective search space: 8295120384
effective search space used: 8295120384
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 15 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 39 (22.0 bits)
S2: 65 (29.6 bits)
Lotus: description of TM0279a.6