
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0234.6
(1379 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete 570 e-162
TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, co... 564 e-160
TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Frag... 148 1e-35
TC235104 weakly similar to UP|Q850H8 (Q850H8) Gag-pol polyprotei... 144 3e-34
CO982036 144 3e-34
TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polypro... 142 1e-33
AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza ... 133 6e-31
BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vi... 129 7e-30
BU548243 129 1e-29
TC234303 weakly similar to UP|Q8W153 (Q8W153) Polyprotein, parti... 128 2e-29
TC231544 weakly similar to UP|Q9FLR2 (Q9FLR2) Polyprotein-like, ... 127 3e-29
TC221132 weakly similar to UP|O23529 (O23529) RETROTRANSPOSON li... 76 5e-29
CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {V... 126 8e-29
TC232995 121 2e-27
BI317550 weakly similar to GP|9759590|dbj| polyprotein-like {Ara... 117 5e-26
TC213888 similar to UP|Q9SFE1 (Q9SFE1) T26F17.17, partial (11%) 116 6e-26
BU549979 115 1e-25
CO983516 115 1e-25
BE211208 112 9e-25
AI855899 similar to GP|2244960|emb| retrotransposon like protein... 112 1e-24
>TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete
Length = 4731
Score = 570 bits (1468), Expect = e-162
Identities = 348/991 (35%), Positives = 515/991 (51%), Gaps = 19/991 (1%)
Frame = +1
Query: 407 SLSHVLHTPQIVKNLISVRQLTTDNNVSVCFDPYGFSVIDFQTGIPLMRCNSPGDLYPVT 466
SL+ VL + NLIS+ QL D +V F V + ++ + + S + Y T
Sbjct: 1837 SLNKVLLVKGLTANLISISQLC-DEGFNVNFTKSECLVTNEKSEVLMKGSRSKDNCYLWT 2013
Query: 467 P---SFPFAGLAQS-----LWHSRLGHPSSSALQSLRSN---KFISYEHLNSSPVCESCV 515
P S+ L+ +WH R GH ++ + + I + +C C
Sbjct: 2014 PQETSYSSTCLSSKEDEVRIWHQRFGHLHLRGMKKIIDKGAVRGIPNLKIEEGRICGECQ 2193
Query: 516 FGKHVRLPFVS-SNNVTVMPFDILHSDLW-TSPVLSSAGHRFYVLFLDDFTDFLWTFPLS 573
GK V++ + T ++LH DL V S G R+ + +DDF+ F W +
Sbjct: 2194 IGKQVKMSHQKLQHQTTSRVLELLHMDLMGPMQVESLGGKRYAYVVVDDFSRFTWVNFIR 2373
Query: 574 NKSQVFEMFISLSNQIRTHFSQTIKCLQCDNGREFDNKSFHDYCAANGLIFRFSCPHTSS 633
KS+ FE+F LS +++ IK ++ D+GREF+N F ++C + G+ FS T
Sbjct: 2374 EKSETFEVFKELSLRLQREKDCVIKRIRSDHGREFENSRFTEFCTSEGITHEFSAAITPQ 2553
Query: 634 QNGKAERKIRTINNMIRTLLAHASVPPSFWHHALQMATYLLNIIPRKNLSNLSPTQLLYR 693
QNG ERK RT+ R +L +P + W A+ A Y+ N + + + + ++
Sbjct: 2554 QNGIVERKNRTLQEAARVMLHAKELPYNLWAEAMNTACYIHNRVTLRRGTPTTLYEIWKG 2733
Query: 694 RDPSYTHLRVFGCLCYPLVPSSTINKLQPRSTPCVFLGYPLHHRGYKCFDLSHRKVIISR 753
R PS H +FG CY L K+ P+S +FLGY + R Y+ F+ R V+ S
Sbjct: 2734 RKPSVKHFHIFGSPCYILADREQRRKMDPKSDAGIFLGYSTNSRAYRVFNSRTRTVMESI 2913
Query: 754 HVIFDETQFPFANLTPTPSSTYEWLSDDIHPSVIHRWTTQTPSPDLQPTPVAPSATATPP 813
+V+ D+ L+P E +D+ S D A
Sbjct: 2914 NVVVDD-------LSPARKKDVE---EDVRTS-----------GDNVADAAKSGENAENS 3030
Query: 814 TSTASSSSPSDPSPSSSTPQSPPQP-----APPVRTMATRSMRGIYKPRKLFNLSVTIDD 868
S S+ + P SST P P R + TRS + + +
Sbjct: 3031 DSATDESNINQPDKRSSTRIQKMHPKELIIGDPNRGVTTRSRE----------VEIVSNS 3180
Query: 869 PTISPL-PKNPKLALSDPNWKSAMQSEFDALIRNNTWDLVPRPCDVNIIRCMWIFRHKTK 927
+S + PKN K AL+D W +AMQ E + RN W+LVPRP N+I WIF++KT
Sbjct: 3181 CFVSKIEPKNVKEALTDEFWINAMQEELEQFKRNEVWELVPRPEGTNVIGTKWIFKNKTN 3360
Query: 928 ANGCFERYKARLVGDGRSQIAGVDCDETFSHVVKPATIRTVLTIALSRSWPIHQLDVQNA 987
G R KARLV G +QI GVD DETF+ V + +IR +L +A + ++Q+DV++A
Sbjct: 3361 EEGVITRNKARLVAQGYTQIEGVDFDETFAPVARLESIRLLLGVACILKFKLYQMDVKSA 3540
Query: 988 FLHGDLHETVYMHQPLGFRDPNHPDYVCRLRKSLYGLKQAPRAWYQCFADYVSTIGFQHS 1047
FL+G L+E VY+ QP GF DP HPD+V RL+K+LYGLKQAPRAWY+ ++++ G++
Sbjct: 3541 FLNGYLNEEVYVEQPKGFADPTHPDHVYRLKKALYGLKQAPRAWYERLTEFLTQQGYRKG 3720
Query: 1048 TSDHSLFIYRRGSDMAYLLLYVDDIILISSSHDLRKSIMALLASEFAMKDLGPLSYFLGI 1107
D +LF+ + ++ +YVDDI+ S+++ + + + SEF M +G L+YFLG+
Sbjct: 3721 GIDKTLFVKQDAENLMIAQIYVDDIVFGGMSNEMLRHFVQQMQSEFEMSLVGELTYFLGL 3900
Query: 1108 AVTRHAGGLFLSQSTYARDIIARAGMASCHPSATPVDTKQKLSTSAGTPCDDPTLYRSLV 1167
V + +FLSQS YA++I+ + GM + TP T KLS D +LYRS++
Sbjct: 3901 QVKQMEDSIFLSQSRYAKNIVKKFGMENASHKRTPAPTHLKLSKDEAGTSVDQSLYRSMI 4080
Query: 1168 GALQYLTFTRPDISYAVQQVCLHMHAPRTEHMLALKRILRYVQGTLQLGLHLYPSPIEKL 1227
G+L YLT +RPDI+YAV + P+ H+ +KRIL+YV GT G+ L
Sbjct: 4081 GSLLYLTASRPDITYAVGVCARYQANPKISHLTQVKRILKYVNGTSDYGIMYCHCSNPML 4260
Query: 1228 ISYTDADWGGCLDTRRSTSGYCVFLGDNLISWSSKRQPTLSRSSAEAEYRGVANVVSESC 1287
+ Y DADW G D R+STSG C +LG+NLISW SK+Q +S S+AEAEY + S+
Sbjct: 4261 VGYCDADWAGSADDRKSTSGGCFYLGNNLISWFSKKQNCVSLSTAEAEYIAAGSSCSQLV 4440
Query: 1288 WLRNLLLELHFPLS*ATLVYCDNVSAIYLSGNPVQHQRTKHIEMDIHFVREKVARGQARV 1347
W++ +L E + TL YCDN+SAI +S NPVQH RTKHI++ H++R+ V +
Sbjct: 4441 WMKQMLKEYNVEQDVMTL-YCDNMSAINISKNPVQHSRTKHIDIRHHYIRDLVDDKVITL 4617
Query: 1348 LHVPSRHQIADIFTKGLPRVLFDDFRSSLSV 1378
HV + QIADIFTK L F+ R L +
Sbjct: 4618 KHVDTEEQIADIFTKALDANQFEKLRGKLGI 4710
>TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, complete
Length = 4734
Score = 564 bits (1453), Expect = e-160
Identities = 347/989 (35%), Positives = 518/989 (52%), Gaps = 17/989 (1%)
Frame = +1
Query: 407 SLSHVLHTPQIVKNLISVRQLTTDNNVSVCFDPYGFSVIDFQTGIPLMRCNSPGDLYPVT 466
SL+ VL + NLIS+ QL D +V F V + ++ + + S + Y T
Sbjct: 1840 SLNKVLLVKGLTANLISISQLC-DEGFNVNFTKSECLVTNEKSEVLMKGSRSKDNCYLWT 2016
Query: 467 P---SFPFAGLAQS-----LWHSRLGHPSSSALQSLRSN---KFISYEHLNSSPVCESCV 515
P S+ L +WH R GH ++ + + I + +C C
Sbjct: 2017 PQETSYSSTCLFSKEDEVKIWHQRFGHLHLRGMKKIIDKGAVRGIPNLKIEEGRICGECQ 2196
Query: 516 FGKHVRLPFVS-SNNVTVMPFDILHSDLW-TSPVLSSAGHRFYVLFLDDFTDFLWTFPLS 573
GK V++ + T ++LH DL V S G R+ + +DDF+ F W +
Sbjct: 2197 IGKQVKMSHQKLQHQTTSRVLELLHMDLMGPMQVESLGGKRYAYVVVDDFSRFTWVNFIR 2376
Query: 574 NKSQVFEMFISLSNQIRTHFSQTIKCLQCDNGREFDNKSFHDYCAANGLIFRFSCPHTSS 633
KS FE+F LS +++ IK ++ D+GREF+N F ++C + G+ FS T
Sbjct: 2377 EKSDTFEVFKELSLRLQREKDCVIKRIRSDHGREFENSKFTEFCTSEGITHEFSAAITPQ 2556
Query: 634 QNGKAERKIRTINNMIRTLLAHASVPPSFWHHALQMATYLLNIIPRKNLSNLSPTQLLYR 693
QNG ERK RT+ R +L +P + W A+ A Y+ N + + + + ++
Sbjct: 2557 QNGIVERKNRTLQEAARVMLHAKELPYNLWAEAMNTACYIHNRVTLRRGTPTTLYEIWKG 2736
Query: 694 RDPSYTHLRVFGCLCYPLVPSSTINKLQPRSTPCVFLGYPLHHRGYKCFDLSHRKVIISR 753
R P+ H +FG CY L K+ P+S +FLGY + R Y+ F+ R V+ S
Sbjct: 2737 RKPTVKHFHIFGSPCYILADREQRRKMDPKSDAGIFLGYSTNSRAYRVFNSRTRTVMESI 2916
Query: 754 HVIFDETQFPFANLTPTPSSTYEWLSDDIHPSVIHRWTTQTPSPDLQPTPVAPSATATPP 813
+V+ D+ LTP E +D+ S + T + + + + SAT P
Sbjct: 2917 NVVVDD-------LTPARKKDVE---EDVRTSGDNVADTAKSAENAENSD---SATDEP- 3054
Query: 814 TSTASSSSPSDPSPSSSTPQSPPQP---APPVRTMATRSMRGIYKPRKLFNLSVTIDDPT 870
+ + D PS + P+ P R + TRS + + +
Sbjct: 3055 -----NINQPDKRPSIRIQKMHPKELIIGDPNRGVTTRSRE----------IEIVSNSCF 3189
Query: 871 ISPL-PKNPKLALSDPNWKSAMQSEFDALIRNNTWDLVPRPCDVNIIRCMWIFRHKTKAN 929
+S + PKN K AL+D W +AMQ E + RN W+LVPRP N+I WIF++KT
Sbjct: 3190 VSKIEPKNVKEALTDEFWINAMQEELEQFKRNEVWELVPRPEGTNVIGTKWIFKNKTNEE 3369
Query: 930 GCFERYKARLVGDGRSQIAGVDCDETFSHVVKPATIRTVLTIALSRSWPIHQLDVQNAFL 989
G R KARLV G +QI GVD DETF+ V + +IR +L +A + ++Q+DV++AFL
Sbjct: 3370 GVITRNKARLVAQGYTQIEGVDFDETFAPVARLESIRLLLGVACILKFKLYQMDVKSAFL 3549
Query: 990 HGDLHETVYMHQPLGFRDPNHPDYVCRLRKSLYGLKQAPRAWYQCFADYVSTIGFQHSTS 1049
+G L+E Y+ QP GF DP HPD+V RL+K+LYGLKQAPRAWY+ ++++ G++
Sbjct: 3550 NGYLNEEAYVEQPKGFVDPTHPDHVYRLKKALYGLKQAPRAWYERLTEFLTQQGYRKGGI 3729
Query: 1050 DHSLFIYRRGSDMAYLLLYVDDIILISSSHDLRKSIMALLASEFAMKDLGPLSYFLGIAV 1109
D +LF+ + ++ +YVDDI+ S+++ + + + SEF M +G L+YFLG+ V
Sbjct: 3730 DKTLFVKQDAENLMIAQIYVDDIVFGGMSNEMLRHFVQQMQSEFEMSLVGELTYFLGLQV 3909
Query: 1110 TRHAGGLFLSQSTYARDIIARAGMASCHPSATPVDTKQKLSTSAGTPCDDPTLYRSLVGA 1169
+ +FLSQS YA++I+ + GM + TP T KLS D +LYRS++G+
Sbjct: 3910 KQMEDSIFLSQSKYAKNIVKKFGMENASHKRTPAPTHLKLSKDEAGTSVDQSLYRSMIGS 4089
Query: 1170 LQYLTFTRPDISYAVQQVCLHMHAPRTEHMLALKRILRYVQGTLQLGLHLYPSPIEKLIS 1229
L YLT +RPDI+YAV + P+ H+ +KRIL+YV GT G+ L+
Sbjct: 4090 LLYLTASRPDITYAVGVCARYQANPKISHLNQVKRILKYVNGTSDYGIMYCHCSDSMLVG 4269
Query: 1230 YTDADWGGCLDTRRSTSGYCVFLGDNLISWSSKRQPTLSRSSAEAEYRGVANVVSESCWL 1289
Y DADW G D R+STSG C +LG NLISW SK+Q +S S+AEAEY + S+ W+
Sbjct: 4270 YCDADWAGSADDRKSTSGGCFYLGTNLISWFSKKQNCVSLSTAEAEYIAAGSSCSQLVWM 4449
Query: 1290 RNLLLELHFPLS*ATLVYCDNVSAIYLSGNPVQHQRTKHIEMDIHFVREKVARGQARVLH 1349
+ +L E + TL YCDN+SAI +S NPVQH RTKHI++ H++R+ V + H
Sbjct: 4450 KQMLKEYNVEQDVMTL-YCDNMSAINISKNPVQHSRTKHIDIRHHYIRDLVDDKVITLEH 4626
Query: 1350 VPSRHQIADIFTKGLPRVLFDDFRSSLSV 1378
V + QIADIFTK L F+ R L +
Sbjct: 4627 VDTEEQIADIFTKALDANQFEKLRGKLGI 4713
>TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Fragment), partial
(30%)
Length = 687
Score = 148 bits (374), Expect = 1e-35
Identities = 71/139 (51%), Positives = 92/139 (66%)
Frame = +2
Query: 1226 KLISYTDADWGGCLDTRRSTSGYCVFLGDNLISWSSKRQPTLSRSSAEAEYRGVANVVSE 1285
+L Y DADW GC RRSTSGYCVF+G NL+SW SK+Q ++RSSAEAEYR +A V E
Sbjct: 14 QLSGYCDADWAGCPMDRRSTSGYCVFIGGNLVSWKSKKQTVVARSSAEAEYRSMAMVTCE 193
Query: 1286 SCWLRNLLLELHFPLS*ATLVYCDNVSAIYLSGNPVQHQRTKHIEMDIHFVREKVARGQA 1345
W++ L EL F +YCDN +A++++ NPV H+RTKHIE+D HF+REK+ +
Sbjct: 194 LMWIKQFLQELRFCEELQMKLYCDNQAALHIASNPVFHERTKHIEIDCHFIREKLLSKEI 373
Query: 1346 RVLHVPSRHQIADIFTKGL 1364
+ S Q DI TK L
Sbjct: 374 VTEFIGSNDQPVDILTKSL 430
>TC235104 weakly similar to UP|Q850H8 (Q850H8) Gag-pol polyprotein
(Fragment), partial (28%)
Length = 865
Score = 144 bits (363), Expect = 3e-34
Identities = 94/282 (33%), Positives = 142/282 (50%), Gaps = 3/282 (1%)
Frame = +2
Query: 381 VGSGQGIPIQGSGHTTLLTSYKTKPLSLSHVLHTPQIVKNLISVRQLTTDNNVSVCFDPY 440
+ G + G GH + T LSL+ V+ N+ S+ QLT N SV FD
Sbjct: 26 LADGSRVVATGIGHVS-----PTSSLSLNSVVFILGCPFNITSLSQLTRFRNCSVTFDAN 190
Query: 441 GFSVIDFQTGIPLMRCNSPGDLYPVTPSFPF---AGLAQSLWHSRLGHPSSSALQSLRSN 497
F + + TG + LY + P+ + A + L H RLGHP S L+ +
Sbjct: 191 SFVIQECGTGWTIGVGIESHGLYYLKPNLSWVCSAVTSPKLLHERLGHPHLSKLKIMVP- 367
Query: 498 KFISYEHLNSSPVCESCVFGKHVRLPFVSSNNVTVMPFDILHSDLWTSPVLSSAGHRFYV 557
S E + CESC GKHVR + PF ++H D+W +SS +R++V
Sbjct: 368 ---SLEKIKDL-FCESCQLGKHVRSSXRHVESRVDSPFLVIHXDIWGPNRVSSMSYRYFV 535
Query: 558 LFLDDFTDFLWTFPLSNKSQVFEMFISLSNQIRTHFSQTIKCLQCDNGREFDNKSFHDYC 617
F+D+F+ F + +S++ F++ N+I+T F +TIK L+ DN +E+ + +
Sbjct: 536 TFIDEFSQCTRVFLMKERSEILS-FLTSVNKIKTQFGKTIKILRSDNAKEYFSSVISPFX 712
Query: 618 AANGLIFRFSCPHTSSQNGKAERKIRTINNMIRTLLAHASVP 659
+A G++ +FSCPHT QN AERK R + RTLL HA+ P
Sbjct: 713 SAQGILHQFSCPHTPQQNDIAERKNRHLVETARTLLLHANEP 838
>CO982036
Length = 674
Score = 144 bits (363), Expect = 3e-34
Identities = 89/204 (43%), Positives = 117/204 (56%), Gaps = 3/204 (1%)
Frame = -2
Query: 1064 YLLLYVDDIILISSSHDLRKSIMALLASEFAMKDLGPLSYFLGIAVTRHAGGLFLSQSTY 1123
YLL+YVD II+ SS L +++ + L S F +K LG L YF+ I V + L S T
Sbjct: 649 YLLVYVD-IIITGSSCTLIQNLTSKLNSSFPLKLLGKLDYFVEIEV-KSMPDLLFSLRTS 476
Query: 1124 ARDIIARAGMASCHPSATPVDTKQKLSTSAGTPCDDPTLYRSLVGALQYLTFTRPDISYA 1183
+I R P ++P+ T KLS S PT YRS+VGALQY T RP+IS+A
Sbjct: 475 IFEIFCRKPR*QAQPISSPMTTTCKLSKSDSDLFSGPTFYRSVVGALQYTTVIRPEISFA 296
Query: 1184 VQQVCLHMHAPRTEHMLALKRILRYVQGTLQLGLHLYPSPIEK---LISYTDADWGGCLD 1240
V +VC M P H +KRILRY++G+L GL L P+ + + + DADW +D
Sbjct: 295 VNKVCQFMSNPLDSHWTEVKRILRYLKGSLSYGL*LKPAISSQPLPIRGFCDADWASAVD 116
Query: 1241 TRRSTSGYCVFLGDNLISWSSKRQ 1264
+RSTSG VFLG NLISW +Q
Sbjct: 115 DKRSTSGAAVFLGPNLISWWXXKQ 44
>TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polyprotein, partial
(7%)
Length = 804
Score = 142 bits (357), Expect = 1e-33
Identities = 81/222 (36%), Positives = 125/222 (55%), Gaps = 4/222 (1%)
Frame = +1
Query: 1159 DPTLYRSLVGALQYLTFTRPDISYAVQQVCLHMHAPRTEHMLALKRILRYVQGTLQLGLH 1218
D T +R L+G+L+YL +RP+I +AV + M PR HM A KR+LR ++GT+ G+
Sbjct: 10 DVTEFRRLIGSLRYLCNSRPNICFAVSLISRFMKRPRLSHMQAAKRVLRLIKGTIGSGV- 186
Query: 1219 LYP----SPIEKLISYTDADWGGCLDTRRSTSGYCVFLGDNLISWSSKRQPTLSRSSAEA 1274
L+P S L+ YTD+DW + +ST GY D ++ SSK+Q ++ S+ EA
Sbjct: 187 LFPFKAKSGKPDLLGYTDSDWKRDPEQEKSTGGYLFMYNDAPVA*SSKKQDVIALSTCEA 366
Query: 1275 EYRGVANVVSESCWLRNLLLELHFPLS*ATLVYCDNVSAIYLSGNPVQHQRTKHIEMDIH 1334
EY + ++ W+ NLL EL + DN SAI L+ +P H R+KHIE+ H
Sbjct: 367 EYVAASLGACQAVWMMNLLEELKLRERKPVNLLIDNKSAINLAKHPTLHGRSKHIELRFH 546
Query: 1335 FVREKVARGQARVLHVPSRHQIADIFTKGLPRVLFDDFRSSL 1376
++R++V++G V + + Q+AD+ TK + F S L
Sbjct: 547 YIRDQVSKGNVTVEYCKAEEQLADLMTKPIQVSRFKQICSEL 672
>AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza sativa
(japonica cultivar-group)}, partial (10%)
Length = 463
Score = 133 bits (334), Expect = 6e-31
Identities = 64/150 (42%), Positives = 98/150 (64%)
Frame = -3
Query: 890 AMQSEFDALIRNNTWDLVPRPCDVNIIRCMWIFRHKTKANGCFERYKARLVGDGRSQIAG 949
AMQ E + RNN W LV +P + +I W+FR+K +G R KARLV G +Q G
Sbjct: 458 AMQEELNQFERNNVWKLVEKPENYPVIGTKWVFRNKLDEHGIIIRNKARLVAKGYNQEEG 279
Query: 950 VDCDETFSHVVKPATIRTVLTIALSRSWPIHQLDVQNAFLHGDLHETVYMHQPLGFRDPN 1009
+D +ET++ V + IR +L ++ ++Q+DV++AFL+G + E VY+ QP GF P+
Sbjct: 278 IDYEETYAPVARLEVIRMLLAYVSIMNFKLYQMDVKSAFLNGLIQEEVYVEQPPGFEIPD 99
Query: 1010 HPDYVCRLRKSLYGLKQAPRAWYQCFADYV 1039
P +V +L+K+LYGLKQAPRAWY+ ++++
Sbjct: 98 KPTHVYKLQKALYGLKQAPRAWYERISNFL 9
>BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vinifera},
partial (34%)
Length = 407
Score = 129 bits (325), Expect = 7e-30
Identities = 63/133 (47%), Positives = 82/133 (61%)
Frame = -2
Query: 907 VPRPCDVNIIRCMWIFRHKTKANGCFERYKARLVGDGRSQIAGVDCDETFSHVVKPATIR 966
VP P + C W++ K G +R KARLV G +Q+ G+D +TFS V K T+R
Sbjct: 406 VPLPPGKTPVGCRWVYTVKVGPTGEVDRLKARLVAKGYTQVYGIDYCDTFSPVAKLTTVR 227
Query: 967 TVLTIALSRSWPIHQLDVQNAFLHGDLHETVYMHQPLGFRDPNHPDYVCRLRKSLYGLKQ 1026
L +A WP+HQLD++NAFLHGDL E +YM QP GF VC+L +SLYGLKQ
Sbjct: 226 LFLAMAAICHWPLHQLDIKNAFLHGDLEEDIYMEQPPGFVAQGEYGLVCKLHRSLYGLKQ 47
Query: 1027 APRAWYQCFADYV 1039
+PRAW+ F+ V
Sbjct: 46 SPRAWFGKFSHVV 8
>BU548243
Length = 599
Score = 129 bits (323), Expect = 1e-29
Identities = 65/147 (44%), Positives = 91/147 (61%)
Frame = -1
Query: 1232 DADWGGCLDTRRSTSGYCVFLGDNLISWSSKRQPTLSRSSAEAEYRGVANVVSESCWLRN 1291
DA W +D RST G +FLG NLISW S++Q ++SS EAEYR +A +E W++
Sbjct: 587 DAGWASDVDDHRSTLGSAIFLGPNLISWWSRKQQVTAQSSTEAEYRSIAQTSAELTWIQA 408
Query: 1292 LLLELHFPLS*ATLVYCDNVSAIYLSGNPVQHQRTKHIEMDIHFVREKVARGQARVLHVP 1351
LL+EL P + ++ CDN SA+ ++ N V H RTKH+E+D+ FV EKV Q ++ H+P
Sbjct: 407 LLMELQIPFT-PPVILCDNKSAVAIAHNLVFHSRTKHMEIDVFFVHEKVLSKQLQIFHIP 231
Query: 1352 SRHQIADIFTKGLPRVLFDDFRSSLSV 1378
+ Q A I TK L F +S L+V
Sbjct: 230 ALDQWAGILTKPLSSARFTFLKSKLTV 150
>TC234303 weakly similar to UP|Q8W153 (Q8W153) Polyprotein, partial (10%)
Length = 558
Score = 128 bits (321), Expect = 2e-29
Identities = 74/180 (41%), Positives = 101/180 (56%), Gaps = 1/180 (0%)
Frame = +1
Query: 929 NGCFERYKARLVGDGRSQIAGVDCDETFSHVVKPATIRTVLTIALSRSWPIHQLDVQNAF 988
+G +++KARLV +Q+ G D TFS V K A + + ++A+ WP+ LD +NAF
Sbjct: 25 SGTIDQFKARLVAKSYTQVYGQDYTGTFSPVAKMAYVHLLWSMAVVCHWPLF*LDAKNAF 204
Query: 989 LHGDLHETVYMHQPLGF-RDPNHPDYVCRLRKSLYGLKQAPRAWYQCFADYVSTIGFQHS 1047
LHG L E VYM QPLGF + VC+L +S YGLKQ+PRAW + + I +
Sbjct: 205 LHGYLEEEVYMEQPLGFVAQGESSNMVCQLCRSFYGLKQSPRAWPFLYCG--AAIWYDSH 378
Query: 1048 TSDHSLFIYRRGSDMAYLLLYVDDIILISSSHDLRKSIMALLASEFAMKDLGPLSYFLGI 1107
+DHS+F YL++YVDDI + S + L +F KDLG L YFLGI
Sbjct: 379 EADHSVFYCHSPQGCIYLIVYVDDIGITGSDQHGIT*LK*XLCCQFQTKDLGKLRYFLGI 558
>TC231544 weakly similar to UP|Q9FLR2 (Q9FLR2) Polyprotein-like, partial (16%)
Length = 662
Score = 127 bits (319), Expect = 3e-29
Identities = 75/186 (40%), Positives = 109/186 (58%), Gaps = 3/186 (1%)
Frame = +3
Query: 1196 TEHMLALKRILRYVQGTLQLGLHLYPSPIEKLISYTDADWGGCLDTRRSTSGYCVFLGDN 1255
T + A R+L+Y++G + GL +++ ++DADW C+D+ +S + YC FLG +
Sbjct: 3 TRPLCAATRVLKYLKGCPRKGLSFSRESPIQILGFSDADWATCIDSSKSITWYCFFLGSS 182
Query: 1256 LISWSSKRQPTLSR--SSAEAEYRGVANVVSESCWLRNLLLELHFPLS*ATLVYCDNVSA 1313
LISW +K+Q T+SR SS+EA+YR + + E WL LL +LH TL+YCDN SA
Sbjct: 183 LISWKAKKQNTVSRSSSSSEAKYRALTSTTCELQWLTYLLKDLH-----VTLIYCDNQSA 347
Query: 1314 IYLSGNPVQHQRTKHIEMDIHFVREKVARGQAR-VLHVPSRHQIADIFTKGLPRVLFDDF 1372
L P++ +E+D H VREK +G +L V S +Q+ADIFTK L LF
Sbjct: 348 --LQ*LPIKVIYHGQLEIDCHIVREKTQQGLMHCLLPVSSSNQLADIFTKALSPKLFSSN 521
Query: 1373 RSSLSV 1378
S L +
Sbjct: 522 LSKLGL 539
>TC221132 weakly similar to UP|O23529 (O23529) RETROTRANSPOSON like protein,
partial (5%)
Length = 799
Score = 76.3 bits (186), Expect(3) = 5e-29
Identities = 38/96 (39%), Positives = 55/96 (56%)
Frame = +1
Query: 1191 MHAPRTEHMLALKRILRYVQGTLQLGLHLYPSPIEKLISYTDADWGGCLDTRRSTSGYCV 1250
M P M A KR+LRY++GT+ GL L SP + L ++ DA+W RST Y V
Sbjct: 142 MKDPTKIRMQATKRVLRYLKGTIDFGLQLRSSPDQHLRAFYDANWVDNTSDIRSTGAYVV 321
Query: 1251 FLGDNLISWSSKRQPTLSRSSAEAEYRGVANVVSES 1286
+ G ++ISWS K+Q + +SS + EY + + ES
Sbjct: 322 YFGLSVISWSCKKQSIIDKSSTKVEYHKITTTIIES 429
Score = 57.0 bits (136), Expect(3) = 5e-29
Identities = 29/59 (49%), Positives = 37/59 (62%)
Frame = +2
Query: 1306 VYCDNVSAIYLSGNPVQHQRTKHIEMDIHFVREKVARGQARVLHVPSRHQIADIFTKGL 1364
+Y N+ A+YL NPV H KH+ +D FV++ VA Q RV HVPS H D+FTK L
Sbjct: 455 MYSYNIGAMYLCANPVFHLCMKHLTIDHLFVQDLVANKQLRVSHVPSCH*HVDLFTKAL 631
Score = 34.7 bits (78), Expect(3) = 5e-29
Identities = 15/39 (38%), Positives = 24/39 (61%)
Frame = +3
Query: 1152 SAGTPCDDPTLYRSLVGALQYLTFTRPDISYAVQQVCLH 1190
S P D +Y LV +LQYL+ T PDI++ + ++ +H
Sbjct: 27 SGDVPSCDGIVYCQLVDSLQYLSLTCPDIAFPINKLSVH 143
>CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {Vitis
vinifera}, partial (34%)
Length = 409
Score = 126 bits (316), Expect = 8e-29
Identities = 59/129 (45%), Positives = 78/129 (59%)
Frame = +3
Query: 874 LPKNPKLALSDPNWKSAMQSEFDALIRNNTWDLVPRPCDVNIIRCMWIFRHKTKANGCFE 933
+P + AL P W+ AM E AL N TW+LVP P + C W++ K NG +
Sbjct: 18 VPSTIREALDHPGWRQAMVDEMQALENNGTWELVPLPPGKTTVGCRWVYTVKVGPNGKVD 197
Query: 934 RYKARLVGDGRSQIAGVDCDETFSHVVKPATIRTVLTIALSRSWPIHQLDVQNAFLHGDL 993
R KARLV G +Q+ G++ +TFS V T+R L +A R WP+HQLD++NAFLHGDL
Sbjct: 198 RLKARLVAKGYTQVYGIEYCDTFSPVFFLTTVRLFLAMAAIRHWPLHQLDIKNAFLHGDL 377
Query: 994 HETVYMHQP 1002
E +YM QP
Sbjct: 378 EEDIYMEQP 404
>TC232995
Length = 1009
Score = 121 bits (303), Expect = 2e-27
Identities = 64/170 (37%), Positives = 98/170 (57%)
Frame = +2
Query: 999 MHQPLGFRDPNHPDYVCRLRKSLYGLKQAPRAWYQCFADYVSTIGFQHSTSDHSLFIYRR 1058
+ QP GF + P++V +L+K+LYGLKQAPRAWY+ ++++ F D +LFI R+
Sbjct: 2 VEQPPGFEISDKPNHVYKLQKALYGLKQAPRAWYERLSNFLLEKEFSRGKVDTTLFIKRK 181
Query: 1059 GSDMAYLLLYVDDIILISSSHDLRKSIMALLASEFAMKDLGPLSYFLGIAVTRHAGGLFL 1118
+D+ + +YVDDII S++ L K + SEF M +G L YFLG+ + + G+F+
Sbjct: 182 HNDILLVQIYVDDIIFGSTNDSLCKEFSLDMQSEFEMSMMGELKYFLGLQIKQTQ*GIFI 361
Query: 1119 SQSTYARDIIARAGMASCHPSATPVDTKQKLSTSAGTPCDDPTLYRSLVG 1168
+QS Y +++I R GM S +TP+ T L D YR +G
Sbjct: 362 NQSKYCKELIKRFGMDSAKHMSTPMSTNCYLDKDESGQSIDIKQYRDAIG 511
>BI317550 weakly similar to GP|9759590|dbj| polyprotein-like {Arabidopsis
thaliana}, partial (18%)
Length = 421
Score = 117 bits (292), Expect = 5e-26
Identities = 61/138 (44%), Positives = 84/138 (60%)
Frame = -2
Query: 1019 KSLYGLKQAPRAWYQCFADYVSTIGFQHSTSDHSLFIYRRGSDMAYLLLYVDDIILISSS 1078
KSLYGLKQA R WY+ + + G+ S SD+SLF +G+ LL+YVDDIIL S
Sbjct: 420 KSLYGLKQASRKWYEKLTNLLLKEGYIQSISDYSLFTLTKGNTFTALLVYVDDIILAGDS 241
Query: 1079 HDLRKSIMALLASEFAMKDLGPLSYFLGIAVTRHAGGLFLSQSTYARDIIARAGMASCHP 1138
D I +L F +K+LG L YFLG+ V G+ +SQ Y D++ +G+ C P
Sbjct: 240 IDEFDRIKNVLDLAFKIKNLGKLKYFLGLEVAHSRLGITISQRKYCLDLLKDSGLLGCKP 61
Query: 1139 SATPVDTKQKLSTSAGTP 1156
++TP+DT KL ++AGTP
Sbjct: 60 ASTPLDTSIKLHSAAGTP 7
>TC213888 similar to UP|Q9SFE1 (Q9SFE1) T26F17.17, partial (11%)
Length = 493
Score = 116 bits (291), Expect = 6e-26
Identities = 59/139 (42%), Positives = 85/139 (60%)
Frame = +3
Query: 1240 DTRRSTSGYCVFLGDNLISWSSKRQPTLSRSSAEAEYRGVANVVSESCWLRNLLLELHFP 1299
D R+ST+G+ F+GD +W SK+QP ++ S+ EAEY + V + WLRNLL EL P
Sbjct: 9 DDRKSTTGFVFFMGDTAFTWMSKKQPIVTLSTCEAEYVAATSCVCHAIWLRNLLKELKMP 188
Query: 1300 LS*ATLVYCDNVSAIYLSGNPVQHQRTKHIEMDIHFVREKVARGQARVLHVPSRHQIADI 1359
+ DN SA+ L+ NPV H+++KHI+ HF+RE + + + ++ +V S+ Q ADI
Sbjct: 189 QEEPMEICVDNKSALALAKNPVFHEKSKHIDTRYHFIRECIEKKEVKLKYVMSQDQAADI 368
Query: 1360 FTKGLPRVLFDDFRSSLSV 1378
FTK L F RS L V
Sbjct: 369 FTKPLKLETFVKLRSMLGV 425
>BU549979
Length = 615
Score = 115 bits (289), Expect = 1e-25
Identities = 59/180 (32%), Positives = 98/180 (53%), Gaps = 2/180 (1%)
Frame = -1
Query: 1194 PRTEHMLALKRILRYVQGTLQLGLHLYPSPIEKLISYTDADWGGCLDTRRSTSGYCVFLG 1253
P +H K+++RY+QGT L + ++I Y+D+D+ GC+D+RRSTSGY L
Sbjct: 591 PGIDHWKTAKKVMRYLQGTKDYMLMYKQTNCLEVIGYSDSDFAGCVDSRRSTSGYIFMLA 412
Query: 1254 DNLISWSSKRQPTLSRSSAEAEYRGVANVVSESCWLRNLLLELHF--PLS*ATLVYCDNV 1311
D ++SW S +Q ++ S+ E E+ S WL++ + L +S +YCDN
Sbjct: 411 DGVVSWRSSKQTLIATSTMEVEFVPCFEATSHGVWLKSFMSSLRVVDSISRPLKLYCDNF 232
Query: 1312 SAIYLSGNPVQHQRTKHIEMDIHFVREKVARGQARVLHVPSRHQIADIFTKGLPRVLFDD 1371
+A++++ N R+KHI++ +RE+V + + HV + I D TKG+ F D
Sbjct: 231 AAVFMAKNNKSGNRSKHIDIKYLVIRERVKEKKVVIEHVNTELMIVDPLTKGMTPKNFKD 52
>CO983516
Length = 724
Score = 115 bits (288), Expect = 1e-25
Identities = 53/122 (43%), Positives = 83/122 (67%)
Frame = +2
Query: 952 CDETFSHVVKPATIRTVLTIALSRSWPIHQLDVQNAFLHGDLHETVYMHQPLGFRDPNHP 1011
CD+ F V + +IR +L +A + ++Q+DV++AFL+G L+E VY+ QP GF DP HP
Sbjct: 353 CDKEFHPVARLESIRLLLGVACILKFKLYQMDVKSAFLNGYLNEEVYVEQPKGFIDPTHP 532
Query: 1012 DYVCRLRKSLYGLKQAPRAWYQCFADYVSTIGFQHSTSDHSLFIYRRGSDMAYLLLYVDD 1071
D+V RL+K+LYGLKQAPRAWY+ + ++ G++ D +LF+ + ++ +YVDD
Sbjct: 533 DHVYRLKKALYGLKQAPRAWYERLTELLTQQGYRKGGIDKTLFVKQDAENLMIAQIYVDD 712
Query: 1072 II 1073
I+
Sbjct: 713 IV 718
>BE211208
Length = 413
Score = 112 bits (281), Expect = 9e-25
Identities = 59/133 (44%), Positives = 86/133 (64%), Gaps = 1/133 (0%)
Frame = +2
Query: 1057 RRGSDMAYLLLYVDDIILISSSHDLRKSIMALLASEFAMKDLGPLSYFLGIAVTRH-AGG 1115
++ ++ YLL+YVDDII+ S+ L +S++ L S F++K LG L YFLGI V G
Sbjct: 8 KKDRNLVYLLVYVDDIIITGRSNYLIQSLVHHLNSNFSLKQLGQLDYFLGIEVHHTPTGS 187
Query: 1116 LFLSQSTYARDIIARAGMASCHPSATPVDTKQKLSTSAGTPCDDPTLYRSLVGALQYLTF 1175
+ L+QS Y D++ + MA P ++P+ T +LS + DPT+YRS+VGALQY T
Sbjct: 188 VLLTQSKYICDLLHKTDMAEAKPISSPMVTNLRLSKNGDDLLSDPTMYRSVVGALQYPTI 367
Query: 1176 TRPDISYAVQQVC 1188
TRP+IS+A +VC
Sbjct: 368 TRPEISFAANKVC 406
>AI855899 similar to GP|2244960|emb| retrotransposon like protein {Arabidopsis
thaliana}, partial (18%)
Length = 418
Score = 112 bits (280), Expect = 1e-24
Identities = 61/135 (45%), Positives = 82/135 (60%), Gaps = 3/135 (2%)
Frame = +1
Query: 1133 MASCHPSATPVDTKQKLSTSAGTPCDDPTLYRSLVGALQYLTFTRPDISYAVQQVCLHMH 1192
M C+ +TP+ + KLS + YR +VGALQY+T TRP+I+Y V +V M
Sbjct: 13 MLDCNGISTPMVSSYKLSKFGSELLPNAHQYRDIVGALQYVTLTRPNIAYNVNKVSEFMS 192
Query: 1193 APRTEHMLALKRILRYVQGTLQLGLHLYPSPIEKLIS---YTDADWGGCLDTRRSTSGYC 1249
+P + L +KRILRY+ GT+ GL L P+ ++ IS Y D DWG RSTSG C
Sbjct: 193 SPLQSY*LTVKRILRYLSGTVTQGLLLQPAHMDAKISLRAYNDLDWGSDPAEMRSTSGSC 372
Query: 1250 VFLGDNLISWSSKRQ 1264
+F G NLI+WSSK+Q
Sbjct: 373 IFSGSNLIAWSSKKQ 417
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.329 0.141 0.453
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 78,750,988
Number of Sequences: 63676
Number of extensions: 1533607
Number of successful extensions: 31684
Number of sequences better than 10.0: 1704
Number of HSP's better than 10.0 without gapping: 15864
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 23571
length of query: 1379
length of database: 12,639,632
effective HSP length: 109
effective length of query: 1270
effective length of database: 5,698,948
effective search space: 7237663960
effective search space used: 7237663960
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.8 bits)
S2: 65 (29.6 bits)
Lotus: description of TM0234.6