
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0490.3
(1199 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete 583 e-166
TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, co... 583 e-166
TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Frag... 153 5e-37
TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polypro... 141 2e-33
CO982036 140 3e-33
AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza ... 139 6e-33
TC235104 weakly similar to UP|Q850H8 (Q850H8) Gag-pol polyprotei... 139 1e-32
BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vi... 132 7e-31
BU548243 129 6e-30
TC234303 weakly similar to UP|Q8W153 (Q8W153) Polyprotein, parti... 129 8e-30
CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {V... 129 8e-30
TC231544 weakly similar to UP|Q9FLR2 (Q9FLR2) Polyprotein-like, ... 125 1e-28
TC232995 124 2e-28
CO983516 123 6e-28
TC221132 weakly similar to UP|O23529 (O23529) RETROTRANSPOSON li... 77 6e-28
TC225402 weakly similar to UP|Q6I923 (Q6I923) Copia-like retrotr... 117 4e-27
AI855899 similar to GP|2244960|emb| retrotransposon like protein... 119 6e-27
TC213888 similar to UP|Q9SFE1 (Q9SFE1) T26F17.17, partial (11%) 117 2e-26
BI317550 weakly similar to GP|9759590|dbj| polyprotein-like {Ara... 116 5e-26
BU549979 112 1e-24
>TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete
Length = 4731
Score = 583 bits (1504), Expect = e-166
Identities = 375/1078 (34%), Positives = 546/1078 (49%), Gaps = 33/1078 (3%)
Frame = +1
Query: 145 PAPTDIAAAMHTMSLTPPDTPWYMDTGASSHTAASQGTL-------TSYSNLSHLNQKLI 197
P ++ +HT WY+D+G S H + L TSY ++ I
Sbjct: 1621 PKHKAVSLVVHTSLRASAKEDWYLDSGCSRHMTGVKEFLLNIEPCSTSYVTFGDGSKGKI 1800
Query: 198 VGSGQGIPIQGSGYTSIPTPHKPLALNHVLHTPRIIKNLISVRQLTTDNNVSVSFDPFGF 257
+G G+ + + +P+ LN VL + NLIS+ QL D +V+F
Sbjct: 1801 IGMGKLV------HDGLPS------LNKVLLVKGLTANLISISQLC-DEGFNVNFTKSEC 1941
Query: 258 SVSDFKTGMPLLRCNSLGDLY---PVTRSSPFAGLASS-----VWHNRLGHPASSALNHL 309
V++ K+ + + S + Y P S L+S +WH R GH HL
Sbjct: 1942 LVTNEKSEVLMKGSRSKDNCYLWTPQETSYSSTCLSSKEDEVRIWHQRFGHL------HL 2103
Query: 310 RNNKLIFCEPS---------RSSSVCDSCVLGKHVRLPFSS-SAIITLRPFDILHSDLW- 358
R K I + + +C C +GK V++ T R ++LH DL
Sbjct: 2104 RGMKKIIDKGAVRGIPNLKIEEGRICGECQIGKQVKMSHQKLQHQTTSRVLELLHMDLMG 2283
Query: 359 TSPVLSTAGHRYYVLFLDDHTDFLWTFPISKKSQVYETFTTLATLIKTQFSANIKCLQCD 418
V S G RY + +DD + F W I +KS+ +E F L+ ++ + IK ++ D
Sbjct: 2284 PMQVESLGGKRYAYVVVDDFSRFTWVNFIREKSETFEVFKELSLRLQREKDCVIKRIRSD 2463
Query: 419 NGREYDNDSFHRYCDANGLIFRFSCPHTSSQNGKAERKIRTINNMIRTLLAHSSVPPSFW 478
+GRE++N F +C + G+ FS T QNG ERK RT+ R +L +P + W
Sbjct: 2464 HGREFENSRFTEFCTSEGITHEFSAAITPQQNGIVERKNRTLQEAARVMLHAKELPYNLW 2643
Query: 479 HHALQMATYLLNILPRKTLQNDSPTQLLY---HRDPSYSHLRVFGCLCFPLFPSATINKL 535
A+ A Y+ N R TL+ +PT L R PS H +FG C+ L K+
Sbjct: 2644 AEAMNTACYIHN---RVTLRRGTPTTLYEIWKGRKPSVKHFHIFGSPCYILADREQRRKM 2814
Query: 536 QPRSTPCVFLGYPMNHRGYKCYDLSHRKILISRHVIFDETRFPFADLSLTPAPSYECFTE 595
P+S +FLGY N R Y+ ++ R ++ S +V+ D+ L+PA + E
Sbjct: 2815 DPKSDAGIFLGYSTNSRAYRVFNSRTRTVMESINVVVDD---------LSPARKKDV-EE 2964
Query: 596 DLPPSLIHHWQTVSSRPPDPPVQPSSPTDSSPPLSALVSPTASPSPLPLPPVPP----AP 651
D+ S + S + + +DS+ S + P S P
Sbjct: 2965 DVRTSGDNVADAAKSG------ENAENSDSATDESNINQPDKRSSTRIQKMHPKELIIGD 3126
Query: 652 PVRTMTTRSMHGISKPKKPFSLSVSIDDPSISPLPHNPKQALSDPNWKSAMQSEFNALIR 711
P R +TTRS F + P N K+AL+D W +AMQ E R
Sbjct: 3127 PNRGVTTRSREVEIVSNSCFVSKIE---------PKNVKEALTDEFWINAMQEELEQFKR 3279
Query: 712 SNTWELVPRPCDVNVIRCMWIFRHKKQSNGLFERYKARLVGDGRSQIAGVDCDETFSPVV 771
+ WELVPRP NVI WIF++K G+ R KARLV G +QI GVD DETF+PV
Sbjct: 3280 NEVWELVPRPEGTNVIGTKWIFKNKTNEEGVITRNKARLVAQGYTQIEGVDFDETFAPVA 3459
Query: 772 KPATIRTVLSIALSRSWPIHQLDVQNAFLHGDLHETIYMHHPLGFRDPHHPDYVCRLKKS 831
+ +IR +L +A + ++Q+DV++AFL+G L+E +Y+ P GF DP HPD+V RLKK+
Sbjct: 3460 RLESIRLLLGVACILKFKLYQMDVKSAFLNGYLNEEVYVEQPKGFADPTHPDHVYRLKKA 3639
Query: 832 LYGLKQAPRAWYQRFADYVSSIGFRHSTSDHSLFIFRQGSDIAYILLYVDDIILVASSHD 891
LYGLKQAPRAWY+R ++++ G+R D +LF+ + ++ +YVDDI+ S++
Sbjct: 3640 LYGLKQAPRAWYERLTEFLTQQGYRKGGIDKTLFVKQDAENLMIAQIYVDDIVFGGMSNE 3819
Query: 892 LRKSFMALLASEFAMKDLGPLSYFLGIAVTRHAGGLFLSQSTYATEIIARAGMASCNPSA 951
+ + F+ + SEF M +G L+YFLG+ V + +FLSQS YA I+ + GM + +
Sbjct: 3820 MLRHFVQQMQSEFEMSLVGELTYFLGLQVKQMEDSIFLSQSRYAKNIVKKFGMENASHKR 3999
Query: 952 TPVDTKQKLSSSSGTPCEDASLYRSLDGALQYLTFTRPDISYAVQQVCLHMHAPHTKHML 1011
TP T KLS D SLYRS+ G+L YLT +RPDI+YAV + P H+
Sbjct: 4000 TPAPTHLKLSKDEAGTSVDQSLYRSMIGSLLYLTASRPDITYAVGVCARYQANPKISHLT 4179
Query: 1012 ALKRVLRYVRGTLTYGLHLYPSPVETLVSYTDADWGGCPDTRRSTSGYCVFLGDNLISWS 1071
+KR+L+YV GT YG+ LV Y DADW G D R+STSG C +LG+NLISW
Sbjct: 4180 QVKRILKYVNGTSDYGIMYCHCSNPMLVGYCDADWAGSADDRKSTSGGCFYLGNNLISWF 4359
Query: 1072 SKRQPTLSRSSAEAEYRGVANVVSESCWIRNLLLELHFPLSQATLVHCDNVSAIYLSGNP 1131
SK+Q +S S+AEAEY + S+ W++ +L E + TL +CDN+SAI +S NP
Sbjct: 4360 SKKQNCVSLSTAEAEYIAAGSSCSQLVWMKQMLKEYNVEQDVMTL-YCDNMSAINISKNP 4536
Query: 1132 VHHQRTKHIEMDIHFVREKVVRGQARVLHVPSRHQIADIFTKGLPRLLFDDFRSSLSV 1189
V H RTKHI++ H++R+ V + HV + QIADIFTK L F+ R L +
Sbjct: 4537 VQHSRTKHIDIRHHYIRDLVDDKVITLKHVDTEEQIADIFTKALDANQFEKLRGKLGI 4710
>TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, complete
Length = 4734
Score = 583 bits (1502), Expect = e-166
Identities = 375/1071 (35%), Positives = 546/1071 (50%), Gaps = 31/1071 (2%)
Frame = +1
Query: 150 IAAAMHTMSLTPPDTPWYMDTGASSHTAASQGTLTSYSNLSHLNQKLIVGSGQGIPIQGS 209
++ +HT WY+D+G S H + L + S + G G I G
Sbjct: 1639 VSLVVHTSLRASAKEDWYLDSGCSRHMTGVKEFLVNIEPCS--TSYVTFGDGSKGKITGM 1812
Query: 210 G---YTSIPTPHKPLALNHVLHTPRIIKNLISVRQLTTDNNVSVSFDPFGFSVSDFKTGM 266
G + +P+ LN VL + NLIS+ QL D +V+F V++ K+ +
Sbjct: 1813 GKLVHDGLPS------LNKVLLVKGLTANLISISQLC-DEGFNVNFTKSECLVTNEKSEV 1971
Query: 267 PLLRCNSLGDLY---PVTRSSPFAGLASS-----VWHNRLGHPASSALNHLRNNKLIFCE 318
+ S + Y P S L S +WH R GH HLR K I +
Sbjct: 1972 LMKGSRSKDNCYLWTPQETSYSSTCLFSKEDEVKIWHQRFGHL------HLRGMKKIIDK 2133
Query: 319 PS---------RSSSVCDSCVLGKHVRLPFSS-SAIITLRPFDILHSDLW-TSPVLSTAG 367
+ +C C +GK V++ T R ++LH DL V S G
Sbjct: 2134 GAVRGIPNLKIEEGRICGECQIGKQVKMSHQKLQHQTTSRVLELLHMDLMGPMQVESLGG 2313
Query: 368 HRYYVLFLDDHTDFLWTFPISKKSQVYETFTTLATLIKTQFSANIKCLQCDNGREYDNDS 427
RY + +DD + F W I +KS +E F L+ ++ + IK ++ D+GRE++N
Sbjct: 2314 KRYAYVVVDDFSRFTWVNFIREKSDTFEVFKELSLRLQREKDCVIKRIRSDHGREFENSK 2493
Query: 428 FHRYCDANGLIFRFSCPHTSSQNGKAERKIRTINNMIRTLLAHSSVPPSFWHHALQMATY 487
F +C + G+ FS T QNG ERK RT+ R +L +P + W A+ A Y
Sbjct: 2494 FTEFCTSEGITHEFSAAITPQQNGIVERKNRTLQEAARVMLHAKELPYNLWAEAMNTACY 2673
Query: 488 LLNILPRKTLQNDSPTQLLY---HRDPSYSHLRVFGCLCFPLFPSATINKLQPRSTPCVF 544
+ N R TL+ +PT L R P+ H +FG C+ L K+ P+S +F
Sbjct: 2674 IHN---RVTLRRGTPTTLYEIWKGRKPTVKHFHIFGSPCYILADREQRRKMDPKSDAGIF 2844
Query: 545 LGYPMNHRGYKCYDLSHRKILISRHVIFDETRFPFADLSLTPAPSYECFTEDLPPSLIHH 604
LGY N R Y+ ++ R ++ S +V+ D+ LTPA + ED+ S +
Sbjct: 2845 LGYSTNSRAYRVFNSRTRTVMESINVVVDD---------LTPARKKDV-EEDVRTSGDNV 2994
Query: 605 WQTVSSRPPDPPVQPSSPTDSSPPLSALVSPTASPSPLPLPPVPP-----APPVRTMTTR 659
T S + + +DS+ + P PS + + + P P R +TTR
Sbjct: 2995 ADTAKS------AENAENSDSATDEPNINQPDKRPS-IRIQKMHPKELIIGDPNRGVTTR 3153
Query: 660 SMHGISKPKKPFSLSVSIDDPSISPL-PHNPKQALSDPNWKSAMQSEFNALIRSNTWELV 718
S + + + +S + P N K+AL+D W +AMQ E R+ WELV
Sbjct: 3154 SRE----------IEIVSNSCFVSKIEPKNVKEALTDEFWINAMQEELEQFKRNEVWELV 3303
Query: 719 PRPCDVNVIRCMWIFRHKKQSNGLFERYKARLVGDGRSQIAGVDCDETFSPVVKPATIRT 778
PRP NVI WIF++K G+ R KARLV G +QI GVD DETF+PV + +IR
Sbjct: 3304 PRPEGTNVIGTKWIFKNKTNEEGVITRNKARLVAQGYTQIEGVDFDETFAPVARLESIRL 3483
Query: 779 VLSIALSRSWPIHQLDVQNAFLHGDLHETIYMHHPLGFRDPHHPDYVCRLKKSLYGLKQA 838
+L +A + ++Q+DV++AFL+G L+E Y+ P GF DP HPD+V RLKK+LYGLKQA
Sbjct: 3484 LLGVACILKFKLYQMDVKSAFLNGYLNEEAYVEQPKGFVDPTHPDHVYRLKKALYGLKQA 3663
Query: 839 PRAWYQRFADYVSSIGFRHSTSDHSLFIFRQGSDIAYILLYVDDIILVASSHDLRKSFMA 898
PRAWY+R ++++ G+R D +LF+ + ++ +YVDDI+ S+++ + F+
Sbjct: 3664 PRAWYERLTEFLTQQGYRKGGIDKTLFVKQDAENLMIAQIYVDDIVFGGMSNEMLRHFVQ 3843
Query: 899 LLASEFAMKDLGPLSYFLGIAVTRHAGGLFLSQSTYATEIIARAGMASCNPSATPVDTKQ 958
+ SEF M +G L+YFLG+ V + +FLSQS YA I+ + GM + + TP T
Sbjct: 3844 QMQSEFEMSLVGELTYFLGLQVKQMEDSIFLSQSKYAKNIVKKFGMENASHKRTPAPTHL 4023
Query: 959 KLSSSSGTPCEDASLYRSLDGALQYLTFTRPDISYAVQQVCLHMHAPHTKHMLALKRVLR 1018
KLS D SLYRS+ G+L YLT +RPDI+YAV + P H+ +KR+L+
Sbjct: 4024 KLSKDEAGTSVDQSLYRSMIGSLLYLTASRPDITYAVGVCARYQANPKISHLNQVKRILK 4203
Query: 1019 YVRGTLTYGLHLYPSPVETLVSYTDADWGGCPDTRRSTSGYCVFLGDNLISWSSKRQPTL 1078
YV GT YG+ LV Y DADW G D R+STSG C +LG NLISW SK+Q +
Sbjct: 4204 YVNGTSDYGIMYCHCSDSMLVGYCDADWAGSADDRKSTSGGCFYLGTNLISWFSKKQNCV 4383
Query: 1079 SRSSAEAEYRGVANVVSESCWIRNLLLELHFPLSQATLVHCDNVSAIYLSGNPVHHQRTK 1138
S S+AEAEY + S+ W++ +L E + TL +CDN+SAI +S NPV H RTK
Sbjct: 4384 SLSTAEAEYIAAGSSCSQLVWMKQMLKEYNVEQDVMTL-YCDNMSAINISKNPVQHSRTK 4560
Query: 1139 HIEMDIHFVREKVVRGQARVLHVPSRHQIADIFTKGLPRLLFDDFRSSLSV 1189
HI++ H++R+ V + HV + QIADIFTK L F+ R L +
Sbjct: 4561 HIDIRHHYIRDLVDDKVITLEHVDTEEQIADIFTKALDANQFEKLRGKLGI 4713
>TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Fragment), partial
(30%)
Length = 687
Score = 153 bits (386), Expect = 5e-37
Identities = 72/138 (52%), Positives = 93/138 (67%)
Frame = +2
Query: 1038 LVSYTDADWGGCPDTRRSTSGYCVFLGDNLISWSSKRQPTLSRSSAEAEYRGVANVVSES 1097
L Y DADW GCP RRSTSGYCVF+G NL+SW SK+Q ++RSSAEAEYR +A V E
Sbjct: 17 LSGYCDADWAGCPMDRRSTSGYCVFIGGNLVSWKSKKQTVVARSSAEAEYRSMAMVTCEL 196
Query: 1098 CWIRNLLLELHFPLSQATLVHCDNVSAIYLSGNPVHHQRTKHIEMDIHFVREKVVRGQAR 1157
WI+ L EL F ++CDN +A++++ NPV H+RTKHIE+D HF+REK++ +
Sbjct: 197 MWIKQFLQELRFCEELQMKLYCDNQAALHIASNPVFHERTKHIEIDCHFIREKLLSKEIV 376
Query: 1158 VLHVPSRHQIADIFTKGL 1175
+ S Q DI TK L
Sbjct: 377 TEFIGSNDQPVDILTKSL 430
>TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polyprotein, partial
(7%)
Length = 804
Score = 141 bits (355), Expect = 2e-33
Identities = 80/222 (36%), Positives = 125/222 (56%), Gaps = 4/222 (1%)
Frame = +1
Query: 970 DASLYRSLDGALQYLTFTRPDISYAVQQVCLHMHAPHTKHMLALKRVLRYVRGTLTYGLH 1029
D + +R L G+L+YL +RP+I +AV + M P HM A KRVLR ++GT+ G+
Sbjct: 10 DVTEFRRLIGSLRYLCNSRPNICFAVSLISRFMKRPRLSHMQAAKRVLRLIKGTIGSGV- 186
Query: 1030 LYPSPVET----LVSYTDADWGGCPDTRRSTSGYCVFLGDNLISWSSKRQPTLSRSSAEA 1085
L+P ++ L+ YTD+DW P+ +ST GY D ++ SSK+Q ++ S+ EA
Sbjct: 187 LFPFKAKSGKPDLLGYTDSDWKRDPEQEKSTGGYLFMYNDAPVA*SSKKQDVIALSTCEA 366
Query: 1086 EYRGVANVVSESCWIRNLLLELHFPLSQATLVHCDNVSAIYLSGNPVHHQRTKHIEMDIH 1145
EY + ++ W+ NLL EL + + DN SAI L+ +P H R+KHIE+ H
Sbjct: 367 EYVAASLGACQAVWMMNLLEELKLRERKPVNLLIDNKSAINLAKHPTLHGRSKHIELRFH 546
Query: 1146 FVREKVVRGQARVLHVPSRHQIADIFTKGLPRLLFDDFRSSL 1187
++R++V +G V + + Q+AD+ TK + F S L
Sbjct: 547 YIRDQVSKGNVTVEYCKAEEQLADLMTKPIQVSRFKQICSEL 672
>CO982036
Length = 674
Score = 140 bits (353), Expect = 3e-33
Identities = 87/204 (42%), Positives = 114/204 (55%), Gaps = 3/204 (1%)
Frame = -2
Query: 875 YILLYVDDIILVASSHDLRKSFMALLASEFAMKDLGPLSYFLGIAVTRHAGGLFLSQSTY 934
Y+L+YVD II+ SS L ++ + L S F +K LG L YF+ I V + L S T
Sbjct: 649 YLLVYVD-IIITGSSCTLIQNLTSKLNSSFPLKLLGKLDYFVEIEV-KSMPDLLFSLRTS 476
Query: 935 ATEIIARAGMASCNPSATPVDTKQKLSSSSGTPCEDASLYRSLDGALQYLTFTRPDISYA 994
EI R P ++P+ T KLS S + YRS+ GALQY T RP+IS+A
Sbjct: 475 IFEIFCRKPR*QAQPISSPMTTTCKLSKSDSDLFSGPTFYRSVVGALQYTTVIRPEISFA 296
Query: 995 VQQVCLHMHAPHTKHMLALKRVLRYVRGTLTYGLHLYP---SPVETLVSYTDADWGGCPD 1051
V +VC M P H +KR+LRY++G+L+YGL L P S + + DADW D
Sbjct: 295 VNKVCQFMSNPLDSHWTEVKRILRYLKGSLSYGL*LKPAISSQPLPIRGFCDADWASAVD 116
Query: 1052 TRRSTSGYCVFLGDNLISWSSKRQ 1075
+RSTSG VFLG NLISW +Q
Sbjct: 115 DKRSTSGAAVFLGPNLISWWXXKQ 44
>AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza sativa
(japonica cultivar-group)}, partial (10%)
Length = 463
Score = 139 bits (351), Expect = 6e-33
Identities = 65/150 (43%), Positives = 101/150 (67%)
Frame = -3
Query: 701 AMQSEFNALIRSNTWELVPRPCDVNVIRCMWIFRHKKQSNGLFERYKARLVGDGRSQIAG 760
AMQ E N R+N W+LV +P + VI W+FR+K +G+ R KARLV G +Q G
Sbjct: 458 AMQEELNQFERNNVWKLVEKPENYPVIGTKWVFRNKLDEHGIIIRNKARLVAKGYNQEEG 279
Query: 761 VDCDETFSPVVKPATIRTVLSIALSRSWPIHQLDVQNAFLHGDLHETIYMHHPLGFRDPH 820
+D +ET++PV + IR +L+ ++ ++Q+DV++AFL+G + E +Y+ P GF P
Sbjct: 278 IDYEETYAPVARLEVIRMLLAYVSIMNFKLYQMDVKSAFLNGLIQEEVYVEQPPGFEIPD 99
Query: 821 HPDYVCRLKKSLYGLKQAPRAWYQRFADYV 850
P +V +L+K+LYGLKQAPRAWY+R ++++
Sbjct: 98 KPTHVYKLQKALYGLKQAPRAWYERISNFL 9
>TC235104 weakly similar to UP|Q850H8 (Q850H8) Gag-pol polyprotein
(Fragment), partial (28%)
Length = 865
Score = 139 bits (349), Expect = 1e-32
Identities = 92/283 (32%), Positives = 139/283 (48%), Gaps = 4/283 (1%)
Frame = +2
Query: 196 LIVGSGQGIPIQGSGYTSIPTPHKPLALNHVLHTPRIIKNLISVRQLTTDNNVSVSFDPF 255
+ + G + G G+ S P L+LN V+ N+ S+ QLT N SV+FD
Sbjct: 20 ITLADGSRVVATGIGHVS---PTSSLSLNSVVFILGCPFNITSLSQLTRFRNCSVTFDAN 190
Query: 256 GFSVSDFKTGMPL---LRCNSLGDLYPVTRSSPFAGLASSVWHNRLGHPASSALNHLRNN 312
F + + TG + + + L L P A + + H RLGHP HL
Sbjct: 191 SFVIQECGTGWTIGVGIESHGLYYLKPNLSWVCSAVTSPKLLHERLGHP------HLSKL 352
Query: 313 KLIFCEPSRSSSV-CDSCVLGKHVRLPFSSSAIITLRPFDILHSDLWTSPVLSTAGHRYY 371
K++ + + C+SC LGKHVR PF ++H D+W +S+ +RY+
Sbjct: 353 KIMVPSLEKIKDLFCESCQLGKHVRSSXRHVESRVDSPFLVIHXDIWGPNRVSSMSYRYF 532
Query: 372 VLFLDDHTDFLWTFPISKKSQVYETFTTLATLIKTQFSANIKCLQCDNGREYDNDSFHRY 431
V F+D+ + F + ++S++ +F T IKTQF IK L+ DN +EY + +
Sbjct: 533 VTFIDEFSQCTRVFLMKERSEIL-SFLTSVNKIKTQFGKTIKILRSDNAKEYFSSVISPF 709
Query: 432 CDANGLIFRFSCPHTSSQNGKAERKIRTINNMIRTLLAHSSVP 474
A G++ +FSCPHT QN AERK R + RTLL H++ P
Sbjct: 710 XSAQGILHQFSCPHTPQQNDIAERKNRHLVETARTLLLHANEP 838
>BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vinifera},
partial (34%)
Length = 407
Score = 132 bits (333), Expect = 7e-31
Identities = 64/133 (48%), Positives = 84/133 (63%)
Frame = -2
Query: 718 VPRPCDVNVIRCMWIFRHKKQSNGLFERYKARLVGDGRSQIAGVDCDETFSPVVKPATIR 777
VP P + C W++ K G +R KARLV G +Q+ G+D +TFSPV K T+R
Sbjct: 406 VPLPPGKTPVGCRWVYTVKVGPTGEVDRLKARLVAKGYTQVYGIDYCDTFSPVAKLTTVR 227
Query: 778 TVLSIALSRSWPIHQLDVQNAFLHGDLHETIYMHHPLGFRDPHHPDYVCRLKKSLYGLKQ 837
L++A WP+HQLD++NAFLHGDL E IYM P GF VC+L +SLYGLKQ
Sbjct: 226 LFLAMAAICHWPLHQLDIKNAFLHGDLEEDIYMEQPPGFVAQGEYGLVCKLHRSLYGLKQ 47
Query: 838 APRAWYQRFADYV 850
+PRAW+ +F+ V
Sbjct: 46 SPRAWFGKFSHVV 8
>BU548243
Length = 599
Score = 129 bits (325), Expect = 6e-30
Identities = 66/147 (44%), Positives = 91/147 (61%)
Frame = -1
Query: 1043 DADWGGCPDTRRSTSGYCVFLGDNLISWSSKRQPTLSRSSAEAEYRGVANVVSESCWIRN 1102
DA W D RST G +FLG NLISW S++Q ++SS EAEYR +A +E WI+
Sbjct: 587 DAGWASDVDDHRSTLGSAIFLGPNLISWWSRKQQVTAQSSTEAEYRSIAQTSAELTWIQA 408
Query: 1103 LLLELHFPLSQATLVHCDNVSAIYLSGNPVHHQRTKHIEMDIHFVREKVVRGQARVLHVP 1162
LL+EL P + ++ CDN SA+ ++ N V H RTKH+E+D+ FV EKV+ Q ++ H+P
Sbjct: 407 LLMELQIPFTPPVIL-CDNKSAVAIAHNLVFHSRTKHMEIDVFFVHEKVLSKQLQIFHIP 231
Query: 1163 SRHQIADIFTKGLPRLLFDDFRSSLSV 1189
+ Q A I TK L F +S L+V
Sbjct: 230 ALDQWAGILTKPLSSARFTFLKSKLTV 150
>TC234303 weakly similar to UP|Q8W153 (Q8W153) Polyprotein, partial (10%)
Length = 558
Score = 129 bits (324), Expect = 8e-30
Identities = 74/180 (41%), Positives = 101/180 (56%), Gaps = 1/180 (0%)
Frame = +1
Query: 740 NGLFERYKARLVGDGRSQIAGVDCDETFSPVVKPATIRTVLSIALSRSWPIHQLDVQNAF 799
+G +++KARLV +Q+ G D TFSPV K A + + S+A+ WP+ LD +NAF
Sbjct: 25 SGTIDQFKARLVAKSYTQVYGQDYTGTFSPVAKMAYVHLLWSMAVVCHWPLF*LDAKNAF 204
Query: 800 LHGDLHETIYMHHPLGF-RDPHHPDYVCRLKKSLYGLKQAPRAWYQRFADYVSSIGFRHS 858
LHG L E +YM PLGF + VC+L +S YGLKQ+PRAW F ++I +
Sbjct: 205 LHGYLEEEVYMEQPLGFVAQGESSNMVCQLCRSFYGLKQSPRAW--PFLYCGAAIWYDSH 378
Query: 859 TSDHSLFIFRQGSDIAYILLYVDDIILVASSHDLRKSFMALLASEFAMKDLGPLSYFLGI 918
+DHS+F Y+++YVDDI + S L +F KDLG L YFLGI
Sbjct: 379 EADHSVFYCHSPQGCIYLIVYVDDIGITGSDQHGIT*LK*XLCCQFQTKDLGKLRYFLGI 558
>CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {Vitis
vinifera}, partial (34%)
Length = 409
Score = 129 bits (324), Expect = 8e-30
Identities = 60/129 (46%), Positives = 80/129 (61%)
Frame = +3
Query: 685 LPHNPKQALSDPNWKSAMQSEFNALIRSNTWELVPRPCDVNVIRCMWIFRHKKQSNGLFE 744
+P ++AL P W+ AM E AL + TWELVP P + C W++ K NG +
Sbjct: 18 VPSTIREALDHPGWRQAMVDEMQALENNGTWELVPLPPGKTTVGCRWVYTVKVGPNGKVD 197
Query: 745 RYKARLVGDGRSQIAGVDCDETFSPVVKPATIRTVLSIALSRSWPIHQLDVQNAFLHGDL 804
R KARLV G +Q+ G++ +TFSPV T+R L++A R WP+HQLD++NAFLHGDL
Sbjct: 198 RLKARLVAKGYTQVYGIEYCDTFSPVFFLTTVRLFLAMAAIRHWPLHQLDIKNAFLHGDL 377
Query: 805 HETIYMHHP 813
E IYM P
Sbjct: 378 EEDIYMEQP 404
>TC231544 weakly similar to UP|Q9FLR2 (Q9FLR2) Polyprotein-like, partial (16%)
Length = 662
Score = 125 bits (313), Expect = 1e-28
Identities = 77/195 (39%), Positives = 113/195 (57%), Gaps = 7/195 (3%)
Frame = +3
Query: 1007 TKHMLALKRVLRYVRGTLTYGLHLYPSPVETLVSYTDADWGGCPDTRRSTSGYCVFLGDN 1066
T+ + A RVL+Y++G GL ++ ++DADW C D+ +S + YC FLG +
Sbjct: 3 TRPLCAATRVLKYLKGCPRKGLSFSRESPIQILGFSDADWATCIDSSKSITWYCFFLGSS 182
Query: 1067 LISWSSKRQPTLSR--SSAEAEYRGVANVVSESCWIRNLLLELHFPLSQATLVHCDNVSA 1124
LISW +K+Q T+SR SS+EA+YR + + E W+ LL +LH TL++CDN SA
Sbjct: 183 LISWKAKKQNTVSRSSSSSEAKYRALTSTTCELQWLTYLLKDLH-----VTLIYCDNQSA 347
Query: 1125 IY-LSGNPVHHQRTKHIEMDIHFVREKVVRGQAR-VLHVPSRHQIADIFTKGLPRLLFDD 1182
+ L ++H + +E+D H VREK +G +L V S +Q+ADIFTK L LF
Sbjct: 348 LQ*LPIKVIYHGQ---LEIDCHIVREKTQQGLMHCLLPVSSSNQLADIFTKALSPKLFSS 518
Query: 1183 FRSSLSVGE---PPA 1194
S L + + PPA
Sbjct: 519 NLSKLGLSDIFLPPA 563
>TC232995
Length = 1009
Score = 124 bits (311), Expect = 2e-28
Identities = 66/170 (38%), Positives = 96/170 (55%)
Frame = +2
Query: 810 MHHPLGFRDPHHPDYVCRLKKSLYGLKQAPRAWYQRFADYVSSIGFRHSTSDHSLFIFRQ 869
+ P GF P++V +L+K+LYGLKQAPRAWY+R ++++ F D +LFI R+
Sbjct: 2 VEQPPGFEISDKPNHVYKLQKALYGLKQAPRAWYERLSNFLLEKEFSRGKVDTTLFIKRK 181
Query: 870 GSDIAYILLYVDDIILVASSHDLRKSFMALLASEFAMKDLGPLSYFLGIAVTRHAGGLFL 929
+DI + +YVDDII +++ L K F + SEF M +G L YFLG+ + + G+F+
Sbjct: 182 HNDILLVQIYVDDIIFGSTNDSLCKEFSLDMQSEFEMSMMGELKYFLGLQIKQTQ*GIFI 361
Query: 930 SQSTYATEIIARAGMASCNPSATPVDTKQKLSSSSGTPCEDASLYRSLDG 979
+QS Y E+I R GM S +TP+ T L D YR G
Sbjct: 362 NQSKYCKELIKRFGMDSAKHMSTPMSTNCYLDKDESGQSIDIKQYRDAIG 511
>CO983516
Length = 724
Score = 123 bits (308), Expect = 6e-28
Identities = 55/122 (45%), Positives = 84/122 (68%)
Frame = +2
Query: 763 CDETFSPVVKPATIRTVLSIALSRSWPIHQLDVQNAFLHGDLHETIYMHHPLGFRDPHHP 822
CD+ F PV + +IR +L +A + ++Q+DV++AFL+G L+E +Y+ P GF DP HP
Sbjct: 353 CDKEFHPVARLESIRLLLGVACILKFKLYQMDVKSAFLNGYLNEEVYVEQPKGFIDPTHP 532
Query: 823 DYVCRLKKSLYGLKQAPRAWYQRFADYVSSIGFRHSTSDHSLFIFRQGSDIAYILLYVDD 882
D+V RLKK+LYGLKQAPRAWY+R + ++ G+R D +LF+ + ++ +YVDD
Sbjct: 533 DHVYRLKKALYGLKQAPRAWYERLTELLTQQGYRKGGIDKTLFVKQDAENLMIAQIYVDD 712
Query: 883 II 884
I+
Sbjct: 713 IV 718
>TC221132 weakly similar to UP|O23529 (O23529) RETROTRANSPOSON like protein,
partial (5%)
Length = 799
Score = 77.0 bits (188), Expect(3) = 6e-28
Identities = 39/96 (40%), Positives = 56/96 (57%)
Frame = +1
Query: 1002 MHAPHTKHMLALKRVLRYVRGTLTYGLHLYPSPVETLVSYTDADWGGCPDTRRSTSGYCV 1061
M P M A KRVLRY++GT+ +GL L SP + L ++ DA+W RST Y V
Sbjct: 142 MKDPTKIRMQATKRVLRYLKGTIDFGLQLRSSPDQHLRAFYDANWVDNTSDIRSTGAYVV 321
Query: 1062 FLGDNLISWSSKRQPTLSRSSAEAEYRGVANVVSES 1097
+ G ++ISWS K+Q + +SS + EY + + ES
Sbjct: 322 YFGLSVISWSCKKQSIIDKSSTKVEYHKITTTIIES 429
Score = 54.7 bits (130), Expect(3) = 6e-28
Identities = 27/55 (49%), Positives = 34/55 (61%)
Frame = +2
Query: 1121 NVSAIYLSGNPVHHQRTKHIEMDIHFVREKVVRGQARVLHVPSRHQIADIFTKGL 1175
N+ A+YL NPV H KH+ +D FV++ V Q RV HVPS H D+FTK L
Sbjct: 467 NIGAMYLCANPVFHLCMKHLTIDHLFVQDLVANKQLRVSHVPSCH*HVDLFTKAL 631
Score = 32.3 bits (72), Expect(3) = 6e-28
Identities = 14/40 (35%), Positives = 24/40 (60%)
Frame = +3
Query: 962 SSSGTPCEDASLYRSLDGALQYLTFTRPDISYAVQQVCLH 1001
+S P D +Y L +LQYL+ T PDI++ + ++ +H
Sbjct: 24 ASGDVPSCDGIVYCQLVDSLQYLSLTCPDIAFPINKLSVH 143
>TC225402 weakly similar to UP|Q6I923 (Q6I923) Copia-like retrotransposon
Hopscotch polyprotein, partial (7%)
Length = 1446
Score = 117 bits (293), Expect(2) = 4e-27
Identities = 55/109 (50%), Positives = 74/109 (67%)
Frame = +2
Query: 1043 DADWGGCPDTRRSTSGYCVFLGDNLISWSSKRQPTLSRSSAEAEYRGVANVVSESCWIRN 1102
DA+W P R ST GYCV +G+NL+ W S + ++RSSAEAEY+ + E WI+
Sbjct: 8 DANWAVSPIDRGSTLGYCVSIGENLVLWKSNK*NVVARSSAEAEYKAMTVATCELIWIKQ 187
Query: 1103 LLLELHFPLSQATLVHCDNVSAIYLSGNPVHHQRTKHIEMDIHFVREKV 1151
LL EL F +Q + CDN +A++++ NPV H+RTKHIE+D HFVREKV
Sbjct: 188 LLQELKFGSTQQMKLCCDNQAALHIASNPVFHERTKHIEIDCHFVREKV 334
Score = 23.9 bits (50), Expect(2) = 4e-27
Identities = 16/38 (42%), Positives = 19/38 (49%)
Frame = +3
Query: 1161 VPSRHQIADIFTKGLPRLLFDDFRSSLSVGEPPASTAG 1198
V S Q+A+IFTK L + S L E AST G
Sbjct: 363 VSSNDQLANIFTKSLRGPRIQNICSKLGAFELYAST*G 476
>AI855899 similar to GP|2244960|emb| retrotransposon like protein {Arabidopsis
thaliana}, partial (18%)
Length = 418
Score = 119 bits (299), Expect = 6e-27
Identities = 62/135 (45%), Positives = 84/135 (61%), Gaps = 3/135 (2%)
Frame = +1
Query: 944 MASCNPSATPVDTKQKLSSSSGTPCEDASLYRSLDGALQYLTFTRPDISYAVQQVCLHMH 1003
M CN +TP+ + KLS +A YR + GALQY+T TRP+I+Y V +V M
Sbjct: 13 MLDCNGISTPMVSSYKLSKFGSELLPNAHQYRDIVGALQYVTLTRPNIAYNVNKVSEFMS 192
Query: 1004 APHTKHMLALKRVLRYVRGTLTYGLHLYPSPVETLVS---YTDADWGGCPDTRRSTSGYC 1060
+P + L +KR+LRY+ GT+T GL L P+ ++ +S Y D DWG P RSTSG C
Sbjct: 193 SPLQSY*LTVKRILRYLSGTVTQGLLLQPAHMDAKISLRAYNDLDWGSDPAEMRSTSGSC 372
Query: 1061 VFLGDNLISWSSKRQ 1075
+F G NLI+WSSK+Q
Sbjct: 373 IFSGSNLIAWSSKKQ 417
>TC213888 similar to UP|Q9SFE1 (Q9SFE1) T26F17.17, partial (11%)
Length = 493
Score = 117 bits (294), Expect = 2e-26
Identities = 58/139 (41%), Positives = 86/139 (61%)
Frame = +3
Query: 1051 DTRRSTSGYCVFLGDNLISWSSKRQPTLSRSSAEAEYRGVANVVSESCWIRNLLLELHFP 1110
D R+ST+G+ F+GD +W SK+QP ++ S+ EAEY + V + W+RNLL EL P
Sbjct: 9 DDRKSTTGFVFFMGDTAFTWMSKKQPIVTLSTCEAEYVAATSCVCHAIWLRNLLKELKMP 188
Query: 1111 LSQATLVHCDNVSAIYLSGNPVHHQRTKHIEMDIHFVREKVVRGQARVLHVPSRHQIADI 1170
+ + DN SA+ L+ NPV H+++KHI+ HF+RE + + + ++ +V S+ Q ADI
Sbjct: 189 QEEPMEICVDNKSALALAKNPVFHEKSKHIDTRYHFIRECIEKKEVKLKYVMSQDQAADI 368
Query: 1171 FTKGLPRLLFDDFRSSLSV 1189
FTK L F RS L V
Sbjct: 369 FTKPLKLETFVKLRSMLGV 425
>BI317550 weakly similar to GP|9759590|dbj| polyprotein-like {Arabidopsis
thaliana}, partial (18%)
Length = 421
Score = 116 bits (291), Expect = 5e-26
Identities = 58/138 (42%), Positives = 84/138 (60%)
Frame = -2
Query: 830 KSLYGLKQAPRAWYQRFADYVSSIGFRHSTSDHSLFIFRQGSDIAYILLYVDDIILVASS 889
KSLYGLKQA R WY++ + + G+ S SD+SLF +G+ +L+YVDDIIL S
Sbjct: 420 KSLYGLKQASRKWYEKLTNLLLKEGYIQSISDYSLFTLTKGNTFTALLVYVDDIILAGDS 241
Query: 890 HDLRKSFMALLASEFAMKDLGPLSYFLGIAVTRHAGGLFLSQSTYATEIIARAGMASCNP 949
D +L F +K+LG L YFLG+ V G+ +SQ Y +++ +G+ C P
Sbjct: 240 IDEFDRIKNVLDLAFKIKNLGKLKYFLGLEVAHSRLGITISQRKYCLDLLKDSGLLGCKP 61
Query: 950 SATPVDTKQKLSSSSGTP 967
++TP+DT KL S++GTP
Sbjct: 60 ASTPLDTSIKLHSAAGTP 7
>BU549979
Length = 615
Score = 112 bits (280), Expect = 1e-24
Identities = 57/180 (31%), Positives = 97/180 (53%), Gaps = 2/180 (1%)
Frame = -1
Query: 1005 PHTKHMLALKRVLRYVRGTLTYGLHLYPSPVETLVSYTDADWGGCPDTRRSTSGYCVFLG 1064
P H K+V+RY++GT Y L + ++ Y+D+D+ GC D+RRSTSGY L
Sbjct: 591 PGIDHWKTAKKVMRYLQGTKDYMLMYKQTNCLEVIGYSDSDFAGCVDSRRSTSGYIFMLA 412
Query: 1065 DNLISWSSKRQPTLSRSSAEAEYRGVANVVSESCWIRNLLLELHF--PLSQATLVHCDNV 1122
D ++SW S +Q ++ S+ E E+ S W+++ + L +S+ ++CDN
Sbjct: 411 DGVVSWRSSKQTLIATSTMEVEFVPCFEATSHGVWLKSFMSSLRVVDSISRPLKLYCDNF 232
Query: 1123 SAIYLSGNPVHHQRTKHIEMDIHFVREKVVRGQARVLHVPSRHQIADIFTKGLPRLLFDD 1182
+A++++ N R+KHI++ +RE+V + + HV + I D TKG+ F D
Sbjct: 231 AAVFMAKNNKSGNRSKHIDIKYLVIRERVKEKKVVIEHVNTELMIVDPLTKGMTPKNFKD 52
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.319 0.134 0.425
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 68,761,334
Number of Sequences: 63676
Number of extensions: 1452628
Number of successful extensions: 29919
Number of sequences better than 10.0: 1518
Number of HSP's better than 10.0 without gapping: 13422
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 21565
length of query: 1199
length of database: 12,639,632
effective HSP length: 108
effective length of query: 1091
effective length of database: 5,762,624
effective search space: 6287022784
effective search space used: 6287022784
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 64 (29.3 bits)
Lotus: description of TM0490.3