
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0133.5
(1481 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete 512 e-145
TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, co... 511 e-145
TC231544 weakly similar to UP|Q9FLR2 (Q9FLR2) Polyprotein-like, ... 184 3e-46
TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Frag... 165 1e-40
TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polypro... 161 2e-39
TC235104 weakly similar to UP|Q850H8 (Q850H8) Gag-pol polyprotei... 152 8e-37
CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {V... 152 1e-36
BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vi... 152 1e-36
AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza ... 147 3e-35
BI317550 weakly similar to GP|9759590|dbj| polyprotein-like {Ara... 145 1e-34
TC234303 weakly similar to UP|Q8W153 (Q8W153) Polyprotein, parti... 139 1e-32
TC225402 weakly similar to UP|Q6I923 (Q6I923) Copia-like retrotr... 121 1e-29
BM307983 127 3e-29
BU548243 127 5e-29
TC222001 similar to UP|C716_NEPRA (O04164) Cytochrome P450 71A6 ... 124 3e-28
BU764568 103 2e-26
TC213888 similar to UP|Q9SFE1 (Q9SFE1) T26F17.17, partial (11%) 117 5e-26
BE474381 similar to GP|21434|emb|CA ORF4 {Solanum tuberosum}, pa... 89 5e-26
BU549979 115 1e-25
BI787454 weakly similar to GP|21434|emb|CA ORF4 {Solanum tuberos... 113 5e-25
>TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete
Length = 4731
Score = 512 bits (1318), Expect = e-145
Identities = 316/1058 (29%), Positives = 524/1058 (48%), Gaps = 10/1058 (0%)
Frame = +1
Query: 425 WILDTGATDHICNTLSYFSSYKHVAPIPVSLPNGNIESATIKGSIQLSPSFILINVLFLP 484
W LD+G + H+ + + + + V+ +G+ G + L VL +
Sbjct: 1684 WYLDSGCSRHMTGVKEFLLNIEPCSTSYVTFGDGSKGKIIGMGKLVHDGLPSLNKVLLVK 1863
Query: 485 NFEFNLISVHKLVKSLRFRLTFSDDDCLIQDSNACKMIGTARAVRSLYILNNDSLSPFPS 544
NLIS+ +L F + F+ +CL+ + + ++ +R+ + Y+ S
Sbjct: 1864 GLTANLISISQLCDE-GFNVNFTKSECLVTNEKSEVLMKGSRSKDNCYLWTPQETS---- 2028
Query: 545 CNSVSTCKSQNLVCEFSPSVQNLWHYRLGHPSFVKGQSIKDL-----FPYVQYTQDHVCE 599
STC S +WH R GH + I D P ++ + +C
Sbjct: 2029 --YSSTCLSSK------EDEVRIWHQRFGHLHLRGMKKIIDKGAVRGIPNLKIEEGRICG 2184
Query: 600 VCPIAKQKRLKFP-LSNSTSDCIFQMIHVDIWGPVSVLSLNGFSYFLTIVDDYSRYTWVY 658
C I KQ ++ L + T+ + +++H+D+ GP+ V SL G Y +VDD+SR+TWV
Sbjct: 2185 ECQIGKQVKMSHQKLQHQTTSRVLELLHMDLMGPMQVESLGGKRYAYVVVDDFSRFTWVN 2364
Query: 659 LLKSKDEVQTLVKDFCAFVTNQFGVSVKIVRSDNGKEFVLS---QFYAEKGIIHHTSCVE 715
++ K E + K+ + + +K +RSD+G+EF S +F +GI H S
Sbjct: 2365 FIREKSETFEVFKELSLRLQREKDCVIKRIRSDHGREFENSRFTEFCTSEGITHEFSAAI 2544
Query: 716 TPQQNSIVERKHQHILNVARALLFQAHLPKIFWAHAIIHSIFLINRLPTPILDNQCPFQL 775
TPQQN IVERK++ + AR +L LP WA A+ + ++ NR+ +++
Sbjct: 2545 TPQQNGIVERKNRTLQEAARVMLHAKELPYNLWAEAMNTACYIHNRVTLRRGTPTTLYEI 2724
Query: 776 LHKQLPDITFLKVFGSLCFASTLASHRTKFDHRAKRCVFLGFKSGTKGYLVYDLNTRDIS 835
+ P + +FGS C+ R K D ++ +FLG+ + ++ Y V++ TR +
Sbjct: 2725 WKGRKPSVKHFHIFGSPCYILADREQRRKMDPKSDAGIFLGYSTNSRAYRVFNSRTRTVM 2904
Query: 836 ISRNVIFHENIFPYKLHRDDHECESSTFPQIPCLPSEPFDYEYPIPPDNPVQIQPLSSGP 895
S NV+ +++ P + + + +S DN
Sbjct: 2905 ESINVVV-DDLSPARKKDVEEDVRTSG--------------------DNVADAAKSGENA 3021
Query: 896 SQNSEQPIVHRVSQRPRKQPTYLQDYHCTLAATSTVVPANSSSKGTSHPLSQVISYHKLD 955
+ ++Q ++ T +Q H ++ + T +++S
Sbjct: 3022 ENSDSATDESNINQPDKRSSTRIQKMH----PKELIIGDPNRGVTTRSREVEIVSNS--- 3180
Query: 956 PSYHAFIMNITTTVEPTRYSEAVKHECWRVAMN*EIEALERNNTWLLVDKPPDKTPIGCK 1015
F+ I EP EA+ E W AM E+E +RN W LV +P IG K
Sbjct: 3181 ----CFVSKI----EPKNVKEALTDEFWINAMQEELEQFKRNEVWELVPRPEGTNVIGTK 3336
Query: 1016 WVYRIKYKQDGTIDRYKARLVVKGYTQIEGIDFMDTFSPVAKMTTLRVLLSLVSTKNWFL 1075
W+++ K ++G I R KARLV +GYTQIEG+DF +TF+PVA++ ++R+LL + + L
Sbjct: 3337 WIFKNKTNEEGVITRNKARLVAQGYTQIEGVDFDETFAPVARLESIRLLLGVACILKFKL 3516
Query: 1076 HQLDVDNAFLHAKLDEEIYMSLPQGM-NSDKPNQVCLLQKSLYGLKQASRQWFSTLCQAL 1134
+Q+DV +AFL+ L+EE+Y+ P+G + P+ V L+K+LYGLKQA R W+ L + L
Sbjct: 3517 YQMDVKSAFLNGYLNEEVYVEQPKGFADPTHPDHVYRLKKALYGLKQAPRAWYERLTEFL 3696
Query: 1135 QGLGFTQSFADHTLYVKKSATGSFTALLLYVDDVLLAGNDMDEIKLVKHSLHQKFRIKDL 1194
G+ + D TL+VK+ A A + YVDD++ G + ++ + +F + +
Sbjct: 3697 TQQGYRKGGIDKTLFVKQDAENLMIAQI-YVDDIVFGGMSNEMLRHFVQQMQSEFEMSLV 3873
Query: 1195 GEAKFFLGLAIARSQKGIILNQRKYALELLSDSGLLGGKSATTPMDCSQKLSASSGTPLS 1254
GE +FLGL + + + I L+Q +YA ++ G+ TP KLS
Sbjct: 3874 GELTYFLGLQVKQMEDSIFLSQSRYAKNIVKKFGMENASHKRTPAPTHLKLSKDEAGTSV 4053
Query: 1255 DISSYRRLIGRLLYLTTTRPDIAYVVNQLSQFLSAPTDMHEAAAHRVLRYIKGSPGCGLF 1314
D S YR +IG LLYLT +RPDI Y V +++ + P H R+L+Y+ G+ G+
Sbjct: 4054 DQSLYRSMIGSLLYLTASRPDITYAVGVCARYQANPKISHLTQVKRILKYVNGTSDYGIM 4233
Query: 1315 YPAASSTVLTAFSDSDGAGCVDTRKSITGYCMFLGSSLVSWRSKKQTTTSRSSCEAEYRA 1374
Y S+ +L + D+D AG D RKS +G C +LG++L+SW SKKQ S S+ EAEY A
Sbjct: 4234 YCHCSNPMLVGYCDADWAGSADDRKSTSGGCFYLGNNLISWFSKKQNCVSLSTAEAEYIA 4413
Query: 1375 MAATVCEVQWLIYLLQDLQVSQTSPVSMFCDNQSAMHIAHNPSYHERTKHIELDCHIVRE 1434
++ ++ W+ +L++ V Q ++++CDN SA++I+ NP H RTKHI++ H +R+
Sbjct: 4414 AGSSCSQLVWMKQMLKEYNVEQ-DVMTLYCDNMSAINISKNPVQHSRTKHIDIRHHYIRD 4590
Query: 1435 KITQGLVHLLPVTSSLQLADIFTKPLSPAPFRHIFSKL 1472
+ ++ L V + Q+ADIFTK L F + KL
Sbjct: 4591 LVDDKVITLKHVDTEEQIADIFTKALDANQFEKLRGKL 4704
Score = 31.2 bits (69), Expect = 3.6
Identities = 18/68 (26%), Positives = 27/68 (39%)
Frame = +1
Query: 257 PLIAARENSNDGRSSQSNSGYQSNSGYQSGSGNYHSNSGRNRYSNKKCSYYGKMGHTVED 316
P A R + +++ +G + G S R ++ +C Y GK GH
Sbjct: 1378 PKSAGRTTMTEFVPAKNRTGATMSQHRSRHHGMQQKKSKRKKW---RCHYCGKYGHIKPF 1548
Query: 317 CYKKHGFP 324
CY HG P
Sbjct: 1549 CYHLHGHP 1572
>TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, complete
Length = 4734
Score = 511 bits (1316), Expect = e-145
Identities = 323/1060 (30%), Positives = 535/1060 (50%), Gaps = 12/1060 (1%)
Frame = +1
Query: 425 WILDTGATDHICNTLSYFSSYKHVAPIPVSLPNGNIESATIKGSIQLSPSFILINVLFLP 484
W LD+G + H+ + + + + V+ +G+ T G + L VL +
Sbjct: 1687 WYLDSGCSRHMTGVKEFLVNIEPCSTSYVTFGDGSKGKITGMGKLVHDGLPSLNKVLLVK 1866
Query: 485 NFEFNLISVHKLVKSLRFRLTFSDDDCLIQDSNACKMIGTARAVRSLYILNNDSLSPFPS 544
NLIS+ +L F + F+ +CL+ + + ++ +R+ + Y+ P
Sbjct: 1867 GLTANLISISQLCDE-GFNVNFTKSECLVTNEKSEVLMKGSRSKDNCYLWT-------PQ 2022
Query: 545 CNSVSTCKSQNLVCEFSPSVQ-NLWHYRLGHPSFVKGQSIKDL-----FPYVQYTQDHVC 598
S S+ C FS + +WH R GH + I D P ++ + +C
Sbjct: 2023 ETSYSS------TCLFSKEDEVKIWHQRFGHLHLRGMKKIIDKGAVRGIPNLKIEEGRIC 2184
Query: 599 EVCPIAKQKRLKFP-LSNSTSDCIFQMIHVDIWGPVSVLSLNGFSYFLTIVDDYSRYTWV 657
C I KQ ++ L + T+ + +++H+D+ GP+ V SL G Y +VDD+SR+TWV
Sbjct: 2185 GECQIGKQVKMSHQKLQHQTTSRVLELLHMDLMGPMQVESLGGKRYAYVVVDDFSRFTWV 2364
Query: 658 YLLKSKDEVQTLVKDFCAFVTNQFGVSVKIVRSDNGKEFVLS---QFYAEKGIIHHTSCV 714
++ K + + K+ + + +K +RSD+G+EF S +F +GI H S
Sbjct: 2365 NFIREKSDTFEVFKELSLRLQREKDCVIKRIRSDHGREFENSKFTEFCTSEGITHEFSAA 2544
Query: 715 ETPQQNSIVERKHQHILNVARALLFQAHLPKIFWAHAIIHSIFLINRLPTPILDNQCPFQ 774
TPQQN IVERK++ + AR +L LP WA A+ + ++ NR+ ++
Sbjct: 2545 ITPQQNGIVERKNRTLQEAARVMLHAKELPYNLWAEAMNTACYIHNRVTLRRGTPTTLYE 2724
Query: 775 LLHKQLPDITFLKVFGSLCFASTLASHRTKFDHRAKRCVFLGFKSGTKGYLVYDLNTRDI 834
+ + P + +FGS C+ R K D ++ +FLG+ + ++ Y V++ TR +
Sbjct: 2725 IWKGRKPTVKHFHIFGSPCYILADREQRRKMDPKSDAGIFLGYSTNSRAYRVFNSRTRTV 2904
Query: 835 SISRNVIFHENIFPYKLHRDDHECESSTFPQIPCLPSEPFDYEYPIPPDNPVQIQPLSSG 894
S NV+ +++ P + + + +S DN V S+
Sbjct: 2905 MESINVVV-DDLTPARKKDVEEDVRTSG--------------------DN-VADTAKSAE 3018
Query: 895 PSQNSEQPIVHRVSQRPRKQPTY-LQDYHCTLAATSTVVPANSSSKGTSHPLSQVISYHK 953
++NS+ +P K+P+ +Q H ++ + T +++S
Sbjct: 3019 NAENSDSATDEPNINQPDKRPSIRIQKMH----PKELIIGDPNRGVTTRSREIEIVSNS- 3183
Query: 954 LDPSYHAFIMNITTTVEPTRYSEAVKHECWRVAMN*EIEALERNNTWLLVDKPPDKTPIG 1013
F+ I EP EA+ E W AM E+E +RN W LV +P IG
Sbjct: 3184 ------CFVSKI----EPKNVKEALTDEFWINAMQEELEQFKRNEVWELVPRPEGTNVIG 3333
Query: 1014 CKWVYRIKYKQDGTIDRYKARLVVKGYTQIEGIDFMDTFSPVAKMTTLRVLLSLVSTKNW 1073
KW+++ K ++G I R KARLV +GYTQIEG+DF +TF+PVA++ ++R+LL + +
Sbjct: 3334 TKWIFKNKTNEEGVITRNKARLVAQGYTQIEGVDFDETFAPVARLESIRLLLGVACILKF 3513
Query: 1074 FLHQLDVDNAFLHAKLDEEIYMSLPQG-MNSDKPNQVCLLQKSLYGLKQASRQWFSTLCQ 1132
L+Q+DV +AFL+ L+EE Y+ P+G ++ P+ V L+K+LYGLKQA R W+ L +
Sbjct: 3514 KLYQMDVKSAFLNGYLNEEAYVEQPKGFVDPTHPDHVYRLKKALYGLKQAPRAWYERLTE 3693
Query: 1133 ALQGLGFTQSFADHTLYVKKSATGSFTALLLYVDDVLLAGNDMDEIKLVKHSLHQKFRIK 1192
L G+ + D TL+VK+ A A + YVDD++ G + ++ + +F +
Sbjct: 3694 FLTQQGYRKGGIDKTLFVKQDAENLMIAQI-YVDDIVFGGMSNEMLRHFVQQMQSEFEMS 3870
Query: 1193 DLGEAKFFLGLAIARSQKGIILNQRKYALELLSDSGLLGGKSATTPMDCSQKLSASSGTP 1252
+GE +FLGL + + + I L+Q KYA ++ G+ TP KLS
Sbjct: 3871 LVGELTYFLGLQVKQMEDSIFLSQSKYAKNIVKKFGMENASHKRTPAPTHLKLSKDEAGT 4050
Query: 1253 LSDISSYRRLIGRLLYLTTTRPDIAYVVNQLSQFLSAPTDMHEAAAHRVLRYIKGSPGCG 1312
D S YR +IG LLYLT +RPDI Y V +++ + P H R+L+Y+ G+ G
Sbjct: 4051 SVDQSLYRSMIGSLLYLTASRPDITYAVGVCARYQANPKISHLNQVKRILKYVNGTSDYG 4230
Query: 1313 LFYPAASSTVLTAFSDSDGAGCVDTRKSITGYCMFLGSSLVSWRSKKQTTTSRSSCEAEY 1372
+ Y S ++L + D+D AG D RKS +G C +LG++L+SW SKKQ S S+ EAEY
Sbjct: 4231 IMYCHCSDSMLVGYCDADWAGSADDRKSTSGGCFYLGTNLISWFSKKQNCVSLSTAEAEY 4410
Query: 1373 RAMAATVCEVQWLIYLLQDLQVSQTSPVSMFCDNQSAMHIAHNPSYHERTKHIELDCHIV 1432
A ++ ++ W+ +L++ V Q ++++CDN SA++I+ NP H RTKHI++ H +
Sbjct: 4411 IAAGSSCSQLVWMKQMLKEYNVEQ-DVMTLYCDNMSAINISKNPVQHSRTKHIDIRHHYI 4587
Query: 1433 REKITQGLVHLLPVTSSLQLADIFTKPLSPAPFRHIFSKL 1472
R+ + ++ L V + Q+ADIFTK L F + KL
Sbjct: 4588 RDLVDDKVITLEHVDTEEQIADIFTKALDANQFEKLRGKL 4707
Score = 31.2 bits (69), Expect = 3.6
Identities = 13/41 (31%), Positives = 20/41 (48%)
Frame = +1
Query: 284 QSGSGNYHSNSGRNRYSNKKCSYYGKMGHTVEDCYKKHGFP 324
Q S ++ + +++ +C Y GK GH CY HG P
Sbjct: 1453 QHRSRHHGTQQKKSKRKKWRCHYCGKYGHIKPFCYHLHGHP 1575
>TC231544 weakly similar to UP|Q9FLR2 (Q9FLR2) Polyprotein-like, partial (16%)
Length = 662
Score = 184 bits (466), Expect = 3e-46
Identities = 104/188 (55%), Positives = 129/188 (68%), Gaps = 4/188 (2%)
Frame = +3
Query: 1297 AAHRVLRYIKGSPGCGLFYPAASSTVLTAFSDSDGAGCVDTRKSITGYCMFLGSSLVSWR 1356
AA RVL+Y+KG P GL + S + FSD+D A C+D+ KSIT YC FLGSSL+SW+
Sbjct: 18 AATRVLKYLKGCPRKGLSFSRESPIQILGFSDADWATCIDSSKSITWYCFFLGSSLISWK 197
Query: 1357 SKKQTTTSR--SSCEAEYRAMAATVCEVQWLIYLLQDLQVSQTSPVSMFCDNQSAMH-IA 1413
+KKQ T SR SS EA+YRA+ +T CE+QWL YLL+DL V+ ++CDNQSA+ +
Sbjct: 198 AKKQNTVSRSSSSSEAKYRALTSTTCELQWLTYLLKDLHVT-----LIYCDNQSALQ*LP 362
Query: 1414 HNPSYHERTKHIELDCHIVREKITQGLVH-LLPVTSSLQLADIFTKPLSPAPFRHIFSKL 1472
YH + +E+DCHIVREK QGL+H LLPV+SS QLADIFTK LSP F SKL
Sbjct: 363 IKVIYHGQ---LEIDCHIVREKTQQGLMHCLLPVSSSNQLADIFTKALSPKLFSSNLSKL 533
Query: 1473 RLYDIHSP 1480
L DI P
Sbjct: 534 GLSDIFLP 557
>TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Fragment), partial
(30%)
Length = 687
Score = 165 bits (418), Expect = 1e-40
Identities = 76/161 (47%), Positives = 110/161 (68%)
Frame = +2
Query: 1320 STVLTAFSDSDGAGCVDTRKSITGYCMFLGSSLVSWRSKKQTTTSRSSCEAEYRAMAATV 1379
+T L+ + D+D AGC R+S +GYC+F+G +LVSW+SKKQT +RSS EAEYR+MA
Sbjct: 8 NTQLSGYCDADWAGCPMDRRSTSGYCVFIGGNLVSWKSKKQTVVARSSAEAEYRSMAMVT 187
Query: 1380 CEVQWLIYLLQDLQVSQTSPVSMFCDNQSAMHIAHNPSYHERTKHIELDCHIVREKITQG 1439
CE+ W+ LQ+L+ + + ++CDNQ+A+HIA NP +HERTKHIE+DCH +REK+
Sbjct: 188 CELMWIKQFLQELRFCEELQMKLYCDNQAALHIASNPVFHERTKHIEIDCHFIREKLLSK 367
Query: 1440 LVHLLPVTSSLQLADIFTKPLSPAPFRHIFSKLRLYDIHSP 1480
+ + S+ Q DI TK L + + SKL YD+++P
Sbjct: 368 EIVTEFIGSNDQPVDILTKSLRGPKIQIVCSKLGAYDLYAP 490
>TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polyprotein, partial
(7%)
Length = 804
Score = 161 bits (408), Expect = 2e-39
Identities = 84/221 (38%), Positives = 137/221 (61%), Gaps = 3/221 (1%)
Frame = +1
Query: 1255 DISSYRRLIGRLLYLTTTRPDIAYVVNQLSQFLSAPTDMHEAAAHRVLRYIKGSPGCGLF 1314
D++ +RRLIG L YL +RP+I + V+ +S+F+ P H AA RVLR IKG+ G G+
Sbjct: 10 DVTEFRRLIGSLRYLCNSRPNICFAVSLISRFMKRPRLSHMQAAKRVLRLIKGTIGSGVL 189
Query: 1315 YPAASSTV---LTAFSDSDGAGCVDTRKSITGYCMFLGSSLVSWRSKKQTTTSRSSCEAE 1371
+P + + L ++DSD + KS GY + V+ SKKQ + S+CEAE
Sbjct: 190 FPFKAKSGKPDLLGYTDSDWKRDPEQEKSTGGYLFMYNDAPVA*SSKKQDVIALSTCEAE 369
Query: 1372 YRAMAATVCEVQWLIYLLQDLQVSQTSPVSMFCDNQSAMHIAHNPSYHERTKHIELDCHI 1431
Y A + C+ W++ LL++L++ + PV++ DN+SA+++A +P+ H R+KHIEL H
Sbjct: 370 YVAASLGACQAVWMMNLLEELKLRERKPVNLLIDNKSAINLAKHPTLHGRSKHIELRFHY 549
Query: 1432 VREKITQGLVHLLPVTSSLQLADIFTKPLSPAPFRHIFSKL 1472
+R+++++G V + + QLAD+ TKP+ + F+ I S+L
Sbjct: 550 IRDQVSKGNVTVEYCKAEEQLADLMTKPIQVSRFKQICSEL 672
>TC235104 weakly similar to UP|Q850H8 (Q850H8) Gag-pol polyprotein
(Fragment), partial (28%)
Length = 865
Score = 152 bits (385), Expect = 8e-37
Identities = 102/298 (34%), Positives = 158/298 (52%), Gaps = 3/298 (1%)
Frame = +2
Query: 450 PIPVSLPNGNIESATIKGSIQLSPSFILINVLFLPNFEFNLISVHKLVKSLRFRLTFSDD 509
P ++L +G+ AT G + + S L +V+F+ FN+ S+ +L + +TF +
Sbjct: 11 PYFITLADGSRVVATGIGHVSPTSSLSLNSVVFILGCPFNITSLSQLTRFRNCSVTFDAN 190
Query: 510 DCLIQDSNACKMIGTARAVRSLYILNNDSLSPFPSCNSVSTCKSQNLVCEFSPSVQNLWH 569
+IQ+ IG LY L +LS C++V++ K L H
Sbjct: 191 SFVIQECGTGWTIGVGIESHGLYYLK-PNLSWV--CSAVTSPK--------------LLH 319
Query: 570 YRLGHPSFVKGQSIKDLFPYVQYTQDHVCEVCPIAKQKRLKFPLSNSTSDCIFQMIHVDI 629
RLGHP K +K + P ++ +D CE C + K R S D F +IH DI
Sbjct: 320 ERLGHPHLSK---LKIMVPSLEKIKDLFCESCQLGKHVRSSXRHVESRVDSPFLVIHXDI 490
Query: 630 WGPVSVLSLNGFSYFLTIVDDYSRYTWVYLLKSKDEVQTLVKDFCAFVTNQFGVSVKIVR 689
WGP V S++ + YF+T +D++S+ T V+L+K + E+ + + T QFG ++KI+R
Sbjct: 491 WGPNRVSSMS-YRYFVTFIDEFSQCTRVFLMKERSEILSFLTSVNKIKT-QFGKTIKILR 664
Query: 690 SDNGKEF---VLSQFYAEKGIIHHTSCVETPQQNSIVERKHQHILNVARALLFQAHLP 744
SDN KE+ V+S F + +GI+H SC TPQQN I ERK++H++ AR LL A+ P
Sbjct: 665 SDNAKEYFSSVISPFXSAQGILHQFSCPHTPQQNDIAERKNRHLVETARTLLLHANEP 838
>CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {Vitis
vinifera}, partial (34%)
Length = 409
Score = 152 bits (384), Expect = 1e-36
Identities = 69/134 (51%), Positives = 95/134 (70%)
Frame = +3
Query: 965 ITTTVEPTRYSEAVKHECWRVAMN*EIEALERNNTWLLVDKPPDKTPIGCKWVYRIKYKQ 1024
+++ P+ EA+ H WR AM E++ALE N TW LV PP KT +GC+WVY +K
Sbjct: 3 LSSLTVPSTIREALDHPGWRQAMVDEMQALENNGTWELVPLPPGKTTVGCRWVYTVKVGP 182
Query: 1025 DGTIDRYKARLVVKGYTQIEGIDFMDTFSPVAKMTTLRVLLSLVSTKNWFLHQLDVDNAF 1084
+G +DR KARLV KGYTQ+ GI++ DTFSPV +TT+R+ L++ + ++W LHQLD+ NAF
Sbjct: 183 NGKVDRLKARLVAKGYTQVYGIEYCDTFSPVFFLTTVRLFLAMAAIRHWPLHQLDIKNAF 362
Query: 1085 LHAKLDEEIYMSLP 1098
LH L+E+IYM P
Sbjct: 363 LHGDLEEDIYMEQP 404
>BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vinifera},
partial (34%)
Length = 407
Score = 152 bits (384), Expect = 1e-36
Identities = 71/126 (56%), Positives = 91/126 (71%), Gaps = 1/126 (0%)
Frame = -2
Query: 1003 VDKPPDKTPIGCKWVYRIKYKQDGTIDRYKARLVVKGYTQIEGIDFMDTFSPVAKMTTLR 1062
V PP KTP+GC+WVY +K G +DR KARLV KGYTQ+ GID+ DTFSPVAK+TT+R
Sbjct: 406 VPLPPGKTPVGCRWVYTVKVGPTGEVDRLKARLVAKGYTQVYGIDYCDTFSPVAKLTTVR 227
Query: 1063 VLLSLVSTKNWFLHQLDVDNAFLHAKLDEEIYMSLPQG-MNSDKPNQVCLLQKSLYGLKQ 1121
+ L++ + +W LHQLD+ NAFLH L+E+IYM P G + + VC L +SLYGLKQ
Sbjct: 226 LFLAMAAICHWPLHQLDIKNAFLHGDLEEDIYMEQPPGFVAQGEYGLVCKLHRSLYGLKQ 47
Query: 1122 ASRQWF 1127
+ R WF
Sbjct: 46 SPRAWF 29
>AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza sativa
(japonica cultivar-group)}, partial (10%)
Length = 463
Score = 147 bits (371), Expect = 3e-35
Identities = 74/151 (49%), Positives = 100/151 (66%), Gaps = 1/151 (0%)
Frame = -3
Query: 985 VAMN*EIEALERNNTWLLVDKPPDKTPIGCKWVYRIKYKQDGTIDRYKARLVVKGYTQIE 1044
+AM E+ ERNN W LV+KP + IG KWV+R K + G I R KARLV KGY Q E
Sbjct: 461 IAMQEELNQFERNNVWKLVEKPENYPVIGTKWVFRNKLDEHGIIIRNKARLVAKGYNQEE 282
Query: 1045 GIDFMDTFSPVAKMTTLRVLLSLVSTKNWFLHQLDVDNAFLHAKLDEEIYMSLPQGMN-S 1103
GID+ +T++PVA++ +R+LL+ VS N+ L+Q+DV +AFL+ + EE+Y+ P G
Sbjct: 281 GIDYEETYAPVARLEVIRMLLAYVSIMNFKLYQMDVKSAFLNGLIQEEVYVEQPPGFEIP 102
Query: 1104 DKPNQVCLLQKSLYGLKQASRQWFSTLCQAL 1134
DKP V LQK+LYGLKQA R W+ + L
Sbjct: 101 DKPTHVYKLQKALYGLKQAPRAWYERISNFL 9
>BI317550 weakly similar to GP|9759590|dbj| polyprotein-like {Arabidopsis
thaliana}, partial (18%)
Length = 421
Score = 145 bits (367), Expect = 1e-34
Identities = 73/139 (52%), Positives = 102/139 (72%)
Frame = -2
Query: 1114 KSLYGLKQASRQWFSTLCQALQGLGFTQSFADHTLYVKKSATGSFTALLLYVDDVLLAGN 1173
KSLYGLKQASR+W+ L L G+ QS +D++L+ +FTALL+YVDD++LAG+
Sbjct: 420 KSLYGLKQASRKWYEKLTNLLLKEGYIQSISDYSLFTLTKGN-TFTALLVYVDDIILAGD 244
Query: 1174 DMDEIKLVKHSLHQKFRIKDLGEAKFFLGLAIARSQKGIILNQRKYALELLSDSGLLGGK 1233
+DE +K+ L F+IK+LG+ K+FLGL +A S+ GI ++QRKY L+LL DSGLLG K
Sbjct: 243 SIDEFDRIKNVLDLAFKIKNLGKLKYFLGLEVAHSRLGITISQRKYCLDLLKDSGLLGCK 64
Query: 1234 SATTPMDCSQKLSASSGTP 1252
A+TP+D S KL +++GTP
Sbjct: 63 PASTPLDTSIKLHSAAGTP 7
>TC234303 weakly similar to UP|Q8W153 (Q8W153) Polyprotein, partial (10%)
Length = 558
Score = 139 bits (349), Expect = 1e-32
Identities = 78/180 (43%), Positives = 108/180 (59%), Gaps = 2/180 (1%)
Frame = +1
Query: 1026 GTIDRYKARLVVKGYTQIEGIDFMDTFSPVAKMTTLRVLLSLVSTKNWFLHQLDVDNAFL 1085
GTID++KARLV K YTQ+ G D+ TFSPVAKM + +L S+ +W L LD NAFL
Sbjct: 28 GTIDQFKARLVAKSYTQVYGQDYTGTFSPVAKMAYVHLLWSMAVVCHWPLF*LDAKNAFL 207
Query: 1086 HAKLDEEIYMSLPQGM--NSDKPNQVCLLQKSLYGLKQASRQWFSTLCQALQGLGFTQSF 1143
H L+EE+YM P G + N VC L +S YGLKQ+ R W C A + +
Sbjct: 208 HGYLEEEVYMEQPLGFVAQGESSNMVCQLCRSFYGLKQSPRAWPFLYCGA--AIWYDSHE 381
Query: 1144 ADHTLYVKKSATGSFTALLLYVDDVLLAGNDMDEIKLVKHSLHQKFRIKDLGEAKFFLGL 1203
ADH+++ S G L++YVDD+ + G+D I +K L +F+ KDLG+ ++FLG+
Sbjct: 382 ADHSVFYCHSPQGCI-YLIVYVDDIGITGSDQHGIT*LK*XLCCQFQTKDLGKLRYFLGI 558
>TC225402 weakly similar to UP|Q6I923 (Q6I923) Copia-like retrotransposon
Hopscotch polyprotein, partial (7%)
Length = 1446
Score = 121 bits (304), Expect(2) = 1e-29
Identities = 53/99 (53%), Positives = 70/99 (70%)
Frame = +2
Query: 1338 RKSITGYCMFLGSSLVSWRSKKQTTTSRSSCEAEYRAMAATVCEVQWLIYLLQDLQVSQT 1397
R S GYC+ +G +LV W+S K +RSS EAEY+AM CE+ W+ LLQ+L+ T
Sbjct: 38 RGSTLGYCVSIGENLVLWKSNK*NVVARSSAEAEYKAMTVATCELIWIKQLLQELKFGST 217
Query: 1398 SPVSMFCDNQSAMHIAHNPSYHERTKHIELDCHIVREKI 1436
+ + CDNQ+A+HIA NP +HERTKHIE+DCH VREK+
Sbjct: 218 QQMKLCCDNQAALHIASNPVFHERTKHIEIDCHFVREKV 334
Score = 28.1 bits (61), Expect(2) = 1e-29
Identities = 14/34 (41%), Positives = 24/34 (70%)
Frame = +3
Query: 1446 VTSSLQLADIFTKPLSPAPFRHIFSKLRLYDIHS 1479
V+S+ QLA+IFTK L ++I SKL +++++
Sbjct: 363 VSSNDQLANIFTKSLRGPRIQNICSKLGAFELYA 464
>BM307983
Length = 406
Score = 127 bits (320), Expect = 3e-29
Identities = 64/133 (48%), Positives = 86/133 (64%), Gaps = 2/133 (1%)
Frame = +2
Query: 1012 IGCKWVYRIKYKQDGTIDRYKARLVVKGYTQIEGIDFMDTFSPVAK-MTTLRVLLSLVST 1070
+GC+W+Y +KY D T+DRYKARLV KGY Q GID+ +TF+ K + + +
Sbjct: 2 VGCRWIYTVKY*ADDTLDRYKARLVAKGYIQTYGIDYEETFAQWQK*IQSGSSSP*QQAQ 181
Query: 1071 KNWFLHQLDVDNAFLHAKLDEEIYMSLPQGMN-SDKPNQVCLLQKSLYGLKQASRQWFST 1129
W +HQ DV NAFLH L+EE+YM +P G S+ N+VC L+K+LYGLKQ+ R WF
Sbjct: 182 FGWEMHQFDVKNAFLHGSLEEEVYMEIPPGYGASNGGNKVCRLKKALYGLKQSPRAWFGR 361
Query: 1130 LCQALQGLGFTQS 1142
QA+ LG+ QS
Sbjct: 362 FTQAMLSLGYKQS 400
>BU548243
Length = 599
Score = 127 bits (318), Expect = 5e-29
Identities = 68/149 (45%), Positives = 94/149 (62%)
Frame = -1
Query: 1324 TAFSDSDGAGCVDTRKSITGYCMFLGSSLVSWRSKKQTTTSRSSCEAEYRAMAATVCEVQ 1383
TA D+ A VD +S G +FLG +L+SW S+KQ T++SS EAEYR++A T E+
Sbjct: 599 TALCDAGWASDVDDHRSTLGSAIFLGPNLISWWSRKQQVTAQSSTEAEYRSIAQTSAELT 420
Query: 1384 WLIYLLQDLQVSQTSPVSMFCDNQSAMHIAHNPSYHERTKHIELDCHIVREKITQGLVHL 1443
W+ LL +LQ+ T PV + CDN+SA+ IAHN +H RTKH+E+D V EK+ + +
Sbjct: 419 WIQALLMELQIPFTPPV-ILCDNKSAVAIAHNLVFHSRTKHMEIDVFFVHEKVLSKQLQI 243
Query: 1444 LPVTSSLQLADIFTKPLSPAPFRHIFSKL 1472
+ + Q A I TKPLS A F + SKL
Sbjct: 242 FHIPALDQWAGILTKPLSSARFTFLKSKL 156
>TC222001 similar to UP|C716_NEPRA (O04164) Cytochrome P450 71A6 (Fragment)
, partial (21%)
Length = 912
Score = 124 bits (311), Expect = 3e-28
Identities = 51/103 (49%), Positives = 78/103 (75%)
Frame = -2
Query: 743 LPKIFWAHAIIHSIFLINRLPTPILDNQCPFQLLHKQLPDITFLKVFGSLCFASTLASHR 802
+P FW +A++H+ +LIN +PTP L N P++ LH +PDI+ L++FG LC+AST+ ++R
Sbjct: 911 MPPNFWNYALLHAAYLINCIPTPFLQNTSPYERLHGHIPDISHLRIFGCLCYASTIKANR 732
Query: 803 TKFDHRAKRCVFLGFKSGTKGYLVYDLNTRDISISRNVIFHEN 845
K + RA C+F+GFK TKGY++YDL++ +I SRNV+F+EN
Sbjct: 731 KKLEPRAHPCIFIGFKPNTKGYMLYDLHSHNIITSRNVVFYEN 603
>BU764568
Length = 420
Score = 103 bits (257), Expect(2) = 2e-26
Identities = 44/84 (52%), Positives = 64/84 (75%)
Frame = +3
Query: 1342 TGYCMFLGSSLVSWRSKKQTTTSRSSCEAEYRAMAATVCEVQWLIYLLQDLQVSQTSPVS 1401
+GYC+ +G +L+SW+SKKQ+ ++SS EAEYRAMA CE+ WL LL +L+ + + ++
Sbjct: 168 SGYCVLIGGNLISWKSKKQSVVAKSSAEAEYRAMALVTCELIWLKQLL*ELKFEEDTQMT 347
Query: 1402 MFCDNQSAMHIAHNPSYHERTKHI 1425
+ CDNQ+A+HIA NP +H RTKHI
Sbjct: 348 LICDNQAALHIASNPIFH*RTKHI 419
Score = 35.8 bits (81), Expect(2) = 2e-26
Identities = 19/55 (34%), Positives = 28/55 (50%)
Frame = +1
Query: 1284 SQFLSAPTDMHEAAAHRVLRYIKGSPGCGLFYPAASSTVLTAFSDSDGAGCVDTR 1338
SQFL++P H A +L+ K +PG GL Y + + +SD+D G R
Sbjct: 1 SQFLNSPCQDHWNAVS*ILK*TKSAPGKGLIYEDKGHSQIIGYSDAD*VGSPSDR 165
>TC213888 similar to UP|Q9SFE1 (Q9SFE1) T26F17.17, partial (11%)
Length = 493
Score = 117 bits (292), Expect = 5e-26
Identities = 61/137 (44%), Positives = 84/137 (60%)
Frame = +3
Query: 1336 DTRKSITGYCMFLGSSLVSWRSKKQTTTSRSSCEAEYRAMAATVCEVQWLIYLLQDLQVS 1395
D RKS TG+ F+G + +W SKKQ + S+CEAEY A + VC WL LL++L++
Sbjct: 9 DDRKSTTGFVFFMGDTAFTWMSKKQPIVTLSTCEAEYVAATSCVCHAIWLRNLLKELKMP 188
Query: 1396 QTSPVSMFCDNQSAMHIAHNPSYHERTKHIELDCHIVREKITQGLVHLLPVTSSLQLADI 1455
Q P+ + DN+SA+ +A NP +HE++KHI+ H +RE I + V L V S Q ADI
Sbjct: 189 QEEPMEICVDNKSALALAKNPVFHEKSKHIDTRYHFIRECIEKKEVKLKYVMSQDQAADI 368
Query: 1456 FTKPLSPAPFRHIFSKL 1472
FTKPL F + S L
Sbjct: 369 FTKPLKLETFVKLRSML 419
>BE474381 similar to GP|21434|emb|CA ORF4 {Solanum tuberosum}, partial (20%)
Length = 406
Score = 89.4 bits (220), Expect(2) = 5e-26
Identities = 42/84 (50%), Positives = 62/84 (73%)
Frame = +3
Query: 1326 FSDSDGAGCVDTRKSITGYCMFLGSSLVSWRSKKQTTTSRSSCEAEYRAMAATVCEVQWL 1385
++D+D AG V R+S +GYC F+G +LVS SKKQ+ +RSS EAE+RA+A +CE W+
Sbjct: 156 YTDADWAGSVTDRRSTSGYCTFVGGNLVS*-SKKQSVVARSSAEAEFRALAHGICETLWV 332
Query: 1386 IYLLQDLQVSQTSPVSMFCDNQSA 1409
LLQ+L+V + P+ ++CDN+SA
Sbjct: 333 KKLLQELKVHSSPPIKLYCDNKSA 404
Score = 48.5 bits (114), Expect(2) = 5e-26
Identities = 22/39 (56%), Positives = 29/39 (73%)
Frame = +1
Query: 1276 IAYVVNQLSQFLSAPTDMHEAAAHRVLRYIKGSPGCGLF 1314
IA+ V+ +SQF+ AP H AA R+LRY+KGSPG GL+
Sbjct: 7 IAFAVSMVSQFMHAPGHEHLEAAFRILRYLKGSPGRGLY 123
>BU549979
Length = 615
Score = 115 bits (289), Expect = 1e-25
Identities = 63/194 (32%), Positives = 105/194 (53%), Gaps = 2/194 (1%)
Frame = -1
Query: 1283 LSQFLSAPTDMHEAAAHRVLRYIKGSPGCGLFYPAASSTVLTAFSDSDGAGCVDTRKSIT 1342
L ++ S P H A +V+RY++G+ L Y + + +SDSD AGCVD+R+S +
Sbjct: 612 LGRYQSNPGIDHWKTAKKVMRYLQGTKDYMLMYKQTNCLEVIGYSDSDFAGCVDSRRSTS 433
Query: 1343 GYCMFLGSSLVSWRSKKQTTTSRSSCEAEYRAMAATVCEVQWLIYLLQDLQV--SQTSPV 1400
GY L +VSWRS KQT + S+ E E+ WL + L+V S + P+
Sbjct: 432 GYIFMLADGVVSWRSSKQTLIATSTMEVEFVPCFEATSHGVWLKSFMSSLRVVDSISRPL 253
Query: 1401 SMFCDNQSAMHIAHNPSYHERTKHIELDCHIVREKITQGLVHLLPVTSSLQLADIFTKPL 1460
++CDN +A+ +A N R+KHI++ ++RE++ + V + V + L + D TK +
Sbjct: 252 KLYCDNFAAVFMAKNNKSGNRSKHIDIKYLVIRERVKEKKVVIEHVNTELMIVDPLTKGM 73
Query: 1461 SPAPFRHIFSKLRL 1474
+P F+ ++ L
Sbjct: 72 TPKNFKDHVVRMEL 31
>BI787454 weakly similar to GP|21434|emb|CA ORF4 {Solanum tuberosum}, partial
(21%)
Length = 421
Score = 113 bits (283), Expect = 5e-25
Identities = 54/137 (39%), Positives = 88/137 (63%)
Frame = +2
Query: 1142 SFADHTLYVKKSATGSFTALLLYVDDVLLAGNDMDEIKLVKHSLHQKFRIKDLGEAKFFL 1201
S ADH+++ ++ G L++YVDD+++ D +I +K L F+ KDL K+FL
Sbjct: 8 SEADHSVFYCHTSPGKCVYLMVYVDDIMITKKDATKIVQLKEHLFNHFQTKDLRYLKYFL 187
Query: 1202 GLAIARSQKGIILNQRKYALELLSDSGLLGGKSATTPMDCSQKLSASSGTPLSDISSYRR 1261
G+ +A+S G++++QRKYAL++L ++G+ + +PMD + KL A D YRR
Sbjct: 188 GIEVAQSGDGVVISQRKYALDILEETGMQNCRLVDSPMDPNLKLMAYQSEVYPDPERYRR 367
Query: 1262 LIGRLLYLTTTRPDIAY 1278
L+G+L+YLT TRPDI++
Sbjct: 368 LVGKLIYLTITRPDISF 418
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.326 0.138 0.426
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 72,661,035
Number of Sequences: 63676
Number of extensions: 1126813
Number of successful extensions: 8338
Number of sequences better than 10.0: 169
Number of HSP's better than 10.0 without gapping: 8075
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 8259
length of query: 1481
length of database: 12,639,632
effective HSP length: 109
effective length of query: 1372
effective length of database: 5,698,948
effective search space: 7818956656
effective search space used: 7818956656
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.7 bits)
S2: 65 (29.6 bits)
Lotus: description of TM0133.5