
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0366.1
(1346 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete 505 e-143
TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, co... 502 e-142
TC231544 weakly similar to UP|Q9FLR2 (Q9FLR2) Polyprotein-like, ... 189 5e-48
TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Frag... 171 2e-42
BI317550 weakly similar to GP|9759590|dbj| polyprotein-like {Ara... 171 2e-42
AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza ... 155 1e-37
BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vi... 154 2e-37
CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {V... 154 2e-37
TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polypro... 147 2e-35
TC225402 weakly similar to UP|Q6I923 (Q6I923) Copia-like retrotr... 136 5e-32
BU764568 108 9e-31
CO982036 130 3e-30
CO981879 91 5e-30
BU549979 130 5e-30
TC234303 weakly similar to UP|Q8W153 (Q8W153) Polyprotein, parti... 129 9e-30
BM086359 124 2e-28
BI787454 weakly similar to GP|21434|emb|CA ORF4 {Solanum tuberos... 124 4e-28
BM307983 120 3e-27
TC235104 weakly similar to UP|Q850H8 (Q850H8) Gag-pol polyprotei... 119 1e-26
BU548243 117 3e-26
>TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete
Length = 4731
Score = 505 bits (1301), Expect = e-143
Identities = 312/971 (32%), Positives = 499/971 (51%), Gaps = 13/971 (1%)
Frame = +1
Query: 389 SSSSSTCFSIQHSSTSEINDFGHIIPPGALWHFRLGHLSHDRLL-------ALHTVQNSI 441
+S SSTC S S E+ +WH R GHL H R + A+ + N +
Sbjct: 2023 TSYSSTCLS---SKEDEVR----------IWHQRFGHL-HLRGMKKIIDKGAVRGIPN-L 2157
Query: 442 SVSKSIVCDVCHLAKQ-KRKMFTVSVSKAQKCFDLVHMDIWGPLAQASVHNHKYFLTVLD 500
+ + +C C + KQ K + + +L+HMD+ GP+ S+ +Y V+D
Sbjct: 2158 KIEEGRICGECQIGKQVKMSHQKLQHQTTSRVLELLHMDLMGPMQVESLGGKRYAYVVVD 2337
Query: 501 DYSRFVWVVLLNNKGEVQQQVKNFITLVKTQFGQIVKAIRSDNGPEF---LLPAFYSAQG 557
D+SRF WV + K E + K ++ + ++K IRSD+G EF F +++G
Sbjct: 2338 DFSRFTWVNFIREKSETFEVFKELSLRLQREKDCVIKRIRSDHGREFENSRFTEFCTSEG 2517
Query: 558 IVHQKSCVSTPQQNGRVERKHQHILNIARALLFQSKLPKKMWCYSVLHAVFLMNRIPSKL 617
I H+ S TPQQNG VERK++ + AR +L +LP +W ++ A ++ NR+ +
Sbjct: 2518 ITHEFSAAITPQQNGIVERKNRTLQEAARVMLHAKELPYNLWAEAMNTACYIHNRVTLRR 2697
Query: 618 LKNKSPYELLYREAVDLEMMKVFGSLCFATTLTNNRSKLDPRARKCIFLGYKQGMKGYVL 677
+ YE+ ++ +FGS C+ R K+DP++ IFLGY + Y +
Sbjct: 2698 GTPTTLYEIWKGRKPSVKHFHIFGSPCYILADREQRRKMDPKSDAGIFLGYSTNSRAYRV 2877
Query: 678 MDLITQEIFISRDVIFHEHVLPYKGSNSPAWTCLDLQTQAESTADKLNVASEIQSAKGDA 737
+ T+ + S +V+ + SPA D++ ++ D NVA +S G+
Sbjct: 2878 FNSRTRTVMESINVVVDDL--------SPARK-KDVEEDVRTSGD--NVADAAKS--GEN 3018
Query: 738 SVSTPCASIEDGV-QHEMPSGASIEDEIPNTQTVDEIPQPGSITAEGEVEVRRSTRPLKR 796
+ ++ A+ E + Q + S I+ P + + P G T EVE+
Sbjct: 3019 AENSDSATDESNINQPDKRSSTRIQKMHPKELIIGD-PNRGVTTRSREVEI--------- 3168
Query: 797 PVHLANFQFSALPIRSSTAHSIVHHYSDERLSTAHRAYALNIAQDQEPSTFAEANKDLHW 856
++ + I EP EA D W
Sbjct: 3169 --------------------------------VSNSCFVSKI----EPKNVKEALTDEFW 3240
Query: 857 REAMQAEIKALEKNGTWKLVDLPEGVKPIGNKWVYRVKRNVDGTLARYKARLVAKGYNQV 916
AMQ E++ ++N W+LV PEG IG KW+++ K N +G + R KARLVA+GY Q+
Sbjct: 3241 INAMQEELEQFKRNEVWELVPRPEGTNVIGTKWIFKNKTNEEGVITRNKARLVAQGYTQI 3420
Query: 917 EGLDYFDTFSPVAKLTTVRVILALAASQNWHLHQLDVDNAFLHGNLDEDVYMTIPAGVPS 976
EG+D+ +TF+PVA+L ++R++L +A + L+Q+DV +AFL+G L+E+VY+ P G
Sbjct: 3421 EGVDFDETFAPVARLESIRLLLGVACILKFKLYQMDVKSAFLNGYLNEEVYVEQPKGFAD 3600
Query: 977 -VKPNQVCKLLKSLYGLKQASRKWYEKLSAHLETLGFKQTASDHSLFVKFQGSSFTGLLV 1035
P+ V +L K+LYGLKQA R WYE+L+ L G+++ D +LFVK + +
Sbjct: 3601 PTHPDHVYRLKKALYGLKQAPRAWYERLTEFLTQQGYRKGGIDKTLFVKQDAENLMIAQI 3780
Query: 1036 YVDDVILFGNTVTEFQLVKDSLHQAFGIKDLGVLKYFLGLEVAHSTSGISLCQRKYCLDL 1095
YVDD++ G + + + F + +G L YFLGL+V I L Q +Y ++
Sbjct: 3781 YVDDIVFGGMSNEMLRHFVQQMQSEFEMSLVGELTYFLGLQVKQMEDSIFLSQSRYAKNI 3960
Query: 1096 LQETGTLGSKPVATPLDPSIRLSQEQGKPYDDPAAYRRLVGRLLYLTTTRPDISHATQQL 1155
+++ G + TP ++LS+++ D + YR ++G LLYLT +RPDI++A
Sbjct: 3961 VKKFGMENASHKRTPAPTHLKLSKDEAGTSVDQSLYRSMIGSLLYLTASRPDITYAVGVC 4140
Query: 1156 SQYMSNPMDGHFKAAQRVLRYLKASPGLGLLFPRNSTINIQGYSDADWAGCPDTRRSISG 1215
++Y +NP H +R+L+Y+ + G+++ S + GY DADWAG D R+S SG
Sbjct: 4141 ARYQANPKISHLTQVKRILKYVNGTSDYGIMYCHCSNPMLVGYCDADWAGSADDRKSTSG 4320
Query: 1216 YCFYIGRSLVSWKAKKQTTVSRSSNEAEYRALAYATCELQWLLYLLQDLKVTCTATPVLF 1275
CFY+G +L+SW +KKQ VS S+ EAEY A + +L W+ +L++ V L+
Sbjct: 4321 GCFYLGNNLISWFSKKQNCVSLSTAEAEYIAAGSSCSQLVWMKQMLKEYNVE-QDVMTLY 4497
Query: 1276 CDNQSALHIAANPVFHERTKHLDIDCHVVREKLQAGILKLLPIPTTLQVADVFTKALQPR 1335
CDN SA++I+ NPV H RTKH+DI H +R+ + ++ L + T Q+AD+FTKAL
Sbjct: 4498 CDNMSAINISKNPVQHSRTKHIDIRHHYIRDLVDDKVITLKHVDTEEQIADIFTKALDAN 4677
Query: 1336 VFQGFATKLAM 1346
F+ KL +
Sbjct: 4678 QFEKLRGKLGI 4710
>TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, complete
Length = 4734
Score = 502 bits (1292), Expect = e-142
Identities = 300/944 (31%), Positives = 484/944 (50%), Gaps = 15/944 (1%)
Frame = +1
Query: 418 LWHFRLGHLSHDRLL-------ALHTVQNSISVSKSIVCDVCHLAKQ-KRKMFTVSVSKA 469
+WH R GHL H R + A+ + N + + + +C C + KQ K +
Sbjct: 2074 IWHQRFGHL-HLRGMKKIIDKGAVRGIPN-LKIEEGRICGECQIGKQVKMSHQKLQHQTT 2247
Query: 470 QKCFDLVHMDIWGPLAQASVHNHKYFLTVLDDYSRFVWVVLLNNKGEVQQQVKNFITLVK 529
+ +L+HMD+ GP+ S+ +Y V+DD+SRF WV + K + + K ++
Sbjct: 2248 SRVLELLHMDLMGPMQVESLGGKRYAYVVVDDFSRFTWVNFIREKSDTFEVFKELSLRLQ 2427
Query: 530 TQFGQIVKAIRSDNGPEF---LLPAFYSAQGIVHQKSCVSTPQQNGRVERKHQHILNIAR 586
+ ++K IRSD+G EF F +++GI H+ S TPQQNG VERK++ + AR
Sbjct: 2428 REKDCVIKRIRSDHGREFENSKFTEFCTSEGITHEFSAAITPQQNGIVERKNRTLQEAAR 2607
Query: 587 ALLFQSKLPKKMWCYSVLHAVFLMNRIPSKLLKNKSPYELLYREAVDLEMMKVFGSLCFA 646
+L +LP +W ++ A ++ NR+ + + YE+ ++ +FGS C+
Sbjct: 2608 VMLHAKELPYNLWAEAMNTACYIHNRVTLRRGTPTTLYEIWKGRKPTVKHFHIFGSPCYI 2787
Query: 647 TTLTNNRSKLDPRARKCIFLGYKQGMKGYVLMDLITQEIFISRDVIFHEHVLPYKGSNSP 706
R K+DP++ IFLGY + Y + + T+ + S +V+ + + P + +
Sbjct: 2788 LADREQRRKMDPKSDAGIFLGYSTNSRAYRVFNSRTRTVMESINVVVDD-LTPARKKDVE 2964
Query: 707 AWTCLDLQTQAESTADKLNVASEIQ---SAKGDASVSTPCASIEDGVQHEMPSGASIEDE 763
D++T ++ AD A + SA + +++ P +Q P I D
Sbjct: 2965 E----DVRTSGDNVADTAKSAENAENSDSATDEPNINQPDKRPSIRIQKMHPKELIIGD- 3129
Query: 764 IPNTQTVDEIPQPGSITAEGEVEVRRSTRPLKRPVHLANFQFSALPIRSSTAHSIVHHYS 823
P G T E+E+
Sbjct: 3130 ----------PNRGVTTRSREIEI------------------------------------ 3171
Query: 824 DERLSTAHRAYALNIAQDQEPSTFAEANKDLHWREAMQAEIKALEKNGTWKLVDLPEGVK 883
++ + I EP EA D W AMQ E++ ++N W+LV PEG
Sbjct: 3172 -----VSNSCFVSKI----EPKNVKEALTDEFWINAMQEELEQFKRNEVWELVPRPEGTN 3324
Query: 884 PIGNKWVYRVKRNVDGTLARYKARLVAKGYNQVEGLDYFDTFSPVAKLTTVRVILALAAS 943
IG KW+++ K N +G + R KARLVA+GY Q+EG+D+ +TF+PVA+L ++R++L +A
Sbjct: 3325 VIGTKWIFKNKTNEEGVITRNKARLVAQGYTQIEGVDFDETFAPVARLESIRLLLGVACI 3504
Query: 944 QNWHLHQLDVDNAFLHGNLDEDVYMTIPAG-VPSVKPNQVCKLLKSLYGLKQASRKWYEK 1002
+ L+Q+DV +AFL+G L+E+ Y+ P G V P+ V +L K+LYGLKQA R WYE+
Sbjct: 3505 LKFKLYQMDVKSAFLNGYLNEEAYVEQPKGFVDPTHPDHVYRLKKALYGLKQAPRAWYER 3684
Query: 1003 LSAHLETLGFKQTASDHSLFVKFQGSSFTGLLVYVDDVILFGNTVTEFQLVKDSLHQAFG 1062
L+ L G+++ D +LFVK + +YVDD++ G + + + F
Sbjct: 3685 LTEFLTQQGYRKGGIDKTLFVKQDAENLMIAQIYVDDIVFGGMSNEMLRHFVQQMQSEFE 3864
Query: 1063 IKDLGVLKYFLGLEVAHSTSGISLCQRKYCLDLLQETGTLGSKPVATPLDPSIRLSQEQG 1122
+ +G L YFLGL+V I L Q KY +++++ G + TP ++LS+++
Sbjct: 3865 MSLVGELTYFLGLQVKQMEDSIFLSQSKYAKNIVKKFGMENASHKRTPAPTHLKLSKDEA 4044
Query: 1123 KPYDDPAAYRRLVGRLLYLTTTRPDISHATQQLSQYMSNPMDGHFKAAQRVLRYLKASPG 1182
D + YR ++G LLYLT +RPDI++A ++Y +NP H +R+L+Y+ +
Sbjct: 4045 GTSVDQSLYRSMIGSLLYLTASRPDITYAVGVCARYQANPKISHLNQVKRILKYVNGTSD 4224
Query: 1183 LGLLFPRNSTINIQGYSDADWAGCPDTRRSISGYCFYIGRSLVSWKAKKQTTVSRSSNEA 1242
G+++ S + GY DADWAG D R+S SG CFY+G +L+SW +KKQ VS S+ EA
Sbjct: 4225 YGIMYCHCSDSMLVGYCDADWAGSADDRKSTSGGCFYLGTNLISWFSKKQNCVSLSTAEA 4404
Query: 1243 EYRALAYATCELQWLLYLLQDLKVTCTATPVLFCDNQSALHIAANPVFHERTKHLDIDCH 1302
EY A + +L W+ +L++ V L+CDN SA++I+ NPV H RTKH+DI H
Sbjct: 4405 EYIAAGSSCSQLVWMKQMLKEYNVE-QDVMTLYCDNMSAINISKNPVQHSRTKHIDIRHH 4581
Query: 1303 VVREKLQAGILKLLPIPTTLQVADVFTKALQPRVFQGFATKLAM 1346
+R+ + ++ L + T Q+AD+FTKAL F+ KL +
Sbjct: 4582 YIRDLVDDKVITLEHVDTEEQIADIFTKALDANQFEKLRGKLGI 4713
>TC231544 weakly similar to UP|Q9FLR2 (Q9FLR2) Polyprotein-like, partial (16%)
Length = 662
Score = 189 bits (481), Expect = 5e-48
Identities = 100/182 (54%), Positives = 131/182 (71%), Gaps = 4/182 (2%)
Frame = +3
Query: 1169 AAQRVLRYLKASPGLGLLFPRNSTINIQGYSDADWAGCPDTRRSISGYCFYIGRSLVSWK 1228
AA RVL+YLK P GL F R S I I G+SDADWA C D+ +SI+ YCF++G SL+SWK
Sbjct: 18 AATRVLKYLKGCPRKGLSFSRESPIQILGFSDADWATCIDSSKSITWYCFFLGSSLISWK 197
Query: 1229 AKKQTTVSR--SSNEAEYRALAYATCELQWLLYLLQDLKVTCTATPVLFCDNQSALH-IA 1285
AKKQ TVSR SS+EA+YRAL TCELQWL YLL+DL VT +++CDNQSAL +
Sbjct: 198 AKKQNTVSRSSSSSEAKYRALTSTTCELQWLTYLLKDLHVT-----LIYCDNQSALQ*LP 362
Query: 1286 ANPVFHERTKHLDIDCHVVREKLQAGILK-LLPIPTTLQVADVFTKALQPRVFQGFATKL 1344
++H + L+IDCH+VREK Q G++ LLP+ ++ Q+AD+FTKAL P++F +KL
Sbjct: 363 IKVIYHGQ---LEIDCHIVREKTQQGLMHCLLPVSSSNQLADIFTKALSPKLFSSNLSKL 533
Query: 1345 AM 1346
+
Sbjct: 534 GL 539
>TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Fragment), partial
(30%)
Length = 687
Score = 171 bits (434), Expect = 2e-42
Identities = 83/150 (55%), Positives = 106/150 (70%)
Frame = +2
Query: 1195 IQGYSDADWAGCPDTRRSISGYCFYIGRSLVSWKAKKQTTVSRSSNEAEYRALAYATCEL 1254
+ GY DADWAGCP RRS SGYC +IG +LVSWK+KKQT V+RSS EAEYR++A TCEL
Sbjct: 17 LSGYCDADWAGCPMDRRSTSGYCVFIGGNLVSWKSKKQTVVARSSAEAEYRSMAMVTCEL 196
Query: 1255 QWLLYLLQDLKVTCTATPVLFCDNQSALHIAANPVFHERTKHLDIDCHVVREKLQAGILK 1314
W+ LQ+L+ L+CDNQ+ALHIA+NPVFHERTKH++IDCH +REKL + +
Sbjct: 197 MWIKQFLQELRFCEELQMKLYCDNQAALHIASNPVFHERTKHIEIDCHFIREKLLSKEIV 376
Query: 1315 LLPIPTTLQVADVFTKALQPRVFQGFATKL 1344
I + Q D+ TK+L+ Q +KL
Sbjct: 377 TEFIGSNDQPVDILTKSLRGPKIQIVCSKL 466
>BI317550 weakly similar to GP|9759590|dbj| polyprotein-like {Arabidopsis
thaliana}, partial (18%)
Length = 421
Score = 171 bits (434), Expect = 2e-42
Identities = 86/139 (61%), Positives = 107/139 (76%)
Frame = -2
Query: 987 KSLYGLKQASRKWYEKLSAHLETLGFKQTASDHSLFVKFQGSSFTGLLVYVDDVILFGNT 1046
KSLYGLKQASRKWYEKL+ L G+ Q+ SD+SLF +G++FT LLVYVDD+IL G++
Sbjct: 420 KSLYGLKQASRKWYEKLTNLLLKEGYIQSISDYSLFTLTKGNTFTALLVYVDDIILAGDS 241
Query: 1047 VTEFQLVKDSLHQAFGIKDLGVLKYFLGLEVAHSTSGISLCQRKYCLDLLQETGTLGSKP 1106
+ EF +K+ L AF IK+LG LKYFLGLEVAHS GI++ QRKYCLDLL+++G LG KP
Sbjct: 240 IDEFDRIKNVLDLAFKIKNLGKLKYFLGLEVAHSRLGITISQRKYCLDLLKDSGLLGCKP 61
Query: 1107 VATPLDPSIRLSQEQGKPY 1125
+TPLD SI+L G PY
Sbjct: 60 ASTPLDTSIKLHSAAGTPY 4
>AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza sativa
(japonica cultivar-group)}, partial (10%)
Length = 463
Score = 155 bits (391), Expect = 1e-37
Identities = 78/151 (51%), Positives = 105/151 (68%), Gaps = 2/151 (1%)
Frame = -3
Query: 859 AMQAEIKALEKNGTWKLVDLPEGVKPIGNKWVYRVKRNVDGTLARYKARLVAKGYNQVEG 918
AMQ E+ E+N WKLV+ PE IG KWV+R K + G + R KARLVAKGYNQ EG
Sbjct: 458 AMQEELNQFERNNVWKLVEKPENYPVIGTKWVFRNKLDEHGIIIRNKARLVAKGYNQEEG 279
Query: 919 LDYFDTFSPVAKLTTVRVILALAASQNWHLHQLDVDNAFLHGNLDEDVYMTIPAG--VPS 976
+DY +T++PVA+L +R++LA + N+ L+Q+DV +AFL+G + E+VY+ P G +P
Sbjct: 278 IDYEETYAPVARLEVIRMLLAYVSIMNFKLYQMDVKSAFLNGLIQEEVYVEQPPGFEIPD 99
Query: 977 VKPNQVCKLLKSLYGLKQASRKWYEKLSAHL 1007
KP V KL K+LYGLKQA R WYE++S L
Sbjct: 98 -KPTHVYKLQKALYGLKQAPRAWYERISNFL 9
>BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vinifera},
partial (34%)
Length = 407
Score = 154 bits (390), Expect = 2e-37
Identities = 79/130 (60%), Positives = 94/130 (71%), Gaps = 1/130 (0%)
Frame = -2
Query: 876 VDLPEGVKPIGNKWVYRVKRNVDGTLARYKARLVAKGYNQVEGLDYFDTFSPVAKLTTVR 935
V LP G P+G +WVY VK G + R KARLVAKGY QV G+DY DTFSPVAKLTTVR
Sbjct: 406 VPLPPGKTPVGCRWVYTVKVGPTGEVDRLKARLVAKGYTQVYGIDYCDTFSPVAKLTTVR 227
Query: 936 VILALAASQNWHLHQLDVDNAFLHGNLDEDVYMTIPAG-VPSVKPNQVCKLLKSLYGLKQ 994
+ LA+AA +W LHQLD+ NAFLHG+L+ED+YM P G V + VCKL +SLYGLKQ
Sbjct: 226 LFLAMAAICHWPLHQLDIKNAFLHGDLEEDIYMEQPPGFVAQGEYGLVCKLHRSLYGLKQ 47
Query: 995 ASRKWYEKLS 1004
+ R W+ K S
Sbjct: 46 SPRAWFGKFS 17
>CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {Vitis
vinifera}, partial (34%)
Length = 409
Score = 154 bits (390), Expect = 2e-37
Identities = 76/128 (59%), Positives = 94/128 (73%)
Frame = +3
Query: 844 PSTFAEANKDLHWREAMQAEIKALEKNGTWKLVDLPEGVKPIGNKWVYRVKRNVDGTLAR 903
PST EA WR+AM E++ALE NGTW+LV LP G +G +WVY VK +G + R
Sbjct: 21 PSTIREALDHPGWRQAMVDEMQALENNGTWELVPLPPGKTTVGCRWVYTVKVGPNGKVDR 200
Query: 904 YKARLVAKGYNQVEGLDYFDTFSPVAKLTTVRVILALAASQNWHLHQLDVDNAFLHGNLD 963
KARLVAKGY QV G++Y DTFSPV LTTVR+ LA+AA ++W LHQLD+ NAFLHG+L+
Sbjct: 201 LKARLVAKGYTQVYGIEYCDTFSPVFFLTTVRLFLAMAAIRHWPLHQLDIKNAFLHGDLE 380
Query: 964 EDVYMTIP 971
ED+YM P
Sbjct: 381 EDIYMEQP 404
>TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polyprotein, partial
(7%)
Length = 804
Score = 147 bits (372), Expect = 2e-35
Identities = 76/221 (34%), Positives = 130/221 (58%), Gaps = 3/221 (1%)
Frame = +1
Query: 1127 DPAAYRRLVGRLLYLTTTRPDISHATQQLSQYMSNPMDGHFKAAQRVLRYLKASPGLGLL 1186
D +RRL+G L YL +RP+I A +S++M P H +AA+RVLR +K + G G+L
Sbjct: 10 DVTEFRRLIGSLRYLCNSRPNICFAVSLISRFMKRPRLSHMQAAKRVLRLIKGTIGSGVL 189
Query: 1187 FP---RNSTINIQGYSDADWAGCPDTRRSISGYCFYIGRSLVSWKAKKQTTVSRSSNEAE 1243
FP ++ ++ GY+D+DW P+ +S GY F + V+ +KKQ ++ S+ EAE
Sbjct: 190 FPFKAKSGKPDLLGYTDSDWKRDPEQEKSTGGYLFMYNDAPVA*SSKKQDVIALSTCEAE 369
Query: 1244 YRALAYATCELQWLLYLLQDLKVTCTATPVLFCDNQSALHIAANPVFHERTKHLDIDCHV 1303
Y A + C+ W++ LL++LK+ L DN+SA+++A +P H R+KH+++ H
Sbjct: 370 YVAASLGACQAVWMMNLLEELKLRERKPVNLLIDNKSAINLAKHPTLHGRSKHIELRFHY 549
Query: 1304 VREKLQAGILKLLPIPTTLQVADVFTKALQPRVFQGFATKL 1344
+R+++ G + + Q+AD+ TK +Q F+ ++L
Sbjct: 550 IRDQVSKGNVTVEYCKAEEQLADLMTKPIQVSRFKQICSEL 672
>TC225402 weakly similar to UP|Q6I923 (Q6I923) Copia-like retrotransposon
Hopscotch polyprotein, partial (7%)
Length = 1446
Score = 136 bits (343), Expect = 5e-32
Identities = 66/109 (60%), Positives = 79/109 (71%)
Frame = +2
Query: 1200 DADWAGCPDTRRSISGYCFYIGRSLVSWKAKKQTTVSRSSNEAEYRALAYATCELQWLLY 1259
DA+WA P R S GYC IG +LV WK+ K V+RSS EAEY+A+ ATCEL W+
Sbjct: 8 DANWAVSPIDRGSTLGYCVSIGENLVLWKSNK*NVVARSSAEAEYKAMTVATCELIWIKQ 187
Query: 1260 LLQDLKVTCTATPVLFCDNQSALHIAANPVFHERTKHLDIDCHVVREKL 1308
LLQ+LK T L CDNQ+ALHIA+NPVFHERTKH++IDCH VREK+
Sbjct: 188 LLQELKFGSTQQMKLCCDNQAALHIASNPVFHERTKHIEIDCHFVREKV 334
>BU764568
Length = 420
Score = 108 bits (269), Expect(2) = 9e-31
Identities = 51/84 (60%), Positives = 63/84 (74%)
Frame = +3
Query: 1214 SGYCFYIGRSLVSWKAKKQTTVSRSSNEAEYRALAYATCELQWLLYLLQDLKVTCTATPV 1273
SGYC IG +L+SWK+KKQ+ V++SS EAEYRA+A TCEL WL LL +LK
Sbjct: 168 SGYCVLIGGNLISWKSKKQSVVAKSSAEAEYRAMALVTCELIWLKQLL*ELKFEEDTQMT 347
Query: 1274 LFCDNQSALHIAANPVFHERTKHL 1297
L CDNQ+ALHIA+NP+FH RTKH+
Sbjct: 348 LICDNQAALHIASNPIFH*RTKHI 419
Score = 45.4 bits (106), Expect(2) = 9e-31
Identities = 21/55 (38%), Positives = 32/55 (58%)
Frame = +1
Query: 1156 SQYMSNPMDGHFKAAQRVLRYLKASPGLGLLFPRNSTINIQGYSDADWAGCPDTR 1210
SQ++++P H+ A +L+ K++PG GL++ I GYSDAD G P R
Sbjct: 1 SQFLNSPCQDHWNAVS*ILK*TKSAPGKGLIYEDKGHSQIIGYSDAD*VGSPSDR 165
>CO982036
Length = 674
Score = 130 bits (328), Expect = 3e-30
Identities = 76/203 (37%), Positives = 117/203 (57%), Gaps = 3/203 (1%)
Frame = -2
Query: 1033 LLVYVDDVILFGNTVTEFQLVKDSLHQAFGIKDLGVLKYFLGLEVAHSTSGISLCQRKYC 1092
LLVYVD +I+ G++ T Q + L+ +F +K LG L YF+ +EV S + R
Sbjct: 646 LLVYVD-IIITGSSCTLIQNLTSKLNSSFPLKLLGKLDYFVEIEVK-SMPDLLFSLRTSI 473
Query: 1093 LDLLQETGTLGSKPVATPLDPSIRLSQEQGKPYDDPAAYRRLVGRLLYLTTTRPDISHAT 1152
++ ++P+++P+ + +LS+ + P YR +VG L Y T RP+IS A
Sbjct: 472 FEIFCRKPR*QAQPISSPMTTTCKLSKSDSDLFSGPTFYRSVVGALQYTTVIRPEISFAV 293
Query: 1153 QQLSQYMSNPMDGHFKAAQRVLRYLKASPGLGL-LFPRNST--INIQGYSDADWAGCPDT 1209
++ Q+MSNP+D H+ +R+LRYLK S GL L P S+ + I+G+ DADWA D
Sbjct: 292 NKVCQFMSNPLDSHWTEVKRILRYLKGSLSYGL*LKPAISSQPLPIRGFCDADWASAVDD 113
Query: 1210 RRSISGYCFYIGRSLVSWKAKKQ 1232
+RS SG ++G +L+SW KQ
Sbjct: 112 KRSTSGAAVFLGPNLISWWXXKQ 44
>CO981879
Length = 576
Score = 90.9 bits (224), Expect(2) = 5e-30
Identities = 44/94 (46%), Positives = 59/94 (61%), Gaps = 3/94 (3%)
Frame = -1
Query: 522 KNFITLVKTQFGQIVKAIRSDNGPEFL---LPAFYSAQGIVHQKSCVSTPQQNGRVERKH 578
K F +++TQF +K RSDNG E+ L GI+HQ SCV TPQQNG ERK+
Sbjct: 564 KTFFQMIQTQFQVKIKVFRSDNGREYFNKHLSKXXLENGIIHQSSCVDTPQQNGVAERKN 385
Query: 579 QHILNIARALLFQSKLPKKMWCYSVLHAVFLMNR 612
+H+ +ARALLFQ+K PK W ++L +L N+
Sbjct: 384 RHLXEVARALLFQNKAPKYXWGEAILTGTYLKNK 283
Score = 60.5 bits (145), Expect(2) = 5e-30
Identities = 30/89 (33%), Positives = 50/89 (55%), Gaps = 5/89 (5%)
Frame = -2
Query: 612 RIPSKLLKNKSPYELLYREAVDLEM-----MKVFGSLCFATTLTNNRSKLDPRARKCIFL 666
R+PSK+L ++P ++ + + +K+FG F N+ KL+PRA+KC+F+
Sbjct: 284 RMPSKILNFRTPLDVFTSAFPNNRLSCTLPLKIFGCTVFVHIHEPNQGKLEPRAKKCVFV 105
Query: 667 GYKQGMKGYVLMDLITQEIFISRDVIFHE 695
GY KGY D +++ F++ DV F E
Sbjct: 104 GYAPNQKGYKCFDPTSKKTFVTIDVTFFE 18
>BU549979
Length = 615
Score = 130 bits (326), Expect = 5e-30
Identities = 67/194 (34%), Positives = 115/194 (58%), Gaps = 2/194 (1%)
Frame = -1
Query: 1155 LSQYMSNPMDGHFKAAQRVLRYLKASPGLGLLFPRNSTINIQGYSDADWAGCPDTRRSIS 1214
L +Y SNP H+K A++V+RYL+ + L++ + + + + GYSD+D+AGC D+RRS S
Sbjct: 612 LGRYQSNPGIDHWKTAKKVMRYLQGTKDYMLMYKQTNCLEVIGYSDSDFAGCVDSRRSTS 433
Query: 1215 GYCFYIGRSLVSWKAKKQTTVSRSSNEAEYRALAYATCELQWLLYLLQDLKVT-CTATPV 1273
GY F + +VSW++ KQT ++ S+ E E+ AT WL + L+V + P+
Sbjct: 432 GYIFMLADGVVSWRSSKQTLIATSTMEVEFVPCFEATSHGVWLKSFMSSLRVVDSISRPL 253
Query: 1274 -LFCDNQSALHIAANPVFHERTKHLDIDCHVVREKLQAGILKLLPIPTTLQVADVFTKAL 1332
L+CDN +A+ +A N R+KH+DI V+RE+++ + + + T L + D TK +
Sbjct: 252 KLYCDNFAAVFMAKNNKSGNRSKHIDIKYLVIRERVKEKKVVIEHVNTELMIVDPLTKGM 73
Query: 1333 QPRVFQGFATKLAM 1346
P+ F+ ++ +
Sbjct: 72 TPKNFKDHVVRMEL 31
>TC234303 weakly similar to UP|Q8W153 (Q8W153) Polyprotein, partial (10%)
Length = 558
Score = 129 bits (324), Expect = 9e-30
Identities = 76/179 (42%), Positives = 101/179 (55%), Gaps = 2/179 (1%)
Frame = +1
Query: 899 GTLARYKARLVAKGYNQVEGLDYFDTFSPVAKLTTVRVILALAASQNWHLHQLDVDNAFL 958
GT+ ++KARLVAK Y QV G DY TFSPVAK+ V ++ ++A +W L LD NAFL
Sbjct: 28 GTIDQFKARLVAKSYTQVYGQDYTGTFSPVAKMAYVHLLWSMAVVCHWPLF*LDAKNAFL 207
Query: 959 HGNLDEDVYMTIPAG--VPSVKPNQVCKLLKSLYGLKQASRKWYEKLSAHLETLGFKQTA 1016
HG L+E+VYM P G N VC+L +S YGLKQ+ R W + +
Sbjct: 208 HGYLEEEVYMEQPLGFVAQGESSNMVCQLCRSFYGLKQSPRAWPFLYCG--AAIWYDSHE 381
Query: 1017 SDHSLFVKFQGSSFTGLLVYVDDVILFGNTVTEFQLVKDSLHQAFGIKDLGVLKYFLGL 1075
+DHS+F L+VYVDD+ + G+ +K L F KDLG L+YFLG+
Sbjct: 382 ADHSVFYCHSPQGCIYLIVYVDDIGITGSDQHGIT*LK*XLCCQFQTKDLGKLRYFLGI 558
>BM086359
Length = 427
Score = 124 bits (312), Expect = 2e-28
Identities = 65/142 (45%), Positives = 87/142 (60%)
Frame = +1
Query: 1073 LGLEVAHSTSGISLCQRKYCLDLLQETGTLGSKPVATPLDPSIRLSQEQGKPYDDPAAYR 1132
LG++VA S+ GI + Q KY LD+L ETG L P TP+DP+++L QG+ +DP
Sbjct: 1 LGIDVAQSSYGIVISQWKYALDILTETGMLDCLPSNTPMDPNVKLLSGQGEALEDPGR*C 180
Query: 1133 RLVGRLLYLTTTRPDISHATQQLSQYMSNPMDGHFKAAQRVLRYLKASPGLGLLFPRNST 1192
LVGRL YLT TR DI+ A LSQ++ +P D + A R+LRY+K +PG GLL+
Sbjct: 181 CLVGRLNYLTVTRLDITFAVGVLSQFLKDPTDSQWNATIRILRYIKNAPGPGLLYEDKGN 360
Query: 1193 INIQGYSDADWAGCPDTRRSIS 1214
+ Y DADW G P + S S
Sbjct: 361 GKVVCYFDADWPGSPSDKSSTS 426
>BI787454 weakly similar to GP|21434|emb|CA ORF4 {Solanum tuberosum}, partial
(21%)
Length = 421
Score = 124 bits (310), Expect = 4e-28
Identities = 65/140 (46%), Positives = 89/140 (63%), Gaps = 1/140 (0%)
Frame = +2
Query: 1013 KQTASDHSLFVKFQG-SSFTGLLVYVDDVILFGNTVTEFQLVKDSLHQAFGIKDLGVLKY 1071
K + +DHS+F L+VYVDD+++ T+ +K+ L F KDL LKY
Sbjct: 2 K*SEADHSVFYCHTSPGKCVYLMVYVDDIMITKKDATKIVQLKEHLFNHFQTKDLRYLKY 181
Query: 1072 FLGLEVAHSTSGISLCQRKYCLDLLQETGTLGSKPVATPLDPSIRLSQEQGKPYDDPAAY 1131
FLG+EVA S G+ + QRKY LD+L+ETG + V +P+DP+++L Q + Y DP Y
Sbjct: 182 FLGIEVAQSGDGVVISQRKYALDILEETGMQNCRLVDSPMDPNLKLMAYQSEVYPDPERY 361
Query: 1132 RRLVGRLLYLTTTRPDISHA 1151
RRLVG+L+YLT TRPDIS A
Sbjct: 362 RRLVGKLIYLTITRPDISFA 421
>BM307983
Length = 406
Score = 120 bits (302), Expect = 3e-27
Identities = 64/133 (48%), Positives = 87/133 (65%), Gaps = 2/133 (1%)
Frame = +2
Query: 885 IGNKWVYRVKRNVDGTLARYKARLVAKGYNQVEGLDYFDTFSPVAK-LTTVRVILALAAS 943
+G +W+Y VK D TL RYKARLVAKGY Q G+DY +TF+ K + + A
Sbjct: 2 VGCRWIYTVKY*ADDTLDRYKARLVAKGYIQTYGIDYEETFAQWQK*IQSGSSSP*QQAQ 181
Query: 944 QNWHLHQLDVDNAFLHGNLDEDVYMTIPAGV-PSVKPNQVCKLLKSLYGLKQASRKWYEK 1002
W +HQ DV NAFLHG+L+E+VYM IP G S N+VC+L K+LYGLKQ+ R W+ +
Sbjct: 182 FGWEMHQFDVKNAFLHGSLEEEVYMEIPPGYGASNGGNKVCRLKKALYGLKQSPRAWFGR 361
Query: 1003 LSAHLETLGFKQT 1015
+ + +LG+KQ+
Sbjct: 362 FTQAMLSLGYKQS 400
>TC235104 weakly similar to UP|Q850H8 (Q850H8) Gag-pol polyprotein
(Fragment), partial (28%)
Length = 865
Score = 119 bits (297), Expect = 1e-26
Identities = 67/181 (37%), Positives = 103/181 (56%), Gaps = 3/181 (1%)
Frame = +2
Query: 418 LWHFRLGHLSHDRLLALHTVQNSISVSKSIVCDVCHLAKQKRKMFTVSVSKAQKCFDLVH 477
L H RLGH L L + S+ K + C+ C L K R S+ F ++H
Sbjct: 311 LLHERLGH---PHLSKLKIMVPSLEKIKDLFCESCQLGKHVRSSXRHVESRVDSPFLVIH 481
Query: 478 MDIWGPLAQASVHNHKYFLTVLDDYSRFVWVVLLNNKGEVQQQVKNFITLVKTQFGQIVK 537
DIWGP +S+ +++YF+T +D++S+ V L+ + E+ + + + +KTQFG+ +K
Sbjct: 482 XDIWGPNRVSSM-SYRYFVTFIDEFSQCTRVFLMKERSEILSFLTS-VNKIKTQFGKTIK 655
Query: 538 AIRSDNGPEF---LLPAFYSAQGIVHQKSCVSTPQQNGRVERKHQHILNIARALLFQSKL 594
+RSDN E+ ++ F SAQGI+HQ SC TPQQN ERK++H++ AR LL +
Sbjct: 656 ILRSDNAKEYFSSVISPFXSAQGILHQFSCPHTPQQNDIAERKNRHLVETARTLLLHANE 835
Query: 595 P 595
P
Sbjct: 836 P 838
>BU548243
Length = 599
Score = 117 bits (293), Expect = 3e-26
Identities = 62/147 (42%), Positives = 89/147 (60%)
Frame = -1
Query: 1200 DADWAGCPDTRRSISGYCFYIGRSLVSWKAKKQTTVSRSSNEAEYRALAYATCELQWLLY 1259
DA WA D RS G ++G +L+SW ++KQ ++SS EAEYR++A + EL W+
Sbjct: 587 DAGWASDVDDHRSTLGSAIFLGPNLISWWSRKQQVTAQSSTEAEYRSIAQTSAELTWIQA 408
Query: 1260 LLQDLKVTCTATPVLFCDNQSALHIAANPVFHERTKHLDIDCHVVREKLQAGILKLLPIP 1319
LL +L++ T PV+ CDN+SA+ IA N VFH RTKH++ID V EK+ + L++ IP
Sbjct: 407 LLMELQIPFT-PPVILCDNKSAVAIAHNLVFHSRTKHMEIDVFFVHEKVLSKQLQIFHIP 231
Query: 1320 TTLQVADVFTKALQPRVFQGFATKLAM 1346
Q A + TK L F +KL +
Sbjct: 230 ALDQWAGILTKPLSSARFTFLKSKLTV 150
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.335 0.144 0.447
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 56,867,658
Number of Sequences: 63676
Number of extensions: 789006
Number of successful extensions: 6061
Number of sequences better than 10.0: 149
Number of HSP's better than 10.0 without gapping: 5915
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 6009
length of query: 1346
length of database: 12,639,632
effective HSP length: 109
effective length of query: 1237
effective length of database: 5,698,948
effective search space: 7049598676
effective search space used: 7049598676
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 15 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 39 (21.6 bits)
S2: 65 (29.6 bits)
Lotus: description of TM0366.1