
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC148395.6 - phase: 0
(1336 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete 402 e-112
TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, co... 402 e-112
BI317550 weakly similar to GP|9759590|dbj| polyprotein-like {Ara... 204 3e-52
TC231544 weakly similar to UP|Q9FLR2 (Q9FLR2) Polyprotein-like, ... 199 7e-51
TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Frag... 178 2e-44
BQ296988 similar to GP|21740616|em OSJNBb0089K24.12 {Oryza sativ... 176 5e-44
TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polypro... 171 3e-42
BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vi... 155 1e-37
TC234303 weakly similar to UP|Q8W153 (Q8W153) Polyprotein, parti... 150 4e-36
AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza ... 150 5e-36
CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {V... 144 3e-34
TC225402 weakly similar to UP|Q6I923 (Q6I923) Copia-like retrotr... 137 2e-33
BU764568 107 1e-31
TC211311 weakly similar to UP|O24587 (O24587) Pol protein, parti... 110 3e-31
BU549979 129 7e-30
TC211663 128 1e-29
CO982036 126 6e-29
BU548243 125 1e-28
BI787454 weakly similar to GP|21434|emb|CA ORF4 {Solanum tuberos... 124 2e-28
BM307983 124 4e-28
>TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete
Length = 4731
Score = 402 bits (1033), Expect = e-112
Identities = 203/505 (40%), Positives = 307/505 (60%), Gaps = 1/505 (0%)
Frame = +1
Query: 821 EPISYKEASQQDCWVKAMNSELSALNHNKTWIFVDTPPNIKPIGSKWVYKIKHKSDGTIE 880
EP + KEA + W+ AM EL N+ W V P IG+KW++K K +G I
Sbjct: 3199 EPKNVKEALTDEFWINAMQEELEQFKRNEVWELVPRPEGTNVIGTKWIFKNKTNEEGVIT 3378
Query: 881 RYKARLVAKGYTQVEGIDFFDTFSPVAKITTVRILIALASINSWHLHQLDVNNAFLHGDL 940
R KARLVA+GYTQ+EG+DF +TF+PVA++ ++R+L+ +A I + L+Q+DV +AFL+G L
Sbjct: 3379 RNKARLVAQGYTQIEGVDFDETFAPVARLESIRLLLGVACILKFKLYQMDVKSAFLNGYL 3558
Query: 941 SENVYMSVPQGVHSPK-PNQVCKLLKSLYGLKQASRKWYEKLTGFLLLQGYKQSASDHSL 999
+E VY+ P+G P P+ V +L K+LYGLKQA R WYE+LT FL QGY++ D +L
Sbjct: 3559 NEEVYVEQPKGFADPTHPDHVYRLKKALYGLKQAPRAWYERLTEFLTQQGYRKGGIDKTL 3738
Query: 1000 FILHSDTCFTALLVYVDDVILAGNSMTEIDRIKAVLDVEFKIKDLGKLKYFLGIEVAHSK 1059
F+ +YVDD++ G S + + EF++ +G+L YFLG++V +
Sbjct: 3739 FVKQDAENLMIAQIYVDDIVFGGMSNEMLRHFVQQMQSEFEMSLVGELTYFLGLQVKQME 3918
Query: 1060 LGISICQRKYCLDLLKDTGLLGAKPVTTPLDPSIKLHQDNSPPHDDILSYRRLVGKLLYL 1119
I + Q +Y +++K G+ A TP +KL +D + D YR ++G LLYL
Sbjct: 3919 DSIFLSQSRYAKNIVKKFGMENASHKRTPAPTHLKLSKDEAGTSVDQSLYRSMIGSLLYL 4098
Query: 1120 TTTRPDIAFVVQQLSQFLNAPTITHYDTACRVVKYLKGTPGQGLLFRRDSQLQLLGFTDA 1179
T +RPDI + V +++ P I+H R++KY+ GT G+++ S L+G+ DA
Sbjct: 4099 TASRPDITYAVGVCARYQANPKISHLTQVKRILKYVNGTSDYGIMYCHCSNPMLVGYCDA 4278
Query: 1180 DWAGCPDSRRSTSGYCFFLGSSLISWRAKKQHTVARSSSEAEYRALSFASCELQWLLYLL 1239
DWAG D R+STSG CF+LG++LISW +KKQ+ V+ S++EAEY A + +L W+ +L
Sbjct: 4279 DWAGSADDRKSTSGGCFYLGNNLISWFSKKQNCVSLSTAEAEYIAAGSSCSQLVWMKQML 4458
Query: 1240 KDLRVKCNKLPVLFCDNQSAIHIAGNPVFHERTKHLEIDCHFVREKLQQNIFKLLPIKSQ 1299
K+ V+ + L+CDN SAI+I+ NPV H RTKH++I H++R+ + + L + ++
Sbjct: 4459 KEYNVE-QDVMTLYCDNMSAINISKNPVQHSRTKHIDIRHHYIRDLVDDKVITLKHVDTE 4635
Query: 1300 SQLADFFTKPLPPKNFHSFLPKLNM 1324
Q+AD FTK L F KL +
Sbjct: 4636 EQIADIFTKALDANQFEKLRGKLGI 4710
Score = 135 bits (341), Expect = 9e-32
Identities = 133/521 (25%), Positives = 212/521 (40%), Gaps = 18/521 (3%)
Frame = +1
Query: 188 GVMNAKHQHEITRSIRFLTGLNDSFDLVRSQILLMNPLPTINKIFSMVMQHERQFKISIP 247
G +N+K ++ +T+SI+ L +D+ D V +L N F+ +P
Sbjct: 1249 GFLNSKLEN-MTKSIKMLNKGSDTLDEVL--LLGKNAGNQRGLGFNPKSAGRTTMTEFVP 1419
Query: 248 VEDSTILVNSAGKSQGRGRGNGNSNSNGKRS---CSFCGRDGHTIDICYRKHGFPPNYGN 304
++ T A SQ R R +G KR C +CG+ GH CY HG P++G
Sbjct: 1420 AKNRT----GATMSQHRSRHHGMQQKKSKRKKWRCHYCGKYGHIKPFCYHLHGH-PHHGT 1584
Query: 305 KNFAKVNNSSIEQNEEREDLDDSKSCKGNSNTEPSFGITKEQYEQLVTLLQSQQASSSKV 364
Q+S+S+
Sbjct: 1585 -----------------------------------------------------QSSNSRK 1605
Query: 365 NLSHVTNH-VTSGITRTSYTMNHSSFGTWIVDSGASDHICSSIKFFDSYAPIIPVHIKLP 423
+ V H S + TS + S+ W +DSG S H+ +F + P ++
Sbjct: 1606 KMMWVPKHKAVSLVVHTS--LRASAKEDWYLDSGCSRHMTGVKEFLLNIEPCSTSYVTFG 1779
Query: 424 NGNMAIAKFSGTVNFSPGLIARN-VLYVAEFKLNLLSVPKLCMDNNCIVTFDNDKCFI-Q 481
+G+ G + GL + N VL V NL+S+ +LC D V F +C +
Sbjct: 1780 DGSKGKIIGMGKL-VHDGLPSLNKVLLVKGLTANLISISQLC-DEGFNVNFTKSECLVTN 1953
Query: 482 DKSNLKMTGLGELIEGLYFLNLGSQVFSQFNHQPVIALSQSSSSTTFLPQE---ALWHFR 538
+KS + M G S+ N S SST +E +WH R
Sbjct: 1954 EKSEVLMKGSR----------------SKDNCYLWTPQETSYSSTCLSSKEDEVRIWHQR 2085
Query: 539 LGHLSNDRLLRM-----QQHFPCIKVDSNSVCDICHYSRHKKLPF-KLSMNKASHCYELI 592
GHL + ++ + P +K++ +C C + K+ KL S EL+
Sbjct: 2086 FGHLHLRGMKKIIDKGAVRGIPNLKIEEGRICGECQIGKQVKMSHQKLQHQTTSRVLELL 2265
Query: 593 HFDIWGPISTHSIHGHKYFITALDDYSRFTWIMLCKTKSEVSKSVQNFILNIENQFNCKV 652
H D+ GP+ S+ G +Y +DD+SRFTW+ + KSE + + L ++ + +C +
Sbjct: 2266 HMDLMGPMQVESLGGKRYAYVVVDDFSRFTWVNFIREKSETFEVFKELSLRLQREKDCVI 2445
Query: 653 KTVRTDNGPEF---LMPDFYSSKGIEHQTSCVETPQQNGRV 690
K +R+D+G EF +F +S+GI H+ S TPQQNG V
Sbjct: 2446 KRIRSDHGREFENSRFTEFCTSEGITHEFSAAITPQQNGIV 2568
>TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, complete
Length = 4734
Score = 402 bits (1033), Expect = e-112
Identities = 203/505 (40%), Positives = 307/505 (60%), Gaps = 1/505 (0%)
Frame = +1
Query: 821 EPISYKEASQQDCWVKAMNSELSALNHNKTWIFVDTPPNIKPIGSKWVYKIKHKSDGTIE 880
EP + KEA + W+ AM EL N+ W V P IG+KW++K K +G I
Sbjct: 3202 EPKNVKEALTDEFWINAMQEELEQFKRNEVWELVPRPEGTNVIGTKWIFKNKTNEEGVIT 3381
Query: 881 RYKARLVAKGYTQVEGIDFFDTFSPVAKITTVRILIALASINSWHLHQLDVNNAFLHGDL 940
R KARLVA+GYTQ+EG+DF +TF+PVA++ ++R+L+ +A I + L+Q+DV +AFL+G L
Sbjct: 3382 RNKARLVAQGYTQIEGVDFDETFAPVARLESIRLLLGVACILKFKLYQMDVKSAFLNGYL 3561
Query: 941 SENVYMSVPQGVHSPK-PNQVCKLLKSLYGLKQASRKWYEKLTGFLLLQGYKQSASDHSL 999
+E Y+ P+G P P+ V +L K+LYGLKQA R WYE+LT FL QGY++ D +L
Sbjct: 3562 NEEAYVEQPKGFVDPTHPDHVYRLKKALYGLKQAPRAWYERLTEFLTQQGYRKGGIDKTL 3741
Query: 1000 FILHSDTCFTALLVYVDDVILAGNSMTEIDRIKAVLDVEFKIKDLGKLKYFLGIEVAHSK 1059
F+ +YVDD++ G S + + EF++ +G+L YFLG++V +
Sbjct: 3742 FVKQDAENLMIAQIYVDDIVFGGMSNEMLRHFVQQMQSEFEMSLVGELTYFLGLQVKQME 3921
Query: 1060 LGISICQRKYCLDLLKDTGLLGAKPVTTPLDPSIKLHQDNSPPHDDILSYRRLVGKLLYL 1119
I + Q KY +++K G+ A TP +KL +D + D YR ++G LLYL
Sbjct: 3922 DSIFLSQSKYAKNIVKKFGMENASHKRTPAPTHLKLSKDEAGTSVDQSLYRSMIGSLLYL 4101
Query: 1120 TTTRPDIAFVVQQLSQFLNAPTITHYDTACRVVKYLKGTPGQGLLFRRDSQLQLLGFTDA 1179
T +RPDI + V +++ P I+H + R++KY+ GT G+++ S L+G+ DA
Sbjct: 4102 TASRPDITYAVGVCARYQANPKISHLNQVKRILKYVNGTSDYGIMYCHCSDSMLVGYCDA 4281
Query: 1180 DWAGCPDSRRSTSGYCFFLGSSLISWRAKKQHTVARSSSEAEYRALSFASCELQWLLYLL 1239
DWAG D R+STSG CF+LG++LISW +KKQ+ V+ S++EAEY A + +L W+ +L
Sbjct: 4282 DWAGSADDRKSTSGGCFYLGTNLISWFSKKQNCVSLSTAEAEYIAAGSSCSQLVWMKQML 4461
Query: 1240 KDLRVKCNKLPVLFCDNQSAIHIAGNPVFHERTKHLEIDCHFVREKLQQNIFKLLPIKSQ 1299
K+ V+ + L+CDN SAI+I+ NPV H RTKH++I H++R+ + + L + ++
Sbjct: 4462 KEYNVE-QDVMTLYCDNMSAINISKNPVQHSRTKHIDIRHHYIRDLVDDKVITLEHVDTE 4638
Query: 1300 SQLADFFTKPLPPKNFHSFLPKLNM 1324
Q+AD FTK L F KL +
Sbjct: 4639 EQIADIFTKALDANQFEKLRGKLGI 4713
Score = 139 bits (349), Expect = 1e-32
Identities = 132/525 (25%), Positives = 216/525 (41%), Gaps = 22/525 (4%)
Frame = +1
Query: 188 GVMNAKHQHEITRSIRFLTGLNDSFDLVRSQILLMNPLPTINKIFSMVMQHERQFKIS-- 245
G +N+K ++ +T+SI+ L +D D ++L + + + H+ + +
Sbjct: 1252 GFLNSKLEN-MTKSIKMLNKGSDMLD----EVLQLGK--NVGNQRGLGFNHKSAGRTTMT 1410
Query: 246 --IPVEDSTILVNSAGKSQGRGRGNGNSNSNGKRS---CSFCGRDGHTIDICYRKHGFPP 300
+P ++ST A SQ R R +G KR C +CG+ GH CY HG P
Sbjct: 1411 EFVPAKNST----GATMSQHRSRHHGTQQKKSKRKKWRCHYCGKYGHIKPFCYHLHGH-P 1575
Query: 301 NYGNKNFAKVNNSSIEQNEEREDLDDSKSCKGNSNTEPSFGITKEQYEQLVTLLQSQQAS 360
++G Q+S
Sbjct: 1576 HHGT-----------------------------------------------------QSS 1596
Query: 361 SSKVNLSHVTNH-VTSGITRTSYTMNHSSFGTWIVDSGASDHICSSIKFFDSYAPIIPVH 419
SS + V H + S + TS + S+ W +DSG S H+ +F + P +
Sbjct: 1597 SSGRKMMWVPKHKIVSLVVHTS--LRASAKEDWYLDSGCSRHMTGVKEFLVNIEPCSTSY 1770
Query: 420 IKLPNGNMAIAKFSGTVNFSPGLIARN-VLYVAEFKLNLLSVPKLCMDNNCIVTFDNDKC 478
+ +G+ G + GL + N VL V NL+S+ +LC D V F +C
Sbjct: 1771 VTFGDGSKGKITGMGKL-VHDGLPSLNKVLLVKGLTANLISISQLC-DEGFNVNFTKSEC 1944
Query: 479 FI-QDKSNLKMTGLGELIEGLYFLNLGSQVFSQFNHQPVIALSQSSSSTTFLPQE---AL 534
+ +KS + M G S+ N S SST +E +
Sbjct: 1945 LVTNEKSEVLMKGSR----------------SKDNCYLWTPQETSYSSTCLFSKEDEVKI 2076
Query: 535 WHFRLGHLSNDRLLRM-----QQHFPCIKVDSNSVCDICHYSRHKKLPF-KLSMNKASHC 588
WH R GHL + ++ + P +K++ +C C + K+ KL S
Sbjct: 2077 WHQRFGHLHLRGMKKIIDKGAVRGIPNLKIEEGRICGECQIGKQVKMSHQKLQHQTTSRV 2256
Query: 589 YELIHFDIWGPISTHSIHGHKYFITALDDYSRFTWIMLCKTKSEVSKSVQNFILNIENQF 648
EL+H D+ GP+ S+ G +Y +DD+SRFTW+ + KS+ + + L ++ +
Sbjct: 2257 LELLHMDLMGPMQVESLGGKRYAYVVVDDFSRFTWVNFIREKSDTFEVFKELSLRLQREK 2436
Query: 649 NCKVKTVRTDNGPEF---LMPDFYSSKGIEHQTSCVETPQQNGRV 690
+C +K +R+D+G EF +F +S+GI H+ S TPQQNG V
Sbjct: 2437 DCVIKRIRSDHGREFENSKFTEFCTSEGITHEFSAAITPQQNGIV 2571
>BI317550 weakly similar to GP|9759590|dbj| polyprotein-like {Arabidopsis
thaliana}, partial (18%)
Length = 421
Score = 204 bits (518), Expect = 3e-52
Identities = 102/139 (73%), Positives = 115/139 (82%)
Frame = -2
Query: 965 KSLYGLKQASRKWYEKLTGFLLLQGYKQSASDHSLFILHSDTCFTALLVYVDDVILAGNS 1024
KSLYGLKQASRKWYEKLT LL +GY QS SD+SLF L FTALLVYVDD+ILAG+S
Sbjct: 420 KSLYGLKQASRKWYEKLTNLLLKEGYIQSISDYSLFTLTKGNTFTALLVYVDDIILAGDS 241
Query: 1025 MTEIDRIKAVLDVEFKIKDLGKLKYFLGIEVAHSKLGISICQRKYCLDLLKDTGLLGAKP 1084
+ E DRIK VLD+ FKIK+LGKLKYFLG+EVAHS+LGI+I QRKYCLDLLKD+GLLG KP
Sbjct: 240 IDEFDRIKNVLDLAFKIKNLGKLKYFLGLEVAHSRLGITISQRKYCLDLLKDSGLLGCKP 61
Query: 1085 VTTPLDPSIKLHQDNSPPH 1103
+TPLD SIKLH P+
Sbjct: 60 ASTPLDTSIKLHSAAGTPY 4
>TC231544 weakly similar to UP|Q9FLR2 (Q9FLR2) Polyprotein-like, partial (16%)
Length = 662
Score = 199 bits (506), Expect = 7e-51
Identities = 107/185 (57%), Positives = 135/185 (72%), Gaps = 4/185 (2%)
Frame = +3
Query: 1148 ACRVVKYLKGTPGQGLLFRRDSQLQLLGFTDADWAGCPDSRRSTSGYCFFLGSSLISWRA 1207
A RV+KYLKG P +GL F R+S +Q+LGF+DADWA C DS +S + YCFFLGSSLISW+A
Sbjct: 21 ATRVLKYLKGCPRKGLSFSRESPIQILGFSDADWATCIDSSKSITWYCFFLGSSLISWKA 200
Query: 1208 KKQHTVAR--SSSEAEYRALSFASCELQWLLYLLKDLRVKCNKLPVLFCDNQSAIH-IAG 1264
KKQ+TV+R SSSEA+YRAL+ +CELQWL YLLKDL V +++CDNQSA+ +
Sbjct: 201 KKQNTVSRSSSSSEAKYRALTSTTCELQWLTYLLKDLHV-----TLIYCDNQSALQ*LPI 365
Query: 1265 NPVFHERTKHLEIDCHFVREKLQQNIFK-LLPIKSQSQLADFFTKPLPPKNFHSFLPKLN 1323
++H + LEIDCH VREK QQ + LLP+ S +QLAD FTK L PK F S L KL
Sbjct: 366 KVIYHGQ---LEIDCHIVREKTQQGLMHCLLPVSSSNQLADIFTKALSPKLFSSNLSKLG 536
Query: 1324 MIDLY 1328
+ D++
Sbjct: 537 LSDIF 551
>TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Fragment), partial
(30%)
Length = 687
Score = 178 bits (451), Expect = 2e-44
Identities = 87/158 (55%), Positives = 108/158 (68%), Gaps = 1/158 (0%)
Frame = +2
Query: 1172 QLLGFTDADWAGCPDSRRSTSGYCFFLGSSLISWRAKKQHTVARSSSEAEYRALSFASCE 1231
QL G+ DADWAGCP RRSTSGYC F+G +L+SW++KKQ VARSS+EAEYR+++ +CE
Sbjct: 14 QLSGYCDADWAGCPMDRRSTSGYCVFIGGNLVSWKSKKQTVVARSSAEAEYRSMAMVTCE 193
Query: 1232 LQWLLYLLKDLRVKCNKLPV-LFCDNQSAIHIAGNPVFHERTKHLEIDCHFVREKLQQNI 1290
L W+ L++LR C +L + L+CDNQ+A+HIA NPVFHERTKH+EIDCHF+REKL
Sbjct: 194 LMWIKQFLQELRF-CEELQMKLYCDNQAALHIASNPVFHERTKHIEIDCHFIREKLLSKE 370
Query: 1291 FKLLPIKSQSQLADFFTKPLPPKNFHSFLPKLNMIDLY 1328
I S Q D TK L KL DLY
Sbjct: 371 IVTEFIGSNDQPVDILTKSLRGPKIQIVCSKLGAYDLY 484
>BQ296988 similar to GP|21740616|em OSJNBb0089K24.12 {Oryza sativa (japonica
cultivar-group)}, partial (1%)
Length = 408
Score = 176 bits (447), Expect = 5e-44
Identities = 82/137 (59%), Positives = 106/137 (76%)
Frame = -1
Query: 91 RCNMLVHSWIMNSVEDSIAQSIVFLENAIDVWNELKERFSQGDFIRISELQCEIFSLKQD 150
RCNML+HSWI+NSVE SI++SIVF++NA DVW +LKERFSQGD +R+SE+Q EI++L Q
Sbjct: 408 RCNMLIHSWILNSVEPSISRSIVFMDNASDVWLDLKERFSQGDLVRVSEIQQEIYALTQG 229
Query: 151 SRSVTEFFTALKVLWEELEAYLPTPVCACPHRCMCITGVMNAKHQHEITRSIRFLTGLND 210
+RSVT F++ K LWEELE Y+P P C C HRC C + +H H + +RFLTGLND
Sbjct: 228 TRSVTTFYSDKKALWEELEIYMPIPNCTCHHRCSCDAMRLARRHHHTL-HVMRFLTGLND 52
Query: 211 SFDLVRSQILLMNPLPT 227
F+ V+SQILL+ PLP+
Sbjct: 51 EFNAVKSQILLIEPLPS 1
>TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polyprotein, partial
(7%)
Length = 804
Score = 171 bits (432), Expect = 3e-42
Identities = 85/228 (37%), Positives = 139/228 (60%), Gaps = 5/228 (2%)
Frame = +1
Query: 1105 DILSYRRLVGKLLYLTTTRPDIAFVVQQLSQFLNAPTITHYDTACRVVKYLKGTPGQGLL 1164
D+ +RRL+G L YL +RP+I F V +S+F+ P ++H A RV++ +KGT G G+L
Sbjct: 10 DVTEFRRLIGSLRYLCNSRPNICFAVSLISRFMKRPRLSHMQAAKRVLRLIKGTIGSGVL 189
Query: 1165 F---RRDSQLQLLGFTDADWAGCPDSRRSTSGYCFFLGSSLISWRAKKQHTVARSSSEAE 1221
F + + LLG+TD+DW P+ +ST GY F + ++ +KKQ +A S+ EAE
Sbjct: 190 FPFKAKSGKPDLLGYTDSDWKRDPEQEKSTGGYLFMYNDAPVA*SSKKQDVIALSTCEAE 369
Query: 1222 YRALSFASCELQWLLYLLKDLRVKCNKLPVLFCDNQSAIHIAGNPVFHERTKHLEIDCHF 1281
Y A S +C+ W++ LL++L+++ K L DN+SAI++A +P H R+KH+E+ H+
Sbjct: 370 YVAASLGACQAVWMMNLLEELKLRERKPVNLLIDNKSAINLAKHPTLHGRSKHIELRFHY 549
Query: 1282 VREKLQQNIFKLLPIKSQSQLADFFTKPLPPKNFHSFLPKL--NMIDL 1327
+R+++ + + K++ QLAD TKP+ F +L N+ DL
Sbjct: 550 IRDQVSKGNVTVEYCKAEEQLADLMTKPIQVSRFKQICSELVNNLEDL 693
>BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vinifera},
partial (34%)
Length = 407
Score = 155 bits (391), Expect = 1e-37
Identities = 75/130 (57%), Positives = 94/130 (71%), Gaps = 1/130 (0%)
Frame = -2
Query: 854 VDTPPNIKPIGSKWVYKIKHKSDGTIERYKARLVAKGYTQVEGIDFFDTFSPVAKITTVR 913
V PP P+G +WVY +K G ++R KARLVAKGYTQV GID+ DTFSPVAK+TTVR
Sbjct: 406 VPLPPGKTPVGCRWVYTVKVGPTGEVDRLKARLVAKGYTQVYGIDYCDTFSPVAKLTTVR 227
Query: 914 ILIALASINSWHLHQLDVNNAFLHGDLSENVYMSVPQG-VHSPKPNQVCKLLKSLYGLKQ 972
+ +A+A+I W LHQLD+ NAFLHGDL E++YM P G V + VCKL +SLYGLKQ
Sbjct: 226 LFLAMAAICHWPLHQLDIKNAFLHGDLEEDIYMEQPPGFVAQGEYGLVCKLHRSLYGLKQ 47
Query: 973 ASRKWYEKLT 982
+ R W+ K +
Sbjct: 46 SPRAWFGKFS 17
>TC234303 weakly similar to UP|Q8W153 (Q8W153) Polyprotein, partial (10%)
Length = 558
Score = 150 bits (379), Expect = 4e-36
Identities = 85/179 (47%), Positives = 108/179 (59%), Gaps = 2/179 (1%)
Frame = +1
Query: 877 GTIERYKARLVAKGYTQVEGIDFFDTFSPVAKITTVRILIALASINSWHLHQLDVNNAFL 936
GTI+++KARLVAK YTQV G D+ TFSPVAK+ V +L ++A + W L LD NAFL
Sbjct: 28 GTIDQFKARLVAKSYTQVYGQDYTGTFSPVAKMAYVHLLWSMAVVCHWPLF*LDAKNAFL 207
Query: 937 HGDLSENVYMSVPQG--VHSPKPNQVCKLLKSLYGLKQASRKWYEKLTGFLLLQGYKQSA 994
HG L E VYM P G N VC+L +S YGLKQ+ R W G + Y
Sbjct: 208 HGYLEEEVYMEQPLGFVAQGESSNMVCQLCRSFYGLKQSPRAWPFLYCGAAI--WYDSHE 381
Query: 995 SDHSLFILHSDTCFTALLVYVDDVILAGNSMTEIDRIKAVLDVEFKIKDLGKLKYFLGI 1053
+DHS+F HS L+VYVDD+ + G+ I +K L +F+ KDLGKL+YFLGI
Sbjct: 382 ADHSVFYCHSPQGCIYLIVYVDDIGITGSDQHGIT*LK*XLCCQFQTKDLGKLRYFLGI 558
>AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza sativa
(japonica cultivar-group)}, partial (10%)
Length = 463
Score = 150 bits (378), Expect = 5e-36
Identities = 75/151 (49%), Positives = 101/151 (66%), Gaps = 1/151 (0%)
Frame = -3
Query: 837 AMNSELSALNHNKTWIFVDTPPNIKPIGSKWVYKIKHKSDGTIERYKARLVAKGYTQVEG 896
AM EL+ N W V+ P N IG+KWV++ K G I R KARLVAKGY Q EG
Sbjct: 458 AMQEELNQFERNNVWKLVEKPENYPVIGTKWVFRNKLDEHGIIIRNKARLVAKGYNQEEG 279
Query: 897 IDFFDTFSPVAKITTVRILIALASINSWHLHQLDVNNAFLHGDLSENVYMSVPQGVHSP- 955
ID+ +T++PVA++ +R+L+A SI ++ L+Q+DV +AFL+G + E VY+ P G P
Sbjct: 278 IDYEETYAPVARLEVIRMLLAYVSIMNFKLYQMDVKSAFLNGLIQEEVYVEQPPGFEIPD 99
Query: 956 KPNQVCKLLKSLYGLKQASRKWYEKLTGFLL 986
KP V KL K+LYGLKQA R WYE+++ FLL
Sbjct: 98 KPTHVYKLQKALYGLKQAPRAWYERISNFLL 6
>CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {Vitis
vinifera}, partial (34%)
Length = 409
Score = 144 bits (362), Expect = 3e-34
Identities = 67/134 (50%), Positives = 91/134 (67%)
Frame = +3
Query: 816 ITDASEPISYKEASQQDCWVKAMNSELSALNHNKTWIFVDTPPNIKPIGSKWVYKIKHKS 875
++ + P + +EA W +AM E+ AL +N TW V PP +G +WVY +K
Sbjct: 3 LSSLTVPSTIREALDHPGWRQAMVDEMQALENNGTWELVPLPPGKTTVGCRWVYTVKVGP 182
Query: 876 DGTIERYKARLVAKGYTQVEGIDFFDTFSPVAKITTVRILIALASINSWHLHQLDVNNAF 935
+G ++R KARLVAKGYTQV GI++ DTFSPV +TTVR+ +A+A+I W LHQLD+ NAF
Sbjct: 183 NGKVDRLKARLVAKGYTQVYGIEYCDTFSPVFFLTTVRLFLAMAAIRHWPLHQLDIKNAF 362
Query: 936 LHGDLSENVYMSVP 949
LHGDL E++YM P
Sbjct: 363 LHGDLEEDIYMEQP 404
>TC225402 weakly similar to UP|Q6I923 (Q6I923) Copia-like retrotransposon
Hopscotch polyprotein, partial (7%)
Length = 1446
Score = 137 bits (344), Expect(2) = 2e-33
Identities = 62/109 (56%), Positives = 83/109 (75%)
Frame = +2
Query: 1178 DADWAGCPDSRRSTSGYCFFLGSSLISWRAKKQHTVARSSSEAEYRALSFASCELQWLLY 1237
DA+WA P R ST GYC +G +L+ W++ K + VARSS+EAEY+A++ A+CEL W+
Sbjct: 8 DANWAVSPIDRGSTLGYCVSIGENLVLWKSNK*NVVARSSAEAEYKAMTVATCELIWIKQ 187
Query: 1238 LLKDLRVKCNKLPVLFCDNQSAIHIAGNPVFHERTKHLEIDCHFVREKL 1286
LL++L+ + L CDNQ+A+HIA NPVFHERTKH+EIDCHFVREK+
Sbjct: 188 LLQELKFGSTQQMKLCCDNQAALHIASNPVFHERTKHIEIDCHFVREKV 334
Score = 25.8 bits (55), Expect(2) = 2e-33
Identities = 12/33 (36%), Positives = 16/33 (48%)
Frame = +3
Query: 1296 IKSQSQLADFFTKPLPPKNFHSFLPKLNMIDLY 1328
+ S QLA+ FTK L + KL +LY
Sbjct: 363 VSSNDQLANIFTKSLRGPRIQNICSKLGAFELY 461
>BU764568
Length = 420
Score = 107 bits (267), Expect(2) = 1e-31
Identities = 48/85 (56%), Positives = 65/85 (76%)
Frame = +3
Query: 1191 TSGYCFFLGSSLISWRAKKQHTVARSSSEAEYRALSFASCELQWLLYLLKDLRVKCNKLP 1250
TSGYC +G +LISW++KKQ VA+SS+EAEYRA++ +CEL WL LL +L+ + +
Sbjct: 165 TSGYCVLIGGNLISWKSKKQSVVAKSSAEAEYRAMALVTCELIWLKQLL*ELKFEEDTQM 344
Query: 1251 VLFCDNQSAIHIAGNPVFHERTKHL 1275
L CDNQ+A+HIA NP+FH RTKH+
Sbjct: 345 TLICDNQAALHIASNPIFH*RTKHI 419
Score = 49.3 bits (116), Expect(2) = 1e-31
Identities = 22/61 (36%), Positives = 34/61 (55%)
Frame = +1
Query: 1134 SQFLNAPTITHYDTACRVVKYLKGTPGQGLLFRRDSQLQLLGFTDADWAGCPDSRRSTSG 1193
SQFLN+P H++ ++K K PG+GL++ Q++G++DAD G P R
Sbjct: 1 SQFLNSPCQDHWNAVS*ILK*TKSAPGKGLIYEDKGHSQIIGYSDAD*VGSPSDRHQDIV 180
Query: 1194 Y 1194
Y
Sbjct: 181 Y 183
>TC211311 weakly similar to UP|O24587 (O24587) Pol protein, partial (15%)
Length = 1213
Score = 110 bits (275), Expect(2) = 3e-31
Identities = 63/185 (34%), Positives = 96/185 (51%)
Frame = +3
Query: 1008 FTALLVYVDDVILAGNSMTEIDRIKAVLDVEFKIKDLGKLKYFLGIEVAHSKLGISICQR 1067
F + +YVDD+I S ++ F+ G+LK+ LG+++ GI I Q
Sbjct: 486 FLIIHIYVDDIIFGATSKRMCKEFFELMKDGFETSMKGELKFLLGLQIIQKVYGIFIHQE 665
Query: 1068 KYCLDLLKDTGLLGAKPVTTPLDPSIKLHQDNSPPHDDILSYRRLVGKLLYLTTTRPDIA 1127
KY LK + AKP+ TP+ S + +D H Y ++ L YLT++RPDI
Sbjct: 666 KYTKSHLKRFRMDEAKPMATPMHRSTIIDKDEKGNHTS*KEYSGMIDSLSYLTSSRPDIV 845
Query: 1128 FVVQQLSQFLNAPTITHYDTACRVVKYLKGTPGQGLLFRRDSQLQLLGFTDADWAGCPDS 1187
FVV ++F + P I+H R+++YL GT L F++ S+ LLG+ D +AG
Sbjct: 846 FVVCLCARFQSYPKISHVTAVKRILRYLVGTTNHCLWFKKRSEFDLLGYCDVYFAGDKVE 1025
Query: 1188 RRSTS 1192
R+STS
Sbjct: 1026 RKSTS 1040
Score = 44.7 bits (104), Expect(2) = 3e-31
Identities = 19/39 (48%), Positives = 27/39 (68%)
Frame = +2
Query: 962 KLLKSLYGLKQASRKWYEKLTGFLLLQGYKQSASDHSLF 1000
K L +YGLKQA R WYE+L+ FL+ G+ + +D +LF
Sbjct: 347 KTLSCVYGLKQALRAWYERLSSFLVSNGFTRGITDPALF 463
>BU549979
Length = 615
Score = 129 bits (325), Expect = 7e-30
Identities = 65/194 (33%), Positives = 117/194 (59%), Gaps = 2/194 (1%)
Frame = -1
Query: 1133 LSQFLNAPTITHYDTACRVVKYLKGTPGQGLLFRRDSQLQLLGFTDADWAGCPDSRRSTS 1192
L ++ + P I H+ TA +V++YL+GT L++++ + L+++G++D+D+AGC DSRRSTS
Sbjct: 612 LGRYQSNPGIDHWKTAKKVMRYLQGTKDYMLMYKQTNCLEVIGYSDSDFAGCVDSRRSTS 433
Query: 1193 GYCFFLGSSLISWRAKKQHTVARSSSEAEYRALSFASCELQWLLYLLKDLRV--KCNKLP 1250
GY F L ++SWR+ KQ +A S+ E E+ A+ WL + LRV ++
Sbjct: 432 GYIFMLADGVVSWRSSKQTLIATSTMEVEFVPCFEATSHGVWLKSFMSSLRVVDSISRPL 253
Query: 1251 VLFCDNQSAIHIAGNPVFHERTKHLEIDCHFVREKLQQNIFKLLPIKSQSQLADFFTKPL 1310
L+CDN +A+ +A N R+KH++I +RE++++ + + ++ + D TK +
Sbjct: 252 KLYCDNFAAVFMAKNNKSGNRSKHIDIKYLVIRERVKEKKVVIEHVNTELMIVDPLTKGM 73
Query: 1311 PPKNFHSFLPKLNM 1324
PKNF + ++ +
Sbjct: 72 TPKNFKDHVVRMEL 31
>TC211663
Length = 426
Score = 128 bits (322), Expect = 1e-29
Identities = 67/151 (44%), Positives = 95/151 (62%)
Frame = -3
Query: 108 IAQSIVFLENAIDVWNELKERFSQGDFIRISELQCEIFSLKQDSRSVTEFFTALKVLWEE 167
IAQ +++ ++A ++WN+LKE FSQGD ++I+ELQ EI+ LKQ S +V +FFT LK +WEE
Sbjct: 424 IAQIVIYFDHATNIWNDLKEGFSQGDLLQIAELQEEIYRLKQGSHTVLDFFTKLKFVWEE 245
Query: 168 LEAYLPTPVCACPHRCMCITGVMNAKHQHEITRSIRFLTGLNDSFDLVRSQILLMNPLPT 227
L+ Y +C CP R HQ + I FL GL++ F +V S++LLM+ LP+
Sbjct: 244 LDNYGLMNLCTCPSR---------TYHQQDFV--IHFLKGLDERFSVVCSEVLLMDHLPS 98
Query: 228 INKIFSMVMQHERQFKISIPVEDSTILVNSA 258
+IFSMV+QHE Q ED +N A
Sbjct: 97 TKRIFSMVIQHETQHASHTSAEDQNRFINVA 5
>CO982036
Length = 674
Score = 126 bits (317), Expect = 6e-29
Identities = 77/203 (37%), Positives = 113/203 (54%), Gaps = 3/203 (1%)
Frame = -2
Query: 1011 LLVYVDDVILAGNSMTEIDRIKAVLDVEFKIKDLGKLKYFLGIEVAHSKLGISICQRKYC 1070
LLVYVD +I+ G+S T I + + L+ F +K LGKL YF+ IEV S + R
Sbjct: 646 LLVYVD-IIITGSSCTLIQNLTSKLNSSFPLKLLGKLDYFVEIEVK-SMPDLLFSLRTSI 473
Query: 1071 LDLLKDTGLLGAKPVTTPLDPSIKLHQDNSPPHDDILSYRRLVGKLLYLTTTRPDIAFVV 1130
++ A+P+++P+ + KL + +S YR +VG L Y T RP+I+F V
Sbjct: 472 FEIFCRKPR*QAQPISSPMTTTCKLSKSDSDLFSGPTFYRSVVGALQYTTVIRPEISFAV 293
Query: 1131 QQLSQFLNAPTITHYDTACRVVKYLKGTPGQGLLFR---RDSQLQLLGFTDADWAGCPDS 1187
++ QF++ P +H+ R+++YLKG+ GL + L + GF DADWA D
Sbjct: 292 NKVCQFMSNPLDSHWTEVKRILRYLKGSLSYGL*LKPAISSQPLPIRGFCDADWASAVDD 113
Query: 1188 RRSTSGYCFFLGSSLISWRAKKQ 1210
+RSTSG FLG +LISW KQ
Sbjct: 112 KRSTSGAAVFLGPNLISWWXXKQ 44
>BU548243
Length = 599
Score = 125 bits (314), Expect = 1e-28
Identities = 67/147 (45%), Positives = 88/147 (59%)
Frame = -1
Query: 1178 DADWAGCPDSRRSTSGYCFFLGSSLISWRAKKQHTVARSSSEAEYRALSFASCELQWLLY 1237
DA WA D RST G FLG +LISW ++KQ A+SS+EAEYR+++ S EL W+
Sbjct: 587 DAGWASDVDDHRSTLGSAIFLGPNLISWWSRKQQVTAQSSTEAEYRSIAQTSAELTWIQA 408
Query: 1238 LLKDLRVKCNKLPVLFCDNQSAIHIAGNPVFHERTKHLEIDCHFVREKLQQNIFKLLPIK 1297
LL +L++ PV+ CDN+SA+ IA N VFH RTKH+EID FV EK+ ++ I
Sbjct: 407 LLMELQIPFTP-PVILCDNKSAVAIAHNLVFHSRTKHMEIDVFFVHEKVLSKQLQIFHIP 231
Query: 1298 SQSQLADFFTKPLPPKNFHSFLPKLNM 1324
+ Q A TKPL F KL +
Sbjct: 230 ALDQWAGILTKPLSSARFTFLKSKLTV 150
>BI787454 weakly similar to GP|21434|emb|CA ORF4 {Solanum tuberosum}, partial
(21%)
Length = 421
Score = 124 bits (312), Expect = 2e-28
Identities = 69/140 (49%), Positives = 93/140 (66%), Gaps = 2/140 (1%)
Frame = +2
Query: 991 KQSASDHSLFILHSDT--CFTALLVYVDDVILAGNSMTEIDRIKAVLDVEFKIKDLGKLK 1048
K S +DHS+F H+ C L+VYVDD+++ T+I ++K L F+ KDL LK
Sbjct: 2 K*SEADHSVFYCHTSPGKC-VYLMVYVDDIMITKKDATKIVQLKEHLFNHFQTKDLRYLK 178
Query: 1049 YFLGIEVAHSKLGISICQRKYCLDLLKDTGLLGAKPVTTPLDPSIKLHQDNSPPHDDILS 1108
YFLGIEVA S G+ I QRKY LD+L++TG+ + V +P+DP++KL S + D
Sbjct: 179 YFLGIEVAQSGDGVVISQRKYALDILEETGMQNCRLVDSPMDPNLKLMAYQSEVYPDPER 358
Query: 1109 YRRLVGKLLYLTTTRPDIAF 1128
YRRLVGKL+YLT TRPDI+F
Sbjct: 359 YRRLVGKLIYLTITRPDISF 418
>BM307983
Length = 406
Score = 124 bits (310), Expect = 4e-28
Identities = 64/133 (48%), Positives = 86/133 (64%), Gaps = 2/133 (1%)
Frame = +2
Query: 863 IGSKWVYKIKHKSDGTIERYKARLVAKGYTQVEGIDFFDTFSPVAK-ITTVRILIALASI 921
+G +W+Y +K+ +D T++RYKARLVAKGY Q GID+ +TF+ K I + +
Sbjct: 2 VGCRWIYTVKY*ADDTLDRYKARLVAKGYIQTYGIDYEETFAQWQK*IQSGSSSP*QQAQ 181
Query: 922 NSWHLHQLDVNNAFLHGDLSENVYMSVPQGV-HSPKPNQVCKLLKSLYGLKQASRKWYEK 980
W +HQ DV NAFLHG L E VYM +P G S N+VC+L K+LYGLKQ+ R W+ +
Sbjct: 182 FGWEMHQFDVKNAFLHGSLEEEVYMEIPPGYGASNGGNKVCRLKKALYGLKQSPRAWFGR 361
Query: 981 LTGFLLLQGYKQS 993
T +L GYKQS
Sbjct: 362 FTQAMLSLGYKQS 400
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.319 0.134 0.408
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 67,634,520
Number of Sequences: 63676
Number of extensions: 1086731
Number of successful extensions: 6298
Number of sequences better than 10.0: 164
Number of HSP's better than 10.0 without gapping: 6134
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 6245
length of query: 1336
length of database: 12,639,632
effective HSP length: 109
effective length of query: 1227
effective length of database: 5,698,948
effective search space: 6992609196
effective search space used: 6992609196
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 65 (29.6 bits)
Medicago: description of AC148395.6