
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0211.5
(1428 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, co... 523 e-148
TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete 516 e-146
TC231544 weakly similar to UP|Q9FLR2 (Q9FLR2) Polyprotein-like, ... 194 2e-49
TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Frag... 174 2e-43
TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polypro... 161 2e-39
BQ296988 similar to GP|21740616|em OSJNBb0089K24.12 {Oryza sativ... 153 6e-37
CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {V... 152 1e-36
BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vi... 151 2e-36
TC234303 weakly similar to UP|Q8W153 (Q8W153) Polyprotein, parti... 144 2e-34
AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza ... 144 3e-34
BI317550 weakly similar to GP|9759590|dbj| polyprotein-like {Ara... 141 2e-33
TC225402 weakly similar to UP|Q6I923 (Q6I923) Copia-like retrotr... 122 6e-31
TC235104 weakly similar to UP|Q850H8 (Q850H8) Gag-pol polyprotei... 133 6e-31
BU548243 131 2e-30
TC222001 similar to UP|C716_NEPRA (O04164) Cytochrome P450 71A6 ... 127 5e-29
BM307983 124 4e-28
TC213888 similar to UP|Q9SFE1 (Q9SFE1) T26F17.17, partial (11%) 121 2e-27
BE474381 similar to GP|21434|emb|CA ORF4 {Solanum tuberosum}, pa... 93 6e-27
CO982036 119 1e-26
TC211663 117 5e-26
>TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, complete
Length = 4734
Score = 523 bits (1346), Expect = e-148
Identities = 366/1301 (28%), Positives = 611/1301 (46%), Gaps = 21/1301 (1%)
Frame = +1
Query: 142 IKNIEDERNKNQV-VRFLRG----LNDQFSGVRSQLMLL----DNLPNVNRVFALIAQQE 192
I N+E E+ ++ + L+G LN + + + +L D L V ++ + Q
Sbjct: 1186 IANLEAEKEAHEEEISELKGEVGFLNSKLENMTKSIKMLNKGSDMLDEVLQLGKNVGNQ- 1362
Query: 193 RQFSFENVSGSRALIASRENSNDNRGSQSDHNRNSQSNYGGCQSSSGNNRYSSKKCSYCG 252
R F + S R + + ++ G+ +R+ ++G Q S ++ +C YCG
Sbjct: 1363 RGLGFNHKSAGRTTMTEFVPAKNSTGATMSQHRSR--HHGTQQKKSKRKKW---RCHYCG 1527
Query: 253 KMGHTVEDCYKKHGFPPGFKFKNPKYAQRSANLAHSTGEDQDSVDQENASGQDAARFGFT 312
K GH CY HG +P + +S+ S+G V +
Sbjct: 1528 KYGHIKPFCYHLHG--------HPHHGTQSS----SSGRKMMWVPK-------------- 1629
Query: 313 ADQYHHLLALLPPSESKASSSQHTASVNSCAQVLPTKNGNGTSQVLPTRNGNPLDTTWIL 372
H +++L+ + +AS+ + W L
Sbjct: 1630 ----HKIVSLVVHTSLRASAKED----------------------------------WYL 1695
Query: 373 DTGATDHICNTLSYFSSYKHVEPIPVSLPNGIVETTTIKGTIQITPSFILANVLFLPNFE 432
D+G + H+ + + + V+ +G T G + L VL +
Sbjct: 1696 DSGCSRHMTGVKEFLVNIEPCSTSYVTFGDGSKGKITGMGKLVHDGLPSLNKVLLVKGLT 1875
Query: 433 FNLISVHKLVKCLRYRLIFEDDLCLIQDSNACKMIGTVRAVKGLYIFNKSSISSLASCNS 492
NLIS+ +L + + F CL+ + + ++ R+ Y++ S
Sbjct: 1876 ANLISISQLCD-EGFNVNFTKSECLVTNEKSEVLMKGSRSKDNCYLWTPQETSY------ 2034
Query: 493 ISTSVNPSVHSSSICTFQSNVH-NLWHYRLGHPSLVKGQSINEL-----FPYVQCSKAHV 546
SS C F +WH R GH L + I + P ++ + +
Sbjct: 2035 -----------SSTCLFSKEDEVKIWHQRFGHLHLRGMKKIIDKGAVRGIPNLKIEEGRI 2181
Query: 547 CDVCPVAKQKRMSFPLSVTQSTA-IFQLIHVDIWGPVSIVSLHGFSYFLTIVDDYSRFTW 605
C C + KQ +MS Q+T+ + +L+H+D+ GP+ + SL G Y +VDD+SRFTW
Sbjct: 2182 CGECQIGKQVKMSHQKLQHQTTSRVLELLHMDLMGPMQVESLGGKRYAYVVVDDFSRFTW 2361
Query: 606 IYLLKS*AEVKNLVQEFCALVANQFETAVKTIRSDNGKEFS---LPQFYATKGIVHQTSC 662
+ ++ ++ + +E + + + +K IRSD+G+EF +F ++GI H+ S
Sbjct: 2362 VNFIREKSDTFEVFKELSLRLQREKDCVIKRIRSDHGREFENSKFTEFCTSEGITHEFSA 2541
Query: 663 VETPQQNSIVERKHQHILNVARALLFQAHLPKIFWAHAIVHAVFLINRLPSPVLDGKCPF 722
TPQQN IVERK++ + AR +L LP WA A+ A ++ NR+ +
Sbjct: 2542 AITPQQNGIVERKNRTLQEAARVMLHAKELPYNLWAEAMNTACYIHNRVTLRRGTPTTLY 2721
Query: 723 QILHKVLPDLTNLKVFGSLCFASTLVSHRTKFDPRAKRCVFLGFKPGTKGYIVYDLKSND 782
+I P + + +FGS C+ R K DP++ +FLG+ ++ Y V++ ++
Sbjct: 2722 EIWKGRKPTVKHFHIFGSPCYILADREQRRKMDPKSDAGIFLGYSTNSRAYRVFNSRTRT 2901
Query: 783 IAISRNVVFHENMFPYPTQPSDLNQHQSCPLPQASFLSDEPFEYATQSPSEALTQPNTSE 842
+ S NVV + P + D+ + + + E A S S A +PN ++
Sbjct: 2902 VMESINVVVDDLT---PARKKDVEEDVRTSGDNVADTAKSA-ENAENSDS-ATDEPNINQ 3066
Query: 843 PSSDPVLDNNHRTSTRTRKQ-PSYLQDYHCSLIASTAVSSSSSSKGTSYPLSKVISYCNL 901
P P S R +K P L +I ++ S+ + V + C
Sbjct: 3067 PDKRP--------SIRIQKMHPKEL------IIGDPNRGVTTRSR----EIEIVSNSC-- 3186
Query: 902 APAYHTFVMNITAVVEPKRYSEAVKHDSWRKAMDQEIEALERNHTWILVDKPHDKTPIGC 961
+ +EPK EA+ + W AM +E+E +RN W LV +P IG
Sbjct: 3187 ----------FVSKIEPKNVKEALTDEFWINAMQEELEQFKRNEVWELVPRPEGTNVIGT 3336
Query: 962 KWVYRIKYKQDGTLDRYKARLVVKGYTQLEGIDFIDTFSPVAKMTTLRVLLALASSYNWF 1021
KW+++ K ++G + R KARLV +GYTQ+EG+DF +TF+PVA++ ++R+LL +A +
Sbjct: 3337 KWIFKNKTNEEGVITRNKARLVAQGYTQIEGVDFDETFAPVARLESIRLLLGVACILKFK 3516
Query: 1022 LHQLDVDNAFLHAQLDEEIYMSLPQG-LHTEKPNQVCLLQKSLYGLKQASRQWYTTLCKA 1080
L+Q+DV +AFL+ L+EE Y+ P+G + P+ V L+K+LYGLKQA R WY L +
Sbjct: 3517 LYQMDVKSAFLNGYLNEEAYVEQPKGFVDPTHPDHVYRLKKALYGLKQAPRAWYERLTEF 3696
Query: 1081 LHTLGFSPSSADHTLYIKKGTTGSFTALLLYVDDVLLTGNDLHEIQLVKESLHAQFRIKD 1140
L G+ D TL++K+ A + YVDD++ G ++ + + ++F +
Sbjct: 3697 LTQQGYRKGGIDKTLFVKQDAENLMIAQI-YVDDIVFGGMSNEMLRHFVQQMQSEFEMSL 3873
Query: 1141 MGEAKFFLGLEIARSKAGIVLNQRKYALELLSDSGLLGGKPTTTPMDSSQKFGLSTDTPL 1200
+GE +FLGL++ + + I L+Q KYA ++ G+ TP + K
Sbjct: 3874 VGELTYFLGLQVKQMEDSIFLSQSKYAKNIVKKFGMENASHKRTPAPTHLKLSKDEAGTS 4053
Query: 1201 SDISSYRRLIGKLLYLTTTRPDIAYVVNQLSQFLSAPTNVHEAAAHRVLRYIKGNPGCGL 1260
D S YR +IG LLYLT +RPDI Y V +++ + P H R+L+Y+ G G+
Sbjct: 4054 VDQSLYRSMIGSLLYLTASRPDITYAVGVCARYQANPKISHLNQVKRILKYVNGTSDYGI 4233
Query: 1261 FYPADSSTTLTAFSDSDWAGCLDTRKSITGYCMFLGSSLISWRSKKQTTTSRSSCEAEYR 1320
Y S + L + D+DWAG D RKS +G C +LG++LISW SKKQ S S+ EAEY
Sbjct: 4234 MYCHCSDSMLVGYCDADWAGSADDRKSTSGGCFYLGTNLISWFSKKQNCVSLSTAEAEYI 4413
Query: 1321 AMAATVCEVQWLSYLLHDLQAPPSAPVSMYCDNQSAMHIAHNPSYHERTKHIEVDCHIVR 1380
A ++ ++ W+ +L + +++YCDN SA++I+ NP H RTKHI++ H +R
Sbjct: 4414 AAGSSCSQLVWMKQMLKEYNVEQDV-MTLYCDNMSAINISKNPVQHSRTKHIDIRHHYIR 4590
Query: 1381 EKVQQGLVHLLPIASSHQLADIFTKPLTPAPFRHIFSKLGM 1421
+ V ++ L + + Q+ADIFTK L F + KLG+
Sbjct: 4591 DLVDDKVITLEHVDTEEQIADIFTKALDANQFEKLRGKLGI 4713
>TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete
Length = 4731
Score = 516 bits (1330), Expect = e-146
Identities = 325/1096 (29%), Positives = 542/1096 (48%), Gaps = 10/1096 (0%)
Frame = +1
Query: 336 TASVNSCAQVLPTKNGNGTSQVLPTRNGNPLDTTWILDTGATDHICNTLSYFSSYKHVEP 395
T S NS +++ S V+ T W LD+G + H+ + + +
Sbjct: 1582 TQSSNSRKKMMWVPKHKAVSLVVHTSLRASAKEDWYLDSGCSRHMTGVKEFLLNIEPCST 1761
Query: 396 IPVSLPNGIVETTTIKGTIQITPSFILANVLFLPNFEFNLISVHKLVKCLRYRLIFEDDL 455
V+ +G G + L VL + NLIS+ +L + + F
Sbjct: 1762 SYVTFGDGSKGKIIGMGKLVHDGLPSLNKVLLVKGLTANLISISQLCD-EGFNVNFTKSE 1938
Query: 456 CLIQDSNACKMIGTVRAVKGLYIFNKSSISSLASCNSISTSVNPSVHSSSICTFQSNVHN 515
CL+ + + ++ R+ Y++ S +SS+ + + +
Sbjct: 1939 CLVTNEKSEVLMKGSRSKDNCYLWTPQETS----------------YSSTCLSSKEDEVR 2070
Query: 516 LWHYRLGHPSLVKGQSINEL-----FPYVQCSKAHVCDVCPVAKQKRMSFPLSVTQSTA- 569
+WH R GH L + I + P ++ + +C C + KQ +MS Q+T+
Sbjct: 2071 IWHQRFGHLHLRGMKKIIDKGAVRGIPNLKIEEGRICGECQIGKQVKMSHQKLQHQTTSR 2250
Query: 570 IFQLIHVDIWGPVSIVSLHGFSYFLTIVDDYSRFTWIYLLKS*AEVKNLVQEFCALVANQ 629
+ +L+H+D+ GP+ + SL G Y +VDD+SRFTW+ ++ +E + +E + +
Sbjct: 2251 VLELLHMDLMGPMQVESLGGKRYAYVVVDDFSRFTWVNFIREKSETFEVFKELSLRLQRE 2430
Query: 630 FETAVKTIRSDNGKEFS---LPQFYATKGIVHQTSCVETPQQNSIVERKHQHILNVARAL 686
+ +K IRSD+G+EF +F ++GI H+ S TPQQN IVERK++ + AR +
Sbjct: 2431 KDCVIKRIRSDHGREFENSRFTEFCTSEGITHEFSAAITPQQNGIVERKNRTLQEAARVM 2610
Query: 687 LFQAHLPKIFWAHAIVHAVFLINRLPSPVLDGKCPFQILHKVLPDLTNLKVFGSLCFAST 746
L LP WA A+ A ++ NR+ ++I P + + +FGS C+
Sbjct: 2611 LHAKELPYNLWAEAMNTACYIHNRVTLRRGTPTTLYEIWKGRKPSVKHFHIFGSPCYILA 2790
Query: 747 LVSHRTKFDPRAKRCVFLGFKPGTKGYIVYDLKSNDIAISRNVVFHENMFPYPTQPSDLN 806
R K DP++ +FLG+ ++ Y V++ ++ + S NVV DL+
Sbjct: 2791 DREQRRKMDPKSDAGIFLGYSTNSRAYRVFNSRTRTVMESINVVV-----------DDLS 2937
Query: 807 QHQSCPLPQASFLSDEPFEYATQSPSEALTQPNTSEPSSDPVLDNNHRTSTRTRKQPSYL 866
+ + + S + A +S A + ++ S+ + + R+STR +K
Sbjct: 2938 PARKKDVEEDVRTSGDNVADAAKSGENAENSDSATDESN--INQPDKRSSTRIQKM---- 3099
Query: 867 QDYHCSLIASTAVSSSSSSKGTSYPLSKVISYCNLAPAYHTFVMNITAVVEPKRYSEAVK 926
+ LI ++G + +V N + +EPK EA+
Sbjct: 3100 --HPKELIIG------DPNRGVTTRSREVEIVSNSC---------FVSKIEPKNVKEALT 3228
Query: 927 HDSWRKAMDQEIEALERNHTWILVDKPHDKTPIGCKWVYRIKYKQDGTLDRYKARLVVKG 986
+ W AM +E+E +RN W LV +P IG KW+++ K ++G + R KARLV +G
Sbjct: 3229 DEFWINAMQEELEQFKRNEVWELVPRPEGTNVIGTKWIFKNKTNEEGVITRNKARLVAQG 3408
Query: 987 YTQLEGIDFIDTFSPVAKMTTLRVLLALASSYNWFLHQLDVDNAFLHAQLDEEIYMSLPQ 1046
YTQ+EG+DF +TF+PVA++ ++R+LL +A + L+Q+DV +AFL+ L+EE+Y+ P+
Sbjct: 3409 YTQIEGVDFDETFAPVARLESIRLLLGVACILKFKLYQMDVKSAFLNGYLNEEVYVEQPK 3588
Query: 1047 GL-HTEKPNQVCLLQKSLYGLKQASRQWYTTLCKALHTLGFSPSSADHTLYIKKGTTGSF 1105
G P+ V L+K+LYGLKQA R WY L + L G+ D TL++K+
Sbjct: 3589 GFADPTHPDHVYRLKKALYGLKQAPRAWYERLTEFLTQQGYRKGGIDKTLFVKQDAENLM 3768
Query: 1106 TALLLYVDDVLLTGNDLHEIQLVKESLHAQFRIKDMGEAKFFLGLEIARSKAGIVLNQRK 1165
A + YVDD++ G ++ + + ++F + +GE +FLGL++ + + I L+Q +
Sbjct: 3769 IAQI-YVDDIVFGGMSNEMLRHFVQQMQSEFEMSLVGELTYFLGLQVKQMEDSIFLSQSR 3945
Query: 1166 YALELLSDSGLLGGKPTTTPMDSSQKFGLSTDTPLSDISSYRRLIGKLLYLTTTRPDIAY 1225
YA ++ G+ TP + K D S YR +IG LLYLT +RPDI Y
Sbjct: 3946 YAKNIVKKFGMENASHKRTPAPTHLKLSKDEAGTSVDQSLYRSMIGSLLYLTASRPDITY 4125
Query: 1226 VVNQLSQFLSAPTNVHEAAAHRVLRYIKGNPGCGLFYPADSSTTLTAFSDSDWAGCLDTR 1285
V +++ + P H R+L+Y+ G G+ Y S+ L + D+DWAG D R
Sbjct: 4126 AVGVCARYQANPKISHLTQVKRILKYVNGTSDYGIMYCHCSNPMLVGYCDADWAGSADDR 4305
Query: 1286 KSITGYCMFLGSSLISWRSKKQTTTSRSSCEAEYRAMAATVCEVQWLSYLLHDLQAPPSA 1345
KS +G C +LG++LISW SKKQ S S+ EAEY A ++ ++ W+ +L +
Sbjct: 4306 KSTSGGCFYLGNNLISWFSKKQNCVSLSTAEAEYIAAGSSCSQLVWMKQMLKEYNVEQDV 4485
Query: 1346 PVSMYCDNQSAMHIAHNPSYHERTKHIEVDCHIVREKVQQGLVHLLPIASSHQLADIFTK 1405
+++YCDN SA++I+ NP H RTKHI++ H +R+ V ++ L + + Q+ADIFTK
Sbjct: 4486 -MTLYCDNMSAINISKNPVQHSRTKHIDIRHHYIRDLVDDKVITLKHVDTEEQIADIFTK 4662
Query: 1406 PLTPAPFRHIFSKLGM 1421
L F + KLG+
Sbjct: 4663 ALDANQFEKLRGKLGI 4710
Score = 37.7 bits (86), Expect = 0.037
Identities = 22/80 (27%), Positives = 34/80 (42%)
Frame = +1
Query: 189 AQQERQFSFENVSGSRALIASRENSNDNRGSQSDHNRNSQSNYGGCQSSSGNNRYSSKKC 248
A +R F S R + + + G+ +R+ ++G Q S ++ +C
Sbjct: 1348 AGNQRGLGFNPKSAGRTTMTEFVPAKNRTGATMSQHRSR--HHGMQQKKSKRKKW---RC 1512
Query: 249 SYCGKMGHTVEDCYKKHGFP 268
YCGK GH CY HG P
Sbjct: 1513 HYCGKYGHIKPFCYHLHGHP 1572
>TC231544 weakly similar to UP|Q9FLR2 (Q9FLR2) Polyprotein-like, partial (16%)
Length = 662
Score = 194 bits (494), Expect = 2e-49
Identities = 105/188 (55%), Positives = 132/188 (69%), Gaps = 4/188 (2%)
Frame = +3
Query: 1244 AAHRVLRYIKGNPGCGLFYPADSSTTLTAFSDSDWAGCLDTRKSITGYCMFLGSSLISWR 1303
AA RVL+Y+KG P GL + +S + FSD+DWA C+D+ KSIT YC FLGSSLISW+
Sbjct: 18 AATRVLKYLKGCPRKGLSFSRESPIQILGFSDADWATCIDSSKSITWYCFFLGSSLISWK 197
Query: 1304 SKKQTTTSR--SSCEAEYRAMAATVCEVQWLSYLLHDLQAPPSAPVSMYCDNQSAMH-IA 1360
+KKQ T SR SS EA+YRA+ +T CE+QWL+YLL DL +YCDNQSA+ +
Sbjct: 198 AKKQNTVSRSSSSSEAKYRALTSTTCELQWLTYLLKDLHV-----TLIYCDNQSALQ*LP 362
Query: 1361 HNPSYHERTKHIEVDCHIVREKVQQGLVH-LLPIASSHQLADIFTKPLTPAPFRHIFSKL 1419
YH + +E+DCHIVREK QQGL+H LLP++SS+QLADIFTK L+P F SKL
Sbjct: 363 IKVIYHGQ---LEIDCHIVREKTQQGLMHCLLPVSSSNQLADIFTKALSPKLFSSNLSKL 533
Query: 1420 GMYDIHSP 1427
G+ DI P
Sbjct: 534 GLSDIFLP 557
>TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Fragment), partial
(30%)
Length = 687
Score = 174 bits (441), Expect = 2e-43
Identities = 78/161 (48%), Positives = 110/161 (67%)
Frame = +2
Query: 1267 STTLTAFSDSDWAGCLDTRKSITGYCMFLGSSLISWRSKKQTTTSRSSCEAEYRAMAATV 1326
+T L+ + D+DWAGC R+S +GYC+F+G +L+SW+SKKQT +RSS EAEYR+MA
Sbjct: 8 NTQLSGYCDADWAGCPMDRRSTSGYCVFIGGNLVSWKSKKQTVVARSSAEAEYRSMAMVT 187
Query: 1327 CEVQWLSYLLHDLQAPPSAPVSMYCDNQSAMHIAHNPSYHERTKHIEVDCHIVREKVQQG 1386
CE+ W+ L +L+ + +YCDNQ+A+HIA NP +HERTKHIE+DCH +REK+
Sbjct: 188 CELMWIKQFLQELRFCEELQMKLYCDNQAALHIASNPVFHERTKHIEIDCHFIREKLLSK 367
Query: 1387 LVHLLPIASSHQLADIFTKPLTPAPFRHIFSKLGMYDIHSP 1427
+ I S+ Q DI TK L + + SKLG YD+++P
Sbjct: 368 EIVTEFIGSNDQPVDILTKSLRGPKIQIVCSKLGAYDLYAP 490
>TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polyprotein, partial
(7%)
Length = 804
Score = 161 bits (408), Expect = 2e-39
Identities = 84/221 (38%), Positives = 132/221 (59%), Gaps = 3/221 (1%)
Frame = +1
Query: 1202 DISSYRRLIGKLLYLTTTRPDIAYVVNQLSQFLSAPTNVHEAAAHRVLRYIKGNPGCGLF 1261
D++ +RRLIG L YL +RP+I + V+ +S+F+ P H AA RVLR IKG G G+
Sbjct: 10 DVTEFRRLIGSLRYLCNSRPNICFAVSLISRFMKRPRLSHMQAAKRVLRLIKGTIGSGVL 189
Query: 1262 YPADSST---TLTAFSDSDWAGCLDTRKSITGYCMFLGSSLISWRSKKQTTTSRSSCEAE 1318
+P + + L ++DSDW + KS GY + ++ SKKQ + S+CEAE
Sbjct: 190 FPFKAKSGKPDLLGYTDSDWKRDPEQEKSTGGYLFMYNDAPVA*SSKKQDVIALSTCEAE 369
Query: 1319 YRAMAATVCEVQWLSYLLHDLQAPPSAPVSMYCDNQSAMHIAHNPSYHERTKHIEVDCHI 1378
Y A + C+ W+ LL +L+ PV++ DN+SA+++A +P+ H R+KHIE+ H
Sbjct: 370 YVAASLGACQAVWMMNLLEELKLRERKPVNLLIDNKSAINLAKHPTLHGRSKHIELRFHY 549
Query: 1379 VREKVQQGLVHLLPIASSHQLADIFTKPLTPAPFRHIFSKL 1419
+R++V +G V + + QLAD+ TKP+ + F+ I S+L
Sbjct: 550 IRDQVSKGNVTVEYCKAEEQLADLMTKPIQVSRFKQICSEL 672
>BQ296988 similar to GP|21740616|em OSJNBb0089K24.12 {Oryza sativa (japonica
cultivar-group)}, partial (1%)
Length = 408
Score = 153 bits (386), Expect = 6e-37
Identities = 66/136 (48%), Positives = 102/136 (74%)
Frame = -1
Query: 45 RCNMLVLSWLIKSISVEIAQSILWRDKATDVWNELRERFAQADLFRISELQEEIFSLKQG 104
RCNML+ SW++ S+ I++SI++ D A+DVW +L+ERF+Q DL R+SE+Q+EI++L QG
Sbjct: 408 RCNMLIHSWILNSVEPSISRSIVFMDNASDVWLDLKERFSQGDLVRVSEIQQEIYALTQG 229
Query: 105 DNSVSKFYTSMKTLWDELDILNPLPVCTCNPRCACGAIKNIEDERNKNQVVRFLRGLNDQ 164
SV+ FY+ K LW+EL+I P+P CTC+ RC+C A++ + V+RFL GLND+
Sbjct: 228 TRSVTTFYSDKKALWEELEIYMPIPNCTCHHRCSCDAMRLARRHHHTLHVMRFLTGLNDE 49
Query: 165 FSGVRSQLMLLDNLPN 180
F+ V+SQ++L++ LP+
Sbjct: 48 FNAVKSQILLIEPLPS 1
>CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {Vitis
vinifera}, partial (34%)
Length = 409
Score = 152 bits (383), Expect = 1e-36
Identities = 70/134 (52%), Positives = 95/134 (70%)
Frame = +3
Query: 912 ITAVVEPKRYSEAVKHDSWRKAMDQEIEALERNHTWILVDKPHDKTPIGCKWVYRIKYKQ 971
++++ P EA+ H WR+AM E++ALE N TW LV P KT +GC+WVY +K
Sbjct: 3 LSSLTVPSTIREALDHPGWRQAMVDEMQALENNGTWELVPLPPGKTTVGCRWVYTVKVGP 182
Query: 972 DGTLDRYKARLVVKGYTQLEGIDFIDTFSPVAKMTTLRVLLALASSYNWFLHQLDVDNAF 1031
+G +DR KARLV KGYTQ+ GI++ DTFSPV +TT+R+ LA+A+ +W LHQLD+ NAF
Sbjct: 183 NGKVDRLKARLVAKGYTQVYGIEYCDTFSPVFFLTTVRLFLAMAAIRHWPLHQLDIKNAF 362
Query: 1032 LHAQLDEEIYMSLP 1045
LH L+E+IYM P
Sbjct: 363 LHGDLEEDIYMEQP 404
>BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vinifera},
partial (34%)
Length = 407
Score = 151 bits (381), Expect = 2e-36
Identities = 72/134 (53%), Positives = 93/134 (68%), Gaps = 1/134 (0%)
Frame = -2
Query: 950 VDKPHDKTPIGCKWVYRIKYKQDGTLDRYKARLVVKGYTQLEGIDFIDTFSPVAKMTTLR 1009
V P KTP+GC+WVY +K G +DR KARLV KGYTQ+ GID+ DTFSPVAK+TT+R
Sbjct: 406 VPLPPGKTPVGCRWVYTVKVGPTGEVDRLKARLVAKGYTQVYGIDYCDTFSPVAKLTTVR 227
Query: 1010 VLLALASSYNWFLHQLDVDNAFLHAQLDEEIYMSLPQGLHTE-KPNQVCLLQKSLYGLKQ 1068
+ LA+A+ +W LHQLD+ NAFLH L+E+IYM P G + + VC L +SLYGLKQ
Sbjct: 226 LFLAMAAICHWPLHQLDIKNAFLHGDLEEDIYMEQPPGFVAQGEYGLVCKLHRSLYGLKQ 47
Query: 1069 ASRQWYTTLCKALH 1082
+ R W+ +H
Sbjct: 46 SPRAWFGKFSHVVH 5
>TC234303 weakly similar to UP|Q8W153 (Q8W153) Polyprotein, partial (10%)
Length = 558
Score = 144 bits (364), Expect = 2e-34
Identities = 79/180 (43%), Positives = 110/180 (60%), Gaps = 2/180 (1%)
Frame = +1
Query: 973 GTLDRYKARLVVKGYTQLEGIDFIDTFSPVAKMTTLRVLLALASSYNWFLHQLDVDNAFL 1032
GT+D++KARLV K YTQ+ G D+ TFSPVAKM + +L ++A +W L LD NAFL
Sbjct: 28 GTIDQFKARLVAKSYTQVYGQDYTGTFSPVAKMAYVHLLWSMAVVCHWPLF*LDAKNAFL 207
Query: 1033 HAQLDEEIYMSLPQGL--HTEKPNQVCLLQKSLYGLKQASRQWYTTLCKALHTLGFSPSS 1090
H L+EE+YM P G E N VC L +S YGLKQ+ R W C A + +
Sbjct: 208 HGYLEEEVYMEQPLGFVAQGESSNMVCQLCRSFYGLKQSPRAWPFLYCGA--AIWYDSHE 381
Query: 1091 ADHTLYIKKGTTGSFTALLLYVDDVLLTGNDLHEIQLVKESLHAQFRIKDMGEAKFFLGL 1150
ADH+++ G L++YVDD+ +TG+D H I +K L QF+ KD+G+ ++FLG+
Sbjct: 382 ADHSVFYCHSPQGCI-YLIVYVDDIGITGSDQHGIT*LK*XLCCQFQTKDLGKLRYFLGI 558
>AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza sativa
(japonica cultivar-group)}, partial (10%)
Length = 463
Score = 144 bits (363), Expect = 3e-34
Identities = 72/150 (48%), Positives = 99/150 (66%), Gaps = 1/150 (0%)
Frame = -3
Query: 933 AMDQEIEALERNHTWILVDKPHDKTPIGCKWVYRIKYKQDGTLDRYKARLVVKGYTQLEG 992
AM +E+ ERN+ W LV+KP + IG KWV+R K + G + R KARLV KGY Q EG
Sbjct: 458 AMQEELNQFERNNVWKLVEKPENYPVIGTKWVFRNKLDEHGIIIRNKARLVAKGYNQEEG 279
Query: 993 IDFIDTFSPVAKMTTLRVLLALASSYNWFLHQLDVDNAFLHAQLDEEIYMSLPQGLH-TE 1051
ID+ +T++PVA++ +R+LLA S N+ L+Q+DV +AFL+ + EE+Y+ P G +
Sbjct: 278 IDYEETYAPVARLEVIRMLLAYVSIMNFKLYQMDVKSAFLNGLIQEEVYVEQPPGFEIPD 99
Query: 1052 KPNQVCLLQKSLYGLKQASRQWYTTLCKAL 1081
KP V LQK+LYGLKQA R WY + L
Sbjct: 98 KPTHVYKLQKALYGLKQAPRAWYERISNFL 9
>BI317550 weakly similar to GP|9759590|dbj| polyprotein-like {Arabidopsis
thaliana}, partial (18%)
Length = 421
Score = 141 bits (355), Expect = 2e-33
Identities = 72/140 (51%), Positives = 99/140 (70%), Gaps = 1/140 (0%)
Frame = -2
Query: 1061 KSLYGLKQASRQWYTTLCKALHTLGFSPSSADHTLY-IKKGTTGSFTALLLYVDDVLLTG 1119
KSLYGLKQASR+WY L L G+ S +D++L+ + KG T FTALL+YVDD++L G
Sbjct: 420 KSLYGLKQASRKWYEKLTNLLLKEGYIQSISDYSLFTLTKGNT--FTALLVYVDDIILAG 247
Query: 1120 NDLHEIQLVKESLHAQFRIKDMGEAKFFLGLEIARSKAGIVLNQRKYALELLSDSGLLGG 1179
+ + E +K L F+IK++G+ K+FLGLE+A S+ GI ++QRKY L+LL DSGLLG
Sbjct: 246 DSIDEFDRIKNVLDLAFKIKNLGKLKYFLGLEVAHSRLGITISQRKYCLDLLKDSGLLGC 67
Query: 1180 KPTTTPMDSSQKFGLSTDTP 1199
KP +TP+D+S K + TP
Sbjct: 66 KPASTPLDTSIKLHSAAGTP 7
>TC225402 weakly similar to UP|Q6I923 (Q6I923) Copia-like retrotransposon
Hopscotch polyprotein, partial (7%)
Length = 1446
Score = 122 bits (306), Expect(2) = 6e-31
Identities = 54/109 (49%), Positives = 74/109 (67%)
Frame = +2
Query: 1275 DSDWAGCLDTRKSITGYCMFLGSSLISWRSKKQTTTSRSSCEAEYRAMAATVCEVQWLSY 1334
D++WA R S GYC+ +G +L+ W+S K +RSS EAEY+AM CE+ W+
Sbjct: 8 DANWAVSPIDRGSTLGYCVSIGENLVLWKSNK*NVVARSSAEAEYKAMTVATCELIWIKQ 187
Query: 1335 LLHDLQAPPSAPVSMYCDNQSAMHIAHNPSYHERTKHIEVDCHIVREKV 1383
LL +L+ + + + CDNQ+A+HIA NP +HERTKHIE+DCH VREKV
Sbjct: 188 LLQELKFGSTQQMKLCCDNQAALHIASNPVFHERTKHIEIDCHFVREKV 334
Score = 32.0 bits (71), Expect(2) = 6e-31
Identities = 14/34 (41%), Positives = 25/34 (73%)
Frame = +3
Query: 1393 IASSHQLADIFTKPLTPAPFRHIFSKLGMYDIHS 1426
++S+ QLA+IFTK L ++I SKLG +++++
Sbjct: 363 VSSNDQLANIFTKSLRGPRIQNICSKLGAFELYA 464
>TC235104 weakly similar to UP|Q850H8 (Q850H8) Gag-pol polyprotein
(Fragment), partial (28%)
Length = 865
Score = 133 bits (334), Expect = 6e-31
Identities = 91/302 (30%), Positives = 154/302 (50%), Gaps = 3/302 (0%)
Frame = +2
Query: 395 PIPVSLPNGIVETTTIKGTIQITPSFILANVLFLPNFEFNLISVHKLVKCLRYRLIFEDD 454
P ++L +G T G + T S L +V+F+ FN+ S+ +L + + F+ +
Sbjct: 11 PYFITLADGSRVVATGIGHVSPTSSLSLNSVVFILGCPFNITSLSQLTRFRNCSVTFDAN 190
Query: 455 LCLIQDSNACKMIGTVRAVKGLYIFNKSSISSLASCNSISTSVNPSVHSSSICTFQSNVH 514
+IQ+ IG GLY + K ++S + C+++++
Sbjct: 191 SFVIQECGTGWTIGVGIESHGLY-YLKPNLSWV--CSAVTSP------------------ 307
Query: 515 NLWHYRLGHPSLVKGQSINELFPYVQCSKAHVCDVCPVAKQKRMSFPLSVTQSTAIFQLI 574
L H RLGHP L K + + P ++ K C+ C + K R S ++ + F +I
Sbjct: 308 KLLHERLGHPHLSK---LKIMVPSLEKIKDLFCESCQLGKHVRSSXRHVESRVDSPFLVI 478
Query: 575 HVDIWGPVSIVSLHGFSYFLTIVDDYSRFTWIYLLKS*AEVKNLVQEFCALVANQFETAV 634
H DIWGP + S+ + YF+T +D++S+ T ++L+K +E+ + + + QF +
Sbjct: 479 HXDIWGPNRVSSM-SYRYFVTFIDEFSQCTRVFLMKERSEILSFLTSVNK-IKTQFGKTI 652
Query: 635 KTIRSDNGKEFS---LPQFYATKGIVHQTSCVETPQQNSIVERKHQHILNVARALLFQAH 691
K +RSDN KE+ + F + +GI+HQ SC TPQQN I ERK++H++ AR LL A+
Sbjct: 653 KILRSDNAKEYFSSVISPFXSAQGILHQFSCPHTPQQNDIAERKNRHLVETARTLLLHAN 832
Query: 692 LP 693
P
Sbjct: 833 EP 838
>BU548243
Length = 599
Score = 131 bits (330), Expect = 2e-30
Identities = 70/149 (46%), Positives = 95/149 (62%)
Frame = -1
Query: 1271 TAFSDSDWAGCLDTRKSITGYCMFLGSSLISWRSKKQTTTSRSSCEAEYRAMAATVCEVQ 1330
TA D+ WA +D +S G +FLG +LISW S+KQ T++SS EAEYR++A T E+
Sbjct: 599 TALCDAGWASDVDDHRSTLGSAIFLGPNLISWWSRKQQVTAQSSTEAEYRSIAQTSAELT 420
Query: 1331 WLSYLLHDLQAPPSAPVSMYCDNQSAMHIAHNPSYHERTKHIEVDCHIVREKVQQGLVHL 1390
W+ LL +LQ P + PV + CDN+SA+ IAHN +H RTKH+E+D V EKV + +
Sbjct: 419 WIQALLMELQIPFTPPV-ILCDNKSAVAIAHNLVFHSRTKHMEIDVFFVHEKVLSKQLQI 243
Query: 1391 LPIASSHQLADIFTKPLTPAPFRHIFSKL 1419
I + Q A I TKPL+ A F + SKL
Sbjct: 242 FHIPALDQWAGILTKPLSSARFTFLKSKL 156
>TC222001 similar to UP|C716_NEPRA (O04164) Cytochrome P450 71A6 (Fragment)
, partial (21%)
Length = 912
Score = 127 bits (318), Expect = 5e-29
Identities = 53/103 (51%), Positives = 80/103 (77%)
Frame = -2
Query: 692 LPKIFWAHAIVHAVFLINRLPSPVLDGKCPFQILHKVLPDLTNLKVFGSLCFASTLVSHR 751
+P FW +A++HA +LIN +P+P L P++ LH +PD+++L++FG LC+AST+ ++R
Sbjct: 911 MPPNFWNYALLHAAYLINCIPTPFLQNTSPYERLHGHIPDISHLRIFGCLCYASTIKANR 732
Query: 752 TKFDPRAKRCVFLGFKPGTKGYIVYDLKSNDIAISRNVVFHEN 794
K +PRA C+F+GFKP TKGY++YDL S++I SRNVVF+EN
Sbjct: 731 KKLEPRAHPCIFIGFKPNTKGYMLYDLHSHNIITSRNVVFYEN 603
>BM307983
Length = 406
Score = 124 bits (310), Expect = 4e-28
Identities = 61/133 (45%), Positives = 85/133 (63%), Gaps = 2/133 (1%)
Frame = +2
Query: 959 IGCKWVYRIKYKQDGTLDRYKARLVVKGYTQLEGIDFIDTFSPVAK-MTTLRVLLALASS 1017
+GC+W+Y +KY D TLDRYKARLV KGY Q GID+ +TF+ K + + +
Sbjct: 2 VGCRWIYTVKY*ADDTLDRYKARLVAKGYIQTYGIDYEETFAQWQK*IQSGSSSP*QQAQ 181
Query: 1018 YNWFLHQLDVDNAFLHAQLDEEIYMSLPQGLHTEK-PNQVCLLQKSLYGLKQASRQWYTT 1076
+ W +HQ DV NAFLH L+EE+YM +P G N+VC L+K+LYGLKQ+ R W+
Sbjct: 182 FGWEMHQFDVKNAFLHGSLEEEVYMEIPPGYGASNGGNKVCRLKKALYGLKQSPRAWFGR 361
Query: 1077 LCKALHTLGFSPS 1089
+A+ +LG+ S
Sbjct: 362 FTQAMLSLGYKQS 400
>TC213888 similar to UP|Q9SFE1 (Q9SFE1) T26F17.17, partial (11%)
Length = 493
Score = 121 bits (304), Expect = 2e-27
Identities = 60/139 (43%), Positives = 85/139 (60%)
Frame = +3
Query: 1283 DTRKSITGYCMFLGSSLISWRSKKQTTTSRSSCEAEYRAMAATVCEVQWLSYLLHDLQAP 1342
D RKS TG+ F+G + +W SKKQ + S+CEAEY A + VC WL LL +L+ P
Sbjct: 9 DDRKSTTGFVFFMGDTAFTWMSKKQPIVTLSTCEAEYVAATSCVCHAIWLRNLLKELKMP 188
Query: 1343 PSAPVSMYCDNQSAMHIAHNPSYHERTKHIEVDCHIVREKVQQGLVHLLPIASSHQLADI 1402
P+ + DN+SA+ +A NP +HE++KHI+ H +RE +++ V L + S Q ADI
Sbjct: 189 QEEPMEICVDNKSALALAKNPVFHEKSKHIDTRYHFIRECIEKKEVKLKYVMSQDQAADI 368
Query: 1403 FTKPLTPAPFRHIFSKLGM 1421
FTKPL F + S LG+
Sbjct: 369 FTKPLKLETFVKLRSMLGV 425
>BE474381 similar to GP|21434|emb|CA ORF4 {Solanum tuberosum}, partial (20%)
Length = 406
Score = 92.8 bits (229), Expect(2) = 6e-27
Identities = 42/91 (46%), Positives = 64/91 (70%)
Frame = +3
Query: 1266 SSTTLTAFSDSDWAGCLDTRKSITGYCMFLGSSLISWRSKKQTTTSRSSCEAEYRAMAAT 1325
+S+ ++D+DWAG + R+S +GYC F+G +L+S SKKQ+ +RSS EAE+RA+A
Sbjct: 135 TSSGRKGYTDADWAGSVTDRRSTSGYCTFVGGNLVS*-SKKQSVVARSSAEAEFRALAHG 311
Query: 1326 VCEVQWLSYLLHDLQAPPSAPVSMYCDNQSA 1356
+CE W+ LL +L+ S P+ +YCDN+SA
Sbjct: 312 ICETLWVKKLLQELKVHSSPPIKLYCDNKSA 404
Score = 48.1 bits (113), Expect(2) = 6e-27
Identities = 21/39 (53%), Positives = 30/39 (76%)
Frame = +1
Query: 1223 IAYVVNQLSQFLSAPTNVHEAAAHRVLRYIKGNPGCGLF 1261
IA+ V+ +SQF+ AP + H AA R+LRY+KG+PG GL+
Sbjct: 7 IAFAVSMVSQFMHAPGHEHLEAAFRILRYLKGSPGRGLY 123
>CO982036
Length = 674
Score = 119 bits (297), Expect = 1e-26
Identities = 70/203 (34%), Positives = 115/203 (56%), Gaps = 3/203 (1%)
Frame = -2
Query: 1108 LLLYVDDVLLTGNDLHEIQLVKESLHAQFRIKDMGEAKFFLGLEIARSKAGIVLNQRKYA 1167
LL+YVD +++TG+ IQ + L++ F +K +G+ +F+ +E+ +S ++ + R
Sbjct: 646 LLVYVD-IIITGSSCTLIQNLTSKLNSSFPLKLLGKLDYFVEIEV-KSMPDLLFSLRTSI 473
Query: 1168 LELLSDSGLLGGKPTTTPMDSSQKFGLSTDTPLSDISSYRRLIGKLLYLTTTRPDIAYVV 1227
E+ +P ++PM ++ K S S + YR ++G L Y T RP+I++ V
Sbjct: 472 FEIFCRKPR*QAQPISSPMTTTCKLSKSDSDLFSGPTFYRSVVGALQYTTVIRPEISFAV 293
Query: 1228 NQLSQFLSAPTNVHEAAAHRVLRYIKGNPGCGL-FYPADSSTTL--TAFSDSDWAGCLDT 1284
N++ QF+S P + H R+LRY+KG+ GL PA SS L F D+DWA +D
Sbjct: 292 NKVCQFMSNPLDSHWTEVKRILRYLKGSLSYGL*LKPAISSQPLPIRGFCDADWASAVDD 113
Query: 1285 RKSITGYCMFLGSSLISWRSKKQ 1307
++S +G +FLG +LISW KQ
Sbjct: 112 KRSTSGAAVFLGPNLISWWXXKQ 44
>TC211663
Length = 426
Score = 117 bits (292), Expect = 5e-26
Identities = 55/133 (41%), Positives = 91/133 (68%)
Frame = -3
Query: 62 IAQSILWRDKATDVWNELRERFAQADLFRISELQEEIFSLKQGDNSVSKFYTSMKTLWDE 121
IAQ +++ D AT++WN+L+E F+Q DL +I+ELQEEI+ LKQG ++V F+T +K +W+E
Sbjct: 424 IAQIVIYFDHATNIWNDLKEGFSQGDLLQIAELQEEIYRLKQGSHTVLDFFTKLKFVWEE 245
Query: 122 LDILNPLPVCTCNPRCACGAIKNIEDERNKNQVVRFLRGLNDQFSGVRSQLMLLDNLPNV 181
LD + +CTC R ++ V+ FL+GL+++FS V S+++L+D+LP+
Sbjct: 244 LDNYGLMNLCTCPSR----------TYHQQDFVIHFLKGLDERFSVVCSEVLLMDHLPST 95
Query: 182 NRVFALIAQQERQ 194
R+F+++ Q E Q
Sbjct: 94 KRIFSMVIQHETQ 56
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.320 0.134 0.405
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 70,855,314
Number of Sequences: 63676
Number of extensions: 1126041
Number of successful extensions: 6869
Number of sequences better than 10.0: 198
Number of HSP's better than 10.0 without gapping: 6628
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 6788
length of query: 1428
length of database: 12,639,632
effective HSP length: 109
effective length of query: 1319
effective length of database: 5,698,948
effective search space: 7516912412
effective search space used: 7516912412
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 65 (29.6 bits)
Lotus: description of TM0211.5