
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC137510.5 - phase: 0
(1519 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, co... 384 e-106
TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete 383 e-106
TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Frag... 182 1e-45
BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vi... 161 2e-39
CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {V... 154 3e-37
TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polypro... 147 5e-35
TC231544 weakly similar to UP|Q9FLR2 (Q9FLR2) Polyprotein-like, ... 141 2e-33
AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza ... 140 4e-33
BI317550 weakly similar to GP|9759590|dbj| polyprotein-like {Ara... 139 1e-32
TC225402 weakly similar to UP|Q6I923 (Q6I923) Copia-like retrotr... 129 7e-32
TC234303 weakly similar to UP|Q8W153 (Q8W153) Polyprotein, parti... 134 4e-31
BM307983 130 3e-30
TC235104 weakly similar to UP|Q850H8 (Q850H8) Gag-pol polyprotei... 127 4e-29
BU764568 103 7e-29
BI969608 similar to GP|21434|emb|CA ORF4 {Solanum tuberosum}, pa... 124 3e-28
CO983154 122 9e-28
CO981347 72 4e-27
BI787454 weakly similar to GP|21434|emb|CA ORF4 {Solanum tuberos... 119 1e-26
BM527454 weakly similar to GP|27901709|gb| gag-pol polyprotein {... 70 1e-26
BM086359 118 2e-26
>TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, complete
Length = 4734
Score = 384 bits (986), Expect = e-106
Identities = 209/505 (41%), Positives = 302/505 (59%), Gaps = 1/505 (0%)
Frame = +1
Query: 989 EPNNFKEAVKDSGWRDAMRNEIQALEDNETWVMEKLPPGKKALGSKWVYKIKHHSDGSIE 1048
EP N KEA+ D W +AM+ E++ + NE W + P G +G+KW++K K + +G I
Sbjct: 3202 EPKNVKEALTDEFWINAMQEELEQFKRNEVWELVPRPEGTNVIGTKWIFKNKTNEEGVIT 3381
Query: 1049 RLKARLVVFGHHQIEGIDYDETFAPVAKMVTVRTFLAVAAIKKWEVHQMDVHNAFLHGDL 1108
R KARLV G+ QIEG+D+DETFAPVA++ ++R L VA I K++++QMDV +AFL+G L
Sbjct: 3382 RNKARLVAQGYTQIEGVDFDETFAPVARLESIRLLLGVACILKFKLYQMDVKSAFLNGYL 3561
Query: 1109 EEEVYMKVPPGFKN-TDPNLVCRLKKSLYGLKQAPRCWFAKLVTALKRYGFVQSYSDYSL 1167
EE Y++ P GF + T P+ V RLKK+LYGLKQAPR W+ +L L + G+ + D +L
Sbjct: 3562 NEEAYVEQPKGFVDPTHPDHVYRLKKALYGLKQAPRAWYERLTEFLTQQGYRKGGIDKTL 3741
Query: 1168 FTLHRGEIQINVLVYVDDLIIAGNDIAALKIFKAYLGVCFHMKDLGVLKYFLGLEVARNH 1227
F E + +YVDD++ G L+ F + F M +G L YFLGL+V +
Sbjct: 3742 FVKQDAENLMIAQIYVDDIVFGGMSNEMLRHFVQQMQSEFEMSLVGELTYFLGLQVKQME 3921
Query: 1228 EGIYLCQRKYALEIIDETGLLGAKPADFPMEQHHKLALVSGKPLEDPEPYRRLIGRLIYL 1287
+ I+L Q KYA I+ + G+ A P H KL+ D YR +IG L+YL
Sbjct: 3922 DSIFLSQSKYAKNIVKKFGMENASHKRTPAPTHLKLSKDEAGTSVDQSLYRSMIGSLLYL 4101
Query: 1288 SVTRPDLAYSVHILSQFMQKPCEEHWEAALRVVRYLKKHPGQGILLRSDSELKLEGWCDS 1347
+ +RPD+ Y+V + +++ P H R+++Y+ GI+ S+ L G+CD+
Sbjct: 4102 TASRPDITYAVGVCARYQANPKISHLNQVKRILKYVNGTSDYGIMYCHCSDSMLVGYCDA 4281
Query: 1348 DWASCPLTRRSLTGWVVLLDLSPVSWKTKKQPTVSRSSAEAEYRSMAMTTCELKWLKQLL 1407
DWA R+S +G L + +SW +KKQ VS S+AEAEY + + +L W+KQ+L
Sbjct: 4282 DWAGSADDRKSTSGGCFYLGTNLISWFSKKQNCVSLSTAEAEYIAAGSSCSQLVWMKQML 4461
Query: 1408 GDLGVSHSQGMQLYCDSKSALHIAQNPVFHERTKHIEADCHFVRDAVVAGIICPLYVPTS 1467
+ V M LYCD+ SA++I++NPV H RTKHI+ H++RD V +I +V T
Sbjct: 4462 KEYNVEQDV-MTLYCDNMSAINISKNPVQHSRTKHIDIRHHYIRDLVDDKVITLEHVDTE 4638
Query: 1468 VQLADIFTKALGKAQFEFLLRKLGI 1492
Q+ADIFTKAL QFE L KLGI
Sbjct: 4639 EQIADIFTKALDANQFEKLRGKLGI 4713
Score = 184 bits (466), Expect = 3e-46
Identities = 130/456 (28%), Positives = 207/456 (44%), Gaps = 10/456 (2%)
Frame = +1
Query: 353 HAPPHSSEAAGISGITPAQWQQILDALNISKTKDRLHGKNDISWIIDTGASHHVTGNFSC 412
H PH + SG + + T R K D W +D+G S H+TG
Sbjct: 1564 HGHPHHGTQSSSSGRKMMWVPKHKIVSLVVHTSLRASAKED--WYLDSGCSRHMTGVKEF 1737
Query: 413 LINGKRITNTPVGLPNGKDATAIQEGSVILDGGLRLNNVLFVPQLTCNLISVTQLIDDSN 472
L+N + + + V +G G ++ DG LN VL V LT NLIS++QL D+
Sbjct: 1738 LVNIEPCSTSYVTFGDGSKGKITGMGKLVHDGLPSLNKVLLVKGLTANLISISQLCDEG- 1914
Query: 473 CIVQFTNALCVIQDRTTRTLIGAGERIDGLYFFRGVPKVHA--LMVEGDSAMDLWHKRLG 530
V FT + C++ + + L+ D Y + ++ + + + +WH+R G
Sbjct: 1915 FNVNFTKSECLVTNEKSEVLMKGSRSKDNCYLWTPQETSYSSTCLFSKEDEVKIWHQRFG 2094
Query: 531 HPSEKVLKFIPHVSQ-----HSRSKNNRPCDVCPRAKQHRDSFP-LSENNAASLFELVHC 584
H + +K I + + + R C C KQ + S L + + EL+H
Sbjct: 2095 HLHLRGMKKIIDKGAVRGIPNLKIEEGRICGECQIGKQVKMSHQKLQHQTTSRVLELLHM 2274
Query: 585 DLWGSYRTRSSCGAQYYLTIVNDYSRAVWVYLLCNKTEIETMFLNFVAFVDRQFDKKIKK 644
DL G + S G +Y +V+D+SR WV + K++ +F + R+ D IK+
Sbjct: 2275 DLMGPMQVESLGGKRYAYVVVDDFSRFTWVNFIREKSDTFEVFKELSLRLQREKDCVIKR 2454
Query: 645 VRSDNGTEFNCLR--DYFFNNGIVFETSCVGTPQQNGRVERKHQHIMNVARALRFQGHLP 702
+RSD+G EF + ++ + GI E S TPQQNG VERK++ + AR + LP
Sbjct: 2455 IRSDHGREFENSKFTEFCTSEGITHEFSAAITPQQNGIVERKNRTLQEAARVMLHAKELP 2634
Query: 703 MQFWGECVLTACYLINRTPSSVLNYKTPYEKLFGKVPKFDNMKIFGCLCYAHNQRRDGDK 762
W E + TACY+ NR T YE G+ P + IFG CY R K
Sbjct: 2635 YNLWAEAMNTACYIHNRVTLRRGTPTTLYEIWKGRKPTVKHFHIFGSPCYILADREQRRK 2814
Query: 763 FASRSRKCIFVGYPYGKKGWKLYDLESKEYIVSRDV 798
+S IF+GY + +++++ ++ + S +V
Sbjct: 2815 MDPKSDAGIFLGYSTNSRAYRVFNSRTRTVMESINV 2922
>TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete
Length = 4731
Score = 383 bits (983), Expect = e-106
Identities = 209/505 (41%), Positives = 302/505 (59%), Gaps = 1/505 (0%)
Frame = +1
Query: 989 EPNNFKEAVKDSGWRDAMRNEIQALEDNETWVMEKLPPGKKALGSKWVYKIKHHSDGSIE 1048
EP N KEA+ D W +AM+ E++ + NE W + P G +G+KW++K K + +G I
Sbjct: 3199 EPKNVKEALTDEFWINAMQEELEQFKRNEVWELVPRPEGTNVIGTKWIFKNKTNEEGVIT 3378
Query: 1049 RLKARLVVFGHHQIEGIDYDETFAPVAKMVTVRTFLAVAAIKKWEVHQMDVHNAFLHGDL 1108
R KARLV G+ QIEG+D+DETFAPVA++ ++R L VA I K++++QMDV +AFL+G L
Sbjct: 3379 RNKARLVAQGYTQIEGVDFDETFAPVARLESIRLLLGVACILKFKLYQMDVKSAFLNGYL 3558
Query: 1109 EEEVYMKVPPGFKN-TDPNLVCRLKKSLYGLKQAPRCWFAKLVTALKRYGFVQSYSDYSL 1167
EEVY++ P GF + T P+ V RLKK+LYGLKQAPR W+ +L L + G+ + D +L
Sbjct: 3559 NEEVYVEQPKGFADPTHPDHVYRLKKALYGLKQAPRAWYERLTEFLTQQGYRKGGIDKTL 3738
Query: 1168 FTLHRGEIQINVLVYVDDLIIAGNDIAALKIFKAYLGVCFHMKDLGVLKYFLGLEVARNH 1227
F E + +YVDD++ G L+ F + F M +G L YFLGL+V +
Sbjct: 3739 FVKQDAENLMIAQIYVDDIVFGGMSNEMLRHFVQQMQSEFEMSLVGELTYFLGLQVKQME 3918
Query: 1228 EGIYLCQRKYALEIIDETGLLGAKPADFPMEQHHKLALVSGKPLEDPEPYRRLIGRLIYL 1287
+ I+L Q +YA I+ + G+ A P H KL+ D YR +IG L+YL
Sbjct: 3919 DSIFLSQSRYAKNIVKKFGMENASHKRTPAPTHLKLSKDEAGTSVDQSLYRSMIGSLLYL 4098
Query: 1288 SVTRPDLAYSVHILSQFMQKPCEEHWEAALRVVRYLKKHPGQGILLRSDSELKLEGWCDS 1347
+ +RPD+ Y+V + +++ P H R+++Y+ GI+ S L G+CD+
Sbjct: 4099 TASRPDITYAVGVCARYQANPKISHLTQVKRILKYVNGTSDYGIMYCHCSNPMLVGYCDA 4278
Query: 1348 DWASCPLTRRSLTGWVVLLDLSPVSWKTKKQPTVSRSSAEAEYRSMAMTTCELKWLKQLL 1407
DWA R+S +G L + +SW +KKQ VS S+AEAEY + + +L W+KQ+L
Sbjct: 4279 DWAGSADDRKSTSGGCFYLGNNLISWFSKKQNCVSLSTAEAEYIAAGSSCSQLVWMKQML 4458
Query: 1408 GDLGVSHSQGMQLYCDSKSALHIAQNPVFHERTKHIEADCHFVRDAVVAGIICPLYVPTS 1467
+ V M LYCD+ SA++I++NPV H RTKHI+ H++RD V +I +V T
Sbjct: 4459 KEYNVEQDV-MTLYCDNMSAINISKNPVQHSRTKHIDIRHHYIRDLVDDKVITLKHVDTE 4635
Query: 1468 VQLADIFTKALGKAQFEFLLRKLGI 1492
Q+ADIFTKAL QFE L KLGI
Sbjct: 4636 EQIADIFTKALDANQFEKLRGKLGI 4710
Score = 186 bits (473), Expect = 5e-47
Identities = 128/425 (30%), Positives = 200/425 (46%), Gaps = 10/425 (2%)
Frame = +1
Query: 384 TKDRLHGKNDISWIIDTGASHHVTGNFSCLINGKRITNTPVGLPNGKDATAIQEGSVILD 443
T R K D W +D+G S H+TG L+N + + + V +G I G ++ D
Sbjct: 1654 TSLRASAKED--WYLDSGCSRHMTGVKEFLLNIEPCSTSYVTFGDGSKGKIIGMGKLVHD 1827
Query: 444 GGLRLNNVLFVPQLTCNLISVTQLIDDSNCIVQFTNALCVIQDRTTRTLIGAGERIDGLY 503
G LN VL V LT NLIS++QL D+ V FT + C++ + + L+ D Y
Sbjct: 1828 GLPSLNKVLLVKGLTANLISISQLCDEG-FNVNFTKSECLVTNEKSEVLMKGSRSKDNCY 2004
Query: 504 FFRGVPKVHA--LMVEGDSAMDLWHKRLGHPSEKVLKFIPHVSQ-----HSRSKNNRPCD 556
+ ++ + + + +WH+R GH + +K I + + + R C
Sbjct: 2005 LWTPQETSYSSTCLSSKEDEVRIWHQRFGHLHLRGMKKIIDKGAVRGIPNLKIEEGRICG 2184
Query: 557 VCPRAKQHRDSFP-LSENNAASLFELVHCDLWGSYRTRSSCGAQYYLTIVNDYSRAVWVY 615
C KQ + S L + + EL+H DL G + S G +Y +V+D+SR WV
Sbjct: 2185 ECQIGKQVKMSHQKLQHQTTSRVLELLHMDLMGPMQVESLGGKRYAYVVVDDFSRFTWVN 2364
Query: 616 LLCNKTEIETMFLNFVAFVDRQFDKKIKKVRSDNGTEFNCLR--DYFFNNGIVFETSCVG 673
+ K+E +F + R+ D IK++RSD+G EF R ++ + GI E S
Sbjct: 2365 FIREKSETFEVFKELSLRLQREKDCVIKRIRSDHGREFENSRFTEFCTSEGITHEFSAAI 2544
Query: 674 TPQQNGRVERKHQHIMNVARALRFQGHLPMQFWGECVLTACYLINRTPSSVLNYKTPYEK 733
TPQQNG VERK++ + AR + LP W E + TACY+ NR T YE
Sbjct: 2545 TPQQNGIVERKNRTLQEAARVMLHAKELPYNLWAEAMNTACYIHNRVTLRRGTPTTLYEI 2724
Query: 734 LFGKVPKFDNMKIFGCLCYAHNQRRDGDKFASRSRKCIFVGYPYGKKGWKLYDLESKEYI 793
G+ P + IFG CY R K +S IF+GY + +++++ ++ +
Sbjct: 2725 WKGRKPSVKHFHIFGSPCYILADREQRRKMDPKSDAGIFLGYSTNSRAYRVFNSRTRTVM 2904
Query: 794 VSRDV 798
S +V
Sbjct: 2905 ESINV 2919
>TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Fragment), partial
(30%)
Length = 687
Score = 182 bits (461), Expect = 1e-45
Identities = 86/162 (53%), Positives = 116/162 (71%)
Frame = +2
Query: 1340 KLEGWCDSDWASCPLTRRSLTGWVVLLDLSPVSWKTKKQPTVSRSSAEAEYRSMAMTTCE 1399
+L G+CD+DWA CP+ RRS +G+ V + + VSWK+KKQ V+RSSAEAEYRSMAM TCE
Sbjct: 14 QLSGYCDADWAGCPMDRRSTSGYCVFIGGNLVSWKSKKQTVVARSSAEAEYRSMAMVTCE 193
Query: 1400 LKWLKQLLGDLGVSHSQGMQLYCDSKSALHIAQNPVFHERTKHIEADCHFVRDAVVAGII 1459
L W+KQ L +L M+LYCD+++ALHIA NPVFHERTKHIE DCHF+R+ +++ I
Sbjct: 194 LMWIKQFLQELRFCEELQMKLYCDNQAALHIASNPVFHERTKHIEIDCHFIREKLLSKEI 373
Query: 1460 CPLYVPTSVQLADIFTKALGKAQFEFLLRKLGIRDLHAPEGG 1501
++ ++ Q DI TK+L + + + KLG DL+AP G
Sbjct: 374 VTEFIGSNDQPVDILTKSLRGPKIQIVCSKLGAYDLYAPA*G 499
>BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vinifera},
partial (34%)
Length = 407
Score = 161 bits (408), Expect = 2e-39
Identities = 75/126 (59%), Positives = 96/126 (75%), Gaps = 1/126 (0%)
Frame = -2
Query: 1024 LPPGKKALGSKWVYKIKHHSDGSIERLKARLVVFGHHQIEGIDYDETFAPVAKMVTVRTF 1083
LPPGK +G +WVY +K G ++RLKARLV G+ Q+ GIDY +TF+PVAK+ TVR F
Sbjct: 400 LPPGKTPVGCRWVYTVKVGPTGEVDRLKARLVAKGYTQVYGIDYCDTFSPVAKLTTVRLF 221
Query: 1084 LAVAAIKKWEVHQMDVHNAFLHGDLEEEVYMKVPPGF-KNTDPNLVCRLKKSLYGLKQAP 1142
LA+AAI W +HQ+D+ NAFLHGDLEE++YM+ PPGF + LVC+L +SLYGLKQ+P
Sbjct: 220 LAMAAICHWPLHQLDIKNAFLHGDLEEDIYMEQPPGFVAQGEYGLVCKLHRSLYGLKQSP 41
Query: 1143 RCWFAK 1148
R WF K
Sbjct: 40 RAWFGK 23
>CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {Vitis
vinifera}, partial (34%)
Length = 409
Score = 154 bits (389), Expect = 3e-37
Identities = 69/128 (53%), Positives = 96/128 (74%)
Frame = +3
Query: 990 PNNFKEAVKDSGWRDAMRNEIQALEDNETWVMEKLPPGKKALGSKWVYKIKHHSDGSIER 1049
P+ +EA+ GWR AM +E+QALE+N TW + LPPGK +G +WVY +K +G ++R
Sbjct: 21 PSTIREALDHPGWRQAMVDEMQALENNGTWELVPLPPGKTTVGCRWVYTVKVGPNGKVDR 200
Query: 1050 LKARLVVFGHHQIEGIDYDETFAPVAKMVTVRTFLAVAAIKKWEVHQMDVHNAFLHGDLE 1109
LKARLV G+ Q+ GI+Y +TF+PV + TVR FLA+AAI+ W +HQ+D+ NAFLHGDLE
Sbjct: 201 LKARLVAKGYTQVYGIEYCDTFSPVFFLTTVRLFLAMAAIRHWPLHQLDIKNAFLHGDLE 380
Query: 1110 EEVYMKVP 1117
E++YM+ P
Sbjct: 381 EDIYMEQP 404
>TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polyprotein, partial
(7%)
Length = 804
Score = 147 bits (370), Expect = 5e-35
Identities = 77/221 (34%), Positives = 131/221 (58%), Gaps = 3/221 (1%)
Frame = +1
Query: 1273 DPEPYRRLIGRLIYLSVTRPDLAYSVHILSQFMQKPCEEHWEAALRVVRYLKKHPGQGIL 1332
D +RRLIG L YL +RP++ ++V ++S+FM++P H +AA RV+R +K G G+L
Sbjct: 10 DVTEFRRLIGSLRYLCNSRPNICFAVSLISRFMKRPRLSHMQAAKRVLRLIKGTIGSGVL 189
Query: 1333 L---RSDSELKLEGWCDSDWASCPLTRRSLTGWVVLLDLSPVSWKTKKQPTVSRSSAEAE 1389
+ L G+ DSDW P +S G++ + + +PV+ +KKQ ++ S+ EAE
Sbjct: 190 FPFKAKSGKPDLLGYTDSDWKRDPEQEKSTGGYLFMYNDAPVA*SSKKQDVIALSTCEAE 369
Query: 1390 YRSMAMTTCELKWLKQLLGDLGVSHSQGMQLYCDSKSALHIAQNPVFHERTKHIEADCHF 1449
Y + ++ C+ W+ LL +L + + + L D+KSA+++A++P H R+KHIE H+
Sbjct: 370 YVAASLGACQAVWMMNLLEELKLRERKPVNLLIDNKSAINLAKHPTLHGRSKHIELRFHY 549
Query: 1450 VRDAVVAGIICPLYVPTSVQLADIFTKALGKAQFEFLLRKL 1490
+RD V G + Y QLAD+ TK + ++F+ + +L
Sbjct: 550 IRDQVSKGNVTVEYCKAEEQLADLMTKPIQVSRFKQICSEL 672
>TC231544 weakly similar to UP|Q9FLR2 (Q9FLR2) Polyprotein-like, partial (16%)
Length = 662
Score = 141 bits (356), Expect = 2e-33
Identities = 83/188 (44%), Positives = 119/188 (63%), Gaps = 4/188 (2%)
Frame = +3
Query: 1315 AALRVVRYLKKHPGQGILLRSDSELKLEGWCDSDWASCPLTRRSLTGWVVLLDLSPVSWK 1374
AA RV++YLK P +G+ +S +++ G+ D+DWA+C + +S+T + L S +SWK
Sbjct: 18 AATRVLKYLKGCPRKGLSFSRESPIQILGFSDADWATCIDSSKSITWYCFFLGSSLISWK 197
Query: 1375 TKKQPTVSR--SSAEAEYRSMAMTTCELKWLKQLLGDLGVSHSQGMQLYCDSKSALH-IA 1431
KKQ TVSR SS+EA+YR++ TTCEL+WL LL DL V+ +YCD++SAL +
Sbjct: 198 AKKQNTVSRSSSSSEAKYRALTSTTCELQWLTYLLKDLHVT-----LIYCDNQSALQ*LP 362
Query: 1432 QNPVFHERTKHIEADCHFVRDAVVAGII-CPLYVPTSVQLADIFTKALGKAQFEFLLRKL 1490
++H + +E DCH VR+ G++ C L V +S QLADIFTKAL F L KL
Sbjct: 363 IKVIYHGQ---LEIDCHIVREKTQQGLMHCLLPVSSSNQLADIFTKALSPKLFSSNLSKL 533
Query: 1491 GIRDLHAP 1498
G+ D+ P
Sbjct: 534 GLSDIFLP 557
>AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza sativa
(japonica cultivar-group)}, partial (10%)
Length = 463
Score = 140 bits (353), Expect = 4e-33
Identities = 68/146 (46%), Positives = 101/146 (68%), Gaps = 1/146 (0%)
Frame = -3
Query: 1005 AMRNEIQALEDNETWVMEKLPPGKKALGSKWVYKIKHHSDGSIERLKARLVVFGHHQIEG 1064
AM+ E+ E N W + + P +G+KWV++ K G I R KARLV G++Q EG
Sbjct: 458 AMQEELNQFERNNVWKLVEKPENYPVIGTKWVFRNKLDEHGIIIRNKARLVAKGYNQEEG 279
Query: 1065 IDYDETFAPVAKMVTVRTFLAVAAIKKWEVHQMDVHNAFLHGDLEEEVYMKVPPGFKNTD 1124
IDY+ET+APVA++ +R LA +I ++++QMDV +AFL+G ++EEVY++ PPGF+ D
Sbjct: 278 IDYEETYAPVARLEVIRMLLAYVSIMNFKLYQMDVKSAFLNGLIQEEVYVEQPPGFEIPD 99
Query: 1125 -PNLVCRLKKSLYGLKQAPRCWFAKL 1149
P V +L+K+LYGLKQAPR W+ ++
Sbjct: 98 KPTHVYKLQKALYGLKQAPRAWYERI 21
>BI317550 weakly similar to GP|9759590|dbj| polyprotein-like {Arabidopsis
thaliana}, partial (18%)
Length = 421
Score = 139 bits (349), Expect = 1e-32
Identities = 71/138 (51%), Positives = 93/138 (66%)
Frame = -2
Query: 1133 KSLYGLKQAPRCWFAKLVTALKRYGFVQSYSDYSLFTLHRGEIQINVLVYVDDLIIAGND 1192
KSLYGLKQA R W+ KL L + G++QS SDYSLFTL +G +LVYVDD+I+AG+
Sbjct: 420 KSLYGLKQASRKWYEKLTNLLLKEGYIQSISDYSLFTLTKGNTFTALLVYVDDIILAGDS 241
Query: 1193 IAALKIFKAYLGVCFHMKDLGVLKYFLGLEVARNHEGIYLCQRKYALEIIDETGLLGAKP 1252
I K L + F +K+LG LKYFLGLEVA + GI + QRKY L+++ ++GLLG KP
Sbjct: 240 IDEFDRIKNVLDLAFKIKNLGKLKYFLGLEVAHSRLGITISQRKYCLDLLKDSGLLGCKP 61
Query: 1253 ADFPMEQHHKLALVSGKP 1270
A P++ KL +G P
Sbjct: 60 ASTPLDTSIKLHSAAGTP 7
>TC225402 weakly similar to UP|Q6I923 (Q6I923) Copia-like retrotransposon
Hopscotch polyprotein, partial (7%)
Length = 1446
Score = 129 bits (323), Expect(2) = 7e-32
Identities = 61/109 (55%), Positives = 80/109 (72%)
Frame = +2
Query: 1346 DSDWASCPLTRRSLTGWVVLLDLSPVSWKTKKQPTVSRSSAEAEYRSMAMTTCELKWLKQ 1405
D++WA P+ R S G+ V + + V WK+ K V+RSSAEAEY++M + TCEL W+KQ
Sbjct: 8 DANWAVSPIDRGSTLGYCVSIGENLVLWKSNK*NVVARSSAEAEYKAMTVATCELIWIKQ 187
Query: 1406 LLGDLGVSHSQGMQLYCDSKSALHIAQNPVFHERTKHIEADCHFVRDAV 1454
LL +L +Q M+L CD+++ALHIA NPVFHERTKHIE DCHFVR+ V
Sbjct: 188 LLQELKFGSTQQMKLCCDNQAALHIASNPVFHERTKHIEIDCHFVREKV 334
Score = 28.5 bits (62), Expect(2) = 7e-32
Identities = 14/35 (40%), Positives = 24/35 (68%)
Frame = +3
Query: 1463 YVPTSVQLADIFTKALGKAQFEFLLRKLGIRDLHA 1497
+V ++ QLA+IFTK+L + + + KLG +L+A
Sbjct: 360 FVSSNDQLANIFTKSLRGPRIQNICSKLGAFELYA 464
>TC234303 weakly similar to UP|Q8W153 (Q8W153) Polyprotein, partial (10%)
Length = 558
Score = 134 bits (336), Expect = 4e-31
Identities = 78/179 (43%), Positives = 106/179 (58%), Gaps = 2/179 (1%)
Frame = +1
Query: 1045 GSIERLKARLVVFGHHQIEGIDYDETFAPVAKMVTVRTFLAVAAIKKWEVHQMDVHNAFL 1104
G+I++ KARLV + Q+ G DY TF+PVAKM V ++A + W + +D NAFL
Sbjct: 28 GTIDQFKARLVAKSYTQVYGQDYTGTFSPVAKMAYVHLLWSMAVVCHWPLF*LDAKNAFL 207
Query: 1105 HGDLEEEVYMKVPPGF--KNTDPNLVCRLKKSLYGLKQAPRCWFAKLVTALKRYGFVQSY 1162
HG LEEEVYM+ P GF + N+VC+L +S YGLKQ+PR W A Y
Sbjct: 208 HGYLEEEVYMEQPLGFVAQGESSNMVCQLCRSFYGLKQSPRAWPFLYCGAAIWYD--SHE 381
Query: 1163 SDYSLFTLHRGEIQINVLVYVDDLIIAGNDIAALKIFKAYLGVCFHMKDLGVLKYFLGL 1221
+D+S+F H + I ++VYVDD+ I G+D + K L F KDLG L+YFLG+
Sbjct: 382 ADHSVFYCHSPQGCIYLIVYVDDIGITGSDQHGIT*LK*XLCCQFQTKDLGKLRYFLGI 558
>BM307983
Length = 406
Score = 130 bits (328), Expect = 3e-30
Identities = 66/133 (49%), Positives = 89/133 (66%), Gaps = 2/133 (1%)
Frame = +2
Query: 1031 LGSKWVYKIKHHSDGSIERLKARLVVFGHHQIEGIDYDETFAPVAKMVTVRTFLA-VAAI 1089
+G +W+Y +K+ +D +++R KARLV G+ Q GIDY+ETFA K + + A
Sbjct: 2 VGCRWIYTVKY*ADDTLDRYKARLVAKGYIQTYGIDYEETFAQWQK*IQSGSSSP*QQAQ 181
Query: 1090 KKWEVHQMDVHNAFLHGDLEEEVYMKVPPGF-KNTDPNLVCRLKKSLYGLKQAPRCWFAK 1148
WE+HQ DV NAFLHG LEEEVYM++PPG+ + N VCRLKK+LYGLKQ+PR WF +
Sbjct: 182 FGWEMHQFDVKNAFLHGSLEEEVYMEIPPGYGASNGGNKVCRLKKALYGLKQSPRAWFGR 361
Query: 1149 LVTALKRYGFVQS 1161
A+ G+ QS
Sbjct: 362 FTQAMLSLGYKQS 400
>TC235104 weakly similar to UP|Q850H8 (Q850H8) Gag-pol polyprotein
(Fragment), partial (28%)
Length = 865
Score = 127 bits (319), Expect = 4e-29
Identities = 91/283 (32%), Positives = 143/283 (50%), Gaps = 3/283 (1%)
Frame = +2
Query: 424 VGLPNGKDATAIQEGSVILDGGLRLNNVLFVPQLTCNLISVTQLIDDSNCIVQFTNALCV 483
+ L +G A G V L LN+V+F+ N+ S++QL NC V F V
Sbjct: 20 ITLADGSRVVATGIGHVSPTSSLSLNSVVFILGCPFNITSLSQLTRFRNCSVTFDANSFV 199
Query: 484 IQDRTTRTLIGAGERIDGLYFFRGVPKVHALMVEGDSAMDLWHKRLGHPSEKVLK-FIPH 542
IQ+ T IG G GLY+ + P + + + ++ L H+RLGHP LK +P
Sbjct: 200 IQECGTGWTIGVGIESHGLYYLK--PNL-SWVCSAVTSPKLLHERLGHPHLSKLKIMVPS 370
Query: 543 VSQHSRSKNNRPCDVCPRAKQHRDSFPLSENNAASLFELVHCDLWGSYRTRSSCGAQYYL 602
+ + + C+ C K R S E+ S F ++H D+WG R SS +Y++
Sbjct: 371 LEK----IKDLFCESCQLGKHVRSSXRHVESRVDSPFLVIHXDIWGPNRV-SSMSYRYFV 535
Query: 603 TIVNDYSRAVWVYLLCNKTEIETMFLNFVAFVDRQFDKKIKKVRSDNGTEF--NCLRDYF 660
T ++++S+ V+L+ ++EI FL V + QF K IK +RSDN E+ + + +
Sbjct: 536 TFIDEFSQCTRVFLMKERSEI-LSFLTSVNKIKTQFGKTIKILRSDNAKEYFSSVISPFX 712
Query: 661 FNNGIVFETSCVGTPQQNGRVERKHQHIMNVARALRFQGHLPM 703
GI+ + SC TPQQN ERK++H++ AR L + P+
Sbjct: 713 SAQGILHQFSCPHTPQQNDIAERKNRHLVETARTLLLHANEPI 841
>BU764568
Length = 420
Score = 103 bits (257), Expect(2) = 7e-29
Identities = 49/84 (58%), Positives = 64/84 (75%)
Frame = +3
Query: 1360 TGWVVLLDLSPVSWKTKKQPTVSRSSAEAEYRSMAMTTCELKWLKQLLGDLGVSHSQGMQ 1419
+G+ VL+ + +SWK+KKQ V++SSAEAEYR+MA+ TCEL WLKQLL +L M
Sbjct: 168 SGYCVLIGGNLISWKSKKQSVVAKSSAEAEYRAMALVTCELIWLKQLL*ELKFEEDTQMT 347
Query: 1420 LYCDSKSALHIAQNPVFHERTKHI 1443
L CD+++ALHIA NP+FH RTKHI
Sbjct: 348 LICDNQAALHIASNPIFH*RTKHI 419
Score = 43.9 bits (102), Expect(2) = 7e-29
Identities = 17/55 (30%), Positives = 30/55 (53%)
Frame = +1
Query: 1302 SQFMQKPCEEHWEAALRVVRYLKKHPGQGILLRSDSELKLEGWCDSDWASCPLTR 1356
SQF+ PC++HW A +++ K PG+G++ ++ G+ D+D P R
Sbjct: 1 SQFLNSPCQDHWNAVS*ILK*TKSAPGKGLIYEDKGHSQIIGYSDAD*VGSPSDR 165
>BI969608 similar to GP|21434|emb|CA ORF4 {Solanum tuberosum}, partial (19%)
Length = 454
Score = 124 bits (311), Expect = 3e-28
Identities = 62/127 (48%), Positives = 90/127 (70%)
Frame = -2
Query: 1375 TKKQPTVSRSSAEAEYRSMAMTTCELKWLKQLLGDLGVSHSQGMQLYCDSKSALHIAQNP 1434
TK + +RSSA+AEYR+MA CE+ LK++L +L + + M+LYCD+K+A++I+QNP
Sbjct: 426 TKNKMXXARSSAKAEYRAMAQGVCEIL*LKRILEELQLPMTLPMKLYCDNKAAINISQNP 247
Query: 1435 VFHERTKHIEADCHFVRDAVVAGIICPLYVPTSVQLADIFTKALGKAQFEFLLRKLGIRD 1494
V H RTKH+E D F+++ V AG IC ++P S Q+ADIFTK L + FEF + KL + D
Sbjct: 246 VQHGRTKHVEIDRPFIKEKVDAGQICMPFIPFSQQVADIFTKGLFRPNFEFFVSKLDMLD 67
Query: 1495 LHAPEGG 1501
++AP G
Sbjct: 66 IYAPT*G 46
>CO983154
Length = 568
Score = 122 bits (307), Expect = 9e-28
Identities = 60/144 (41%), Positives = 89/144 (61%), Gaps = 1/144 (0%)
Frame = +3
Query: 674 TPQQNGRVERKHQHIMNVARALRFQGHLPMQFWGECVLTACYLINRTPSSVLNYKTPYEK 733
TPQQNG ERK++H++ AR+L ++P+ WG+ VLT+C+LINR PSS L + P+
Sbjct: 6 TPQQNGIAERKNRHLLETARSLMLNLNVPIHHWGDAVLTSCFLINRMPSSSLENQIPHSL 185
Query: 734 LFGKVPKFD-NMKIFGCLCYAHNQRRDGDKFASRSRKCIFVGYPYGKKGWKLYDLESKEY 792
+F P F + K+FGC C+ H+ DK ++RS KC+F+GY +KG+K Y + Y
Sbjct: 186 VFPHDPLFHVSPKVFGCTCFVHDLSPGLDKLSARSVKCVFLGYSRLQKGYKCYSPTMRRY 365
Query: 793 IVSRDVKFYEHEFPFDVQLDTTHS 816
+S DV F+E F +D + S
Sbjct: 366 YMSADVTFFEDTPFFSPSVDHSSS 437
>CO981347
Length = 624
Score = 72.0 bits (175), Expect(3) = 4e-27
Identities = 41/106 (38%), Positives = 52/106 (48%), Gaps = 2/106 (1%)
Frame = +2
Query: 634 VDRQFDKKIKKVRSDNGTEF--NCLRDYFFNNGIVFETSCVGTPQQNGRVERKHQHIMNV 691
+ Q K+K +R+DNG EF ++ GI TP QNG ER + I+
Sbjct: 137 IGNQLGTKLKVLRTDNGLEFVLEQFNEFCRKIGIKRHKIVPHTP*QNGLAERMNMTILER 316
Query: 692 ARALRFQGHLPMQFWGECVLTACYLINRTPSSVLNYKTPYEKLFGK 737
R + LP FWGE T YLINR PSS L +KTP E G+
Sbjct: 317 VRCMLLSARLPKTFWGEAANTTSYLINRCPSSTLGFKTPMEAWSGE 454
Score = 47.8 bits (112), Expect(3) = 4e-27
Identities = 22/55 (40%), Positives = 34/55 (61%)
Frame = +3
Query: 737 KVPKFDNMKIFGCLCYAHNQRRDGDKFASRSRKCIFVGYPYGKKGWKLYDLESKE 791
K P + +K+FG L + H ++ K +R+ KC+F+GYP G K +KL+ LE E
Sbjct: 453 KPPNYSGLKVFGSLAFDHVKQ---GKLDARAVKCVFIGYPKGVKRYKLWKLEPGE 608
Score = 42.0 bits (97), Expect(3) = 4e-27
Identities = 16/32 (50%), Positives = 26/32 (81%)
Frame = +3
Query: 591 RTRSSCGAQYYLTIVNDYSRAVWVYLLCNKTE 622
R ++ G+ Y+LTI++D+SR VW+Y+L NK+E
Sbjct: 9 RVKTHGGSSYFLTIIDDFSRRVWLYVLKNKSE 104
>BI787454 weakly similar to GP|21434|emb|CA ORF4 {Solanum tuberosum}, partial
(21%)
Length = 421
Score = 119 bits (297), Expect = 1e-26
Identities = 59/138 (42%), Positives = 92/138 (65%), Gaps = 1/138 (0%)
Frame = +2
Query: 1161 SYSDYSLFTLHRGEIQ-INVLVYVDDLIIAGNDIAALKIFKAYLGVCFHMKDLGVLKYFL 1219
S +D+S+F H + + ++VYVDD++I D + K +L F KDL LKYFL
Sbjct: 8 SEADHSVFYCHTSPGKCVYLMVYVDDIMITKKDATKIVQLKEHLFNHFQTKDLRYLKYFL 187
Query: 1220 GLEVARNHEGIYLCQRKYALEIIDETGLLGAKPADFPMEQHHKLALVSGKPLEDPEPYRR 1279
G+EVA++ +G+ + QRKYAL+I++ETG+ + D PM+ + KL + DPE YRR
Sbjct: 188 GIEVAQSGDGVVISQRKYALDILEETGMQNCRLVDSPMDPNLKLMAYQSEVYPDPERYRR 367
Query: 1280 LIGRLIYLSVTRPDLAYS 1297
L+G+LIYL++TRPD++++
Sbjct: 368 LVGKLIYLTITRPDISFA 421
>BM527454 weakly similar to GP|27901709|gb| gag-pol polyprotein {Vitis
vinifera}, partial (19%)
Length = 437
Score = 70.5 bits (171), Expect(2) = 1e-26
Identities = 37/86 (43%), Positives = 53/86 (61%)
Frame = +2
Query: 1157 GFVQSYSDYSLFTLHRGEIQINVLVYVDDLIIAGNDIAALKIFKAYLGVCFHMKDLGVLK 1216
GF+ SYS + ++VYVDD++I GND + K +L F KDLG +
Sbjct: 2 GFLLSYSSSRC---------VYLMVYVDDIVITGNDQGKIAQLKGHLFSHFQTKDLGKFE 154
Query: 1217 YFLGLEVARNHEGIYLCQRKYALEII 1242
YFLG+EVA++ +GI + QRKYAL+I+
Sbjct: 155 YFLGIEVAQSKDGIIISQRKYALDIL 232
Score = 69.3 bits (168), Expect(2) = 1e-26
Identities = 32/71 (45%), Positives = 47/71 (66%)
Frame = +1
Query: 1242 IDETGLLGAKPADFPMEQHHKLALVSGKPLEDPEPYRRLIGRLIYLSVTRPDLAYSVHIL 1301
I TG+ +P D M+ + KL GKP D E YR L+G+LIYL++TRP++++ V ++
Sbjct: 220 IRHTGMSDCRPIDSLMDPNKKLLPNQGKPYSDSERYRILVGKLIYLTITRPNISFVVGVV 399
Query: 1302 SQFMQKPCEEH 1312
SQFMQ P +H
Sbjct: 400 SQFMQSPHNDH 432
>BM086359
Length = 427
Score = 118 bits (295), Expect = 2e-26
Identities = 58/140 (41%), Positives = 89/140 (63%)
Frame = +1
Query: 1219 LGLEVARNHEGIYLCQRKYALEIIDETGLLGAKPADFPMEQHHKLALVSGKPLEDPEPYR 1278
LG++VA++ GI + Q KYAL+I+ ETG+L P++ PM+ + KL G+ LEDP
Sbjct: 1 LGIDVAQSSYGIVISQWKYALDILTETGMLDCLPSNTPMDPNVKLLSGQGEALEDPGR*C 180
Query: 1279 RLIGRLIYLSVTRPDLAYSVHILSQFMQKPCEEHWEAALRVVRYLKKHPGQGILLRSDSE 1338
L+GRL YL+VTR D+ ++V +LSQF++ P + W A +R++RY+K PG G+L
Sbjct: 181 CLVGRLNYLTVTRLDITFAVGVLSQFLKDPTDSQWNATIRILRYIKNAPGPGLLYEDKGN 360
Query: 1339 LKLEGWCDSDWASCPLTRRS 1358
K+ + D+DW P + S
Sbjct: 361 GKVVCYFDADWPGSPSDKSS 420
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.320 0.137 0.423
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 73,255,110
Number of Sequences: 63676
Number of extensions: 1097175
Number of successful extensions: 6145
Number of sequences better than 10.0: 191
Number of HSP's better than 10.0 without gapping: 5827
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 6012
length of query: 1519
length of database: 12,639,632
effective HSP length: 110
effective length of query: 1409
effective length of database: 5,635,272
effective search space: 7940098248
effective search space used: 7940098248
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 65 (29.6 bits)
Medicago: description of AC137510.5