
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC135396.1 - phase: 0 /pseudo
(1436 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, co... 415 e-116
TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete 411 e-114
TC235104 weakly similar to UP|Q850H8 (Q850H8) Gag-pol polyprotei... 195 1e-49
BE474381 similar to GP|21434|emb|CA ORF4 {Solanum tuberosum}, pa... 135 1e-48
TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Frag... 187 2e-47
TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polypro... 167 3e-41
BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vi... 165 2e-40
BM307983 159 6e-39
TC234303 weakly similar to UP|Q8W153 (Q8W153) Polyprotein, parti... 157 2e-38
AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza ... 153 5e-37
CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {V... 152 1e-36
CO983154 148 1e-35
BI787454 weakly similar to GP|21434|emb|CA ORF4 {Solanum tuberos... 148 2e-35
BU764568 110 4e-34
BI427153 142 1e-33
BM086359 136 6e-32
TC225402 weakly similar to UP|Q6I923 (Q6I923) Copia-like retrotr... 136 8e-32
BI317550 weakly similar to GP|9759590|dbj| polyprotein-like {Ara... 136 8e-32
BU548243 134 2e-31
BI969608 similar to GP|21434|emb|CA ORF4 {Solanum tuberosum}, pa... 133 6e-31
>TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, complete
Length = 4734
Score = 415 bits (1067), Expect = e-116
Identities = 217/595 (36%), Positives = 349/595 (58%), Gaps = 2/595 (0%)
Frame = +1
Query: 844 EVQSDSPSEGPT--DNPSSSSSGNSSHSSNDLPDLSFPDINLPIAVRKNIPDLNIPIAER 901
+V+ D + G D S+ + +S S+ D P+++ PD I ++K P + I +
Sbjct: 2956 DVEEDVRTSGDNVADTAKSAENAENSDSATDEPNINQPDKRPSIRIQKMHPK-ELIIGDP 3132
Query: 902 KRTRTCTKHPMSNYLSYDKLSHSHKAYVSRISNLFVPRTIQEALGDPNWKLAVKEEMNAL 961
R T + S+ +VS+I P+ ++EAL D W A++EE+
Sbjct: 3133 NRGVTTRSREIEIV--------SNSCFVSKIE----PKNVKEALTDEFWINAMQEELEQF 3276
Query: 962 NKNNTWCITDLPYDKKAVGCKWVFTVKCKADGSVERYKARLVAKGFTQTHGIDYQETFAP 1021
+N W + P +G KW+F K +G + R KARLVA+G+TQ G+D+ ETFAP
Sbjct: 3277 KRNEVWELVPRPEGTNVIGTKWIFKNKTNEEGVITRNKARLVAQGYTQIEGVDFDETFAP 3456
Query: 1022 VAKINSIRILLSLAVNFNWTLHQYDVKNAFLNGELHEEVYMRLPPGFEDKLGRGKVCRLK 1081
VA++ SIR+LL +A + L+Q DVK+AFLNG L+EE Y+ P GF D V RLK
Sbjct: 3457 VARLESIRLLLGVACILKFKLYQMDVKSAFLNGYLNEEAYVEQPKGFVDPTHPDHVYRLK 3636
Query: 1082 KSLYGLKQSPRAWFERFGSVVKGHGFTQSQADHTMFFKHSREGKIAILIVYVDDIIMTGD 1141
K+LYGLKQ+PRAW+ER + G+ + D T+F K E + I +YVDDI+ G
Sbjct: 3637 KALYGLKQAPRAWYERLTEFLTQQGYRKGGIDKTLFVKQDAEN-LMIAQIYVDDIVFGGM 3813
Query: 1142 DIGEISDLKRRLEAEFDIKDLGKLKYFLGMEFARSKEGIFLNQRKYILDLLTETGMTGCK 1201
+ +++++EF++ +G+L YFLG++ + ++ IFL+Q KY +++ + GM
Sbjct: 3814 SNEMLRHFVQQMQSEFEMSLVGELTYFLGLQVKQMEDSIFLSQSKYAKNIVKKFGMENAS 3993
Query: 1202 AAETPMDPNVKLKSVAEDEIIDRERYQRLAGRLIYLSHTRPDIAFAVSVISQFMHAPGPA 1261
TP ++KL +D+ Y+ + G L+YL+ +RPDI +AV V +++ P +
Sbjct: 3994 HKRTPAPTHLKLSKDEAGTSVDQSLYRSMIGSLLYLTASRPDITYAVGVCARYQANPKIS 4173
Query: 1262 HFEAVFRILRYLKGTPGKGLMFRNRGHIQVEAYTDADWAGNINDRRSTSGYCTFVGGNLV 1321
H V RIL+Y+ GT G+M+ + + Y DADWAG+ +DR+STSG C ++G NL+
Sbjct: 4174 HLNQVKRILKYVNGTSDYGIMYCHCSDSMLVGYCDADWAGSADDRKSTSGGCFYLGTNLI 4353
Query: 1322 TWRSKKQNVVARSSAEAEFRSVAHGFCEVLWIKKFMQELKIAGPTPMKVYCDNKAAISIA 1381
+W SKKQN V+ S+AEAE+ + +++W+K+ ++E + M +YCDN +AI+I+
Sbjct: 4354 SWFSKKQNCVSLSTAEAEYIAAGSSCSQLVWMKQMLKEYNVEQDV-MTLYCDNMSAINIS 4530
Query: 1382 HNPVLHDRTKHVEVDKHFIKEKIDSGEICMSYIPTKSQVADVLTKSLPKRQFDDM 1436
NPV H RTKH+++ H+I++ +D I + ++ T+ Q+AD+ TK+L QF+ +
Sbjct: 4531 KNPVQHSRTKHIDIRHHYIRDLVDDKVITLEHVDTEEQIADIFTKALDANQFEKL 4695
Score = 214 bits (546), Expect = 2e-55
Identities = 146/538 (27%), Positives = 251/538 (46%), Gaps = 8/538 (1%)
Frame = +1
Query: 238 CDYCGRNRHTQETCFKLHGRPNNSKAGKFGDRPMPTTSNAASSPFTKEQMDHLLKLLKFN 297
C YCG+ H + C+ LHG P++ R M H + L +
Sbjct: 1513 CHYCGKYGHIKPFCYHLHGHPHHGTQSSSSGRKMMWVPK------------HKIVSLVVH 1656
Query: 298 SSPNTPIGTVAQTGKDSWALSVQNHSNPWIIDSGASEHMTNCSHLFSSYFPSSGSEKVKI 357
+S + + K+ W L DSG S HMT + P S S V
Sbjct: 1657 TS-------LRASAKEDWYL-----------DSGCSRHMTGVKEFLVNIEPCSTS-YVTF 1779
Query: 358 ADGSYSSIAGKGNIKISEQITLQSVLHVPKFACNLLSVHKLSEDSNCSVLFCRSTCVFQD 417
DGS I G G + +L VL V NL+S+ +L D +V F +S C+ +
Sbjct: 1780 GDGSKGKITGMGKLVHDGLPSLNKVLLVKGLTANLISISQLC-DEGFNVNFTKSECLVTN 1956
Query: 418 QNSGKTIGTAREINGLYYF--DETPLGNTAMFGSSRTSPPLVSNKIMLWHKRLGHPSFSY 475
+ S + +R + Y + ET +T +F +++ +WH+R GH
Sbjct: 1957 EKSEVLMKGSRSKDNCYLWTPQETSYSSTCLFSK--------EDEVKIWHQRFGHLHLRG 2112
Query: 476 LKHLFPEFS-KEISSSQFH----CDACHLAKDHRVSFNS-RGYSASKPFYLIHSDVWGPS 529
+K + + + + I + + C C + K ++S + + S+ L+H D+ GP
Sbjct: 2113 MKKIIDKGAVRGIPNLKIEEGRICGECQIGKQVKMSHQKLQHQTTSRVLELLHMDLMGPM 2292
Query: 530 KIKTLSRKKWFVTFIDDHTRVCWVYLMEKKSEVEQRFQDFFNMIKNQFHTTIGILRSDNG 589
++++L K++ +DD +R WV + +KS+ + F++ ++ + I +RSD+G
Sbjct: 2293 QVESLGGKRYAYVVVDDFSRFTWVNFIREKSDTFEVFKELSLRLQREKDCVIKRIRSDHG 2472
Query: 590 TEYFNKYLSTFLVTNGIIHQSTCRDTPQQNGIAERKNRHLLEVTRAIMFSMNVPKYLWGN 649
E+ N + F + GI H+ + TPQQNGI ERKNR L E R ++ + +P LW
Sbjct: 2473 REFENSKFTEFCTSEGITHEFSAAITPQQNGIVERKNRTLQEAARVMLHAKELPYNLWAE 2652
Query: 650 ALLTACHLINRMPSRVLQYETPVQVLQNNFPTSRIITNIPLKVFGCLCYVYIPNIFRSKL 709
A+ TAC++ NR+ R T ++ + PT + +FG CY+ R K+
Sbjct: 2653 AMNTACYIHNRVTLRRGTPTTLYEIWKGRKPTVK-----HFHIFGSPCYILADREQRRKM 2817
Query: 710 DPKAEKCVFLGYASNKKGYKCFNPVTKKFFESMDVHFVEDQPFFRENSLQGESQSSNE 767
DPK++ +FLGY++N + Y+ FN T+ ES++V V+D R+ ++ + ++S +
Sbjct: 2818 DPKSDAGIFLGYSTNSRAYRVFNSRTRTVMESINV-VVDDLTPARKKDVEEDVRTSGD 2988
>TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete
Length = 4731
Score = 411 bits (1056), Expect = e-114
Identities = 215/595 (36%), Positives = 348/595 (58%), Gaps = 2/595 (0%)
Frame = +1
Query: 844 EVQSDSPSEGPT--DNPSSSSSGNSSHSSNDLPDLSFPDINLPIAVRKNIPDLNIPIAER 901
+V+ D + G D S + +S S+ D +++ PD ++K P + I +
Sbjct: 2953 DVEEDVRTSGDNVADAAKSGENAENSDSATDESNINQPDKRSSTRIQKMHPK-ELIIGDP 3129
Query: 902 KRTRTCTKHPMSNYLSYDKLSHSHKAYVSRISNLFVPRTIQEALGDPNWKLAVKEEMNAL 961
R T + S+ +VS+I P+ ++EAL D W A++EE+
Sbjct: 3130 NRGVTTRSREVEIV--------SNSCFVSKIE----PKNVKEALTDEFWINAMQEELEQF 3273
Query: 962 NKNNTWCITDLPYDKKAVGCKWVFTVKCKADGSVERYKARLVAKGFTQTHGIDYQETFAP 1021
+N W + P +G KW+F K +G + R KARLVA+G+TQ G+D+ ETFAP
Sbjct: 3274 KRNEVWELVPRPEGTNVIGTKWIFKNKTNEEGVITRNKARLVAQGYTQIEGVDFDETFAP 3453
Query: 1022 VAKINSIRILLSLAVNFNWTLHQYDVKNAFLNGELHEEVYMRLPPGFEDKLGRGKVCRLK 1081
VA++ SIR+LL +A + L+Q DVK+AFLNG L+EEVY+ P GF D V RLK
Sbjct: 3454 VARLESIRLLLGVACILKFKLYQMDVKSAFLNGYLNEEVYVEQPKGFADPTHPDHVYRLK 3633
Query: 1082 KSLYGLKQSPRAWFERFGSVVKGHGFTQSQADHTMFFKHSREGKIAILIVYVDDIIMTGD 1141
K+LYGLKQ+PRAW+ER + G+ + D T+F K E + I +YVDDI+ G
Sbjct: 3634 KALYGLKQAPRAWYERLTEFLTQQGYRKGGIDKTLFVKQDAE-NLMIAQIYVDDIVFGGM 3810
Query: 1142 DIGEISDLKRRLEAEFDIKDLGKLKYFLGMEFARSKEGIFLNQRKYILDLLTETGMTGCK 1201
+ +++++EF++ +G+L YFLG++ + ++ IFL+Q +Y +++ + GM
Sbjct: 3811 SNEMLRHFVQQMQSEFEMSLVGELTYFLGLQVKQMEDSIFLSQSRYAKNIVKKFGMENAS 3990
Query: 1202 AAETPMDPNVKLKSVAEDEIIDRERYQRLAGRLIYLSHTRPDIAFAVSVISQFMHAPGPA 1261
TP ++KL +D+ Y+ + G L+YL+ +RPDI +AV V +++ P +
Sbjct: 3991 HKRTPAPTHLKLSKDEAGTSVDQSLYRSMIGSLLYLTASRPDITYAVGVCARYQANPKIS 4170
Query: 1262 HFEAVFRILRYLKGTPGKGLMFRNRGHIQVEAYTDADWAGNINDRRSTSGYCTFVGGNLV 1321
H V RIL+Y+ GT G+M+ + + + Y DADWAG+ +DR+STSG C ++G NL+
Sbjct: 4171 HLTQVKRILKYVNGTSDYGIMYCHCSNPMLVGYCDADWAGSADDRKSTSGGCFYLGNNLI 4350
Query: 1322 TWRSKKQNVVARSSAEAEFRSVAHGFCEVLWIKKFMQELKIAGPTPMKVYCDNKAAISIA 1381
+W SKKQN V+ S+AEAE+ + +++W+K+ ++E + M +YCDN +AI+I+
Sbjct: 4351 SWFSKKQNCVSLSTAEAEYIAAGSSCSQLVWMKQMLKEYNVEQDV-MTLYCDNMSAINIS 4527
Query: 1382 HNPVLHDRTKHVEVDKHFIKEKIDSGEICMSYIPTKSQVADVLTKSLPKRQFDDM 1436
NPV H RTKH+++ H+I++ +D I + ++ T+ Q+AD+ TK+L QF+ +
Sbjct: 4528 KNPVQHSRTKHIDIRHHYIRDLVDDKVITLKHVDTEEQIADIFTKALDANQFEKL 4692
Score = 210 bits (534), Expect = 4e-54
Identities = 146/538 (27%), Positives = 255/538 (47%), Gaps = 8/538 (1%)
Frame = +1
Query: 238 CDYCGRNRHTQETCFKLHGRPNNSKAGKFGDRPMPTTSNAASSPFTKEQMDHLLKLLKFN 297
C YCG+ H + C+ LHG P++ S ++++M + K
Sbjct: 1510 CHYCGKYGHIKPFCYHLHGHPHHG----------------TQSSNSRKKMMWVPK----- 1626
Query: 298 SSPNTPIGTVAQTGKDSWALSVQNHSNPWIIDSGASEHMTNCSHLFSSYFPSSGSEKVKI 357
+ + V T + A W +DSG S HMT + P S S V
Sbjct: 1627 ---HKAVSLVVHTSLRASA------KEDWYLDSGCSRHMTGVKEFLLNIEPCSTSY-VTF 1776
Query: 358 ADGSYSSIAGKGNIKISEQITLQSVLHVPKFACNLLSVHKLSEDSNCSVLFCRSTCVFQD 417
DGS I G G + +L VL V NL+S+ +L D +V F +S C+ +
Sbjct: 1777 GDGSKGKIIGMGKLVHDGLPSLNKVLLVKGLTANLISISQLC-DEGFNVNFTKSECLVTN 1953
Query: 418 QNSGKTIGTAREINGLYYF--DETPLGNTAMFGSSRTSPPLVSNKIMLWHKRLGHPSFSY 475
+ S + +R + Y + ET +T + SS+ +++ +WH+R GH
Sbjct: 1954 EKSEVLMKGSRSKDNCYLWTPQETSYSSTCL--SSK------EDEVRIWHQRFGHLHLRG 2109
Query: 476 LKHLFPEFS-KEISSSQFH----CDACHLAKDHRVSFNSRGY-SASKPFYLIHSDVWGPS 529
+K + + + + I + + C C + K ++S + + S+ L+H D+ GP
Sbjct: 2110 MKKIIDKGAVRGIPNLKIEEGRICGECQIGKQVKMSHQKLQHQTTSRVLELLHMDLMGPM 2289
Query: 530 KIKTLSRKKWFVTFIDDHTRVCWVYLMEKKSEVEQRFQDFFNMIKNQFHTTIGILRSDNG 589
++++L K++ +DD +R WV + +KSE + F++ ++ + I +RSD+G
Sbjct: 2290 QVESLGGKRYAYVVVDDFSRFTWVNFIREKSETFEVFKELSLRLQREKDCVIKRIRSDHG 2469
Query: 590 TEYFNKYLSTFLVTNGIIHQSTCRDTPQQNGIAERKNRHLLEVTRAIMFSMNVPKYLWGN 649
E+ N + F + GI H+ + TPQQNGI ERKNR L E R ++ + +P LW
Sbjct: 2470 REFENSRFTEFCTSEGITHEFSAAITPQQNGIVERKNRTLQEAARVMLHAKELPYNLWAE 2649
Query: 650 ALLTACHLINRMPSRVLQYETPVQVLQNNFPTSRIITNIPLKVFGCLCYVYIPNIFRSKL 709
A+ TAC++ NR+ R T ++ + P+ + +FG CY+ R K+
Sbjct: 2650 AMNTACYIHNRVTLRRGTPTTLYEIWKGRKPSVK-----HFHIFGSPCYILADREQRRKM 2814
Query: 710 DPKAEKCVFLGYASNKKGYKCFNPVTKKFFESMDVHFVEDQPFFRENSLQGESQSSNE 767
DPK++ +FLGY++N + Y+ FN T+ ES++V V+D R+ ++ + ++S +
Sbjct: 2815 DPKSDAGIFLGYSTNSRAYRVFNSRTRTVMESINV-VVDDLSPARKKDVEEDVRTSGD 2985
>TC235104 weakly similar to UP|Q850H8 (Q850H8) Gag-pol polyprotein
(Fragment), partial (28%)
Length = 865
Score = 195 bits (496), Expect = 1e-49
Identities = 113/292 (38%), Positives = 164/292 (55%)
Frame = +2
Query: 355 VKIADGSYSSIAGKGNIKISEQITLQSVLHVPKFACNLLSVHKLSEDSNCSVLFCRSTCV 414
+ +ADGS G G++ + ++L SV+ + N+ S+ +L+ NCSV F ++ V
Sbjct: 20 ITLADGSRVVATGIGHVSPTSSLSLNSVVFILGCPFNITSLSQLTRFRNCSVTFDANSFV 199
Query: 415 FQDQNSGKTIGTAREINGLYYFDETPLGNTAMFGSSRTSPPLVSNKIMLWHKRLGHPSFS 474
Q+ +G TIG E +GLYY N + S+ TSP L+ H+RLGHP S
Sbjct: 200 IQECGTGWTIGVGIESHGLYYLKP----NLSWVCSAVTSPKLL-------HERLGHPHLS 346
Query: 475 YLKHLFPEFSKEISSSQFHCDACHLAKDHRVSFNSRGYSASKPFYLIHSDVWGPSKIKTL 534
LK + P K C++C L K R S PF +IH D+WGP+++ ++
Sbjct: 347 KLKIMVPSLEK---IKDLFCESCQLGKHVRSSXRHVESRVDSPFLVIHXDIWGPNRVSSM 517
Query: 535 SRKKWFVTFIDDHTRVCWVYLMEKKSEVEQRFQDFFNMIKNQFHTTIGILRSDNGTEYFN 594
S + +FVTFID+ ++ V+LM+++SE+ F N IK QF TI ILRSDN EYF+
Sbjct: 518 SYR-YFVTFIDEFSQCTRVFLMKERSEILS-FLTSVNKIKTQFGKTIKILRSDNAKEYFS 691
Query: 595 KYLSTFLVTNGIIHQSTCRDTPQQNGIAERKNRHLLEVTRAIMFSMNVPKYL 646
+S F GI+HQ +C TPQQN IAERKNRHL+E R ++ N P ++
Sbjct: 692 SVISPFXSAQGILHQFSCPHTPQQNDIAERKNRHLVETARTLLLHANEPIHI 847
>BE474381 similar to GP|21434|emb|CA ORF4 {Solanum tuberosum}, partial (20%)
Length = 406
Score = 135 bits (339), Expect(2) = 1e-48
Identities = 62/86 (72%), Positives = 75/86 (87%)
Frame = +3
Query: 1292 EAYTDADWAGNINDRRSTSGYCTFVGGNLVTWRSKKQNVVARSSAEAEFRSVAHGFCEVL 1351
+ YTDADWAG++ DRRSTSGYCTFVGGNLV+ SKKQ+VVARSSAEAEFR++AHG CE L
Sbjct: 150 KGYTDADWAGSVTDRRSTSGYCTFVGGNLVS*-SKKQSVVARSSAEAEFRALAHGICETL 326
Query: 1352 WIKKFMQELKIAGPTPMKVYCDNKAA 1377
W+KK +QELK+ P+K+YCDNK+A
Sbjct: 327 WVKKLLQELKVHSSPPIKLYCDNKSA 404
Score = 78.6 bits (192), Expect(2) = 1e-48
Identities = 37/52 (71%), Positives = 44/52 (84%)
Frame = +1
Query: 1244 IAFAVSVISQFMHAPGPAHFEAVFRILRYLKGTPGKGLMFRNRGHIQVEAYT 1295
IAFAVS++SQFMHAPG H EA FRILRYLKG+PG+GL ++N GH+QVE T
Sbjct: 7 IAFAVSMVSQFMHAPGHEHLEAAFRILRYLKGSPGRGL-YKNHGHLQVEKAT 159
>TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Fragment), partial
(30%)
Length = 687
Score = 187 bits (476), Expect = 2e-47
Identities = 90/149 (60%), Positives = 114/149 (76%), Gaps = 2/149 (1%)
Frame = +2
Query: 1286 RGHIQVEAYTDADWAGNINDRRSTSGYCTFVGGNLVTWRSKKQNVVARSSAEAEFRSVAH 1345
+G+ Q+ Y DADWAG DRRSTSGYC F+GGNLV+W+SKKQ VVARSSAEAE+RS+A
Sbjct: 2 KGNTQLSGYCDADWAGCPMDRRSTSGYCVFIGGNLVSWKSKKQTVVARSSAEAEYRSMAM 181
Query: 1346 GFCEVLWIKKFMQELKIAGPTPMKVYCDNKAAISIAHNPVLHDRTKHVEVDKHFIKEKID 1405
CE++WIK+F+QEL+ MK+YCDN+AA+ IA NPV H+RTKH+E+D HFI+EK+
Sbjct: 182 VTCELMWIKQFLQELRFCEELQMKLYCDNQAALHIASNPVFHERTKHIEIDCHFIREKLL 361
Query: 1406 SGEICMSYIPTKSQVADVLTKSL--PKRQ 1432
S EI +I + Q D+LTKSL PK Q
Sbjct: 362 SKEIVTEFIGSNDQPVDILTKSLRGPKIQ 448
>TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polyprotein, partial
(7%)
Length = 804
Score = 167 bits (423), Expect = 3e-41
Identities = 81/215 (37%), Positives = 133/215 (61%), Gaps = 3/215 (1%)
Frame = +1
Query: 1222 IDRERYQRLAGRLIYLSHTRPDIAFAVSVISQFMHAPGPAHFEAVFRILRYLKGTPGKGL 1281
+D ++RL G L YL ++RP+I FAVS+IS+FM P +H +A R+LR +KGT G G+
Sbjct: 7 VDVTEFRRLIGSLRYLCNSRPNICFAVSLISRFMKRPRLSHMQAAKRVLRLIKGTIGSGV 186
Query: 1282 MF---RNRGHIQVEAYTDADWAGNINDRRSTSGYCTFVGGNLVTWRSKKQNVVARSSAEA 1338
+F G + YTD+DW + +ST GY V SKKQ+V+A S+ EA
Sbjct: 187 LFPFKAKSGKPDLLGYTDSDWKRDPEQEKSTGGYLFMYNDAPVA*SSKKQDVIALSTCEA 366
Query: 1339 EFRSVAHGFCEVLWIKKFMQELKIAGPTPMKVYCDNKAAISIAHNPVLHDRTKHVEVDKH 1398
E+ + + G C+ +W+ ++ELK+ P+ + DNK+AI++A +P LH R+KH+E+ H
Sbjct: 367 EYVAASLGACQAVWMMNLLEELKLRERKPVNLLIDNKSAINLAKHPTLHGRSKHIELRFH 546
Query: 1399 FIKEKIDSGEICMSYIPTKSQVADVLTKSLPKRQF 1433
+I++++ G + + Y + Q+AD++TK + +F
Sbjct: 547 YIRDQVSKGNVTVEYCKAEEQLADLMTKPIQVSRF 651
>BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vinifera},
partial (34%)
Length = 407
Score = 165 bits (417), Expect = 2e-40
Identities = 76/131 (58%), Positives = 99/131 (75%)
Frame = -2
Query: 972 LPYDKKAVGCKWVFTVKCKADGSVERYKARLVAKGFTQTHGIDYQETFAPVAKINSIRIL 1031
LP K VGC+WV+TVK G V+R KARLVAKG+TQ +GIDY +TF+PVAK+ ++R+
Sbjct: 400 LPPGKTPVGCRWVYTVKVGPTGEVDRLKARLVAKGYTQVYGIDYCDTFSPVAKLTTVRLF 221
Query: 1032 LSLAVNFNWTLHQYDVKNAFLNGELHEEVYMRLPPGFEDKLGRGKVCRLKKSLYGLKQSP 1091
L++A +W LHQ D+KNAFL+G+L E++YM PPGF + G VC+L +SLYGLKQSP
Sbjct: 220 LAMAAICHWPLHQLDIKNAFLHGDLEEDIYMEQPPGFVAQGEYGLVCKLHRSLYGLKQSP 41
Query: 1092 RAWFERFGSVV 1102
RAWF +F VV
Sbjct: 40 RAWFGKFSHVV 8
>BM307983
Length = 406
Score = 159 bits (403), Expect = 6e-39
Identities = 79/134 (58%), Positives = 96/134 (70%), Gaps = 1/134 (0%)
Frame = +2
Query: 979 VGCKWVFTVKCKADGSVERYKARLVAKGFTQTHGIDYQETFAPVAK-INSIRILLSLAVN 1037
VGC+W++TVK AD +++RYKARLVAKG+ QT+GIDY+ETFA K I S
Sbjct: 2 VGCRWIYTVKY*ADDTLDRYKARLVAKGYIQTYGIDYEETFAQWQK*IQSGSSSP*QQAQ 181
Query: 1038 FNWTLHQYDVKNAFLNGELHEEVYMRLPPGFEDKLGRGKVCRLKKSLYGLKQSPRAWFER 1097
F W +HQ+DVKNAFL+G L EEVYM +PPG+ G KVCRLKK+LYGLKQSPRAWF R
Sbjct: 182 FGWEMHQFDVKNAFLHGSLEEEVYMEIPPGYGASNGGNKVCRLKKALYGLKQSPRAWFGR 361
Query: 1098 FGSVVKGHGFTQSQ 1111
F + G+ QSQ
Sbjct: 362 FTQAMLSLGYKQSQ 403
>TC234303 weakly similar to UP|Q8W153 (Q8W153) Polyprotein, partial (10%)
Length = 558
Score = 157 bits (398), Expect = 2e-38
Identities = 91/181 (50%), Positives = 120/181 (66%), Gaps = 2/181 (1%)
Frame = +1
Query: 993 GSVERYKARLVAKGFTQTHGIDYQETFAPVAKINSIRILLSLAVNFNWTLHQYDVKNAFL 1052
G+++++KARLVAK +TQ +G DY TF+PVAK+ + +L S+AV +W L D KNAFL
Sbjct: 28 GTIDQFKARLVAKSYTQVYGQDYTGTFSPVAKMAYVHLLWSMAVVCHWPLF*LDAKNAFL 207
Query: 1053 NGELHEEVYMRLPPGF-EDKLGRGKVCRLKKSLYGLKQSPRAW-FERFGSVVKGHGFTQS 1110
+G L EEVYM P GF VC+L +S YGLKQSPRAW F G+ + +
Sbjct: 208 HGYLEEEVYMEQPLGFVAQGESSNMVCQLCRSFYGLKQSPRAWPFLYCGAAI---WYDSH 378
Query: 1111 QADHTMFFKHSREGKIAILIVYVDDIIMTGDDIGEISDLKRRLEAEFDIKDLGKLKYFLG 1170
+ADH++F+ HS +G I LIVYVDDI +TG D I+ LK L +F KDLGKL+YFLG
Sbjct: 379 EADHSVFYCHSPQGCI-YLIVYVDDIGITGSDQHGIT*LK*XLCCQFQTKDLGKLRYFLG 555
Query: 1171 M 1171
+
Sbjct: 556 I 558
>AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza sativa
(japonica cultivar-group)}, partial (10%)
Length = 463
Score = 153 bits (387), Expect = 5e-37
Identities = 74/149 (49%), Positives = 102/149 (67%)
Frame = -3
Query: 952 LAVKEEMNALNKNNTWCITDLPYDKKAVGCKWVFTVKCKADGSVERYKARLVAKGFTQTH 1011
+A++EE+N +NN W + + P + +G KWVF K G + R KARLVAKG+ Q
Sbjct: 461 IAMQEELNQFERNNVWKLVEKPENYPVIGTKWVFRNKLDEHGIIIRNKARLVAKGYNQEE 282
Query: 1012 GIDYQETFAPVAKINSIRILLSLAVNFNWTLHQYDVKNAFLNGELHEEVYMRLPPGFEDK 1071
GIDY+ET+APVA++ IR+LL+ N+ L+Q DVK+AFLNG + EEVY+ PPGFE
Sbjct: 281 GIDYEETYAPVARLEVIRMLLAYVSIMNFKLYQMDVKSAFLNGLIQEEVYVEQPPGFEIP 102
Query: 1072 LGRGKVCRLKKSLYGLKQSPRAWFERFGS 1100
V +L+K+LYGLKQ+PRAW+ER +
Sbjct: 101 DKPTHVYKLQKALYGLKQAPRAWYERISN 15
>CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {Vitis
vinifera}, partial (34%)
Length = 409
Score = 152 bits (384), Expect = 1e-36
Identities = 69/134 (51%), Positives = 97/134 (71%)
Frame = +3
Query: 932 ISNLFVPRTIQEALGDPNWKLAVKEEMNALNKNNTWCITDLPYDKKAVGCKWVFTVKCKA 991
+S+L VP TI+EAL P W+ A+ +EM AL N TW + LP K VGC+WV+TVK
Sbjct: 3 LSSLTVPSTIREALDHPGWRQAMVDEMQALENNGTWELVPLPPGKTTVGCRWVYTVKVGP 182
Query: 992 DGSVERYKARLVAKGFTQTHGIDYQETFAPVAKINSIRILLSLAVNFNWTLHQYDVKNAF 1051
+G V+R KARLVAKG+TQ +GI+Y +TF+PV + ++R+ L++A +W LHQ D+KNAF
Sbjct: 183 NGKVDRLKARLVAKGYTQVYGIEYCDTFSPVFFLTTVRLFLAMAAIRHWPLHQLDIKNAF 362
Query: 1052 LNGELHEEVYMRLP 1065
L+G+L E++YM P
Sbjct: 363 LHGDLEEDIYMEQP 404
>CO983154
Length = 568
Score = 148 bits (374), Expect = 1e-35
Identities = 78/185 (42%), Positives = 114/185 (61%), Gaps = 1/185 (0%)
Frame = +3
Query: 615 TPQQNGIAERKNRHLLEVTRAIMFSMNVPKYLWGNALLTACHLINRMPSRVLQYETPVQV 674
TPQQNGIAERKNRHLLE R++M ++NVP + WG+A+LT+C LINRMPS L+ + P +
Sbjct: 6 TPQQNGIAERKNRHLLETARSLMLNLNVPIHHWGDAVLTSCFLINRMPSSSLENQIPHSL 185
Query: 675 LQNNFPTSRIITNIPLKVFGCLCYVYIPNIFRSKLDPKAEKCVFLGYASNKKGYKCFNPV 734
+ + P + ++ KVFGC C+V+ + KL ++ KCVFLGY+ +KGYKC++P
Sbjct: 186 VFPHDP----LFHVSPKVFGCTCFVHDLSPGLDKLSARSVKCVFLGYSRLQKGYKCYSPT 353
Query: 735 TKKFFESMDVHFVEDQPFFRENSLQGESQSSNEEDNFWEILPVLDDI-VTNDHLDAKITE 793
++++ S DV F ED PFF S S + + E+LP+ + N + I
Sbjct: 354 MRRYYMSADVTFFEDTPFF--------SPSVDHSSSLQEVLPIPSPYPLXNSGQNVSIVP 509
Query: 794 PRNPN 798
+PN
Sbjct: 510 SSSPN 524
>BI787454 weakly similar to GP|21434|emb|CA ORF4 {Solanum tuberosum}, partial
(21%)
Length = 421
Score = 148 bits (373), Expect = 2e-35
Identities = 72/138 (52%), Positives = 98/138 (70%)
Frame = +2
Query: 1110 SQADHTMFFKHSREGKIAILIVYVDDIIMTGDDIGEISDLKRRLEAEFDIKDLGKLKYFL 1169
S+ADH++F+ H+ GK L+VYVDDI++T D +I LK L F KDL LKYFL
Sbjct: 8 SEADHSVFYCHTSPGKCVYLMVYVDDIMITKKDATKIVQLKEHLFNHFQTKDLRYLKYFL 187
Query: 1170 GMEFARSKEGIFLNQRKYILDLLTETGMTGCKAAETPMDPNVKLKSVAEDEIIDRERYQR 1229
G+E A+S +G+ ++QRKY LD+L ETGM C+ ++PMDPN+KL + + D ERY+R
Sbjct: 188 GIEVAQSGDGVVISQRKYALDILEETGMQNCRLVDSPMDPNLKLMAYQSEVYPDPERYRR 367
Query: 1230 LAGRLIYLSHTRPDIAFA 1247
L G+LIYL+ TRPDI+FA
Sbjct: 368 LVGKLIYLTITRPDISFA 421
>BU764568
Length = 420
Score = 110 bits (275), Expect(2) = 4e-34
Identities = 48/85 (56%), Positives = 67/85 (78%)
Frame = +3
Query: 1309 TSGYCTFVGGNLVTWRSKKQNVVARSSAEAEFRSVAHGFCEVLWIKKFMQELKIAGPTPM 1368
TSGYC +GGNL++W+SKKQ+VVA+SSAEAE+R++A CE++W+K+ + ELK T M
Sbjct: 165 TSGYCVLIGGNLISWKSKKQSVVAKSSAEAEYRAMALVTCELIWLKQLL*ELKFEEDTQM 344
Query: 1369 KVYCDNKAAISIAHNPVLHDRTKHV 1393
+ CDN+AA+ IA NP+ H RTKH+
Sbjct: 345 TLICDNQAALHIASNPIFH*RTKHI 419
Score = 54.7 bits (130), Expect(2) = 4e-34
Identities = 26/61 (42%), Positives = 39/61 (63%)
Frame = +1
Query: 1252 SQFMHAPGPAHFEAVFRILRYLKGTPGKGLMFRNRGHIQVEAYTDADWAGNINDRRSTSG 1311
SQF+++P H+ AV IL+ K PGKGL++ ++GH Q+ Y+DAD G+ +DR
Sbjct: 1 SQFLNSPCQDHWNAVS*ILK*TKSAPGKGLIYEDKGHSQIIGYSDAD*VGSPSDRHQDIV 180
Query: 1312 Y 1312
Y
Sbjct: 181 Y 183
>BI427153
Length = 422
Score = 142 bits (357), Expect = 1e-33
Identities = 71/142 (50%), Positives = 95/142 (66%)
Frame = +1
Query: 608 HQSTCRDTPQQNGIAERKNRHLLEVTRAIMFSMNVPKYLWGNALLTACHLINRMPSRVLQ 667
HQSTC TPQQNGIAERKN HLLE R++M + NVP + WG+A+LTAC LINRMPS L+
Sbjct: 1 HQSTCPHTPQQNGIAERKNHHLLETARSLMLNSNVPTHHWGDAVLTACFLINRMPSSSLE 180
Query: 668 YETPVQVLQNNFPTSRIITNIPLKVFGCLCYVYIPNIFRSKLDPKAEKCVFLGYASNKKG 727
+ P ++ FP + P KVFGC C+V+ + KL ++ KCVFLGY+ +KG
Sbjct: 181 NQIPHSIV---FPNDLLFYVSP-KVFGCTCFVHDLSPGLDKLSARSVKCVFLGYSRLQKG 348
Query: 728 YKCFNPVTKKFFESMDVHFVED 749
Y C+ P ++++ S +V F ED
Sbjct: 349 YTCYFPNMRRYYMSANVTFFED 414
>BM086359
Length = 427
Score = 136 bits (343), Expect = 6e-32
Identities = 71/142 (50%), Positives = 95/142 (66%)
Frame = +1
Query: 1169 LGMEFARSKEGIFLNQRKYILDLLTETGMTGCKAAETPMDPNVKLKSVAEDEIIDRERYQ 1228
LG++ A+S GI ++Q KY LD+LTETGM C + TPMDPNVKL S + + D R
Sbjct: 1 LGIDVAQSSYGIVISQWKYALDILTETGMLDCLPSNTPMDPNVKLLSGQGEALEDPGR*C 180
Query: 1229 RLAGRLIYLSHTRPDIAFAVSVISQFMHAPGPAHFEAVFRILRYLKGTPGKGLMFRNRGH 1288
L GRL YL+ TR DI FAV V+SQF+ P + + A RILRY+K PG GL++ ++G+
Sbjct: 181 CLVGRLNYLTVTRLDITFAVGVLSQFLKDPTDSQWNATIRILRYIKNAPGPGLLYEDKGN 360
Query: 1289 IQVEAYTDADWAGNINDRRSTS 1310
+V Y DADW G+ +D+ STS
Sbjct: 361 GKVVCYFDADWPGSPSDKSSTS 426
>TC225402 weakly similar to UP|Q6I923 (Q6I923) Copia-like retrotransposon
Hopscotch polyprotein, partial (7%)
Length = 1446
Score = 136 bits (342), Expect = 8e-32
Identities = 61/109 (55%), Positives = 82/109 (74%)
Frame = +2
Query: 1296 DADWAGNINDRRSTSGYCTFVGGNLVTWRSKKQNVVARSSAEAEFRSVAHGFCEVLWIKK 1355
DA+WA + DR ST GYC +G NLV W+S K NVVARSSAEAE++++ CE++WIK+
Sbjct: 8 DANWAVSPIDRGSTLGYCVSIGENLVLWKSNK*NVVARSSAEAEYKAMTVATCELIWIKQ 187
Query: 1356 FMQELKIAGPTPMKVYCDNKAAISIAHNPVLHDRTKHVEVDKHFIKEKI 1404
+QELK MK+ CDN+AA+ IA NPV H+RTKH+E+D HF++EK+
Sbjct: 188 LLQELKFGSTQQMKLCCDNQAALHIASNPVFHERTKHIEIDCHFVREKV 334
>BI317550 weakly similar to GP|9759590|dbj| polyprotein-like {Arabidopsis
thaliana}, partial (18%)
Length = 421
Score = 136 bits (342), Expect = 8e-32
Identities = 67/136 (49%), Positives = 95/136 (69%)
Frame = -2
Query: 1082 KSLYGLKQSPRAWFERFGSVVKGHGFTQSQADHTMFFKHSREGKIAILIVYVDDIIMTGD 1141
KSLYGLKQ+ R W+E+ +++ G+ QS +D+++F ++ L+VYVDDII+ GD
Sbjct: 420 KSLYGLKQASRKWYEKLTNLLLKEGYIQSISDYSLFTL-TKGNTFTALLVYVDDIILAGD 244
Query: 1142 DIGEISDLKRRLEAEFDIKDLGKLKYFLGMEFARSKEGIFLNQRKYILDLLTETGMTGCK 1201
I E +K L+ F IK+LGKLKYFLG+E A S+ GI ++QRKY LDLL ++G+ GCK
Sbjct: 243 SIDEFDRIKNVLDLAFKIKNLGKLKYFLGLEVAHSRLGITISQRKYCLDLLKDSGLLGCK 64
Query: 1202 AAETPMDPNVKLKSVA 1217
A TP+D ++KL S A
Sbjct: 63 PASTPLDTSIKLHSAA 16
>BU548243
Length = 599
Score = 134 bits (338), Expect = 2e-31
Identities = 66/141 (46%), Positives = 94/141 (65%)
Frame = -1
Query: 1293 AYTDADWAGNINDRRSTSGYCTFVGGNLVTWRSKKQNVVARSSAEAEFRSVAHGFCEVLW 1352
A DA WA +++D RST G F+G NL++W S+KQ V A+SS EAE+RS+A E+ W
Sbjct: 596 ALCDAGWASDVDDHRSTLGSAIFLGPNLISWWSRKQQVTAQSSTEAEYRSIAQTSAELTW 417
Query: 1353 IKKFMQELKIAGPTPMKVYCDNKAAISIAHNPVLHDRTKHVEVDKHFIKEKIDSGEICMS 1412
I+ + EL+I TP + CDNK+A++IAHN V H RTKH+E+D F+ EK+ S ++ +
Sbjct: 416 IQALLMELQIPF-TPPVILCDNKSAVAIAHNLVFHSRTKHMEIDVFFVHEKVLSKQLQIF 240
Query: 1413 YIPTKSQVADVLTKSLPKRQF 1433
+IP Q A +LTK L +F
Sbjct: 239 HIPALDQWAGILTKPLSSARF 177
>BI969608 similar to GP|21434|emb|CA ORF4 {Solanum tuberosum}, partial (19%)
Length = 454
Score = 133 bits (334), Expect = 6e-31
Identities = 62/116 (53%), Positives = 87/116 (74%), Gaps = 3/116 (2%)
Frame = -2
Query: 1322 TWR---SKKQNVVARSSAEAEFRSVAHGFCEVLWIKKFMQELKIAGPTPMKVYCDNKAAI 1378
TW +K + ARSSA+AE+R++A G CE+L +K+ ++EL++ PMK+YCDNKAAI
Sbjct: 444 TWXXGGTKNKMXXARSSAKAEYRAMAQGVCEIL*LKRILEELQLPMTLPMKLYCDNKAAI 265
Query: 1379 SIAHNPVLHDRTKHVEVDKHFIKEKIDSGEICMSYIPTKSQVADVLTKSLPKRQFD 1434
+I+ NPV H RTKHVE+D+ FIKEK+D+G+ICM +IP QVAD+ TK L + F+
Sbjct: 264 NISQNPVQHGRTKHVEIDRPFIKEKVDAGQICMPFIPFSQQVADIFTKGLFRPNFE 97
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.319 0.136 0.412
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 67,895,344
Number of Sequences: 63676
Number of extensions: 1018677
Number of successful extensions: 5624
Number of sequences better than 10.0: 168
Number of HSP's better than 10.0 without gapping: 5412
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 5556
length of query: 1436
length of database: 12,639,632
effective HSP length: 109
effective length of query: 1327
effective length of database: 5,698,948
effective search space: 7562503996
effective search space used: 7562503996
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 65 (29.6 bits)
Medicago: description of AC135396.1