
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC124966.5 - phase: 0
(1309 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete 431 e-120
TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, co... 428 e-120
TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Frag... 184 2e-46
BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vi... 172 7e-43
TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polypro... 170 3e-42
CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {V... 162 9e-40
TC231544 weakly similar to UP|Q9FLR2 (Q9FLR2) Polyprotein-like, ... 157 4e-38
AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza ... 153 4e-37
TC234303 weakly similar to UP|Q8W153 (Q8W153) Polyprotein, parti... 150 4e-36
BI317550 weakly similar to GP|9759590|dbj| polyprotein-like {Ara... 145 1e-34
TC225402 weakly similar to UP|Q6I923 (Q6I923) Copia-like retrotr... 129 3e-34
BM527454 weakly similar to GP|27901709|gb| gag-pol polyprotein {... 79 8e-32
CO982036 134 2e-31
BM307983 133 4e-31
BU764568 104 1e-29
BQ081067 weakly similar to GP|23495377|dbj orf490 {Oryza sativa ... 90 6e-29
BM086359 121 2e-27
BU548243 121 2e-27
AI959950 121 2e-27
BI787454 weakly similar to GP|21434|emb|CA ORF4 {Solanum tuberos... 119 7e-27
>TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete
Length = 4731
Score = 431 bits (1107), Expect = e-120
Identities = 256/748 (34%), Positives = 406/748 (54%), Gaps = 4/748 (0%)
Frame = +1
Query: 557 ECVLTAAYLINKLPTPILKFKSPHQVLLGSPPSYSSLRVFGCLCFA-KNMNIQHKFDERS 615
E + TA Y+ N++ + +++ G PS +FG C+ + + K D +S
Sbjct: 2647 EAMNTACYIHNRVTLRRGTPTTLYEIWKGRKPSVKHFHIFGSPCYILADREQRRKMDPKS 2826
Query: 616 KPGIFVGYPFNQKGYRIYDMKTRQIYVSRDVQFHETVFPYQDIQSPPFNNAISINTQILD 675
GIF+GY N + YR+++ +TR + S IN +
Sbjct: 2827 DAGIFLGYSTNSRAYRVFNSRTRTVMES-------------------------INVVV-- 2925
Query: 676 NEFDDLFTGSPTHPNIPPENSHNDNSNDTIVTISTPEDDASSNPPSFSAESLSNNPPSTE 735
DDL SP E+ N D A S +AE+ + +
Sbjct: 2926 ---DDL---SPARKKDVEEDVRTSGDNVA--------DAAKSGE---NAENSDSATDESN 3054
Query: 736 MNPPNHRHSQRIRNPPIHLKDYICKINN--VTSKINFPLENYLSLSNLSNSHRAFLINII 793
+N P+ R S RI+ +H K+ I N VT++ + +SNS F+ I
Sbjct: 3055 INQPDKRSSTRIQK--MHPKELIIGDPNRGVTTRSR-------EVEIVSNS--CFVSKI- 3198
Query: 794 ENKEPKSYSQAMKSVEWRDAMAKEIQALESNNTWILCPLPEGKSAIGCKWIYKIKYHSDG 853
EPK+ +A+ W +AM +E++ + N W L P PEG + IG KWI+K K + +G
Sbjct: 3199 ---EPKNVKEALTDEFWINAMQEELEQFKRNEVWELVPRPEGTNVIGTKWIFKNKTNEEG 3369
Query: 854 SIDRYKARLVAKGYSQVQGIDYHDTFAPVAKLVTVRLLLSIAAIKNWPLYQFDVNNAFLQ 913
I R KARLVA+GY+Q++G+D+ +TFAPVA+L ++RLLL +A I + LYQ DV +AFL
Sbjct: 3370 VITRNKARLVAQGYTQIEGVDFDETFAPVARLESIRLLLGVACILKFKLYQMDVKSAFLN 3549
Query: 914 GDLSEEVYMKLPPGFSHKKKPC-VCKLNKSIYGLKQASRQWFSKFSTTLIQKGFRQSISD 972
G L+EEVY++ P GF+ P V +L K++YGLKQA R W+ + + L Q+G+R+ D
Sbjct: 3550 GYLNEEVYVEQPKGFADPTHPDHVYRLKKALYGLKQAPRAWYERLTEFLTQQGYRKGGID 3729
Query: 973 YSLFTYNCDQTTIFVLVYVDDIIITGNNENAISKIKKFLAQSFSIKDLGNLSYILGIEVS 1032
+LF + + +YVDDI+ G + + + + F + +G L+Y LG++V
Sbjct: 3730 KTLFVKQDAENLMIAQIYVDDIVFGGMSNEMLRHFVQQMQSEFEMSLVGELTYFLGLQVK 3909
Query: 1033 RSKKGIFLCQRKYTLDILSDSGMTGCRPSDFPMEQHLRLRPNDGTPLSDPTVYRRHVGRL 1092
+ + IFL Q +Y +I+ GM P HL+L ++ D ++YR +G L
Sbjct: 3910 QMEDSIFLSQSRYAKNIVKKFGMENASHKRTPAPTHLKLSKDEAGTSVDQSLYRSMIGSL 4089
Query: 1093 LYLTVTRPDIQYAVNTLSQFMQSPYSSHFDAATQVLRYLKGSVGKGLFLSASSSINLVGY 1152
LYLT +RPDI YAV +++ +P SH ++L+Y+ G+ G+ S+ LVGY
Sbjct: 4090 LYLTASRPDITYAVGVCARYQANPKISHLTQVKRILKYVNGTSDYGIMYCHCSNPMLVGY 4269
Query: 1153 ADSDWAGCPTTRRSTTGYFTMLGSNPISWKTKKQPTISRSSAEAEYRSLATLSSELQWLK 1212
D+DWAG R+ST+G LG+N ISW +KKQ +S S+AEAEY + + S+L W+K
Sbjct: 4270 CDADWAGSADDRKSTSGGCFYLGNNLISWFSKKQNCVSLSTAEAEYIAAGSSCSQLVWMK 4449
Query: 1213 YLLSDLGIDHPQPITIYCDSQAAIHIAENPVFHERTKHIEIDCHFVREKIKSGLIAPSYI 1272
+L + ++ +T+YCD+ +AI+I++NPV H RTKHI+I H++R+ + +I ++
Sbjct: 4450 QMLKEYNVEQ-DVMTLYCDNMSAINISKNPVQHSRTKHIDIRHHYIRDLVDDKVITLKHV 4626
Query: 1273 RSSDQLADIFTKPLGGDAYKRILGKLGV 1300
+ +Q+ADIFTK L + ++++ GKLG+
Sbjct: 4627 DTEEQIADIFTKALDANQFEKLRGKLGI 4710
Score = 57.8 bits (138), Expect = 3e-08
Identities = 53/244 (21%), Positives = 95/244 (38%)
Frame = +1
Query: 286 KRQRPFCEHCNRHGHTITTCYQIHGFPSNPSKPQKKTETSPSTSANQLSSAQYHKLLTLL 345
KR++ C +C ++GH CY +HG P + ++ + ++ S H L
Sbjct: 1492 KRKKWRCHYCGKYGHIKPFCYHLHGHPHHGTQSSNSRKKMMWVPKHKAVSLVVHTSLRAS 1671
Query: 346 AKEDNVGSSVNLAGTAFTCIPFSWIIDSGASNHICTSLSLFSSYYPVNNQISVQQPDGSQ 405
AKED W +DSG S H+ + P + V DGS+
Sbjct: 1672 AKED-------------------WYLDSGCSRHMTGVKEFLLNIEPCSTSY-VTFGDGSK 1791
Query: 406 ALVKHIGTINCSPSLILTNVYHVPTFKFNLMSVTQLTESLNCDAIFSSSGCVFQDQATKK 465
+ +G + L V V NL+S++QL + + F+ S C+ ++ ++
Sbjct: 1792 GKIIGMGKLVHDGLPSLNKVLLVKGLTANLISISQLCDE-GFNVNFTKSECLVTNEKSEV 1968
Query: 466 MIGRGSARNELYYLNQDLVSNKHDKDHYCLSHPLSSLFSSKHCNKFDLWHLRLVALQLRN 525
++ +++ Y S S+ SSK ++ +WH R L LR
Sbjct: 1969 LMKGSRSKDNCYLWTPQETSYS------------STCLSSKE-DEVRIWHQRFGHLHLRG 2109
Query: 526 KMEL 529
++
Sbjct: 2110 MKKI 2121
>TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, complete
Length = 4734
Score = 428 bits (1101), Expect = e-120
Identities = 254/748 (33%), Positives = 406/748 (53%), Gaps = 4/748 (0%)
Frame = +1
Query: 557 ECVLTAAYLINKLPTPILKFKSPHQVLLGSPPSYSSLRVFGCLCFA-KNMNIQHKFDERS 615
E + TA Y+ N++ + +++ G P+ +FG C+ + + K D +S
Sbjct: 2650 EAMNTACYIHNRVTLRRGTPTTLYEIWKGRKPTVKHFHIFGSPCYILADREQRRKMDPKS 2829
Query: 616 KPGIFVGYPFNQKGYRIYDMKTRQIYVSRDVQFHETVFPYQDIQSPPFNNAISINTQILD 675
GIF+GY N + YR+++ +TR + S IN +
Sbjct: 2830 DAGIFLGYSTNSRAYRVFNSRTRTVMES-------------------------INVVV-- 2928
Query: 676 NEFDDLFTGSPTHPNIPPENSHNDNSNDTIVTISTPEDDASSNPPSFSAESLSNNPPSTE 735
DDL +P E+ N D A S + +++S ++ P
Sbjct: 2929 ---DDL---TPARKKDVEEDVRTSGDNVA--------DTAKSAENAENSDSATDEP---N 3057
Query: 736 MNPPNHRHSQRIRNPPIHLKDYICKINN--VTSKINFPLENYLSLSNLSNSHRAFLINII 793
+N P+ R S RI+ +H K+ I N VT++ + +SNS F+ I
Sbjct: 3058 INQPDKRPSIRIQK--MHPKELIIGDPNRGVTTRSR-------EIEIVSNS--CFVSKI- 3201
Query: 794 ENKEPKSYSQAMKSVEWRDAMAKEIQALESNNTWILCPLPEGKSAIGCKWIYKIKYHSDG 853
EPK+ +A+ W +AM +E++ + N W L P PEG + IG KWI+K K + +G
Sbjct: 3202 ---EPKNVKEALTDEFWINAMQEELEQFKRNEVWELVPRPEGTNVIGTKWIFKNKTNEEG 3372
Query: 854 SIDRYKARLVAKGYSQVQGIDYHDTFAPVAKLVTVRLLLSIAAIKNWPLYQFDVNNAFLQ 913
I R KARLVA+GY+Q++G+D+ +TFAPVA+L ++RLLL +A I + LYQ DV +AFL
Sbjct: 3373 VITRNKARLVAQGYTQIEGVDFDETFAPVARLESIRLLLGVACILKFKLYQMDVKSAFLN 3552
Query: 914 GDLSEEVYMKLPPGFSHKKKPC-VCKLNKSIYGLKQASRQWFSKFSTTLIQKGFRQSISD 972
G L+EE Y++ P GF P V +L K++YGLKQA R W+ + + L Q+G+R+ D
Sbjct: 3553 GYLNEEAYVEQPKGFVDPTHPDHVYRLKKALYGLKQAPRAWYERLTEFLTQQGYRKGGID 3732
Query: 973 YSLFTYNCDQTTIFVLVYVDDIIITGNNENAISKIKKFLAQSFSIKDLGNLSYILGIEVS 1032
+LF + + +YVDDI+ G + + + + F + +G L+Y LG++V
Sbjct: 3733 KTLFVKQDAENLMIAQIYVDDIVFGGMSNEMLRHFVQQMQSEFEMSLVGELTYFLGLQVK 3912
Query: 1033 RSKKGIFLCQRKYTLDILSDSGMTGCRPSDFPMEQHLRLRPNDGTPLSDPTVYRRHVGRL 1092
+ + IFL Q KY +I+ GM P HL+L ++ D ++YR +G L
Sbjct: 3913 QMEDSIFLSQSKYAKNIVKKFGMENASHKRTPAPTHLKLSKDEAGTSVDQSLYRSMIGSL 4092
Query: 1093 LYLTVTRPDIQYAVNTLSQFMQSPYSSHFDAATQVLRYLKGSVGKGLFLSASSSINLVGY 1152
LYLT +RPDI YAV +++ +P SH + ++L+Y+ G+ G+ S LVGY
Sbjct: 4093 LYLTASRPDITYAVGVCARYQANPKISHLNQVKRILKYVNGTSDYGIMYCHCSDSMLVGY 4272
Query: 1153 ADSDWAGCPTTRRSTTGYFTMLGSNPISWKTKKQPTISRSSAEAEYRSLATLSSELQWLK 1212
D+DWAG R+ST+G LG+N ISW +KKQ +S S+AEAEY + + S+L W+K
Sbjct: 4273 CDADWAGSADDRKSTSGGCFYLGTNLISWFSKKQNCVSLSTAEAEYIAAGSSCSQLVWMK 4452
Query: 1213 YLLSDLGIDHPQPITIYCDSQAAIHIAENPVFHERTKHIEIDCHFVREKIKSGLIAPSYI 1272
+L + ++ +T+YCD+ +AI+I++NPV H RTKHI+I H++R+ + +I ++
Sbjct: 4453 QMLKEYNVEQ-DVMTLYCDNMSAINISKNPVQHSRTKHIDIRHHYIRDLVDDKVITLEHV 4629
Query: 1273 RSSDQLADIFTKPLGGDAYKRILGKLGV 1300
+ +Q+ADIFTK L + ++++ GKLG+
Sbjct: 4630 DTEEQIADIFTKALDANQFEKLRGKLGI 4713
Score = 60.1 bits (144), Expect = 6e-09
Identities = 59/270 (21%), Positives = 104/270 (37%), Gaps = 6/270 (2%)
Frame = +1
Query: 266 VPAVESAALQTSKAPYRTPG------KRQRPFCEHCNRHGHTITTCYQIHGFPSNPSKPQ 319
VPA S S+ R G KR++ C +C ++GH CY +HG P + ++
Sbjct: 1417 VPAKNSTGATMSQHRSRHHGTQQKKSKRKKWRCHYCGKYGHIKPFCYHLHGHPHHGTQSS 1596
Query: 320 KKTETSPSTSANQLSSAQYHKLLTLLAKEDNVGSSVNLAGTAFTCIPFSWIIDSGASNHI 379
+++ S H L AKED W +DSG S H+
Sbjct: 1597 SSGRKMMWVPKHKIVSLVVHTSLRASAKED-------------------WYLDSGCSRHM 1719
Query: 380 CTSLSLFSSYYPVNNQISVQQPDGSQALVKHIGTINCSPSLILTNVYHVPTFKFNLMSVT 439
+ P + V DGS+ + +G + L V V NL+S++
Sbjct: 1720 TGVKEFLVNIEPCSTSY-VTFGDGSKGKITGMGKLVHDGLPSLNKVLLVKGLTANLISIS 1896
Query: 440 QLTESLNCDAIFSSSGCVFQDQATKKMIGRGSARNELYYLNQDLVSNKHDKDHYCLSHPL 499
QL + + F+ S C+ ++ ++ ++ +++ Y S+
Sbjct: 1897 QLCDE-GFNVNFTKSECLVTNEKSEVLMKGSRSKDNCYLWTPQET-----------SYSS 2040
Query: 500 SSLFSSKHCNKFDLWHLRLVALQLRNKMEL 529
+ LFS + ++ +WH R L LR ++
Sbjct: 2041 TCLFSKE--DEVKIWHQRFGHLHLRGMKKI 2124
>TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Fragment), partial
(30%)
Length = 687
Score = 184 bits (468), Expect = 2e-46
Identities = 86/158 (54%), Positives = 110/158 (69%)
Frame = +2
Query: 1149 LVGYADSDWAGCPTTRRSTTGYFTMLGSNPISWKTKKQPTISRSSAEAEYRSLATLSSEL 1208
L GY D+DWAGCP RRST+GY +G N +SWK+KKQ ++RSSAEAEYRS+A ++ EL
Sbjct: 17 LSGYCDADWAGCPMDRRSTSGYCVFIGGNLVSWKSKKQTVVARSSAEAEYRSMAMVTCEL 196
Query: 1209 QWLKYLLSDLGIDHPQPITIYCDSQAAIHIAENPVFHERTKHIEIDCHFVREKIKSGLIA 1268
W+K L +L + +YCD+QAA+HIA NPVFHERTKHIEIDCHF+REK+ S I
Sbjct: 197 MWIKQFLQELRFCEELQMKLYCDNQAALHIASNPVFHERTKHIEIDCHFIREKLLSKEIV 376
Query: 1269 PSYIRSSDQLADIFTKPLGGDAYKRILGKLGVIEISIP 1306
+I S+DQ DI TK L G + + KLG ++ P
Sbjct: 377 TEFIGSNDQPVDILTKSLRGPKIQIVCSKLGAYDLYAP 490
>BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vinifera},
partial (34%)
Length = 407
Score = 172 bits (437), Expect = 7e-43
Identities = 79/129 (61%), Positives = 101/129 (78%), Gaps = 1/129 (0%)
Frame = -2
Query: 831 PLPEGKSAIGCKWIYKIKYHSDGSIDRYKARLVAKGYSQVQGIDYHDTFAPVAKLVTVRL 890
PLP GK+ +GC+W+Y +K G +DR KARLVAKGY+QV GIDY DTF+PVAKL TVRL
Sbjct: 403 PLPPGKTPVGCRWVYTVKVGPTGEVDRLKARLVAKGYTQVYGIDYCDTFSPVAKLTTVRL 224
Query: 891 LLSIAAIKNWPLYQFDVNNAFLQGDLSEEVYMKLPPGF-SHKKKPCVCKLNKSIYGLKQA 949
L++AAI +WPL+Q D+ NAFL GDL E++YM+ PPGF + + VCKL++S+YGLKQ+
Sbjct: 223 FLAMAAICHWPLHQLDIKNAFLHGDLEEDIYMEQPPGFVAQGEYGLVCKLHRSLYGLKQS 44
Query: 950 SRQWFSKFS 958
R WF KFS
Sbjct: 43 PRAWFGKFS 17
>TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polyprotein, partial
(7%)
Length = 804
Score = 170 bits (431), Expect = 3e-42
Identities = 83/221 (37%), Positives = 139/221 (62%), Gaps = 3/221 (1%)
Frame = +1
Query: 1081 DPTVYRRHVGRLLYLTVTRPDIQYAVNTLSQFMQSPYSSHFDAATQVLRYLKGSVGKGL- 1139
D T +RR +G L YL +RP+I +AV+ +S+FM+ P SH AA +VLR +KG++G G+
Sbjct: 10 DVTEFRRLIGSLRYLCNSRPNICFAVSLISRFMKRPRLSHMQAAKRVLRLIKGTIGSGVL 189
Query: 1140 --FLSASSSINLVGYADSDWAGCPTTRRSTTGYFTMLGSNPISWKTKKQPTISRSSAEAE 1197
F + S +L+GY DSDW P +ST GY M P++ +KKQ I+ S+ EAE
Sbjct: 190 FPFKAKSGKPDLLGYTDSDWKRDPEQEKSTGGYLFMYNDAPVA*SSKKQDVIALSTCEAE 369
Query: 1198 YRSLATLSSELQWLKYLLSDLGIDHPQPITIYCDSQAAIHIAENPVFHERTKHIEIDCHF 1257
Y + + + + W+ LL +L + +P+ + D+++AI++A++P H R+KHIE+ H+
Sbjct: 370 YVAASLGACQAVWMMNLLEELKLRERKPVNLLIDNKSAINLAKHPTLHGRSKHIELRFHY 549
Query: 1258 VREKIKSGLIAPSYIRSSDQLADIFTKPLGGDAYKRILGKL 1298
+R+++ G + Y ++ +QLAD+ TKP+ +K+I +L
Sbjct: 550 IRDQVSKGNVTVEYCKAEEQLADLMTKPIQVSRFKQICSEL 672
>CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {Vitis
vinifera}, partial (34%)
Length = 409
Score = 162 bits (410), Expect = 9e-40
Identities = 72/128 (56%), Positives = 96/128 (74%)
Frame = +3
Query: 798 PKSYSQAMKSVEWRDAMAKEIQALESNNTWILCPLPEGKSAIGCKWIYKIKYHSDGSIDR 857
P + +A+ WR AM E+QALE+N TW L PLP GK+ +GC+W+Y +K +G +DR
Sbjct: 21 PSTIREALDHPGWRQAMVDEMQALENNGTWELVPLPPGKTTVGCRWVYTVKVGPNGKVDR 200
Query: 858 YKARLVAKGYSQVQGIDYHDTFAPVAKLVTVRLLLSIAAIKNWPLYQFDVNNAFLQGDLS 917
KARLVAKGY+QV GI+Y DTF+PV L TVRL L++AAI++WPL+Q D+ NAFL GDL
Sbjct: 201 LKARLVAKGYTQVYGIEYCDTFSPVFFLTTVRLFLAMAAIRHWPLHQLDIKNAFLHGDLE 380
Query: 918 EEVYMKLP 925
E++YM+ P
Sbjct: 381 EDIYMEQP 404
>TC231544 weakly similar to UP|Q9FLR2 (Q9FLR2) Polyprotein-like, partial (16%)
Length = 662
Score = 157 bits (396), Expect = 4e-38
Identities = 90/189 (47%), Positives = 123/189 (64%), Gaps = 4/189 (2%)
Frame = +3
Query: 1123 AATQVLRYLKGSVGKGLFLSASSSINLVGYADSDWAGCPTTRRSTTGYFTMLGSNPISWK 1182
AAT+VL+YLKG KGL S S I ++G++D+DWA C + +S T Y LGS+ ISWK
Sbjct: 18 AATRVLKYLKGCPRKGLSFSRESPIQILGFSDADWATCIDSSKSITWYCFFLGSSLISWK 197
Query: 1183 TKKQPTISR--SSAEAEYRSLATLSSELQWLKYLLSDLGIDHPQPITIYCDSQAAIH-IA 1239
KKQ T+SR SS+EA+YR+L + + ELQWL YLL DL + IYCD+Q+A+ +
Sbjct: 198 AKKQNTVSRSSSSSEAKYRALTSTTCELQWLTYLLKDLHV-----TLIYCDNQSALQ*LP 362
Query: 1240 ENPVFHERTKHIEIDCHFVREKIKSGLI-APSYIRSSDQLADIFTKPLGGDAYKRILGKL 1298
++H + +EIDCH VREK + GL+ + SS+QLADIFTK L + L KL
Sbjct: 363 IKVIYHGQ---LEIDCHIVREKTQQGLMHCLLPVSSSNQLADIFTKALSPKLFSSNLSKL 533
Query: 1299 GVIEISIPP 1307
G+ +I +PP
Sbjct: 534 GLSDIFLPP 560
>AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza sativa
(japonica cultivar-group)}, partial (10%)
Length = 463
Score = 153 bits (387), Expect = 4e-37
Identities = 76/152 (50%), Positives = 101/152 (66%), Gaps = 1/152 (0%)
Frame = -3
Query: 813 AMAKEIQALESNNTWILCPLPEGKSAIGCKWIYKIKYHSDGSIDRYKARLVAKGYSQVQG 872
AM +E+ E NN W L PE IG KW+++ K G I R KARLVAKGY+Q +G
Sbjct: 458 AMQEELNQFERNNVWKLVEKPENYPVIGTKWVFRNKLDEHGIIIRNKARLVAKGYNQEEG 279
Query: 873 IDYHDTFAPVAKLVTVRLLLSIAAIKNWPLYQFDVNNAFLQGDLSEEVYMKLPPGFSHKK 932
IDY +T+APVA+L +R+LL+ +I N+ LYQ DV +AFL G + EEVY++ PPGF
Sbjct: 278 IDYEETYAPVARLEVIRMLLAYVSIMNFKLYQMDVKSAFLNGLIQEEVYVEQPPGFEIPD 99
Query: 933 KPC-VCKLNKSIYGLKQASRQWFSKFSTTLIQ 963
KP V KL K++YGLKQA R W+ + S L++
Sbjct: 98 KPTHVYKLQKALYGLKQAPRAWYERISNFLLE 3
>TC234303 weakly similar to UP|Q8W153 (Q8W153) Polyprotein, partial (10%)
Length = 558
Score = 150 bits (379), Expect = 4e-36
Identities = 84/179 (46%), Positives = 114/179 (62%), Gaps = 2/179 (1%)
Frame = +1
Query: 853 GSIDRYKARLVAKGYSQVQGIDYHDTFAPVAKLVTVRLLLSIAAIKNWPLYQFDVNNAFL 912
G+ID++KARLVAK Y+QV G DY TF+PVAK+ V LL S+A + +WPL+ D NAFL
Sbjct: 28 GTIDQFKARLVAKSYTQVYGQDYTGTFSPVAKMAYVHLLWSMAVVCHWPLF*LDAKNAFL 207
Query: 913 QGDLSEEVYMKLPPGF--SHKKKPCVCKLNKSIYGLKQASRQWFSKFSTTLIQKGFRQSI 970
G L EEVYM+ P GF + VC+L +S YGLKQ+ R W + I +
Sbjct: 208 HGYLEEEVYMEQPLGFVAQGESSNMVCQLCRSFYGLKQSPRAWPFLYCGAAI--WYDSHE 381
Query: 971 SDYSLFTYNCDQTTIFVLVYVDDIIITGNNENAISKIKKFLAQSFSIKDLGNLSYILGI 1029
+D+S+F + Q I+++VYVDDI ITG++++ I+ +K L F KDLG L Y LGI
Sbjct: 382 ADHSVFYCHSPQGCIYLIVYVDDIGITGSDQHGIT*LK*XLCCQFQTKDLGKLRYFLGI 558
>BI317550 weakly similar to GP|9759590|dbj| polyprotein-like {Arabidopsis
thaliana}, partial (18%)
Length = 421
Score = 145 bits (366), Expect = 1e-34
Identities = 69/138 (50%), Positives = 97/138 (70%)
Frame = -2
Query: 941 KSIYGLKQASRQWFSKFSTTLIQKGFRQSISDYSLFTYNCDQTTIFVLVYVDDIIITGNN 1000
KS+YGLKQASR+W+ K + L+++G+ QSISDYSLFT T +LVYVDDII+ G++
Sbjct: 420 KSLYGLKQASRKWYEKLTNLLLKEGYIQSISDYSLFTLTKGNTFTALLVYVDDIILAGDS 241
Query: 1001 ENAISKIKKFLAQSFSIKDLGNLSYILGIEVSRSKKGIFLCQRKYTLDILSDSGMTGCRP 1060
+ +IK L +F IK+LG L Y LG+EV+ S+ GI + QRKY LD+L DSG+ GC+P
Sbjct: 240 IDEFDRIKNVLDLAFKIKNLGKLKYFLGLEVAHSRLGITISQRKYCLDLLKDSGLLGCKP 61
Query: 1061 SDFPMEQHLRLRPNDGTP 1078
+ P++ ++L GTP
Sbjct: 60 ASTPLDTSIKLHSAAGTP 7
>TC225402 weakly similar to UP|Q6I923 (Q6I923) Copia-like retrotransposon
Hopscotch polyprotein, partial (7%)
Length = 1446
Score = 129 bits (325), Expect(2) = 3e-34
Identities = 59/109 (54%), Positives = 77/109 (70%)
Frame = +2
Query: 1154 DSDWAGCPTTRRSTTGYFTMLGSNPISWKTKKQPTISRSSAEAEYRSLATLSSELQWLKY 1213
D++WA P R ST GY +G N + WK+ K ++RSSAEAEY+++ + EL W+K
Sbjct: 8 DANWAVSPIDRGSTLGYCVSIGENLVLWKSNK*NVVARSSAEAEYKAMTVATCELIWIKQ 187
Query: 1214 LLSDLGIDHPQPITIYCDSQAAIHIAENPVFHERTKHIEIDCHFVREKI 1262
LL +L Q + + CD+QAA+HIA NPVFHERTKHIEIDCHFVREK+
Sbjct: 188 LLQELKFGSTQQMKLCCDNQAALHIASNPVFHERTKHIEIDCHFVREKV 334
Score = 35.4 bits (80), Expect(2) = 3e-34
Identities = 16/33 (48%), Positives = 22/33 (66%)
Frame = +3
Query: 1271 YIRSSDQLADIFTKPLGGDAYKRILGKLGVIEI 1303
++ S+DQLA+IFTK L G + I KLG E+
Sbjct: 360 FVSSNDQLANIFTKSLRGPRIQNICSKLGAFEL 458
>BM527454 weakly similar to GP|27901709|gb| gag-pol polyprotein {Vitis
vinifera}, partial (19%)
Length = 437
Score = 79.3 bits (194), Expect(2) = 8e-32
Identities = 36/66 (54%), Positives = 49/66 (73%)
Frame = +2
Query: 985 IFVLVYVDDIIITGNNENAISKIKKFLAQSFSIKDLGNLSYILGIEVSRSKKGIFLCQRK 1044
++++VYVDDI+ITGN++ I+++K L F KDLG Y LGIEV++SK GI + QRK
Sbjct: 35 VYLMVYVDDIVITGNDQGKIAQLKGHLFSHFQTKDLGKFEYFLGIEVAQSKDGIIISQRK 214
Query: 1045 YTLDIL 1050
Y LDIL
Sbjct: 215 YALDIL 232
Score = 77.8 bits (190), Expect(2) = 8e-32
Identities = 35/68 (51%), Positives = 48/68 (70%)
Frame = +1
Query: 1053 SGMTGCRPSDFPMEQHLRLRPNDGTPLSDPTVYRRHVGRLLYLTVTRPDIQYAVNTLSQF 1112
+GM+ CRP D M+ + +L PN G P SD YR VG+L+YLT+TRP+I + V +SQF
Sbjct: 229 TGMSDCRPIDSLMDPNKKLLPNQGKPYSDSERYRILVGKLIYLTITRPNISFVVGVVSQF 408
Query: 1113 MQSPYSSH 1120
MQSP++ H
Sbjct: 409 MQSPHNDH 432
>CO982036
Length = 674
Score = 134 bits (338), Expect = 2e-31
Identities = 83/212 (39%), Positives = 112/212 (52%), Gaps = 3/212 (1%)
Frame = -2
Query: 978 YNCDQTTIFVLVYVDDIIITGNNENAISKIKKFLAQSFSIKDLGNLSYILGIEVSRSKKG 1037
Y T+++LVYVD IIITG++ I + L SF +K LG L Y + IEV +S
Sbjct: 673 YKTHILTVYLLVYVD-IIITGSSCTLIQNLTSKLNSSFPLKLLGKLDYFVEIEV-KSMPD 500
Query: 1038 IFLCQRKYTLDILSDSGMTGCRPSDFPMEQHLRLRPNDGTPLSDPTVYRRHVGRLLYLTV 1097
+ R +I +P PM +L +D S PT YR VG L Y TV
Sbjct: 499 LLFSLRTSIFEIFCRKPR*QAQPISSPMTTTCKLSKSDSDLFSGPTFYRSVVGALQYTTV 320
Query: 1098 TRPDIQYAVNTLSQFMQSPYSSHFDAATQVLRYLKGSVGKGLFLS---ASSSINLVGYAD 1154
RP+I +AVN + QFM +P SH+ ++LRYLKGS+ GL L +S + + G+ D
Sbjct: 319 IRPEISFAVNKVCQFMSNPLDSHWTEVKRILRYLKGSLSYGL*LKPAISSQPLPIRGFCD 140
Query: 1155 SDWAGCPTTRRSTTGYFTMLGSNPISWKTKKQ 1186
+DWA +RST+G LG N ISW KQ
Sbjct: 139 ADWASAVDDKRSTSGAAVFLGPNLISWWXXKQ 44
>BM307983
Length = 406
Score = 133 bits (335), Expect = 4e-31
Identities = 65/133 (48%), Positives = 89/133 (66%), Gaps = 2/133 (1%)
Frame = +2
Query: 839 IGCKWIYKIKYHSDGSIDRYKARLVAKGYSQVQGIDYHDTFAPVAKLVTVRLLLS-IAAI 897
+GC+WIY +KY +D ++DRYKARLVAKGY Q GIDY +TFA K + A
Sbjct: 2 VGCRWIYTVKY*ADDTLDRYKARLVAKGYIQTYGIDYEETFAQWQK*IQSGSSSP*QQAQ 181
Query: 898 KNWPLYQFDVNNAFLQGDLSEEVYMKLPPGF-SHKKKPCVCKLNKSIYGLKQASRQWFSK 956
W ++QFDV NAFL G L EEVYM++PPG+ + VC+L K++YGLKQ+ R WF +
Sbjct: 182 FGWEMHQFDVKNAFLHGSLEEEVYMEIPPGYGASNGGNKVCRLKKALYGLKQSPRAWFGR 361
Query: 957 FSTTLIQKGFRQS 969
F+ ++ G++QS
Sbjct: 362 FTQAMLSLGYKQS 400
>BU764568
Length = 420
Score = 104 bits (259), Expect(2) = 1e-29
Identities = 47/85 (55%), Positives = 65/85 (76%)
Frame = +3
Query: 1167 TTGYFTMLGSNPISWKTKKQPTISRSSAEAEYRSLATLSSELQWLKYLLSDLGIDHPQPI 1226
T+GY ++G N ISWK+KKQ +++SSAEAEYR++A ++ EL WLK LL +L + +
Sbjct: 165 TSGYCVLIGGNLISWKSKKQSVVAKSSAEAEYRAMALVTCELIWLKQLL*ELKFEEDTQM 344
Query: 1227 TIYCDSQAAIHIAENPVFHERTKHI 1251
T+ CD+QAA+HIA NP+FH RTKHI
Sbjct: 345 TLICDNQAALHIASNPIFH*RTKHI 419
Score = 45.8 bits (107), Expect(2) = 1e-29
Identities = 21/61 (34%), Positives = 33/61 (53%)
Frame = +1
Query: 1110 SQFMQSPYSSHFDAATQVLRYLKGSVGKGLFLSASSSINLVGYADSDWAGCPTTRRSTTG 1169
SQF+ SP H++A + +L+ K + GKGL ++GY+D+D G P+ R
Sbjct: 1 SQFLNSPCQDHWNAVS*ILK*TKSAPGKGLIYEDKGHSQIIGYSDAD*VGSPSDRHQDIV 180
Query: 1170 Y 1170
Y
Sbjct: 181 Y 183
>BQ081067 weakly similar to GP|23495377|dbj orf490 {Oryza sativa (japonica
cultivar-group)}, partial (18%)
Length = 430
Score = 90.1 bits (222), Expect(2) = 6e-29
Identities = 42/89 (47%), Positives = 59/89 (66%)
Frame = +1
Query: 797 EPKSYSQAMKSVEWRDAMAKEIQALESNNTWILCPLPEGKSAIGCKWIYKIKYHSDGSID 856
EP + QA+ S WR AM + AL N T L LP GK+AI CKW+++IK + G+++
Sbjct: 31 EPSTVKQALISPPWRQAMQADFDALMENKTLTLTSLPSGKAAIDCKWVFRIKENLYGTLN 210
Query: 857 RYKARLVAKGYSQVQGIDYHDTFAPVAKL 885
RY++RLVAKG+ G DY +TF+PV +L
Sbjct: 211 RYRSRLVAKGFHLKFGCDYSETFSPVIEL 297
Score = 57.4 bits (137), Expect(2) = 6e-29
Identities = 27/43 (62%), Positives = 32/43 (73%)
Frame = +3
Query: 886 VTVRLLLSIAAIKNWPLYQFDVNNAFLQGDLSEEVYMKLPPGF 928
VT+RL+L IA +WPL Q D+NNAFL G L+EEVYM PGF
Sbjct: 297 VTIRLILFIALTNHWPLQQVDINNAFLHGLLTEEVYMVQLPGF 425
>BM086359
Length = 427
Score = 121 bits (304), Expect = 2e-27
Identities = 63/142 (44%), Positives = 90/142 (63%)
Frame = +1
Query: 1027 LGIEVSRSKKGIFLCQRKYTLDILSDSGMTGCRPSDFPMEQHLRLRPNDGTPLSDPTVYR 1086
LGI+V++S GI + Q KY LDIL+++GM C PS+ PM+ +++L G L DP
Sbjct: 1 LGIDVAQSSYGIVISQWKYALDILTETGMLDCLPSNTPMDPNVKLLSGQGEALEDPGR*C 180
Query: 1087 RHVGRLLYLTVTRPDIQYAVNTLSQFMQSPYSSHFDAATQVLRYLKGSVGKGLFLSASSS 1146
VGRL YLTVTR DI +AV LSQF++ P S ++A ++LRY+K + G GL +
Sbjct: 181 CLVGRLNYLTVTRLDITFAVGVLSQFLKDPTDSQWNATIRILRYIKNAPGPGLLYEDKGN 360
Query: 1147 INLVGYADSDWAGCPTTRRSTT 1168
+V Y D+DW G P+ + ST+
Sbjct: 361 GKVVCYFDADWPGSPSDKSSTS 426
>BU548243
Length = 599
Score = 121 bits (303), Expect = 2e-27
Identities = 69/155 (44%), Positives = 91/155 (58%)
Frame = -1
Query: 1154 DSDWAGCPTTRRSTTGYFTMLGSNPISWKTKKQPTISRSSAEAEYRSLATLSSELQWLKY 1213
D+ WA RST G LG N ISW ++KQ ++SS EAEYRS+A S+EL W++
Sbjct: 587 DAGWASDVDDHRSTLGSAIFLGPNLISWWSRKQQVTAQSSTEAEYRSIAQTSAELTWIQA 408
Query: 1214 LLSDLGIDHPQPITIYCDSQAAIHIAENPVFHERTKHIEIDCHFVREKIKSGLIAPSYIR 1273
LL +L I P+ I CD+++A+ IA N VFH RTKH+EID FV EK+ S + +I
Sbjct: 407 LLMELQIPFTPPV-ILCDNKSAVAIAHNLVFHSRTKHMEIDVFFVHEKVLSKQLQIFHIP 231
Query: 1274 SSDQLADIFTKPLGGDAYKRILGKLGVIEISIPPP 1308
+ DQ A I TKPL + + KL V S P
Sbjct: 230 ALDQWAGILTKPLSSARFTFLKSKLTVKGFSSEKP 126
>AI959950
Length = 466
Score = 121 bits (303), Expect = 2e-27
Identities = 65/130 (50%), Positives = 83/130 (63%), Gaps = 1/130 (0%)
Frame = -1
Query: 813 AMAKEIQALESNNTWILCPLPEGKSAIGCKWIYKIKYHSDGSIDRYKARLVAKGYSQVQG 872
AM +E+ + NN L LP+ K +G KWI+ K DG + RYKARLVAKGYSQ +G
Sbjct: 391 AMQEELDQFQKNNV*KLVKLPKRKKVVGVKWIFCNKLDEDGKVVRYKARLVAKGYSQQEG 212
Query: 873 IDYHDTFAPVAKLVTVRLLLSIAAIKNWPLYQFDVNNAFLQGDLSEEVYMKLPPGFSHKK 932
IDY TFA VA+L + +LLS A N LYQ DV +AFL G + +EVY++ PPGF ++
Sbjct: 211 IDYPKTFALVARLEVICILLSFATYSNMKLYQMDVKSAFLNGLIQKEVYVEQPPGFENET 32
Query: 933 -KPCVCKLNK 941
V KLNK
Sbjct: 31 LHQHVFKLNK 2
>BI787454 weakly similar to GP|21434|emb|CA ORF4 {Solanum tuberosum}, partial
(21%)
Length = 421
Score = 119 bits (299), Expect = 7e-27
Identities = 62/138 (44%), Positives = 88/138 (62%), Gaps = 1/138 (0%)
Frame = +2
Query: 969 SISDYSLF-TYNCDQTTIFVLVYVDDIIITGNNENAISKIKKFLAQSFSIKDLGNLSYIL 1027
S +D+S+F + ++++VYVDDI+IT + I ++K+ L F KDL L Y L
Sbjct: 8 SEADHSVFYCHTSPGKCVYLMVYVDDIMITKKDATKIVQLKEHLFNHFQTKDLRYLKYFL 187
Query: 1028 GIEVSRSKKGIFLCQRKYTLDILSDSGMTGCRPSDFPMEQHLRLRPNDGTPLSDPTVYRR 1087
GIEV++S G+ + QRKY LDIL ++GM CR D PM+ +L+L DP YRR
Sbjct: 188 GIEVAQSGDGVVISQRKYALDILEETGMQNCRLVDSPMDPNLKLMAYQSEVYPDPERYRR 367
Query: 1088 HVGRLLYLTVTRPDIQYA 1105
VG+L+YLT+TRPDI +A
Sbjct: 368 LVGKLIYLTITRPDISFA 421
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.319 0.134 0.403
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 65,323,974
Number of Sequences: 63676
Number of extensions: 1054552
Number of successful extensions: 7452
Number of sequences better than 10.0: 181
Number of HSP's better than 10.0 without gapping: 7054
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 7364
length of query: 1309
length of database: 12,639,632
effective HSP length: 109
effective length of query: 1200
effective length of database: 5,698,948
effective search space: 6838737600
effective search space used: 6838737600
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 65 (29.6 bits)
Medicago: description of AC124966.5