
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC147000.7 - phase: 0
(1185 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete 521 e-148
TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, co... 513 e-145
BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vi... 154 3e-37
AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza ... 151 1e-36
CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {V... 141 1e-33
TC224357 102 6e-31
TC234303 weakly similar to UP|Q8W153 (Q8W153) Polyprotein, parti... 132 9e-31
TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polypro... 127 2e-29
TC232910 similar to UP|Q6I8N0 (Q6I8N0) Pol polypeptide, partial ... 110 1e-28
BU549979 124 2e-28
AI959950 114 3e-25
TC232995 113 6e-25
CO982036 112 7e-25
CO981879 80 1e-24
BM307983 108 2e-23
CO983516 107 4e-23
BI317550 weakly similar to GP|9759590|dbj| polyprotein-like {Ara... 105 2e-22
AW185460 102 1e-21
BI321712 100 4e-21
TC235104 weakly similar to UP|Q850H8 (Q850H8) Gag-pol polyprotei... 98 2e-20
>TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete
Length = 4731
Score = 521 bits (1342), Expect = e-148
Identities = 348/1197 (29%), Positives = 583/1197 (48%), Gaps = 74/1197 (6%)
Frame = +1
Query: 57 KEFSLESLITRLRIEEEARKQEQNE------------------------------EVFVV 86
+E L+ +I L E+EA K+E +E EV ++
Sbjct: 1159 QEAQLKKVIADLEAEKEAHKEEISELKGEVGFLNSKLENMTKSIKMLNKGSDTLDEVLLL 1338
Query: 87 SNNNTKKKFVGAVLKPAGKPFKNQNRPMNKNSNRNKTGNNSRPQIQQPPKNDAAPPFNCY 146
N ++ +G K AG+ + P + + + SR Q K+ + C+
Sbjct: 1339 GKNAGNQRGLGFNPKSAGRTTMTEFVPAKNRTGATMSQHRSRHHGMQQKKSKRKK-WRCH 1515
Query: 147 NCGQADHMARKCRNRTNRPAQAHMATDA------APDEPYVAMITEINMIAGS-DGWWVD 199
CG+ H+ C + P ++++ P V+++ ++ A + + W++D
Sbjct: 1516 YCGKYGHIKPFCYHLHGHPHHGTQSSNSRKKMMWVPKHKAVSLVVHTSLRASAKEDWYLD 1695
Query: 200 TGASRHVCYDRDMFKTYTACDDQKVLLGDSHSTDVVGIGDIELKFTSEKTLILKDVLHTP 259
+G SRH+ ++ C V GD ++G+G K + L VL
Sbjct: 1696 SGCSRHMTGVKEFLLNIEPCSTSYVTFGDGSKGKIIGMG----KLVHDGLPSLNKVLLVK 1863
Query: 260 KIRKNLVSGFLLNKAGFTQSIGAD--LYTITKNGIFVGKGYATDGMFKLNIDMNKISSSA 317
+ NL+S L GF + L T K+ + + + D + SS+
Sbjct: 1864 GLTANLISISQLCDEGFNVNFTKSECLVTNEKSEVLMKGSRSKDNCYLWTPQETSYSSTC 2043
Query: 318 YMLCD--FNIWHSRLCHVNKRIISNMSGLGL---IPKISLNDFEKCQFCSQAKINKESHK 372
+ IWH R H++ R + + G IP + + + C C K K SH+
Sbjct: 2044 LSSKEDEVRIWHQRFGHLHLRGMKKIIDKGAVRGIPNLKIEEGRICGECQIGKQVKMSHQ 2223
Query: 373 SVTRITEP--FELIHSDLCELDGNLTRNGKRYFITFIDDCSDYTHVYLMRNKNEALDIFK 430
+ T EL+H DL + GKRY +DD S +T V +R K+E ++FK
Sbjct: 2224 KLQHQTTSRVLELLHMDLMGPMQVESLGGKRYAYVVVDDFSRFTWVNFIREKSETFEVFK 2403
Query: 431 QYVKEIENQFNIRIKRFRSDRGTEYGSHIFNEYYKELGIIHETTAPYSPEMNGKAERKNR 490
+ ++ + + IKR RSD G E+ + F E+ GI HE +A +P+ NG ERKNR
Sbjct: 2404 ELSLRLQREKDCVIKRIRSDHGREFENSRFTEFCTSEGITHEFSAAITPQQNGIVERKNR 2583
Query: 491 TFTELVVATMLNSGAAPH-WWGEILLTVCYVLNRVP-KTKNKISPYEILKKRQPNLSYFR 548
T E ML++ P+ W E + T CY+ NRV + + YEI K R+P++ +F
Sbjct: 2584 TLQE-AARVMLHAKELPYNLWAEAMNTACYIHNRVTLRRGTPTTLYEIWKGRKPSVKHFH 2760
Query: 549 TWGCLAYVRKPDPKRVKLASRAYECAFIGYALNSKAYRFYDLKSKTIIESNDV------- 601
+G Y+ +R K+ ++ F+GY+ NS+AYR ++ +++T++ES +V
Sbjct: 2761 IFGSPCYILADREQRRKMDPKSDAGIFLGYSTNSRAYRVFNSRTRTVMESINVVVDDLSP 2940
Query: 602 ----DFYEN--------KFPFKSGDSGGNS-GGTDNSVLDQP----SEIITSNENIERDV 644
D E+ KSG++ NS TD S ++QP S I E +
Sbjct: 2941 ARKKDVEEDVRTSGDNVADAAKSGENAENSDSATDESNINQPDKRSSTRIQKMHPKELII 3120
Query: 645 IEPGRGKRARIAKEYGPEYVAYTIEEDPSSIKEALSSIDADLWQEAINDEMDSLMSNETW 704
+P RG R + + + +P ++KEAL+ + W A+ +E++ NE W
Sbjct: 3121 GDPNRGVTTRSREVEIVSNSCFVSKIEPKNVKEALTD---EFWINAMQEELEQFKRNEVW 3291
Query: 705 HLTDLPPGCKTIGCKWILKKKLKPDGSIDKYKARLVAKGFRQRENVDFFDTYSPVTRITS 764
L P G IG KWI K K +G I + KARLVA+G+ Q E VDF +T++PV R+ S
Sbjct: 3292 ELVPRPEGTNVIGTKWIFKNKTNEEGVITRNKARLVAQGYTQIEGVDFDETFAPVARLES 3471
Query: 765 IRVLISLAAIHNLIVHQMDVKTAFLNGELEEEIYMDQPEGFVIHGQENKVCKLDKSLYGL 824
IR+L+ +A I ++QMDVK+AFLNG L EE+Y++QP+GF + V +L K+LYGL
Sbjct: 3472 IRLLLGVACILKFKLYQMDVKSAFLNGYLNEEVYVEQPKGFADPTHPDHVYRLKKALYGL 3651
Query: 825 KQAPKQWHEKFDNLMIENEFKVNESDKCIYSKYENNTCTIICLYVDDLLIFGSNLNAIKD 884
KQAP+ W+E+ + + ++ DK ++ K + I +YVDD++ G + ++
Sbjct: 3652 KQAPRAWYERLTEFLTQQGYRKGGIDKTLFVKQDAENLMIAQIYVDDIVFGGMSNEMLRH 3831
Query: 885 VKSLLCHNFDMKDLGKADVILGIKITRTDNGISLNQSHYVEKILRKYNYFYCKPASTPCD 944
+ F+M +G+ LG+++ + ++ I L+QS Y + I++K+ TP
Sbjct: 3832 FVQQMQSEFEMSLVGELTYFLGLQVKQMEDSIFLSQSRYAKNIVKKFGMENASHKRTPAP 4011
Query: 945 PSVKLFKN-TGDSVRQTEYASIIGSLRYATDCTRPDISYAVGLLCKFTSRPSMEHWQAIE 1003
+KL K+ G SV Q+ Y S+IGSL Y T +RPDI+YAVG+ ++ + P + H ++
Sbjct: 4012 THLKLSKDEAGTSVDQSLYRSMIGSLLYLT-ASRPDITYAVGVCARYQANPKISHLTQVK 4188
Query: 1004 RVMRYLKKTMTLGLHYQRYP-AVLEGYSDADWNNLSDDSKATSGYIFSIAGGAVSWKSKK 1062
R+++Y+ T G+ Y +L GY DADW +DD K+TSG F + +SW SKK
Sbjct: 4189 RILKYVNGTSDYGIMYCHCSNPMLVGYCDADWAGSADDRKSTSGGCFYLGNNLISWFSKK 4368
Query: 1063 QTILAQSTMESEMIALAAASEEASWLRCLLSEIPLWERPLPAVLIHCDSTAAIAKIENRY 1122
Q ++ ST E+E IA ++ + W++ +L E + + + ++CD+ +AI +N
Sbjct: 4369 QNCVSLSTAEAEYIAAGSSCSQLVWMKQMLKEYNVEQ---DVMTLYCDNMSAINISKNPV 4539
Query: 1123 YNGKRRQIRRKHSTIREYLSNGTVRVDFVRTNENLADPLTKGLNREKVANTSSRMGL 1179
+ + + I +H IR+ + + + + V T E +AD TK L+ + ++G+
Sbjct: 4540 QHSRTKHIDIRHHYIRDLVDDKVITLKHVDTEEQIADIFTKALDANQFEKLRGKLGI 4710
>TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, complete
Length = 4734
Score = 513 bits (1321), Expect = e-145
Identities = 334/1144 (29%), Positives = 565/1144 (49%), Gaps = 45/1144 (3%)
Frame = +1
Query: 81 EEVFVVSNNNTKKKFVGAVLKPAGKPFKNQNRPMNKNSNRNKTGNNSRPQIQQPPKNDAA 140
+EV + N ++ +G K AG+ + P ++ + + SR Q K+
Sbjct: 1324 DEVLQLGKNVGNQRGLGFNHKSAGRTTMTEFVPAKNSTGATMSQHRSRHHGTQQKKSKRK 1503
Query: 141 PPFNCYNCGQADHMARKCRNRTNRPAQAHMATDAA------PDEPYVAMITEINMIAGS- 193
+ C+ CG+ H+ C + P ++ + P V+++ ++ A +
Sbjct: 1504 K-WRCHYCGKYGHIKPFCYHLHGHPHHGTQSSSSGRKMMWVPKHKIVSLVVHTSLRASAK 1680
Query: 194 DGWWVDTGASRHVCYDRDMFKTYTACDDQKVLLGDSHSTDVVGIGDIELKFTSEKTLILK 253
+ W++D+G SRH+ ++ C V GD + G+G K + L
Sbjct: 1681 EDWYLDSGCSRHMTGVKEFLVNIEPCSTSYVTFGDGSKGKITGMG----KLVHDGLPSLN 1848
Query: 254 DVLHTPKIRKNLVSGFLLNKAGFTQSIGAD--LYTITKNGIFVGKGYATDGMFKLNIDMN 311
VL + NL+S L GF + L T K+ + + + D +
Sbjct: 1849 KVLLVKGLTANLISISQLCDEGFNVNFTKSECLVTNEKSEVLMKGSRSKDNCYLWTPQET 2028
Query: 312 KISSSAYMLCD--FNIWHSRLCHVNKRIISNMSGLGL---IPKISLNDFEKCQFCSQAKI 366
SS+ + IWH R H++ R + + G IP + + + C C K
Sbjct: 2029 SYSSTCLFSKEDEVKIWHQRFGHLHLRGMKKIIDKGAVRGIPNLKIEEGRICGECQIGKQ 2208
Query: 367 NKESHKSVTRITEP--FELIHSDLCELDGNLTRNGKRYFITFIDDCSDYTHVYLMRNKNE 424
K SH+ + T EL+H DL + GKRY +DD S +T V +R K++
Sbjct: 2209 VKMSHQKLQHQTTSRVLELLHMDLMGPMQVESLGGKRYAYVVVDDFSRFTWVNFIREKSD 2388
Query: 425 ALDIFKQYVKEIENQFNIRIKRFRSDRGTEYGSHIFNEYYKELGIIHETTAPYSPEMNGK 484
++FK+ ++ + + IKR RSD G E+ + F E+ GI HE +A +P+ NG
Sbjct: 2389 TFEVFKELSLRLQREKDCVIKRIRSDHGREFENSKFTEFCTSEGITHEFSAAITPQQNGI 2568
Query: 485 AERKNRTFTELVVATMLNSGAAPH-WWGEILLTVCYVLNRVPKTKNKISP-YEILKKRQP 542
ERKNRT E ML++ P+ W E + T CY+ NRV + + YEI K R+P
Sbjct: 2569 VERKNRTLQE-AARVMLHAKELPYNLWAEAMNTACYIHNRVTLRRGTPTTLYEIWKGRKP 2745
Query: 543 NLSYFRTWGCLAYVRKPDPKRVKLASRAYECAFIGYALNSKAYRFYDLKSKTIIESNDVD 602
+ +F +G Y+ +R K+ ++ F+GY+ NS+AYR ++ +++T++ES +V
Sbjct: 2746 TVKHFHIFGSPCYILADREQRRKMDPKSDAGIFLGYSTNSRAYRVFNSRTRTVMESINV- 2922
Query: 603 FYENKFPFKSGD-------SGGNSGGTDNSVLD-QPSEIITSNENIERD----------- 643
++ P + D SG N T S + + S+ T NI +
Sbjct: 2923 VVDDLTPARKKDVEEDVRTSGDNVADTAKSAENAENSDSATDEPNINQPDKRPSIRIQKM 3102
Query: 644 ------VIEPGRGKRARIAKEYGPEYVAYTIEEDPSSIKEALSSIDADLWQEAINDEMDS 697
+ +P RG R + + + +P ++KEAL+ + W A+ +E++
Sbjct: 3103 HPKELIIGDPNRGVTTRSREIEIVSNSCFVSKIEPKNVKEALTD---EFWINAMQEELEQ 3273
Query: 698 LMSNETWHLTDLPPGCKTIGCKWILKKKLKPDGSIDKYKARLVAKGFRQRENVDFFDTYS 757
NE W L P G IG KWI K K +G I + KARLVA+G+ Q E VDF +T++
Sbjct: 3274 FKRNEVWELVPRPEGTNVIGTKWIFKNKTNEEGVITRNKARLVAQGYTQIEGVDFDETFA 3453
Query: 758 PVTRITSIRVLISLAAIHNLIVHQMDVKTAFLNGELEEEIYMDQPEGFVIHGQENKVCKL 817
PV R+ SIR+L+ +A I ++QMDVK+AFLNG L EE Y++QP+GFV + V +L
Sbjct: 3454 PVARLESIRLLLGVACILKFKLYQMDVKSAFLNGYLNEEAYVEQPKGFVDPTHPDHVYRL 3633
Query: 818 DKSLYGLKQAPKQWHEKFDNLMIENEFKVNESDKCIYSKYENNTCTIICLYVDDLLIFGS 877
K+LYGLKQAP+ W+E+ + + ++ DK ++ K + I +YVDD++ G
Sbjct: 3634 KKALYGLKQAPRAWYERLTEFLTQQGYRKGGIDKTLFVKQDAENLMIAQIYVDDIVFGGM 3813
Query: 878 NLNAIKDVKSLLCHNFDMKDLGKADVILGIKITRTDNGISLNQSHYVEKILRKYNYFYCK 937
+ ++ + F+M +G+ LG+++ + ++ I L+QS Y + I++K+
Sbjct: 3814 SNEMLRHFVQQMQSEFEMSLVGELTYFLGLQVKQMEDSIFLSQSKYAKNIVKKFGMENAS 3993
Query: 938 PASTPCDPSVKLFKN-TGDSVRQTEYASIIGSLRYATDCTRPDISYAVGLLCKFTSRPSM 996
TP +KL K+ G SV Q+ Y S+IGSL Y T +RPDI+YAVG+ ++ + P +
Sbjct: 3994 HKRTPAPTHLKLSKDEAGTSVDQSLYRSMIGSLLYLT-ASRPDITYAVGVCARYQANPKI 4170
Query: 997 EHWQAIERVMRYLKKTMTLGLHY-QRYPAVLEGYSDADWNNLSDDSKATSGYIFSIAGGA 1055
H ++R+++Y+ T G+ Y ++L GY DADW +DD K+TSG F +
Sbjct: 4171 SHLNQVKRILKYVNGTSDYGIMYCHCSDSMLVGYCDADWAGSADDRKSTSGGCFYLGTNL 4350
Query: 1056 VSWKSKKQTILAQSTMESEMIALAAASEEASWLRCLLSEIPLWERPLPAVLIHCDSTAAI 1115
+SW SKKQ ++ ST E+E IA ++ + W++ +L E + + + ++CD+ +AI
Sbjct: 4351 ISWFSKKQNCVSLSTAEAEYIAAGSSCSQLVWMKQMLKEYNVEQ---DVMTLYCDNMSAI 4521
Query: 1116 AKIENRYYNGKRRQIRRKHSTIREYLSNGTVRVDFVRTNENLADPLTKGLNREKVANTSS 1175
+N + + + I +H IR+ + + + ++ V T E +AD TK L+ +
Sbjct: 4522 NISKNPVQHSRTKHIDIRHHYIRDLVDDKVITLEHVDTEEQIADIFTKALDANQFEKLRG 4701
Query: 1176 RMGL 1179
++G+
Sbjct: 4702 KLGI 4713
>BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vinifera},
partial (34%)
Length = 407
Score = 154 bits (388), Expect = 3e-37
Identities = 66/131 (50%), Positives = 99/131 (75%)
Frame = -2
Query: 709 LPPGCKTIGCKWILKKKLKPDGSIDKYKARLVAKGFRQRENVDFFDTYSPVTRITSIRVL 768
LPPG +GC+W+ K+ P G +D+ KARLVAKG+ Q +D+ DT+SPV ++T++R+
Sbjct: 400 LPPGKTPVGCRWVYTVKVGPTGEVDRLKARLVAKGYTQVYGIDYCDTFSPVAKLTTVRLF 221
Query: 769 ISLAAIHNLIVHQMDVKTAFLNGELEEEIYMDQPEGFVIHGQENKVCKLDKSLYGLKQAP 828
+++AAI + +HQ+D+K AFL+G+LEE+IYM+QP GFV G+ VCKL +SLYGLKQ+P
Sbjct: 220 LAMAAICHWPLHQLDIKNAFLHGDLEEDIYMEQPPGFVAQGEYGLVCKLHRSLYGLKQSP 41
Query: 829 KQWHEKFDNLM 839
+ W KF +++
Sbjct: 40 RAWFGKFSHVV 8
>AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza sativa
(japonica cultivar-group)}, partial (10%)
Length = 463
Score = 151 bits (382), Expect = 1e-36
Identities = 69/152 (45%), Positives = 102/152 (66%)
Frame = -3
Query: 690 AINDEMDSLMSNETWHLTDLPPGCKTIGCKWILKKKLKPDGSIDKYKARLVAKGFRQREN 749
A+ +E++ N W L + P IG KW+ + KL G I + KARLVAKG+ Q E
Sbjct: 458 AMQEELNQFERNNVWKLVEKPENYPVIGTKWVFRNKLDEHGIIIRNKARLVAKGYNQEEG 279
Query: 750 VDFFDTYSPVTRITSIRVLISLAAIHNLIVHQMDVKTAFLNGELEEEIYMDQPEGFVIHG 809
+D+ +TY+PV R+ IR+L++ +I N ++QMDVK+AFLNG ++EE+Y++QP GF I
Sbjct: 278 IDYEETYAPVARLEVIRMLLAYVSIMNFKLYQMDVKSAFLNGLIQEEVYVEQPPGFEIPD 99
Query: 810 QENKVCKLDKSLYGLKQAPKQWHEKFDNLMIE 841
+ V KL K+LYGLKQAP+ W+E+ N ++E
Sbjct: 98 KPTHVYKLQKALYGLKQAPRAWYERISNFLLE 3
>CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {Vitis
vinifera}, partial (34%)
Length = 409
Score = 141 bits (356), Expect = 1e-33
Identities = 64/131 (48%), Positives = 98/131 (73%)
Frame = +3
Query: 672 PSSIKEALSSIDADLWQEAINDEMDSLMSNETWHLTDLPPGCKTIGCKWILKKKLKPDGS 731
PS+I+EAL D W++A+ DEM +L +N TW L LPPG T+GC+W+ K+ P+G
Sbjct: 21 PSTIREAL---DHPGWRQAMVDEMQALENNGTWELVPLPPGKTTVGCRWVYTVKVGPNGK 191
Query: 732 IDKYKARLVAKGFRQRENVDFFDTYSPVTRITSIRVLISLAAIHNLIVHQMDVKTAFLNG 791
+D+ KARLVAKG+ Q +++ DT+SPV +T++R+ +++AAI + +HQ+D+K AFL+G
Sbjct: 192 VDRLKARLVAKGYTQVYGIEYCDTFSPVFFLTTVRLFLAMAAIRHWPLHQLDIKNAFLHG 371
Query: 792 ELEEEIYMDQP 802
+LEE+IYM+QP
Sbjct: 372 DLEEDIYMEQP 404
>TC224357
Length = 860
Score = 102 bits (255), Expect(2) = 6e-31
Identities = 48/66 (72%), Positives = 56/66 (84%)
Frame = -1
Query: 1117 KIENRYYNGKRRQIRRKHSTIREYLSNGTVRVDFVRTNENLADPLTKGLNREKVANTSSR 1176
KIEN YYNGK++QIRRKH T+RE LS G VRVD VRT++NLADPLTKGL +EKV NTS
Sbjct: 770 KIENHYYNGKKQQIRRKHDTVRELLSTGVVRVDHVRTDDNLADPLTKGLAKEKVHNTSKT 591
Query: 1177 MGLMPI 1182
MGL+P+
Sbjct: 590 MGLLPL 573
Score = 51.2 bits (121), Expect(2) = 6e-31
Identities = 23/27 (85%), Positives = 25/27 (92%)
Frame = -2
Query: 1091 LLSEIPLWERPLPAVLIHCDSTAAIAK 1117
LL+EI LWERP+P VLIHCDSTAAIAK
Sbjct: 850 LLAEILLWERPIPVVLIHCDSTAAIAK 770
>TC234303 weakly similar to UP|Q8W153 (Q8W153) Polyprotein, partial (10%)
Length = 558
Score = 132 bits (332), Expect = 9e-31
Identities = 73/179 (40%), Positives = 109/179 (60%), Gaps = 1/179 (0%)
Frame = +1
Query: 730 GSIDKYKARLVAKGFRQRENVDFFDTYSPVTRITSIRVLISLAAIHNLIVHQMDVKTAFL 789
G+ID++KARLVAK + Q D+ T+SPV ++ + +L S+A + + + +D K AFL
Sbjct: 28 GTIDQFKARLVAKSYTQVYGQDYTGTFSPVAKMAYVHLLWSMAVVCHWPLF*LDAKNAFL 207
Query: 790 NGELEEEIYMDQPEGFVIHGQ-ENKVCKLDKSLYGLKQAPKQWHEKFDNLMIENEFKVNE 848
+G LEEE+YM+QP GFV G+ N VC+L +S YGLKQ+P+ W + I + +E
Sbjct: 208 HGYLEEEVYMEQPLGFVAQGESSNMVCQLCRSFYGLKQSPRAWPFLYCGAAI--WYDSHE 381
Query: 849 SDKCIYSKYENNTCTIICLYVDDLLIFGSNLNAIKDVKSLLCHNFDMKDLGKADVILGI 907
+D ++ + C + +YVDD+ I GS+ + I +K LC F KDLGK LGI
Sbjct: 382 ADHSVFYCHSPQGCIYLIVYVDDIGITGSDQHGIT*LK*XLCCQFQTKDLGKLRYFLGI 558
>TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polyprotein, partial
(7%)
Length = 804
Score = 127 bits (320), Expect = 2e-29
Identities = 77/223 (34%), Positives = 126/223 (55%), Gaps = 5/223 (2%)
Frame = +1
Query: 960 TEYASIIGSLRYATDCTRPDISYAVGLLCKFTSRPSMEHWQAIERVMRYLKKTMTLGLHY 1019
TE+ +IGSLRY + +RP+I +AV L+ +F RP + H QA +RV+R +K T+ G+ +
Sbjct: 16 TEFRRLIGSLRYLCN-SRPNICFAVSLISRFMKRPRLSHMQAAKRVLRLIKGTIGSGVLF 192
Query: 1020 -----QRYPAVLEGYSDADWNNLSDDSKATSGYIFSIAGGAVSWKSKKQTILAQSTMESE 1074
P +L GY+D+DW + K+T GY+F V+ SKKQ ++A ST E+E
Sbjct: 193 PFKAKSGKPDLL-GYTDSDWKRDPEQEKSTGGYLFMYNDAPVA*SSKKQDVIALSTCEAE 369
Query: 1075 MIALAAASEEASWLRCLLSEIPLWERPLPAVLIHCDSTAAIAKIENRYYNGKRRQIRRKH 1134
+A + + +A W+ LL E+ L ER +LI D+ +AI ++ +G+ + I +
Sbjct: 370 YVAASLGACQAVWMMNLLEELKLRERKPVNLLI--DNKSAINLAKHPTLHGRSKHIELRF 543
Query: 1135 STIREYLSNGTVRVDFVRTNENLADPLTKGLNREKVANTSSRM 1177
IR+ +S G V V++ + E LAD +TK + + S +
Sbjct: 544 HYIRDQVSKGNVTVEYCKAEEQLADLMTKPIQVSRFKQICSEL 672
>TC232910 similar to UP|Q6I8N0 (Q6I8N0) Pol polypeptide, partial (3%)
Length = 690
Score = 110 bits (274), Expect(3) = 1e-28
Identities = 64/163 (39%), Positives = 95/163 (58%), Gaps = 9/163 (5%)
Frame = +2
Query: 213 FKTYTACDDQKVL-LGDSHSTDVVGIGDIELKFTSEKTLILKDVLHTPKIRKNLVSGFLL 271
F + DD ++ +G+ + ++G+G + L FTS K+L L DVL P IRKNL+SG +L
Sbjct: 206 FMEFRPIDDGSIVNMGNVATEPILGLGCVNLVFTSGKSLYL-DVLFVPGIRKNLLSGMIL 382
Query: 272 NKAGFTQSIGADLYTITKNGIFVGKGYATDGMFKLNIDMNKISSSAYM--------LCDF 323
N GF Q + +D Y ++++G FVG GY + MFKLNID+ + S M +
Sbjct: 383 NNCGFKQVLESDKYILSRHGSFVGFGYRCNEMFKLNIDVPFVHESVCMASCSSITNMTKS 562
Query: 324 NIWHSRLCHVNKRIISNMSGLGLIPKISLNDFEKCQFCSQAKI 366
IWH+RL HV+ + + +MS +IP +N EKC+ C KI
Sbjct: 563 EIWHARLGHVHYKRLKDMSKTCMIPPFDMN-IEKCKTCMLTKI 688
Score = 33.1 bits (74), Expect(3) = 1e-28
Identities = 11/18 (61%), Positives = 14/18 (77%)
Frame = +1
Query: 196 WWVDTGASRHVCYDRDMF 213
WW D+GA+ HVC DR +F
Sbjct: 154 WWFDSGATSHVCKDRRLF 207
Score = 23.5 bits (49), Expect(3) = 1e-28
Identities = 7/16 (43%), Positives = 10/16 (61%)
Frame = +3
Query: 144 NCYNCGQADHMARKCR 159
+C CG+ H+ R CR
Sbjct: 30 SCSKCGKPGHLKRDCR 77
>BU549979
Length = 615
Score = 124 bits (312), Expect = 2e-28
Identities = 66/195 (33%), Positives = 110/195 (55%), Gaps = 1/195 (0%)
Frame = -1
Query: 986 LLCKFTSRPSMEHWQAIERVMRYLKKTMTLGLHYQRYPAV-LEGYSDADWNNLSDDSKAT 1044
+L ++ S P ++HW+ ++VMRYL+ T L Y++ + + GYSD+D+ D ++T
Sbjct: 615 VLGRYQSNPGIDHWKTAKKVMRYLQGTKDYMLMYKQTNCLEVIGYSDSDFAGCVDSRRST 436
Query: 1045 SGYIFSIAGGAVSWKSKKQTILAQSTMESEMIALAAASEEASWLRCLLSEIPLWERPLPA 1104
SGYIF +A G VSW+S KQT++A STME E + A+ WL+ +S + + +
Sbjct: 435 SGYIFMLADGVVSWRSSKQTLIATSTMEVEFVPCFEATSHGVWLKSFMSSLRVVDSISRP 256
Query: 1105 VLIHCDSTAAIAKIENRYYNGKRRQIRRKHSTIREYLSNGTVRVDFVRTNENLADPLTKG 1164
+ ++CD+ AA+ +N + + I K+ IRE + V ++ V T + DPLTKG
Sbjct: 255 LKLYCDNFAAVFMAKNNKSGNRSKHIDIKYLVIRERVKEKKVVIEHVNTELMIVDPLTKG 76
Query: 1165 LNREKVANTSSRMGL 1179
+ + + RM L
Sbjct: 75 MTPKNFKDHVVRMEL 31
>AI959950
Length = 466
Score = 114 bits (285), Expect = 3e-25
Identities = 56/131 (42%), Positives = 83/131 (62%)
Frame = -1
Query: 689 EAINDEMDSLMSNETWHLTDLPPGCKTIGCKWILKKKLKPDGSIDKYKARLVAKGFRQRE 748
+A+ +E+D N L LP K +G KWI KL DG + +YKARLVAKG+ Q+E
Sbjct: 394 KAMQEELDQFQKNNV*KLVKLPKRKKVVGVKWIFCNKLDEDGKVVRYKARLVAKGYSQQE 215
Query: 749 NVDFFDTYSPVTRITSIRVLISLAAIHNLIVHQMDVKTAFLNGELEEEIYMDQPEGFVIH 808
+D+ T++ V R+ I +L+S A N+ ++QMDVK+AFLNG +++E+Y++QP GF
Sbjct: 214 GIDYPKTFALVARLEVICILLSFATYSNMKLYQMDVKSAFLNGLIQKEVYVEQPPGFENE 35
Query: 809 GQENKVCKLDK 819
V KL+K
Sbjct: 34 TLHQHVFKLNK 2
>TC232995
Length = 1009
Score = 113 bits (282), Expect = 6e-25
Identities = 57/172 (33%), Positives = 100/172 (58%), Gaps = 1/172 (0%)
Frame = +2
Query: 799 MDQPEGFVIHGQENKVCKLDKSLYGLKQAPKQWHEKFDNLMIENEFKVNESDKCIYSKYE 858
++QP GF I + N V KL K+LYGLKQAP+ W+E+ N ++E EF + D ++ K +
Sbjct: 2 VEQPPGFEISDKPNHVYKLQKALYGLKQAPRAWYERLSNFLLEKEFSRGKVDTTLFIKRK 181
Query: 859 NNTCTIICLYVDDLLIFGSNLNAIKDVKSLLCHNFDMKDLGKADVILGIKITRTDNGISL 918
+N ++ +YVDD++ +N + K+ + F+M +G+ LG++I +T GI +
Sbjct: 182 HNDILLVQIYVDDIIFGSTNDSLCKEFSLDMQSEFEMSMMGELKYFLGLQIKQTQ*GIFI 361
Query: 919 NQSHYVEKILRKYNYFYCKPASTPCDPSVKLFKN-TGDSVRQTEYASIIGSL 969
NQS Y +++++++ K STP + L K+ +G S+ +Y IG +
Sbjct: 362 NQSKYCKELIKRFGMDSAKHMSTPMSTNCYLDKDESGQSIDIKQYRDAIGEV 517
>CO982036
Length = 674
Score = 112 bits (281), Expect = 7e-25
Identities = 68/214 (31%), Positives = 113/214 (52%), Gaps = 5/214 (2%)
Frame = -2
Query: 857 YENNTCTIICLYVDDLLIFGSNLNAIKDVKSLLCHNFDMKDLGKADVILGIKITRTDNGI 916
Y+ + T+ L D++I GS+ I+++ S L +F +K LGK D + I++ + +
Sbjct: 673 YKTHILTVYLLVYVDIIITGSSCTLIQNLTSKLNSSFPLKLLGKLDYFVEIEVKSMPDLL 494
Query: 917 SLNQSHYVEKILRKYNYFYCKPASTPCDPSVKLFKNTGDSVR-QTEYASIIGSLRYATDC 975
++ E RK +P S+P + KL K+ D T Y S++G+L+Y T
Sbjct: 493 FSLRTSIFEIFCRKPR*-QAQPISSPMTTTCKLSKSDSDLFSGPTFYRSVVGALQYTT-V 320
Query: 976 TRPDISYAVGLLCKFTSRPSMEHWQAIERVMRYLKKTMTLGL----HYQRYPAVLEGYSD 1031
RP+IS+AV +C+F S P HW ++R++RYLK +++ GL P + G+ D
Sbjct: 319 IRPEISFAVNKVCQFMSNPLDSHWTEVKRILRYLKGSLSYGL*LKPAISSQPLPIRGFCD 140
Query: 1032 ADWNNLSDDSKATSGYIFSIAGGAVSWKSKKQTI 1065
ADW + DD ++TSG + +SW KQ +
Sbjct: 139 ADWASAVDDKRSTSGAAVFLGPNLISWWXXKQQV 38
>CO981879
Length = 576
Score = 79.7 bits (195), Expect(2) = 1e-24
Identities = 39/96 (40%), Positives = 59/96 (60%)
Frame = -1
Query: 428 IFKQYVKEIENQFNIRIKRFRSDRGTEYGSHIFNEYYKELGIIHETTAPYSPEMNGKAER 487
IFK + + I+ QF ++IK FRSD G EY + ++ E GIIH+++ +P+ NG AER
Sbjct: 570 IFKTFFQMIQTQFQVKIKVFRSDNGREYFNKHLSKXXLENGIIHQSSCVDTPQQNGVAER 391
Query: 488 KNRTFTELVVATMLNSGAAPHWWGEILLTVCYVLNR 523
KNR E+ A + + A + WGE +LT Y+ N+
Sbjct: 390 KNRHLXEVARALLFQNKAPKYXWGEAILTGTYLKNK 283
Score = 53.5 bits (127), Expect(2) = 1e-24
Identities = 33/94 (35%), Positives = 48/94 (50%), Gaps = 6/94 (6%)
Frame = -2
Query: 523 RVP-KTKNKISPYEILKKRQPNLSY-----FRTWGCLAYVRKPDPKRVKLASRAYECAFI 576
R+P K N +P ++ PN + +GC +V +P + KL RA +C F+
Sbjct: 284 RMPSKILNFRTPLDVFTSAFPNNRLSCTLPLKIFGCTVFVHIHEPNQGKLEPRAKKCVFV 105
Query: 577 GYALNSKAYRFYDLKSKTIIESNDVDFYENKFPF 610
GYA N K Y+ +D SK + DV F+E K PF
Sbjct: 104 GYAPNQKGYKCFDPTSKKTFVTIDVTFFE-KTPF 6
>BM307983
Length = 406
Score = 108 bits (269), Expect = 2e-23
Identities = 56/134 (41%), Positives = 82/134 (60%), Gaps = 1/134 (0%)
Frame = +2
Query: 716 IGCKWILKKKLKPDGSIDKYKARLVAKGFRQRENVDFFDTYSPVTR-ITSIRVLISLAAI 774
+GC+WI K D ++D+YKARLVAKG+ Q +D+ +T++ + I S A
Sbjct: 2 VGCRWIYTVKY*ADDTLDRYKARLVAKGYIQTYGIDYEETFAQWQK*IQSGSSSP*QQAQ 181
Query: 775 HNLIVHQMDVKTAFLNGELEEEIYMDQPEGFVIHGQENKVCKLDKSLYGLKQAPKQWHEK 834
+HQ DVK AFL+G LEEE+YM+ P G+ NKVC+L K+LYGLKQ+P+ W +
Sbjct: 182 FGWEMHQFDVKNAFLHGSLEEEVYMEIPPGYGASNGGNKVCRLKKALYGLKQSPRAWFGR 361
Query: 835 FDNLMIENEFKVNE 848
F M+ +K ++
Sbjct: 362 FTQAMLSLGYKQSQ 403
>CO983516
Length = 724
Score = 107 bits (266), Expect = 4e-23
Identities = 51/121 (42%), Positives = 80/121 (65%)
Frame = +2
Query: 756 YSPVTRITSIRVLISLAAIHNLIVHQMDVKTAFLNGELEEEIYMDQPEGFVIHGQENKVC 815
+ PV R+ SIR+L+ +A I ++QMDVK+AFLNG L EE+Y++QP+GF+ + V
Sbjct: 365 FHPVARLESIRLLLGVACILKFKLYQMDVKSAFLNGYLNEEVYVEQPKGFIDPTHPDHVY 544
Query: 816 KLDKSLYGLKQAPKQWHEKFDNLMIENEFKVNESDKCIYSKYENNTCTIICLYVDDLLIF 875
+L K+LYGLKQAP+ W+E+ L+ + ++ DK ++ K + I +YVDD ++F
Sbjct: 545 RLKKALYGLKQAPRAWYERLTELLTQQGYRKGGIDKTLFVKQDAENLMIAQIYVDD-IVF 721
Query: 876 G 876
G
Sbjct: 722 G 724
>BI317550 weakly similar to GP|9759590|dbj| polyprotein-like {Arabidopsis
thaliana}, partial (18%)
Length = 421
Score = 105 bits (261), Expect = 2e-22
Identities = 50/136 (36%), Positives = 85/136 (61%)
Frame = -2
Query: 819 KSLYGLKQAPKQWHEKFDNLMIENEFKVNESDKCIYSKYENNTCTIICLYVDDLLIFGSN 878
KSLYGLKQA ++W+EK NL+++ + + SD +++ + NT T + +YVDD+++ G +
Sbjct: 420 KSLYGLKQASRKWYEKLTNLLLKEGYIQSISDYSLFTLTKGNTFTALLVYVDDIILAGDS 241
Query: 879 LNAIKDVKSLLCHNFDMKDLGKADVILGIKITRTDNGISLNQSHYVEKILRKYNYFYCKP 938
++ +K++L F +K+LGK LG+++ + GI+++Q Y +L+ CKP
Sbjct: 240 IDEFDRIKNVLDLAFKIKNLGKLKYFLGLEVAHSRLGITISQRKYCLDLLKDSGLLGCKP 61
Query: 939 ASTPCDPSVKLFKNTG 954
ASTP D S+KL G
Sbjct: 60 ASTPLDTSIKLHSAAG 13
>AW185460
Length = 411
Score = 102 bits (254), Expect = 1e-21
Identities = 52/113 (46%), Positives = 71/113 (62%), Gaps = 1/113 (0%)
Frame = +2
Query: 968 SLRYATDCTRPDISYAVGLLCKFTSRPSMEHWQAIERVMRYLKKTMTLGLHYQRYP-AVL 1026
+L + TRPDI YA LL +F PS H+ A +R++RYL+ T G+ Y + L
Sbjct: 68 TLYIKSQATRPDIMYATSLLSRFMQSPSQIHFGAGKRILRYLQGTKAFGIWYTTETNSEL 247
Query: 1027 EGYSDADWNNLSDDSKATSGYIFSIAGGAVSWKSKKQTILAQSTMESEMIALA 1079
GY+D+DW +DD K+TSGY FS+ G SW SKKQ +AQST E+E +A+A
Sbjct: 248 LGYTDSDWAGSTDDMKSTSGYAFSLGSGMFSWASKKQATVAQSTAEAEYVAVA 406
>BI321712
Length = 399
Score = 100 bits (249), Expect = 4e-21
Identities = 51/127 (40%), Positives = 79/127 (62%), Gaps = 1/127 (0%)
Frame = -3
Query: 865 ICLYVDDLLIFGSNLNAIKDVKSLLCHNFDMKDLGKADVILGIKITRTDNGISLNQSHYV 924
+CLYVDDL+ G+N + ++ K + + F+M D+G LGI++ + D GI + Q Y
Sbjct: 385 LCLYVDDLIFTGNNPSMFEEFKKDMSNEFEMTDMGLMAYYLGIEVKQEDKGIFITQEGYA 206
Query: 925 EKILRKYNYFYCKPASTPCDPSVKLFKN-TGDSVRQTEYASIIGSLRYATDCTRPDISYA 983
+++L+K+ P TP + KL K+ G++V T Y S+IGSLRY T CTRPDI Y
Sbjct: 205 KEVLKKFKMDDANPVGTPMECGSKLSKHEKGENVDPTLYKSLIGSLRYLT-CTRPDILYV 29
Query: 984 VGLLCKF 990
VG++ ++
Sbjct: 28 VGVVSRY 8
>TC235104 weakly similar to UP|Q850H8 (Q850H8) Gag-pol polyprotein
(Fragment), partial (28%)
Length = 865
Score = 98.2 bits (243), Expect = 2e-20
Identities = 65/219 (29%), Positives = 109/219 (49%), Gaps = 1/219 (0%)
Frame = +2
Query: 291 GIFVGKGYATDGMFKLNIDMNKISSSAYMLCDFNIWHSRLCHVNKRIISNMSGLGLIPKI 350
G +G G + G++ L +++ + S+ + + H RL H + + M +P +
Sbjct: 218 GWTIGVGIESHGLYYLKPNLSWVCSA---VTSPKLLHERLGHPHLSKLKIM-----VPSL 373
Query: 351 SLNDFEKCQFCSQAKINKESHKSV-TRITEPFELIHSDLCELDGNLTRNGKRYFITFIDD 409
C+ C K + S + V +R+ PF +IH D+ ++ RYF+TFID+
Sbjct: 374 EKIKDLFCESCQLGKHVRSSXRHVESRVDSPFLVIHXDIWG-PNRVSSMSYRYFVTFIDE 550
Query: 410 CSDYTHVYLMRNKNEALDIFKQYVKEIENQFNIRIKRFRSDRGTEYGSHIFNEYYKELGI 469
S T V+LM+ ++E L F V +I+ QF IK RSD EY S + + + GI
Sbjct: 551 FSQCTRVFLMKERSEILS-FLTSVNKIKTQFGKTIKILRSDNAKEYFSSVISPFXSAQGI 727
Query: 470 IHETTAPYSPEMNGKAERKNRTFTELVVATMLNSGAAPH 508
+H+ + P++P+ N AERKNR E +L++ H
Sbjct: 728 LHQFSCPHTPQQNDIAERKNRHLVETARTLLLHANEPIH 844
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.318 0.134 0.400
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 50,642,567
Number of Sequences: 63676
Number of extensions: 707700
Number of successful extensions: 3841
Number of sequences better than 10.0: 154
Number of HSP's better than 10.0 without gapping: 3702
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 3786
length of query: 1185
length of database: 12,639,632
effective HSP length: 108
effective length of query: 1077
effective length of database: 5,762,624
effective search space: 6206346048
effective search space used: 6206346048
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 64 (29.3 bits)
Medicago: description of AC147000.7