
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC146790.9 + phase: 0
(1349 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete 593 e-169
TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, co... 589 e-168
TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polypro... 162 9e-40
CO982036 161 2e-39
BU548243 154 3e-37
AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza ... 154 3e-37
TC235104 weakly similar to UP|Q850H8 (Q850H8) Gag-pol polyprotei... 152 7e-37
TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Frag... 142 1e-33
TC234303 weakly similar to UP|Q8W153 (Q8W153) Polyprotein, parti... 138 2e-32
TC211311 weakly similar to UP|O24587 (O24587) Pol protein, parti... 115 9e-32
BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vi... 133 6e-31
TC221132 weakly similar to UP|O23529 (O23529) RETROTRANSPOSON li... 84 2e-30
BU549979 129 1e-29
BE211208 127 3e-29
TC232995 127 4e-29
CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {V... 124 2e-28
CO983516 122 8e-28
TC225402 weakly similar to UP|Q6I923 (Q6I923) Copia-like retrotr... 112 7e-27
TC231544 weakly similar to UP|Q9FLR2 (Q9FLR2) Polyprotein-like, ... 119 1e-26
TC213888 similar to UP|Q9SFE1 (Q9SFE1) T26F17.17, partial (11%) 118 2e-26
>TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete
Length = 4731
Score = 593 bits (1529), Expect = e-169
Identities = 382/1154 (33%), Positives = 594/1154 (51%), Gaps = 26/1154 (2%)
Frame = +1
Query: 216 NATANVANKFDHRDNRFNSNNNWRGSNFRGWRGGRGRGRSSKAPCQVYGKTNHTAINCFH 275
+A +F NR + + S G + + + + K C GK H C+H
Sbjct: 1384 SAGRTTMTEFVPAKNRTGATMSQHRSRHHGMQ--QKKSKRKKWRCHYCGKYGHIKPFCYH 1557
Query: 276 RFDKNYSRSNYSADSDKQ---GSHNAFI----ASQNSVEDYDWYFDSGASNHVTHQTDKF 328
+ + S K H A S + DWY DSG S H+T +
Sbjct: 1558 LHGHPHHGTQSSNSRKKMMWVPKHKAVSLVVHTSLRASAKEDWYLDSGCSRHMTGVKEFL 1737
Query: 329 QDLTEHHGKNSQVVGNGDKLEIVATGSSKLKSL-NLDDVLYVPNITKNLLSVSKLAADNN 387
++ E + G+G K +I+ G L +L+ VL V +T NL+S+S+L D
Sbjct: 1738 LNI-EPCSTSYVTFGDGSKGKIIGMGKLVHDGLPSLNKVLLVKGLTANLISISQLC-DEG 1911
Query: 388 IFVEFDKNCCFVKEKLTGKVILKGLLKNGLYQLSDTKGNPYAF--VSVKES----WHRRL 441
V F K+ C V + + +V++KG L + Y+ +S KE WH+R
Sbjct: 1912 FNVNFTKSECLVTNEKS-EVLMKGSRSKDNCYLWTPQETSYSSTCLSSKEDEVRIWHQRF 2088
Query: 442 GHPNNKVLDKVLKSCNVKVPPS---DNFSFCEACQYGKMHLLPF-KSSSSHAQEPLELVH 497
GH + + + K++ V+ P+ + C CQ GK + K LEL+H
Sbjct: 2089 GHLHLRGMKKIIDKGAVRGIPNLKIEEGRICGECQIGKQVKMSHQKLQHQTTSRVLELLH 2268
Query: 498 TDVWGPAPIMSSSGFKYYVHFIDDFSRFTWIYPLKQKSETVQAFTTENQFNKR-----IK 552
D+ GP + S G +Y +DDFSRFTW+ +++KSET + F + +R IK
Sbjct: 2269 MDLMGPMQVESLGGKRYAYVVVDDFSRFTWVNFIREKSETFEVFKELSLRLQREKDCVIK 2448
Query: 553 VIQCDGGGEYKPVQ--KLAIDVGIQFRMSCPYTFQQNGRAERKHRHIAEFGLTLLAQAQM 610
I+ D G E++ + + GI S T QQNG ERK+R + E +L ++
Sbjct: 2449 RIRSDHGREFENSRFTEFCTSEGITHEFSAAITPQQNGIVERKNRTLQEAARVMLHAKEL 2628
Query: 611 PLHYWWEAFSTAVYLINRLPSQVTQNESPYSLIFHKEPNYKLLKPFGCACYPCLKPYNQH 670
P + W EA +TA Y+ NR+ + + Y + ++P+ K FG CY +
Sbjct: 2629 PYNLWAEAMNTACYIHNRVTLRRGTPTTLYEIWKGRKPSVKHFHIFGSPCYILADREQRR 2808
Query: 671 KLQFHTTRCVFLGYSNSHKGYKCLNSHGRTFISRHVIFNEDLFPFHEGFLNTRSPLKTTI 730
K+ + +FLGYS + + Y+ NS RT + + +DL P + K
Sbjct: 2809 KMDPKSDAGIFLGYSTNSRAYRVFNSRTRTVMESINVVVDDLSPARK---------KDVE 2961
Query: 731 NNPSTSFPLCSAGNSINDASIPIIEEENQDETNEEDSQGVTSDTEQTDNGSSEGDTTHEE 790
+ TS G+++ DA+ EN D +E S+ Q D SS
Sbjct: 2962 EDVRTS------GDNVADAAKSGENAENSDSATDE------SNINQPDKRSSTR------ 3087
Query: 791 TLDIVQQQNVGESSLDTNTSNAIHTRSKSGIHKPKLPYIGITETYKDTVEPANVKEALTR 850
+Q+ + E + + + + TRS+ I + +EP NVKEALT
Sbjct: 3088 ----IQKMHPKELIIG-DPNRGVTTRSRE-------VEIVSNSCFVSKIEPKNVKEALTD 3231
Query: 851 TLWKEAMQKEFQALMSNKTWILVPYQDQENIVDSKWVFKTKYKSDGSIERRKARLVAKGF 910
W AMQ+E + N+ W LVP + N++ +KW+FK K +G I R KARLVA+G+
Sbjct: 3232 EFWINAMQEELEQFKRNEVWELVPRPEGTNVIGTKWIFKNKTNEEGVITRNKARLVAQGY 3411
Query: 911 QQTAGIDYEETFSPVVKVSTVRVILSIAVHLNWEVRQLDINNAFLNGYLKETVFMHQPEG 970
Q G+D++ETF+PV ++ ++R++L +A L +++ Q+D+ +AFLNGYL E V++ QP+G
Sbjct: 3412 TQIEGVDFDETFAPVARLESIRLLLGVACILKFKLYQMDVKSAFLNGYLNEEVYVEQPKG 3591
Query: 971 FVDPTKPNHICKLSKAIYGLKQAPRAWFDSLKTALLNWGFQNTKSDPSLFLLKGKDHITF 1030
F DPT P+H+ +L KA+YGLKQAPRAW++ L L G++ D +LF+ + +++
Sbjct: 3592 FADPTHPDHVYRLKKALYGLKQAPRAWYERLTEFLTQQGYRKGGIDKTLFVKQDAENLMI 3771
Query: 1031 LLIYVDDIIVTGSSNNFLQAFIKQLNDVFSLKDLGRLHYFLGIEVQRDASGMYLKQSKYI 1090
IYVDDI+ G SN L+ F++Q+ F + +G L YFLG++V++ ++L QS+Y
Sbjct: 3772 AQIYVDDIVFGGMSNEMLRHFVQQMQSEFEMSLVGELTYFLGLQVKQMEDSIFLSQSRYA 3951
Query: 1091 GDLLKKFKMENASPCPTPMITGRHFTV-EGEKLKDPTVFRQAIGGLQYLTHTRPDIAFSV 1149
+++KKF MENAS TP T + E D +++R IG L YLT +RPDI ++V
Sbjct: 3952 KNIVKKFGMENASHKRTPAPTHLKLSKDEAGTSVDQSLYRSMIGSLLYLTASRPDITYAV 4131
Query: 1150 NKLSQYMSSPTTDHWQGIKRILRYLQGTINYCLHIKPSTDLDITGFSDADWATSIDDRKS 1209
++Y ++P H +KRIL+Y+ GT +Y + ++ + G+ DADWA S DDRKS
Sbjct: 4132 GVCARYQANPKISHLTQVKRILKYVNGTSDYGIMYCHCSNPMLVGYCDADWAGSADDRKS 4311
Query: 1210 MAGQCVFLGETLISWSSRKQKVVSRSSTESEYRALVDLAAEIAWIHSLLFELKLPLPRKP 1269
+G C +LG LISW S+KQ VS S+ E+EY A +++ W+ +L E +
Sbjct: 4312 TSGGCFYLGNNLISWFSKKQNCVSLSTAEAEYIAAGSSCSQLVWMKQMLKEYNVE-QDVM 4488
Query: 1270 ILWCDNLSAKALASNPVLHARSKHIEIDVHYIRDQVLQNKVVVAYVPTTDQIADCLTKPL 1329
L+CDN+SA ++ NPV H+R+KHI+I HYIRD V + + +V T +QIAD TK L
Sbjct: 4489 TLYCDNMSAINISKNPVQHSRTKHIDIRHHYIRDLVDDKVITLKHVDTEEQIADIFTKAL 4668
Query: 1330 SHTRFSQLRDKLGV 1343
+F +LR KLG+
Sbjct: 4669 DANQFEKLRGKLGI 4710
>TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, complete
Length = 4734
Score = 589 bits (1518), Expect = e-168
Identities = 374/1123 (33%), Positives = 575/1123 (50%), Gaps = 31/1123 (2%)
Frame = +1
Query: 252 RGRSSKAPCQVYGKTNHTAINCFHRFDKNYSRSNYSADSDKQGSHNAFIASQNSVE---- 307
+ + K C GK H C+H + ++ S G ++ V
Sbjct: 1489 KSKRKKWRCHYCGKYGHIKPFCYHL----HGHPHHGTQSSSSGRKMMWVPKHKIVSLVVH 1656
Query: 308 -------DYDWYFDSGASNHVTHQTDKFQDLTEHHGKNSQVVGNGDKLEIVATGSSKLKS 360
DWY DSG S H+T + ++ E + G+G K +I G
Sbjct: 1657 TSLRASAKEDWYLDSGCSRHMTGVKEFLVNI-EPCSTSYVTFGDGSKGKITGMGKLVHDG 1833
Query: 361 L-NLDDVLYVPNITKNLLSVSKLAADNNIFVEFDKNCCFVKEKLTGKVILKGLLKNG--- 416
L +L+ VL V +T NL+S+S+L D V F K+ C V + + +V++KG
Sbjct: 1834 LPSLNKVLLVKGLTANLISISQLC-DEGFNVNFTKSECLVTNEKS-EVLMKGSRSKDNCY 2007
Query: 417 LYQLSDTKGNPYAFVSVKES---WHRRLGHPNNKVLDKVLKSCNVKVPPS---DNFSFCE 470
L+ +T + S ++ WH+R GH + + + K++ V+ P+ + C
Sbjct: 2008 LWTPQETSYSSTCLFSKEDEVKIWHQRFGHLHLRGMKKIIDKGAVRGIPNLKIEEGRICG 2187
Query: 471 ACQYGKMHLLPF-KSSSSHAQEPLELVHTDVWGPAPIMSSSGFKYYVHFIDDFSRFTWIY 529
CQ GK + K LEL+H D+ GP + S G +Y +DDFSRFTW+
Sbjct: 2188 ECQIGKQVKMSHQKLQHQTTSRVLELLHMDLMGPMQVESLGGKRYAYVVVDDFSRFTWVN 2367
Query: 530 PLKQKSETVQAFTTENQFNKR-----IKVIQCDGGGEYKPVQ--KLAIDVGIQFRMSCPY 582
+++KS+T + F + +R IK I+ D G E++ + + GI S
Sbjct: 2368 FIREKSDTFEVFKELSLRLQREKDCVIKRIRSDHGREFENSKFTEFCTSEGITHEFSAAI 2547
Query: 583 TFQQNGRAERKHRHIAEFGLTLLAQAQMPLHYWWEAFSTAVYLINRLPSQVTQNESPYSL 642
T QQNG ERK+R + E +L ++P + W EA +TA Y+ NR+ + + Y +
Sbjct: 2548 TPQQNGIVERKNRTLQEAARVMLHAKELPYNLWAEAMNTACYIHNRVTLRRGTPTTLYEI 2727
Query: 643 IFHKEPNYKLLKPFGCACYPCLKPYNQHKLQFHTTRCVFLGYSNSHKGYKCLNSHGRTFI 702
++P K FG CY + K+ + +FLGYS + + Y+ NS RT +
Sbjct: 2728 WKGRKPTVKHFHIFGSPCYILADREQRRKMDPKSDAGIFLGYSTNSRAYRVFNSRTRTVM 2907
Query: 703 SRHVIFNEDLFPFHEGFLNTRSPLKTTINNPSTSFPLCSAGNSINDASIPIIEEENQDET 762
+ +DL P + K + TS G+++ D + EN D
Sbjct: 2908 ESINVVVDDLTPARK---------KDVEEDVRTS------GDNVADTAKSAENAENSDSA 3042
Query: 763 NEEDSQGVTSDTEQTDNGSS-EGDTTHEETLDIVQQQNVGESSLDTNTSNAIHTRSKSGI 821
+E + Q D S H + L I+ N G T S I S S
Sbjct: 3043 TDE------PNINQPDKRPSIRIQKMHPKEL-IIGDPNRGV----TTRSREIEIVSNS-- 3183
Query: 822 HKPKLPYIGITETYKDTVEPANVKEALTRTLWKEAMQKEFQALMSNKTWILVPYQDQENI 881
+ +EP NVKEALT W AMQ+E + N+ W LVP + N+
Sbjct: 3184 ------------CFVSKIEPKNVKEALTDEFWINAMQEELEQFKRNEVWELVPRPEGTNV 3327
Query: 882 VDSKWVFKTKYKSDGSIERRKARLVAKGFQQTAGIDYEETFSPVVKVSTVRVILSIAVHL 941
+ +KW+FK K +G I R KARLVA+G+ Q G+D++ETF+PV ++ ++R++L +A L
Sbjct: 3328 IGTKWIFKNKTNEEGVITRNKARLVAQGYTQIEGVDFDETFAPVARLESIRLLLGVACIL 3507
Query: 942 NWEVRQLDINNAFLNGYLKETVFMHQPEGFVDPTKPNHICKLSKAIYGLKQAPRAWFDSL 1001
+++ Q+D+ +AFLNGYL E ++ QP+GFVDPT P+H+ +L KA+YGLKQAPRAW++ L
Sbjct: 3508 KFKLYQMDVKSAFLNGYLNEEAYVEQPKGFVDPTHPDHVYRLKKALYGLKQAPRAWYERL 3687
Query: 1002 KTALLNWGFQNTKSDPSLFLLKGKDHITFLLIYVDDIIVTGSSNNFLQAFIKQLNDVFSL 1061
L G++ D +LF+ + +++ IYVDDI+ G SN L+ F++Q+ F +
Sbjct: 3688 TEFLTQQGYRKGGIDKTLFVKQDAENLMIAQIYVDDIVFGGMSNEMLRHFVQQMQSEFEM 3867
Query: 1062 KDLGRLHYFLGIEVQRDASGMYLKQSKYIGDLLKKFKMENASPCPTPMITGRHFTV-EGE 1120
+G L YFLG++V++ ++L QSKY +++KKF MENAS TP T + E
Sbjct: 3868 SLVGELTYFLGLQVKQMEDSIFLSQSKYAKNIVKKFGMENASHKRTPAPTHLKLSKDEAG 4047
Query: 1121 KLKDPTVFRQAIGGLQYLTHTRPDIAFSVNKLSQYMSSPTTDHWQGIKRILRYLQGTINY 1180
D +++R IG L YLT +RPDI ++V ++Y ++P H +KRIL+Y+ GT +Y
Sbjct: 4048 TSVDQSLYRSMIGSLLYLTASRPDITYAVGVCARYQANPKISHLNQVKRILKYVNGTSDY 4227
Query: 1181 CLHIKPSTDLDITGFSDADWATSIDDRKSMAGQCVFLGETLISWSSRKQKVVSRSSTESE 1240
+ +D + G+ DADWA S DDRKS +G C +LG LISW S+KQ VS S+ E+E
Sbjct: 4228 GIMYCHCSDSMLVGYCDADWAGSADDRKSTSGGCFYLGTNLISWFSKKQNCVSLSTAEAE 4407
Query: 1241 YRALVDLAAEIAWIHSLLFELKLPLPRKPILWCDNLSAKALASNPVLHARSKHIEIDVHY 1300
Y A +++ W+ +L E + L+CDN+SA ++ NPV H+R+KHI+I HY
Sbjct: 4408 YIAAGSSCSQLVWMKQMLKEYNVE-QDVMTLYCDNMSAINISKNPVQHSRTKHIDIRHHY 4584
Query: 1301 IRDQVLQNKVVVAYVPTTDQIADCLTKPLSHTRFSQLRDKLGV 1343
IRD V + + +V T +QIAD TK L +F +LR KLG+
Sbjct: 4585 IRDLVDDKVITLEHVDTEEQIADIFTKALDANQFEKLRGKLGI 4713
>TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polyprotein, partial
(7%)
Length = 804
Score = 162 bits (410), Expect = 9e-40
Identities = 92/222 (41%), Positives = 133/222 (59%), Gaps = 4/222 (1%)
Frame = +1
Query: 1124 DPTVFRQAIGGLQYLTHTRPDIAFSVNKLSQYMSSPTTDHWQGIKRILRYLQGTINYCLH 1183
D T FR+ IG L+YL ++RP+I F+V+ +S++M P H Q KR+LR ++GTI +
Sbjct: 10 DVTEFRRLIGSLRYLCNSRPNICFAVSLISRFMKRPRLSHMQAAKRVLRLIKGTIGSGVL 189
Query: 1184 IK---PSTDLDITGFSDADWATSIDDRKSMAGQCVFLGETLISWSSRKQKVVSRSSTESE 1240
S D+ G++D+DW + KS G + ++ SS+KQ V++ S+ E+E
Sbjct: 190 FPFKAKSGKPDLLGYTDSDWKRDPEQEKSTGGYLFMYNDAPVA*SSKKQDVIALSTCEAE 369
Query: 1241 YRALVDLAAEIAWIHSLLFELKLPLPRKPI-LWCDNLSAKALASNPVLHARSKHIEIDVH 1299
Y A A + W+ +LL ELKL RKP+ L DN SA LA +P LH RSKHIE+ H
Sbjct: 370 YVAASLGACQAVWMMNLLEELKLR-ERKPVNLLIDNKSAINLAKHPTLHGRSKHIELRFH 546
Query: 1300 YIRDQVLQNKVVVAYVPTTDQIADCLTKPLSHTRFSQLRDKL 1341
YIRDQV + V V Y +Q+AD +TKP+ +RF Q+ +L
Sbjct: 547 YIRDQVSKGNVTVEYCKAEEQLADLMTKPIQVSRFKQICSEL 672
>CO982036
Length = 674
Score = 161 bits (408), Expect = 2e-39
Identities = 93/213 (43%), Positives = 131/213 (60%), Gaps = 6/213 (2%)
Frame = -2
Query: 1025 KDHI--TFLLIYVDDIIVTGSSNNFLQAFIKQLNDVFSLKDLGRLHYFLGIEVQRDASGM 1082
K HI +LL+YVD II+TGSS +Q +LN F LK LG+L YF+ IEV+ +
Sbjct: 670 KTHILTVYLLVYVD-IIITGSSCTLIQNLTSKLNSSFPLKLLGKLDYFVEIEVKSMPDLL 494
Query: 1083 YLKQSKYIGDLLKKFKMENASPCPTPMITGRHFTVEGEKL-KDPTVFRQAIGGLQYLTHT 1141
+ ++ +K + + A P +PM T + L PT +R +G LQY T
Sbjct: 493 FSLRTSIFEIFCRKPR*Q-AQPISSPMTTTCKLSKSDSDLFSGPTFYRSVVGALQYTTVI 317
Query: 1142 RPDIAFSVNKLSQYMSSPTTDHWQGIKRILRYLQGTINYCLHIKP---STDLDITGFSDA 1198
RP+I+F+VNK+ Q+MS+P HW +KRILRYL+G+++Y L +KP S L I GF DA
Sbjct: 316 RPEISFAVNKVCQFMSNPLDSHWTEVKRILRYLKGSLSYGL*LKPAISSQPLPIRGFCDA 137
Query: 1199 DWATSIDDRKSMAGQCVFLGETLISWSSRKQKV 1231
DWA+++DD++S +G VFLG LISW KQ+V
Sbjct: 136 DWASAVDDKRSTSGAAVFLGPNLISWWXXKQQV 38
>BU548243
Length = 599
Score = 154 bits (388), Expect = 3e-37
Identities = 73/151 (48%), Positives = 105/151 (69%)
Frame = -1
Query: 1193 TGFSDADWATSIDDRKSMAGQCVFLGETLISWSSRKQKVVSRSSTESEYRALVDLAAEIA 1252
T DA WA+ +DD +S G +FLG LISW SRKQ+V ++SSTE+EYR++ +AE+
Sbjct: 599 TALCDAGWASDVDDHRSTLGSAIFLGPNLISWWSRKQQVTAQSSTEAEYRSIAQTSAELT 420
Query: 1253 WIHSLLFELKLPLPRKPILWCDNLSAKALASNPVLHARSKHIEIDVHYIRDQVLQNKVVV 1312
WI +LL EL++P P++ CDN SA A+A N V H+R+KH+EIDV ++ ++VL ++ +
Sbjct: 419 WIQALLMELQIPF-TPPVILCDNKSAVAIAHNLVFHSRTKHMEIDVFFVHEKVLSKQLQI 243
Query: 1313 AYVPTTDQIADCLTKPLSHTRFSQLRDKLGV 1343
++P DQ A LTKPLS RF+ L+ KL V
Sbjct: 242 FHIPALDQWAGILTKPLSSARFTFLKSKLTV 150
>AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza sativa
(japonica cultivar-group)}, partial (10%)
Length = 463
Score = 154 bits (388), Expect = 3e-37
Identities = 70/151 (46%), Positives = 101/151 (66%)
Frame = -3
Query: 856 AMQKEFQALMSNKTWILVPYQDQENIVDSKWVFKTKYKSDGSIERRKARLVAKGFQQTAG 915
AMQ+E N W LV + ++ +KWVF+ K G I R KARLVAKG+ Q G
Sbjct: 458 AMQEELNQFERNNVWKLVEKPENYPVIGTKWVFRNKLDEHGIIIRNKARLVAKGYNQEEG 279
Query: 916 IDYEETFSPVVKVSTVRVILSIAVHLNWEVRQLDINNAFLNGYLKETVFMHQPEGFVDPT 975
IDYEET++PV ++ +R++L+ +N+++ Q+D+ +AFLNG ++E V++ QP GF P
Sbjct: 278 IDYEETYAPVARLEVIRMLLAYVSIMNFKLYQMDVKSAFLNGLIQEEVYVEQPPGFEIPD 99
Query: 976 KPNHICKLSKAIYGLKQAPRAWFDSLKTALL 1006
KP H+ KL KA+YGLKQAPRAW++ + LL
Sbjct: 98 KPTHVYKLQKALYGLKQAPRAWYERISNFLL 6
>TC235104 weakly similar to UP|Q850H8 (Q850H8) Gag-pol polyprotein
(Fragment), partial (28%)
Length = 865
Score = 152 bits (385), Expect = 7e-37
Identities = 97/279 (34%), Positives = 143/279 (50%), Gaps = 11/279 (3%)
Frame = +2
Query: 346 DKLEIVATGS---SKLKSLNLDDVLYVPNITKNLLSVSKLAADNNIFVEFDKNCCFVKEK 402
D +VATG S SL+L+ V+++ N+ S+S+L N V FD N ++E
Sbjct: 32 DGSRVVATGIGHVSPTSSLSLNSVVFILGCPFNITSLSQLTRFRNCSVTFDANSFVIQEC 211
Query: 403 LTGKVILKGLLKNGLYQLSDTKGNPYAFVSVKESWHRRLGHPNNKVLDKVLKSCNVKVPP 462
TG I G+ +GLY L + V+ + H RLGHP+ L + VP
Sbjct: 212 GTGWTIGVGIESHGLYYLKPNLSWVCSAVTSPKLLHERLGHPH-------LSKLKIMVPS 370
Query: 463 SDNFS--FCEACQYGKMHLLPFKSSSSHAQEPLELVHTDVWGPAPIMSSSGFKYYVHFID 520
+ FCE+CQ GK + S P ++H D+WGP + SS ++Y+V FID
Sbjct: 371 LEKIKDLFCESCQLGKHVRSSXRHVESRVDSPFLVIHXDIWGPNRV-SSMSYRYFVTFID 547
Query: 521 DFSRFTWIYPLKQKSETVQAFTTEN----QFNKRIKVIQCDGGGEY--KPVQKLAIDVGI 574
+FS+ T ++ +K++SE + T+ N QF K IK+++ D EY + GI
Sbjct: 548 EFSQCTRVFLMKERSEILSFLTSVNKIKTQFGKTIKILRSDNAKEYFSSVISPFXSAQGI 727
Query: 575 QFRMSCPYTFQQNGRAERKHRHIAEFGLTLLAQAQMPLH 613
+ SCP+T QQN AERK+RH+ E TLL A P+H
Sbjct: 728 LHQFSCPHTPQQNDIAERKNRHLVETARTLLLHANEPIH 844
>TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Fragment), partial
(30%)
Length = 687
Score = 142 bits (357), Expect = 1e-33
Identities = 68/151 (45%), Positives = 99/151 (65%)
Frame = +2
Query: 1192 ITGFSDADWATSIDDRKSMAGQCVFLGETLISWSSRKQKVVSRSSTESEYRALVDLAAEI 1251
++G+ DADWA DR+S +G CVF+G L+SW S+KQ VV+RSS E+EYR++ + E+
Sbjct: 17 LSGYCDADWAGCPMDRRSTSGYCVFIGGNLVSWKSKKQTVVARSSAEAEYRSMAMVTCEL 196
Query: 1252 AWIHSLLFELKLPLPRKPILWCDNLSAKALASNPVLHARSKHIEIDVHYIRDQVLQNKVV 1311
WI L EL+ + L+CDN +A +ASNPV H R+KHIEID H+IR+++L ++V
Sbjct: 197 MWIKQFLQELRFCEELQMKLYCDNQAALHIASNPVFHERTKHIEIDCHFIREKLLSKEIV 376
Query: 1312 VAYVPTTDQIADCLTKPLSHTRFSQLRDKLG 1342
++ + DQ D LTK L + + KLG
Sbjct: 377 TEFIGSNDQPVDILTKSLRGPKIQIVCSKLG 469
>TC234303 weakly similar to UP|Q8W153 (Q8W153) Polyprotein, partial (10%)
Length = 558
Score = 138 bits (347), Expect = 2e-32
Identities = 79/179 (44%), Positives = 110/179 (61%), Gaps = 1/179 (0%)
Frame = +1
Query: 896 GSIERRKARLVAKGFQQTAGIDYEETFSPVVKVSTVRVILSIAVHLNWEVRQLDINNAFL 955
G+I++ KARLVAK + Q G DY TFSPV K++ V ++ S+AV +W + LD NAFL
Sbjct: 28 GTIDQFKARLVAKSYTQVYGQDYTGTFSPVAKMAYVHLLWSMAVVCHWPLF*LDAKNAFL 207
Query: 956 NGYLKETVFMHQPEGFV-DPTKPNHICKLSKAIYGLKQAPRAWFDSLKTALLNWGFQNTK 1014
+GYL+E V+M QP GFV N +C+L ++ YGLKQ+PRAW L W + + +
Sbjct: 208 HGYLEEEVYMEQPLGFVAQGESSNMVCQLCRSFYGLKQSPRAW-PFLYCGAAIW-YDSHE 381
Query: 1015 SDPSLFLLKGKDHITFLLIYVDDIIVTGSSNNFLQAFIKQLNDVFSLKDLGRLHYFLGI 1073
+D S+F +L++YVDDI +TGS + + L F KDLG+L YFLGI
Sbjct: 382 ADHSVFYCHSPQGCIYLIVYVDDIGITGSDQHGIT*LK*XLCCQFQTKDLGKLRYFLGI 558
>TC211311 weakly similar to UP|O24587 (O24587) Pol protein, partial (15%)
Length = 1213
Score = 115 bits (287), Expect(2) = 9e-32
Identities = 67/186 (36%), Positives = 102/186 (54%), Gaps = 5/186 (2%)
Frame = +3
Query: 1029 TFLLI--YVDDIIVTGSSNNFLQAFIKQLNDVFSLKDLGRLHYFLGIEVQRDASGMYLKQ 1086
TFL+I YVDDII +S + F + + D F G L + LG+++ + G+++ Q
Sbjct: 483 TFLIIHIYVDDIIFGATSKRMCKEFFELMKDGFETSMKGELKFLLGLQIIQKVYGIFIHQ 662
Query: 1087 SKYIGDLLKKFKMENASPCPTPMITGRHFTVEGEKLKDPTVFRQAIG---GLQYLTHTRP 1143
KY LK+F+M+ A P TPM R ++ ++ + T ++ G L YLT +RP
Sbjct: 663 EKYTKSHLKRFRMDEAKPMATPM--HRSTIIDKDEKGNHTS*KEYSGMIDSLSYLTSSRP 836
Query: 1144 DIAFSVNKLSQYMSSPTTDHWQGIKRILRYLQGTINYCLHIKPSTDLDITGFSDADWATS 1203
DI F V +++ S P H +KRILRYL GT N+CL K ++ D+ G+ D +A
Sbjct: 837 DIVFVVCLCARFQSYPKISHVTAVKRILRYLVGTTNHCLWFKKRSEFDLLGYCDVYFAGD 1016
Query: 1204 IDDRKS 1209
+RKS
Sbjct: 1017 KVERKS 1034
Score = 42.0 bits (97), Expect(2) = 9e-32
Identities = 19/49 (38%), Positives = 31/49 (62%)
Frame = +2
Query: 982 KLSKAIYGLKQAPRAWFDSLKTALLNWGFQNTKSDPSLFLLKGKDHITF 1030
K +YGLKQA RAW++ L + L++ GF +DP+LF K ++++
Sbjct: 347 KTLSCVYGLKQALRAWYERLSSFLVSNGFTRGITDPALFRKAQKGNLSY 493
>BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vinifera},
partial (34%)
Length = 407
Score = 133 bits (334), Expect = 6e-31
Identities = 63/126 (50%), Positives = 87/126 (69%)
Frame = -2
Query: 873 VPYQDQENIVDSKWVFKTKYKSDGSIERRKARLVAKGFQQTAGIDYEETFSPVVKVSTVR 932
VP + V +WV+ K G ++R KARLVAKG+ Q GIDY +TFSPV K++TVR
Sbjct: 406 VPLPPGKTPVGCRWVYTVKVGPTGEVDRLKARLVAKGYTQVYGIDYCDTFSPVAKLTTVR 227
Query: 933 VILSIAVHLNWEVRQLDINNAFLNGYLKETVFMHQPEGFVDPTKPNHICKLSKAIYGLKQ 992
+ L++A +W + QLDI NAFL+G L+E ++M QP GFV + +CKL +++YGLKQ
Sbjct: 226 LFLAMAAICHWPLHQLDIKNAFLHGDLEEDIYMEQPPGFVAQGEYGLVCKLHRSLYGLKQ 47
Query: 993 APRAWF 998
+PRAWF
Sbjct: 46 SPRAWF 29
>TC221132 weakly similar to UP|O23529 (O23529) RETROTRANSPOSON like protein,
partial (5%)
Length = 799
Score = 84.0 bits (206), Expect(3) = 2e-30
Identities = 39/103 (37%), Positives = 61/103 (58%)
Frame = +1
Query: 1148 SVNKLSQYMSSPTTDHWQGIKRILRYLQGTINYCLHIKPSTDLDITGFSDADWATSIDDR 1207
S++ SQ+M PT Q KR+LRYL+GTI++ L ++ S D + F DA+W + D
Sbjct: 118 SLSTSSQFMKDPTKIRMQATKRVLRYLKGTIDFGLQLRSSPDQHLRAFYDANWVDNTSDI 297
Query: 1208 KSMAGQCVFLGETLISWSSRKQKVVSRSSTESEYRALVDLAAE 1250
+S V+ G ++ISWS +KQ ++ +SST+ EY + E
Sbjct: 298 RSTGAYVVYFGLSVISWSCKKQSIIDKSSTKVEYHKITTTIIE 426
Score = 53.1 bits (126), Expect(3) = 2e-30
Identities = 26/81 (32%), Positives = 45/81 (55%)
Frame = +2
Query: 1269 PILWCDNLSAKALASNPVLHARSKHIEIDVHYIRDQVLQNKVVVAYVPTTDQIADCLTKP 1328
P ++ N+ A L +NPV H KH+ ID +++D V ++ V++VP+ D TK
Sbjct: 449 PTMYSYNIGAMYLCANPVFHLCMKHLTIDHLFVQDLVANKQLRVSHVPSCH*HVDLFTKA 628
Query: 1329 LSHTRFSQLRDKLGVIHSPPV 1349
L +R + DK+GV+ + +
Sbjct: 629 LVSSRHKFMMDKIGVVSTTTI 691
Score = 35.8 bits (81), Expect(3) = 2e-30
Identities = 17/30 (56%), Positives = 21/30 (69%)
Frame = +3
Query: 1124 DPTVFRQAIGGLQYLTHTRPDIAFSVNKLS 1153
D V+ Q + LQYL+ T PDIAF +NKLS
Sbjct: 48 DGIVYCQLVDSLQYLSLTCPDIAFPINKLS 137
>BU549979
Length = 615
Score = 129 bits (323), Expect = 1e-29
Identities = 67/185 (36%), Positives = 109/185 (58%), Gaps = 2/185 (1%)
Frame = -1
Query: 1152 LSQYMSSPTTDHWQGIKRILRYLQGTINYCLHIKPSTDLDITGFSDADWATSIDDRKSMA 1211
L +Y S+P DHW+ K+++RYLQGT +Y L K + L++ G+SD+D+A +D R+S +
Sbjct: 612 LGRYQSNPGIDHWKTAKKVMRYLQGTKDYMLMYKQTNCLEVIGYSDSDFAGCVDSRRSTS 433
Query: 1212 GQCVFLGETLISWSSRKQKVVSRSSTESEYRALVDLAAEIAWIHSLLFELKL--PLPRKP 1269
G L + ++SW S KQ +++ S+ E E+ + + W+ S + L++ + R
Sbjct: 432 GYIFMLADGVVSWRSSKQTLIATSTMEVEFVPCFEATSHGVWLKSFMSSLRVVDSISRPL 253
Query: 1270 ILWCDNLSAKALASNPVLHARSKHIEIDVHYIRDQVLQNKVVVAYVPTTDQIADCLTKPL 1329
L+CDN +A +A N RSKHI+I IR++V + KVV+ +V T I D LTK +
Sbjct: 252 KLYCDNFAAVFMAKNNKSGNRSKHIDIKYLVIRERVKEKKVVIEHVNTELMIVDPLTKGM 73
Query: 1330 SHTRF 1334
+ F
Sbjct: 72 TPKNF 58
>BE211208
Length = 413
Score = 127 bits (319), Expect = 3e-29
Identities = 65/131 (49%), Positives = 88/131 (66%), Gaps = 2/131 (1%)
Frame = +2
Query: 1027 HITFLLIYVDDIIVTGSSNNFLQAFIKQLNDVFSLKDLGRLHYFLGIEVQRDASG-MYLK 1085
++ +LL+YVDDII+TG SN +Q+ + LN FSLK LG+L YFLGIEV +G + L
Sbjct: 20 NLVYLLVYVDDIIITGRSNYLIQSLVHHLNSNFSLKQLGQLDYFLGIEVHHTPTGSVLLT 199
Query: 1086 QSKYIGDLLKKFKMENASPCPTPMITGRHFTVEGEK-LKDPTVFRQAIGGLQYLTHTRPD 1144
QSKYI DLL K M A P +PM+T + G+ L DPT++R +G LQY T TRP+
Sbjct: 200 QSKYICDLLHKTDMAEAKPISSPMVTNLRLSKNGDDLLSDPTMYRSVVGALQYPTITRPE 379
Query: 1145 IAFSVNKLSQY 1155
I+F+ NK+ Q+
Sbjct: 380 ISFAANKVCQF 412
>TC232995
Length = 1009
Score = 127 bits (318), Expect = 4e-29
Identities = 69/184 (37%), Positives = 106/184 (57%), Gaps = 5/184 (2%)
Frame = +2
Query: 965 MHQPEGFVDPTKPNHICKLSKAIYGLKQAPRAWFDSLKTALLNWGFQNTKSDPSLFLLKG 1024
+ QP GF KPNH+ KL KA+YGLKQAPRAW++ L LL F K D +LF+ +
Sbjct: 2 VEQPPGFEISDKPNHVYKLQKALYGLKQAPRAWYERLSNFLLEKEFSRGKVDTTLFIKRK 181
Query: 1025 KDHITFLLIYVDDIIVTGSSNNFLQAFIKQLNDVFSLKDLGRLHYFLGIEVQRDASGMYL 1084
+ I + IYVDDII ++++ + F + F + +G L YFLG+++++ G+++
Sbjct: 182 HNDILLVQIYVDDIIFGSTNDSLCKEFSLDMQSEFEMSMMGELKYFLGLQIKQTQ*GIFI 361
Query: 1085 KQSKYIGDLLKKFKMENASPCPTPMITGRHFTV-EGEKLKDPTVFRQAIGGL----QYLT 1139
QSKY +L+K+F M++A TPM T + E + D +R AIG + Q+
Sbjct: 362 NQSKYCKELIKRFGMDSAKHMSTPMSTNCYLDKDESGQSIDIKQYRDAIGEVVEIGQWNL 541
Query: 1140 HTRP 1143
H +P
Sbjct: 542 HGKP 553
>CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {Vitis
vinifera}, partial (34%)
Length = 409
Score = 124 bits (312), Expect = 2e-28
Identities = 60/128 (46%), Positives = 86/128 (66%)
Frame = +3
Query: 841 PANVKEALTRTLWKEAMQKEFQALMSNKTWILVPYQDQENIVDSKWVFKTKYKSDGSIER 900
P+ ++EAL W++AM E QAL +N TW LVP + V +WV+ K +G ++R
Sbjct: 21 PSTIREALDHPGWRQAMVDEMQALENNGTWELVPLPPGKTTVGCRWVYTVKVGPNGKVDR 200
Query: 901 RKARLVAKGFQQTAGIDYEETFSPVVKVSTVRVILSIAVHLNWEVRQLDINNAFLNGYLK 960
KARLVAKG+ Q GI+Y +TFSPV ++TVR+ L++A +W + QLDI NAFL+G L+
Sbjct: 201 LKARLVAKGYTQVYGIEYCDTFSPVFFLTTVRLFLAMAAIRHWPLHQLDIKNAFLHGDLE 380
Query: 961 ETVFMHQP 968
E ++M QP
Sbjct: 381 EDIYMEQP 404
>CO983516
Length = 724
Score = 122 bits (307), Expect = 8e-28
Identities = 52/121 (42%), Positives = 86/121 (70%)
Frame = +2
Query: 919 EETFSPVVKVSTVRVILSIAVHLNWEVRQLDINNAFLNGYLKETVFMHQPEGFVDPTKPN 978
++ F PV ++ ++R++L +A L +++ Q+D+ +AFLNGYL E V++ QP+GF+DPT P+
Sbjct: 356 DKEFHPVARLESIRLLLGVACILKFKLYQMDVKSAFLNGYLNEEVYVEQPKGFIDPTHPD 535
Query: 979 HICKLSKAIYGLKQAPRAWFDSLKTALLNWGFQNTKSDPSLFLLKGKDHITFLLIYVDDI 1038
H+ +L KA+YGLKQAPRAW++ L L G++ D +LF+ + +++ IYVDDI
Sbjct: 536 HVYRLKKALYGLKQAPRAWYERLTELLTQQGYRKGGIDKTLFVKQDAENLMIAQIYVDDI 715
Query: 1039 I 1039
+
Sbjct: 716 V 718
>TC225402 weakly similar to UP|Q6I923 (Q6I923) Copia-like retrotransposon
Hopscotch polyprotein, partial (7%)
Length = 1446
Score = 112 bits (279), Expect(2) = 7e-27
Identities = 55/109 (50%), Positives = 72/109 (65%)
Frame = +2
Query: 1197 DADWATSIDDRKSMAGQCVFLGETLISWSSRKQKVVSRSSTESEYRALVDLAAEIAWIHS 1256
DA+WA S DR S G CV +GE L+ W S K VV+RSS E+EY+A+ E+ WI
Sbjct: 8 DANWAVSPIDRGSTLGYCVSIGENLVLWKSNK*NVVARSSAEAEYKAMTVATCELIWIKQ 187
Query: 1257 LLFELKLPLPRKPILWCDNLSAKALASNPVLHARSKHIEIDVHYIRDQV 1305
LL ELK ++ L CDN +A +ASNPV H R+KHIEID H++R++V
Sbjct: 188 LLQELKFGSTQQMKLCCDNQAALHIASNPVFHERTKHIEIDCHFVREKV 334
Score = 28.5 bits (62), Expect(2) = 7e-27
Identities = 12/34 (35%), Positives = 20/34 (58%)
Frame = +3
Query: 1309 KVVVAYVPTTDQIADCLTKPLSHTRFSQLRDKLG 1342
+V++ +V + DQ+A+ TK L R + KLG
Sbjct: 345 EVIIEFVSSNDQLANIFTKSLRGPRIQNICSKLG 446
>TC231544 weakly similar to UP|Q9FLR2 (Q9FLR2) Polyprotein-like, partial (16%)
Length = 662
Score = 119 bits (297), Expect = 1e-26
Identities = 76/185 (41%), Positives = 104/185 (56%), Gaps = 5/185 (2%)
Frame = +3
Query: 1169 RILRYLQGTINYCLHIKPSTDLDITGFSDADWATSIDDRKSMAGQCVFLGETLISWSSRK 1228
R+L+YL+G L + + I GFSDADWAT ID KS+ C FLG +LISW ++K
Sbjct: 27 RVLKYLKGCPRKGLSFSRESPIQILGFSDADWATCIDSSKSITWYCFFLGSSLISWKAKK 206
Query: 1229 QKVVSRSSTESE--YRALVDLAAEIAWIHSLLFELKLPLPRKPILWCDNLSAKALASNPV 1286
Q VSRSS+ SE YRAL E+ W+ LL +L + L ++CDN S AL P+
Sbjct: 207 QNTVSRSSSSSEAKYRALTSTTCELQWLTYLLKDLHVTL-----IYCDNQS--ALQ*LPI 365
Query: 1287 LHARSKHIEIDVHYIRDQVLQNKV-VVAYVPTTDQIADCLTKPLSHTRFSQLRDKLGV-- 1343
+EID H +R++ Q + + V +++Q+AD TK LS FS KLG+
Sbjct: 366 KVIYHGQLEIDCHIVREKTQQGLMHCLLPVSSSNQLADIFTKALSPKLFSSNLSKLGLSD 545
Query: 1344 IHSPP 1348
I PP
Sbjct: 546 IFLPP 560
>TC213888 similar to UP|Q9SFE1 (Q9SFE1) T26F17.17, partial (11%)
Length = 493
Score = 118 bits (296), Expect = 2e-26
Identities = 61/139 (43%), Positives = 83/139 (58%)
Frame = +3
Query: 1205 DDRKSMAGQCVFLGETLISWSSRKQKVVSRSSTESEYRALVDLAAEIAWIHSLLFELKLP 1264
DDRKS G F+G+T +W S+KQ +V+ S+ E+EY A W+ +LL ELK+P
Sbjct: 9 DDRKSTTGFVFFMGDTAFTWMSKKQPIVTLSTCEAEYVAATSCVCHAIWLRNLLKELKMP 188
Query: 1265 LPRKPILWCDNLSAKALASNPVLHARSKHIEIDVHYIRDQVLQNKVVVAYVPTTDQIADC 1324
+ DN SA ALA NPV H +SKHI+ H+IR+ + + +V + YV + DQ AD
Sbjct: 189 QEEPMEICVDNKSALALAKNPVFHEKSKHIDTRYHFIRECIEKKEVKLKYVMSQDQAADI 368
Query: 1325 LTKPLSHTRFSQLRDKLGV 1343
TKPL F +LR LGV
Sbjct: 369 FTKPLKLETFVKLRSMLGV 425
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.318 0.133 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 63,901,162
Number of Sequences: 63676
Number of extensions: 976771
Number of successful extensions: 5517
Number of sequences better than 10.0: 178
Number of HSP's better than 10.0 without gapping: 5352
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 5448
length of query: 1349
length of database: 12,639,632
effective HSP length: 109
effective length of query: 1240
effective length of database: 5,698,948
effective search space: 7066695520
effective search space used: 7066695520
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 65 (29.6 bits)
Medicago: description of AC146790.9