
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC146720.2 - phase: 0
(1333 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete 548 e-156
TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, co... 546 e-155
TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polypro... 151 2e-36
TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Frag... 148 2e-35
CO982036 143 4e-34
TC235104 weakly similar to UP|Q850H8 (Q850H8) Gag-pol polyprotei... 143 6e-34
AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza ... 142 1e-33
TC234303 weakly similar to UP|Q8W153 (Q8W153) Polyprotein, parti... 130 4e-30
BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vi... 128 2e-29
BU548243 124 2e-28
CO983516 122 8e-28
TC231544 weakly similar to UP|Q9FLR2 (Q9FLR2) Polyprotein-like, ... 122 8e-28
CO981879 78 1e-27
TC221132 weakly similar to UP|O23529 (O23529) RETROTRANSPOSON li... 78 7e-27
TC213888 similar to UP|Q9SFE1 (Q9SFE1) T26F17.17, partial (11%) 118 2e-26
CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {V... 116 7e-26
BU549979 116 7e-26
TC225402 weakly similar to UP|Q6I923 (Q6I923) Copia-like retrotr... 112 1e-24
BM307983 112 1e-24
CO981347 69 1e-23
>TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete
Length = 4731
Score = 548 bits (1412), Expect = e-156
Identities = 358/1089 (32%), Positives = 546/1089 (49%), Gaps = 45/1089 (4%)
Frame = +1
Query: 279 IQAAMHSLSLAPPDEQWYMDTGATSHMTANGGNLTSYSNISNN-ITVGSGHNIPVIGCGN 337
+ +H+ A E WY+D+G + HMT L + S + +T G G +IG G
Sbjct: 1636 VSLVVHTSLRASAKEDWYLDSGCSRHMTGVKEFLLNIEPCSTSYVTFGDGSKGKIIGMGK 1815
Query: 338 ALVQNPQYPLTLNNVLHAPKLIKNLVSVRKFTIDNEVSVEFDPFGFSVKDFQTGMPLMRC 397
+ +LN VL L NL+S+ + D +V F V + ++ + LM+
Sbjct: 1816 LVHDGLP---SLNKVLLVKGLTANLISISQLC-DEGFNVNFTKSECLVTNEKSEV-LMKG 1980
Query: 398 NSSGDLYPLATRPYYPSTTPSTFAALSNE-----IWHNRLGHPGVSILNSLHRNNSI--L 450
+ S D L T P T + LS++ IWH R GH + + + ++ +
Sbjct: 1981 SRSKDNCYLWT----PQETSYSSTCLSSKEDEVRIWHQRFGHLHLRGMKKIIDKGAVRGI 2148
Query: 451 CN-KFRNNFFCQSCQLGKQIKLPFYE-SLSSTLLPFDIVHSDIW-TSPILSSGGHHYYIL 507
N K C CQ+GKQ+K+ + +T +++H D+ + S GG Y +
Sbjct: 2149 PNLKIEEGRICGECQIGKQVKMSHQKLQHQTTSRVLELLHMDLMGPMQVESLGGKRYAYV 2328
Query: 508 FLDDFTDFLWTFPLTNKSQAHSIFLQFCNHIKTQFERDIKCFQCDNGKEYDNSHFHQFCK 567
+DDF+ F W + KS+ +F + ++ + + IK + D+G+E++NS F +FC
Sbjct: 2329 VVDDFSRFTWVNFIREKSETFEVFKELSLRLQREKDCVIKRIRSDHGREFENSRFTEFCT 2508
Query: 568 QNGMIFRFSCPHTSPQNGKVERKIRTINNIIRTLLAHASLPPSFWHHALQMATYLLNILP 627
G+ FS T QNG VERK RT+ R +L LP + W A+ A Y+ N +
Sbjct: 2509 SEGITHEFSAAITPQQNGIVERKNRTLQEAARVMLHAKELPYNLWAEAMNTACYIHNRVT 2688
Query: 628 TKKLALQTPTTILYQ----KSPSYSHLKVFGCLCFPLIPSTTRNKLQARSTPCVFLGYPS 683
++ TPTT LY+ + PS H +FG C+ L R K+ +S +FLGY +
Sbjct: 2689 LRR---GTPTT-LYEIWKGRKPSVKHFHIFGSPCYILADREQRRKMDPKSDAGIFLGYST 2856
Query: 684 NHRGYKCFELSSRKIIISRHVIFDENTFPFSNSNIPESSCYNFLDTSDTPFPYHLFQHTS 743
N R Y+ F +R ++ S +V+ D+ + P ++ E
Sbjct: 2857 NSRAYRVFNSRTRTVMESINVVVDDLS-PARKKDVEE----------------------- 2964
Query: 744 NLPTNEPNSHDQPPTTTATPLTTPYSINTTTHQPTVAPSPQIQTSPQLTITPNTASPHQI 803
++ T+ N D + + + + +QP S +IQ ++
Sbjct: 2965 DVRTSGDNVADAAKSGENAENSDSATDESNINQPDKRSSTRIQ---------------KM 3099
Query: 804 PPPNPLLANPPLSPQMTTRAQHGIFKPRQLLNLHTSSSNQISPL-PTNPINALQDHNWKM 862
P ++ +P + +TTR++ S+S +S + P N AL D W
Sbjct: 3100 HPKELIIGDP--NRGVTTRSREVEI---------VSNSCFVSKIEPKNVKEALTDEFWIN 3246
Query: 863 AMKDEYDALIDNKTWDLVPRPSNANIIRSLWIFRHKKKADGSFERYKARLVGNGSNQQTG 922
AM++E + N+ W+LVPRP N+I + WIF++K +G R KARLV G Q G
Sbjct: 3247 AMQEELEQFKRNEVWELVPRPEGTNVIGTKWIFKNKTNEEGVITRNKARLVAQGYTQIEG 3426
Query: 923 VDCGETFSPVVKPATIRTVLSIALSKSWCLHQLDVKNAFLHGNLNETVYMYQPPGFRDPQ 982
VD ETF+PV + +IR +L +A + L+Q+DVK+AFL+G LNE VY+ QP GF DP
Sbjct: 3427 VDFDETFAPVARLESIRLLLGVACILKFKLYQMDVKSAFLNGYLNEEVYVEQPKGFADPT 3606
Query: 983 HPDYVCLLKKSLYGLKQAPRAWYQRFTDYVATLGFSHSVCDHSLFIYHSGDDTAYILLYV 1042
HPD+V LKK+LYGLKQAPRAWY+R T+++ G+ D +LF+ ++ +YV
Sbjct: 3607 HPDHVYRLKKALYGLKQAPRAWYERLTEFLTQQGYRKGGIDKTLFVKQDAENLMIAQIYV 3786
Query: 1043 DDIILTASSDTLRQSIMSKLNSEFAMKDLGPLSYFLGISVTRHSDD-------------- 1088
DDI+ S+ + + + ++ SEF M +G L+YFLG+ V + D
Sbjct: 3787 DDIVFGGMSNEMLRHFVQQMQSEFEMSLVGELTYFLGLQVKQMEDSIFLSQSRYAKNIVK 3966
Query: 1089 ---------------TKAKLSGTSGNPYHDPSEYRSLAGALQYLTFTRPDISYAVQQVCL 1133
T KLS D S YRS+ G+L YLT +RPDI+YAV
Sbjct: 3967 KFGMENASHKRTPAPTHLKLSKDEAGTSVDQSLYRSMIGSLLYLTASRPDITYAVGVCAR 4146
Query: 1134 FMHDPKTQHMTALKRIIRYIKGTSTHGLHLYPSTVDKLTTYTDADWGGCPDTRKSTSGYC 1193
+ +PK H+T +KRI++Y+ GTS +G+ + L Y DADW G D RKSTSG C
Sbjct: 4147 YQANPKISHLTQVKRILKYVNGTSDYGIMYCHCSNPMLVGYCDADWAGSADDRKSTSGGC 4326
Query: 1194 VYLGDNLVSWSAKRQPTLSRSSAEAEYRGVANVVSESCWLRNLLLELQCPVTKATLVYCD 1253
YLG+NL+SW +K+Q +S S+AEAEY + S+ W++ +L E TL YCD
Sbjct: 4327 FYLGNNLISWFSKKQNCVSLSTAEAEYIAAGSSCSQLVWMKQMLKEYNVEQDVMTL-YCD 4503
Query: 1254 NVSAVYLSGNPIQHQRTKHIEMDIHFVREKVARGQVRVMHVPSRYQIADIFTKGLPLQLF 1313
N+SA+ +S NP+QH RTKHI++ H++R+ V + + HV + QIADIFTK L F
Sbjct: 4504 NMSAINISKNPVQHSRTKHIDIRHHYIRDLVDDKVITLKHVDTEEQIADIFTKALDANQF 4683
Query: 1314 DDFRDSLHI 1322
+ R L I
Sbjct: 4684 EKLRGKLGI 4710
>TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, complete
Length = 4734
Score = 546 bits (1407), Expect = e-155
Identities = 352/1083 (32%), Positives = 539/1083 (49%), Gaps = 39/1083 (3%)
Frame = +1
Query: 279 IQAAMHSLSLAPPDEQWYMDTGATSHMTANGGNLTSYSNISNN-ITVGSGHNIPVIGCGN 337
+ +H+ A E WY+D+G + HMT L + S + +T G G + G G
Sbjct: 1639 VSLVVHTSLRASAKEDWYLDSGCSRHMTGVKEFLVNIEPCSTSYVTFGDGSKGKITGMGK 1818
Query: 338 ALVQNPQYPLTLNNVLHAPKLIKNLVSVRKFTIDNEVSVEFDPFGFSVKDFQTGMPLMRC 397
+ +LN VL L NL+S+ + D +V F V + ++ + +
Sbjct: 1819 LVHDGLP---SLNKVLLVKGLTANLISISQLC-DEGFNVNFTKSECLVTNEKSEVLMKGS 1986
Query: 398 NSSGDLYPLATRPYYPSTTPSTFAALSNEIWHNRLGHPGVSILNSLHRNNSI--LCN-KF 454
S + Y + S+T +IWH R GH + + + ++ + N K
Sbjct: 1987 RSKDNCYLWTPQETSYSSTCLFSKEDEVKIWHQRFGHLHLRGMKKIIDKGAVRGIPNLKI 2166
Query: 455 RNNFFCQSCQLGKQIKLPFYE-SLSSTLLPFDIVHSDIW-TSPILSSGGHHYYILFLDDF 512
C CQ+GKQ+K+ + +T +++H D+ + S GG Y + +DDF
Sbjct: 2167 EEGRICGECQIGKQVKMSHQKLQHQTTSRVLELLHMDLMGPMQVESLGGKRYAYVVVDDF 2346
Query: 513 TDFLWTFPLTNKSQAHSIFLQFCNHIKTQFERDIKCFQCDNGKEYDNSHFHQFCKQNGMI 572
+ F W + KS +F + ++ + + IK + D+G+E++NS F +FC G+
Sbjct: 2347 SRFTWVNFIREKSDTFEVFKELSLRLQREKDCVIKRIRSDHGREFENSKFTEFCTSEGIT 2526
Query: 573 FRFSCPHTSPQNGKVERKIRTINNIIRTLLAHASLPPSFWHHALQMATYLLNILPTKKLA 632
FS T QNG VERK RT+ R +L LP + W A+ A Y+ N + ++
Sbjct: 2527 HEFSAAITPQQNGIVERKNRTLQEAARVMLHAKELPYNLWAEAMNTACYIHNRVTLRR-- 2700
Query: 633 LQTPTTILYQ----KSPSYSHLKVFGCLCFPLIPSTTRNKLQARSTPCVFLGYPSNHRGY 688
TPTT LY+ + P+ H +FG C+ L R K+ +S +FLGY +N R Y
Sbjct: 2701 -GTPTT-LYEIWKGRKPTVKHFHIFGSPCYILADREQRRKMDPKSDAGIFLGYSTNSRAY 2874
Query: 689 KCFELSSRKIIISRHVIFDENTFPFSNSNIPESSCYNFLDTSDTPFPYHLFQHTSNLPTN 748
+ F +R ++ S +V+ D+ T P ++ E + + +DT ++ S+ T+
Sbjct: 2875 RVFNSRTRTVMESINVVVDDLT-PARKKDVEEDVRTSGDNVADTAKSAENAEN-SDSATD 3048
Query: 749 EPNSHDQPPTTTATPLTTPYSINTTTHQPTVAPSPQIQTSPQLTITPNTASPHQIPPPNP 808
EPN +QP PS +IQ ++ P
Sbjct: 3049 EPN----------------------INQPDKRPSIRIQ---------------KMHPKEL 3117
Query: 809 LLANPPLSPQMTTRAQHGIFKPRQLLNLHTSSSNQISPLPTNPINALQDHNWKMAMKDEY 868
++ +P + +TTR++ + + ++S P N AL D W AM++E
Sbjct: 3118 IIGDP--NRGVTTRSRE--------IEIVSNSCFVSKIEPKNVKEALTDEFWINAMQEEL 3267
Query: 869 DALIDNKTWDLVPRPSNANIIRSLWIFRHKKKADGSFERYKARLVGNGSNQQTGVDCGET 928
+ N+ W+LVPRP N+I + WIF++K +G R KARLV G Q GVD ET
Sbjct: 3268 EQFKRNEVWELVPRPEGTNVIGTKWIFKNKTNEEGVITRNKARLVAQGYTQIEGVDFDET 3447
Query: 929 FSPVVKPATIRTVLSIALSKSWCLHQLDVKNAFLHGNLNETVYMYQPPGFRDPQHPDYVC 988
F+PV + +IR +L +A + L+Q+DVK+AFL+G LNE Y+ QP GF DP HPD+V
Sbjct: 3448 FAPVARLESIRLLLGVACILKFKLYQMDVKSAFLNGYLNEEAYVEQPKGFVDPTHPDHVY 3627
Query: 989 LLKKSLYGLKQAPRAWYQRFTDYVATLGFSHSVCDHSLFIYHSGDDTAYILLYVDDIILT 1048
LKK+LYGLKQAPRAWY+R T+++ G+ D +LF+ ++ +YVDDI+
Sbjct: 3628 RLKKALYGLKQAPRAWYERLTEFLTQQGYRKGGIDKTLFVKQDAENLMIAQIYVDDIVFG 3807
Query: 1049 ASSDTLRQSIMSKLNSEFAMKDLGPLSYFLGISVTRHSDD-------------------- 1088
S+ + + + ++ SEF M +G L+YFLG+ V + D
Sbjct: 3808 GMSNEMLRHFVQQMQSEFEMSLVGELTYFLGLQVKQMEDSIFLSQSKYAKNIVKKFGMEN 3987
Query: 1089 ---------TKAKLSGTSGNPYHDPSEYRSLAGALQYLTFTRPDISYAVQQVCLFMHDPK 1139
T KLS D S YRS+ G+L YLT +RPDI+YAV + +PK
Sbjct: 3988 ASHKRTPAPTHLKLSKDEAGTSVDQSLYRSMIGSLLYLTASRPDITYAVGVCARYQANPK 4167
Query: 1140 TQHMTALKRIIRYIKGTSTHGLHLYPSTVDKLTTYTDADWGGCPDTRKSTSGYCVYLGDN 1199
H+ +KRI++Y+ GTS +G+ + L Y DADW G D RKSTSG C YLG N
Sbjct: 4168 ISHLNQVKRILKYVNGTSDYGIMYCHCSDSMLVGYCDADWAGSADDRKSTSGGCFYLGTN 4347
Query: 1200 LVSWSAKRQPTLSRSSAEAEYRGVANVVSESCWLRNLLLELQCPVTKATLVYCDNVSAVY 1259
L+SW +K+Q +S S+AEAEY + S+ W++ +L E TL YCDN+SA+
Sbjct: 4348 LISWFSKKQNCVSLSTAEAEYIAAGSSCSQLVWMKQMLKEYNVEQDVMTL-YCDNMSAIN 4524
Query: 1260 LSGNPIQHQRTKHIEMDIHFVREKVARGQVRVMHVPSRYQIADIFTKGLPLQLFDDFRDS 1319
+S NP+QH RTKHI++ H++R+ V + + HV + QIADIFTK L F+ R
Sbjct: 4525 ISKNPVQHSRTKHIDIRHHYIRDLVDDKVITLEHVDTEEQIADIFTKALDANQFEKLRGK 4704
Query: 1320 LHI 1322
L I
Sbjct: 4705 LGI 4713
>TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polyprotein, partial
(7%)
Length = 804
Score = 151 bits (381), Expect = 2e-36
Identities = 83/215 (38%), Positives = 127/215 (58%), Gaps = 4/215 (1%)
Frame = +1
Query: 1103 DPSEYRSLAGALQYLTFTRPDISYAVQQVCLFMHDPKTQHMTALKRIIRYIKGTSTHGLH 1162
D +E+R L G+L+YL +RP+I +AV + FM P+ HM A KR++R IKGT G+
Sbjct: 10 DVTEFRRLIGSLRYLCNSRPNICFAVSLISRFMKRPRLSHMQAAKRVLRLIKGTIGSGV- 186
Query: 1163 LYP----STVDKLTTYTDADWGGCPDTRKSTSGYCVYLGDNLVSWSAKRQPTLSRSSAEA 1218
L+P S L YTD+DW P+ KST GY D V+ S+K+Q ++ S+ EA
Sbjct: 187 LFPFKAKSGKPDLLGYTDSDWKRDPEQEKSTGGYLFMYNDAPVA*SSKKQDVIALSTCEA 366
Query: 1219 EYRGVANVVSESCWLRNLLLELQCPVTKATLVYCDNVSAVYLSGNPIQHQRTKHIEMDIH 1278
EY + ++ W+ NLL EL+ K + DN SA+ L+ +P H R+KHIE+ H
Sbjct: 367 EYVAASLGACQAVWMMNLLEELKLRERKPVNLLIDNKSAINLAKHPTLHGRSKHIELRFH 546
Query: 1279 FVREKVARGQVRVMHVPSRYQIADIFTKGLPLQLF 1313
++R++V++G V V + + Q+AD+ TK + + F
Sbjct: 547 YIRDQVSKGNVTVEYCKAEEQLADLMTKPIQVSRF 651
>TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Fragment), partial
(30%)
Length = 687
Score = 148 bits (373), Expect = 2e-35
Identities = 68/139 (48%), Positives = 95/139 (67%)
Frame = +2
Query: 1170 KLTTYTDADWGGCPDTRKSTSGYCVYLGDNLVSWSAKRQPTLSRSSAEAEYRGVANVVSE 1229
+L+ Y DADW GCP R+STSGYCV++G NLVSW +K+Q ++RSSAEAEYR +A V E
Sbjct: 14 QLSGYCDADWAGCPMDRRSTSGYCVFIGGNLVSWKSKKQTVVARSSAEAEYRSMAMVTCE 193
Query: 1230 SCWLRNLLLELQCPVTKATLVYCDNVSAVYLSGNPIQHQRTKHIEMDIHFVREKVARGQV 1289
W++ L EL+ +YCDN +A++++ NP+ H+RTKHIE+D HF+REK+ ++
Sbjct: 194 LMWIKQFLQELRFCEELQMKLYCDNQAALHIASNPVFHERTKHIEIDCHFIREKLLSKEI 373
Query: 1290 RVMHVPSRYQIADIFTKGL 1308
+ S Q DI TK L
Sbjct: 374 VTEFIGSNDQPVDILTKSL 430
>CO982036
Length = 674
Score = 143 bits (361), Expect = 4e-34
Identities = 84/205 (40%), Positives = 115/205 (55%), Gaps = 31/205 (15%)
Frame = -2
Query: 1035 TAYILLYVDDIILTASSDTLRQSIMSKLNSEFAMKDLGPLSYFLGISVTRHSD------- 1087
T Y+L+YVD II+T SS TL Q++ SKLNS F +K LG L YF+ I V D
Sbjct: 655 TVYLLVYVD-IIITGSSCTLIQNLTSKLNSSFPLKLLGKLDYFVEIEVKSMPDLLFSLRT 479
Query: 1088 ---------------------DTKAKLSGTSGNPYHDPSEYRSLAGALQYLTFTRPDISY 1126
T KLS + + + P+ YRS+ GALQY T RP+IS+
Sbjct: 478 SIFEIFCRKPR*QAQPISSPMTTTCKLSKSDSDLFSGPTFYRSVVGALQYTTVIRPEISF 299
Query: 1127 AVQQVCLFMHDPKTQHMTALKRIIRYIKGTSTHGLHLYPSTVDK---LTTYTDADWGGCP 1183
AV +VC FM +P H T +KRI+RY+KG+ ++GL L P+ + + + DADW
Sbjct: 298 AVNKVCQFMSNPLDSHWTEVKRILRYLKGSLSYGL*LKPAISSQPLPIRGFCDADWASAV 119
Query: 1184 DTRKSTSGYCVYLGDNLVSWSAKRQ 1208
D ++STSG V+LG NL+SW +Q
Sbjct: 118 DDKRSTSGAAVFLGPNLISWWXXKQ 44
>TC235104 weakly similar to UP|Q850H8 (Q850H8) Gag-pol polyprotein
(Fragment), partial (28%)
Length = 865
Score = 143 bits (360), Expect = 6e-34
Identities = 96/287 (33%), Positives = 142/287 (49%)
Frame = +2
Query: 322 ITVGSGHNIPVIGCGNALVQNPQYPLTLNNVLHAPKLIKNLVSVRKFTIDNEVSVEFDPF 381
IT+ G + G G+ +P L+LN+V+ N+ S+ + T SV FD
Sbjct: 20 ITLADGSRVVATGIGHV---SPTSSLSLNSVVFILGCPFNITSLSQLTRFRNCSVTFDAN 190
Query: 382 GFSVKDFQTGMPLMRCNSSGDLYPLATRPYYPSTTPSTFAALSNEIWHNRLGHPGVSILN 441
F +++ TG + S LY L P+ + A S ++ H RLGHP +S L
Sbjct: 191 SFVIQECGTGWTIGVGIESHGLYYLK-----PNLSWVCSAVTSPKLLHERLGHPHLSKLK 355
Query: 442 SLHRNNSILCNKFRNNFFCQSCQLGKQIKLPFYESLSSTLLPFDIVHSDIWTSPILSSGG 501
+ + + + FC+SCQLGK ++ S PF ++H DIW +SS
Sbjct: 356 IMVPSLEKI-----KDLFCESCQLGKHVRSSXRHVESRVDSPFLVIHXDIWGPNRVSSMS 520
Query: 502 HHYYILFLDDFTDFLWTFPLTNKSQAHSIFLQFCNHIKTQFERDIKCFQCDNGKEYDNSH 561
+ Y++ F+D+F+ F + +S+ S FL N IKTQF + IK + DN KEY +S
Sbjct: 521 YRYFVTFIDEFSQCTRVFLMKERSEILS-FLTSVNKIKTQFGKTIKILRSDNAKEYFSSV 697
Query: 562 FHQFCKQNGMIFRFSCPHTSPQNGKVERKIRTINNIIRTLLAHASLP 608
F G++ +FSCPHT QN ERK R + RTLL HA+ P
Sbjct: 698 ISPFXSAQGILHQFSCPHTPQQNDIAERKNRHLVETARTLLLHANEP 838
>AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza sativa
(japonica cultivar-group)}, partial (10%)
Length = 463
Score = 142 bits (357), Expect = 1e-33
Identities = 68/151 (45%), Positives = 100/151 (66%)
Frame = -3
Query: 862 MAMKDEYDALIDNKTWDLVPRPSNANIIRSLWIFRHKKKADGSFERYKARLVGNGSNQQT 921
+AM++E + N W LV +P N +I + W+FR+K G R KARLV G NQ+
Sbjct: 461 IAMQEELNQFERNNVWKLVEKPENYPVIGTKWVFRNKLDEHGIIIRNKARLVAKGYNQEE 282
Query: 922 GVDCGETFSPVVKPATIRTVLSIALSKSWCLHQLDVKNAFLHGNLNETVYMYQPPGFRDP 981
G+D ET++PV + IR +L+ ++ L+Q+DVK+AFL+G + E VY+ QPPGF P
Sbjct: 281 GIDYEETYAPVARLEVIRMLLAYVSIMNFKLYQMDVKSAFLNGLIQEEVYVEQPPGFEIP 102
Query: 982 QHPDYVCLLKKSLYGLKQAPRAWYQRFTDYV 1012
P +V L+K+LYGLKQAPRAWY+R ++++
Sbjct: 101 DKPTHVYKLQKALYGLKQAPRAWYERISNFL 9
>TC234303 weakly similar to UP|Q8W153 (Q8W153) Polyprotein, partial (10%)
Length = 558
Score = 130 bits (327), Expect = 4e-30
Identities = 79/179 (44%), Positives = 100/179 (55%), Gaps = 1/179 (0%)
Frame = +1
Query: 903 GSFERYKARLVGNGSNQQTGVDCGETFSPVVKPATIRTVLSIALSKSWCLHQLDVKNAFL 962
G+ +++KARLV Q G D TFSPV K A + + S+A+ W L LD KNAFL
Sbjct: 28 GTIDQFKARLVAKSYTQVYGQDYTGTFSPVAKMAYVHLLWSMAVVCHWPLF*LDAKNAFL 207
Query: 963 HGNLNETVYMYQPPGF-RDPQHPDYVCLLKKSLYGLKQAPRAWYQRFTDYVATLGFSHSV 1021
HG L E VYM QP GF + + VC L +S YGLKQ+PRAW F A + +
Sbjct: 208 HGYLEEEVYMEQPLGFVAQGESSNMVCQLCRSFYGLKQSPRAW--PFLYCGAAIWYDSHE 381
Query: 1022 CDHSLFIYHSGDDTAYILLYVDDIILTASSDTLRQSIMSKLNSEFAMKDLGPLSYFLGI 1080
DHS+F HS Y+++YVDDI +T S + L +F KDLG L YFLGI
Sbjct: 382 ADHSVFYCHSPQGCIYLIVYVDDIGITGSDQHGIT*LK*XLCCQFQTKDLGKLRYFLGI 558
>BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vinifera},
partial (34%)
Length = 407
Score = 128 bits (321), Expect = 2e-29
Identities = 64/133 (48%), Positives = 81/133 (60%)
Frame = -2
Query: 880 VPRPSNANIIRSLWIFRHKKKADGSFERYKARLVGNGSNQQTGVDCGETFSPVVKPATIR 939
VP P + W++ K G +R KARLV G Q G+D +TFSPV K T+R
Sbjct: 406 VPLPPGKTPVGCRWVYTVKVGPTGEVDRLKARLVAKGYTQVYGIDYCDTFSPVAKLTTVR 227
Query: 940 TVLSIALSKSWCLHQLDVKNAFLHGNLNETVYMYQPPGFRDPQHPDYVCLLKKSLYGLKQ 999
L++A W LHQLD+KNAFLHG+L E +YM QPPGF VC L +SLYGLKQ
Sbjct: 226 LFLAMAAICHWPLHQLDIKNAFLHGDLEEDIYMEQPPGFVAQGEYGLVCKLHRSLYGLKQ 47
Query: 1000 APRAWYQRFTDYV 1012
+PRAW+ +F+ V
Sbjct: 46 SPRAWFGKFSHVV 8
>BU548243
Length = 599
Score = 124 bits (312), Expect = 2e-28
Identities = 62/152 (40%), Positives = 92/152 (59%)
Frame = -1
Query: 1172 TTYTDADWGGCPDTRKSTSGYCVYLGDNLVSWSAKRQPTLSRSSAEAEYRGVANVVSESC 1231
T DA W D +ST G ++LG NL+SW +++Q ++SS EAEYR +A +E
Sbjct: 599 TALCDAGWASDVDDHRSTLGSAIFLGPNLISWWSRKQQVTAQSSTEAEYRSIAQTSAELT 420
Query: 1232 WLRNLLLELQCPVTKATLVYCDNVSAVYLSGNPIQHQRTKHIEMDIHFVREKVARGQVRV 1291
W++ LL+ELQ P T ++ CDN SAV ++ N + H RTKH+E+D+ FV EKV Q+++
Sbjct: 419 WIQALLMELQIPFT-PPVILCDNKSAVAIAHNLVFHSRTKHMEIDVFFVHEKVLSKQLQI 243
Query: 1292 MHVPSRYQIADIFTKGLPLQLFDDFRDSLHIR 1323
H+P+ Q A I TK L F + L ++
Sbjct: 242 FHIPALDQWAGILTKPLSSARFTFLKSKLTVK 147
>CO983516
Length = 724
Score = 122 bits (307), Expect = 8e-28
Identities = 58/122 (47%), Positives = 81/122 (65%)
Frame = +2
Query: 925 CGETFSPVVKPATIRTVLSIALSKSWCLHQLDVKNAFLHGNLNETVYMYQPPGFRDPQHP 984
C + F PV + +IR +L +A + L+Q+DVK+AFL+G LNE VY+ QP GF DP HP
Sbjct: 353 CDKEFHPVARLESIRLLLGVACILKFKLYQMDVKSAFLNGYLNEEVYVEQPKGFIDPTHP 532
Query: 985 DYVCLLKKSLYGLKQAPRAWYQRFTDYVATLGFSHSVCDHSLFIYHSGDDTAYILLYVDD 1044
D+V LKK+LYGLKQAPRAWY+R T+ + G+ D +LF+ ++ +YVDD
Sbjct: 533 DHVYRLKKALYGLKQAPRAWYERLTELLTQQGYRKGGIDKTLFVKQDAENLMIAQIYVDD 712
Query: 1045 II 1046
I+
Sbjct: 713 IV 718
>TC231544 weakly similar to UP|Q9FLR2 (Q9FLR2) Polyprotein-like, partial (16%)
Length = 662
Score = 122 bits (307), Expect = 8e-28
Identities = 72/177 (40%), Positives = 105/177 (58%), Gaps = 3/177 (1%)
Frame = +3
Query: 1140 TQHMTALKRIIRYIKGTSTHGLHLYPSTVDKLTTYTDADWGGCPDTRKSTSGYCVYLGDN 1199
T+ + A R+++Y+KG GL + ++ ++DADW C D+ KS + YC +LG +
Sbjct: 3 TRPLCAATRVLKYLKGCPRKGLSFSRESPIQILGFSDADWATCIDSSKSITWYCFFLGSS 182
Query: 1200 LVSWSAKRQPTLSR--SSAEAEYRGVANVVSESCWLRNLLLELQCPVTKATLVYCDNVSA 1257
L+SW AK+Q T+SR SS+EA+YR + + E WL LL +L TL+YCDN SA
Sbjct: 183 LISWKAKKQNTVSRSSSSSEAKYRALTSTTCELQWLTYLLKDLH-----VTLIYCDNQSA 347
Query: 1258 VYLSGNPIQHQRTKHIEMDIHFVREKVARGQVR-VMHVPSRYQIADIFTKGLPLQLF 1313
L PI+ +E+D H VREK +G + ++ V S Q+ADIFTK L +LF
Sbjct: 348 --LQ*LPIKVIYHGQLEIDCHIVREKTQQGLMHCLLPVSSSNQLADIFTKALSPKLF 512
>CO981879
Length = 576
Score = 77.8 bits (190), Expect(2) = 1e-27
Identities = 41/96 (42%), Positives = 52/96 (53%)
Frame = -1
Query: 529 SIFLQFCNHIKTQFERDIKCFQCDNGKEYDNSHFHQFCKQNGMIFRFSCPHTSPQNGKVE 588
SIF F I+TQF+ IK F+ DNG+EY N H + +NG+I + SC T QNG E
Sbjct: 573 SIFKTFFQMIQTQFQVKIKVFRSDNGREYFNKHLSKXXLENGIIHQSSCVDTPQQNGVAE 394
Query: 589 RKIRTINNIIRTLLAHASLPPSFWHHALQMATYLLN 624
RK R + + R LL P W A+ TYL N
Sbjct: 393 RKNRHLXEVARALLFQNKAPKYXWGEAILTGTYLKN 286
Score = 65.1 bits (157), Expect(2) = 1e-27
Identities = 35/93 (37%), Positives = 51/93 (54%), Gaps = 5/93 (5%)
Frame = -2
Query: 626 LPTKKLALQTPTTILYQKSPSYS-----HLKVFGCLCFPLIPSTTRNKLQARSTPCVFLG 680
+P+K L +TP + P+ LK+FGC F I + KL+ R+ CVF+G
Sbjct: 281 MPSKILNFRTPLDVFTSAFPNNRLSCTLPLKIFGCTVFVHIHEPNQGKLEPRAKKCVFVG 102
Query: 681 YPSNHRGYKCFELSSRKIIISRHVIFDENTFPF 713
Y N +GYKCF+ +S+K ++ V F E T PF
Sbjct: 101 YAPNQKGYKCFDPTSKKTFVTIDVTFFEKT-PF 6
>TC221132 weakly similar to UP|O23529 (O23529) RETROTRANSPOSON like protein,
partial (5%)
Length = 799
Score = 77.8 bits (190), Expect(3) = 7e-27
Identities = 38/97 (39%), Positives = 53/97 (54%)
Frame = +1
Query: 1134 FMHDPKTQHMTALKRIIRYIKGTSTHGLHLYPSTVDKLTTYTDADWGGCPDTRKSTSGYC 1193
FM DP M A KR++RY+KGT GL L S L + DA+W +ST Y
Sbjct: 139 FMKDPTKIRMQATKRVLRYLKGTIDFGLQLRSSPDQHLRAFYDANWVDNTSDIRSTGAYV 318
Query: 1194 VYLGDNLVSWSAKRQPTLSRSSAEAEYRGVANVVSES 1230
VY G +++SWS K+Q + +SS + EY + + ES
Sbjct: 319 VYFGLSVISWSCKKQSIIDKSSTKVEYHKITTTIIES 429
Score = 55.8 bits (133), Expect(3) = 7e-27
Identities = 32/83 (38%), Positives = 46/83 (54%)
Frame = +2
Query: 1250 VYCDNVSAVYLSGNPIQHQRTKHIEMDIHFVREKVARGQVRVMHVPSRYQIADIFTKGLP 1309
+Y N+ A+YL NP+ H KH+ +D FV++ VA Q+RV HVPS + D+FTK L
Sbjct: 455 MYSYNIGAMYLCANPVFHLCMKHLTIDHLFVQDLVANKQLRVSHVPSCH*HVDLFTKALV 634
Query: 1310 LQLFDDFRDSLHIRQPPVSTTGV 1332
D + + VSTT +
Sbjct: 635 SSRHKFMMDKIGV----VSTTTI 691
Score = 26.9 bits (58), Expect(3) = 7e-27
Identities = 12/32 (37%), Positives = 19/32 (58%)
Frame = +3
Query: 1100 PYHDPSEYRSLAGALQYLTFTRPDISYAVQQV 1131
P D Y L +LQYL+ T PDI++ + ++
Sbjct: 39 PSCDGIVYCQLVDSLQYLSLTCPDIAFPINKL 134
>TC213888 similar to UP|Q9SFE1 (Q9SFE1) T26F17.17, partial (11%)
Length = 493
Score = 118 bits (296), Expect = 2e-26
Identities = 57/139 (41%), Positives = 89/139 (64%)
Frame = +3
Query: 1184 DTRKSTSGYCVYLGDNLVSWSAKRQPTLSRSSAEAEYRGVANVVSESCWLRNLLLELQCP 1243
D RKST+G+ ++GD +W +K+QP ++ S+ EAEY + V + WLRNLL EL+ P
Sbjct: 9 DDRKSTTGFVFFMGDTAFTWMSKKQPIVTLSTCEAEYVAATSCVCHAIWLRNLLKELKMP 188
Query: 1244 VTKATLVYCDNVSAVYLSGNPIQHQRTKHIEMDIHFVREKVARGQVRVMHVPSRYQIADI 1303
+ + DN SA+ L+ NP+ H+++KHI+ HF+RE + + +V++ +V S+ Q ADI
Sbjct: 189 QEEPMEICVDNKSALALAKNPVFHEKSKHIDTRYHFIRECIEKKEVKLKYVMSQDQAADI 368
Query: 1304 FTKGLPLQLFDDFRDSLHI 1322
FTK L L+ F R L +
Sbjct: 369 FTKPLKLETFVKLRSMLGV 425
>CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {Vitis
vinifera}, partial (34%)
Length = 409
Score = 116 bits (290), Expect = 7e-26
Identities = 57/129 (44%), Positives = 77/129 (59%)
Frame = +3
Query: 847 LPTNPINALQDHNWKMAMKDEYDALIDNKTWDLVPRPSNANIIRSLWIFRHKKKADGSFE 906
+P+ AL W+ AM DE AL +N TW+LVP P + W++ K +G +
Sbjct: 18 VPSTIREALDHPGWRQAMVDEMQALENNGTWELVPLPPGKTTVGCRWVYTVKVGPNGKVD 197
Query: 907 RYKARLVGNGSNQQTGVDCGETFSPVVKPATIRTVLSIALSKSWCLHQLDVKNAFLHGNL 966
R KARLV G Q G++ +TFSPV T+R L++A + W LHQLD+KNAFLHG+L
Sbjct: 198 RLKARLVAKGYTQVYGIEYCDTFSPVFFLTTVRLFLAMAAIRHWPLHQLDIKNAFLHGDL 377
Query: 967 NETVYMYQP 975
E +YM QP
Sbjct: 378 EEDIYMEQP 404
>BU549979
Length = 615
Score = 116 bits (290), Expect = 7e-26
Identities = 57/184 (30%), Positives = 102/184 (54%), Gaps = 2/184 (1%)
Frame = -1
Query: 1134 FMHDPKTQHMTALKRIIRYIKGTSTHGLHLYPSTVDKLTTYTDADWGGCPDTRKSTSGYC 1193
+ +P H K+++RY++GT + L + ++ Y+D+D+ GC D+R+STSGY
Sbjct: 603 YQSNPGIDHWKTAKKVMRYLQGTKDYMLMYKQTNCLEVIGYSDSDFAGCVDSRRSTSGYI 424
Query: 1194 VYLGDNLVSWSAKRQPTLSRSSAEAEYRGVANVVSESCWLRNLLLELQC--PVTKATLVY 1251
L D +VSW + +Q ++ S+ E E+ S WL++ + L+ +++ +Y
Sbjct: 423 FMLADGVVSWRSSKQTLIATSTMEVEFVPCFEATSHGVWLKSFMSSLRVVDSISRPLKLY 244
Query: 1252 CDNVSAVYLSGNPIQHQRTKHIEMDIHFVREKVARGQVRVMHVPSRYQIADIFTKGLPLQ 1311
CDN +AV+++ N R+KHI++ +RE+V +V + HV + I D TKG+ +
Sbjct: 243 CDNFAAVFMAKNNKSGNRSKHIDIKYLVIRERVKEKKVVIEHVNTELMIVDPLTKGMTPK 64
Query: 1312 LFDD 1315
F D
Sbjct: 63 NFKD 52
>TC225402 weakly similar to UP|Q6I923 (Q6I923) Copia-like retrotransposon
Hopscotch polyprotein, partial (7%)
Length = 1446
Score = 112 bits (280), Expect = 1e-24
Identities = 52/109 (47%), Positives = 74/109 (67%)
Frame = +2
Query: 1176 DADWGGCPDTRKSTSGYCVYLGDNLVSWSAKRQPTLSRSSAEAEYRGVANVVSESCWLRN 1235
DA+W P R ST GYCV +G+NLV W + + ++RSSAEAEY+ + E W++
Sbjct: 8 DANWAVSPIDRGSTLGYCVSIGENLVLWKSNK*NVVARSSAEAEYKAMTVATCELIWIKQ 187
Query: 1236 LLLELQCPVTKATLVYCDNVSAVYLSGNPIQHQRTKHIEMDIHFVREKV 1284
LL EL+ T+ + CDN +A++++ NP+ H+RTKHIE+D HFVREKV
Sbjct: 188 LLQELKFGSTQQMKLCCDNQAALHIASNPVFHERTKHIEIDCHFVREKV 334
>BM307983
Length = 406
Score = 112 bits (280), Expect = 1e-24
Identities = 61/129 (47%), Positives = 79/129 (60%), Gaps = 1/129 (0%)
Frame = +2
Query: 893 WIFRHKKKADGSFERYKARLVGNGSNQQTGVDCGETFSPVVKPATIRTVLSIALSK-SWC 951
WI+ K AD + +RYKARLV G Q G+D ETF+ K + ++ W
Sbjct: 14 WIYTVKY*ADDTLDRYKARLVAKGYIQTYGIDYEETFAQWQK*IQSGSSSP*QQAQFGWE 193
Query: 952 LHQLDVKNAFLHGNLNETVYMYQPPGFRDPQHPDYVCLLKKSLYGLKQAPRAWYQRFTDY 1011
+HQ DVKNAFLHG+L E VYM PPG+ + VC LKK+LYGLKQ+PRAW+ RFT
Sbjct: 194 MHQFDVKNAFLHGSLEEEVYMEIPPGYGASNGGNKVCRLKKALYGLKQSPRAWFGRFTQA 373
Query: 1012 VATLGFSHS 1020
+ +LG+ S
Sbjct: 374 MLSLGYKQS 400
>CO981347
Length = 624
Score = 68.6 bits (166), Expect(3) = 1e-23
Identities = 36/99 (36%), Positives = 51/99 (51%)
Frame = +2
Query: 538 IKTQFERDIKCFQCDNGKEYDNSHFHQFCKQNGMIFRFSCPHTSPQNGKVERKIRTINNI 597
I Q +K + DNG E+ F++FC++ G+ PHT QNG ER TI
Sbjct: 137 IGNQLGTKLKVLRTDNGLEFVLEQFNEFCRKIGIKRHKIVPHTP*QNGLAERMNMTILER 316
Query: 598 IRTLLAHASLPPSFWHHALQMATYLLNILPTKKLALQTP 636
+R +L A LP +FW A +YL+N P+ L +TP
Sbjct: 317 VRCMLLSARLPKTFWGEAANTTSYLINRCPSSTLGFKTP 433
Score = 48.1 bits (113), Expect(3) = 1e-23
Identities = 24/51 (47%), Positives = 32/51 (62%)
Frame = +3
Query: 643 KSPSYSHLKVFGCLCFPLIPSTTRNKLQARSTPCVFLGYPSNHRGYKCFEL 693
K P+YS LKVFG L F + + KL AR+ CVF+GYP + YK ++L
Sbjct: 453 KPPNYSGLKVFGSLAFDHVK---QGKLDARAVKCVFIGYPKGVKRYKLWKL 596
Score = 32.7 bits (73), Expect(3) = 1e-23
Identities = 12/34 (35%), Positives = 22/34 (64%)
Frame = +3
Query: 494 SPILSSGGHHYYILFLDDFTDFLWTFPLTNKSQA 527
S + + GG Y++ +DDF+ +W + L NKS++
Sbjct: 6 SRVKTHGGSSYFLTIIDDFSRRVWLYVLKNKSES 107
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.319 0.134 0.418
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 70,828,049
Number of Sequences: 63676
Number of extensions: 1290398
Number of successful extensions: 14752
Number of sequences better than 10.0: 818
Number of HSP's better than 10.0 without gapping: 11112
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 13476
length of query: 1333
length of database: 12,639,632
effective HSP length: 109
effective length of query: 1224
effective length of database: 5,698,948
effective search space: 6975512352
effective search space used: 6975512352
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 65 (29.6 bits)
Medicago: description of AC146720.2