
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC146632.4 - phase: 0
(1038 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete 549 e-156
TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, co... 548 e-156
TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polypro... 151 2e-36
CO982036 147 2e-35
TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Frag... 147 2e-35
AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza ... 140 2e-33
TC235104 weakly similar to UP|Q850H8 (Q850H8) Gag-pol polyprotei... 138 1e-32
TC234303 weakly similar to UP|Q8W153 (Q8W153) Polyprotein, parti... 132 8e-31
BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vi... 132 1e-30
TC232995 131 1e-30
CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {V... 127 3e-29
TC231544 weakly similar to UP|Q9FLR2 (Q9FLR2) Polyprotein-like, ... 123 5e-28
BU549979 122 1e-27
BI317550 weakly similar to GP|9759590|dbj| polyprotein-like {Ara... 120 2e-27
BU548243 120 3e-27
TC211311 weakly similar to UP|O24587 (O24587) Pol protein, parti... 100 7e-27
BE211208 116 6e-26
AI855899 similar to GP|2244960|emb| retrotransposon like protein... 116 6e-26
BM307983 114 2e-25
TC213888 similar to UP|Q9SFE1 (Q9SFE1) T26F17.17, partial (11%) 113 4e-25
>TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete
Length = 4731
Score = 549 bits (1415), Expect = e-156
Identities = 335/1025 (32%), Positives = 512/1025 (49%), Gaps = 14/1025 (1%)
Frame = +1
Query: 17 NRGIIVGNGHSIPIRGYSHTNLSFPNPPLTLKNVLHSPQLIKNLVSVRKFTTDNSVSVEF 76
++G I+G G + H L N L +K L NL+S+ + D +V F
Sbjct: 1786 SKGKIIGMGKLV------HDGLPSLNKVLLVKG------LTANLISISQLC-DEGFNVNF 1926
Query: 77 DPFGFSVKDFQTGMRLMRCESRGDLYPITTSQAISPSTFAALAPS---LWHARLGH---P 130
V + ++ + + S+ + Y T + ST + +WH R GH
Sbjct: 1927 TKSECLVTNEKSEVLMKGSRSKDNCYLWTPQETSYSSTCLSSKEDEVRIWHQRFGHLHLR 2106
Query: 131 GAPVVDSLRKNKFIECNRASGSHICHSCSLGKHIKLPFVS-SNSCTIMPFDIIHSDIW-T 188
G + + I + IC C +GK +K+ + T +++H D+
Sbjct: 2107 GMKKIIDKGAVRGIPNLKIEEGRICGECQIGKQVKMSHQKLQHQTTSRVLELLHMDLMGP 2286
Query: 189 SPVLSSSGHRYYVLFVDDYSNFLWTFPLSKKSQVFSIFLSFRTFIRTQFEREVKNIQCDN 248
V S G RY + VDD+S F W + +KS+ F +F ++ + + +K I+ D+
Sbjct: 2287 MQVESLGGKRYAYVVVDDFSRFTWVNFIREKSETFEVFKELSLRLQREKDCVIKRIRSDH 2466
Query: 249 GKEFDNRHFWEFCKENGVAFRLSCPHTSSQNGKAERKIRTINNIIRTLLVHASLPPSFWH 308
G+EF+N F EFC G+ S T QNG ERK RT+ R +L LP + W
Sbjct: 2467 GREFENSRFTEFCTSEGITHEFSAAITPQQNGIVERKNRTLQEAARVMLHAKELPYNLWA 2646
Query: 309 HALQMATYLINILPNKQLAYQSPLKILYQKEPSYSHLRVFGCLCYPLFPSTTINKLQARS 368
A+ A Y+ N + ++ + +I ++PS H +FG CY L K+ +S
Sbjct: 2647 EAMNTACYIHNRVTLRRGTPTTLYEIWKGRKPSVKHFHIFGSPCYILADREQRRKMDPKS 2826
Query: 369 TPCVFLGYPSNHRSYKCYDLSSRKIIISRHVIFDETQFPFAKLHNPQPYTYGFMDDGPSP 428
+FLGY +N R+Y+ ++ +R ++ S +V+ D+
Sbjct: 2827 DAGIFLGYSTNSRAYRVFNSRTRTVMESINVVVDDLS----------------------- 2937
Query: 429 YVIHHLTSQPSLGQPAQHDLPNTQPTTQPTTPEEQHAHSPPS----SSPNPSPSTTATPS 484
P+ + + D+ + ++A + S S+ N ++T
Sbjct: 2938 ---------PARKKDVEEDVRTSGDNVADAAKSGENAENSDSATDESNINQPDKRSSTRI 3090
Query: 485 PPYQPTPISVPKPVTRSQHGIFKPKRQLNL--NTSVPRSPLPRNPVSALRDPNWKMAMDD 542
P + + P G+ R++ + N+ P+N AL D W AM +
Sbjct: 3091 QKMHPKELIIGDP----NRGVTTRSREVEIVSNSCFVSKIEPKNVKEALTDEFWINAMQE 3258
Query: 543 EFNALIKNKTWELVPRPPDVNVIRSMWIFTHKEKSDGVFERYKARLVGDGKTQQVGVDCG 602
E +N+ WELVPRP NVI + WIF +K +GV R KARLV G TQ GVD
Sbjct: 3259 ELEQFKRNEVWELVPRPEGTNVIGTKWIFKNKTNEEGVITRNKARLVAQGYTQIEGVDFD 3438
Query: 603 ETFSPVVKPATIRTVLSLALSKAWSIHQLDVKNAFLHGELKETVYMHQPMGFRDPNLPNH 662
ETF+PV + +IR +L +A + ++Q+DVK+AFL+G L E VY+ QP GF DP P+H
Sbjct: 3439 ETFAPVARLESIRLLLGVACILKFKLYQMDVKSAFLNGYLNEEVYVEQPKGFADPTHPDH 3618
Query: 663 VCLLKKSLYGLKQAPRAWYKRFADYVSTIGFSHSTSDHSLFIYRKGTAMAYILLYVDDII 722
V LKK+LYGLKQAPRAWY+R ++++ G+ D +LF+ + + +YVDDI+
Sbjct: 3619 VYRLKKALYGLKQAPRAWYERLTEFLTQQGYRKGGIDKTLFVKQDAENLMIAQIYVDDIV 3798
Query: 723 LTASSDALRVTIISLLSTEFAMKDLGSLHYFLGIAVTHHTGGLFLSQRKYAAEIIERAGM 782
S+ + + + +EF M +G L YFLG+ V +FLSQ +YA I+++ GM
Sbjct: 3799 FGGMSNEMLRHFVQQMQSEFEMSLVGELTYFLGLQVKQMEDSIFLSQSRYAKNIVKKFGM 3978
Query: 783 AACKSSSTPVDTKPKLSANSSAPHADPSHYRSLAGALQYLTFTRPDIAYAVQQVCLFMHD 842
TP T KLS + + D S YRS+ G+L YLT +RPDI YAV + +
Sbjct: 3979 ENASHKRTPAPTHLKLSKDEAGTSVDQSLYRSMIGSLLYLTASRPDITYAVGVCARYQAN 4158
Query: 843 PREEHMHALKRIVRYIQGTLDHGLHLYPSSTSTLISYTDADWGGCPDTRRSTSGYCVFLG 902
P+ H+ +KRI++Y+ GT D+G+ S L+ Y DADW G D R+STSG C +LG
Sbjct: 4159 PKISHLTQVKRILKYVNGTSDYGIMYCHCSNPMLVGYCDADWAGSADDRKSTSGGCFYLG 4338
Query: 903 DNLISWSAKRQATLSRSSAEAEYRGVANVVSESCWLRNLLLELHCPIRKASLVYCDNVSA 962
+NLISW +K+Q +S S+AEAEY + S+ W++ +L E + + +YCDN+SA
Sbjct: 4339 NNLISWFSKKQNCVSLSTAEAEYIAAGSSCSQLVWMKQMLKEYNVE-QDVMTLYCDNMSA 4515
Query: 963 IYLSGNPVQHQRTKHIEMDIHSVREKVARGEVRVLHVPSRYQIADIFTKGLPLVLFEDFR 1022
I +S NPVQH RTKHI++ H +R+ V + + HV + QIADIFTK L FE R
Sbjct: 4516 INISKNPVQHSRTKHIDIRHHYIRDLVDDKVITLKHVDTEEQIADIFTKALDANQFEKLR 4695
Query: 1023 NNLSV 1027
L +
Sbjct: 4696 GKLGI 4710
>TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, complete
Length = 4734
Score = 548 bits (1412), Expect = e-156
Identities = 332/997 (33%), Positives = 506/997 (50%), Gaps = 15/997 (1%)
Frame = +1
Query: 46 TLKNVLHSPQLIKNLVSVRKFTTDNSVSVEFDPFGFSVKDFQTGMRLMRCESRGDLYPIT 105
+L VL L NL+S+ + D +V F V + ++ + + S+ + Y +
Sbjct: 1840 SLNKVLLVKGLTANLISISQLC-DEGFNVNFTKSECLVTNEKSEVLMKGSRSKDNCY-LW 2013
Query: 106 TSQAISPSTFAALAPS----LWHARLGH---PGAPVVDSLRKNKFIECNRASGSHICHSC 158
T Q S S+ + +WH R GH G + + I + IC C
Sbjct: 2014 TPQETSYSSTCLFSKEDEVKIWHQRFGHLHLRGMKKIIDKGAVRGIPNLKIEEGRICGEC 2193
Query: 159 SLGKHIKLPFVS-SNSCTIMPFDIIHSDIW-TSPVLSSSGHRYYVLFVDDYSNFLWTFPL 216
+GK +K+ + T +++H D+ V S G RY + VDD+S F W +
Sbjct: 2194 QIGKQVKMSHQKLQHQTTSRVLELLHMDLMGPMQVESLGGKRYAYVVVDDFSRFTWVNFI 2373
Query: 217 SKKSQVFSIFLSFRTFIRTQFEREVKNIQCDNGKEFDNRHFWEFCKENGVAFRLSCPHTS 276
+KS F +F ++ + + +K I+ D+G+EF+N F EFC G+ S T
Sbjct: 2374 REKSDTFEVFKELSLRLQREKDCVIKRIRSDHGREFENSKFTEFCTSEGITHEFSAAITP 2553
Query: 277 SQNGKAERKIRTINNIIRTLLVHASLPPSFWHHALQMATYLINILPNKQLAYQSPLKILY 336
QNG ERK RT+ R +L LP + W A+ A Y+ N + ++ + +I
Sbjct: 2554 QQNGIVERKNRTLQEAARVMLHAKELPYNLWAEAMNTACYIHNRVTLRRGTPTTLYEIWK 2733
Query: 337 QKEPSYSHLRVFGCLCYPLFPSTTINKLQARSTPCVFLGYPSNHRSYKCYDLSSRKIIIS 396
++P+ H +FG CY L K+ +S +FLGY +N R+Y+ ++ +R ++ S
Sbjct: 2734 GRKPTVKHFHIFGSPCYILADREQRRKMDPKSDAGIFLGYSTNSRAYRVFNSRTRTVMES 2913
Query: 397 RHVIFDETQFPFAKLHNPQPYTYGFMDDGPSPYVIHHLTSQPSLGQPAQHDLPNTQPTTQ 456
+V+ D+ P+ + + D+ +
Sbjct: 2914 INVVVDDLT--------------------------------PARKKDVEEDVRTSGDNVA 2997
Query: 457 PTTPEEQHAHSPPSSSPNPSPSTT-ATPSPPYQ---PTPISVPKPVTRSQHGIFKPKRQL 512
T ++A + S++ P+ + PS Q P + + P G+ R++
Sbjct: 2998 DTAKSAENAENSDSATDEPNINQPDKRPSIRIQKMHPKELIIGDP----NRGVTTRSREI 3165
Query: 513 NL--NTSVPRSPLPRNPVSALRDPNWKMAMDDEFNALIKNKTWELVPRPPDVNVIRSMWI 570
+ N+ P+N AL D W AM +E +N+ WELVPRP NVI + WI
Sbjct: 3166 EIVSNSCFVSKIEPKNVKEALTDEFWINAMQEELEQFKRNEVWELVPRPEGTNVIGTKWI 3345
Query: 571 FTHKEKSDGVFERYKARLVGDGKTQQVGVDCGETFSPVVKPATIRTVLSLALSKAWSIHQ 630
F +K +GV R KARLV G TQ GVD ETF+PV + +IR +L +A + ++Q
Sbjct: 3346 FKNKTNEEGVITRNKARLVAQGYTQIEGVDFDETFAPVARLESIRLLLGVACILKFKLYQ 3525
Query: 631 LDVKNAFLHGELKETVYMHQPMGFRDPNLPNHVCLLKKSLYGLKQAPRAWYKRFADYVST 690
+DVK+AFL+G L E Y+ QP GF DP P+HV LKK+LYGLKQAPRAWY+R ++++
Sbjct: 3526 MDVKSAFLNGYLNEEAYVEQPKGFVDPTHPDHVYRLKKALYGLKQAPRAWYERLTEFLTQ 3705
Query: 691 IGFSHSTSDHSLFIYRKGTAMAYILLYVDDIILTASSDALRVTIISLLSTEFAMKDLGSL 750
G+ D +LF+ + + +YVDDI+ S+ + + + +EF M +G L
Sbjct: 3706 QGYRKGGIDKTLFVKQDAENLMIAQIYVDDIVFGGMSNEMLRHFVQQMQSEFEMSLVGEL 3885
Query: 751 HYFLGIAVTHHTGGLFLSQRKYAAEIIERAGMAACKSSSTPVDTKPKLSANSSAPHADPS 810
YFLG+ V +FLSQ KYA I+++ GM TP T KLS + + D S
Sbjct: 3886 TYFLGLQVKQMEDSIFLSQSKYAKNIVKKFGMENASHKRTPAPTHLKLSKDEAGTSVDQS 4065
Query: 811 HYRSLAGALQYLTFTRPDIAYAVQQVCLFMHDPREEHMHALKRIVRYIQGTLDHGLHLYP 870
YRS+ G+L YLT +RPDI YAV + +P+ H++ +KRI++Y+ GT D+G+
Sbjct: 4066 LYRSMIGSLLYLTASRPDITYAVGVCARYQANPKISHLNQVKRILKYVNGTSDYGIMYCH 4245
Query: 871 SSTSTLISYTDADWGGCPDTRRSTSGYCVFLGDNLISWSAKRQATLSRSSAEAEYRGVAN 930
S S L+ Y DADW G D R+STSG C +LG NLISW +K+Q +S S+AEAEY +
Sbjct: 4246 CSDSMLVGYCDADWAGSADDRKSTSGGCFYLGTNLISWFSKKQNCVSLSTAEAEYIAAGS 4425
Query: 931 VVSESCWLRNLLLELHCPIRKASLVYCDNVSAIYLSGNPVQHQRTKHIEMDIHSVREKVA 990
S+ W++ +L E + + +YCDN+SAI +S NPVQH RTKHI++ H +R+ V
Sbjct: 4426 SCSQLVWMKQMLKEYNVE-QDVMTLYCDNMSAINISKNPVQHSRTKHIDIRHHYIRDLVD 4602
Query: 991 RGEVRVLHVPSRYQIADIFTKGLPLVLFEDFRNNLSV 1027
+ + HV + QIADIFTK L FE R L +
Sbjct: 4603 DKVITLEHVDTEEQIADIFTKALDANQFEKLRGKLGI 4713
>TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polyprotein, partial
(7%)
Length = 804
Score = 151 bits (381), Expect = 2e-36
Identities = 83/222 (37%), Positives = 130/222 (58%), Gaps = 4/222 (1%)
Frame = +1
Query: 808 DPSHYRSLAGALQYLTFTRPDIAYAVQQVCLFMHDPREEHMHALKRIVRYIQGTLDHGLH 867
D + +R L G+L+YL +RP+I +AV + FM PR HM A KR++R I+GT+ G+
Sbjct: 10 DVTEFRRLIGSLRYLCNSRPNICFAVSLISRFMKRPRLSHMQAAKRVLRLIKGTIGSGV- 186
Query: 868 LYP----SSTSTLISYTDADWGGCPDTRRSTSGYCVFLGDNLISWSAKRQATLSRSSAEA 923
L+P S L+ YTD+DW P+ +ST GY D ++ S+K+Q ++ S+ EA
Sbjct: 187 LFPFKAKSGKPDLLGYTDSDWKRDPEQEKSTGGYLFMYNDAPVA*SSKKQDVIALSTCEA 366
Query: 924 EYRGVANVVSESCWLRNLLLELHCPIRKASLVYCDNVSAIYLSGNPVQHQRTKHIEMDIH 983
EY + ++ W+ NLL EL RK + DN SAI L+ +P H R+KHIE+ H
Sbjct: 367 EYVAASLGACQAVWMMNLLEELKLRERKPVNLLIDNKSAINLAKHPTLHGRSKHIELRFH 546
Query: 984 SVREKVARGEVRVLHVPSRYQIADIFTKGLPLVLFEDFRNNL 1025
+R++V++G V V + + Q+AD+ TK + + F+ + L
Sbjct: 547 YIRDQVSKGNVTVEYCKAEEQLADLMTKPIQVSRFKQICSEL 672
>CO982036
Length = 674
Score = 147 bits (372), Expect = 2e-35
Identities = 90/212 (42%), Positives = 124/212 (58%), Gaps = 3/212 (1%)
Frame = -2
Query: 705 YRKGTAMAYILLYVDDIILTASSDALRVTIISLLSTEFAMKDLGSLHYFLGIAVTHHTGG 764
Y+ Y+L+YVD II+T SS L + S L++ F +K LG L YF+ I V
Sbjct: 673 YKTHILTVYLLVYVD-IIITGSSCTLIQNLTSKLNSSFPLKLLGKLDYFVEIEVKSMPDL 497
Query: 765 LFLSQRKYAAEIIERAGMAACKSSSTPVDTKPKLSANSSAPHADPSHYRSLAGALQYLTF 824
LF S R EI R + S+P+ T KLS + S + P+ YRS+ GALQY T
Sbjct: 496 LF-SLRTSIFEIFCRKPR*QAQPISSPMTTTCKLSKSDSDLFSGPTFYRSVVGALQYTTV 320
Query: 825 TRPDIAYAVQQVCLFMHDPREEHMHALKRIVRYIQGTLDHGLHLYPSSTS---TLISYTD 881
RP+I++AV +VC FM +P + H +KRI+RY++G+L +GL L P+ +S + + D
Sbjct: 319 IRPEISFAVNKVCQFMSNPLDSHWTEVKRILRYLKGSLSYGL*LKPAISSQPLPIRGFCD 140
Query: 882 ADWGGCPDTRRSTSGYCVFLGDNLISWSAKRQ 913
ADW D +RSTSG VFLG NLISW +Q
Sbjct: 139 ADWASAVDDKRSTSGAAVFLGPNLISWWXXKQ 44
>TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Fragment), partial
(30%)
Length = 687
Score = 147 bits (372), Expect = 2e-35
Identities = 70/138 (50%), Positives = 91/138 (65%)
Frame = +2
Query: 876 LISYTDADWGGCPDTRRSTSGYCVFLGDNLISWSAKRQATLSRSSAEAEYRGVANVVSES 935
L Y DADW GCP RRSTSGYCVF+G NL+SW +K+Q ++RSSAEAEYR +A V E
Sbjct: 17 LSGYCDADWAGCPMDRRSTSGYCVFIGGNLVSWKSKKQTVVARSSAEAEYRSMAMVTCEL 196
Query: 936 CWLRNLLLELHCPIRKASLVYCDNVSAIYLSGNPVQHQRTKHIEMDIHSVREKVARGEVR 995
W++ L EL +YCDN +A++++ NPV H+RTKHIE+D H +REK+ E+
Sbjct: 197 MWIKQFLQELRFCEELQMKLYCDNQAALHIASNPVFHERTKHIEIDCHFIREKLLSKEIV 376
Query: 996 VLHVPSRYQIADIFTKGL 1013
+ S Q DI TK L
Sbjct: 377 TEFIGSNDQPVDILTKSL 430
>AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza sativa
(japonica cultivar-group)}, partial (10%)
Length = 463
Score = 140 bits (354), Expect = 2e-33
Identities = 66/151 (43%), Positives = 100/151 (65%)
Frame = -3
Query: 538 MAMDDEFNALIKNKTWELVPRPPDVNVIRSMWIFTHKEKSDGVFERYKARLVGDGKTQQV 597
+AM +E N +N W+LV +P + VI + W+F +K G+ R KARLV G Q+
Sbjct: 461 IAMQEELNQFERNNVWKLVEKPENYPVIGTKWVFRNKLDEHGIIIRNKARLVAKGYNQEE 282
Query: 598 GVDCGETFSPVVKPATIRTVLSLALSKAWSIHQLDVKNAFLHGELKETVYMHQPMGFRDP 657
G+D ET++PV + IR +L+ + ++Q+DVK+AFL+G ++E VY+ QP GF P
Sbjct: 281 GIDYEETYAPVARLEVIRMLLAYVSIMNFKLYQMDVKSAFLNGLIQEEVYVEQPPGFEIP 102
Query: 658 NLPNHVCLLKKSLYGLKQAPRAWYKRFADYV 688
+ P HV L+K+LYGLKQAPRAWY+R ++++
Sbjct: 101 DKPTHVYKLQKALYGLKQAPRAWYERISNFL 9
>TC235104 weakly similar to UP|Q850H8 (Q850H8) Gag-pol polyprotein
(Fragment), partial (28%)
Length = 865
Score = 138 bits (347), Expect = 1e-32
Identities = 95/289 (32%), Positives = 142/289 (48%), Gaps = 5/289 (1%)
Frame = +2
Query: 20 IIVGNGHSIPIRGYSHTNLSFPNPPLTLKNVLHSPQLIKNLVSVRKFTTDNSVSVEFDPF 79
I + +G + G H + P L+L +V+ N+ S+ + T + SV FD
Sbjct: 20 ITLADGSRVVATGIGHVS---PTSSLSLNSVVFILGCPFNITSLSQLTRFRNCSVTFDAN 190
Query: 80 GFSVKDFQTGMRL-MRCESRGDLYPITTSQAISPSTFAALAPSLWHARLGHPGAP----V 134
F +++ TG + + ES G Y +S A +P L H RLGHP +
Sbjct: 191 SFVIQECGTGWTIGVGIESHGLYY---LKPNLSWVCSAVTSPKLLHERLGHPHLSKLKIM 361
Query: 135 VDSLRKNKFIECNRASGSHICHSCSLGKHIKLPFVSSNSCTIMPFDIIHSDIWTSPVLSS 194
V SL K K + C SC LGKH++ S PF +IH DIW +SS
Sbjct: 362 VPSLEKIKDL---------FCESCQLGKHVRSSXRHVESRVDSPFLVIHXDIWGPNRVSS 514
Query: 195 SGHRYYVLFVDDYSNFLWTFPLSKKSQVFSIFLSFRTFIRTQFEREVKNIQCDNGKEFDN 254
+RY+V F+D++S F + ++S++ S FL+ I+TQF + +K ++ DN KE+ +
Sbjct: 515 MSYRYFVTFIDEFSQCTRVFLMKERSEILS-FLTSVNKIKTQFGKTIKILRSDNAKEYFS 691
Query: 255 RHFWEFCKENGVAFRLSCPHTSSQNGKAERKIRTINNIIRTLLVHASLP 303
F G+ + SCPHT QN AERK R + RTLL+HA+ P
Sbjct: 692 SVISPFXSAQGILHQFSCPHTPQQNDIAERKNRHLVETARTLLLHANEP 838
>TC234303 weakly similar to UP|Q8W153 (Q8W153) Polyprotein, partial (10%)
Length = 558
Score = 132 bits (332), Expect = 8e-31
Identities = 78/179 (43%), Positives = 100/179 (55%), Gaps = 1/179 (0%)
Frame = +1
Query: 579 GVFERYKARLVGDGKTQQVGVDCGETFSPVVKPATIRTVLSLALSKAWSIHQLDVKNAFL 638
G +++KARLV TQ G D TFSPV K A + + S+A+ W + LD KNAFL
Sbjct: 28 GTIDQFKARLVAKSYTQVYGQDYTGTFSPVAKMAYVHLLWSMAVVCHWPLF*LDAKNAFL 207
Query: 639 HGELKETVYMHQPMGF-RDPNLPNHVCLLKKSLYGLKQAPRAWYKRFADYVSTIGFSHST 697
HG L+E VYM QP+GF N VC L +S YGLKQ+PRAW F + I +
Sbjct: 208 HGYLEEEVYMEQPLGFVAQGESSNMVCQLCRSFYGLKQSPRAW--PFLYCGAAIWYDSHE 381
Query: 698 SDHSLFIYRKGTAMAYILLYVDDIILTASSDALRVTIISLLSTEFAMKDLGSLHYFLGI 756
+DHS+F Y+++YVDDI +T S + L +F KDLG L YFLGI
Sbjct: 382 ADHSVFYCHSPQGCIYLIVYVDDIGITGSDQHGIT*LK*XLCCQFQTKDLGKLRYFLGI 558
>BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vinifera},
partial (34%)
Length = 407
Score = 132 bits (331), Expect = 1e-30
Identities = 65/133 (48%), Positives = 84/133 (62%)
Frame = -2
Query: 556 VPRPPDVNVIRSMWIFTHKEKSDGVFERYKARLVGDGKTQQVGVDCGETFSPVVKPATIR 615
VP PP + W++T K G +R KARLV G TQ G+D +TFSPV K T+R
Sbjct: 406 VPLPPGKTPVGCRWVYTVKVGPTGEVDRLKARLVAKGYTQVYGIDYCDTFSPVAKLTTVR 227
Query: 616 TVLSLALSKAWSIHQLDVKNAFLHGELKETVYMHQPMGFRDPNLPNHVCLLKKSLYGLKQ 675
L++A W +HQLD+KNAFLHG+L+E +YM QP GF VC L +SLYGLKQ
Sbjct: 226 LFLAMAAICHWPLHQLDIKNAFLHGDLEEDIYMEQPPGFVAQGEYGLVCKLHRSLYGLKQ 47
Query: 676 APRAWYKRFADYV 688
+PRAW+ +F+ V
Sbjct: 46 SPRAWFGKFSHVV 8
>TC232995
Length = 1009
Score = 131 bits (330), Expect = 1e-30
Identities = 69/170 (40%), Positives = 99/170 (57%)
Frame = +2
Query: 648 MHQPMGFRDPNLPNHVCLLKKSLYGLKQAPRAWYKRFADYVSTIGFSHSTSDHSLFIYRK 707
+ QP GF + PNHV L+K+LYGLKQAPRAWY+R ++++ FS D +LFI RK
Sbjct: 2 VEQPPGFEISDKPNHVYKLQKALYGLKQAPRAWYERLSNFLLEKEFSRGKVDTTLFIKRK 181
Query: 708 GTAMAYILLYVDDIILTASSDALRVTIISLLSTEFAMKDLGSLHYFLGIAVTHHTGGLFL 767
+ + +YVDDII +++D+L + +EF M +G L YFLG+ + G+F+
Sbjct: 182 HNDILLVQIYVDDIIFGSTNDSLCKEFSLDMQSEFEMSMMGELKYFLGLQIKQTQ*GIFI 361
Query: 768 SQRKYAAEIIERAGMAACKSSSTPVDTKPKLSANSSAPHADPSHYRSLAG 817
+Q KY E+I+R GM + K STP+ T L + S D YR G
Sbjct: 362 NQSKYCKELIKRFGMDSAKHMSTPMSTNCYLDKDESGQSIDIKQYRDAIG 511
>CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {Vitis
vinifera}, partial (34%)
Length = 409
Score = 127 bits (318), Expect = 3e-29
Identities = 61/129 (47%), Positives = 80/129 (61%)
Frame = +3
Query: 523 LPRNPVSALRDPNWKMAMDDEFNALIKNKTWELVPRPPDVNVIRSMWIFTHKEKSDGVFE 582
+P AL P W+ AM DE AL N TWELVP PP + W++T K +G +
Sbjct: 18 VPSTIREALDHPGWRQAMVDEMQALENNGTWELVPLPPGKTTVGCRWVYTVKVGPNGKVD 197
Query: 583 RYKARLVGDGKTQQVGVDCGETFSPVVKPATIRTVLSLALSKAWSIHQLDVKNAFLHGEL 642
R KARLV G TQ G++ +TFSPV T+R L++A + W +HQLD+KNAFLHG+L
Sbjct: 198 RLKARLVAKGYTQVYGIEYCDTFSPVFFLTTVRLFLAMAAIRHWPLHQLDIKNAFLHGDL 377
Query: 643 KETVYMHQP 651
+E +YM QP
Sbjct: 378 EEDIYMEQP 404
>TC231544 weakly similar to UP|Q9FLR2 (Q9FLR2) Polyprotein-like, partial (16%)
Length = 662
Score = 123 bits (308), Expect = 5e-28
Identities = 76/192 (39%), Positives = 109/192 (56%), Gaps = 3/192 (1%)
Frame = +3
Query: 850 ALKRIVRYIQGTLDHGLHLYPSSTSTLISYTDADWGGCPDTRRSTSGYCVFLGDNLISWS 909
A R+++Y++G GL S ++ ++DADW C D+ +S + YC FLG +LISW
Sbjct: 18 AATRVLKYLKGCPRKGLSFSRESPIQILGFSDADWATCIDSSKSITWYCFFLGSSLISWK 197
Query: 910 AKRQATLSR--SSAEAEYRGVANVVSESCWLRNLLLELHCPIRKASLVYCDNVSAIYLSG 967
AK+Q T+SR SS+EA+YR + + E WL LL +LH +L+YCDN SA L
Sbjct: 198 AKKQNTVSRSSSSSEAKYRALTSTTCELQWLTYLLKDLH-----VTLIYCDNQSA--LQ* 356
Query: 968 NPVQHQRTKHIEMDIHSVREKVARGEVR-VLHVPSRYQIADIFTKGLPLVLFEDFRNNLS 1026
P++ +E+D H VREK +G + +L V S Q+ADIFTK L LF + L
Sbjct: 357 LPIKVIYHGQLEIDCHIVREKTQQGLMHCLLPVSSSNQLADIFTKALSPKLFSSNLSKLG 536
Query: 1027 VRQPPVSTAGVC 1038
+ + A VC
Sbjct: 537 LSDIFLPPACVC 572
>BU549979
Length = 615
Score = 122 bits (305), Expect = 1e-27
Identities = 60/184 (32%), Positives = 103/184 (55%), Gaps = 2/184 (1%)
Frame = -1
Query: 839 FMHDPREEHMHALKRIVRYIQGTLDHGLHLYPSSTSTLISYTDADWGGCPDTRRSTSGYC 898
+ +P +H K+++RY+QGT D+ L ++ +I Y+D+D+ GC D+RRSTSGY
Sbjct: 603 YQSNPGIDHWKTAKKVMRYLQGTKDYMLMYKQTNCLEVIGYSDSDFAGCVDSRRSTSGYI 424
Query: 899 VFLGDNLISWSAKRQATLSRSSAEAEYRGVANVVSESCWLRNLLLELHC--PIRKASLVY 956
L D ++SW + +Q ++ S+ E E+ S WL++ + L I + +Y
Sbjct: 423 FMLADGVVSWRSSKQTLIATSTMEVEFVPCFEATSHGVWLKSFMSSLRVVDSISRPLKLY 244
Query: 957 CDNVSAIYLSGNPVQHQRTKHIEMDIHSVREKVARGEVRVLHVPSRYQIADIFTKGLPLV 1016
CDN +A++++ N R+KHI++ +RE+V +V + HV + I D TKG+
Sbjct: 243 CDNFAAVFMAKNNKSGNRSKHIDIKYLVIRERVKEKKVVIEHVNTELMIVDPLTKGMTPK 64
Query: 1017 LFED 1020
F+D
Sbjct: 63 NFKD 52
>BI317550 weakly similar to GP|9759590|dbj| polyprotein-like {Arabidopsis
thaliana}, partial (18%)
Length = 421
Score = 120 bits (302), Expect = 2e-27
Identities = 61/140 (43%), Positives = 87/140 (61%)
Frame = -2
Query: 668 KSLYGLKQAPRAWYKRFADYVSTIGFSHSTSDHSLFIYRKGTAMAYILLYVDDIILTASS 727
KSLYGLKQA R WY++ + + G+ S SD+SLF KG +L+YVDDIIL S
Sbjct: 420 KSLYGLKQASRKWYEKLTNLLLKEGYIQSISDYSLFTLTKGNTFTALLVYVDDIILAGDS 241
Query: 728 DALRVTIISLLSTEFAMKDLGSLHYFLGIAVTHHTGGLFLSQRKYAAEIIERAGMAACKS 787
I ++L F +K+LG L YFLG+ V H G+ +SQRKY ++++ +G+ CK
Sbjct: 240 IDEFDRIKNVLDLAFKIKNLGKLKYFLGLEVAHSRLGITISQRKYCLDLLKDSGLLGCKP 61
Query: 788 SSTPVDTKPKLSANSSAPHA 807
+STP+DT KL + + P+A
Sbjct: 60 ASTPLDTSIKLHSAAGTPYA 1
>BU548243
Length = 599
Score = 120 bits (301), Expect = 3e-27
Identities = 61/148 (41%), Positives = 90/148 (60%)
Frame = -1
Query: 881 DADWGGCPDTRRSTSGYCVFLGDNLISWSAKRQATLSRSSAEAEYRGVANVVSESCWLRN 940
DA W D RST G +FLG NLISW +++Q ++SS EAEYR +A +E W++
Sbjct: 587 DAGWASDVDDHRSTLGSAIFLGPNLISWWSRKQQVTAQSSTEAEYRSIAQTSAELTWIQA 408
Query: 941 LLLELHCPIRKASLVYCDNVSAIYLSGNPVQHQRTKHIEMDIHSVREKVARGEVRVLHVP 1000
LL+EL P ++ CDN SA+ ++ N V H RTKH+E+D+ V EKV ++++ H+P
Sbjct: 407 LLMELQIPF-TPPVILCDNKSAVAIAHNLVFHSRTKHMEIDVFFVHEKVLSKQLQIFHIP 231
Query: 1001 SRYQIADIFTKGLPLVLFEDFRNNLSVR 1028
+ Q A I TK L F ++ L+V+
Sbjct: 230 ALDQWAGILTKPLSSARFTFLKSKLTVK 147
>TC211311 weakly similar to UP|O24587 (O24587) Pol protein, partial (15%)
Length = 1213
Score = 100 bits (249), Expect(2) = 7e-27
Identities = 58/182 (31%), Positives = 86/182 (46%)
Frame = +3
Query: 714 ILLYVDDIILTASSDALRVTIISLLSTEFAMKDLGSLHYFLGIAVTHHTGGLFLSQRKYA 773
I +YVDDII A+S + L+ F G L + LG+ + G+F+ Q KY
Sbjct: 495 IHIYVDDIIFGATSKRMCKEFFELMKDGFETSMKGELKFLLGLQIIQKVYGIFIHQEKYT 674
Query: 774 AEIIERAGMAACKSSSTPVDTKPKLSANSSAPHADPSHYRSLAGALQYLTFTRPDIAYAV 833
++R M K +TP+ + + H Y + +L YLT +RPDI + V
Sbjct: 675 KSHLKRFRMDEAKPMATPMHRSTIIDKDEKGNHTS*KEYSGMIDSLSYLTSSRPDIVFVV 854
Query: 834 QQVCLFMHDPREEHMHALKRIVRYIQGTLDHGLHLYPSSTSTLISYTDADWGGCPDTRRS 893
F P+ H+ A+KRI+RY+ GT +H L S L+ Y D + G R+S
Sbjct: 855 CLCARFQSYPKISHVTAVKRILRYLVGTTNHCLWFKKRSEFDLLGYCDVYFAGDKVERKS 1034
Query: 894 TS 895
TS
Sbjct: 1035TS 1040
Score = 39.7 bits (91), Expect(2) = 7e-27
Identities = 16/34 (47%), Positives = 25/34 (73%)
Frame = +2
Query: 670 LYGLKQAPRAWYKRFADYVSTIGFSHSTSDHSLF 703
+YGLKQA RAWY+R + ++ + GF+ +D +LF
Sbjct: 362 VYGLKQALRAWYERLSSFLVSNGFTRGITDPALF 463
>BE211208
Length = 413
Score = 116 bits (290), Expect = 6e-26
Identities = 59/135 (43%), Positives = 88/135 (64%), Gaps = 1/135 (0%)
Frame = +2
Query: 706 RKGTAMAYILLYVDDIILTASSDALRVTIISLLSTEFAMKDLGSLHYFLGIAVTHH-TGG 764
+K + Y+L+YVDDII+T S+ L +++ L++ F++K LG L YFLGI V H TG
Sbjct: 8 KKDRNLVYLLVYVDDIIITGRSNYLIQSLVHHLNSNFSLKQLGQLDYFLGIEVHHTPTGS 187
Query: 765 LFLSQRKYAAEIIERAGMAACKSSSTPVDTKPKLSANSSAPHADPSHYRSLAGALQYLTF 824
+ L+Q KY +++ + MA K S+P+ T +LS N +DP+ YRS+ GALQY T
Sbjct: 188 VLLTQSKYICDLLHKTDMAEAKPISSPMVTNLRLSKNGDDLLSDPTMYRSVVGALQYPTI 367
Query: 825 TRPDIAYAVQQVCLF 839
TRP+I++A +VC F
Sbjct: 368 TRPEISFAANKVCQF 412
>AI855899 similar to GP|2244960|emb| retrotransposon like protein
{Arabidopsis thaliana}, partial (18%)
Length = 418
Score = 116 bits (290), Expect = 6e-26
Identities = 62/135 (45%), Positives = 80/135 (58%), Gaps = 3/135 (2%)
Frame = +1
Query: 782 MAACKSSSTPVDTKPKLSANSSAPHADPSHYRSLAGALQYLTFTRPDIAYAVQQVCLFMH 841
M C STP+ + KLS S + YR + GALQY+T TRP+IAY V +V FM
Sbjct: 13 MLDCNGISTPMVSSYKLSKFGSELLPNAHQYRDIVGALQYVTLTRPNIAYNVNKVSEFMS 192
Query: 842 DPREEHMHALKRIVRYIQGTLDHGLHLYPSSTSTLIS---YTDADWGGCPDTRRSTSGYC 898
P + + +KRI+RY+ GT+ GL L P+ IS Y D DWG P RSTSG C
Sbjct: 193 SPLQSY*LTVKRILRYLSGTVTQGLLLQPAHMDAKISLRAYNDLDWGSDPAEMRSTSGSC 372
Query: 899 VFLGDNLISWSAKRQ 913
+F G NLI+WS+K+Q
Sbjct: 373 IFSGSNLIAWSSKKQ 417
>BM307983
Length = 406
Score = 114 bits (285), Expect = 2e-25
Identities = 63/131 (48%), Positives = 81/131 (61%), Gaps = 3/131 (2%)
Frame = +2
Query: 569 WIFTHKEKSDGVFERYKARLVGDGKTQQVGVDCGETFSPVVKPATIRTVLSLALSKA--- 625
WI+T K +D +RYKARLV G Q G+D ETF+ K I++ S +A
Sbjct: 14 WIYTVKY*ADDTLDRYKARLVAKGYIQTYGIDYEETFAQWQK*--IQSGSSSP*QQAQFG 187
Query: 626 WSIHQLDVKNAFLHGELKETVYMHQPMGFRDPNLPNHVCLLKKSLYGLKQAPRAWYKRFA 685
W +HQ DVKNAFLHG L+E VYM P G+ N N VC LKK+LYGLKQ+PRAW+ RF
Sbjct: 188 WEMHQFDVKNAFLHGSLEEEVYMEIPPGYGASNGGNKVCRLKKALYGLKQSPRAWFGRFT 367
Query: 686 DYVSTIGFSHS 696
+ ++G+ S
Sbjct: 368 QAMLSLGYKQS 400
>TC213888 similar to UP|Q9SFE1 (Q9SFE1) T26F17.17, partial (11%)
Length = 493
Score = 113 bits (283), Expect = 4e-25
Identities = 58/139 (41%), Positives = 86/139 (61%)
Frame = +3
Query: 889 DTRRSTSGYCVFLGDNLISWSAKRQATLSRSSAEAEYRGVANVVSESCWLRNLLLELHCP 948
D R+ST+G+ F+GD +W +K+Q ++ S+ EAEY + V + WLRNLL EL P
Sbjct: 9 DDRKSTTGFVFFMGDTAFTWMSKKQPIVTLSTCEAEYVAATSCVCHAIWLRNLLKELKMP 188
Query: 949 IRKASLVYCDNVSAIYLSGNPVQHQRTKHIEMDIHSVREKVARGEVRVLHVPSRYQIADI 1008
+ + DN SA+ L+ NPV H+++KHI+ H +RE + + EV++ +V S+ Q ADI
Sbjct: 189 QEEPMEICVDNKSALALAKNPVFHEKSKHIDTRYHFIRECIEKKEVKLKYVMSQDQAADI 368
Query: 1009 FTKGLPLVLFEDFRNNLSV 1027
FTK L L F R+ L V
Sbjct: 369 FTKPLKLETFVKLRSMLGV 425
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.320 0.135 0.416
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 55,248,694
Number of Sequences: 63676
Number of extensions: 1020157
Number of successful extensions: 20246
Number of sequences better than 10.0: 1285
Number of HSP's better than 10.0 without gapping: 10516
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 14694
length of query: 1038
length of database: 12,639,632
effective HSP length: 107
effective length of query: 931
effective length of database: 5,826,300
effective search space: 5424285300
effective search space used: 5424285300
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 64 (29.3 bits)
Medicago: description of AC146632.4