
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0029b.1
(1377 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
CF922488 203 5e-52
NP595172 polyprotein [Glycine max] 192 7e-49
TC223727 similar to UP|Q6WAY3 (Q6WAY3) Gag/pol polyprotein, part... 183 4e-46
NP395547 reverse transcriptase [Glycine max] 156 7e-38
NP334778 reverse transcriptase [Glycine max] 153 6e-37
NP395548 reverse transcriptase [Glycine max] 144 3e-34
TC233837 similar to UP|Q6WAY3 (Q6WAY3) Gag/pol polyprotein, part... 138 1e-32
BI316922 135 1e-31
TC211815 77 9e-28
TC212032 similar to UP|Q6WAY3 (Q6WAY3) Gag/pol polyprotein, part... 92 2e-24
BG725601 similar to PIR|H86337|H86 protein F5M15.26 [imported] -... 106 8e-23
AW184779 64 3e-19
TC224482 similar to UP|Q6WAY3 (Q6WAY3) Gag/pol polyprotein, part... 86 1e-16
TC219643 weakly similar to UP|Q6WAY7 (Q6WAY7) Gag/pol polyprotei... 84 3e-16
BI321666 82 2e-15
TC211067 similar to UP|Q9LQH2 (Q9LQH2) F15O4.13, partial (9%) 57 6e-12
BQ628592 50 8e-12
BE801213 weakly similar to GP|6691193|gb| F7F22.17 {Arabidopsis ... 52 7e-11
BE802896 65 3e-10
BE660092 weakly similar to GP|9884624|dbj retroelement pol polyp... 62 2e-09
>CF922488
Length = 741
Score = 203 bits (516), Expect = 5e-52
Identities = 109/245 (44%), Positives = 148/245 (59%)
Frame = +3
Query: 431 VKKANGKWRMCVDYTDLNKACPKDSYPLPSIDSLVDGASGNELLSLMDAYSGYHQIRMHP 490
V K +GK MCVDY DLN A PKD +PLP I+ LVD + S MD +SGY+QI++ P
Sbjct: 3 VLKEDGKV*MCVDYRDLN*ASPKDKFPLPHINVLVDNTTSFSQFSFMDGFSGYNQIKIAP 182
Query: 491 ADEDKTAFMTARVNYCYRTMPFGLKNAGATYQRLMDRVFARQVGRNMEVYVDDMIVKSVR 550
D +KT F+T +CY+ M FGLKN GATYQR M +F + + +EVY+DDMIVKS
Sbjct: 183 EDMEKTTFITLWGTFCYKAMSFGLKNVGATYQRAMVALF*DMMHKEIEVYMDDMIVKSRT 362
Query: 551 GLDHHQDLEEAFGEIRKHSMRLNPEKCSFGVQGGKFLGFMITSRGIEINPEKCKAIQQMK 610
+H +L + F +RK+ +RLNP KC F V+ K L F+ + RGIE++ K K I +M
Sbjct: 363 EEEHLVNLRKLFRRLRKYRLRLNPAKCMFEVKSRKLLDFIDS*RGIEVDSNKVKVILEMA 542
Query: 611 SPSNVKEVQRLTGRIAALSRFLPKSGDRSFPFFKCLRKNVVFEWTAECEEAFVRLKELLS 670
P K+VQ GR+ + RF+ P F L KN +W +C AF R+K+ L
Sbjct: 543 KPHTEKQVQGFLGRLNYIVRFIS*LIATCEPLFILLCKNQFVKWDHDC*VAFERIKQCLI 722
Query: 671 SPPIL 675
+P +L
Sbjct: 723 NPHVL 737
>NP595172 polyprotein [Glycine max]
Length = 4659
Score = 192 bits (489), Expect = 7e-49
Identities = 118/379 (31%), Positives = 188/379 (49%)
Frame = +1
Query: 377 HRLALNPSVKPVSQLRRRLGGDKGKAVQQEVDKLLAAEFIREVKYPTWLANVVMVKKANG 436
H + L PV R + +++ + ++L I+ P L +++VKK +G
Sbjct: 1759 HAIPLKQGSGPVKVRPYRYPHTQKDQIEKMIQEMLVQGIIQPSNSPFSLP-ILLVKKKDG 1935
Query: 437 KWRMCVDYTDLNKACPKDSYPLPSIDSLVDGASGNELLSLMDAYSGYHQIRMHPADEDKT 496
WR C DY LN KDS+P+P++D L+D G + S +D SGYHQI + P D +KT
Sbjct: 1936 SWRFCTDYRALNAITVKDSFPMPTVDELLDELHGAQYFSKLDLRSGYHQILVQPEDREKT 2115
Query: 497 AFMTARVNYCYRTMPFGLKNAGATYQRLMDRVFARQVGRNMEVYVDDMIVKSVRGLDHHQ 556
AF T +Y + MPFGL NA AT+Q LM+++F + + + V+ DD+++ S DH +
Sbjct: 2116 AFRTHHGHYEWLVMPFGLTNAPATFQCLMNKIFQFALRKFVLVFFDDILIYSASWKDHLK 2295
Query: 557 DLEEAFGEIRKHSMRLNPEKCSFGVQGGKFLGFMITSRGIEINPEKCKAIQQMKSPSNVK 616
LE +++H + KCSFG +LG ++ G+ + K +A+ +P+NVK
Sbjct: 2296 HLESVLQTLKQHQLFARLSKCSFGDTEVDYLGHKVSGLGVSMENTKVQAVLDWPTPNNVK 2475
Query: 617 EVQRLTGRIAALSRFLPKSGDRSFPFFKCLRKNVVFEWTAECEEAFVRLKELLSSPPILS 676
+++ G RF+ + + P L+K+ F W E E AFV+LK+ ++ P+LS
Sbjct: 2476 QLRGFLGLTGYYRRFIKSYANIAGPLTDLLQKD-SFLWNNEAEAAFVKLKKAMTEAPVLS 2652
Query: 677 KPIQGHPLHLYFAVSDSALSSVMLQEIDGEHRIVYFVSHTLQGAEVRYQKIEKAALAVLV 736
P P L S + +V+ Q H I YF S L + + LA+
Sbjct: 2653 LPDFSQPFILETDASGIGVGAVLGQ---NGHPIAYF-SKKLAPRMQKQSAYTRELLAITE 2820
Query: 737 TARRLRPYFQSFPVKVRTD 755
+ R Y +RTD
Sbjct: 2821 ALSKFRHYLLGNKFIIRTD 2877
Score = 131 bits (330), Expect = 2e-30
Identities = 147/574 (25%), Positives = 243/574 (41%), Gaps = 20/574 (3%)
Frame = +1
Query: 823 SNSSGSGAGVALEGPGELVLEQSLKFEFKATNNQAEYEALIA---GLKLAREVKI-RSLL 878
+++SG G G L G + S K + A L+A L R + +
Sbjct: 2686 TDASGIGVGAVLGQNGHPIAYFSKKLAPRMQKQSAYTRELLAITEALSKFRHYLLGNKFI 2865
Query: 879 IRTDSQLVENQVKGTFQVKDPNLIKYLERVRYLMTLFQEVVVEYVPRAENQRADALAKLA 938
IRTD + +++ + + Q P +L + L + +EY P +NQ ADAL+++
Sbjct: 2866 IRTDQRSLKSLMDQSLQT--PEQQAWLHKF-----LGYDFKIEYKPGKDNQAADALSRMF 3024
Query: 939 STRKPGNNKSVIQETLAYPSIEGELMSCVNRGRTWMDPIISILAGDPAEVEQCTKEQQRE 998
+ ++E R R DP + L T +Q +
Sbjct: 3025 MLAWSEPHSIFLEEL---------------RARLISDPHLKQLME--------TYKQGAD 3135
Query: 999 ASHYTLIDGHLYRRGFSTPLLKCVSPEKYEA---IMSEVHEGVCASHIG-GRSLACKVLR 1054
ASHYT+ +G LY + + V P + E I+ E H H G R+LA L+
Sbjct: 3136 ASHYTVREGLLYWKD------RVVIPAEEEIVNKILQEYHSSPIGGHAGITRTLAR--LK 3291
Query: 1055 AGFYWPTLRKDCMDFVKQCKECQVFADLSKAPPKELVTMSAPWPFAMW---GVDLVGPFP 1111
A FYWP +++D ++++C CQ + P L + P P +W +D + P
Sbjct: 3292 AQFYWPKMQEDVKAYIQKCLICQQAKSNNTLPAGLLQPL--PIPQQVWEDVAMDFITGLP 3465
Query: 1112 TARAQMKFILVAVDYFTKWIEAEPLAKITSAKIV-NFYWKRIVCRFGIPRAIVSDNGTQF 1170
+ + I+V +D TK+ PL ++K+V + IV GIPR+IVSD F
Sbjct: 3466 NSFG-LSVIMVVIDRLTKYAHFIPLKADYNSKVVAEAFMSHIVKLHGIPRSIVSDRDRVF 3642
Query: 1171 SSSQTREFCKEMGIQMRFASVEHPQTNGQVESANRVILRGLRRRLAEAKGAWLDELPAVL 1230
+S+ + K G + +S HPQ++GQ E N+ + LR E W+ LP
Sbjct: 3643 TSTFWQHLFKLQGTTLAMSSAYHPQSDGQSEVLNKCLEMYLRCFTYEHPKGWVKALPWAE 3822
Query: 1231 WSYNTTEQSTTRETPFRMTYGVDAMLPVEIDNFTWRTRPGFEEENQANMAVELDLLSETR 1290
+ YNT + TPFR YG + P + TR ++ A + +L
Sbjct: 3823 FWYNTAYHMSLGMTPFRALYGRE---PPTL------TRQACSIDDPAEVREQLTDRDALL 3975
Query: 1291 DEAHIRETAMKQRVAAKFNSRVRVRDMQVGDLVL----KWRSGA----PGNKLTPNWEGP 1342
+ I T +Q + + + + Q+GD VL +R + KL+ + GP
Sbjct: 3976 AKLKINLTRAQQVMKRQADKKRLDVSFQIGDEVLVKLQPYRQHSAVLRKNQKLSMRYFGP 4155
Query: 1343 YRIVKVLGNGAYHLEELDGRRLPRSFNGLSLRYF 1376
++++ +G+ AY LE R+ F+ L+ F
Sbjct: 4156 FKVLAKIGDVAYKLELPSAARIHPVFHVSQLKPF 4257
>TC223727 similar to UP|Q6WAY3 (Q6WAY3) Gag/pol polyprotein, partial (9%)
Length = 843
Score = 183 bits (465), Expect = 4e-46
Identities = 87/209 (41%), Positives = 125/209 (59%), Gaps = 1/209 (0%)
Frame = +1
Query: 996 QREASHYTLIDGHLYRRGFSTPLLKCVSPEKYEAIMSEVHEGVCASHIGGRSLACKVLRA 1055
+R A+ + + LY+R L+CV + ++ EVHEG +H G ++A K+LRA
Sbjct: 217 RRLAAGFFMSGSILYKRNHDMKPLRCVDAREANHMIEEVHEGSFGTHANGHAMARKILRA 396
Query: 1056 GFYWPTLRKDCMDFVKQCKECQVFADLSKAPPKELVTMSAPWPFAMWGVDLVGPF-PTAR 1114
G+YW T+ DC V++C +CQ FAD APP L MS+PWPF+MWG+D++G P A
Sbjct: 397 GYYWLTMESDCCVHVRKCHKCQAFADNVNAPPHPLNVMSSPWPFSMWGIDVIGAIEPKAS 576
Query: 1115 AQMKFILVAVDYFTKWIEAEPLAKITSAKIVNFYWKRIVCRFGIPRAIVSDNGTQFSSSQ 1174
+FILVA+DYFTKW+EA + +V F K I+CR+G+PR I++DNGT ++
Sbjct: 577 NGHRFILVAIDYFTKWVEAASYTDVMRGVVVRFIKKEIICRYGLPRKIITDNGTNLNNKM 756
Query: 1175 TREFCKEMGIQMRFASVEHPQTNGQVESA 1203
E C+E IQ + P+ N VE A
Sbjct: 757 MGEICEEFKIQHHNPTPYRPKMN*AVEVA 843
>NP395547 reverse transcriptase [Glycine max]
Length = 762
Score = 156 bits (394), Expect = 7e-38
Identities = 90/248 (36%), Positives = 128/248 (51%), Gaps = 18/248 (7%)
Frame = +1
Query: 403 VQQEVDKLLAAEFIREVKYPTWLANVVMVKKANG------------------KWRMCVDY 444
V++EV KLL A I + +W++ V +V K G +WRMC+DY
Sbjct: 1 VRKEVFKLLEAGLIYPISDSSWVSPVQVVPKKGGMTVVKNDRNELIPTRRVTRWRMCIDY 180
Query: 445 TDLNKACPKDSYPLPSIDSLVDGASGNELLSLMDAYSGYHQIRMHPADEDKTAFMTARVN 504
LN+A KD YPLP +D ++ + +D YSGY+QI + P D++KTAF
Sbjct: 181 RKLNEATRKDHYPLPFMDQMLKRLARQSFYRFLDGYSGYNQIAVDPQDQEKTAFTCPFSV 360
Query: 505 YCYRTMPFGLKNAGATYQRLMDRVFARQVGRNMEVYVDDMIVKSVRGLDHHQDLEEAFGE 564
+ YR MPFGL NA T+QR M +F V + +EV++DD + +LE+
Sbjct: 361 FAYRRMPFGLCNASTTFQRCMMAIFDDMVEKCIEVFMDDFSFFGASFGNCLANLEKVLQR 540
Query: 565 IRKHSMRLNPEKCSFGVQGGKFLGFMITSRGIEINPEKCKAIQQMKSPSNVKEVQRLTGR 624
K ++ LN EKC F VQ G LG I+ RGIE+ EK I ++ P NVK + G
Sbjct: 541 CEKSNLVLNWEKCHFMVQEGIVLGHKISKRGIEVVKEKLDVIDKLPPPVNVKGIHSFLGH 720
Query: 625 IAALSRFL 632
+ RF+
Sbjct: 721 VGFYRRFI 744
>NP334778 reverse transcriptase [Glycine max]
Length = 431
Score = 153 bits (386), Expect = 6e-37
Identities = 72/143 (50%), Positives = 97/143 (67%)
Frame = +3
Query: 439 RMCVDYTDLNKACPKDSYPLPSIDSLVDGASGNELLSLMDAYSGYHQIRMHPADEDKTAF 498
RMCVDY DLN+A PKD++PLP ID L+ + L S MD +SGY+QI+M P D +KT F
Sbjct: 3 RMCVDYRDLNRASPKDNFPLPHIDILMANMASFALFSFMDGFSGYNQIKMAPEDMEKTTF 182
Query: 499 MTARVNYCYRTMPFGLKNAGATYQRLMDRVFARQVGRNMEVYVDDMIVKSVRGLDHHQDL 558
+T +CY+ M FGLKN GATY R M +F + + +E YVD+MI KS +H +L
Sbjct: 183 ITLWGTFCYKVMSFGLKNFGATYHRAMVALFQDMMHKEIEAYVDEMIAKSRMEEEHLVNL 362
Query: 559 EEAFGEIRKHSMRLNPEKCSFGV 581
+ FG++RK+ +RLNP KC FG+
Sbjct: 363 QNLFGQLRKYRLRLNPRKCVFGL 431
>NP395548 reverse transcriptase [Glycine max]
Length = 762
Score = 144 bits (362), Expect = 3e-34
Identities = 84/248 (33%), Positives = 132/248 (52%), Gaps = 18/248 (7%)
Frame = +1
Query: 403 VQQEVDKLLAAEFIREVKYPTWLANVVMVKKANG------------------KWRMCVDY 444
V++EV KLL I + W++ V++V K G W++C+DY
Sbjct: 1 VRKEVLKLLEVGLIYPISDSAWVSPVLVVSKKEGMTVIRNEKNDLIPTRTVTSWKLCIDY 180
Query: 445 TDLNKACPKDSYPLPSIDSLVDGASGNELLSLMDAYSGYHQIRMHPADEDKTAFMTARVN 504
LN+A KD +PLP +D +++ +G+ +DAY GY+QI + P D++K AF
Sbjct: 181 RKLNEATRKDHFPLPFMDQMLERLAGHAYYCFLDAYFGYNQIVVDPKDQEKMAFTCPFGV 360
Query: 505 YCYRTMPFGLKNAGATYQRLMDRVFARQVGRNMEVYVDDMIVKSVRGLDHHQDLEEAFGE 564
+ YR +PFGL NA T+Q M +FA V +++EV++DD V + LE
Sbjct: 361 FAYRRIPFGLCNAPTTFQMCMLAIFADIVEKSIEVFMDDFSVFVPSLESCLKKLEMVLQR 540
Query: 565 IRKHSMRLNPEKCSFGVQGGKFLGFMITSRGIEINPEKCKAIQQMKSPSNVKEVQRLTGR 624
+ ++ LN EKC F V+ G LG I++RGIE++ K I+++ PSNVK ++ G+
Sbjct: 541 CVETNLVLNWEKCHFMVREGIVLGHKISTRGIEVDQTKIDVIEKLPPPSNVKGIRSFLGQ 720
Query: 625 IAALSRFL 632
RF+
Sbjct: 721 ARFYRRFI 744
>TC233837 similar to UP|Q6WAY3 (Q6WAY3) Gag/pol polyprotein, partial (6%)
Length = 402
Score = 138 bits (348), Expect = 1e-32
Identities = 67/132 (50%), Positives = 92/132 (68%)
Frame = +2
Query: 475 SLMDAYSGYHQIRMHPADEDKTAFMTARVNYCYRTMPFGLKNAGATYQRLMDRVFARQVG 534
S MD +SGY+QI M D +KT F+T + YR M FGLKN GATYQR M +F +
Sbjct: 5 SFMDGFSGYNQI*MAREDVEKTTFVTLWGTFSYRVMAFGLKNTGATYQRAMVALFHDMMH 184
Query: 535 RNMEVYVDDMIVKSVRGLDHHQDLEEAFGEIRKHSMRLNPEKCSFGVQGGKFLGFMITSR 594
+ +EVYVDDMI KS +H +L + FG ++K+ ++LNP KC+FGV+ GK LGF+++ +
Sbjct: 185 KEIEVYVDDMIAKSRTETEHLVNLCKLFGRLQKYQLKLNPTKCTFGVKSGKLLGFIVSQK 364
Query: 595 GIEINPEKCKAI 606
GIEI+PEK KA+
Sbjct: 365 GIEIDPEKVKAL 400
>BI316922
Length = 405
Score = 135 bits (340), Expect = 1e-31
Identities = 61/134 (45%), Positives = 91/134 (67%)
Frame = +3
Query: 1033 EVHEGVCASHIGGRSLACKVLRAGFYWPTLRKDCMDFVKQCKECQVFADLSKAPPKELVT 1092
E+H G+C H + + +VLR G+Y T+RK C ++VK+C+EC F ++S +EL
Sbjct: 3 EMHRGICGMHSKSQLMTTRVLRVGYY**TMRKYCTEYVKKCEEC*KFGNISHLLVEELHN 182
Query: 1093 MSAPWPFAMWGVDLVGPFPTARAQMKFILVAVDYFTKWIEAEPLAKITSAKIVNFYWKRI 1152
+ APWPFA+ GVD++ PFP ++ Q+K++LV +D FTKWIE E +A I+ A + F + I
Sbjct: 183 IVAPWPFAI*GVDILRPFPLSKRQVKYLLVGIDQFTKWIETEHIAIISIANVRKFV*RNI 362
Query: 1153 VCRFGIPRAIVSDN 1166
VC FGIP ++SDN
Sbjct: 363 VC*FGIPNTLISDN 404
>TC211815
Length = 704
Score = 77.4 bits (189), Expect(2) = 9e-28
Identities = 49/143 (34%), Positives = 76/143 (52%), Gaps = 3/143 (2%)
Frame = +2
Query: 1238 QSTTRETPFRMTYGVDAMLPVEIDNFTWRTRPGFEEENQANMAVELDLLSETRDEAHIRE 1297
QSTT ETPF + YG+ MLP+E+ E N+ + ++LDL+ + R++ I
Sbjct: 230 QSTTHETPF*LIYGISVMLPIEVGEVFL*RHYFAEV*NKEALQIDLDLIKQVREDTVIMT 409
Query: 1298 TAMKQRVAAKFNSRVRVRDMQVG---DLVLKWRSGAPGNKLTPNWEGPYRIVKVLGNGAY 1354
A KQR+ FNS++ G + + + +K T N EGP++I NGAY
Sbjct: 410 *AFKQRMTRCFNSKLPSTV*GRGPSMEGIQRSLEVLVRSKFTTN*EGPFKIRHNSKNGAY 589
Query: 1355 HLEELDGRRLPRSFNGLSLRYFY 1377
LEEL G+ + R +N + L+ +Y
Sbjct: 590 KLEELSGKVVLRIWNSMHLKVYY 658
Score = 66.2 bits (160), Expect(2) = 9e-28
Identities = 30/67 (44%), Positives = 45/67 (66%)
Frame = +3
Query: 1165 DNGTQFSSSQTREFCKEMGIQMRFASVEHPQTNGQVESANRVILRGLRRRLAEAKGAWLD 1224
DNG QF++ + EF + I+ R SV+HPQTN + E+AN+VIL L++ L A G W++
Sbjct: 3 DNGLQFTNRKLNEFPSGLNIKHRVTSVKHPQTNRRAEAANKVILGDLKKLLDGANGRWVE 182
Query: 1225 ELPAVLW 1231
+L +LW
Sbjct: 183 DLVEILW 203
>TC212032 similar to UP|Q6WAY3 (Q6WAY3) Gag/pol polyprotein, partial (3%)
Length = 803
Score = 92.0 bits (227), Expect(2) = 2e-24
Identities = 60/199 (30%), Positives = 100/199 (50%), Gaps = 5/199 (2%)
Frame = +2
Query: 866 LKLAREVKIRSLLIRTDSQLVENQVKGTFQVKDPNLIKYLERVRYLMTLFQEVVVEYVPR 925
++ A + ++ L + DS LV +Q++G + +DPNLI Y ++ L F E+ +V
Sbjct: 206 VQAAIDSNVKLLKVYGDSALVIHQLRGECETRDPNLIPYQAYIKELAGFFDEISFHHVA* 385
Query: 926 AENQRADALAKLASTRK--PGNNKSVIQETLAYPSIEGELMSCVNRGRTWMDPIISILAG 983
ENQ ADALA L S + P + I+ L+ G+ W I +
Sbjct: 386 EENQMADALATLVSMFQLTPHGDLPYIEFRCRGRPAHCCLVEEERDGKPWYFDIKRYVES 565
Query: 984 DPAEVEQCTKEQQRE---ASHYTLIDGHLYRRGFSTPLLKCVSPEKYEAIMSEVHEGVCA 1040
+E +++R+ A+ + + LY+R LL CV+ ++ E ++ EVHEG
Sbjct: 566 KEYPLEASDNDKRRKRRLAAGFFMSGSILYKRNHDMVLLHCVNGKEVENMLGEVHEGSFG 745
Query: 1041 SHIGGRSLACKVLRAGFYW 1059
+H G ++A K+LRAG+YW
Sbjct: 746 THSNGHAMARKILRAGYYW 802
Score = 40.4 bits (93), Expect(2) = 2e-24
Identities = 19/62 (30%), Positives = 31/62 (49%)
Frame = +1
Query: 800 VVELTPDRFERVDTQWTLFVDGSSNSSGSGAGVALEGPGELVLEQSLKFEFKATNNQAEY 859
++ L ++ + +W ++ D +SN G G G L P + + + F TNN AEY
Sbjct: 7 IMALFEEKLDEDRDKWIVWFDRASNVLGHGVGAILVSPDNQCIPFTTRLGFDCTNNMAEY 186
Query: 860 EA 861
EA
Sbjct: 187 EA 192
>BG725601 similar to PIR|H86337|H86 protein F5M15.26 [imported] - Arabidopsis
thaliana, partial (1%)
Length = 285
Score = 106 bits (264), Expect = 8e-23
Identities = 51/91 (56%), Positives = 67/91 (73%)
Frame = -3
Query: 377 HRLALNPSVKPVSQLRRRLGGDKGKAVQQEVDKLLAAEFIREVKYPTWLANVVMVKKANG 436
H+LA+ VK V+Q +R++ ++ + V QEV KL A FIR++ Y T L +VVMVKK NG
Sbjct: 277 HKLAICNDVKLVTQRKRKIREERCQTV*QEVVKLAIASFIRDINYST*LFSVVMVKKPNG 98
Query: 437 KWRMCVDYTDLNKACPKDSYPLPSIDSLVDG 467
KWR+C DY DLN ACPKD+YPLP+ID + DG
Sbjct: 97 KWRICTDYIDLN*ACPKDAYPLPNIDHMTDG 5
>AW184779
Length = 432
Score = 64.3 bits (155), Expect(2) = 3e-19
Identities = 28/69 (40%), Positives = 41/69 (58%)
Frame = +1
Query: 1009 LYRRGFSTPLLKCVSPEKYEAIMSEVHEGVCASHIGGRSLACKVLRAGFYWPTLRKDCMD 1068
LY+R LL+CV + E ++ EVHEG +H ++A K+LR G+YW T+ DC
Sbjct: 16 LYKRNHDMVLLRCVDAREAEQMLVEVHEGSFGTHANIHAMAQKILRVGYYWLTMESDCCI 195
Query: 1069 FVKQCKECQ 1077
V +C +CQ
Sbjct: 196 HVWKCHKCQ 222
Score = 50.8 bits (120), Expect(2) = 3e-19
Identities = 26/70 (37%), Positives = 38/70 (54%), Gaps = 10/70 (14%)
Frame = +3
Query: 1086 PPKELVTMSAPWPFA---------MWGVDLVGPF-PTARAQMKFILVAVDYFTKWIEAEP 1135
P + + M P P+ MWG+D++G P A FILVA+DYFTKW+EA
Sbjct: 219 PVRPSLIMLTPHPYL*MSWQHLGHMWGIDVIGAIEPKASNGHHFILVAIDYFTKWVEAVS 398
Query: 1136 LAKITSAKIV 1145
A +T + ++
Sbjct: 399 YASVTRSVVI 428
Score = 30.0 bits (66), Expect = 7.3
Identities = 12/23 (52%), Positives = 15/23 (65%)
Frame = +2
Query: 1077 QVFADLSKAPPKELVTMSAPWPF 1099
Q FAD APP L ++APWP+
Sbjct: 224 QTFADNVNAPPIPLNVLAAPWPY 292
>TC224482 similar to UP|Q6WAY3 (Q6WAY3) Gag/pol polyprotein, partial (6%)
Length = 669
Score = 85.5 bits (210), Expect = 1e-16
Identities = 57/183 (31%), Positives = 93/183 (50%), Gaps = 6/183 (3%)
Frame = +1
Query: 1201 ESANRVILRGLRRRLAEAKGAWLDELPAVLWSYNTTEQSTTRETPFRMTYGVDAMLP--V 1258
E+AN+ I + +++ K W + LP L Y T+ +++T TPF + YG++A+LP V
Sbjct: 1 EAANKNIKKIIQKMTVSYKD-WHEMLPFALHGYRTSVRTSTGATPFSLVYGMEAVLPFEV 177
Query: 1259 EIDNFTWRTRPGFEEENQANMAV-ELDLLSETRDEAHIRETAMKQRVAAKFNSRVRVRDM 1317
E+ + G +E A +L+L+ R A +QR+ + F+ +V +R
Sbjct: 178 EVPSLRILAESGLKESEWAQTRYDQLNLIEGKRLTAMSHGRLYQQRMKSAFDKKVCLRKF 357
Query: 1318 QVGDLVLKWRSGAPGN---KLTPNWEGPYRIVKVLGNGAYHLEELDGRRLPRSFNGLSLR 1374
GDLVLK S A + K PN+EGP+ + + GA L +DG LP N ++
Sbjct: 358 HEGDLVLKKMSHAVKDHRGKWAPNYEGPFVVKRAFSGGALVLTNMDGEELPSPMNSDVVK 537
Query: 1375 YFY 1377
+Y
Sbjct: 538 RYY 546
>TC219643 weakly similar to UP|Q6WAY7 (Q6WAY7) Gag/pol polyprotein (Fragment),
partial (8%)
Length = 1320
Score = 84.3 bits (207), Expect = 3e-16
Identities = 44/113 (38%), Positives = 64/113 (55%), Gaps = 4/113 (3%)
Frame = +3
Query: 323 EETKALKFGD----RTLKIGTRLTEEQETRLTKLLGENLDLFAWSCKDMPGIDPNFICHR 378
EET+ + G R +KIGT +T L LL + D+FAWS +DMPG+ + + HR
Sbjct: 978 EETELVDLGSGSGKREVKIGTGITAPIREELIILLRDYQDIFAWSYQDMPGLSSDIVQHR 1157
Query: 379 LALNPSVKPVSQLRRRLGGDKGKAVQQEVDKLLAAEFIREVKYPTWLANVVMV 431
L LNP PV Q RR+ + +++EV K A F+ +YP W+AN+V +
Sbjct: 1158 LPLNPECSPVKQKLRRMKPETSLKIKEEVKK*FDAGFLAVARYPKWVANIVPI 1316
>BI321666
Length = 430
Score = 81.6 bits (200), Expect = 2e-15
Identities = 41/126 (32%), Positives = 69/126 (54%)
Frame = +2
Query: 1048 LACKVLRAGFYWPTLRKDCMDFVKQCKECQVFADLSKAPPKELVTMSAPWPFAMWGVDLV 1107
++ VL++ FY P++ KD + C +CQ +SK L T+ F WG+D V
Sbjct: 2 ISTNVLQSRFYLPSIFKDAYVHAQSCNKCQRTRSVSKRNELPLHTILEVEIFDYWGIDFV 181
Query: 1108 GPFPTARAQMKFILVAVDYFTKWIEAEPLAKITSAKIVNFYWKRIVCRFGIPRAIVSDNG 1167
GPFP + + ++ILV VDY +KW+EA K + ++ F K+I R G+P ++ + G
Sbjct: 182 GPFPPSFSN-EYILVVVDYVSKWVEAVACQKSDAKIVIKFLKKQIFSRLGVPWVLIDNGG 358
Query: 1168 TQFSSS 1173
+ ++
Sbjct: 359 SHLCNA 376
>TC211067 similar to UP|Q9LQH2 (Q9LQH2) F15O4.13, partial (9%)
Length = 589
Score = 56.6 bits (135), Expect(2) = 6e-12
Identities = 42/162 (25%), Positives = 72/162 (43%), Gaps = 1/162 (0%)
Frame = +1
Query: 614 NVKEVQRLTGRIAALSRFLPKSGDRSFPFFKCLRKNVVFEWTAECEEAFVRLKELLSSPP 673
+V +++ G + RF+P + P + ++KN+ F W + E+AF LKE L+ P
Sbjct: 109 SVGDIRSFHGLASFYRRFVPNFSTVASPLNELVKKNMAFTWGEKQEQAFALLKEKLTKAP 288
Query: 674 ILSKPIQGHPLHLYFAVSDSALSSVMLQEIDGEHRIVYFVSHTLQGAEVRYQKIEKAALA 733
+L+ P L S + +V+LQ G H I YF S L A + Y +K A
Sbjct: 289 VLALPDFSKTFELECDASGVGVRAVLLQ---GGHPIAYF-SEKLHSATLNYPTYDKELYA 456
Query: 734 VLVTARRLRPYFQSFPVKVRTD-LPLRQVLQKPDLSGRLVAW 774
++ + + + +D L+ + K L+ R W
Sbjct: 457 LIRAPQTWEHFLVCKEFVIHSDHQSLKYIRGKSKLNKRHAKW 582
Score = 33.5 bits (75), Expect(2) = 6e-12
Identities = 14/33 (42%), Positives = 22/33 (66%)
Frame = +2
Query: 586 FLGFMITSRGIEINPEKCKAIQQMKSPSNVKEV 618
F GF++ G++++PEK KAIQ+ P V E+
Sbjct: 23 FSGFVVGRNGVQMDPEKIKAIQEWPPP*KVWEI 121
>BQ628592
Length = 423
Score = 49.7 bits (117), Expect(2) = 8e-12
Identities = 22/71 (30%), Positives = 41/71 (56%), Gaps = 2/71 (2%)
Frame = -1
Query: 646 LRKNVVFEWTAECEEAFVRLKELLSSPPILSKPIQGHPLHLYFAVSDSALSSVMLQEIDG 705
L KN W + +EAF ++K+ L++P +L P+ G P LY + D ++ V++Q D
Sbjct: 423 LPKNQAILWNSNYQEAFEKIKQSLANPSVLMPPVTGRPFLLYMTMLDESMGCVLVQHDDS 244
Query: 706 --EHRIVYFVS 714
+ + +Y++S
Sbjct: 243 GKKEQAIYYLS 211
Score = 40.0 bits (92), Expect(2) = 8e-12
Identities = 18/64 (28%), Positives = 36/64 (56%), Gaps = 1/64 (1%)
Frame = -3
Query: 721 EVRYQKIEKAALAVLVTARRLRPYFQSFPVKVRTDL-PLRQVLQKPDLSGRLVAWSVELS 779
++ Y +E+ ++ + RLR Y S + + + P++ + +KP L+GR+ W V LS
Sbjct: 193 KMNYSMLERTCCTLVWASHRLRQYMLSHTTWLISKMDPVKYIFEKPALTGRIARWQVLLS 14
Query: 780 EYGL 783
E+ +
Sbjct: 13 EFNI 2
>BE801213 weakly similar to GP|6691193|gb| F7F22.17 {Arabidopsis thaliana},
partial (3%)
Length = 416
Score = 52.4 bits (124), Expect(2) = 7e-11
Identities = 23/61 (37%), Positives = 36/61 (58%)
Frame = +3
Query: 1144 IVNFYWKRIVCRFGIPRAIVSDNGTQFSSSQTREFCKEMGIQMRFASVEHPQTNGQVESA 1203
++ F K I RFG+PR ++SD G+ F SQ + K ++ + + HPQTNGQ + +
Sbjct: 18 VIKFLKKNIFSRFGMPRILISDGGSHFYYSQLNKVLKHDSVRHKVETSYHPQTNGQAKVS 197
Query: 1204 N 1204
N
Sbjct: 198 N 200
Score = 34.3 bits (77), Expect(2) = 7e-11
Identities = 15/48 (31%), Positives = 26/48 (53%)
Frame = +2
Query: 1213 RRLAEAKGAWLDELPAVLWSYNTTEQSTTRETPFRMTYGVDAMLPVEI 1260
+ +A ++ W +L LW+ T +++ TPF+M Y LPVE+
Sbjct: 227 KNVASSRKDWSSKLEDALWACKTAKKTPIGLTPFQMVYRKACHLPVEL 370
>BE802896
Length = 416
Score = 64.7 bits (156), Expect = 3e-10
Identities = 40/133 (30%), Positives = 60/133 (45%)
Frame = -2
Query: 623 GRIAALSRFLPKSGDRSFPFFKCLRKNVVFEWTAECEEAFVRLKELLSSPPILSKPIQGH 682
G RF+ + P L+K V F++ +C+ AF LK L + PI+ P
Sbjct: 406 GHAGFYRRFIRDFRKVALPLSNLLQKEVEFDFNDKCK*AFDCLKRALITTPIIQAPDWTA 227
Query: 683 PLHLYFAVSDSALSSVMLQEIDGEHRIVYFVSHTLQGAEVRYQKIEKAALAVLVTARRLR 742
P L S+ AL V+ Q+ID R++Y+ S TL A+ Y EK LA++ +
Sbjct: 226 PFELMCDASNYALGVVLAQKIDKLPRVIYYSSRTLDAAQANYTTTEKELLAIVFALEKFH 47
Query: 743 PYFQSFPVKVRTD 755
Y + V D
Sbjct: 46 SYLLGTRIIVYID 8
>BE660092 weakly similar to GP|9884624|dbj retroelement pol polyprotein-like
{Arabidopsis thaliana}, partial (13%)
Length = 378
Score = 62.0 bits (149), Expect = 2e-09
Identities = 39/118 (33%), Positives = 62/118 (52%), Gaps = 3/118 (2%)
Frame = -3
Query: 1180 KEMGIQMRFASVEHPQTNGQVESANRVILRGLRRRLAEAKGAWLDELPAVLWSYNTTEQS 1239
K+ G+ R ++ HPQTNGQ E +NR I R L + + ++ W L LW++ T ++
Sbjct: 361 KKYGVVHRVSTPYHPQTNGQAEISNREIKRILEKIVQPSRKDWSTRLDDALWAHRTAYKA 182
Query: 1240 TTRETPFRMTYGVDAMLPVEIDNFT-WRTRPGFEEENQANMAVELDL--LSETRDEAH 1294
+P+R+ +G LPVEI++ W + +QA +L L L E R EA+
Sbjct: 181 PIGMSPYRVVFGKACHLPVEIEHKAYWAVKTCNFSMDQAGEERKLQLSELDEIRLEAY 8
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.320 0.136 0.406
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 56,458,559
Number of Sequences: 63676
Number of extensions: 740503
Number of successful extensions: 3391
Number of sequences better than 10.0: 70
Number of HSP's better than 10.0 without gapping: 3339
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 3373
length of query: 1377
length of database: 12,639,632
effective HSP length: 109
effective length of query: 1268
effective length of database: 5,698,948
effective search space: 7226266064
effective search space used: 7226266064
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 65 (29.6 bits)
Lotus: description of TM0029b.1