
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC144727.5 + phase: 0
(256 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAU89779.1| gag-pol polyprotein-like [Solanum tuberosum] 323 2e-87
gb|AAP53905.1| putative pol polyprotein [Oryza sativa (japonica ... 177 3e-43
emb|CAB81478.1| putative protein [Arabidopsis thaliana] gi|49720... 157 2e-37
emb|CAB81170.1| retrotransposon like protein [Arabidopsis thalia... 153 5e-36
gb|AAC35532.1| contains similarity to proteases [Arabidopsis tha... 153 5e-36
ref|XP_475401.1| putative polyprotein [Oryza sativa (japonica cu... 150 3e-35
gb|AAC67200.1| putative retroelement pol polyprotein [Arabidopsi... 149 6e-35
emb|CAC37623.1| copia-like polyprotein [Arabidopsis thaliana] 148 1e-34
gb|AAC61290.1| putative retroelement pol polyprotein [Arabidopsi... 147 3e-34
gb|AAF02855.1| Similar to retrotransposon proteins [Arabidopsis ... 143 4e-33
emb|CAC95126.1| gag-pol polyprotein [Populus deltoides] 142 9e-33
gb|AAK51235.1| polyprotein [Arabidopsis thaliana] 141 2e-32
gb|AAD21687.1| Strong similarity to gi|3600044 T12H20.12 proteas... 138 1e-31
gb|AAD25646.1| putative retroelement pol polyprotein [Arabidopsi... 138 2e-31
gb|AAF79879.1| T7N9.5 [Arabidopsis thaliana] 138 2e-31
ref|XP_476197.1| putative polyprotein [Oryza sativa (japonica cu... 136 6e-31
gb|AAT38747.1| putative polyprotein [Solanum demissum] 136 6e-31
gb|AAT85031.1| putative polyprotein [Oryza sativa (japonica cult... 136 6e-31
gb|AAU89728.1| putative retroelement pol polyprotein-like [Solan... 134 2e-30
ref|NP_909900.1| putative copia-like retrotransposon Hopscotch p... 133 5e-30
>gb|AAU89779.1| gag-pol polyprotein-like [Solanum tuberosum]
Length = 1212
Score = 323 bits (829), Expect = 2e-87
Identities = 155/240 (64%), Positives = 185/240 (76%), Gaps = 1/240 (0%)
Query: 14 WFLDSGASNHMTGSSEYLHNLASYHGNQQIQIADGNNLSITDVGDINSDFRNVLVSPGLA 73
W +DSGASNHMT S+ L N+ Y G QIQIA+G+NL IT VGDI F+NV VSP L+
Sbjct: 320 WIVDSGASNHMTNSTSILKNVRKYQGPSQIQIANGSNLPITKVGDITPTFKNVFVSPKLS 379
Query: 74 SNLLSVGQLVDNNCNVNFSRAGCVVQEQVSGKVIAKGPKVGRLFPLQF-ISNHLSLPCNN 132
++L+SVGQLVDNNC+VNFSR GC+VQ+QVSG +IAKGPKVGRLFP+ F I LS C +
Sbjct: 380 TSLISVGQLVDNNCDVNFSRNGCLVQDQVSGTIIAKGPKVGRLFPIHFSIPPVLSFACTS 439
Query: 133 VLNSYEDWHRKLGHPNSTVLSHLFKTGLLGNKQVVCTASISCLVCKLAKSKTLPFPSGAH 192
+ E WH++LGHPNS VLSH+ +GLLGNK ASI C CKL KSKTLPFP+
Sbjct: 440 TASKTEVWHKRLGHPNSVVLSHISNSGLLGNKNKFSVASIDCSTCKLGKSKTLPFPNFGS 499
Query: 193 RASNCFEMIHSDVWGMSPIASHARYKYFVTFIDDYSRFTWIYFLRSKSEVFSIVITISGY 252
RA+ CF++IHSDVWG+SPI SHA +KYF+TFIDDYSRFTW+YFLRSKSEVFS+ T Y
Sbjct: 500 RATKCFDVIHSDVWGISPIISHAHFKYFMTFIDDYSRFTWVYFLRSKSEVFSMFKTFLAY 559
>gb|AAP53905.1| putative pol polyprotein [Oryza sativa (japonica cultivar-group)]
gi|37534632|ref|NP_921618.1| putative pol polyprotein
[Oryza sativa (japonica cultivar-group)]
Length = 1688
Score = 177 bits (449), Expect = 3e-43
Identities = 99/251 (39%), Positives = 137/251 (54%), Gaps = 19/251 (7%)
Query: 11 SRPWFLDSGASNHMTGSSEYLHNLASYHGNQQIQIADGNNLSITDVGDINSD---FRNVL 67
S+PW LDSGAS HM+ +L + + A+G +T G I+S NV
Sbjct: 181 SQPWILDSGASFHMSFDDSWLTSCRLVKNGATVHTANGTLCKVTHQGSISSPQFTVPNVS 240
Query: 68 VSPGLASNLLSVGQLVDNNCNVNFSRAGCVVQEQVSGKVIAKGPKVGRLFPLQFISNHLS 127
+ P L+ NL+SVGQL D NC V F C VQ++ +G VI G + R L +I + LS
Sbjct: 241 LVPKLSMNLISVGQLTDTNCFVGFDDTSCFVQDRHTGAVIGTGHRQKRSCGL-YILDSLS 299
Query: 128 LP-------------CNNVLNSYEDWHRKLGHPNSTVLSHLFKTGLLGNKQVVCTASISC 174
LP C+ S+ WH +LGH + L+ L G+LG+ V T C
Sbjct: 300 LPSSSTNTPSVYSPMCSTACKSFPQWHHRLGHLCGSRLATLINQGVLGSVPVDTT--FVC 357
Query: 175 LVCKLAKSKTLPFPSGAHRASNCFEMIHSDVWGMSPIASHARYKYFVTFIDDYSRFTWIY 234
CKL K LP+PS R+S F+++HSDVWG SP S + Y+V F+DDYSR+TWIY
Sbjct: 358 KGCKLGKQVQLPYPSSTSRSSRPFDLVHSDVWGKSPFPSKGGHNYYVIFVDDYSRYTWIY 417
Query: 235 FLRSKSEVFSI 245
F++ +S++ SI
Sbjct: 418 FMKHRSQLISI 428
>emb|CAB81478.1| putative protein [Arabidopsis thaliana] gi|4972079|emb|CAB43904.1|
putative protein [Arabidopsis thaliana]
gi|7444467|pir||T08945 hypothetical protein F25O24.20 -
Arabidopsis thaliana
Length = 1415
Score = 157 bits (398), Expect = 2e-37
Identities = 88/252 (34%), Positives = 133/252 (51%), Gaps = 15/252 (5%)
Query: 7 SSNVSRPWFLDSGASNHMTGSSEYLHNLASYHGNQQIQIADGNNLSITDVGDI------- 59
S S PW DSGA++H+T S+ L + Y G + + + + L IT +G
Sbjct: 286 SDQKSNPWVTDSGATSHITNSTSQLQSAQPYSGEDSVIVGNSDFLPITHIGSAVLTSNQG 345
Query: 60 NSDFRNVLVSPGLASNLLSVGQLV-DNNCNVNFSRAGCVVQEQVSGKVIAKGPKVGRLFP 118
N R+VLV P + +LLSV +L D C + F G +V+++++ +++ KG + L+
Sbjct: 346 NLPLRDVLVCPNITKSLLSVSKLTSDYPCVIEFDSDGVIVKDKLTKQLLTKGTRHNDLYL 405
Query: 119 LQFISNHLSLPCNNVLNSYEDWHRKLGHPNSTVLSHLFKTGLLGNKQVVC--TASISCLV 176
L+ S E WH +LGHPN VL L + NK +V T+ C
Sbjct: 406 LENPKFMACYSSRQQATSDEVWHMRLGHPNQDVLQQLLR-----NKAIVISKTSHSLCDA 460
Query: 177 CKLAKSKTLPFPSGAHRASNCFEMIHSDVWGMSPIASHARYKYFVTFIDDYSRFTWIYFL 236
C++ K LPF S +S E +H D+WG +P+ S ++Y+V FID+YSRFTW Y L
Sbjct: 461 CQMGKICKLPFASSDFVSSRLLERVHCDLWGPAPVVSSQGFRYYVIFIDNYSRFTWFYPL 520
Query: 237 RSKSEVFSIVIT 248
R KS+ FS+ +T
Sbjct: 521 RLKSDFFSVFLT 532
>emb|CAB81170.1| retrotransposon like protein [Arabidopsis thaliana]
gi|4539447|emb|CAB40035.1| retrotransposon like protein
[Arabidopsis thaliana] gi|7444419|pir||T04204
hypothetical protein T4F9.150 - Arabidopsis thaliana
Length = 1515
Score = 153 bits (386), Expect = 5e-36
Identities = 85/246 (34%), Positives = 137/246 (55%), Gaps = 13/246 (5%)
Query: 11 SRPWFLDSGASNHMTGSSEYLHNLASYHGNQQIQIADGNNLSITDVGDINSDF------- 63
S W DS A+ H+T +++ L N +Y G+ + + +G+ L IT +G I +
Sbjct: 321 SHEWLPDSAATAHITNTTDGLQNSQTYSGDDSVIVGNGDFLPITHIGTIPLNISQGTLPL 380
Query: 64 RNVLVSPGLASNLLSVGQLVDNN-CNVNFSRAGCVVQEQVSGKVIAKGPKVGRLFPLQFI 122
+VLV PG+ +LLSV +L D+ C+ F V++++ + +++ +G K L+ L+ +
Sbjct: 381 EDVLVCPGITKSLLSVSKLTDDYPCSFTFDSDSVVIKDKRTQQLLTQGNKHKGLYVLKDV 440
Query: 123 SNHLSLPCNNVLNSYEDWHRKLGHPNSTVLSHLFKT-GLLGNKQVVCTASISCLVCKLAK 181
+ E WH++LGHPN VL HL KT ++ NK T+S C C++ K
Sbjct: 441 PFQTYYSTRQQSSDDEVWHQRLGHPNKEVLQHLIKTKAIVVNK----TSSNMCEACQMGK 496
Query: 182 SKTLPFPSGAHRASNCFEMIHSDVWGMSPIASHARYKYFVTFIDDYSRFTWIYFLRSKSE 241
LPF + +S E IH D+WG +P+ S ++Y+V FID+YSRFTW Y L+ KS+
Sbjct: 497 VCRLPFVASEFVSSRPLERIHCDLWGPAPVTSAQGFQYYVIFIDNYSRFTWFYPLKLKSD 556
Query: 242 VFSIVI 247
FS+ +
Sbjct: 557 FFSVFV 562
>gb|AAC35532.1| contains similarity to proteases [Arabidopsis thaliana]
gi|7444456|pir||T01908 hypothetical protein T12H20.12 -
Arabidopsis thaliana
Length = 1392
Score = 153 bits (386), Expect = 5e-36
Identities = 85/246 (34%), Positives = 137/246 (55%), Gaps = 13/246 (5%)
Query: 11 SRPWFLDSGASNHMTGSSEYLHNLASYHGNQQIQIADGNNLSITDVGDINSDF------- 63
S W DS A+ H+T +++ L N +Y G+ + + +G+ L IT +G I +
Sbjct: 324 SHEWLPDSAATAHITNTTDGLQNSQTYSGDDSVIVGNGDFLPITHIGTIPLNISQGTLPL 383
Query: 64 RNVLVSPGLASNLLSVGQLVDNN-CNVNFSRAGCVVQEQVSGKVIAKGPKVGRLFPLQFI 122
+VLV PG+ +LLSV +L D+ C+ F V++++ + +++ +G K L+ L+ +
Sbjct: 384 EDVLVCPGITKSLLSVSKLTDDYPCSFTFDSDSVVIKDKRTQQLLTQGNKHKGLYVLKDV 443
Query: 123 SNHLSLPCNNVLNSYEDWHRKLGHPNSTVLSHLFKT-GLLGNKQVVCTASISCLVCKLAK 181
+ E WH++LGHPN VL HL KT ++ NK T+S C C++ K
Sbjct: 444 PFQTYYSTRQQSSDDEVWHQRLGHPNKEVLQHLIKTKAIVVNK----TSSNMCEACQMGK 499
Query: 182 SKTLPFPSGAHRASNCFEMIHSDVWGMSPIASHARYKYFVTFIDDYSRFTWIYFLRSKSE 241
LPF + +S E IH D+WG +P+ S ++Y+V FID+YSRFTW Y L+ KS+
Sbjct: 500 VCRLPFVASEFVSSRPLERIHCDLWGPAPVTSAQGFQYYVIFIDNYSRFTWFYPLKLKSD 559
Query: 242 VFSIVI 247
FS+ +
Sbjct: 560 FFSVFV 565
>ref|XP_475401.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
gi|49328070|gb|AAT58770.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
Length = 1419
Score = 150 bits (380), Expect = 3e-35
Identities = 89/266 (33%), Positives = 139/266 (51%), Gaps = 31/266 (11%)
Query: 7 SSNVSRP-WFLDSGASNHMTGSSEYLHNLASYHGNQQIQIADGNNLSITDVGDINSD--- 62
S+NV+ W LDSGAS HMT L + S +Q ADG L ++ G + +
Sbjct: 480 SANVTDSCWILDSGASFHMTPDISQLQS-CSLTKASSVQTADGTILPVSLQGTLQTKEYT 538
Query: 63 FRNVLVSPGLASNLLSVGQLVDNNCNVNFSRAGCVVQEQVSGKVIAKGPKVGRLFPLQFI 122
+V P L+ L+SVGQL D C+V F A C V ++ +G ++ G ++ L ++
Sbjct: 539 IPDVFYVPNLSMKLISVGQLTDMKCHVVFDEAACYVLDRATGNLVGAGHRLNGPRGL-YV 597
Query: 123 SNHLSLPCN-----------------------NVLNSYEDWHRKLGHPNSTVLSHLFKTG 159
+HL LP + ++ S+ WH +LGH + LS L + G
Sbjct: 598 LDHLHLPTSTSSGFPGNSASATSITSNSSVYSSLSASFPQWHHRLGHLCGSRLSTLVQQG 657
Query: 160 LLGNKQVVCTASISCLVCKLAKSKTLPFPSGAHRASNCFEMIHSDVWGMSPIASHARYKY 219
+LGN + C CKL K LP+ S R+++ F ++HSDVWG +P S ++Y
Sbjct: 658 VLGNVSI--ETDFVCKGCKLGKQVQLPYRSSMSRSTSPFALVHSDVWGPAPFHSKGGHRY 715
Query: 220 FVTFIDDYSRFTWIYFLRSKSEVFSI 245
+V F+DD+SR+TWIYF++ +SE++ +
Sbjct: 716 YVIFVDDFSRYTWIYFMKHRSELYQV 741
>gb|AAC67200.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301693|pir||F84480 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1402
Score = 149 bits (377), Expect = 6e-35
Identities = 85/243 (34%), Positives = 127/243 (51%), Gaps = 12/243 (4%)
Query: 14 WFLDSGASNHMTGSSEYLHNLASYHGNQQIQIADGNNLSITDVGDINS-------DFRNV 66
W DS A+ H+T S L YHG+ + +ADGN L IT G N +V
Sbjct: 332 WLPDSAATAHVTNSPRSLQQSQPYHGSDAVMVADGNFLPITHTGSTNLASSSGNVPLTDV 391
Query: 67 LVSPGLASNLLSVGQLV-DNNCNVNFSRAGCVVQEQVSGKVIAKGPKVGRLFPLQFISNH 125
LV P + +LLSV +L D C V F G + ++ + K++ G L+ L+ S
Sbjct: 392 LVCPSITKSLLSVSKLTQDYPCTVEFDSDGVRINDKATKKLLIMGSTCDGLYCLKDDSQF 451
Query: 126 LSLPCNNVLNSYED-WHRKLGHPNSTVLSHLFKTGLLGNKQVVCTASISCLVCKLAKSKT 184
+ ++ ++ WHR+LGHP+ VL L KT + + T+ C C+L KS
Sbjct: 452 KAFFSTRQQSASDEVWHRRLGHPHPQVLQQLVKTNSISINK---TSKSLCEACQLGKSTR 508
Query: 185 LPFPSGAHRASNCFEMIHSDVWGMSPIASHARYKYFVTFIDDYSRFTWIYFLRSKSEVFS 244
LPF S + ++ E +H D+WG SPI S ++Y+ FID YSRF+WIY L+ KS+ ++
Sbjct: 509 LPFVSSSFTSNRPLERVHCDLWGPSPITSVQGFRYYAVFIDHYSRFSWIYPLKLKSDFYN 568
Query: 245 IVI 247
I +
Sbjct: 569 IFV 571
>emb|CAC37623.1| copia-like polyprotein [Arabidopsis thaliana]
Length = 1466
Score = 148 bits (374), Expect = 1e-34
Identities = 89/255 (34%), Positives = 134/255 (51%), Gaps = 22/255 (8%)
Query: 7 SSNVSRPWFLDSGASNHMTGSSEYLHNLASYHGNQQIQIADGNNLSITDVGD--INSD-- 62
S + W+ DS A+ H+T S+ L N +Y GN + + DG L IT VG I+S
Sbjct: 317 SDETGKEWYPDSAATAHITASTSGLQNATTYEGNDAVLVGDGTYLPITHVGSTTISSSKG 376
Query: 63 ---FRNVLVSPGLASNLLSVGQLVDNN-CNVNFSRAGCVVQEQVSGKVIAKGPKVGRLFP 118
VLV P + +LLSV +L D+ C V F + + + KV++KGP+ L+
Sbjct: 377 TIPLNEVLVCPAIQKSLLSVSKLCDDYPCGVYFDANKVCIIDLTTQKVVSKGPRNNGLYM 436
Query: 119 LQ---FISNHLSLPCNNVLNSYEDWHRKLGHPNSTVLSHLFKTGLLGNKQVVCTASIS-- 173
L+ F++ + + C S E WH +LGH NS +L L L K++ S +
Sbjct: 437 LENSEFVALYSNRQC---AASMETWHHRLGHSNSKILQQL-----LTRKEIQVNKSRTSP 488
Query: 174 -CLVCKLAKSKTLPFPSGAHRASNCFEMIHSDVWGMSPIASHARYKYFVTFIDDYSRFTW 232
C C++ KS L F S RA + +H D+WG SP+ S+ +KY+ F+DD+SRF+W
Sbjct: 489 VCEPCQMGKSTRLQFFSSDFRALKPLDRVHCDLWGPSPVVSNQGFKYYAVFVDDFSRFSW 548
Query: 233 IYFLRSKSEVFSIVI 247
+ LR KS+ S+ I
Sbjct: 549 FFPLRMKSKFISVFI 563
>gb|AAC61290.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301708|pir||B84523 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1149
Score = 147 bits (371), Expect = 3e-34
Identities = 89/262 (33%), Positives = 134/262 (50%), Gaps = 18/262 (6%)
Query: 3 IHVKSSNVSRPWFLDSGASNHMTGSSEYLHNLASYHGNQQIQIADGNNLSITDVGDINSD 62
+H+ + W DS A+ H+T +S L + Y GN + +DGN L IT +G N
Sbjct: 303 LHITDVSDDSGWVPDSAATAHITNNSSRLQQMQPYLGNDTVMASDGNFLPITHIGSANLP 362
Query: 63 -------FRNVLVSPGLASNLLSVGQLV-DNNCNVNFSRAGCVVQEQVSGKVIAKGPKVG 114
++VLV P +A +LLSV +L D C+ F G +V+++ + KV+ KG
Sbjct: 363 STSGNLPLKDVLVCPNIAKSLLSVSKLTKDYPCSFTFDADGVLVKDKATCKVLTKGSSTS 422
Query: 115 R-LFPLQFISNHLSLPCNNVLNSYEDWHRKLGHPNSTVLSHLFKTGLLGNK---QVVCTA 170
L+ L+ + V + E WH +LGHPN VL LL NK Q+ +
Sbjct: 423 EGLYKLENPKFQMFYSTRQVKATDEVWHMRLGHPNPQVLQ------LLANKKAIQINKST 476
Query: 171 SISCLVCKLAKSKTLPFPSGAHRASNCFEMIHSDVWGMSPIASHARYKYFVTFIDDYSRF 230
S C C+L KS LPF + AS E +H D+WG +P++S ++Y+V FID+ SRF
Sbjct: 477 SKMCESCRLGKSSRLPFIASDFIASRPLERVHCDLWGPAPVSSIQGFQYYVIFIDNRSRF 536
Query: 231 TWIYFLRSKSEVFSIVITISGY 252
W Y L+ KS+ S+ + +
Sbjct: 537 CWFYPLKHKSDFCSLFMKFQSF 558
>gb|AAF02855.1| Similar to retrotransposon proteins [Arabidopsis thaliana]
gi|25301689|pir||C96578 hypothetical protein T18A20.5
[imported] - Arabidopsis thaliana
Length = 1522
Score = 143 bits (361), Expect = 4e-33
Identities = 91/244 (37%), Positives = 122/244 (49%), Gaps = 14/244 (5%)
Query: 14 WFLDSGASNHMTGSSEYLHNLASYHGNQQIQIADGNNLSITDVGD--INSD-----FRNV 66
W DS AS H+T + L YHG+ I +ADGN L IT G I S + V
Sbjct: 326 WIPDSAASAHVTNNRHVLQQSQPYHGSDSIMVADGNFLPITHTGSGSIASSSGKIPLKEV 385
Query: 67 LVSPGLASNLLSVGQLV-DNNCNVNFSRAGCVVQEQVSGKVIAKGPKVGRLFPLQFISNH 125
LV P + +LLSV +L D C+V F + ++ + K++ G L+ L+
Sbjct: 386 LVCPDIVKSLLSVSKLTSDYPCSVEFDADSVRINDKATKKLLVMGRNRDGLYSLEEPKLQ 445
Query: 126 LSLPCNNVLNSYEDWHRKLGHPNSTVLSHLF--KTGLLGNKQVVCTASISCLVCKLAKSK 183
+ S E WHR+LGH N+ VL L K+ ++ NK V C C L KS
Sbjct: 446 VLYSTRQNSASSEVWHRRLGHANAEVLHQLASSKSIIIINKVVKTV----CEACHLGKST 501
Query: 184 TLPFPSGAHRASNCFEMIHSDVWGMSPIASHARYKYFVTFIDDYSRFTWIYFLRSKSEVF 243
LPF AS E IH D+WG SP +S ++Y+V FID YSRFTW Y L+ KS+ F
Sbjct: 502 RLPFMLSTFNASRPLERIHCDLWGPSPTSSVQGFRYYVVFIDHYSRFTWFYPLKLKSDFF 561
Query: 244 SIVI 247
S +
Sbjct: 562 STFV 565
>emb|CAC95126.1| gag-pol polyprotein [Populus deltoides]
Length = 1382
Score = 142 bits (358), Expect = 9e-33
Identities = 90/254 (35%), Positives = 138/254 (53%), Gaps = 18/254 (7%)
Query: 7 SSNVSRP-WFLDSGASNHMTGSSEYLHNLASYHGNQQIQIADGNNLSITDVGDI---NSD 62
SS +S W LDSGAS+HM+ S +++ + + ADG + + VG + +
Sbjct: 334 SSGISHSEWVLDSGASHHMSPDSSSFTSVSPL-SSIPVMTADGTPMPLAGVGSVVTLHLS 392
Query: 63 FRNVLVSPGLASNLLSVGQLVDN-NCNVNFSRAGCVVQEQVSGKVIAKGPKVGRLFPLQF 121
NV + P L NL S+GQ+ D+ + V FS + C VQ+ S K+I G + L+ L
Sbjct: 393 LPNVYLIPKLKLNLASIGQICDSGDYLVMFSGSFCCVQDLQSQKLIGTGRRENGLYILDE 452
Query: 122 ISNHLSLPCNNV----------LNSYEDWHRKLGHPNSTVLSHLFKTGLLGNKQVVCTAS 171
+ + + V +S+ WH +LGH +S+ L L TG LGN + C S
Sbjct: 453 LKVPVVVAATTVDLSFFRLSLSSSSFYLWHSRLGHVSSSRLRFLASTGALGNLKT-CDIS 511
Query: 172 ISCLVCKLAKSKTLPFPSGAHRASNCFEMIHSDVWGMSPIASHARYKYFVTFIDDYSRFT 231
C CKLAK LPF +S+ F++IHSDVWG SP+++ +Y+V+FIDD++R+
Sbjct: 512 -DCSGCKLAKFSALPFNRSTSVSSSPFDLIHSDVWGPSPVSTKGGSRYYVSFIDDHTRYC 570
Query: 232 WIYFLRSKSEVFSI 245
W+Y ++ +SE F I
Sbjct: 571 WVYLMKHRSEFFEI 584
>gb|AAK51235.1| polyprotein [Arabidopsis thaliana]
Length = 1453
Score = 141 bits (356), Expect = 2e-32
Identities = 88/253 (34%), Positives = 136/253 (52%), Gaps = 18/253 (7%)
Query: 7 SSNVSRPWFLDSGASNHMTGSSEYLHNLASYHGNQQIQIADGNNLSITDVGD--INSD-- 62
S + + W DS A+ H+T S+ L + Y+G+ + + DG L IT VG I+SD
Sbjct: 318 SDSSGKEWVPDSAATAHVTSSTNNLQAASPYNGSDTVLVGDGAYLPITHVGSTTISSDSG 377
Query: 63 ---FRNVLVSPGLASNLLSVGQLVDNN-CNVNFSRAGCVVQEQVSGKVIAKGPKVGRLFP 118
VLV P + +LLSV +L D+ C V F + + + KV++KGP+ L+
Sbjct: 378 TLPLNEVLVCPDIQKSLLSVSKLCDDYPCGVYFDANKVCIIDINTQKVVSKGPRSNGLYV 437
Query: 119 LQ---FISNHLSLPCNNVLNSYEDWHRKLGHPNSTVLSHLFKTGLLG-NKQVVCTASISC 174
L+ F++ + + C S E WH +LGH NS +L L + + NK + S C
Sbjct: 438 LENQEFVAFYSNRQC---AASEEIWHHRLGHSNSRILQQLKSSKEISFNKSRM---SPVC 491
Query: 175 LVCKLAKSKTLPFPSGAHRASNCFEMIHSDVWGMSPIASHARYKYFVTFIDDYSRFTWIY 234
C++ KS L F S R + IH D+WG SP+ S +KY+V F+DDYSR++W Y
Sbjct: 492 EPCQMGKSSKLQFFSSNSRELDLLGRIHCDLWGPSPVVSKQGFKYYVVFVDDYSRYSWFY 551
Query: 235 FLRSKSEVFSIVI 247
L++KS+ F++ +
Sbjct: 552 PLKAKSDFFAVFV 564
>gb|AAD21687.1| Strong similarity to gi|3600044 T12H20.12 protease homolog from
Arabidopsis thaliana BAC gb|AF080119 and is a member of
the reverse transcriptase family PF|00078
gi|25301706|pir||C86438 hypothetical protein F28K20.17 -
Arabidopsis thaliana
Length = 1415
Score = 138 bits (348), Expect = 1e-31
Identities = 85/254 (33%), Positives = 127/254 (49%), Gaps = 18/254 (7%)
Query: 7 SSNVSRPWFLDSGASNHMTGSSEYLHNLASYHGNQQIQIADGNNLSITDVGDINSDFRN- 65
S + + W DS A+ H+T S+ L + Y G+ + + DG L IT G N
Sbjct: 315 SDDTGKEWHPDSAATAHVTSSTNGLQSATEYEGDDAVLVGDGTYLPITHTGSTTIKSSNG 374
Query: 66 ------VLVSPGLASNLLSVGQLVDNN-CNVNFSRAGCVVQEQVSGKVIAKGPKVGRLFP 118
VLV P + +LLSV +L D+ C V F + + + KV+ GP+ L+
Sbjct: 375 KIPLNEVLVVPNIQKSLLSVSKLCDDYPCGVYFDANKVCIIDLQTQKVVTTGPRRNGLYV 434
Query: 119 LQ---FISNHLSLPCNNVLNSYEDWHRKLGHPNSTVLSHLFKTGLLG-NKQVVCTASISC 174
L+ F++ + + C + E WH +LGH NS L HL + + NK S C
Sbjct: 435 LENQEFVALYSNRQC---AATEEVWHHRLGHANSKALQHLQNSKAIQINKS---RTSPVC 488
Query: 175 LVCKLAKSKTLPFPSGAHRASNCFEMIHSDVWGMSPIASHARYKYFVTFIDDYSRFTWIY 234
C++ KS LPF R + + IH D+WG SP+ S+ KY+ F+DDYSR++W Y
Sbjct: 489 EPCQMGKSSRLPFLISDSRVLHPLDRIHCDLWGPSPVVSNQGLKYYAIFVDDYSRYSWFY 548
Query: 235 FLRSKSEVFSIVIT 248
L +KSE S+ I+
Sbjct: 549 PLHNKSEFLSVFIS 562
>gb|AAD25646.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301701|pir||E84589 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1461
Score = 138 bits (347), Expect = 2e-31
Identities = 84/240 (35%), Positives = 131/240 (54%), Gaps = 11/240 (4%)
Query: 11 SRPWFLDSGASNHMTGSSEYLHNLASYHGNQQIQIADGNNLSITDVGD--INSDF--RNV 66
S W +DSGA++H++ + L + + + + G N+ I+ VG IN D +NV
Sbjct: 439 SDTWVIDSGATHHVSHDRKLFQTLDTSIVSF-VNLPTGPNVRISGVGTVLINKDIILQNV 497
Query: 67 LVSPGLASNLLSVGQLV-DNNCNVNFSRAGCVVQEQVSGKVIAKGPKVGRLFPLQFISNH 125
L P NL+S+ L D V F + C +Q+ G + +G ++G L+ L S
Sbjct: 498 LFIPEFRLNLISISSLTTDLGTRVIFDPSCCQIQDLTKGLTLGEGKRIGNLYVLDTQSPA 557
Query: 126 LSLPCNNVLNSYEDWHRKLGHPNSTVLSHLFKTGLLGNKQVVCTASISCLVCKLAKSKTL 185
+S+ N + WH++LGHP+ + L L + +LG + S C VC LAK K L
Sbjct: 558 ISV---NAVVDVSVWHKRLGHPSFSRLDSLSE--VLGTTRHKNKKSAYCHVCHLAKQKKL 612
Query: 186 PFPSGAHRASNCFEMIHSDVWGMSPIASHARYKYFVTFIDDYSRFTWIYFLRSKSEVFSI 245
FPS + ++ FE++H DVWG + + YKYF+T +DD+SR TWIY L+SKS+V ++
Sbjct: 613 SFPSANNICNSTFELLHIDVWGPFSVETVEGYKYFLTIVDDHSRATWIYLLKSKSDVLTV 672
>gb|AAF79879.1| T7N9.5 [Arabidopsis thaliana]
Length = 1436
Score = 138 bits (347), Expect = 2e-31
Identities = 82/246 (33%), Positives = 124/246 (50%), Gaps = 14/246 (5%)
Query: 12 RPWFLDSGASNHMTGSSEYLHNLASYHGNQQIQIADGNNLSITDVGDINS----DFRNVL 67
R W +DSGAS+H+T H + +++ +G+ + I G I NVL
Sbjct: 419 RAWVIDSGASHHVTHERNLYHTYKALD-RTFVRLPNGHTVKIEGTGFIQLTDALSLHNVL 477
Query: 68 VSPGLASNLLSVGQLVDN-NCNVNFSRAGCVVQEQVSGKVIAKGPKVGRLFPLQFISNHL 126
P NLLSV L V+F+ C++Q ++ KG +VG L+ L + +
Sbjct: 478 FIPEFKFNLLSVSVLTKTLQSKVSFTSDECMIQALTKELMLGKGSQVGNLYILNLDKSLV 537
Query: 127 SLP-------CNNVLNSYEDWHRKLGHPNSTVLSHLFKTGLLGNKQVVCTASISCLVCKL 179
+ C++V N E WH++LGHP+ + L +L KQ + S C VC L
Sbjct: 538 DVSSFPGKSVCSSVKNESEMWHKRLGHPSFAKIDTLSDVLMLP-KQKINKDSSHCHVCHL 596
Query: 180 AKSKTLPFPSGAHRASNCFEMIHSDVWGMSPIASHARYKYFVTFIDDYSRFTWIYFLRSK 239
+K K LPF S H FE++H D WG + + Y+YF+T +DD+SR TWIY L+ K
Sbjct: 597 SKQKHLPFKSVNHIREKAFELVHIDTWGPFSVPTVDSYRYFLTIVDDFSRATWIYLLKQK 656
Query: 240 SEVFSI 245
S+V ++
Sbjct: 657 SDVLTV 662
>ref|XP_476197.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
gi|46981313|gb|AAT07631.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
gi|46981245|gb|AAT07563.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
Length = 1431
Score = 136 bits (342), Expect = 6e-31
Identities = 89/249 (35%), Positives = 132/249 (52%), Gaps = 26/249 (10%)
Query: 10 VSRPWFLDSGASNHMTGSSEYLHNLASYHGNQQIQIADGNNLSITDVGDI-------NSD 62
V W++D+GA++H+TG + L Y G QI A G SI VG
Sbjct: 309 VDTNWYVDTGATDHITGQLDKLTTKERYKGTDQIHTASGEGTSIKHVGHAIVPTPSHPLH 368
Query: 63 FRNVLVSPGLASNLLSVGQLV-DNNCNVNFSRAGCVVQEQVSGKVIAKGPKVGRLFPLQF 121
+NVL P A NL+SV +LV DN + +++++ + + I +GP L+PL
Sbjct: 369 LKNVLHVPEAAKNLVSVHKLVADNYAFLEIHGKYFLIKDKATRRTILEGPCRRGLYPLPA 428
Query: 122 ISN----HLSLPCNNVLNSYEDWHRKLGHPNSTVLSHLFKTGLLGNKQVVCTASIS---C 174
S+ ++ P S+ WH +LGHP+ ++ + L NK + S++ C
Sbjct: 429 RSSLRQAFVATP------SFVRWHGRLGHPSKPIVLRI----LSQNKLPCLSNSVNESVC 478
Query: 175 LVCKLAKSKTLPFPSGAHRASNCFEMIHSDVWGMSPIASHARYKYFVTFIDDYSRFTWIY 234
C+ AK LPFP ++N E+IHSDVWG + + A+ KY+V+FIDDYS+F WIY
Sbjct: 479 DACQQAKCHQLPFPRSTSVSNNPLELIHSDVWGPASESVGAK-KYYVSFIDDYSKFVWIY 537
Query: 235 FLRSKSEVF 243
FL+ KSEVF
Sbjct: 538 FLKHKSEVF 546
>gb|AAT38747.1| putative polyprotein [Solanum demissum]
Length = 1336
Score = 136 bits (342), Expect = 6e-31
Identities = 83/240 (34%), Positives = 130/240 (53%), Gaps = 15/240 (6%)
Query: 11 SRPWFLDSGASNHMTGSSEYLHNLASYHGNQQIQIADGNNLSITDVGDINSD----FRNV 66
S W +DSGA++HMTG+ ++ ++ + I DG++ +I G +N +V
Sbjct: 362 SSNWIIDSGATDHMTGNPKFFSKFQAHKVPSSVTIVDGSSYTIEGSGTVNHTSSITLSSV 421
Query: 67 LVSPGLASNLLSVGQLVDN-NCNVNFSRAGCVVQEQVSGKVIAKGPKVGRLFPLQFISNH 125
L P A NL+SV +L C V+ C+ Q+ ++ ++I K L+ L +
Sbjct: 422 LGLPSHAFNLISVSKLTKELKCFVSLYPDHCLFQDLMTKQIIGKRHVSDGLYILDEWTPP 481
Query: 126 LSLPCNNVLNSYEDWHRKLGHPNSTVLSHLFKTGLLGNKQVVCTASISCLVCKLAKSKTL 185
S+ C+++++ +E H +LGHP+ VL L Q SI C C AK +
Sbjct: 482 -SVACSSIVSPFEA-HCRLGHPSLPVLKKLCP-------QFHNVPSIDCESCHFAKHHRI 532
Query: 186 PF-PSGAHRASNCFEMIHSDVWGMSPIASHARYKYFVTFIDDYSRFTWIYFLRSKSEVFS 244
P RA+ FE++HSDVWG P+ S ++YFVTF+DD+SR TWIYF++++SEVFS
Sbjct: 533 SLSPRNNKRANFAFELVHSDVWGPCPVVSKVGFRYFVTFMDDFSRMTWIYFMKNRSEVFS 592
>gb|AAT85031.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
Length = 1437
Score = 136 bits (342), Expect = 6e-31
Identities = 86/248 (34%), Positives = 127/248 (50%), Gaps = 13/248 (5%)
Query: 7 SSNVSRPWFLDSGASNHMTGSSEYLHNLASYHGNQQIQIADGNNLSITDVGDI------- 59
S V W+LD+GA++H+TG + L YHGN Q+ A G + I+ +G+
Sbjct: 312 SYGVDTNWYLDTGATDHVTGELDKLTVRDKYHGNDQVHTASGAGMEISHIGNSVVKTPSR 371
Query: 60 NSDFRNVLVSPGLASNLLSVGQLV-DNNCNVNFSRAGCVVQEQVSGKVIAKGPKVGRLFP 118
N ++VL P NL+S +L DN + R +++ + + +G L+
Sbjct: 372 NLHLKDVLYVPKANKNLVSAYKLTSDNLAFIELYRKFFFIKDLAMRRTLLRGRCHKGLYA 431
Query: 119 LQFISNH---LSLPCNNVLNSYEDWHRKLGHPNSTVLSHLFKTGLLGNKQVVCTASISCL 175
L S+H + S+E WH +LGHP+ TV+ + K+ L V S+ C
Sbjct: 432 LPSPSSHHHQVKQVYGVTKPSFERWHSRLGHPSYTVVEKVIKSQNLPCLDVSEQVSV-CD 490
Query: 176 VCKLAKSKTLPFPSGAHRASNCFEMIHSDVWGMSPIASHARYKYFVTFIDDYSRFTWIYF 235
C+ AKS L FP + E++ SDVWG +P S KY+V+FIDDYS+FTWIY
Sbjct: 491 ACQKAKSHQLSFPKSTSESKYPLELVFSDVWGPAP-QSVGNNKYYVSFIDDYSKFTWIYL 549
Query: 236 LRSKSEVF 243
L+ KSEVF
Sbjct: 550 LKYKSEVF 557
>gb|AAU89728.1| putative retroelement pol polyprotein-like [Solanum tuberosum]
Length = 1476
Score = 134 bits (338), Expect = 2e-30
Identities = 83/252 (32%), Positives = 132/252 (51%), Gaps = 14/252 (5%)
Query: 4 HVKSSNVSRPWFLDSGASNHMTGSSEYLHNLASYHGNQQIQIADGNNLSITDVGDINSD- 62
H S+ S W +DSGA++HM ++ L++ S ++Q+ G++ +T G
Sbjct: 408 HCNSNTHSSAWIVDSGATDHMVSNTTLLNHGLSVSHPGKVQLPTGDSAVVTHSGSSQLTG 467
Query: 63 ---FRNVLVSPGLASNLLSVGQLVDN-NCNVNFSRAGCVVQEQVSGKVIAKGPKVGRLFP 118
+NVL P NLLSV +L NC V F ++Q+ +GKV G ++ L+
Sbjct: 468 GDVVKNVLCVPTFQFNLLSVSKLTKELNCCVIFFPDFFIIQDLFTGKVKEIGEEINGLYI 527
Query: 119 LQFISNH----LSLPCNNVLNSYEDWHRKLGHPNSTVLSHLFKTGLLGNKQVVCTASISC 174
+ +H +L E WH++LGH +VL K + + Q + S C
Sbjct: 528 TRPHQHHDTSKKTLAAIKGCEEAEMWHKRLGHIPMSVLR---KIKMFDSPQKLVLPS--C 582
Query: 175 LVCKLAKSKTLPFPSGAHRASNCFEMIHSDVWGMSPIASHARYKYFVTFIDDYSRFTWIY 234
VC LA+ LPFP R+ NCF++IH DVWG A+H + +YF+T +DD+SR+TWI+
Sbjct: 583 DVCPLARQVRLPFPISQSRSENCFDLIHLDVWGPYKAATHNKMRYFLTVVDDHSRWTWIF 642
Query: 235 FLRSKSEVFSIV 246
+ KS+V +++
Sbjct: 643 LMHLKSDVSTVL 654
>ref|NP_909900.1| putative copia-like retrotransposon Hopscotch polyprotein [Oryza
sativa (japonica cultivar-group)]
gi|12957712|gb|AAK09230.1| putative copia-like
retrotransposon Hopscotch polyprotein [Oryza sativa
(japonica cultivar-group)] gi|14718304|gb|AAK72882.1|
putative gag-pol protein [Oryza sativa]
Length = 1219
Score = 133 bits (334), Expect = 5e-30
Identities = 83/250 (33%), Positives = 130/250 (51%), Gaps = 12/250 (4%)
Query: 2 GIHVKSSNVSRPWFLDSGASNHMTGSSEYLHNLASYHGNQQIQIADGNNLSITDVGDINS 61
G + V W++D+ A++H+TG + L Y G QI A G + I +G
Sbjct: 302 GAATHAYGVDTNWYVDTEATDHITGQLDKLTTREKYKGTDQIHTASGEGMDIQHIGHSYV 361
Query: 62 D-------FRNVLVSPGLASNLLSVGQLV-DNNCNVNFSRAGCVVQEQVSGKVIAKGPKV 113
+N+L P + NL+SV +LV DN + + +++++V+ + I +GP
Sbjct: 362 PTSSRPLHLKNILHVPKASKNLISVHRLVADNYAFLEIHQKYFLIKDKVTRRTILEGPCR 421
Query: 114 GRLFPLQFISNHLSLPCNNVLNSYEDWHRKLGHPNSTVLSHLFKTGLLGNKQVVCTASIS 173
L+PL + V+ S+E WH +LGH + ++ + L + S+
Sbjct: 422 RDLYPLP--AGDPIKQVFAVMPSFERWHGRLGHASKPIVLRVINQNKLPCSNESPSESV- 478
Query: 174 CLVCKLAKSKTLPFPSGAHRASNCFEMIHSDVWGMSPIASHARYKYFVTFIDDYSRFTWI 233
C C+ KS LPFP +SN E+IHSDVWG + + A+ +Y+V+FIDDYS+F WI
Sbjct: 479 CDACQQGKSHQLPFPKFFSVSSNPLELIHSDVWGPASDSVGAK-RYYVSFIDDYSKFVWI 537
Query: 234 YFLRSKSEVF 243
YFL+ KSEVF
Sbjct: 538 YFLKFKSEVF 547
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.320 0.135 0.413
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 430,384,211
Number of Sequences: 2540612
Number of extensions: 17253367
Number of successful extensions: 37626
Number of sequences better than 10.0: 795
Number of HSP's better than 10.0 without gapping: 579
Number of HSP's successfully gapped in prelim test: 216
Number of HSP's that attempted gapping in prelim test: 35849
Number of HSP's gapped (non-prelim): 907
length of query: 256
length of database: 863,360,394
effective HSP length: 125
effective length of query: 131
effective length of database: 545,783,894
effective search space: 71497690114
effective search space used: 71497690114
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 74 (33.1 bits)
Medicago: description of AC144727.5