
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC141862.21 - phase: 0 /pseudo
(659 letters)
Database: uniref100
2,790,947 sequences; 848,049,833 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
UniRef100_Q6RZV4 Retrotransposon-like protein [Musa acuminata] 109 2e-22
UniRef100_Q84VI4 Gag-pol polyprotein [Glycine max] 91 1e-16
UniRef100_Q84VI2 Gag-pol polyprotein [Glycine max] 91 1e-16
UniRef100_Q84VH8 Gag-pol polyprotein [Glycine max] 91 1e-16
UniRef100_Q84VI0 Gag-pol polyprotein [Glycine max] 88 9e-16
UniRef100_O65147 Gag-pol polyprotein [Glycine max] 88 9e-16
UniRef100_Q84VH6 Gag-pol polyprotein [Glycine max] 87 1e-15
UniRef100_Q9SKW9 F5J5.1 [Arabidopsis thaliana] 84 1e-14
UniRef100_Q8H6I4 Putative gag-pol polyprotein [Zea mays] 83 3e-14
UniRef100_Q852C7 Putative gag-pol polyprotein [Oryza sativa] 81 1e-13
UniRef100_Q7XWL7 OSJNBb0052B05.12 protein [Oryza sativa] 80 1e-13
UniRef100_Q948B3 Putative polyprotein [Oryza sativa] 80 2e-13
UniRef100_Q8LMY8 Putative polyprotein [Oryza sativa] 79 6e-13
UniRef100_Q7XPD6 OSJNBa0042N22.2 protein [Oryza sativa] 79 6e-13
UniRef100_Q8H7T1 Putative Zea mays retrotransposon Opie-2 [Oryza... 79 6e-13
UniRef100_Q7XPI7 OSJNBb0004A17.2 protein [Oryza sativa] 79 6e-13
UniRef100_Q850U8 Putative polyprotein [Oryza sativa] 78 7e-13
UniRef100_Q688I4 Putative polyprotein [Oryza sativa] 78 9e-13
UniRef100_Q94I67 Putative retroelement [Oryza sativa] 77 1e-12
UniRef100_Q94H69 Putative polyprotein [Oryza sativa] 77 1e-12
>UniRef100_Q6RZV4 Retrotransposon-like protein [Musa acuminata]
Length = 215
Score = 109 bits (273), Expect = 2e-22
Identities = 56/88 (63%), Positives = 63/88 (70%), Gaps = 1/88 (1%)
Query: 567 AREKQRSWYLDSGCSRHMTGEKSLFLTLTMKDGGEVKFGGNQTGKIIGTGTIGN-SSISI 625
++ + + WYLDSGCSRHMTG+ S F LT D G V FG N GKIIG GTIGN S+ I
Sbjct: 124 SQARSKRWYLDSGCSRHMTGDPSQFSKLTSIDEGYVTFGDNNKGKIIGKGTIGNKSNFFI 183
Query: 626 NNVWLVDGLKHNLLSISQFCDNGYDVMF 653
+V LVDGLKHNLLSISQ CD GY V F
Sbjct: 184 EDVLLVDGLKHNLLSISQLCDKGYIVKF 211
>UniRef100_Q84VI4 Gag-pol polyprotein [Glycine max]
Length = 1574
Score = 90.9 bits (224), Expect = 1e-16
Identities = 49/101 (48%), Positives = 61/101 (59%), Gaps = 2/101 (1%)
Query: 559 LKLQMCLRAREKQRSWYLDSGCSRHMTGEKSLFLTLTMKDGGEVKFGGNQTGKIIGTGTI 618
L + LRA K+ WYLDSGCSRHMTG K L + V FG GKIIG G +
Sbjct: 546 LVVHTSLRASAKE-DWYLDSGCSRHMTGVKEFLLNIEPCSTSYVTFGDGSKGKIIGMGKL 604
Query: 619 GNSSI-SINNVWLVDGLKHNLLSISQFCDNGYDVMFSKTNC 658
+ + S+N V LV GL NL+SISQ CD G++V F+K+ C
Sbjct: 605 VHDGLPSLNKVLLVKGLTANLISISQLCDEGFNVNFTKSEC 645
>UniRef100_Q84VI2 Gag-pol polyprotein [Glycine max]
Length = 1576
Score = 90.9 bits (224), Expect = 1e-16
Identities = 49/101 (48%), Positives = 61/101 (59%), Gaps = 2/101 (1%)
Query: 559 LKLQMCLRAREKQRSWYLDSGCSRHMTGEKSLFLTLTMKDGGEVKFGGNQTGKIIGTGTI 618
L + LRA K+ WYLDSGCSRHMTG K L + V FG GKIIG G +
Sbjct: 548 LVVHTSLRASAKE-DWYLDSGCSRHMTGVKEFLLNIEPCSTSYVTFGDGSKGKIIGMGKL 606
Query: 619 GNSSI-SINNVWLVDGLKHNLLSISQFCDNGYDVMFSKTNC 658
+ + S+N V LV GL NL+SISQ CD G++V F+K+ C
Sbjct: 607 VHDGLPSLNKVLLVKGLTANLISISQLCDEGFNVNFTKSEC 647
>UniRef100_Q84VH8 Gag-pol polyprotein [Glycine max]
Length = 1576
Score = 90.9 bits (224), Expect = 1e-16
Identities = 49/101 (48%), Positives = 61/101 (59%), Gaps = 2/101 (1%)
Query: 559 LKLQMCLRAREKQRSWYLDSGCSRHMTGEKSLFLTLTMKDGGEVKFGGNQTGKIIGTGTI 618
L + LRA K+ WYLDSGCSRHMTG K L + V FG GKIIG G +
Sbjct: 548 LVVHTSLRASAKE-DWYLDSGCSRHMTGVKEFLLNIEPCSTSYVTFGDGSKGKIIGMGKL 606
Query: 619 GNSSI-SINNVWLVDGLKHNLLSISQFCDNGYDVMFSKTNC 658
+ + S+N V LV GL NL+SISQ CD G++V F+K+ C
Sbjct: 607 VHDGLPSLNKVLLVKGLTANLISISQLCDEGFNVNFTKSEC 647
>UniRef100_Q84VI0 Gag-pol polyprotein [Glycine max]
Length = 1576
Score = 87.8 bits (216), Expect = 9e-16
Identities = 47/101 (46%), Positives = 60/101 (58%), Gaps = 2/101 (1%)
Query: 559 LKLQMCLRAREKQRSWYLDSGCSRHMTGEKSLFLTLTMKDGGEVKFGGNQTGKIIGTGTI 618
L + LRA K+ WYLDSGCSRHMTG K + + V FG GKI G G +
Sbjct: 548 LVVHTSLRASAKE-DWYLDSGCSRHMTGVKEFLVNIEPCSTSYVTFGDGSKGKITGMGKL 606
Query: 619 GNSSI-SINNVWLVDGLKHNLLSISQFCDNGYDVMFSKTNC 658
+ + S+N V LV GL NL+SISQ CD G++V F+K+ C
Sbjct: 607 VHDGLPSLNKVLLVKGLTANLISISQLCDEGFNVNFTKSEC 647
>UniRef100_O65147 Gag-pol polyprotein [Glycine max]
Length = 1550
Score = 87.8 bits (216), Expect = 9e-16
Identities = 47/101 (46%), Positives = 60/101 (58%), Gaps = 2/101 (1%)
Query: 559 LKLQMCLRAREKQRSWYLDSGCSRHMTGEKSLFLTLTMKDGGEVKFGGNQTGKIIGTGTI 618
L + LRA K+ WYLDSGCSRHMTG K + + V FG GKI G G +
Sbjct: 522 LVVHTSLRASAKE-DWYLDSGCSRHMTGVKEFLVNIEPCSTSYVTFGDGSKGKITGMGKL 580
Query: 619 GNSSI-SINNVWLVDGLKHNLLSISQFCDNGYDVMFSKTNC 658
+ + S+N V LV GL NL+SISQ CD G++V F+K+ C
Sbjct: 581 VHDGLPSLNKVLLVKGLTANLISISQLCDEGFNVNFTKSEC 621
>UniRef100_Q84VH6 Gag-pol polyprotein [Glycine max]
Length = 1577
Score = 87.4 bits (215), Expect = 1e-15
Identities = 47/101 (46%), Positives = 60/101 (58%), Gaps = 2/101 (1%)
Query: 559 LKLQMCLRAREKQRSWYLDSGCSRHMTGEKSLFLTLTMKDGGEVKFGGNQTGKIIGTGTI 618
L + LRA K+ WYLDSGCSRHMTG K + + V FG GKI G G +
Sbjct: 549 LVVHTSLRASAKE-DWYLDSGCSRHMTGVKEFLVNIEPCSTSYVTFGDGSKGKITGMGKL 607
Query: 619 GNSSI-SINNVWLVDGLKHNLLSISQFCDNGYDVMFSKTNC 658
+ + S+N V LV GL NL+SISQ CD G++V F+K+ C
Sbjct: 608 VHEGLPSLNKVLLVKGLTVNLISISQLCDEGFNVNFTKSEC 648
>UniRef100_Q9SKW9 F5J5.1 [Arabidopsis thaliana]
Length = 1463
Score = 84.3 bits (207), Expect = 1e-14
Identities = 41/91 (45%), Positives = 57/91 (62%), Gaps = 1/91 (1%)
Query: 569 EKQRSWYLDSGCSRHMTGEKSLFLTLTMKDGGEVKFGGNQTGKIIGTGTIG-NSSISINN 627
+ R+WY DSGCSRHMTG +S+ ++ + G+V FG G I G G I + ++N
Sbjct: 558 QNNRAWYFDSGCSRHMTGVQSILKDFSVINNGKVTFGDGGKGSIKGKGKIEIDDQPHLSN 617
Query: 628 VWLVDGLKHNLLSISQFCDNGYDVMFSKTNC 658
V+ V+GL NL+SISQ CD+ V F+KT C
Sbjct: 618 VYFVEGLTANLISISQLCDDDLTVTFTKTGC 648
>UniRef100_Q8H6I4 Putative gag-pol polyprotein [Zea mays]
Length = 2319
Score = 82.8 bits (203), Expect = 3e-14
Identities = 43/84 (51%), Positives = 57/84 (67%), Gaps = 2/84 (2%)
Query: 573 SWYLDSGCSRHMTGEKSLFLTLTM-KDGGEVKFGGNQTGKIIGTGTIGNSSI-SINNVWL 630
SW LDSGC+ HMTGEK +F TL + ++ E+ FG + K+IG G I S S++NV L
Sbjct: 734 SWVLDSGCTNHMTGEKDMFHTLQLTQEAQEIVFGDSGKSKVIGIGKIPISDQQSLSNVLL 793
Query: 631 VDGLKHNLLSISQFCDNGYDVMFS 654
VD L +NLLS+SQ C GY+ +FS
Sbjct: 794 VDSLSYNLLSVSQLCGMGYNCLFS 817
>UniRef100_Q852C7 Putative gag-pol polyprotein [Oryza sativa]
Length = 1969
Score = 80.9 bits (198), Expect = 1e-13
Identities = 41/88 (46%), Positives = 54/88 (60%), Gaps = 2/88 (2%)
Query: 570 KQRSWYLDSGCSRHMTGEKSLFLTLTMKDGGE-VKFGGNQTGKIIGTGTIG-NSSISINN 627
+ SW +DSGCSRHMTGE F +LT G E + FG +G+++ GTI N + +
Sbjct: 750 RSNSWLVDSGCSRHMTGEAKWFTSLTRASGDETITFGDASSGRVMAKGTIKVNDKFMLKD 809
Query: 628 VWLVDGLKHNLLSISQFCDNGYDVMFSK 655
V LV LK+NLLS+SQ CD +V F K
Sbjct: 810 VALVSKLKYNLLSVSQLCDENLEVRFKK 837
>UniRef100_Q7XWL7 OSJNBb0052B05.12 protein [Oryza sativa]
Length = 1549
Score = 80.5 bits (197), Expect = 1e-13
Identities = 41/91 (45%), Positives = 60/91 (65%), Gaps = 3/91 (3%)
Query: 572 RSWYLDSGCSRHMTGEKSLFLTLTMKDGGE--VKFGGNQTGKIIGTGTIGNSS-ISINNV 628
RSW +DSGC+ HMTGE+S+F +L + + FG + GK+IG G I S+ +SI+NV
Sbjct: 696 RSWVIDSGCTNHMTGEESMFSSLDPNGTSQENIIFGDDGKGKVIGLGKIAISNDLSISNV 755
Query: 629 WLVDGLKHNLLSISQFCDNGYDVMFSKTNCT 659
LV L +NLLS+SQ C G++ +F+ + T
Sbjct: 756 LLVKSLNYNLLSVSQLCSMGFNCLFTDVDVT 786
>UniRef100_Q948B3 Putative polyprotein [Oryza sativa]
Length = 1407
Score = 79.7 bits (195), Expect = 2e-13
Identities = 40/91 (43%), Positives = 59/91 (63%), Gaps = 3/91 (3%)
Query: 572 RSWYLDSGCSRHMTGEKSLFLTLTMKDGGE--VKFGGNQTGKIIGTGTIG-NSSISINNV 628
RSW +DSGC+ HMTGE+S+F +L + + FG + GK++G G I + +SI+NV
Sbjct: 698 RSWVIDSGCTNHMTGEESMFCSLDPNGSLQENIVFGDDGKGKVMGLGKIAIYNDLSISNV 757
Query: 629 WLVDGLKHNLLSISQFCDNGYDVMFSKTNCT 659
LV L +NLLS+SQ C GY+ +F+ + T
Sbjct: 758 LLVKSLNYNLLSVSQLCSMGYNCLFTDVDVT 788
>UniRef100_Q8LMY8 Putative polyprotein [Oryza sativa]
Length = 1584
Score = 78.6 bits (192), Expect = 6e-13
Identities = 41/99 (41%), Positives = 57/99 (57%), Gaps = 3/99 (3%)
Query: 558 WLKLQMCLRAREKQRSWYLDSGCSRHMTGEKSLFLTLTM--KDGGEVKFGGNQTGKIIGT 615
W+ L + R W LDSGC++HMTG++++F T + + +V FG N GK+IG
Sbjct: 598 WVPLYSVVNYRTGGSHWVLDSGCTQHMTGDRTMFTTFEVGGNEQEKVTFGDNSNGKVIGL 657
Query: 616 GTIG-NSSISINNVWLVDGLKHNLLSISQFCDNGYDVMF 653
G I ++ +SI NV LV L NLLS+ Q CD G F
Sbjct: 658 GKIAISNDLSIENVSLVKSLNFNLLSVPQICDLGLSCAF 696
>UniRef100_Q7XPD6 OSJNBa0042N22.2 protein [Oryza sativa]
Length = 1486
Score = 78.6 bits (192), Expect = 6e-13
Identities = 38/90 (42%), Positives = 60/90 (66%), Gaps = 3/90 (3%)
Query: 573 SWYLDSGCSRHMTGEKSLFLTLTMKDGG--EVKFGGNQTGKIIGTGTIG-NSSISINNVW 629
SW +DSGC+ HMTGE+S+F +L + G + FG + GK++G G I + + S++NV
Sbjct: 583 SWLVDSGCTNHMTGERSMFTSLDEEGGSCENIMFGDDGKGKVMGLGKIAISKNQSLSNVL 642
Query: 630 LVDGLKHNLLSISQFCDNGYDVMFSKTNCT 659
LV+ L +NLLS+SQ C G++ +F+ + T
Sbjct: 643 LVNSLSYNLLSVSQLCKLGFNCLFTDVDVT 672
>UniRef100_Q8H7T1 Putative Zea mays retrotransposon Opie-2 [Oryza sativa]
Length = 2145
Score = 78.6 bits (192), Expect = 6e-13
Identities = 40/88 (45%), Positives = 53/88 (59%), Gaps = 2/88 (2%)
Query: 570 KQRSWYLDSGCSRHMTGEKSLFLTLTMKDGGE-VKFGGNQTGKIIGTGTIG-NSSISINN 627
+ SW +DSGCSRHMTGE F +LT E + FG +G+++ GTI N + +
Sbjct: 750 RSNSWLVDSGCSRHMTGEAKWFTSLTRASSDETITFGDASSGRVMAKGTIKVNDKFMLKD 809
Query: 628 VWLVDGLKHNLLSISQFCDNGYDVMFSK 655
V LV LK+NLLS+SQ CD +V F K
Sbjct: 810 VALVSKLKYNLLSVSQLCDENLEVRFKK 837
>UniRef100_Q7XPI7 OSJNBb0004A17.2 protein [Oryza sativa]
Length = 1877
Score = 78.6 bits (192), Expect = 6e-13
Identities = 40/88 (45%), Positives = 53/88 (59%), Gaps = 2/88 (2%)
Query: 570 KQRSWYLDSGCSRHMTGEKSLFLTLTMKDGGE-VKFGGNQTGKIIGTGTIG-NSSISINN 627
+ SW +DSGCSRHMTGE F +LT E + FG +G+++ GTI N + +
Sbjct: 832 RSNSWLVDSGCSRHMTGEAKWFTSLTRASSDETITFGDASSGRVMAKGTIKVNDKFMLKD 891
Query: 628 VWLVDGLKHNLLSISQFCDNGYDVMFSK 655
V LV LK+NLLS+SQ CD +V F K
Sbjct: 892 VALVSKLKYNLLSVSQLCDENLEVRFKK 919
>UniRef100_Q850U8 Putative polyprotein [Oryza sativa]
Length = 623
Score = 78.2 bits (191), Expect = 7e-13
Identities = 38/90 (42%), Positives = 60/90 (66%), Gaps = 3/90 (3%)
Query: 573 SWYLDSGCSRHMTGEKSLFLTLTMKDGG--EVKFGGNQTGKIIGTGTIG-NSSISINNVW 629
SW +DSGC+ HMTGE+S+F +L + G + FG + GK++G G I + + S++NV
Sbjct: 402 SWVVDSGCTNHMTGERSMFTSLDEEGGSRENIVFGDDGKGKVMGLGKIAISKNQSLSNVL 461
Query: 630 LVDGLKHNLLSISQFCDNGYDVMFSKTNCT 659
LV+ L +NLLS+SQ C G++ +F+ + T
Sbjct: 462 LVNSLSYNLLSVSQLCKLGFNCLFTDVDVT 491
>UniRef100_Q688I4 Putative polyprotein [Oryza sativa]
Length = 1282
Score = 77.8 bits (190), Expect = 9e-13
Identities = 44/100 (44%), Positives = 61/100 (61%), Gaps = 3/100 (3%)
Query: 559 LKLQMCLRAREKQRSWYLDSGCSRHMTGEKSLFLTLTMKDGGE-VKFGGNQTGKIIGTGT 617
L L+M L AR K+ W +DSGCSRHMT +K+ F +L E + FG T ++ TG
Sbjct: 494 LVLEMALVAR-KENVWIVDSGCSRHMTSDKNWFSSLKKTSKTESIIFGDAATSAVLATGL 552
Query: 618 IG-NSSISINNVWLVDGLKHNLLSISQFCDNGYDVMFSKT 656
+ N + + NV LV+ LK+NLLS+SQ D ++V F KT
Sbjct: 553 VKVNENFELKNVALVEDLKYNLLSVSQIVDENFEVHFKKT 592
>UniRef100_Q94I67 Putative retroelement [Oryza sativa]
Length = 916
Score = 77.4 bits (189), Expect = 1e-12
Identities = 42/93 (45%), Positives = 62/93 (66%), Gaps = 7/93 (7%)
Query: 572 RSWYLDSGCSRHMTGEKSLFLTL----TMKDGGEVKFGGNQTGKIIGTGTIGNSS-ISIN 626
RSW +DSGC+ HMT E+S+F +L T+++ + FG + GK++G G I S+ +SI+
Sbjct: 218 RSWVIDSGCTNHMTREESMFSSLDPNGTLQEN--ILFGYDGKGKVMGLGKIAISNDLSIS 275
Query: 627 NVWLVDGLKHNLLSISQFCDNGYDVMFSKTNCT 659
NV LV L +NLLSISQ C GY+ +F+ + T
Sbjct: 276 NVLLVKSLNYNLLSISQLCSMGYNCLFTDVDVT 308
>UniRef100_Q94H69 Putative polyprotein [Oryza sativa]
Length = 454
Score = 77.4 bits (189), Expect = 1e-12
Identities = 42/93 (45%), Positives = 62/93 (66%), Gaps = 7/93 (7%)
Query: 572 RSWYLDSGCSRHMTGEKSLFLTL----TMKDGGEVKFGGNQTGKIIGTGTIGNSS-ISIN 626
RSW +DSGC+ HMT E+S+F +L T+++ + FG + GK++G G I S+ +SI+
Sbjct: 194 RSWVIDSGCTNHMTREESMFSSLDPNGTLQEN--ILFGYDGKGKVMGLGKIAISNDLSIS 251
Query: 627 NVWLVDGLKHNLLSISQFCDNGYDVMFSKTNCT 659
NV LV L +NLLSISQ C GY+ +F+ + T
Sbjct: 252 NVLLVKSLNYNLLSISQLCSMGYNCLFTDVDVT 284
Database: uniref100
Posted date: Jan 5, 2005 1:24 AM
Number of letters in database: 848,049,833
Number of sequences in database: 2,790,947
Lambda K H
0.351 0.154 0.514
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 904,329,451
Number of Sequences: 2790947
Number of extensions: 33014598
Number of successful extensions: 127079
Number of sequences better than 10.0: 383
Number of HSP's better than 10.0 without gapping: 217
Number of HSP's successfully gapped in prelim test: 166
Number of HSP's that attempted gapping in prelim test: 126607
Number of HSP's gapped (non-prelim): 449
length of query: 659
length of database: 848,049,833
effective HSP length: 134
effective length of query: 525
effective length of database: 474,062,935
effective search space: 248883040875
effective search space used: 248883040875
T: 11
A: 40
X1: 14 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 38 (21.9 bits)
S2: 78 (34.7 bits)
Medicago: description of AC141862.21