
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC121245.12 + phase: 0 /pseudo
(136 letters)
Database: uniref100
2,790,947 sequences; 848,049,833 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
UniRef100_Q9LNS0 F1L3.4 [Arabidopsis thaliana] 83 2e-15
UniRef100_Q9LQJ0 F28G4.15 protein [Arabidopsis thaliana] 83 2e-15
UniRef100_Q9SI81 F23N19.5 [Arabidopsis thaliana] 82 2e-15
UniRef100_Q9FG26 Non-LTR retroelement reverse transcriptase-like... 82 2e-15
UniRef100_Q9SJZ8 Putative non-LTR retroelement reverse transcrip... 82 3e-15
UniRef100_O82276 Putative non-LTR retroelement reverse transcrip... 78 4e-14
UniRef100_Q9SHY0 F1E22.12 [Arabidopsis thaliana] 75 4e-13
UniRef100_Q9FMM8 Non-LTR retroelement reverse transcriptase-like... 75 4e-13
UniRef100_Q9ZUY7 Putative reverse transcriptase [Arabidopsis tha... 71 6e-12
UniRef100_O80790 Putative non-LTR retroelement reverse transcrip... 70 1e-11
UniRef100_Q9LDL8 Hypothetical protein AT4g08830 [Arabidopsis tha... 69 2e-11
UniRef100_O65407 Hypothetical protein F18E5.40 [Arabidopsis thal... 67 7e-11
UniRef100_Q655G6 Hypothetical protein P0009H10.11 [Oryza sativa] 67 9e-11
UniRef100_Q6K204 Hypothetical protein B1469H02.35 [Oryza sativa] 67 1e-10
UniRef100_Q9FXC8 F12A21.24 [Arabidopsis thaliana] 63 2e-09
UniRef100_O23409 SIMILARITY to probable splicing factor CEPRP21 ... 59 2e-08
UniRef100_Q9LP00 F9C16.18 [Arabidopsis thaliana] 59 3e-08
UniRef100_Q5W657 Hypothetical protein OSJNBb0052F16.4 [Oryza sat... 56 2e-07
UniRef100_Q6L3X8 Hypothetical protein [Solanum demissum] 56 2e-07
UniRef100_Q8H8C0 Hypothetical protein OJ1134F05.9 [Oryza sativa] 55 3e-07
>UniRef100_Q9LNS0 F1L3.4 [Arabidopsis thaliana]
Length = 253
Score = 82.8 bits (203), Expect = 2e-15
Identities = 45/116 (38%), Positives = 65/116 (55%), Gaps = 1/116 (0%)
Query: 17 VRWNPPPEDFIKINVDGSSFGNPDNAGFGGLLRNNRGNWIHGFSGSCGRATNLFAELSAI 76
+ W+PP + K+N DG+S GNP A GG++R+ GNW +GFS + G + AEL
Sbjct: 83 IAWSPPRVGWFKLNTDGASRGNPRLATAGGVVRDGDGNWCYGFSLNIGICSAPLAELWGA 142
Query: 77 WKGLQLAWDLGYRSIIMESDSQVALDLILDTKQKDFHPHASLLSLIRKLSSLPWVV 132
+ GL +AW+ G + ME DS++ + L T D HP + L+ L L S W V
Sbjct: 143 YYGLNIAWERGVTQLEMEIDSEMVVG-FLRTGIDDSHPLSFLVRLCHGLLSKDWSV 197
>UniRef100_Q9LQJ0 F28G4.15 protein [Arabidopsis thaliana]
Length = 272
Score = 82.8 bits (203), Expect = 2e-15
Identities = 45/116 (38%), Positives = 65/116 (55%), Gaps = 1/116 (0%)
Query: 17 VRWNPPPEDFIKINVDGSSFGNPDNAGFGGLLRNNRGNWIHGFSGSCGRATNLFAELSAI 76
+ W+PP + K+N DG+S GNP A GG++R+ GNW +GFS + G + AEL
Sbjct: 102 IAWSPPRVGWFKLNTDGASRGNPRLATAGGVVRDGDGNWCYGFSLNIGICSAPLAELWGA 161
Query: 77 WKGLQLAWDLGYRSIIMESDSQVALDLILDTKQKDFHPHASLLSLIRKLSSLPWVV 132
+ GL +AW+ G + ME DS++ + L T D HP + L+ L L S W V
Sbjct: 162 YYGLNIAWERGVTQLEMEIDSEMVVG-FLRTGIDDSHPLSFLVRLCHGLLSKDWSV 216
>UniRef100_Q9SI81 F23N19.5 [Arabidopsis thaliana]
Length = 233
Score = 82.4 bits (202), Expect = 2e-15
Identities = 45/117 (38%), Positives = 63/117 (53%), Gaps = 1/117 (0%)
Query: 16 QVRWNPPPEDFIKINVDGSSFGNPDNAGFGGLLRNNRGNWIHGFSGSCGRATNLFAELSA 75
QVRW+ P + K+N DG+S GNP A GG LRN G W GF+ + GR AEL
Sbjct: 62 QVRWSKPSLGWCKLNTDGASHGNPGLATAGGALRNEYGEWCFGFALNIGRCLAPLAELWG 121
Query: 76 IWKGLQLAWDLGYRSIIMESDSQVALDLILDTKQKDFHPHASLLSLIRKLSSLPWVV 132
++ GL +AWD G + +E DS++ + L T HP + L+ + S W+V
Sbjct: 122 VYYGLFMAWDRGITRLELEVDSEMVVG-FLRTGIGSSHPLSFLVRMCHGFLSRDWIV 177
>UniRef100_Q9FG26 Non-LTR retroelement reverse transcriptase-like [Arabidopsis
thaliana]
Length = 676
Score = 82.4 bits (202), Expect = 2e-15
Identities = 41/116 (35%), Positives = 66/116 (56%), Gaps = 1/116 (0%)
Query: 17 VRWNPPPEDFIKINVDGSSFGNPDNAGFGGLLRNNRGNWIHGFSGSCGRATNLFAELSAI 76
+ W P E ++ +N DG+S GNP A GG++R+ G+W+ GF+ + G + AEL +
Sbjct: 506 IAWRKPAEGWVTMNTDGASHGNPGQATAGGVIRDEHGSWLVGFALNIGVCSAPLAELWGV 565
Query: 77 WKGLQLAWDLGYRSIIMESDSQVALDLILDTKQKDFHPHASLLSLIRKLSSLPWVV 132
+ GL +AW+ G+R + +E DS + + L + D HP A L+ L S W+V
Sbjct: 566 YYGLVVAWERGWRRVRLEVDSALVVG-FLQSGIGDSHPLAFLVRLCHGFISKDWIV 620
>UniRef100_Q9SJZ8 Putative non-LTR retroelement reverse transcriptase [Arabidopsis
thaliana]
Length = 321
Score = 81.6 bits (200), Expect = 3e-15
Identities = 45/116 (38%), Positives = 66/116 (56%), Gaps = 1/116 (0%)
Query: 17 VRWNPPPEDFIKINVDGSSFGNPDNAGFGGLLRNNRGNWIHGFSGSCGRATNLFAELSAI 76
V W P + ++K+N DG+S GNP A GG+LR++ G WI GF+ + G + AEL +
Sbjct: 151 VSWVSPEDGWVKLNTDGASRGNPGFATAGGVLRDHNGAWIGGFAVNIGVCSAPLAELWGV 210
Query: 77 WKGLQLAWDLGYRSIIMESDSQVALDLILDTKQKDFHPHASLLSLIRKLSSLPWVV 132
+ GL +AW G R + +E DS++ + L T D HP + LL L S W+V
Sbjct: 211 YYGLFIAWGRGARRVELEVDSKMVVG-FLTTGIADSHPLSFLLRLCYDFLSKGWIV 265
>UniRef100_O82276 Putative non-LTR retroelement reverse transcriptase [Arabidopsis
thaliana]
Length = 1231
Score = 78.2 bits (191), Expect = 4e-14
Identities = 39/116 (33%), Positives = 64/116 (54%), Gaps = 1/116 (0%)
Query: 17 VRWNPPPEDFIKINVDGSSFGNPDNAGFGGLLRNNRGNWIHGFSGSCGRATNLFAELSAI 76
+RW P + ++KI DG+S GN A GG +RN +G W+ GF+ + G AEL
Sbjct: 1061 IRWQVPSDGWVKITTDGASRGNHGLAAAGGAIRNGQGEWLGGFALNIGSCAAPLAELWGA 1120
Query: 77 WKGLQLAWDLGYRSIIMESDSQVALDLILDTKQKDFHPHASLLSLIRKLSSLPWVV 132
+ GL +AWD G+R + ++ D ++ + L T + HP + L+ L + + W+V
Sbjct: 1121 YYGLLIAWDKGFRRVELDLDCKLVVG-FLSTGVSNAHPLSFLVRLCQGFFTRDWLV 1175
>UniRef100_Q9SHY0 F1E22.12 [Arabidopsis thaliana]
Length = 1055
Score = 74.7 bits (182), Expect = 4e-13
Identities = 43/119 (36%), Positives = 63/119 (52%), Gaps = 1/119 (0%)
Query: 17 VRWNPPPEDFIKINVDGSSFGNPDNAGFGGLLRNNRGNWIHGFSGSCGRATNLFAELSAI 76
+ W P ++K+N DG+S GNP A GG+LR+ G W GFS + GR + AEL +
Sbjct: 559 IGWVSPCVGWVKVNTDGASRGNPGLASAGGVLRDCTGAWCGGFSLNIGRCSAPQAELWGV 618
Query: 77 WKGLQLAWDLGYRSIIMESDSQVALDLILDTKQKDFHPHASLLSLIRKLSSLPWVVYLV 135
+ GL AW+ + +E DS+V + L T D HP + L+ L W+V +V
Sbjct: 619 YYGLYFAWEKKVPRVELEVDSEVIVG-FLKTGISDSHPLSFLVRLCHGFLQKDWLVRIV 676
>UniRef100_Q9FMM8 Non-LTR retroelement reverse transcriptase-like protein
[Arabidopsis thaliana]
Length = 308
Score = 74.7 bits (182), Expect = 4e-13
Identities = 43/119 (36%), Positives = 63/119 (52%), Gaps = 1/119 (0%)
Query: 17 VRWNPPPEDFIKINVDGSSFGNPDNAGFGGLLRNNRGNWIHGFSGSCGRATNLFAELSAI 76
+ W P ++K+N DG+S GNP A GG+LR+ G W GFS + GR + AEL +
Sbjct: 138 IGWVLPCVGWVKVNTDGASRGNPGLASAGGVLRDCEGAWCGGFSLNIGRCSAQHAELWGV 197
Query: 77 WKGLQLAWDLGYRSIIMESDSQVALDLILDTKQKDFHPHASLLSLIRKLSSLPWVVYLV 135
+ GL AW+ + +E DS+ A+ L T D HP + L+ L W+V +V
Sbjct: 198 YYGLYFAWEKKVPRVELEVDSE-AIVGFLKTGISDSHPLSFLVRLCHNFLQKDWLVRIV 255
>UniRef100_Q9ZUY7 Putative reverse transcriptase [Arabidopsis thaliana]
Length = 314
Score = 70.9 bits (172), Expect = 6e-12
Identities = 39/116 (33%), Positives = 61/116 (51%), Gaps = 1/116 (0%)
Query: 17 VRWNPPPEDFIKINVDGSSFGNPDNAGFGGLLRNNRGNWIHGFSGSCGRATNLFAELSAI 76
+ W+ P E + K+N DG+S GNP A GG+LR+ G W GF+ + G + AEL +
Sbjct: 144 IAWSKPEEGWWKLNTDGASRGNPGLASAGGVLRDEEGAWRGGFALNIGVCSAPLAELWGV 203
Query: 77 WKGLQLAWDLGYRSIIMESDSQVALDLILDTKQKDFHPHASLLSLIRKLSSLPWVV 132
+ GL +AW+ + +E DS++ + L + HP + L+ L S W V
Sbjct: 204 YYGLYIAWERRVTRLEIEVDSEIVVG-FLKIGINEVHPLSFLVRLCHDFISRDWRV 258
>UniRef100_O80790 Putative non-LTR retroelement reverse transcriptase [Arabidopsis
thaliana]
Length = 970
Score = 69.7 bits (169), Expect = 1e-11
Identities = 36/116 (31%), Positives = 61/116 (52%), Gaps = 1/116 (0%)
Query: 17 VRWNPPPEDFIKINVDGSSFGNPDNAGFGGLLRNNRGNWIHGFSGSCGRATNLFAELSAI 76
+RW P + ++K+ DG+S G+ A G + N +G W+ GF+ + G AEL
Sbjct: 800 IRWKAPSDRWVKLTTDGASRGHQGLAAASGAILNLQGEWLGGFALNIGSCDAPLAELWGA 859
Query: 77 WKGLQLAWDLGYRSIIMESDSQVALDLILDTKQKDFHPHASLLSLIRKLSSLPWVV 132
+ GL +AWD G+R + + DS++ + L T HP + L+ L + + W+V
Sbjct: 860 YYGLLIAWDKGFRRVELNLDSELVVG-FLSTGISKAHPLSFLVRLCQGFFTRDWLV 914
>UniRef100_Q9LDL8 Hypothetical protein AT4g08830 [Arabidopsis thaliana]
Length = 947
Score = 68.9 bits (167), Expect = 2e-11
Identities = 37/105 (35%), Positives = 59/105 (55%), Gaps = 1/105 (0%)
Query: 17 VRWNPPPEDFIKINVDGSSFGNPDNAGFGGLLRNNRGNWIHGFSGSCGRATNLFAELSAI 76
V W PP +++K+N DG+S GN A GG+LR+ G+W GF+ G + AEL +
Sbjct: 839 VAWKPPDGEWVKLNTDGASRGNLGLATTGGVLRDGIGHWCGGFALDIGVCSAPLAELWGV 898
Query: 77 WKGLQLAWDLGYRSIIMESDSQVALDLILDTKQKDFHPHASLLSL 121
+ GL +AW+ + + +E DS++ + L T D H + L+ L
Sbjct: 899 YYGLYMAWERRFTRVELEVDSELVVG-FLTTGISDTHSLSFLVRL 942
>UniRef100_O65407 Hypothetical protein F18E5.40 [Arabidopsis thaliana]
Length = 229
Score = 67.4 bits (163), Expect = 7e-11
Identities = 33/89 (37%), Positives = 54/89 (60%), Gaps = 1/89 (1%)
Query: 16 QVRWNPPPEDFIKINVDGSSFGNPDNAGFGGLLRNNRGNWIHGFSGSCGRATNLFAELSA 75
+V W P IK+N DGS G A GG+ RN++ ++ G+S S G AT+ AE +A
Sbjct: 60 KVVWKKPENGRIKLNFDGSR-GREGQASIGGVFRNHKAEFLLGYSESIGEATSTMAEFAA 118
Query: 76 IWKGLQLAWDLGYRSIIMESDSQVALDLI 104
+ +GL+LA + G + +E D+++ +D+I
Sbjct: 119 LKRGLELALENGLTDLWLEGDAKIIMDII 147
>UniRef100_Q655G6 Hypothetical protein P0009H10.11 [Oryza sativa]
Length = 240
Score = 67.0 bits (162), Expect = 9e-11
Identities = 33/88 (37%), Positives = 55/88 (62%)
Query: 19 WNPPPEDFIKINVDGSSFGNPDNAGFGGLLRNNRGNWIHGFSGSCGRATNLFAELSAIWK 78
W P E ++K+N DGSS + A GG+ R++ G ++ G++ G AT+ AEL+A+ +
Sbjct: 79 WRKPEEGWMKLNFDGSSKHSTGIASIGGVYRDHDGAFLLGYAERIGTATSSVAELAALRR 138
Query: 79 GLQLAWDLGYRSIIMESDSQVALDLILD 106
GL+LA G+R + E DS+ +D++ D
Sbjct: 139 GLELAVRNGWRRVWAEGDSKAVVDVVCD 166
>UniRef100_Q6K204 Hypothetical protein B1469H02.35 [Oryza sativa]
Length = 212
Score = 66.6 bits (161), Expect = 1e-10
Identities = 33/88 (37%), Positives = 53/88 (59%)
Query: 19 WNPPPEDFIKINVDGSSFGNPDNAGFGGLLRNNRGNWIHGFSGSCGRATNLFAELSAIWK 78
W P +IK+N DGSS A GG+ R++ G ++ G++ GRAT+ AEL+A+ +
Sbjct: 51 WRKPEAGWIKLNFDGSSKHATKIASIGGVYRDHEGAFVLGYAERIGRATSSVAELAALRR 110
Query: 79 GLQLAWDLGYRSIIMESDSQVALDLILD 106
GL+L G+R + E DS+ +D++ D
Sbjct: 111 GLELVVRNGWRRVWAEGDSKTVVDVVCD 138
>UniRef100_Q9FXC8 F12A21.24 [Arabidopsis thaliana]
Length = 803
Score = 62.8 bits (151), Expect = 2e-09
Identities = 36/99 (36%), Positives = 53/99 (53%), Gaps = 1/99 (1%)
Query: 34 SSFGNPDNAGFGGLLRNNRGNWIHGFSGSCGRATNLFAELSAIWKGLQLAWDLGYRSIIM 93
+S GNP A GG++R+ GNW GF+ + GR + AEL ++ G LAW + +
Sbjct: 650 ASRGNPGLATAGGVIRDGAGNWCGGFALNIGRCSAPLAELWGVYYGFYLAWTKALTRVEL 709
Query: 94 ESDSQVALDLILDTKQKDFHPHASLLSLIRKLSSLPWVV 132
E DS++ + L T D HP + L+ L L S W+V
Sbjct: 710 EVDSELVVG-FLKTGIGDQHPLSFLVRLCHGLLSKDWIV 747
>UniRef100_O23409 SIMILARITY to probable splicing factor CEPRP21 [Arabidopsis
thaliana]
Length = 559
Score = 59.3 bits (142), Expect = 2e-08
Identities = 37/127 (29%), Positives = 62/127 (48%), Gaps = 15/127 (11%)
Query: 6 GPTNKGHVICQVRWNPPPEDFIKINVDGSSFGNPDNAGFGGLLRNNRGNWIHGFSGSCGR 65
G + V + W PPE ++K++ DG+S GNP A GG++R+ G W+ GF+
Sbjct: 15 GSQSVARVERHIAWTKPPEGWVKVSTDGASRGNPGPAAAGGVIRDEDGLWVGGFA----- 69
Query: 66 ATNLFAELSAIWKGLQLAWDLGYRSIIMESDSQVALDLILDTKQKDFHPHASLLSLIRKL 125
+L+ + +L W + +E DS++ + L + + HP A L+ L L
Sbjct: 70 -----LQLAFV----RLRWQSCGGRVELEVDSELVVG-FLQSGISEAHPLAFLVRLCHGL 119
Query: 126 SSLPWVV 132
S W+V
Sbjct: 120 LSKDWLV 126
>UniRef100_Q9LP00 F9C16.18 [Arabidopsis thaliana]
Length = 196
Score = 58.5 bits (140), Expect = 3e-08
Identities = 28/63 (44%), Positives = 39/63 (61%)
Query: 11 GHVICQVRWNPPPEDFIKINVDGSSFGNPDNAGFGGLLRNNRGNWIHGFSGSCGRATNLF 70
G + V W+ P E ++K+N DG+S GNP + GG+LR+ G W+ GFS + GR T
Sbjct: 50 GSIDILVGWSVPSEGWLKLNTDGASRGNPTLSTAGGVLRDREGEWLGGFSLNIGRRTAPV 109
Query: 71 AEL 73
AEL
Sbjct: 110 AEL 112
>UniRef100_Q5W657 Hypothetical protein OSJNBb0052F16.4 [Oryza sativa]
Length = 220
Score = 56.2 bits (134), Expect = 2e-07
Identities = 36/115 (31%), Positives = 58/115 (50%), Gaps = 4/115 (3%)
Query: 16 QVRWNPPPEDFIKINVDGSSFGNPD-NAGFGGLLRNNRGNWIHGFSGSCGRATNLFAELS 74
++RW PPP + K+N DGS F + A GG++R G + F+ + T E
Sbjct: 46 RIRWAPPPVGWCKLNFDGSVFNDGSRRASIGGVIRGCDGGVVLAFAETTEHWTVGVVEAR 105
Query: 75 AIWKGLQLAWDLGYRSIIMESDSQVALDLILDTKQKDFHP---HASLLSLIRKLS 126
A+ KGL+LA I++E D V + L+ + + P H +LSL+R+ +
Sbjct: 106 ALIKGLKLALACFVERIVVEGDDLVLVQLLRGEETQTRIPAAMHDEILSLLRRFT 160
>UniRef100_Q6L3X8 Hypothetical protein [Solanum demissum]
Length = 1155
Score = 55.8 bits (133), Expect = 2e-07
Identities = 38/123 (30%), Positives = 55/123 (43%), Gaps = 20/123 (16%)
Query: 14 ICQVRWNPPPEDFIKINVDGSSFGNPDNAGFGGLLRNNRGNWIHGFSGSCGRATNLFAEL 73
+ +V W PP F K+N DGS G GG+LRN G I F+ G T+ +AE
Sbjct: 987 VIRVNWIKPPTMFAKLNTDGSCVNG--RCGGGGILRNALGQVIMAFTIKLGEGTSSWAEA 1044
Query: 74 SAIWKGLQLAWDLGYRSIIMESDSQVALDLILDTKQKDFHPHASLLSLIRKLSSLPWVVY 133
++ G+QL G II E+DS + L I + S+PW +Y
Sbjct: 1045 MSMLHGMQLCIQRGVNMIIGETDSIL------------------LAKAITENWSIPWRMY 1086
Query: 134 LVI 136
+ +
Sbjct: 1087 IPV 1089
>UniRef100_Q8H8C0 Hypothetical protein OJ1134F05.9 [Oryza sativa]
Length = 210
Score = 55.5 bits (132), Expect = 3e-07
Identities = 29/88 (32%), Positives = 50/88 (55%), Gaps = 2/88 (2%)
Query: 19 WNPPPEDFIKINVDGSSFGNPDNAGFGGLLRNNRGNWIHGFSGSC--GRATNLFAELSAI 76
W P ++K+N DGS A GG R++ G ++ G++ G A++ AEL+A+
Sbjct: 47 WRKPEVGWMKLNFDGSRNDATGAASIGGAFRDHEGAFVAGYAERMIVGGASSFTAELAAL 106
Query: 77 WKGLQLAWDLGYRSIIMESDSQVALDLI 104
+GL+LA G+R + E DS+ +D++
Sbjct: 107 RRGLELAARYGWRRVWAEGDSRAVVDVV 134
Database: uniref100
Posted date: Jan 5, 2005 1:24 AM
Number of letters in database: 848,049,833
Number of sequences in database: 2,790,947
Lambda K H
0.322 0.140 0.453
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 254,000,154
Number of Sequences: 2790947
Number of extensions: 10023293
Number of successful extensions: 20266
Number of sequences better than 10.0: 457
Number of HSP's better than 10.0 without gapping: 253
Number of HSP's successfully gapped in prelim test: 204
Number of HSP's that attempted gapping in prelim test: 19946
Number of HSP's gapped (non-prelim): 469
length of query: 136
length of database: 848,049,833
effective HSP length: 112
effective length of query: 24
effective length of database: 535,463,769
effective search space: 12851130456
effective search space used: 12851130456
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 67 (30.4 bits)
Medicago: description of AC121245.12