
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC121245.12 + phase: 0 /pseudo
(136 letters)
Database: ara_mips
26,719 sequences; 11,318,596 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
At3g32110 non-LTR reverse transcriptase, putative 85 1e-17
At1g17390 hypothetical protein 83 5e-17
At5g33360 putative protein 82 7e-17
At2g22350 putative non-LTR retroelement reverse transcriptase 82 1e-16
At2g31080 putative non-LTR retroelement reverse transcriptase 78 1e-15
At1g47320 hypothetical protein 74 2e-14
At2g27870 putative non-LTR retroelement reverse transcriptase 71 2e-13
At2g07730 putative non-LTR retroelement reverse transcriptase 70 4e-13
At4g08830 putative protein 69 7e-13
At4g21420 hypothetical protein 67 2e-12
At4g15580 splicing factor like protein 59 6e-10
At5g61080 putative protein 55 1e-08
At3g25270 hypothetical protein 54 2e-08
At2g04420 putative non-LTR retrolelement reverse transcriptase 54 3e-08
At3g23315 unknown protein 52 7e-08
At2g45230 putative non-LTR retroelement reverse transcriptase 51 2e-07
At1g26950 hypothetical protein 50 4e-07
At4g29090 putative protein 50 5e-07
At3g09510 putative non-LTR reverse transcriptase 49 8e-07
At2g31520 putative non-LTR retroelement reverse transcriptase 49 8e-07
>At3g32110 non-LTR reverse transcriptase, putative
Length = 1911
Score = 85.1 bits (209), Expect = 1e-17
Identities = 46/116 (39%), Positives = 64/116 (54%), Gaps = 1/116 (0%)
Query: 17 VRWNPPPEDFIKINVDGSSFGNPDNAGFGGLLRNNRGNWIHGFSGSCGRATNLFAELSAI 76
+RW PP E + KIN DG+S GNP A GG+LRN+ G W GF+ + GR + AEL +
Sbjct: 1741 IRWTPPMEGWYKINTDGASRGNPGLASAGGVLRNSAGAWCGGFAVNIGRCSAPLAELWGV 1800
Query: 77 WKGLQLAWDLGYRSIIMESDSQVALDLILDTKQKDFHPHASLLSLIRKLSSLPWVV 132
+ GL +AW + +E DS+V + L T + HP + L+ L S W V
Sbjct: 1801 YYGLYMAWAKQLTHLELEVDSEVVVG-FLKTGIGETHPLSFLVRLCHNFLSKDWTV 1855
>At1g17390 hypothetical protein
Length = 322
Score = 82.8 bits (203), Expect = 5e-17
Identities = 45/116 (38%), Positives = 65/116 (55%), Gaps = 1/116 (0%)
Query: 17 VRWNPPPEDFIKINVDGSSFGNPDNAGFGGLLRNNRGNWIHGFSGSCGRATNLFAELSAI 76
+ W+PP + K+N DG+S GNP A GG++R+ GNW +GFS + G + AEL
Sbjct: 102 IAWSPPRVGWFKLNTDGASRGNPRLATAGGVVRDGDGNWCYGFSLNIGICSAPLAELWGA 161
Query: 77 WKGLQLAWDLGYRSIIMESDSQVALDLILDTKQKDFHPHASLLSLIRKLSSLPWVV 132
+ GL +AW+ G + ME DS++ + L T D HP + L+ L L S W V
Sbjct: 162 YYGLNIAWERGVTQLEMEIDSEMVVG-FLRTGIDDSHPLSFLVRLCHGLLSKDWSV 216
>At5g33360 putative protein
Length = 306
Score = 82.4 bits (202), Expect = 7e-17
Identities = 45/117 (38%), Positives = 64/117 (54%), Gaps = 1/117 (0%)
Query: 16 QVRWNPPPEDFIKINVDGSSFGNPDNAGFGGLLRNNRGNWIHGFSGSCGRATNLFAELSA 75
QVRW+ P + K+N DG+S GNP A GG LRN G W GF+ + GR + AEL
Sbjct: 135 QVRWSKPSLGWCKLNTDGASHGNPGLAIAGGALRNEYGEWCFGFALNIGRCSAPLAELWG 194
Query: 76 IWKGLQLAWDLGYRSIIMESDSQVALDLILDTKQKDFHPHASLLSLIRKLSSLPWVV 132
++ GL +AWD G + +E DS++ + L T HP + L+ + S W+V
Sbjct: 195 VYYGLFMAWDRGITRLELEVDSEMVVG-FLRTGIGSSHPLSFLVRMCHGFLSRDWIV 250
>At2g22350 putative non-LTR retroelement reverse transcriptase
Length = 321
Score = 81.6 bits (200), Expect = 1e-16
Identities = 45/116 (38%), Positives = 66/116 (56%), Gaps = 1/116 (0%)
Query: 17 VRWNPPPEDFIKINVDGSSFGNPDNAGFGGLLRNNRGNWIHGFSGSCGRATNLFAELSAI 76
V W P + ++K+N DG+S GNP A GG+LR++ G WI GF+ + G + AEL +
Sbjct: 151 VSWVSPEDGWVKLNTDGASRGNPGFATAGGVLRDHNGAWIGGFAVNIGVCSAPLAELWGV 210
Query: 77 WKGLQLAWDLGYRSIIMESDSQVALDLILDTKQKDFHPHASLLSLIRKLSSLPWVV 132
+ GL +AW G R + +E DS++ + L T D HP + LL L S W+V
Sbjct: 211 YYGLFIAWGRGARRVELEVDSKMVVG-FLTTGIADSHPLSFLLRLCYDFLSKGWIV 265
>At2g31080 putative non-LTR retroelement reverse transcriptase
Length = 1231
Score = 78.2 bits (191), Expect = 1e-15
Identities = 39/116 (33%), Positives = 64/116 (54%), Gaps = 1/116 (0%)
Query: 17 VRWNPPPEDFIKINVDGSSFGNPDNAGFGGLLRNNRGNWIHGFSGSCGRATNLFAELSAI 76
+RW P + ++KI DG+S GN A GG +RN +G W+ GF+ + G AEL
Sbjct: 1061 IRWQVPSDGWVKITTDGASRGNHGLAAAGGAIRNGQGEWLGGFALNIGSCAAPLAELWGA 1120
Query: 77 WKGLQLAWDLGYRSIIMESDSQVALDLILDTKQKDFHPHASLLSLIRKLSSLPWVV 132
+ GL +AWD G+R + ++ D ++ + L T + HP + L+ L + + W+V
Sbjct: 1121 YYGLLIAWDKGFRRVELDLDCKLVVG-FLSTGVSNAHPLSFLVRLCQGFFTRDWLV 1175
>At1g47320 hypothetical protein
Length = 259
Score = 73.9 bits (180), Expect = 2e-14
Identities = 43/106 (40%), Positives = 59/106 (55%), Gaps = 1/106 (0%)
Query: 27 IKINVDGSSFGNPDNAGFGGLLRNNRGNWIHGFSGSCGRATNLFAELSAIWKGLQLAWDL 86
+KIN DG+S GNP A GG+L++N G W GFS + GR++ AEL + GL LAW+
Sbjct: 101 LKINTDGASRGNPGLATAGGVLQDNEGRWCGGFSLNIGRSSAPMAELWGAYYGLYLAWER 160
Query: 87 GYRSIIMESDSQVALDLILDTKQKDFHPHASLLSLIRKLSSLPWVV 132
I +E DS++ + L T D HP + L+ L S W V
Sbjct: 161 KSSHIELEVDSEIVVG-FLKTGISDHHPLSFLVRLCHGFISKDWRV 205
>At2g27870 putative non-LTR retroelement reverse transcriptase
Length = 314
Score = 70.9 bits (172), Expect = 2e-13
Identities = 39/116 (33%), Positives = 61/116 (51%), Gaps = 1/116 (0%)
Query: 17 VRWNPPPEDFIKINVDGSSFGNPDNAGFGGLLRNNRGNWIHGFSGSCGRATNLFAELSAI 76
+ W+ P E + K+N DG+S GNP A GG+LR+ G W GF+ + G + AEL +
Sbjct: 144 IAWSKPEEGWWKLNTDGASRGNPGLASAGGVLRDEEGAWRGGFALNIGVCSAPLAELWGV 203
Query: 77 WKGLQLAWDLGYRSIIMESDSQVALDLILDTKQKDFHPHASLLSLIRKLSSLPWVV 132
+ GL +AW+ + +E DS++ + L + HP + L+ L S W V
Sbjct: 204 YYGLYIAWERRVTRLEIEVDSEIVVG-FLKIGINEVHPLSFLVRLCHDFISRDWRV 258
>At2g07730 putative non-LTR retroelement reverse transcriptase
Length = 970
Score = 69.7 bits (169), Expect = 4e-13
Identities = 36/116 (31%), Positives = 61/116 (52%), Gaps = 1/116 (0%)
Query: 17 VRWNPPPEDFIKINVDGSSFGNPDNAGFGGLLRNNRGNWIHGFSGSCGRATNLFAELSAI 76
+RW P + ++K+ DG+S G+ A G + N +G W+ GF+ + G AEL
Sbjct: 800 IRWKAPSDRWVKLTTDGASRGHQGLAAASGAILNLQGEWLGGFALNIGSCDAPLAELWGA 859
Query: 77 WKGLQLAWDLGYRSIIMESDSQVALDLILDTKQKDFHPHASLLSLIRKLSSLPWVV 132
+ GL +AWD G+R + + DS++ + L T HP + L+ L + + W+V
Sbjct: 860 YYGLLIAWDKGFRRVELNLDSELVVG-FLSTGISKAHPLSFLVRLCQGFFTRDWLV 914
>At4g08830 putative protein
Length = 947
Score = 68.9 bits (167), Expect = 7e-13
Identities = 37/105 (35%), Positives = 59/105 (55%), Gaps = 1/105 (0%)
Query: 17 VRWNPPPEDFIKINVDGSSFGNPDNAGFGGLLRNNRGNWIHGFSGSCGRATNLFAELSAI 76
V W PP +++K+N DG+S GN A GG+LR+ G+W GF+ G + AEL +
Sbjct: 839 VAWKPPDGEWVKLNTDGASRGNLGLATTGGVLRDGIGHWCGGFALDIGVCSAPLAELWGV 898
Query: 77 WKGLQLAWDLGYRSIIMESDSQVALDLILDTKQKDFHPHASLLSL 121
+ GL +AW+ + + +E DS++ + L T D H + L+ L
Sbjct: 899 YYGLYMAWERRFTRVELEVDSELVVG-FLTTGISDTHSLSFLVRL 942
>At4g21420 hypothetical protein
Length = 229
Score = 67.4 bits (163), Expect = 2e-12
Identities = 33/89 (37%), Positives = 54/89 (60%), Gaps = 1/89 (1%)
Query: 16 QVRWNPPPEDFIKINVDGSSFGNPDNAGFGGLLRNNRGNWIHGFSGSCGRATNLFAELSA 75
+V W P IK+N DGS G A GG+ RN++ ++ G+S S G AT+ AE +A
Sbjct: 60 KVVWKKPENGRIKLNFDGSR-GREGQASIGGVFRNHKAEFLLGYSESIGEATSTMAEFAA 118
Query: 76 IWKGLQLAWDLGYRSIIMESDSQVALDLI 104
+ +GL+LA + G + +E D+++ +D+I
Sbjct: 119 LKRGLELALENGLTDLWLEGDAKIIMDII 147
>At4g15580 splicing factor like protein
Length = 559
Score = 59.3 bits (142), Expect = 6e-10
Identities = 37/127 (29%), Positives = 62/127 (48%), Gaps = 15/127 (11%)
Query: 6 GPTNKGHVICQVRWNPPPEDFIKINVDGSSFGNPDNAGFGGLLRNNRGNWIHGFSGSCGR 65
G + V + W PPE ++K++ DG+S GNP A GG++R+ G W+ GF+
Sbjct: 15 GSQSVARVERHIAWTKPPEGWVKVSTDGASRGNPGPAAAGGVIRDEDGLWVGGFA----- 69
Query: 66 ATNLFAELSAIWKGLQLAWDLGYRSIIMESDSQVALDLILDTKQKDFHPHASLLSLIRKL 125
+L+ + +L W + +E DS++ + L + + HP A L+ L L
Sbjct: 70 -----LQLAFV----RLRWQSCGGRVELEVDSELVVG-FLQSGISEAHPLAFLVRLCHGL 119
Query: 126 SSLPWVV 132
S W+V
Sbjct: 120 LSKDWLV 126
>At5g61080 putative protein
Length = 348
Score = 55.1 bits (131), Expect = 1e-08
Identities = 33/112 (29%), Positives = 54/112 (47%), Gaps = 4/112 (3%)
Query: 19 WNPPPEDFIKINVDGSSFGNPDNAGFGGLLRNNRGNWIHGFSGSCGRATNLFAELSAIWK 78
W PP +K+N+ G+S D G++R+ G W+ G+ + A L AI++
Sbjct: 190 WTPP---LVKLNIQGTSNPLSDLTRSAGIVRDQSGKWVFGYIRCHKSIPEVVAGLLAIYQ 246
Query: 79 GLQLAWDLGYRSIIMESDSQVALDLILDTKQKDFHPHASLLSLIRKLSSLPW 130
GL+ WD G+R I +E+ S ++ L TK F +LL + + W
Sbjct: 247 GLKYLWDSGFRRIHLETTSFEIIN-ALTTKSSLFCKSKTLLGACKDMILKEW 297
>At3g25270 hypothetical protein
Length = 343
Score = 53.9 bits (128), Expect = 2e-08
Identities = 32/97 (32%), Positives = 51/97 (51%), Gaps = 3/97 (3%)
Query: 14 ICQVRWNPPPEDFIKINVDGSSFGNPDNAGFGGLLRNNRGNWIHGFSGSCGRAT--NLFA 71
+ + +W PP +IK N DG+ NA G L+R+ G ++ G + G T +L +
Sbjct: 181 MARTKWQRPPSTWIKYNYDGAFNHQTRNAKAGWLMRDENGVYM-GSGQAIGSTTSDSLES 239
Query: 72 ELSAIWKGLQLAWDLGYRSIIMESDSQVALDLILDTK 108
E A+ +Q AW GYR +I E DS+ +L+ + K
Sbjct: 240 EFQALIIAMQHAWSQGYRKVIFEGDSKQVEELMNNEK 276
>At2g04420 putative non-LTR retrolelement reverse transcriptase
Length = 221
Score = 53.5 bits (127), Expect = 3e-08
Identities = 33/97 (34%), Positives = 51/97 (52%), Gaps = 4/97 (4%)
Query: 18 RWNPPPEDFIKINVDGSSFGNPDNAGFGGLLRNNRGNWIHGFSGSCGRATN--LFAELSA 75
+W PP +IK N DGS G L+R+++G + G + + G N L +EL A
Sbjct: 62 KWEQPPMGWIKCNYDGSFNYRTQQTNSGWLIRDDKG-FYKGAAQAVGGTMNNALESELQA 120
Query: 76 IWKGLQLAWDLGYRSIIMESDSQVALDLILDTKQKDF 112
+ +Q W GYR +I E DS+ ++ +L+ KQ F
Sbjct: 121 LVMAMQHTWSQGYRKVIFEGDSK-QVEELLNRKQMHF 156
>At3g23315 unknown protein
Length = 191
Score = 52.4 bits (124), Expect = 7e-08
Identities = 31/98 (31%), Positives = 51/98 (51%), Gaps = 4/98 (4%)
Query: 18 RWNPPPEDFIKINVDGSSFGNPDNAGFGGLLRNNRGNWIHGFSGSC-GRATNLFAELSAI 76
+W P +++K N D S+ ++G ++RN++G + G GR T AE +A+
Sbjct: 29 KWQKPGAEWVKCNYDVSNHAGRQDSGLRWIIRNSQGTCLDCGCGKFQGRQTIKEAECTAL 88
Query: 77 WKGLQLAWDLGYRSIIMESDSQVALDLILDTKQKDFHP 114
+Q AWDLGYR + E D+ LI + K+ +P
Sbjct: 89 IWAIQCAWDLGYRRVEFEGDNITVNRLI---RNKETNP 123
>At2g45230 putative non-LTR retroelement reverse transcriptase
Length = 1374
Score = 51.2 bits (121), Expect = 2e-07
Identities = 33/91 (36%), Positives = 48/91 (52%), Gaps = 5/91 (5%)
Query: 17 VRWNPPPEDFIKINVDGSSFGNPDNAGFGGLLRNNRGN--WIHGFSGSCGRATNLFAELS 74
V+W PP ++K N DG+ + N G G +LRN+ G W+ G + + L E+
Sbjct: 1213 VKWQPPSHGWVKCNTDGAWSKDLGNCGVGWVLRNHTGRLLWL-GLRALPSQQSVLETEVE 1271
Query: 75 AI-WKGLQLAWDLGYRSIIMESDSQVALDLI 104
A+ W L L+ YR +I ESDSQ + LI
Sbjct: 1272 ALRWAVLSLS-RFNYRRVIFESDSQYLVSLI 1301
>At1g26950 hypothetical protein
Length = 158
Score = 50.1 bits (118), Expect = 4e-07
Identities = 30/87 (34%), Positives = 46/87 (52%), Gaps = 1/87 (1%)
Query: 46 GLLRNNRGNWIHGFSGSCGRATNLFAELSAIWKGLQLAWDLGYRSIIMESDSQVALDLIL 105
G LR+ G+W F+ + GR T AEL ++ GL +AW+ G + +E DS++ L
Sbjct: 17 GALRDEYGDWRGSFALNLGRCTAPLAELWGVYYGLVIAWEKGITRLELEVDSKLVAG-FL 75
Query: 106 DTKQKDFHPHASLLSLIRKLSSLPWVV 132
T +D H + L+ L SS W+V
Sbjct: 76 TTGIEDSHLLSFLVRLCYGFSSKDWIV 102
>At4g29090 putative protein
Length = 575
Score = 49.7 bits (117), Expect = 5e-07
Identities = 32/90 (35%), Positives = 48/90 (52%), Gaps = 5/90 (5%)
Query: 18 RWNPPPEDFIKINVDGSSFGNPDNAGFGGLLRNNRG--NWIHGFSGSCGRATNLFAELSA 75
RW PPP ++K N D + + + G G +LRN +G W+ G + L AEL A
Sbjct: 419 RWRPPPHQWVKCNTDATWNRDNERCGIGWVLRNEKGEVKWM-GARALPKLKSVLEAELEA 477
Query: 76 I-WKGLQLAWDLGYRSIIMESDSQVALDLI 104
+ W L L+ Y +I ESDSQV ++++
Sbjct: 478 MRWAVLSLS-RFQYNYVIFESDSQVLIEIL 506
>At3g09510 putative non-LTR reverse transcriptase
Length = 484
Score = 48.9 bits (115), Expect = 8e-07
Identities = 29/90 (32%), Positives = 41/90 (45%), Gaps = 1/90 (1%)
Query: 16 QVRWNPPPEDFIKINVDGSSFGNPDNAGFGGLLRNNRGNWIHGFSGSCGRATN-LFAELS 74
++ W PP ++K N D A G ++RN+ G I S +N L AE
Sbjct: 323 KIEWRNPPATYVKCNFDAGFDVQKLEATGGWIIRNHYGTPISWGSMKLAHTSNPLEAETK 382
Query: 75 AIWKGLQLAWDLGYRSIIMESDSQVALDLI 104
A+ LQ W GY + ME D Q ++LI
Sbjct: 383 ALLAALQQTWIRGYTQVFMEGDCQTLINLI 412
>At2g31520 putative non-LTR retroelement reverse transcriptase
Length = 1524
Score = 48.9 bits (115), Expect = 8e-07
Identities = 29/90 (32%), Positives = 41/90 (45%), Gaps = 1/90 (1%)
Query: 16 QVRWNPPPEDFIKINVDGSSFGNPDNAGFGGLLRNNRGNWIHGFSGSCGRATN-LFAELS 74
++ W PP ++K N D A G ++RN+ G I S +N L AE
Sbjct: 1363 KIEWRNPPATYVKCNFDAGFDVQKLEATGGWIIRNHYGTPISWGSMKLAHTSNPLEAETK 1422
Query: 75 AIWKGLQLAWDLGYRSIIMESDSQVALDLI 104
A+ LQ W GY + ME D Q ++LI
Sbjct: 1423 ALLAALQQTWIRGYTQVFMEGDCQTLINLI 1452
Database: ara_mips
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,978,382
Number of sequences in database: 6832
Database: /data/blast2/ara_mips_chr2
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,737,135
Number of sequences in database: 4184
Database: /data/blast2/ara_mips_chr3
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,236,886
Number of sequences in database: 5377
Database: /data/blast2/ara_mips_chr4
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,748,816
Number of sequences in database: 4030
Database: /data/blast2/ara_mips_chr5
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,569,679
Number of sequences in database: 6098
Database: /data/blast2/ara_mips_chl
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 25,951
Number of sequences in database: 85
Database: /data/blast2/ara_mips_mit
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 21,747
Number of sequences in database: 113
Lambda K H
0.322 0.140 0.453
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,481,128
Number of Sequences: 26719
Number of extensions: 141589
Number of successful extensions: 394
Number of sequences better than 10.0: 67
Number of HSP's better than 10.0 without gapping: 53
Number of HSP's successfully gapped in prelim test: 14
Number of HSP's that attempted gapping in prelim test: 322
Number of HSP's gapped (non-prelim): 68
length of query: 136
length of database: 11,318,596
effective HSP length: 89
effective length of query: 47
effective length of database: 8,940,605
effective search space: 420208435
effective search space used: 420208435
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 54 (25.4 bits)
Medicago: description of AC121245.12