Lotus
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= TM0098b.3
         (287 letters)

Database: ara_mips 
           26,719 sequences; 11,318,596 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

At4g10990 putative retrotransposon polyprotein                         33  0.14
At4g11430 putative proline-rich protein                                32  0.39
At2g31080 putative non-LTR retroelement reverse transcriptase          32  0.39
At4g25160 putative Ser/Thr protein kinase                              31  0.88
At2g07730 putative non-LTR retroelement reverse transcriptase          31  0.88
At1g69810 unknown protein                                              31  0.88
At3g03770 unknown protein                                              30  1.1
At5g66730 zinc finger protein                                          30  1.5
At4g22360 unknown protein                                              30  1.5
At1g64790 hypothetical protein                                         30  1.5
At1g16610 arginine/serine-rich protein                                 30  1.5
At5g38690 unknown protein                                              30  2.0
At5g04810 membrane-associated salt-inducible protein-like              30  2.0
At4g36080 ATM-like protein                                             30  2.0
At2g31220 putative transcription factor BHLH10                         29  2.6
At2g22350 putative non-LTR retroelement reverse transcriptase          29  2.6
At1g62760 hypothetical protein                                         29  2.6
At5g21280 putative protein                                             29  3.3
At5g15390 rRNA methylase - like protein                                29  3.3
At1g07970 unknown protein                                              29  3.3

>At4g10990 putative retrotransposon polyprotein
          Length = 1203

 Score = 33.5 bits (75), Expect = 0.14
 Identities = 30/113 (26%), Positives = 51/113 (44%), Gaps = 13/113 (11%)

Query: 95  LAMPHPLYDVLNKPTINDMASQVDKALLSSSSSSLSLSHPPPPPPSFTIATIVVVAIATG 154
           L +P PL+ V + P  +D+ +  + A  S+S+S+ S S  PP P +        + I T 
Sbjct: 385 LPLPAPLHFVESMPLDDDLRADDNNA--STSNSASSASSIPPLPSTVNTQNTDALDIDTN 442

Query: 155 SITA------SNSTVVITNTHCHHHPPLMPLFIT-----EVTHTNIAPRKHVT 196
           S+        + +   ++  HC+  P L  L  T     E   ++I P+K  T
Sbjct: 443 SVPIARPKRNAKAPAYLSEYHCNSVPFLSSLSPTTSTSIETPSSSIPPKKITT 495


>At4g11430 putative proline-rich protein
          Length = 219

 Score = 32.0 bits (71), Expect = 0.39
 Identities = 26/67 (38%), Positives = 28/67 (40%), Gaps = 10/67 (14%)

Query: 132 SHPPPPPPSFTIATIV--VVAIATG------SITASNSTVVITNTHCHH--HPPLMPLFI 181
           S PPPPPP     TI   V    TG            +T  ITNT  HH  HPP     I
Sbjct: 150 SPPPPPPPPPPPPTITPPVTTTTTGHHHHRPPPPPPATTTPITNTSDHHQLHPPPPATTI 209

Query: 182 TEVTHTN 188
           T  T +N
Sbjct: 210 TTTTISN 216



 Score = 28.1 bits (61), Expect = 5.7
 Identities = 22/73 (30%), Positives = 27/73 (36%), Gaps = 19/73 (26%)

Query: 134 PPPPPPSFTIATIVVVAIA------------------TGSITASNSTVVITNTHCHHHPP 175
           PPPPPP  TI   V    A                    +IT   +T   T  H H  PP
Sbjct: 124 PPPPPPPPTITPPVTTTTAGHHHHRRSPPPPPPPPPPPPTITPPVTTTT-TGHHHHRPPP 182

Query: 176 LMPLFITEVTHTN 188
             P   T +T+T+
Sbjct: 183 PPPATTTPITNTS 195


>At2g31080 putative non-LTR retroelement reverse transcriptase
          Length = 1231

 Score = 32.0 bits (71), Expect = 0.39
 Identities = 21/74 (28%), Positives = 33/74 (44%), Gaps = 1/74 (1%)

Query: 29   LMSVDDIIRYSFSEWVASFSSFFGGSCSLYVELLAKKHGLLLDINLGYRWVICESDSTTV 88
            L +    IR    EW+  F+   G   +   EL    +GLL+  + G+R V  + D   V
Sbjct: 1085 LAAAGGAIRNGQGEWLGGFALNIGSCAAPLAELWGAYYGLLIAWDKGFRRVELDLDCKLV 1144

Query: 89   IHLLE-ALAMPHPL 101
            +  L   ++  HPL
Sbjct: 1145 VGFLSTGVSNAHPL 1158


>At4g25160 putative Ser/Thr protein kinase
          Length = 835

 Score = 30.8 bits (68), Expect = 0.88
 Identities = 17/34 (50%), Positives = 21/34 (61%), Gaps = 4/34 (11%)

Query: 122 LSSSSSSLSLSHPPPPPPSFTIATIVVVAIATGS 155
           +S S   L+L  PPPPPPS T    VVVA++  S
Sbjct: 1   MSRSPDKLALPPPPPPPPSRT----VVVALSGSS 30


>At2g07730 putative non-LTR retroelement reverse transcriptase
          Length = 970

 Score = 30.8 bits (68), Expect = 0.88
 Identities = 19/61 (31%), Positives = 29/61 (47%), Gaps = 1/61 (1%)

Query: 42  EWVASFSSFFGGSCSLYVELLAKKHGLLLDINLGYRWVICESDSTTVIHLLE-ALAMPHP 100
           EW+  F+   G   +   EL    +GLL+  + G+R V    DS  V+  L   ++  HP
Sbjct: 837 EWLGGFALNIGSCDAPLAELWGAYYGLLIAWDKGFRRVELNLDSELVVGFLSTGISKAHP 896

Query: 101 L 101
           L
Sbjct: 897 L 897


>At1g69810 unknown protein
          Length = 387

 Score = 30.8 bits (68), Expect = 0.88
 Identities = 14/19 (73%), Positives = 15/19 (78%)

Query: 122 LSSSSSSLSLSHPPPPPPS 140
           LSSSS SL+ S P PPPPS
Sbjct: 335 LSSSSFSLNFSSPDPPPPS 353


>At3g03770 unknown protein
          Length = 802

 Score = 30.4 bits (67), Expect = 1.1
 Identities = 20/51 (39%), Positives = 27/51 (52%), Gaps = 11/51 (21%)

Query: 101 LYDVLNKPTINDM------ASQVDKALLSSSSSSLSLSHPPP-----PPPS 140
           L D L +P+I D+      ASQV +  L +S+ S +L  P P     PPPS
Sbjct: 732 LKDPLERPSIEDVLWNLQFASQVQEGWLQNSNPSSNLGSPSPAASSLPPPS 782


>At5g66730 zinc finger protein
          Length = 500

 Score = 30.0 bits (66), Expect = 1.5
 Identities = 21/74 (28%), Positives = 35/74 (46%), Gaps = 2/74 (2%)

Query: 91  LLEALAMPHPLYDVLNKPTINDMASQVDKALLSSSSSSLSLSHPPPPPPSFTIATIVVVA 150
           L E  A  H     L   T+     ++++   ++  SS SL  PP  PPS  IA    ++
Sbjct: 188 LAEESAKNHTQSKKLYPETVTRKNPEIEQKSPAAVESSPSL--PPSSPPSVAIAPAPAIS 245

Query: 151 IATGSITASNSTVV 164
           + T S+   +S+V+
Sbjct: 246 VETESVKIISSSVL 259


>At4g22360 unknown protein
          Length = 385

 Score = 30.0 bits (66), Expect = 1.5
 Identities = 17/55 (30%), Positives = 28/55 (50%), Gaps = 4/55 (7%)

Query: 124 SSSSSLSLSHPPPPPPSF----TIATIVVVAIATGSITASNSTVVITNTHCHHHP 174
           +S+SS+  SHPPPPP S       + + V  +A G  T S+ +    ++    +P
Sbjct: 67  ASASSVQQSHPPPPPSSHQQQNLHSGVNVPPMAKGHFTLSHPSQFSVSSQSQQYP 121


>At1g64790 hypothetical protein
          Length = 2428

 Score = 30.0 bits (66), Expect = 1.5
 Identities = 15/69 (21%), Positives = 31/69 (44%), Gaps = 2/69 (2%)

Query: 111 NDMASQVDKALLSSSSSSLS--LSHPPPPPPSFTIATIVVVAIATGSITASNSTVVITNT 168
           +D  S     ++SS +S       H  P  PS  +    ++ I++ ++    S+ ++   
Sbjct: 474 SDFLSITGDQIVSSRTSDADNPADHQAPFVPSVEVLVKALIVISSAAVAGPPSSWIVRAI 533

Query: 169 HCHHHPPLM 177
            C HHP ++
Sbjct: 534 FCSHHPSIV 542


>At1g16610 arginine/serine-rich protein
          Length = 414

 Score = 30.0 bits (66), Expect = 1.5
 Identities = 14/20 (70%), Positives = 15/20 (75%)

Query: 123 SSSSSSLSLSHPPPPPPSFT 142
           SSS+SS S S PPPPPP  T
Sbjct: 395 SSSNSSSSSSPPPPPPPRKT 414


>At5g38690 unknown protein
          Length = 572

 Score = 29.6 bits (65), Expect = 2.0
 Identities = 21/93 (22%), Positives = 45/93 (47%), Gaps = 6/93 (6%)

Query: 193 KHVTSGADKERLVRGTEAGVPKVSLDDVVRWSVPFLMVERTMVQGL---TDREGKPEQEF 249
           K + +G   E +V   ++   K+ L D V  +    ++++ + +G    T  +G  E   
Sbjct: 175 KDINNGCSNENVVV-KKSNPKKIKLSDSVHATTE--VIKKVLAEGKKKKTTAKGIKENNV 231

Query: 250 IGRNKFVKEKVQEKSEDRIEVDISYSTIVIRLN 282
             + K +K  +++K ED++E+ I    I I ++
Sbjct: 232 AEKIKRIKPALKKKEEDQVEIKIPQGNISITVS 264


>At5g04810 membrane-associated salt-inducible protein-like
          Length = 952

 Score = 29.6 bits (65), Expect = 2.0
 Identities = 16/49 (32%), Positives = 26/49 (52%), Gaps = 8/49 (16%)

Query: 99  HPLYDVLNKPTINDMASQVDKALLSSSSSSLS--------LSHPPPPPP 139
           +PL  + N+ +++ +      + +SS  SSL+        LS PPPPPP
Sbjct: 81  NPLKGLTNRSSVSPLVQSEVSSKVSSFGSSLASKLRLSSKLSPPPPPPP 129


>At4g36080 ATM-like protein
          Length = 3738

 Score = 29.6 bits (65), Expect = 2.0
 Identities = 31/113 (27%), Positives = 44/113 (38%), Gaps = 10/113 (8%)

Query: 40   FSEWVASFSSFFGGSCSLYVE--LLAKKHGLLLDINLGYR-WVICESDSTTVIHLLEALA 96
            F   V   S F   +  L V   ++  +     D N+ Y  WV+        +H  E +A
Sbjct: 2428 FDSIVMKHSQFLSAASKLQVADVVIPLRELAHTDANVAYHLWVLVFPIVWATLHKEEQIA 2487

Query: 97   MPHPLYDVLNKPTINDMASQ---VDKALLSSSSSSLSLSHPPPPPPSFTIATI 146
            +  P+  +L+K            V +ALL      L LSHP P  PS  I  I
Sbjct: 2488 LAKPMISLLSKDYHKKQQGHRPNVVQALLEG----LQLSHPQPRMPSELIKYI 2536


>At2g31220 putative transcription factor BHLH10
          Length = 458

 Score = 29.3 bits (64), Expect = 2.6
 Identities = 17/49 (34%), Positives = 22/49 (44%), Gaps = 9/49 (18%)

Query: 127 SSLSLSHPPPPPPSFTIATIVVVAIATGSITASNSTVVITNTHCHHHPP 175
           SS S + PPPPPP   +A         GS + SN +V +      H  P
Sbjct: 27  SSFSQAEPPPPPPQVLVA---------GSTSNSNCSVEVEELSEFHLSP 66


>At2g22350 putative non-LTR retroelement reverse transcriptase
          Length = 321

 Score = 29.3 bits (64), Expect = 2.6
 Identities = 24/81 (29%), Positives = 34/81 (41%), Gaps = 8/81 (9%)

Query: 35  IIRYSFSEWVASFSSFFGGSCSLYVELLAKKHGLLLDINLGYRWVICESDSTTVI-HLLE 93
           ++R     W+  F+   G   +   EL    +GL +    G R V  E DS  V+  L  
Sbjct: 181 VLRDHNGAWIGGFAVNIGVCSAPLAELWGVYYGLFIAWGRGARRVELEVDSKMVVGFLTT 240

Query: 94  ALAMPHPL-------YDVLNK 107
            +A  HPL       YD L+K
Sbjct: 241 GIADSHPLSFLLRLCYDFLSK 261


>At1g62760 hypothetical protein
          Length = 312

 Score = 29.3 bits (64), Expect = 2.6
 Identities = 18/55 (32%), Positives = 28/55 (50%)

Query: 108 PTINDMASQVDKALLSSSSSSLSLSHPPPPPPSFTIATIVVVAIATGSITASNST 162
           P  +  +S    +L  SS   LSLS   PPPP  + + +  ++ ++ S T SN T
Sbjct: 89  PPSSSPSSAPPSSLSPSSPPPLSLSPSSPPPPPPSSSPLSSLSPSSSSSTYSNQT 143



 Score = 29.3 bits (64), Expect = 2.6
 Identities = 21/65 (32%), Positives = 33/65 (50%), Gaps = 3/65 (4%)

Query: 115 SQVDKALLSSSSS---SLSLSHPPPPPPSFTIATIVVVAIATGSITASNSTVVITNTHCH 171
           S    + LS SS    SLS S PPPPPPS +  + +  +++    ++S S+   ++    
Sbjct: 46  SSAPPSSLSPSSPPPLSLSPSSPPPPPPSSSPLSSLSPSLSPSPPSSSPSSAPPSSLSPS 105

Query: 172 HHPPL 176
             PPL
Sbjct: 106 SPPPL 110


>At5g21280 putative protein
          Length = 302

 Score = 28.9 bits (63), Expect = 3.3
 Identities = 24/65 (36%), Positives = 29/65 (43%), Gaps = 11/65 (16%)

Query: 92  LEALAMPHPLYDVLNKPTINDM------ASQVDKALLSSSSSSLSLSHP-----PPPPPS 140
           L  L    PL   LN    ND        S    +  SSSSSS+  ++P     P PPP+
Sbjct: 131 LNFLLPNQPLGLNLNFQDFNDFIQTSSTTSSSSSSSTSSSSSSIFPTNPHIYSSPSPPPT 190

Query: 141 FTIAT 145
           FT AT
Sbjct: 191 FTTAT 195


>At5g15390 rRNA methylase - like protein
          Length = 350

 Score = 28.9 bits (63), Expect = 3.3
 Identities = 22/66 (33%), Positives = 33/66 (49%), Gaps = 3/66 (4%)

Query: 119 KALLSSSSSSLSLSHPPPPPPSFTIATIVVVAIATGSITASNSTVVITNTHCHHHP-PLM 177
           K+L+ +S SS      P P P+F+I  I V A +  S+   NS+ ++T     H    L 
Sbjct: 5   KSLIRASISSFPFV--PLPNPNFSITFISVRAFSPLSVLHPNSSCIVTARRTFHGAVALS 62

Query: 178 PLFITE 183
           P  +TE
Sbjct: 63  PESLTE 68


>At1g07970 unknown protein
          Length = 693

 Score = 28.9 bits (63), Expect = 3.3
 Identities = 15/60 (25%), Positives = 31/60 (51%), Gaps = 3/60 (5%)

Query: 103 DVLNKPTINDMASQVDKALLSSSSSSLSLSHPPPPPPSFTIATIVVVAIATGSITASNST 162
           D+  +  +  + +++D+ +  S+     +  PPP   SF +A+   V  +TG+  A+ ST
Sbjct: 256 DITTEEQLEQLLAEIDEKITESAGK---MRTPPPTVGSFAMASPSTVGGSTGASGATRST 312


  Database: ara_mips
    Posted date:  Jul 15, 2004 10:29 AM
  Number of letters in database: 2,978,382
  Number of sequences in database:  6832
  
  Database: /data/blast2/ara_mips_chr2
    Posted date:  Jul 15, 2004 10:29 AM
  Number of letters in database: 1,737,135
  Number of sequences in database:  4184
  
  Database: /data/blast2/ara_mips_chr3
    Posted date:  Jul 15, 2004 10:29 AM
  Number of letters in database: 2,236,886
  Number of sequences in database:  5377
  
  Database: /data/blast2/ara_mips_chr4
    Posted date:  Jul 15, 2004 10:29 AM
  Number of letters in database: 1,748,816
  Number of sequences in database:  4030
  
  Database: /data/blast2/ara_mips_chr5
    Posted date:  Jul 15, 2004 10:29 AM
  Number of letters in database: 2,569,679
  Number of sequences in database:  6098
  
  Database: /data/blast2/ara_mips_chl
    Posted date:  Jul 15, 2004 10:29 AM
  Number of letters in database: 25,951
  Number of sequences in database:  85
  
  Database: /data/blast2/ara_mips_mit
    Posted date:  Jul 15, 2004 10:29 AM
  Number of letters in database: 21,747
  Number of sequences in database:  113
  
Lambda     K      H
   0.319    0.134    0.395 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,589,034
Number of Sequences: 26719
Number of extensions: 290790
Number of successful extensions: 2581
Number of sequences better than 10.0: 51
Number of HSP's better than 10.0 without gapping: 21
Number of HSP's successfully gapped in prelim test: 31
Number of HSP's that attempted gapping in prelim test: 2310
Number of HSP's gapped (non-prelim): 201
length of query: 287
length of database: 11,318,596
effective HSP length: 98
effective length of query: 189
effective length of database: 8,700,134
effective search space: 1644325326
effective search space used: 1644325326
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 59 (27.3 bits)


Lotus: description of TM0098b.3