Lotus
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= TM0588b.3
         (432 letters)

Database: nr 
           2,540,612 sequences; 863,360,394 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_198319.1| aspartyl protease family protein [Arabidopsis t...   318  2e-85
gb|AAD38257.1| Hypothetical Protein [Arabidopsis thaliana] gi|15...   310  6e-83
gb|AAY78629.1| aspartyl protease family protein [Arabidopsis tha...   278  2e-73
ref|NP_850251.1| aspartyl protease family protein [Arabidopsis t...   269  1e-70
sp|Q766C3|NEP1_NEPGR Aspartic proteinase nepenthesin-1 precursor...   240  5e-62
ref|XP_550383.1| putative CDR1 [Oryza sativa (japonica cultivar-...   232  1e-59
ref|NP_914417.1| P0509B06.7 [Oryza sativa (japonica cultivar-gro...   232  1e-59
sp|Q766C2|NEP2_NEPGR Aspartic proteinase nepenthesin-2 precursor...   230  7e-59
ref|XP_462658.1| OSJNBa0064H22.10 [Oryza sativa (japonica cultiv...   225  2e-57
gb|AAP21262.1| At2g03200 [Arabidopsis thaliana] gi|7487145|pir||...   220  6e-56
ref|XP_482870.1| putative nucleoid DNA-binding protein [Oryza sa...   220  8e-56
gb|AAM15327.1| putative chloroplast nucleoid DNA binding protein...   216  8e-55
gb|AAY78698.1| aspartyl protease family protein [Arabidopsis tha...   214  4e-54
gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein...   214  4e-54
gb|AAD21501.1| putative chloroplast nucleoid DNA binding protein...   211  4e-53
gb|AAV85724.1| At2g28040 [Arabidopsis thaliana] gi|28392898|gb|A...   206  1e-51
gb|AAM15330.1| putative chloroplast nucleoid DNA binding protein...   206  1e-51
gb|AAP31963.1| At1g01300 [Arabidopsis thaliana] gi|22135930|gb|A...   206  1e-51
gb|AAN15613.1| unknown protein [Arabidopsis thaliana] gi|2046651...   205  2e-51
dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryz...   205  2e-51

>ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
           gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
          Length = 437

 Score =  318 bits (816), Expect = 2e-85
 Identities = 176/430 (40%), Positives = 257/430 (58%), Gaps = 17/430 (3%)

Query: 10  LCFLVVLYLLSCQTPIEAQDAGFSVQLTRQNSPHSPFYKPDNLHRHKLPS-FHQVPKKAF 68
           LC L  L+L +     +    GF+  L  ++SP SPFY P      +L +  H+   + F
Sbjct: 12  LCLLSSLFLSNANAKPKL---GFTADLIHRDSPKSPFYNPMETSSQRLRNAIHRSVNRVF 68

Query: 69  APNGPFSTR-----VTSNNGDYLMKLTLGSPPVDIYGLVDTGSDLVWAQCSPCHGCYKQK 123
                 +T      +TSN+G+YLM +++G+PP  I  + DTGSDL+W QC+PC  CY Q 
Sbjct: 69  HFTEKDNTPQPQIDLTSNSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQV 128

Query: 124 SPMFEPLSSKTFNPIPCDSEQCGSLFSH-SCSP-QKLCAYSYSYADSSVTKGVLARETIT 181
            P+F+P +S T+  + C S QC +L +  SCS     C+YS SY D+S TKG +A +T+T
Sbjct: 129 DPLFDPKTSSTYKDVSCSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLT 188

Query: 182 FSSPTNGDELVVGDIIFGCGHSNSGAFNENDMGVIGLGGGPLSLVSQMGALYGSRRFSQC 241
             S ++   + + +II GCGH+N+G FN+   G++GLGGGP+SL+ Q+G      +FS C
Sbjct: 189 LGS-SDTRPMQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDG-KFSYC 246

Query: 242 LVPFHADSRTSGTISFGDASDVSGEGVVTTPLVSEEGQ-TPYLVTLEGISVGDTFVSFNS 300
           LVP  +    +  I+FG  + VSG GVV+TPL+++  Q T Y +TL+ ISVG   + ++ 
Sbjct: 247 LVPLTSKKDQTSKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSG 306

Query: 301 SE-KLSKGNMMIDSGTPATYLPQEFYDRLVEELKVQSSLLPVDNDPDLGTQLCYRSETNL 359
           S+ + S+GN++IDSGT  T LP EFY  L E+    S       DP  G  LCY +  +L
Sbjct: 307 SDSESSEGNIIIDSGTTLTLLPTEFYSEL-EDAVASSIDAEKKQDPQSGLSLCYSATGDL 365

Query: 360 EGPILTAHFEGADVQLMPIQTFIPPKDGVFCFAMAGTADGDYIFGNFAQSNILIGFDLDR 419
           + P++T HF+GADV+L     F+   + + CFA  G+     I+GN AQ N L+G+D   
Sbjct: 366 KVPVITMHFDGADVKLDSSNAFVQVSEDLVCFAFRGSPSFS-IYGNVAQMNFLVGYDTVS 424

Query: 420 KTISFKPTDC 429
           KT+SFKPTDC
Sbjct: 425 KTVSFKPTDC 434


>gb|AAD38257.1| Hypothetical Protein [Arabidopsis thaliana]
           gi|15217764|ref|NP_176663.1| aspartyl protease family
           protein [Arabidopsis thaliana] gi|25404498|pir||E96671
           hypothetical protein F13O11.13 [imported] - Arabidopsis
           thaliana
          Length = 431

 Score =  310 bits (794), Expect = 6e-83
 Identities = 175/431 (40%), Positives = 247/431 (56%), Gaps = 15/431 (3%)

Query: 10  LCFLVVLYLLSCQTPIEAQDAGFSVQLTRQNSPHSPFY--------KPDNLHRHKLPSFH 61
           L F  +L LL           GF++ L  ++SP SPFY        +  N  R    S  
Sbjct: 4   LIFATLLSLLLLSNVNAYPKDGFTIDLIHRDSPKSPFYNSAETSSQRMRNAIRRSARSTL 63

Query: 62  QVPKKAFAPNGPFSTRVTSNNGDYLMKLTLGSPPVDIYGLVDTGSDLVWAQCSPCHGCYK 121
           Q      +PN P S  +TSN G+YLM +++G+PPV I  + DTGSDL+W QC+PC  CY+
Sbjct: 64  QFSNDDASPNSPQSF-ITSNRGEYLMNISIGTPPVPILAIADTGSDLIWTQCNPCEDCYQ 122

Query: 122 QKSPMFEPLSSKTFNPIPCDSEQCGSLFSHSCS-PQKLCAYSYSYADSSVTKGVLARETI 180
           Q SP+F+P  S T+  + C S QC +L   SCS  +  C+Y+ +Y D+S TKG +A +T+
Sbjct: 123 QTSPLFDPKESSTYRKVSCSSSQCRALEDASCSTDENTCSYTITYGDNSYTKGDVAVDTV 182

Query: 181 TFSSPTNGDELVVGDIIFGCGHSNSGAFNENDMGVIGLGGGPLSLVSQMGALYGSRRFSQ 240
           T  S +    + + ++I GCGH N+G F+    G+IGLGGG  SLVSQ+       +FS 
Sbjct: 183 TMGS-SGRRPVSLRNMIIGCGHENTGTFDPAGSGIIGLGGGSTSLVSQLRKSING-KFSY 240

Query: 241 CLVPFHADSRTSGTISFGDASDVSGEGVVTTPLVSEEGQTPYLVTLEGISVGDTFVSFNS 300
           CLVPF +++  +  I+FG    VSG+GVV+T +V ++  T Y + LE ISVG   + F S
Sbjct: 241 CLVPFTSETGLTSKINFGTNGIVSGDGVVSTSMVKKDPATYYFLNLEAISVGSKKIQFTS 300

Query: 301 S-EKLSKGNMMIDSGTPATYLPQEFYDRLVEELKVQSSLLPVDNDPDLGTQLCYRSETNL 359
           +     +GN++IDSGT  T LP  FY  L E +   +       DPD    LCYR  ++ 
Sbjct: 301 TIFGTGEGNIVIDSGTTLTLLPSNFYYEL-ESVVASTIKAERVQDPDGILSLCYRDSSSF 359

Query: 360 EGPILTAHFEGADVQLMPIQTFIPPKDGVFCFAMAGTADGDYIFGNFAQSNILIGFDLDR 419
           + P +T HF+G DV+L  + TF+   + V CFA A   +   IFGN AQ N L+G+D   
Sbjct: 360 KVPDITVHFKGGDVKLGNLNTFVAVSEDVSCFAFAAN-EQLTIFGNLAQMNFLVGYDTVS 418

Query: 420 KTISFKPTDCT 430
            T+SFK TDC+
Sbjct: 419 GTVSFKKTDCS 429


>gb|AAY78629.1| aspartyl protease family protein [Arabidopsis thaliana]
           gi|15222357|ref|NP_174430.1| aspartyl protease family
           protein [Arabidopsis thaliana] gi|25513600|pir||E86440
           probable chloroplast nucleoid DNA binding protein
           T8E3.12 - Arabidopsis thaliana
           gi|12322538|gb|AAG51267.1| chloroplast nucleoid DNA
           binding protein, putative [Arabidopsis thaliana]
          Length = 445

 Score =  278 bits (712), Expect = 2e-73
 Identities = 159/444 (35%), Positives = 238/444 (52%), Gaps = 27/444 (6%)

Query: 7   FFHLCFLVVLYLLSCQTPIEAQDAGFSVQLTRQNSPHSPFYKP-----DNLHRHKLPSFH 61
           F +   L + +  +  +   A     +V+L  ++SPHSP Y P     D L+   L S  
Sbjct: 6   FLYCSLLAISFFFASNS--SANRENLTVELIHRDSPHSPLYNPHHTVSDRLNAAFLRSIS 63

Query: 62  QVPKKAFAPNGPFSTRVTSNNGDYLMKLTLGSPPVDIYGLVDTGSDLVWAQCSPCHGCYK 121
           +   + F       + + SN G+Y M +++G+PP  ++ + DTGSDL W QC PC  CYK
Sbjct: 64  R--SRRFTTKTDLQSGLISNGGEYFMSISIGTPPSKVFAIADTGSDLTWVQCKPCQQCYK 121

Query: 122 QKSPMFEPLSSKTFNPIPCDSEQCGSLFSH--SCSPQK-LCAYSYSYADSSVTKGVLARE 178
           Q SP+F+   S T+    CDS+ C +L  H   C   K +C Y YSY D+S TKG +A E
Sbjct: 122 QNSPLFDKKKSSTYKTESCDSKTCQALSEHEEGCDESKDICKYRYSYGDNSFTKGDVATE 181

Query: 179 TITFSSPTNGDELVVGDIIFGCGHSNSGAFNENDMGVIGLGGGPLSLVSQMGALYGSRRF 238
           TI+  S ++G  +     +FGCG++N G F E   G+IGLGGGPLSLVSQ+G+  G ++F
Sbjct: 182 TISIDS-SSGSSVSFPGTVFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLGSSIG-KKF 239

Query: 239 SQCLVPFHADSRTSGTISFGDASDVSG----EGVVTTPLVSEEGQTPYLVTLEGISVGDT 294
           S CL    A +  +  I+ G  S  S        +TTPL+ ++ +T Y +TLE ++VG T
Sbjct: 240 SYCLSHTAATTNGTSVINLGTNSIPSNPSKDSATLTTPLIQKDPETYYFLTLEAVTVGKT 299

Query: 295 FVSF-------NSSEKLSKGNMMIDSGTPATYLPQEFYDRLVEELKVQSSLLPVDNDPDL 347
            + +       N       GN++IDSGT  T L   FYD     ++   +     +DP  
Sbjct: 300 KLPYTGGGYGLNGKSSKRTGNIIIDSGTTLTLLDSGFYDDFGTAVEESVTGAKRVSDPQG 359

Query: 348 GTQLCYRS-ETNLEGPILTAHFEGADVQLMPIQTFIPPKDGVFCFAMAGTADGDYIFGNF 406
               C++S +  +  P +T HF  ADV+L PI  F+   +   C +M  T +   I+GN 
Sbjct: 360 LLTHCFKSGDKEIGLPAITMHFTNADVKLSPINAFVKLNEDTVCLSMIPTTE-VAIYGNM 418

Query: 407 AQSNILIGFDLDRKTISFKPTDCT 430
            Q + L+G+DL+ KT+SF+  DC+
Sbjct: 419 VQMDFLVGYDLETKTVSFQRMDCS 442


>ref|NP_850251.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 447

 Score =  269 bits (688), Expect = 1e-70
 Identities = 166/452 (36%), Positives = 245/452 (53%), Gaps = 38/452 (8%)

Query: 4   ILCFFHLCFLVVLYLLSCQTPIEAQDAGFSVQLTRQNSPHSPFYKPDNLHRHKL-PSFHQ 62
           +LCFF L F V L               FSV+L  ++SP SP Y P      +L  +F +
Sbjct: 6   LLCFF-LFFSVTL-------SSSGHPKNFSVELIHRDSPLSPIYNPQITVTDRLNAAFLR 57

Query: 63  VPKKAFAPNGPFS-----TRVTSNNGDYLMKLTLGSPPVDIYGLVDTGSDLVWAQCSPCH 117
              ++   N   S     + +   +G++ M +T+G+PP+ ++ + DTGSDL W QC PC 
Sbjct: 58  SVSRSRRFNHQLSQTDLQSGLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQ 117

Query: 118 GCYKQKSPMFEPLSSKTFNPIPCDSEQCGSLFS--HSC-SPQKLCAYSYSYADSSVTKGV 174
            CYK+  P+F+   S T+   PCDS  C +L S    C     +C Y YSY D S +KG 
Sbjct: 118 QCYKENGPIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSFSKGD 177

Query: 175 LARETITFSSPTNGDELVVGDIIFGCGHSNSGAFNENDMGVIGLGGGPLSLVSQMGALYG 234
           +A ET++  S  +G  +     +FGCG++N G F+E   G+IGLGGG LSL+SQ+G+   
Sbjct: 178 VATETVSIDS-ASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSI- 235

Query: 235 SRRFSQCLVPFHADSRTSGT--ISFGDASDVSG----EGVVTTPLVSEEGQTPYLVTLEG 288
           S++FS CL   H  + T+GT  I+ G  S  S      GVV+TPLV +E  T Y +TLE 
Sbjct: 236 SKKFSYCL--SHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPLTYYYLTLEA 293

Query: 289 ISVGDTFVSFNSSE---------KLSKGNMMIDSGTPATYLPQEFYDRLVEELKVQSSLL 339
           ISVG   + +  S            + GN++IDSGT  T L   F+D+    ++   +  
Sbjct: 294 ISVGKKKIPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGA 353

Query: 340 PVDNDPDLGTQLCYRSETNLEG-PILTAHFEGADVQLMPIQTFIPPKDGVFCFAMAGTAD 398
              +DP      C++S +   G P +T HF GADV+L PI  F+   + + C +M  T +
Sbjct: 354 KRVSDPQGLLSHCFKSGSAEIGLPEITVHFTGADVRLSPINAFVKLSEDMVCLSMVPTTE 413

Query: 399 GDYIFGNFAQSNILIGFDLDRKTISFKPTDCT 430
              I+GNFAQ + L+G+DL+ +T+SF+  DC+
Sbjct: 414 -VAIYGNFAQMDFLVGYDLETRTVSFQHMDCS 444


>sp|Q766C3|NEP1_NEPGR Aspartic proteinase nepenthesin-1 precursor (Nepenthesin-I)
           gi|41016421|dbj|BAD07474.1| aspartic proteinase
           nepenthesin I [Nepenthes gracilis]
          Length = 437

 Score =  240 bits (613), Expect = 5e-62
 Identities = 145/415 (34%), Positives = 215/415 (50%), Gaps = 26/415 (6%)

Query: 26  EAQDAGFSVQLTRQNSPHSPFYKPDNLHRHKLPSFHQVPKKAFAPNGP--FSTRVTSNNG 83
           EA+  GF + L   +S  +   K   L R       ++ +     NGP    T V + +G
Sbjct: 35  EAKVTGFQIMLEHVDSGKN-LTKFQLLERAIERGSRRLQRLEAMLNGPSGVETSVYAGDG 93

Query: 84  DYLMKLTLGSPPVDIYGLVDTGSDLVWAQCSPCHGCYKQKSPMFEPLSSKTFNPIPCDSE 143
           +YLM L++G+P      ++DTGSDL+W QC PC  C+ Q +P+F P  S +F+ +PC S+
Sbjct: 94  EYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQ 153

Query: 144 QCGSLFSHSCSPQKLCAYSYSYADSSVTKGVLARETITFSSPTNGDELVVGDIIFGCGHS 203
            C +L S +CS    C Y+Y Y D S T+G +  ET+TF S      + + +I FGCG +
Sbjct: 154 LCQALSSPTCS-NNFCQYTYGYGDGSETQGSMGTETLTFGS------VSIPNITFGCGEN 206

Query: 204 NSGAFNENDMGVIGLGGGPLSLVSQMGALYGSRRFSQCLVPFHADSRTSGTISFGD-ASD 262
           N G    N  G++G+G GPLSL SQ+       +FS C+ P    S T   +  G  A+ 
Sbjct: 207 NQGFGQGNGAGLVGMGRGPLSLPSQLDV----TKFSYCMTPI--GSSTPSNLLLGSLANS 260

Query: 263 VSGEGVVTTPLVSEEGQTPYLVTLEGISVGDTFV-----SFNSSEKLSKGNMMIDSGTPA 317
           V+     TT + S +  T Y +TL G+SVG T +     +F  +     G ++IDSGT  
Sbjct: 261 VTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTL 320

Query: 318 TYLPQEFYDRLVEELKVQSSLLPVDNDPDLGTQLCYRS---ETNLEGPILTAHFEGADVQ 374
           TY     Y  + +E   Q + LPV N    G  LC+++    +NL+ P    HF+G D++
Sbjct: 321 TYFVNNAYQSVRQEFISQIN-LPVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDGGDLE 379

Query: 375 LMPIQTFIPPKDGVFCFAMAGTADGDYIFGNFAQSNILIGFDLDRKTISFKPTDC 429
           L     FI P +G+ C AM  ++ G  IFGN  Q N+L+ +D     +SF    C
Sbjct: 380 LPSENYFISPSNGLICLAMGSSSQGMSIFGNIQQQNMLVVYDTGNSVVSFASAQC 434


>ref|XP_550383.1| putative CDR1 [Oryza sativa (japonica cultivar-group)]
           gi|55296252|dbj|BAD67993.1| putative CDR1 [Oryza sativa
           (japonica cultivar-group)] gi|55296112|dbj|BAD67831.1|
           putative CDR1 [Oryza sativa (japonica cultivar-group)]
          Length = 454

 Score =  232 bits (592), Expect = 1e-59
 Identities = 153/432 (35%), Positives = 225/432 (51%), Gaps = 39/432 (9%)

Query: 27  AQDAGFSVQLTRQNSPHSPFYKPD-NLHRHKLPSFHQVPKKAFAPNGPFST--------- 76
           A   GFSV+   ++SP SPF+ P    H   L +  +   +A A  G  S+         
Sbjct: 29  ASGGGFSVEFIHRDSPRSPFHDPAFTAHGRALAAARRSVARAAAIAGSASSSASGGGAAD 88

Query: 77  ----RVTSNNGDYLMKLTLGSPPVDIYGLVDTGSDLVWAQCSPCHGCYKQKSP---MFEP 129
               +V S + +YLM + LGSPP  +  + DTGSDLVW +C   +      +     F+P
Sbjct: 89  DVVSKVVSRSFEYLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQFDP 148

Query: 130 LSSKTFNPIPCDSEQCGSLFSHSCSPQKLCAYSYSYADSSVTKGVLARETITFSSPTNG- 188
             S T+  + C ++ C +L   +C     CAY Y+Y D S T GVL+ ET TF    +G 
Sbjct: 149 SRSSTYGRVSCQTDACEALGRATCDDGSNCAYLYAYGDGSNTTGVLSTETFTFDDGGSGR 208

Query: 189 --DELVVGDIIFGCGHSNSGAFNENDMGVIGLGGGPLSLVSQM-GALYGSRRFSQCLVPF 245
              ++ VG + FGC  + +G+F  +  G++GLGGG +SLV+Q+ GA    RRFS CLVP 
Sbjct: 209 SPRQVRVGGVKFGCSTATAGSFPAD--GLVGLGGGAVSLVTQLGGATSLGRRFSYCLVPH 266

Query: 246 HADSRTSGTISFGDASDVSGEGVVTTPLVSEEGQTPYLVTLEGISVGDTFVSFNSSEKLS 305
             ++  S  ++FG  +DV+  G  +TPLV+ +  T Y V L+ + VG+  V+  +S ++ 
Sbjct: 267 SVNA--SSALNFGALADVTEPGAASTPLVAGDVDTYYTVVLDSVKVGNKTVASAASSRI- 323

Query: 306 KGNMMIDSGTPATYLPQEFYDRLVEELKVQSSLLPVDNDPDLGTQLCY-----RSETNLE 360
               ++DSGT  T+L       +V+EL  + +L PV + PD   QLCY       E    
Sbjct: 324 ----IVDSGTTLTFLDPSLLGPIVDELSRRITLPPVQS-PDGLLQLCYNVAGREVEAGES 378

Query: 361 GPILTAHF-EGADVQLMPIQTFIPPKDGVFCFAMAGTADGD--YIFGNFAQSNILIGFDL 417
            P LT  F  GA V L P   F+  ++G  C A+  T +     I GN AQ NI +G+DL
Sbjct: 379 IPDLTLEFGGGAAVALKPENAFVAVQEGTLCLAIVATTEQQPVSILGNLAQQNIHVGYDL 438

Query: 418 DRKTISFKPTDC 429
           D  T++F   DC
Sbjct: 439 DAGTVTFAGADC 450


>ref|NP_914417.1| P0509B06.7 [Oryza sativa (japonica cultivar-group)]
          Length = 451

 Score =  232 bits (592), Expect = 1e-59
 Identities = 153/432 (35%), Positives = 225/432 (51%), Gaps = 39/432 (9%)

Query: 27  AQDAGFSVQLTRQNSPHSPFYKPD-NLHRHKLPSFHQVPKKAFAPNGPFST--------- 76
           A   GFSV+   ++SP SPF+ P    H   L +  +   +A A  G  S+         
Sbjct: 26  ASGGGFSVEFIHRDSPRSPFHDPAFTAHGRALAAARRSVARAAAIAGSASSSASGGGAAD 85

Query: 77  ----RVTSNNGDYLMKLTLGSPPVDIYGLVDTGSDLVWAQCSPCHGCYKQKSP---MFEP 129
               +V S + +YLM + LGSPP  +  + DTGSDLVW +C   +      +     F+P
Sbjct: 86  DVVSKVVSRSFEYLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQFDP 145

Query: 130 LSSKTFNPIPCDSEQCGSLFSHSCSPQKLCAYSYSYADSSVTKGVLARETITFSSPTNG- 188
             S T+  + C ++ C +L   +C     CAY Y+Y D S T GVL+ ET TF    +G 
Sbjct: 146 SRSSTYGRVSCQTDACEALGRATCDDGSNCAYLYAYGDGSNTTGVLSTETFTFDDGGSGR 205

Query: 189 --DELVVGDIIFGCGHSNSGAFNENDMGVIGLGGGPLSLVSQM-GALYGSRRFSQCLVPF 245
              ++ VG + FGC  + +G+F  +  G++GLGGG +SLV+Q+ GA    RRFS CLVP 
Sbjct: 206 SPRQVRVGGVKFGCSTATAGSFPAD--GLVGLGGGAVSLVTQLGGATSLGRRFSYCLVPH 263

Query: 246 HADSRTSGTISFGDASDVSGEGVVTTPLVSEEGQTPYLVTLEGISVGDTFVSFNSSEKLS 305
             ++  S  ++FG  +DV+  G  +TPLV+ +  T Y V L+ + VG+  V+  +S ++ 
Sbjct: 264 SVNA--SSALNFGALADVTEPGAASTPLVAGDVDTYYTVVLDSVKVGNKTVASAASSRI- 320

Query: 306 KGNMMIDSGTPATYLPQEFYDRLVEELKVQSSLLPVDNDPDLGTQLCY-----RSETNLE 360
               ++DSGT  T+L       +V+EL  + +L PV + PD   QLCY       E    
Sbjct: 321 ----IVDSGTTLTFLDPSLLGPIVDELSRRITLPPVQS-PDGLLQLCYNVAGREVEAGES 375

Query: 361 GPILTAHF-EGADVQLMPIQTFIPPKDGVFCFAMAGTADGD--YIFGNFAQSNILIGFDL 417
            P LT  F  GA V L P   F+  ++G  C A+  T +     I GN AQ NI +G+DL
Sbjct: 376 IPDLTLEFGGGAAVALKPENAFVAVQEGTLCLAIVATTEQQPVSILGNLAQQNIHVGYDL 435

Query: 418 DRKTISFKPTDC 429
           D  T++F   DC
Sbjct: 436 DAGTVTFAGADC 447


>sp|Q766C2|NEP2_NEPGR Aspartic proteinase nepenthesin-2 precursor (Nepenthesin-II)
           gi|41016423|dbj|BAD07475.1| aspartic proteinase
           nepenthesin II [Nepenthes gracilis]
          Length = 438

 Score =  230 bits (586), Expect = 7e-59
 Identities = 132/363 (36%), Positives = 198/363 (54%), Gaps = 23/363 (6%)

Query: 76  TRVTSNNGDYLMKLTLGSPPVDIYGLVDTGSDLVWAQCSPCHGCYKQKSPMFEPLSSKTF 135
           T V + +G+YLM + +G+P      ++DTGSDL+W QC PC  C+ Q +P+F P  S +F
Sbjct: 87  TPVYAGDGEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSF 146

Query: 136 NPIPCDSEQCGSLFSHSCSPQKLCAYSYSYADSSVTKGVLARETITFSSPTNGDELVVGD 195
           + +PC+S+ C  L S +C+  + C Y+Y Y D S T+G +A ET TF + +      V +
Sbjct: 147 STLPCESQYCQDLPSETCNNNE-CQYTYGYGDGSTTQGYMATETFTFETSS------VPN 199

Query: 196 IIFGCGHSNSGAFNENDMGVIGLGGGPLSLVSQMGALYGSRRFSQCLVPFHADSRTSGTI 255
           I FGCG  N G    N  G+IG+G GPLSL SQ+G      +FS C+  +   S +  T+
Sbjct: 200 IAFGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQLGV----GQFSYCMTSY--GSSSPSTL 253

Query: 256 SFGDASDVSGEGVVTTPLV-SEEGQTPYLVTLEGISVGDTFVSFNSS----EKLSKGNMM 310
           + G A+    EG  +T L+ S    T Y +TL+GI+VG   +   SS    +    G M+
Sbjct: 254 ALGSAASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMI 313

Query: 311 IDSGTPATYLPQEFYDRLVEELKVQSSLLPVDNDPDLGTQLCYRSETN---LEGPILTAH 367
           IDSGT  TYLPQ+ Y+ + +    Q +L  VD +   G   C++  ++   ++ P ++  
Sbjct: 314 IDSGTTLTYLPQDAYNAVAQAFTDQINLPTVD-ESSSGLSTCFQQPSDGSTVQVPEISMQ 372

Query: 368 FEGADVQLMPIQTFIPPKDGVFCFAMAGTAD-GDYIFGNFAQSNILIGFDLDRKTISFKP 426
           F+G  + L      I P +GV C AM  ++  G  IFGN  Q    + +DL    +SF P
Sbjct: 373 FDGGVLNLGEQNILISPAEGVICLAMGSSSQLGISIFGNIQQQETQVLYDLQNLAVSFVP 432

Query: 427 TDC 429
           T C
Sbjct: 433 TQC 435


>ref|XP_462658.1| OSJNBa0064H22.10 [Oryza sativa (japonica cultivar-group)]
           gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza
           sativa (japonica cultivar-group)]
          Length = 444

 Score =  225 bits (573), Expect = 2e-57
 Identities = 134/377 (35%), Positives = 194/377 (50%), Gaps = 30/377 (7%)

Query: 69  APNGPFSTRVTSNNGDYLMKLTLGSPPVDIYGLVDTGSDLVWAQCSPCHGCYKQKSPMFE 128
           A  G     V + NG++LM +++G+P +    +VDTGSDLVW QC PC  C+KQ +P+F+
Sbjct: 79  AGGGDLQVPVHAGNGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFD 138

Query: 129 PLSSKTFNPIPCDSEQCGSLFSHSCSPQKLCAYSYSYADSSVTKGVLARETITFSSPTNG 188
           P SS T+  +PC S  C  L +  C+    C Y+Y+Y DSS T+GVLA ET T +     
Sbjct: 139 PSSSSTYATVPCSSASCSDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSK-- 196

Query: 189 DELVVGDIIFGCGHSNSGAFNENDMGVIGLGGGPLSLVSQMGALYGSRRFSQCLVPFHAD 248
               +  ++FGCG +N G       G++GLG GPLSLVSQ+G      +FS CL     D
Sbjct: 197 ----LPGVVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGL----DKFSYCLTSL--D 246

Query: 249 SRTSGTISFGDASDVS-----GEGVVTTPLVSEEGQTP-YLVTLEGISVGDTFVSFNSS- 301
              +  +  G  + +S        V TTPL+    Q   Y V+L+ I+VG T +S  SS 
Sbjct: 247 DTNNSPLLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSA 306

Query: 302 ---EKLSKGNMMIDSGTPATYLPQEFYDRLVEELKVQSSLLPVDNDPDLGTQLCYRSET- 357
              +    G +++DSGT  TYL  + Y  L +    Q + LP  +   +G  LC+R+   
Sbjct: 307 FAVQDDGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMA-LPAADGSGVGLDLCFRAPAK 365

Query: 358 ---NLEGPILTAHFEGADVQLMPIQTF--IPPKDGVFCFAMAGTADGDYIFGNFAQSNIL 412
               +E P L  HF+G     +P + +  +    G  C  + G+  G  I GNF Q N  
Sbjct: 366 GVDQVEVPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVMGSR-GLSIIGNFQQQNFQ 424

Query: 413 IGFDLDRKTISFKPTDC 429
             +D+   T+SF P  C
Sbjct: 425 FVYDVGHDTLSFAPVQC 441


>gb|AAP21262.1| At2g03200 [Arabidopsis thaliana] gi|7487145|pir||T02706
           hypothetical protein At2g03200 [imported] - Arabidopsis
           thaliana gi|30678047|ref|NP_565298.2| aspartyl protease
           family protein [Arabidopsis thaliana]
          Length = 461

 Score =  220 bits (561), Expect = 6e-56
 Identities = 137/368 (37%), Positives = 201/368 (54%), Gaps = 33/368 (8%)

Query: 82  NGDYLMKLTLGSPPVDIYGLVDTGSDLVWAQCSPCHGCYKQKSPMFEPLSSKTFNPIPCD 141
           +G++LM+L++G+P V    +VDTGSDL+W QC PC  C+ Q +P+F+P  S +++ + C 
Sbjct: 104 SGEFLMELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCS 163

Query: 142 SEQCGSLFSHSCSPQK-LCAYSYSYADSSVTKGVLARETITFSSPTNGDELVVGDIIFGC 200
           S  C +L   +C+  K  C Y Y+Y D S T+G+LA ET TF      DE  +  I FGC
Sbjct: 164 SGLCNALPRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFE-----DENSISGIGFGC 218

Query: 201 GHSNSGAFNENDMGVIGLGGGPLSLVSQMGALYGSRRFSQCLVPFHADSRTSGTISFGD- 259
           G  N G       G++GLG GPLSL+SQ+       +FS CL     DS  S ++  G  
Sbjct: 219 GVENEGDGFSQGSGLVGLGRGPLSLISQL----KETKFSYCLTSIE-DSEASSSLFIGSL 273

Query: 260 --------ASDVSGEGVVTTPLVSEEGQTP-YLVTLEGISVGDTFVSFNSS----EKLSK 306
                    + + GE   T  L+    Q   Y + L+GI+VG   +S   S     +   
Sbjct: 274 ASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGT 333

Query: 307 GNMMIDSGTPATYLPQEFYDRLVEELKVQSSLLPVDNDPDLGTQLCYR---SETNLEGPI 363
           G M+IDSGT  TYL +  +  L EE   + S LPVD+    G  LC++   +  N+  P 
Sbjct: 334 GGMIIDSGTTITYLEETAFKVLKEEFTSRMS-LPVDDSGSTGLDLCFKLPDAAKNIAVPK 392

Query: 364 LTAHFEGADVQLMPIQTFI--PPKDGVFCFAMAGTADGDYIFGNFAQSNILIGFDLDRKT 421
           +  HF+GAD++L P + ++      GV C AM G+++G  IFGN  Q N  +  DL+++T
Sbjct: 393 MIFHFKGADLEL-PGENYMVADSSTGVLCLAM-GSSNGMSIFGNVQQQNFNVLHDLEKET 450

Query: 422 ISFKPTDC 429
           +SF PT+C
Sbjct: 451 VSFVPTEC 458


>ref|XP_482870.1| putative nucleoid DNA-binding protein [Oryza sativa (japonica
           cultivar-group)] gi|42407407|dbj|BAD09565.1| putative
           nucleoid DNA-binding protein [Oryza sativa (japonica
           cultivar-group)]
          Length = 448

 Score =  220 bits (560), Expect = 8e-56
 Identities = 135/369 (36%), Positives = 195/369 (52%), Gaps = 25/369 (6%)

Query: 78  VTSNNGDYLMKLTLGSPPVDIYGLVDTGSDLVWAQCSPCHGCYKQKSPMFEPLSSKTFNP 137
           V ++ G+YLM L +G+PP+    +VDTGSDL+W QC+PC  C  Q +P F P  S T+  
Sbjct: 85  VAASQGEYLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPCVLCADQPTPYFRPARSATYRL 144

Query: 138 IPCDSEQCGSLFSHSCSPQKLCAYSYSYADSSVTKGVLARETITFSSPTNGDELVVGDII 197
           +PC S  C +L   +C  + +C Y Y Y D + T GVLA ET TF +  N  +++V D+ 
Sbjct: 145 VPCRSPLCAALPYPACFQRSVCVYQYYYGDEASTAGVLASETFTFGA-ANSSKVMVSDVA 203

Query: 198 FGCGHSNSGAFNENDMGVIGLGGGPLSLVSQMGALYGSRRFSQCLVPFHADSRTS----- 252
           FGCG+ NSG    N  G++GLG GPLSLVSQ+    G  RFS CL  F +   +      
Sbjct: 204 FGCGNINSGQL-ANSSGMVGLGRGPLSLVSQL----GPSRFSYCLTSFLSPEPSRLNFGV 258

Query: 253 -GTISFGDASDVSGEGVVTTPLVSEEG-QTPYLVTLEGISVGDTFVSFN----SSEKLSK 306
             T++  +AS  SG  V +TPLV      + Y ++L+GIS+G   +  +    +      
Sbjct: 259 FATLNGTNASS-SGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGT 317

Query: 307 GNMMIDSGTPATYLPQEFYDRLVEELKVQSSLLPVDNDPDLGTQLCY----RSETNLEGP 362
           G + IDSGT  T+L Q+ YD +  EL      LP  ND ++G + C+         +  P
Sbjct: 318 GGVFIDSGTSLTWLQQDAYDAVRRELVSVLRPLPPTNDTEIGLETCFPWPPPPSVAVTVP 377

Query: 363 ILTAHFEGADVQLMPIQTF--IPPKDGVFCFAMAGTADGDYIFGNFAQSNILIGFDLDRK 420
            +  HF+G     +P + +  I    G  C AM  + D   I GN+ Q N+ I +D+   
Sbjct: 378 DMELHFDGGANMTVPPENYMLIDGATGFLCLAMIRSGDAT-IIGNYQQQNMHILYDIANS 436

Query: 421 TISFKPTDC 429
            +SF P  C
Sbjct: 437 LLSFVPAPC 445


>gb|AAM15327.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana] gi|4063755|gb|AAC98463.1| putative chloroplast
           nucleoid DNA binding protein [Arabidopsis thaliana]
           gi|25407944|pir||H84679 hypothetical protein At2g28030
           [imported] - Arabidopsis thaliana
           gi|15226317|ref|NP_180370.1| aspartyl protease family
           protein [Arabidopsis thaliana]
          Length = 392

 Score =  216 bits (551), Expect = 8e-55
 Identities = 144/403 (35%), Positives = 207/403 (50%), Gaps = 31/403 (7%)

Query: 33  SVQLTRQNSPHSPFYKPDNLHRHKLPSFHQVPKKAFAPNGPFSTRVTSNNGDYLMKLTLG 92
           S+  T  +SPH   +  D + R    S  ++ K       P++  +   N  YLMKL +G
Sbjct: 12  SLFTTTASSPHG--FTIDLIQRRSNSSSSRLSKNQLQGASPYADTLFDYN-IYLMKLQVG 68

Query: 93  SPPVDIYGLVDTGSDLVWAQCSPCHGCYKQKSPMFEPLSSKTFNPIPCDSEQCGSLFSHS 152
           +PP +I   +DTGSDL+W QC PC  CY Q +P+F+P +S TF    C+           
Sbjct: 69  TPPFEIEAEIDTGSDLIWTQCMPCTNCYSQYAPIFDPSNSSTFKEKRCNGNS-------- 120

Query: 153 CSPQKLCAYSYSYADSSVTKGVLARETITFSSPTNGDELVVGDIIFGCGHSNSGAFNEND 212
                 C Y   YAD++ +KG LA ET+T  S T+G+  V+ +   GCGH NS  F    
Sbjct: 121 ------CHYKIIYADTTYSKGTLATETVTIHS-TSGEPFVMPETTIGCGH-NSSWFKPTF 172

Query: 213 MGVIGLGGGPLSLVSQMGALYGSRRFSQCLVPFHADSRTSGTISFGDASDVSGEGVV-TT 271
            G++GL  GP SL++QMG  Y        L+ +   S+ +  I+FG  + V+G+GVV TT
Sbjct: 173 SGMVGLSWGPSSLITQMGGEYPG------LMSYCFASQGTSKINFGTNAIVAGDGVVSTT 226

Query: 272 PLVSEEGQTPYLVTLEGISVGDTFV-SFNSSEKLSKGNMMIDSGTPATYLPQEFYDRLVE 330
             ++      Y + L+ +SVGDT V +  ++    +GN++IDSGT  TY P   Y  LV 
Sbjct: 227 MFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHALEGNIIIDSGTTLTYFPVS-YCNLVR 285

Query: 331 ELKVQSSLLPVDNDPDLGTQLCYRSETNLEGPILTAHFE-GADVQLMPIQTFIPP-KDGV 388
           E            DP     LCY ++T    P++T HF  GAD+ L     +I     G 
Sbjct: 286 EAVDHYVTAVRTADPTGNDMLCYYTDTIDIFPVITMHFSGGADLVLDKYNMYIETITRGT 345

Query: 389 FCFA-MAGTADGDYIFGNFAQSNILIGFDLDRKTISFKPTDCT 430
           FC A +      D IFGN AQ+N L+G+D     +SF PT+C+
Sbjct: 346 FCLAIICNNPPQDAIFGNRAQNNFLVGYDSSSLLVSFSPTNCS 388


>gb|AAY78698.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 392

 Score =  214 bits (545), Expect = 4e-54
 Identities = 143/403 (35%), Positives = 206/403 (50%), Gaps = 31/403 (7%)

Query: 33  SVQLTRQNSPHSPFYKPDNLHRHKLPSFHQVPKKAFAPNGPFSTRVTSNNGDYLMKLTLG 92
           S+  T  +SPH   +  D + R    S  ++ K       P++  +   N  YLMKL +G
Sbjct: 12  SLFTTTASSPHG--FTIDLIQRRSNSSSSRLSKNQLQGASPYADTLFDYN-IYLMKLQVG 68

Query: 93  SPPVDIYGLVDTGSDLVWAQCSPCHGCYKQKSPMFEPLSSKTFNPIPCDSEQCGSLFSHS 152
           +PP +I   +DTGSDL+W QC PC  CY Q +P+F+P +S TF    C+           
Sbjct: 69  TPPFEIEAEIDTGSDLIWTQCMPCTNCYSQYAPIFDPSNSSTFKEKRCNGNS-------- 120

Query: 153 CSPQKLCAYSYSYADSSVTKGVLARETITFSSPTNGDELVVGDIIFGCGHSNSGAFNEND 212
                 C Y   YAD++ +KG LA ET+T  S T+G+  V+ +   GCGH NS  F    
Sbjct: 121 ------CHYKIIYADTTYSKGTLATETVTIHS-TSGEPFVMPETTIGCGH-NSSWFKPTF 172

Query: 213 MGVIGLGGGPLSLVSQMGALYGSRRFSQCLVPFHADSRTSGTISFGDASDVSGEGVV-TT 271
            G++GL  GP SL++QMG  Y        L+ +   S+ +  I+FG  + V+G+GVV TT
Sbjct: 173 SGMVGLSWGPSSLITQMGGEYPG------LMSYCFASQGTSKINFGTNAIVAGDGVVSTT 226

Query: 272 PLVSEEGQTPYLVTLEGISVGDTFV-SFNSSEKLSKGNMMIDSGTPATYLPQEFYDRLVE 330
             ++      Y + L+ +SVGDT V +  ++    +GN++IDSGT  TY P   Y  LV 
Sbjct: 227 MFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHALEGNIIIDSGTTLTYFPVS-YCNLVR 285

Query: 331 ELKVQSSLLPVDNDPDLGTQLCYRSETNLEGPILTAHFE-GADVQLMPIQTFIPP-KDGV 388
           E            DP     LCY ++T    P++T HF  GAD+ L     +I     G 
Sbjct: 286 EAVDHYVTAVRTADPTGNDMLCYYTDTIDIFPVITMHFSGGADLVLDKYNMYIETITRGT 345

Query: 389 FCFA-MAGTADGDYIFGNFAQSNILIGFDLDRKTISFKPTDCT 430
           FC A +      D IFGN AQ+N L+G+D     + F PT+C+
Sbjct: 346 FCLAIICNNPPQDAIFGNRAQNNFLVGYDSSSLLVFFSPTNCS 388


>gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
          Length = 353

 Score =  214 bits (545), Expect = 4e-54
 Identities = 135/363 (37%), Positives = 196/363 (53%), Gaps = 33/363 (9%)

Query: 87  MKLTLGSPPVDIYGLVDTGSDLVWAQCSPCHGCYKQKSPMFEPLSSKTFNPIPCDSEQCG 146
           M+L++G+P V    +VDTGSDL+W QC PC  C+ Q +P+F+P  S +++ + C S  C 
Sbjct: 1   MELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLCN 60

Query: 147 SLFSHSCSPQK-LCAYSYSYADSSVTKGVLARETITFSSPTNGDELVVGDIIFGCGHSNS 205
           +L   +C+  K  C Y Y+Y D S T+G+LA ET TF      DE  +  I FGCG  N 
Sbjct: 61  ALPRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFE-----DENSISGIGFGCGVENE 115

Query: 206 GAFNENDMGVIGLGGGPLSLVSQMGALYGSRRFSQCLVPFHADSRTSGTISFGD------ 259
           G       G++GLG GPLSL+SQ+       +FS CL     DS  S ++  G       
Sbjct: 116 GDGFSQGSGLVGLGRGPLSLISQL----KETKFSYCLTSIE-DSEASSSLFIGSLASGIV 170

Query: 260 ---ASDVSGEGVVTTPLVSEEGQTP-YLVTLEGISVGDTFVSFNSS----EKLSKGNMMI 311
               + + GE   T  L+    Q   Y + L+GI+VG   +S   S     +   G M+I
Sbjct: 171 NKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGTGGMII 230

Query: 312 DSGTPATYLPQEFYDRLVEELKVQSSLLPVDNDPDLGTQLCYR---SETNLEGPILTAHF 368
           DSGT  TYL +  +  L EE   + S LPVD+    G  LC++   +  N+  P +  HF
Sbjct: 231 DSGTTITYLEETAFKVLKEEFTSRMS-LPVDDSGSTGLDLCFKLPDAAKNIAVPKMIFHF 289

Query: 369 EGADVQLMPIQTFI--PPKDGVFCFAMAGTADGDYIFGNFAQSNILIGFDLDRKTISFKP 426
           +GAD++L P + ++      GV C AM G+++G  IFGN  Q N  +  DL+++T+SF P
Sbjct: 290 KGADLEL-PGENYMVADSSTGVLCLAM-GSSNGMSIFGNVQQQNFNVLHDLEKETVSFVP 347

Query: 427 TDC 429
           T+C
Sbjct: 348 TEC 350


>gb|AAD21501.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana] gi|25407941|pir||F84679 hypothetical protein
           At2g28010 [imported] - Arabidopsis thaliana
           gi|15226315|ref|NP_180368.1| aspartyl protease family
           protein [Arabidopsis thaliana]
          Length = 396

 Score =  211 bits (537), Expect = 4e-53
 Identities = 139/399 (34%), Positives = 208/399 (51%), Gaps = 33/399 (8%)

Query: 37  TRQNSPHSPFYKPDNLHRHKLPSFHQVPKKAFAPNGPFSTRVTSNNGDYLMKLTLGSPPV 96
           T  + PH   +  D +HR    S         + + P++  V  N+  YLMKL +G+PP 
Sbjct: 22  TTASPPHG--FTMDLIHRRSNASSRV--SNTQSGSSPYANTVFDNSV-YLMKLQVGTPPF 76

Query: 97  DIYGLVDTGSDLVWAQCSPCHGCYKQKSPMFEPLSSKTFNPIPCDSEQCGSLFSHSCSPQ 156
           +I  ++DTGS++ W QC PC  CY+Q +P+F+P  S TF    CD         HS    
Sbjct: 77  EIQAIIDTGSEITWTQCLPCVHCYEQNAPIFDPSKSSTFKEKRCD--------GHS---- 124

Query: 157 KLCAYSYSYADSSVTKGVLARETITFSSPTNGDELVVGDIIFGCGHSNSGAFNENDMGVI 216
             C Y   Y D + T G LA ETIT  S T+G+  V+ + I GCGH+NS  F  +  G++
Sbjct: 125 --CPYEVDYFDHTYTMGTLATETITLHS-TSGEPFVMPETIIGCGHNNSW-FKPSFSGMV 180

Query: 217 GLGGGPLSLVSQMGALYGSRRFSQCLVPFHADSRTSGTISFGDASDVSGEGVVTTPLVSE 276
           GL  GP SL++QMG  Y        L+ +    + +  I+FG  + V+G+GVV+T +   
Sbjct: 181 GLNWGPSSLITQMGGEYPG------LMSYCFSGQGTSKINFGANAIVAGDGVVSTTMFMT 234

Query: 277 EGQTP-YLVTLEGISVGDTFV-SFNSSEKLSKGNMMIDSGTPATYLPQEFYDRLVEELKV 334
             +   Y + L+ +SVG+T + +  ++    +GN++IDSGT  TY P   Y  LV +   
Sbjct: 235 TAKPGFYYLNLDAVSVGNTRIETMGTTFHALEGNIVIDSGTTLTYFPVS-YCNLVRQAVE 293

Query: 335 QSSLLPVDNDPDLGTQLCYRSETNLEGPILTAHFE-GADVQLMPIQTFIPPKD-GVFCFA 392
                    DP     LCY S+T    P++T HF  G D+ L     ++   + GVFC A
Sbjct: 294 HVVTAVRAADPTGNDMLCYNSDTIDIFPVITMHFSGGVDLVLDKYNMYMESNNGGVFCLA 353

Query: 393 -MAGTADGDYIFGNFAQSNILIGFDLDRKTISFKPTDCT 430
            +  +   + IFGN AQ+N L+G+D     +SF PT+C+
Sbjct: 354 IICNSPTQEAIFGNRAQNNFLVGYDSSSLLVSFSPTNCS 392


>gb|AAV85724.1| At2g28040 [Arabidopsis thaliana] gi|28392898|gb|AAO41885.1|
           putative chloroplast nucleoid DNA binding protein
           [Arabidopsis thaliana] gi|30683732|ref|NP_180371.2|
           aspartyl protease family protein [Arabidopsis thaliana]
          Length = 395

 Score =  206 bits (524), Expect = 1e-51
 Identities = 135/352 (38%), Positives = 190/352 (53%), Gaps = 29/352 (8%)

Query: 84  DYLMKLTLGSPPVDIYGLVDTGSDLVWAQCSPCHGCYKQKSPMFEPLSSKTFNPIPCDSE 143
           +YLMKL +G+PP +I  ++DTGS+ +W QC PC  CY Q +P+F+P  S TF  I CD+ 
Sbjct: 64  EYLMKLQIGTPPFEIEAVLDTGSEHIWTQCLPCVHCYNQTAPIFDPSKSSTFKEIRCDTH 123

Query: 144 QCGSLFSHSCSPQKLCAYSYSYADSSVTKGVLARETITFSSPTNGDELVVGDIIFGCGHS 203
                  HS      C Y   Y   S TKG L  ET+T  S T+G   V+ + I GCG +
Sbjct: 124 ------DHS------CPYELVYGGKSYTKGTLVTETVTIHS-TSGQPFVMPETIIGCGRN 170

Query: 204 NSGAFNENDMGVIGLGGGPLSLVSQMGALYGSRRFSQCLVPFHADSRTSGTISFGDASDV 263
           NSG F     GV+GL  GP SL++QMG  Y        L+ +    + +  I+FG  + V
Sbjct: 171 NSG-FKPGFAGVVGLDRGPKSLITQMGGEYPG------LMSYCFAGKGTSKINFGANAIV 223

Query: 264 SGEGVV-TTPLVSEEGQTPYLVTLEGISVGDTFV-SFNSSEKLSKGNMMIDSGTPATYLP 321
           +G+GVV TT  V       Y + L+ +SVG+T + +  +     KGN++IDSG+  TY P
Sbjct: 224 AGDGVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPFHALKGNIVIDSGSTLTYFP 283

Query: 322 QEFYDRLVEELKVQSSLLPVDNDPDLGTQLCYRSETNLEGPILTAHFE-GADVQLMPIQT 380
            E Y  LV +   Q          D+   LCY S+T    P++T HF  GAD+ L     
Sbjct: 284 -ESYCNLVRKAVEQVVTAVRFPRSDI---LCYYSKTIDIFPVITMHFSGGADLVLDKYNM 339

Query: 381 FIPPK-DGVFCFA-MAGTADGDYIFGNFAQSNILIGFDLDRKTISFKPTDCT 430
           ++     GVFC A +  +   + IFGN AQ+N L+G+D     +SFKPT+C+
Sbjct: 340 YVASNTGGVFCLAIICNSPIEEAIFGNRAQNNFLVGYDSSSLLVSFKPTNCS 391


>gb|AAM15330.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana] gi|4063754|gb|AAC98462.1| putative chloroplast
           nucleoid DNA binding protein [Arabidopsis thaliana]
           gi|25407946|pir||A84680 hypothetical protein At2g28040
           [imported] - Arabidopsis thaliana
          Length = 389

 Score =  206 bits (524), Expect = 1e-51
 Identities = 135/352 (38%), Positives = 190/352 (53%), Gaps = 29/352 (8%)

Query: 84  DYLMKLTLGSPPVDIYGLVDTGSDLVWAQCSPCHGCYKQKSPMFEPLSSKTFNPIPCDSE 143
           +YLMKL +G+PP +I  ++DTGS+ +W QC PC  CY Q +P+F+P  S TF  I CD+ 
Sbjct: 58  EYLMKLQIGTPPFEIEAVLDTGSEHIWTQCLPCVHCYNQTAPIFDPSKSSTFKEIRCDTH 117

Query: 144 QCGSLFSHSCSPQKLCAYSYSYADSSVTKGVLARETITFSSPTNGDELVVGDIIFGCGHS 203
                  HS      C Y   Y   S TKG L  ET+T  S T+G   V+ + I GCG +
Sbjct: 118 ------DHS------CPYELVYGGKSYTKGTLVTETVTIHS-TSGQPFVMPETIIGCGRN 164

Query: 204 NSGAFNENDMGVIGLGGGPLSLVSQMGALYGSRRFSQCLVPFHADSRTSGTISFGDASDV 263
           NSG F     GV+GL  GP SL++QMG  Y        L+ +    + +  I+FG  + V
Sbjct: 165 NSG-FKPGFAGVVGLDRGPKSLITQMGGEYPG------LMSYCFAGKGTSKINFGANAIV 217

Query: 264 SGEGVV-TTPLVSEEGQTPYLVTLEGISVGDTFV-SFNSSEKLSKGNMMIDSGTPATYLP 321
           +G+GVV TT  V       Y + L+ +SVG+T + +  +     KGN++IDSG+  TY P
Sbjct: 218 AGDGVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPFHALKGNIVIDSGSTLTYFP 277

Query: 322 QEFYDRLVEELKVQSSLLPVDNDPDLGTQLCYRSETNLEGPILTAHFE-GADVQLMPIQT 380
            E Y  LV +   Q          D+   LCY S+T    P++T HF  GAD+ L     
Sbjct: 278 -ESYCNLVRKAVEQVVTAVRFPRSDI---LCYYSKTIDIFPVITMHFSGGADLVLDKYNM 333

Query: 381 FIPPK-DGVFCFA-MAGTADGDYIFGNFAQSNILIGFDLDRKTISFKPTDCT 430
           ++     GVFC A +  +   + IFGN AQ+N L+G+D     +SFKPT+C+
Sbjct: 334 YVASNTGGVFCLAIICNSPIEEAIFGNRAQNNFLVGYDSSSLLVSFKPTNCS 385


>gb|AAP31963.1| At1g01300 [Arabidopsis thaliana] gi|22135930|gb|AAM91547.1|
           chloroplast nucleoid DNA binding protein, putative
           [Arabidopsis thaliana] gi|15223368|ref|NP_171637.1|
           aspartyl protease family protein [Arabidopsis thaliana]
           gi|25518405|pir||C86143 hypothetical protein F6F3.10 -
           Arabidopsis thaliana gi|9665144|gb|AAF97328.1| Unknown
           protein [Arabidopsis thaliana]
          Length = 485

 Score =  206 bits (523), Expect = 1e-51
 Identities = 140/371 (37%), Positives = 189/371 (50%), Gaps = 24/371 (6%)

Query: 72  GPFSTRVTSN----NGDYLMKLTLGSPPVDIYGLVDTGSDLVWAQCSPCHGCYKQKSPMF 127
           G FS+ V S     +G+Y  +L +G+P   +Y ++DTGSD+VW QC+PC  CY Q  P+F
Sbjct: 125 GGFSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIF 184

Query: 128 EPLSSKTFNPIPCDSEQCGSLFSHSCSP-QKLCAYSYSYADSSVTKGVLARETITFSSPT 186
           +P  SKT+  IPC S  C  L S  C+  +K C Y  SY D S T G  + ET+TF    
Sbjct: 185 DPRKSKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNR 244

Query: 187 NGDELVVGDIIFGCGHSNSGAFNENDMGVIGLGGGPLSLVSQMGALYGSRRFSQCLVPFH 246
                 V  +  GCGH N G F     G++GLG G LS   Q G  + +++FS CLV   
Sbjct: 245 ------VKGVALGCGHDNEGLF-VGAAGLLGLGKGKLSFPGQTGHRF-NQKFSYCLVDRS 296

Query: 247 ADSRTSGTISFGDASDVSGEGVVTTPLVSEEGQTPYLVTLEGISVGDTFV-----SFNSS 301
           A S+ S  + FG+A+ VS     T  L + +  T Y V L GISVG T V     S    
Sbjct: 297 ASSKPSSVV-FGNAA-VSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKL 354

Query: 302 EKLSKGNMMIDSGTPATYLPQEFYDRLVEELKVQSSLLPVDNDPDLGTQLCYRSETN-LE 360
           +++  G ++IDSGT  T L +  Y  + +  +V +  L    D  L       S  N ++
Sbjct: 355 DQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVK 414

Query: 361 GPILTAHFEGADVQLMPIQTFIPPKD--GVFCFAMAGTADGDYIFGNFAQSNILIGFDLD 418
            P +  HF GADV L P   ++ P D  G FCFA AGT  G  I GN  Q    + +DL 
Sbjct: 415 VPTVVLHFRGADVSL-PATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLA 473

Query: 419 RKTISFKPTDC 429
              + F P  C
Sbjct: 474 SSRVGFAPGGC 484


>gb|AAN15613.1| unknown protein [Arabidopsis thaliana] gi|20466516|gb|AAM20575.1|
           unknown protein [Arabidopsis thaliana]
           gi|15222611|ref|NP_173922.1| aspartyl protease family
           protein [Arabidopsis thaliana] gi|25518510|pir||D86385
           hypothetical protein F2J7.6 - Arabidopsis thaliana
           gi|12321511|gb|AAG50814.1| hypothetical protein
           [Arabidopsis thaliana]
          Length = 483

 Score =  205 bits (522), Expect = 2e-51
 Identities = 131/368 (35%), Positives = 188/368 (50%), Gaps = 31/368 (8%)

Query: 73  PFSTRVTSNNGDYLMKLTLGSPPVDIYGLVDTGSDLVWAQCSPCHGCYKQKSPMFEPLSS 132
           P  +  T  +G+Y  ++ +G P  ++Y ++DTGSD+ W QC+PC  CY Q  P+FEP SS
Sbjct: 136 PLISGTTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPSSS 195

Query: 133 KTFNPIPCDSEQCGSLFSHSCSPQKLCAYSYSYADSSVTKGVLARETITFSSPTNGDELV 192
            ++ P+ CD+ QC +L    C     C Y  SY D S T G  A ET+T  S       +
Sbjct: 196 SSYEPLSCDTPQCNALEVSECR-NATCLYEVSYGDGSYTVGDFATETLTIGS------TL 248

Query: 193 VGDIIFGCGHSNSGAFNENDMGVIGLGGGPLSLVSQMGALYGSRRFSQCLVPFHADSRTS 252
           V ++  GCGHSN G F     G++GLGGG L+L SQ+     +  FS CLV    DS ++
Sbjct: 249 VQNVAVGCGHSNEGLF-VGAAGLLGLGGGLLALPSQL----NTTSFSYCLV--DRDSDSA 301

Query: 253 GTISFGDASDVSGEGVVTTPLVSEEGQTPYLVTLEGISVGDTFVSFNSS----EKLSKGN 308
            T+ FG  + +S + VV   L + +  T Y + L GISVG   +    S    ++   G 
Sbjct: 302 STVDFG--TSLSPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGG 359

Query: 309 MMIDSGTPATYLPQEFYDRLVEELKVQSSLLPVDNDPDLGTQL---CYR--SETNLEGPI 363
           ++IDSGT  T L  E Y+ L +   V+ +L   D +   G  +   CY   ++T +E P 
Sbjct: 360 IIIDSGTAVTRLQTEIYNSLRDSF-VKGTL---DLEKAAGVAMFDTCYNLSAKTTVEVPT 415

Query: 364 LTAHFEGADVQLMPIQTFIPPKD--GVFCFAMAGTADGDYIFGNFAQSNILIGFDLDRKT 421
           +  HF G  +  +P + ++ P D  G FC A A TA    I GN  Q    + FDL    
Sbjct: 416 VAFHFPGGKMLALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSL 475

Query: 422 ISFKPTDC 429
           I F    C
Sbjct: 476 IGFSSNKC 483


>dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryza sativa (japonica
           cultivar-group)]
          Length = 453

 Score =  205 bits (522), Expect = 2e-51
 Identities = 129/372 (34%), Positives = 188/372 (49%), Gaps = 33/372 (8%)

Query: 80  SNNGDYLMKLTLGSPPVDIYGLVDTGSDLVWAQCSPCHGCYKQKSPMFEPLSSKTFNPIP 139
           S + +Y++ L +G+PP  I  L+DTGSDL+W QC  C  C +Q  P+F P  S ++ P+ 
Sbjct: 93  SGDLEYVLDLAVGTPPQPITALLDTGSDLIWTQCDTCTACLRQPDPLFSPRMSSSYEPMR 152

Query: 140 CDSEQCGSLFSHSCSPQKLCAYSYSYADSSVTKGVLARETITFSSPTNGDELVVGDIIFG 199
           C  + CG +  HSC     C Y YSY D + T G  A E  TF+S +   + V   + FG
Sbjct: 153 CAGQLCGDILHHSCVRPDTCTYRYSYGDGTTTLGYYATERFTFASSSGETQSV--PLGFG 210

Query: 200 CGHSNSGAFNENDMGVIGLGGGPLSLVSQMGALYGSRRFSQCLVPFHADSRTSGTISFGD 259
           CG  N G+ N N  G++G G  PLSLVSQ+      RRFS CL P+ A SR S T+ FG 
Sbjct: 211 CGTMNVGSLN-NASGIVGFGRDPLSLVSQLSI----RRFSYCLTPY-ASSRKS-TLQFGS 263

Query: 260 ASDV-----SGEGVVTTPLV-SEEGQTPYLVTLEGISVGDTFVSFNSS----EKLSKGNM 309
            +DV     +   V TTP++ S +  T Y V   G++VG   +   +S         G +
Sbjct: 264 LADVGLYDDATGPVQTTPILQSAQNPTFYYVAFTGVTVGARRLRIPASAFALRPDGSGGV 323

Query: 310 MIDSGTPATYLPQEFYDRLVEELKVQSSLLPVDNDPDLGTQLCYRSETNLEG-------- 361
           +IDSGT  T  P      +V   + Q   LP  N       +C+ +     G        
Sbjct: 324 IIDSGTALTLFPVAVLAEVVRAFRSQLR-LPFANGSSPDDGVCFAAPAVAAGGGRMARQV 382

Query: 362 --PILTAHFEGADVQLMPIQTFI--PPKDGVFCFAMAGTADGDYIFGNFAQSNILIGFDL 417
             P +  HF+GAD+ L P + ++    + G  C  +  + D     GNF Q ++ + +DL
Sbjct: 383 AVPRMVFHFQGADLDL-PRENYVLEDHRRGHLCVLLGDSGDDGATIGNFVQQDMRVVYDL 441

Query: 418 DRKTISFKPTDC 429
           +R+T+SF P +C
Sbjct: 442 ERETLSFAPVEC 453


  Database: nr
    Posted date:  Jul 5, 2005 12:34 AM
  Number of letters in database: 863,360,394
  Number of sequences in database:  2,540,612
  
Lambda     K      H
   0.319    0.137    0.419 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 800,743,431
Number of Sequences: 2540612
Number of extensions: 37071836
Number of successful extensions: 79418
Number of sequences better than 10.0: 653
Number of HSP's better than 10.0 without gapping: 272
Number of HSP's successfully gapped in prelim test: 381
Number of HSP's that attempted gapping in prelim test: 78099
Number of HSP's gapped (non-prelim): 738
length of query: 432
length of database: 863,360,394
effective HSP length: 131
effective length of query: 301
effective length of database: 530,540,222
effective search space: 159692606822
effective search space used: 159692606822
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 77 (34.3 bits)


Lotus: description of TM0588b.3