Medicago
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= AC149547.3 + phase: 0 
         (492 letters)

Database: uniref100 
           2,790,947 sequences; 848,049,833 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

UniRef100_Q9SZT5 Hypothetical protein AT4g35880 [Arabidopsis tha...   516  e-145
UniRef100_Q8VYV9 Putative chloroplast nucleoid DNA-binding prote...   483  e-135
UniRef100_Q7X5X8 OSJNBb0071D01.4 protein [Oryza sativa]               469  e-130
UniRef100_Q7XJ13 Chloroplast nucleoid DNA-binding protein-like p...   429  e-119
UniRef100_O65625 Hypothetical protein T19K4.10 [Arabidopsis thal...   405  e-111
UniRef100_Q84WU7 Hypothetical protein At3g51330 [Arabidopsis tha...   397  e-109
UniRef100_Q651C2 Aspartic proteinase nepenthesin II-like [Oryza ...   390  e-107
UniRef100_Q8SB30 Putative chloroplast nucleoid DNA-binding prote...   389  e-106
UniRef100_Q9SD16 Hypothetical protein F24M12.370 [Arabidopsis th...   384  e-105
UniRef100_Q9SD15 Hypothetical protein F24M12.380 [Arabidopsis th...   338  2e-91
UniRef100_Q6Z8K1 Aspartyl protease-like [Oryza sativa]                308  3e-82
UniRef100_Q9LX20 Hypothetical protein T31P16_70 [Arabidopsis tha...   302  1e-80
UniRef100_Q9SD14 Hypothetical protein F24M12.390 [Arabidopsis th...   280  9e-74
UniRef100_Q9SD13 Hypothetical protein F24M12.400 [Arabidopsis th...   209  1e-52
UniRef100_Q5Z8N8 Aspartic proteinase nepenthesin II-like [Oryza ...   206  2e-51
UniRef100_Q94AV8 AT3g51330/F24M12_370 [Arabidopsis thaliana]          192  1e-47
UniRef100_Q9M8R6 Putative aspartyl protease [Arabidopsis thaliana]    132  3e-29
UniRef100_Q5ZAZ8 Putative aspartic proteinase nepenthesin II [Or...   129  1e-28
UniRef100_Q9FPD6 Hypothetical protein At1g08210 [Arabidopsis tha...   127  1e-27
UniRef100_Q9FG60 Aspartyl protease-like [Arabidopsis thaliana]        125  3e-27

>UniRef100_Q9SZT5 Hypothetical protein AT4g35880 [Arabidopsis thaliana]
          Length = 455

 Score =  516 bits (1330), Expect = e-145
 Identities = 255/373 (68%), Positives = 301/373 (80%), Gaps = 14/373 (3%)

Query: 4   FTKIIVIILIILHLSM-CCNAHIFTFTMHHRYSEPVKKWSHSAPSPSHRWPEKGSVEYYA 62
           F   + +I I++ LS   CN  IFTF MHHR+S+ VK+WS S    + ++P KGS EY+ 
Sbjct: 6   FKTTLFLIPILMLLSFGSCNGRIFTFEMHHRFSDEVKQWSDSTGRFA-KFPPKGSFEYFN 64

Query: 63  ELADRDRFLRGRRLSQFDA----GLAFSDGNSTFRISSLGFLHYTTIELGTPGVKFMVAL 118
            L  RD  +RGRRLS+ ++     L FSDGNST RISSLGFLHYTT++LGTPG++FMVAL
Sbjct: 65  ALVLRDWLIRGRRLSESESESESSLTFSDGNSTSRISSLGFLHYTTVKLGTPGMRFMVAL 124

Query: 119 DTGSDLFWVPCDCTRCSATRSSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQ 178
           DTGSDLFWVPCDC +C+ T  + +AS    +F+LS+YNP  S+T+KKVTCNNSLC  RNQ
Sbjct: 125 DTGSDLFWVPCDCGKCAPTEGATYAS----EFELSIYNPKVSTTNKKVTCNNSLCAQRNQ 180

Query: 179 CLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVEANVIFGCGQVQSGSFLD 238
           CLGTFS CPYMVSYVSA+TSTSGIL+EDV+HLT  D N + VEA V FGCGQVQSGSFLD
Sbjct: 181 CLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPERVEAYVTFGCGQVQSGSFLD 240

Query: 239 VAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFGDKGSLDQDETPFNV 298
           +AAPNGLFGLGMEKISVPS+L+REG  ADSFSMCFG DG+GRISFGDKGS DQ+ETPFN+
Sbjct: 241 IAAPNGLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGVGRISFGDKGSSDQEETPFNL 300

Query: 299 NPSHPTYNITINQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRRRP 358
           NPSHP YNIT+ +VRVGTTLID EFTALFD+GTSFTYLVDP Y+ +SES     +D+R  
Sbjct: 301 NPSHPNYNITVTRVRVGTTLIDDEFTALFDTGTSFTYLVDPMYTTVSES----AQDKRHS 356

Query: 359 PDSRIPFDYCYDM 371
           PDSRIPF+YCYDM
Sbjct: 357 PDSRIPFEYCYDM 369


>UniRef100_Q8VYV9 Putative chloroplast nucleoid DNA-binding protein [Arabidopsis
           thaliana]
          Length = 513

 Score =  483 bits (1243), Expect = e-135
 Identities = 239/448 (53%), Positives = 311/448 (69%), Gaps = 11/448 (2%)

Query: 4   FTKIIVIILIILHLSMCCNAHIFTFTMHHRYSEPVKKWSHSAPSPSHRWPEKGSVEYYAE 63
           F  +++++     L  C     F F  HHR+S+ V         P    P + S +YY  
Sbjct: 11  FLGLLILLASSWVLDRCEGFGEFGFEFHHRFSDQVV-----GVLPGDGLPNRDSSKYYRV 65

Query: 64  LADRDRFLRGRRLSQFDAGLA-FSDGNSTFRISSLGFLHYTTIELGTPGVKFMVALDTGS 122
           +A RDR +RGRRL+  D  L  FSDGN T R+ +LGFLHY  + +GTP   FMVALDTGS
Sbjct: 66  MAHRDRLIRGRRLANEDQSLVTFSDGNETVRVDALGFLHYANVTVGTPSDWFMVALDTGS 125

Query: 123 DLFWVPCDCTRCSATRSSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCLGT 182
           DLFW+PCDCT C     +   S+L    DL++Y+PN SSTS KV CN++LCT  ++C   
Sbjct: 126 DLFWLPCDCTNCVRELKAPGGSSL----DLNIYSPNASSTSTKVPCNSTLCTRGDRCASP 181

Query: 183 FSNCPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVEANVIFGCGQVQSGSFLDVAAP 242
            S+CPY + Y+S  TS++G+LVEDVLHL   D +   + A V FGCGQVQ+G F D AAP
Sbjct: 182 ESDCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKAIPARVTFGCGQVQTGVFHDGAAP 241

Query: 243 NGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFGDKGSLDQDETPFNVNPSH 302
           NGLFGLG+E ISVPS+L++EG  A+SFSMCFG DG GRISFGDKGS+DQ ETP N+   H
Sbjct: 242 NGLFGLGLEDISVPSVLAKEGIAANSFSMCFGNDGAGRISFGDKGSVDQRETPLNIRQPH 301

Query: 303 PTYNITINQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRR-RPPDS 361
           PTYNIT+ ++ VG    D+EF A+FDSGTSFTYL D  Y+ +SESF+S   D+R +  DS
Sbjct: 302 PTYNITVTKISVGGNTGDLEFDAVFDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDS 361

Query: 362 RIPFDYCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDPIIIISTQSELVYCLAVVKSAELN 421
            +PF+YCY +SP+ ++   P+++LTM GGS + VY P+++I  +   VYCLA++K  +++
Sbjct: 362 ELPFEYCYALSPNKDSFQYPAVNLTMKGGSSYPVYHPLVVIPMKDTDVYCLAIMKIEDIS 421

Query: 422 IIGQNFMTGYRVVFDRGKLILGWKKSDC 449
           IIGQNFMTGYRVVFDR KLILGWK+SDC
Sbjct: 422 IIGQNFMTGYRVVFDREKLILGWKESDC 449


>UniRef100_Q7X5X8 OSJNBb0071D01.4 protein [Oryza sativa]
          Length = 530

 Score =  469 bits (1206), Expect = e-130
 Identities = 238/432 (55%), Positives = 305/432 (70%), Gaps = 25/432 (5%)

Query: 31  HHRYSEPVKKWS----HSAPSPSHRWPEKGSVEYYAELADRDRFLRGRRLSQFDAG---- 82
           HHR+S PV++W+    H  P     WPE GS +Y A L   DR  R    +  D G    
Sbjct: 38  HHRFSSPVQRWAEARGHVLPGG---WPEHGSADYVAALNGHDR-RRALSAAGGDGGGGGD 93

Query: 83  ----LAFSDGNSTFRISSLGFLHYTTIELGTPGVKFMVALDTGSDLFWVPCDCTRCSATR 138
               L FS+GN+T ++S+LGFLHY  + +GTPG  FMVALDTGSDLFW+PC C  C+   
Sbjct: 94  KPPPLTFSEGNATLKVSNLGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPA 153

Query: 139 SSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCLGTFSNCPYMVSYVSAETS 198
           S+A  SA       S Y P+ SSTS+ V CN+  C  R +C  T S CPY + YVSA+TS
Sbjct: 154 SAASGSA-------SFYIPSMSSTSQAVPCNSQFCELRKEC-STTSQCPYKMVYVSADTS 205

Query: 199 TSGILVEDVLHLTQPDDNHDLVEANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPSM 258
           +SG LVEDVL+L+  D    +++A ++FGCGQVQ+GSFLD AAPNGLFGLG++ IS+PS+
Sbjct: 206 SSGFLVEDVLYLSTEDAIPQILKAQILFGCGQVQTGSFLDAAAPNGLFGLGIDMISIPSI 265

Query: 259 LSREGFTADSFSMCFGRDGIGRISFGDKGSLDQDETPFNVNPSHPTYNITINQVRVGTTL 318
           L+++G T++SF+MCF RDGIGRISFGD+GS DQ+ETP +VNP HPTY I+I+++ VG +L
Sbjct: 266 LAQKGLTSNSFAMCFSRDGIGRISFGDQGSSDQEETPLDVNPQHPTYTISISEITVGNSL 325

Query: 319 IDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRRRPPDSRIPFDYCYDMSPDSNTS 378
            D+EF+ +FD+GTSFTYL DP Y+ +++SFH+QV   R   DSRIPF+YCYD+S   +  
Sbjct: 326 TDLEFSTIFDTGTSFTYLADPAYTYITQSFHAQVHANRHAADSRIPFEYCYDLSSSEDRI 385

Query: 379 LIPSMSLTMGGGSRFVVYDPIIIISTQS-ELVYCLAVVKSAELNIIGQNFMTGYRVVFDR 437
             PS+SL   GGS F V D   +IS Q  E VYCLA+VKSA+LNIIGQNFMTG RVVFDR
Sbjct: 386 QTPSISLRTVGGSVFPVIDEGQVISIQQHEYVYCLAIVKSAKLNIIGQNFMTGLRVVFDR 445

Query: 438 GKLILGWKKSDC 449
            + ILGWKK +C
Sbjct: 446 ERKILGWKKFNC 457


>UniRef100_Q7XJ13 Chloroplast nucleoid DNA-binding protein-like protein [Oryza
           sativa]
          Length = 732

 Score =  429 bits (1104), Expect = e-119
 Identities = 222/449 (49%), Positives = 303/449 (67%), Gaps = 20/449 (4%)

Query: 7   IIVIILIILHLSMCCNAHIFTFTMHHRYSEPVKKWSHSAPSPSHRWPEKGSVEYYAELAD 66
           ++V+  ++L L+    A   +  +HHRYS  V++W+ +A +P H     G+ EYYA LA 
Sbjct: 8   VVVVRAVLLLLAAVAAAEALSLDVHHRYSAAVRRWA-AAAAPPH-----GTAEYYAALAG 61

Query: 67  RDRFLRGRRLSQFDAG----LAFSDGNSTFRISSLGFLHYTTIELGTPGVKFMVALDTGS 122
            D  LR R L     G     AF+DGN T+R++  GFLHY  + LGTP V F+VALDTGS
Sbjct: 62  HDG-LRRRSLGVGGGGGGAEFAFADGNDTYRLNDFGFLHYAVVALGTPNVTFLVALDTGS 120

Query: 123 DLFWVPCDCTRCSATRSSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCLGT 182
           DLFWVPCDC +C+  +S  + S     FD  VY+P  S+TS+KV C+++LC  +N C   
Sbjct: 121 DLFWVPCDCLKCAPFQSPNYGSL---KFD--VYSPAQSTTSRKVPCSSNLCDLQNACRSK 175

Query: 183 FSNCPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVEANVIFGCGQVQSGSFLDVAAP 242
            ++CPY + Y+S  TS+SG+LVEDVL+LT       +V A ++FGCGQVQ+GSFL  AAP
Sbjct: 176 SNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQSKIVTAPIMFGCGQVQTGSFLGSAAP 235

Query: 243 NGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFGDKGSLDQDETPFNVNPSH 302
           NGL GLGM+  SVPS+L+ +G  A+SFSMCFG DG GRI+FGD GS DQ ETP NV   +
Sbjct: 236 NGLLGLGMDSKSVPSLLASKGLAANSFSMCFGDDGHGRINFGDTGSSDQKETPLNVYKQN 295

Query: 303 PTYNITINQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRRRPPDSR 362
           P YNITI  + VG+  I  EF+A+ DSGTSFT L DP Y++++ SF +Q+   R   DS 
Sbjct: 296 PYYNITITGITVGSKSISTEFSAIVDSGTSFTALSDPMYTQITSSFDAQIRSSRNMLDSS 355

Query: 363 IPFDYCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDPIIIISTQ--SELVYCLAVVKSAEL 420
           +PF++CY +S  +N  + P++SLT  GGS F V DPII I+    + + YCLA++KS  +
Sbjct: 356 MPFEFCYSVS--ANGIVHPNVSLTAKGGSIFPVNDPIITITDNAFNPVGYCLAIMKSEGV 413

Query: 421 NIIGQNFMTGYRVVFDRGKLILGWKKSDC 449
           N+IG+NFM+G +VVFDR +++LGWK  +C
Sbjct: 414 NLIGENFMSGLKVVFDRERMVLGWKNFNC 442


>UniRef100_O65625 Hypothetical protein T19K4.10 [Arabidopsis thaliana]
          Length = 335

 Score =  405 bits (1040), Expect = e-111
 Identities = 193/257 (75%), Positives = 220/257 (85%), Gaps = 8/257 (3%)

Query: 115 MVALDTGSDLFWVPCDCTRCSATRSSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCT 174
           MVALDTGSDLFWVPCDC +C+ T  + +AS    +F+LS+YNP  S+T+KKVTCNNSLC 
Sbjct: 1   MVALDTGSDLFWVPCDCGKCAPTEGATYAS----EFELSIYNPKVSTTNKKVTCNNSLCA 56

Query: 175 HRNQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVEANVIFGCGQVQSG 234
            RNQCLGTFS CPYMVSYVSA+TSTSGIL+EDV+HLT  D N + VEA V FGCGQVQSG
Sbjct: 57  QRNQCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPERVEAYVTFGCGQVQSG 116

Query: 235 SFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFGDKGSLDQDET 294
           SFLD+AAPNGLFGLGMEKISVPS+L+REG  ADSFSMCFG DG+GRISFGDKGS DQ+ET
Sbjct: 117 SFLDIAAPNGLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGVGRISFGDKGSSDQEET 176

Query: 295 PFNVNPSHPTYNITINQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVED 354
           PFN+NPSHP YNIT+ +VRVGTTLID EFTALFD+GTSFTYLVDP Y+ +SES     +D
Sbjct: 177 PFNLNPSHPNYNITVTRVRVGTTLIDDEFTALFDTGTSFTYLVDPMYTTVSES----AQD 232

Query: 355 RRRPPDSRIPFDYCYDM 371
           +R  PDSRIPF+YCYDM
Sbjct: 233 KRHSPDSRIPFEYCYDM 249


>UniRef100_Q84WU7 Hypothetical protein At3g51330 [Arabidopsis thaliana]
          Length = 529

 Score =  397 bits (1019), Expect = e-109
 Identities = 199/450 (44%), Positives = 290/450 (64%), Gaps = 13/450 (2%)

Query: 7   IIVIILIILHLSMCCNAHIFTFTMHHRYSEPVKKWSHSAPSPSHRWPEKGSVEYYAELAD 66
           ++ ++++   L  C  +  F+F +HH +S+ VK+    +       PEKGS+EY+  LA 
Sbjct: 10  LLSLLVVCWGLERCEASGKFSFEVHHMFSDRVKQ----SLGLDDLVPEKGSLEYFKVLAQ 65

Query: 67  RDRFLRGRRLSQF--DAGLAFSDGNSTFRISSLGFLHYTTIELGTPGVKFMVALDTGSDL 124
           RDR +RGR L+    +  + F  GN T  I  LGFLHY  + +GTP   F+VALDTGSDL
Sbjct: 66  RDRLIRGRGLASNNEETPITFMRGNRTISIDLLGFLHYANVSVGTPATWFLVALDTGSDL 125

Query: 125 FWVPCDCTRCSATRSSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCLGTFS 184
           FW+PC+C   S          L+    L++Y+PN SSTS  + C++  C   ++C    S
Sbjct: 126 FWLPCNCG--STCIRDLKEVGLSQSRPLNLYSPNTSSTSSSIRCSDDRCFGSSRCSSPAS 183

Query: 185 NCPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVEANVIFGCGQVQSGSFLDVAAPNG 244
           +CPY + Y+S +T T+G L EDVLHL   D+  + V+AN+  GCG+ Q+G     AA NG
Sbjct: 184 SCPYQIQYLSKDTFTTGTLFEDVLHLVTEDEGLEPVKANITLGCGKNQTGFLQSSAAVNG 243

Query: 245 LFGLGMEKISVPSMLSREGFTADSFSMCFGR--DGIGRISFGDKGSLDQDETPFNVNPSH 302
           L GLG++  SVPS+L++   TA+SFSMCFG   D +GRISFGDKG  DQ ETP       
Sbjct: 244 LLGLGLKDYSVPSILAKAKITANSFSMCFGNIIDVVGRISFGDKGYTDQMETPLLPTEPS 303

Query: 303 PTYNITINQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRRRPPDSR 362
           PTY +++ +V VG   + V+  ALFD+GTSFT+L++P Y  ++++F   V D+RRP D  
Sbjct: 304 PTYAVSVTEVSVGGDAVGVQLLALFDTGTSFTHLLEPEYGLITKAFDDHVTDKRRPIDPE 363

Query: 363 IPFDYCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDPIIII-STQSELVYCLAVVKSAE-- 419
           +PF++CYD+SP+  T L P +++T  GGS+  + +P+ I+ +  +  +YCL ++KS +  
Sbjct: 364 LPFEFCYDLSPNKTTILFPRVAMTFEGGSQMFLRNPLFIVWNEDNSAMYCLGILKSVDFK 423

Query: 420 LNIIGQNFMTGYRVVFDRGKLILGWKKSDC 449
           +NIIGQNFM+GYR+VFDR ++ILGWK+SDC
Sbjct: 424 INIIGQNFMSGYRIVFDRERMILGWKRSDC 453


>UniRef100_Q651C2 Aspartic proteinase nepenthesin II-like [Oryza sativa]
          Length = 476

 Score =  390 bits (1002), Expect = e-107
 Identities = 202/445 (45%), Positives = 280/445 (62%), Gaps = 49/445 (11%)

Query: 7   IIVIILIILHLSMCCNAHIFTFTMHHRYSEPVKKWSHSAPSPSHRWPEKGSVEYYAELAD 66
           ++V+  ++L L+    A   +  +HHRYS  V++W+ +A  P                  
Sbjct: 8   VVVVRAVLLLLAAVAAAEALSLDVHHRYSAAVRRWAAAAAPP------------------ 49

Query: 67  RDRFLRGRRLSQFDAGLAFSDGNSTFRISSLGFLHYTTIELGTPGVKFMVALDTGSDLFW 126
                                 + T  ++  GFLHY  + LGTP V F+VALDTGSDLFW
Sbjct: 50  ----------------------HGTADLNDFGFLHYAVVALGTPNVTFLVALDTGSDLFW 87

Query: 127 VPCDCTRCSATRSSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCLGTFSNC 186
           VPCDC +C+  +S  + S     FD  VY+P  S+TS+KV C+++LC  +N C    ++C
Sbjct: 88  VPCDCLKCAPFQSPNYGSL---KFD--VYSPAQSTTSRKVPCSSNLCDLQNACRSKSNSC 142

Query: 187 PYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVEANVIFGCGQVQSGSFLDVAAPNGLF 246
           PY + Y+S  TS+SG+LVEDVL+LT       +V A ++FGCGQVQ+GSFL  AAPNGL 
Sbjct: 143 PYSIQYLSDNTSSSGVLVEDVLYLTSDSAQSKIVTAPIMFGCGQVQTGSFLGSAAPNGLL 202

Query: 247 GLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFGDKGSLDQDETPFNVNPSHPTYN 306
           GLGM+  SVPS+L+ +G  A+SFSMCFG DG GRI+FGD GS DQ ETP NV   +P YN
Sbjct: 203 GLGMDSKSVPSLLASKGLAANSFSMCFGDDGHGRINFGDTGSSDQKETPLNVYKQNPYYN 262

Query: 307 ITINQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRRRPPDSRIPFD 366
           ITI  + VG+  I  EF+A+ DSGTSFT L DP Y++++ SF +Q+   R   DS +PF+
Sbjct: 263 ITITGITVGSKSISTEFSAIVDSGTSFTALSDPMYTQITSSFDAQIRSSRNMLDSSMPFE 322

Query: 367 YCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDPIIIISTQ--SELVYCLAVVKSAELNIIG 424
           +CY +S  +N  + P++SLT  GGS F V DPII I+    + + YCLA++KS  +N+IG
Sbjct: 323 FCYSVS--ANGIVHPNVSLTAKGGSIFPVNDPIITITDNAFNPVGYCLAIMKSEGVNLIG 380

Query: 425 QNFMTGYRVVFDRGKLILGWKKSDC 449
           +NFM+G +VVFDR +++LGWK  +C
Sbjct: 381 ENFMSGLKVVFDRERMVLGWKNFNC 405


>UniRef100_Q8SB30 Putative chloroplast nucleoid DNA-binding protein [Oryza sativa]
          Length = 551

 Score =  389 bits (998), Expect = e-106
 Identities = 215/445 (48%), Positives = 278/445 (62%), Gaps = 26/445 (5%)

Query: 28  FTMHHRYSEPVKKWSHSAPSPSHRWPEK----GSVEYYAELADRDRFLRGRR-LSQFDAG 82
           F +HHRYS  V++W+         WP      GS EYY+ L+  D  L  RR L+Q D  
Sbjct: 29  FDLHHRYSPIVQRWAEERGHAGVSWPAGAEVIGSPEYYSALSRHDHALFARRGLAQGDGL 88

Query: 83  LAFSDGNSTFRISSLGFLHYTTIELGTPGVKFMVALDTGSDLFWVPCDCTRCSATRSSAF 142
           + F+DGN T R+   G LHY  + +GTP   F+VALDTGSDLFWVPCDC +C A   +  
Sbjct: 89  VTFADGNITLRLD--GSLHYAEVAVGTPNTTFLVALDTGSDLFWVPCDCKQC-APLGNLT 145

Query: 143 ASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCLGTFSNCPYMVSYVSAETSTSGI 202
           A       +L  Y+P+ SSTSK VTC ++LC   N C    S+CPY V Y  A TS+SG 
Sbjct: 146 AVDGGGGPELRQYSPSKSSTSKTVTCASNLCDQPNACATATSSCPYAVRYAMANTSSSGE 205

Query: 203 LVEDVLHLTQPDDNHDL-----VEANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPS 257
           LVEDVL+LT+            V   V+FGCGQVQ+GSFLD AA +GL GLGMEK+SVPS
Sbjct: 206 LVEDVLYLTREKGAAAAAAGAAVRTPVVFGCGQVQTGSFLDGAAADGLMGLGMEKVSVPS 265

Query: 258 MLSREGFT-ADSFSMCFGRDGIGRISFGDKGSLDQDETPFNVNPSHPTYNITINQVRVGT 316
           +L+  G   ++SFSMCF +DG+GRI+FGD GS DQ ETPF V  +H  YNI+I  + VG 
Sbjct: 266 ILASTGVVKSNSFSMCFSKDGLGRINFGDTGSADQSETPFIVKSTHSYYNISITSMSVGD 325

Query: 317 TLIDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRR-----RPPDSRIPFDYCYDM 371
             + + F A+ DSGTSFTYL DP Y+  + +F++Q+ +RR            PF+YCY +
Sbjct: 326 KNLPLGFYAIADSGTSFTYLNDPAYTAYTTNFNAQISERRANFSGSTRSGPFPFEYCYSL 385

Query: 372 SPDSNTSLIPSMSLTMGGGSRFVVYDPIIIISTQSE------LVYCLAVVKS-AELNIIG 424
           SPD  T  +P +SLT  GG+ F V  P+  I+ Q        + YCLAV+KS   ++IIG
Sbjct: 386 SPDQTTVELPVVSLTTNGGAVFPVTSPVYPIAAQMTNGEIRIIGYCLAVIKSDLPIDIIG 445

Query: 425 QNFMTGYRVVFDRGKLILGWKKSDC 449
           QNFMTG +VVF+R K +LGW+K DC
Sbjct: 446 QNFMTGLKVVFNREKSVLGWQKFDC 470


>UniRef100_Q9SD16 Hypothetical protein F24M12.370 [Arabidopsis thaliana]
          Length = 519

 Score =  384 bits (987), Expect = e-105
 Identities = 198/449 (44%), Positives = 288/449 (64%), Gaps = 21/449 (4%)

Query: 7   IIVIILIILHLSMCCNAHIFTFTMHHRYSEPVKKWSHSAPSPSHRWPEKGSVEYYAELAD 66
           ++ ++++   L  C  +  F+F +HH +S+ VK+    +       PEKGS+EY+  LA 
Sbjct: 10  LLSLLVVCWGLERCEASGKFSFEVHHMFSDRVKQ----SLGLDDLVPEKGSLEYFKVLAQ 65

Query: 67  RDRFLRGRRLSQF--DAGLAFSDGNSTFRISSLGFLHYTTIELGTPGVKFMVALDTGSDL 124
           RDR +RGR L+    +  + F  GN T  I  LGFLHY  + +GTP   F+VALDTGSDL
Sbjct: 66  RDRLIRGRGLASNNEETPITFMRGNRTISIDLLGFLHYANVSVGTPATWFLVALDTGSDL 125

Query: 125 FWVPCDCTRCSATRSSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCLGTFS 184
           FW+PC+C   S          L+    L++Y+PN SSTS  + C++  C   ++C    S
Sbjct: 126 FWLPCNCG--STCIRDLKEVGLSQSRPLNLYSPNTSSTSSSIRCSDDRCFGSSRCSSPAS 183

Query: 185 NCPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVEANVIFGCGQVQSGSFLDVAAPNG 244
           +CPY + Y+S +T T+G L EDVLHL   D+  + V+AN+  GCG+ Q+G     AA NG
Sbjct: 184 SCPYQIQYLSKDTFTTGTLFEDVLHLVTEDEGLEPVKANITLGCGKNQTGFLQSSAAVNG 243

Query: 245 LFGLGMEKISVPSMLSREGFTADSFSMCFGR--DGIGRISFGDKGSLDQDETPFNVNPSH 302
           L GLG++  SVPS+L++   TA+SFSMCFG   D +GRISFGDKG  DQ ETP  + P+ 
Sbjct: 244 LLGLGLKDYSVPSILAKAKITANSFSMCFGNIIDVVGRISFGDKGYTDQMETP--LLPTE 301

Query: 303 PTYNITINQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRRRPPDSR 362
           P    ++ +V VG   + V+  ALFD+GTSFT+L++P Y  ++++F   V D+RRP D  
Sbjct: 302 P----SVTEVSVGGDAVGVQLLALFDTGTSFTHLLEPEYGLITKAFDDHVTDKRRPIDPE 357

Query: 363 IPFDYCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDPIIIISTQSELVYCLAVVKSAE--L 420
           +PF++CYD+SP+  T L P +++T  GGS+  + +P+ I    +  +YCL ++KS +  +
Sbjct: 358 LPFEFCYDLSPNKTTILFPRVAMTFEGGSQMFLRNPLFI---DNSAMYCLGILKSVDFKI 414

Query: 421 NIIGQNFMTGYRVVFDRGKLILGWKKSDC 449
           NIIGQNFM+GYR+VFDR ++ILGWK+SDC
Sbjct: 415 NIIGQNFMSGYRIVFDRERMILGWKRSDC 443


>UniRef100_Q9SD15 Hypothetical protein F24M12.380 [Arabidopsis thaliana]
          Length = 518

 Score =  338 bits (867), Expect = 2e-91
 Identities = 188/462 (40%), Positives = 268/462 (57%), Gaps = 36/462 (7%)

Query: 9   VIILIILHLSMCCNAHIFTFTMHHRYSEPVKKWSHSAPSPSHRWPEKGSVEYYAELADRD 68
           +++LI   L  C  +  F+F +HH +S+ VK+            PE GS+EY+  LA RD
Sbjct: 1   MLVLIFWGLERCEASGKFSFEVHHMFSDVVKQ----TLGFDDLVPENGSLEYFKVLAHRD 56

Query: 69  RFLRGRRLSQF--DAGLAFSDGNSTFRISSLGFLHYTTIELGTPGVKFMVALDTGSDLFW 126
           RF+RGR L+    +  L     N T  ++ LGFLHY  + LGTP   F+VALDTGSDLFW
Sbjct: 57  RFIRGRGLASNNEETPLTSIGSNLTLALNFLGFLHYANVSLGTPATWFLVALDTGSDLFW 116

Query: 127 VPCDC-TRCSATRSSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCLGTFSN 185
           +PC+C T C      A     +    L++Y PN S+TS  + C++  C    +C    S 
Sbjct: 117 LPCNCGTTCIHDLKDA---RFSESVPLNLYTPNASTTSSSIRCSDKRCFGSGKCSSPESI 173

Query: 186 CPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVEANVIFGCGQVQSGSFLDVAAPNGL 245
           CPY ++ +S+ T T+G L++DVLHL   D++   V ANV  GCGQ Q+G+F    A NG+
Sbjct: 174 CPYQIA-LSSNTVTTGTLLQDVLHLVTEDEDLKPVNANVTLGCGQNQTGAFQTDIAVNGV 232

Query: 246 FGLGMEKISVPSMLSREGFTADSFSMCFGR--DGIGRISFGDKGSLDQDETPFNVNPSHP 303
            GL M++ SVPS+L++   TA+SFSMCFGR    +GRISFGDKG  DQ+ETP     +  
Sbjct: 233 LGLSMKEYSVPSLLAKANITANSFSMCFGRIISVVGRISFGDKGYTDQEETPLVSLETST 292

Query: 304 TYNITINQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRRRPPDSRI 363
            Y + +  V VG   +DV   ALFD+G+SFT L++  Y   +++F   +ED+RRP D   
Sbjct: 293 AYGVNVTGVSVGGVPVDVPLFALFDTGSSFTLLLESAYGVFTKAFDDLMEDKRRPVDPDF 352

Query: 364 PFDYCYDMSPDS-NTSLIPSMSLTMGGGSRFVVYDPI---------------IIISTQSE 407
           PF++CYD+  +  N+   P    +         Y+P                +  S +  
Sbjct: 353 PFEFCYDLREEHLNSDARPRHMQSK-------CYNPCRDDFRWRIQNDSQESVSYSNEGT 405

Query: 408 LVYCLAVVKSAELNIIGQNFMTGYRVVFDRGKLILGWKKSDC 449
            +YCL ++KS  LNIIGQN M+G+R+VFDR ++ILGWK+S+C
Sbjct: 406 KMYCLGILKSINLNIIGQNLMSGHRIVFDRERMILGWKQSNC 447


>UniRef100_Q6Z8K1 Aspartyl protease-like [Oryza sativa]
          Length = 520

 Score =  308 bits (788), Expect = 3e-82
 Identities = 180/449 (40%), Positives = 257/449 (57%), Gaps = 15/449 (3%)

Query: 7   IIVIILIILHLSMCCNAHIFTFTMHHRYSEPVKKWSHSAPSPSHRWPEKGSVEYYAELAD 66
           +++++L          A   +  M HR S+  +    +      RWP +GS +Y+  L  
Sbjct: 9   LLLLLLAAAAAVAAAEAATLSARMVHRLSDEARL--AAGARGGRRWPRRGSGDYFRALVR 66

Query: 67  RDRFLRGRRLSQFDAGLAFSDGNSTFRISS-LGFLHYTTIELGTPGVKFMVALDTGSDLF 125
            D   + RR+      L+ S G S F   + LG+L+YT +++GTP   F+VALDTGSDLF
Sbjct: 67  SDLQRQKRRVGGKYQLLSLSQGGSIFPSGNDLGWLYYTWVDVGTPNTSFLVALDTGSDLF 126

Query: 126 WVPCDCTRCSATRSSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCLGTFSN 185
           WVPCDC +C+    S++  +L  D DL +Y P+ S+TS+ + C++ LC+  + C      
Sbjct: 127 WVPCDCIQCAPL--SSYHGSL--DRDLGIYKPSESTTSRHLPCSHELCSPASGCTNPKQP 182

Query: 186 CPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVEANVIFGCGQVQSGSFLDVAAPNGL 245
           CPY + Y S  T++SG+L+ED+LHL   +  H  V A+VI GCG+ QSGS+L+  AP+GL
Sbjct: 183 CPYNIDYFSENTTSSGLLIEDMLHLDSRE-GHAPVNASVIIGCGKKQSGSYLEGIAPDGL 241

Query: 246 FGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFGDKGSLDQDETPF-NVNPSHPT 304
            GLGM  ISVPS L+R G   +SFSMCF +D  GRI FGD+G   Q  TPF  +N    T
Sbjct: 242 LGLGMADISVPSFLARAGLVRNSFSMCFKKDDSGRIFFGDQGVPTQQSTPFVPMNGKLQT 301

Query: 305 YNITINQVRVGTTLID-VEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRRRPPDSRI 363
           Y + +++  +G    +   F AL D+GTSFT L    Y  ++  F  Q+   R   D   
Sbjct: 302 YAVNVDKYCIGHKCTEGAGFQALVDTGTSFTSLPLDAYKSITMEFDKQINASRASSDD-Y 360

Query: 364 PFDYCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDPIIIIS-TQSEL-VYCLAVVKSAE-L 420
            F+YCY   P      +P+++LT      F   +PI+  +  Q E  V+CLAV+ S E +
Sbjct: 361 SFEYCYSTGPLEMPD-VPTITLTFAENKSFQAVNPILPFNDRQGEFAVFCLAVLPSPEPV 419

Query: 421 NIIGQNFMTGYRVVFDRGKLILGWKKSDC 449
            IIGQNFM GY VVFDR  + LGW +S+C
Sbjct: 420 GIIGQNFMVGYHVVFDRENMKLGWYRSEC 448


>UniRef100_Q9LX20 Hypothetical protein T31P16_70 [Arabidopsis thaliana]
          Length = 528

 Score =  302 bits (774), Expect = 1e-80
 Identities = 174/440 (39%), Positives = 252/440 (56%), Gaps = 19/440 (4%)

Query: 23  AHIFTFTMHHRYSEPVKKWSHSAPSPSHRWPEKGSVEYYAELADRDRFLRGRRLSQFDAG 82
           A +F+  + HR+S+  +  S   PS S   P K S+EYY  LA+ D   +   L      
Sbjct: 22  ASLFSSRLIHRFSDEGRA-SIKTPSSSDSLPNKQSLEYYRLLAESDFRRQRMNLGAKVQS 80

Query: 83  LAFSDGNSTFRISS-LGFLHYTTIELGTPGVKFMVALDTGSDLFWVPCDCTRCSATRSSA 141
           L  S+G+ T    +  G+LHYT I++GTP V F+VALDTGS+L W+PC+C +C A  +S 
Sbjct: 81  LVPSEGSKTISSGNDFGWLHYTWIDIGTPSVSFLVALDTGSNLLWIPCNCVQC-APLTST 139

Query: 142 FASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCLGTFSNCPYMVSYVSAETSTSG 201
           + S+LA+  DL+ YNP+ SSTSK   C++ LC   + C      CPY V+Y+S  TS+SG
Sbjct: 140 YYSSLATK-DLNEYNPSSSSTSKVFLCSHKLCDSASDCESPKEQCPYTVNYLSGNTSSSG 198

Query: 202 ILVEDVLHLTQPDDNHDL-----VEANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVP 256
           +LVED+LHLT   +N  +     V+A V+ GCG+ QSG +LD  AP+GL GLG  +ISVP
Sbjct: 199 LLVEDILHLTYNTNNRLMNGSSSVKARVVIGCGKKQSGDYLDGVAPDGLMGLGPAEISVP 258

Query: 257 SMLSREGFTADSFSMCFGRDGIGRISFGDKGSLDQDETPFNV--NPSHPTYNITINQVRV 314
           S LS+ G   +SFS+CF  +  GRI FGD G   Q  TPF    N  +  Y + +    +
Sbjct: 259 SFLSKAGLMRNSFSLCFDEEDSGRIYFGDMGPSIQQSTPFLQLDNNKYSGYIVGVEACCI 318

Query: 315 GTT-LIDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRRRPPDSRIPFDYCYDMSP 373
           G + L    FT   DSG SFTYL +  Y +++      +    +  +  + ++YCY+ S 
Sbjct: 319 GNSCLKQTSFTTFIDSGQSFTYLPEEIYRKVALEIDRHINATSKNFEG-VSWEYCYESSA 377

Query: 374 DSNTSLIPSMSLTMGGGSRFVVYDPIIIISTQSELV-YCLAVVKSAELNI--IGQNFMTG 430
           +     +P++ L     + FV++ P+ +      LV +CL +  S +  I  IGQN+M G
Sbjct: 378 EPK---VPAIKLKFSHNNTFVIHKPLFVFQQSQGLVQFCLPISPSGQEGIGSIGQNYMRG 434

Query: 431 YRVVFDRGKLILGWKKSDCK 450
           YR+VFDR  + LGW  S C+
Sbjct: 435 YRMVFDRENMKLGWSPSKCQ 454


>UniRef100_Q9SD14 Hypothetical protein F24M12.390 [Arabidopsis thaliana]
          Length = 475

 Score =  280 bits (715), Expect = 9e-74
 Identities = 165/453 (36%), Positives = 236/453 (51%), Gaps = 73/453 (16%)

Query: 7   IIVIILIILHLSMCCNAHIFTFTMHHRYSEPVKKWSHSAPSPSHRWPEKGSVEYYAELAD 66
           ++ ++++      C     F F +HH +S+ VK+    +       PE+GS+EY+  LA 
Sbjct: 10  LLSVLVVCWGFERCEATGKFGFEVHHIFSDSVKQ----SLGLGDLVPEQGSLEYFKVLAH 65

Query: 67  RDRFLRGRRLSQF--DAGLAFSDGNSTFRISSLGFLHYTTIELGTPGVKFMVALDTGSDL 124
           RDR +RGR L+    +  + F  GN T  +  LG L+Y  + +GTP   F+VALDTGSDL
Sbjct: 66  RDRLIRGRGLASNNDETPITFDGGNLTVSVKLLGSLYYANVSVGTPPSSFLVALDTGSDL 125

Query: 125 FWVPCDC-TRCSATRSSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCLGTF 183
           FW+PC+C T C           +     L++Y PN S+TS  + C++  C    +C    
Sbjct: 126 FWLPCNCGTTCIRDLEDI---GVPQSVPLNLYTPNASTTSSSIRCSDKRCFGSKKCSSPS 182

Query: 184 SNCPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVEANVIFGCGQVQSGSFLDVAAPN 243
           S CPY +SY S  T T G L++DVLHL   D+N   V+ANV  GCGQ Q+G F    + N
Sbjct: 183 SICPYQISY-SNSTGTKGTLLQDVLHLATEDENLTPVKANVTLGCGQKQTGLFQRNNSVN 241

Query: 244 GLFGLGMEKISVPSMLSREGFTADSFSMCFGR--DGIGRISFGDKGSLDQDETPFNVNPS 301
           G+ GLG++  SVPS+L++   TA+SFSMCFGR    +GRISFG                 
Sbjct: 242 GVLGLGIKGYSVPSLLAKANITANSFSMCFGRVIGNVGRISFG----------------- 284

Query: 302 HPTYNITINQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRRRPPDS 361
                                               D  Y+   E+    V  RRRP D 
Sbjct: 285 ------------------------------------DRGYTDQEETPFISVAPRRRPVDP 308

Query: 362 RIPFDYCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDPIIIISTQS-----ELVYCLAVVK 416
            +PF++CYD+SP++ T   P + +T  GGS+ ++ +P     TQ+      ++YCL V+K
Sbjct: 309 ELPFEFCYDLSPNATTIQFPLVEMTFIGGSKIILNNPFFTARTQARHGEGNVMYCLGVLK 368

Query: 417 SAELNIIGQNFMTGYRVVFDRGKLILGWKKSDC 449
           S  L I   NF+ GYR+VFDR ++ILGWK+S C
Sbjct: 369 SVGLKI--NNFVAGYRIVFDRERMILGWKQSLC 399


>UniRef100_Q9SD13 Hypothetical protein F24M12.400 [Arabidopsis thaliana]
          Length = 426

 Score =  209 bits (532), Expect = 1e-52
 Identities = 110/258 (42%), Positives = 167/258 (64%), Gaps = 9/258 (3%)

Query: 178 QCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVEANVIFGCGQVQSGSFL 237
           +C+   S+CPY + Y+S  + ++G+LVEDV+H++  +   +  +A + FG  + Q G F 
Sbjct: 127 RCISPVSDCPYRIRYLSPGSKSTGVLVEDVIHMSTEEG--EARDARITFG--ESQLGLFK 182

Query: 238 DVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFGDKGSLDQDETPFN 297
           +VA  NG+ GL +  I+VP+ML + G  +DSFSMCFG +G G ISFGDKGS DQ ETP +
Sbjct: 183 EVAV-NGIMGLAIADIAVPNMLVKAGVASDSFSMCFGPNGKGTISFGDKGSSDQLETPLS 241

Query: 298 VNPSHPTYNITINQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRRR 357
              S   Y+++I + +VG   +D EFTA FDSGT+ T+L++P Y+ L+ +FH  V DRR 
Sbjct: 242 GTISPMFYDVSITKFKVGKVTVDTEFTATFDSGTAVTWLIEPYYTALTTNFHLSVPDRRL 301

Query: 358 PPDSRIPFDYCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDPIIIISTQ--SELVYCLAVV 415
                 PF++CY ++  S+   +PS+S  M GG+ + V+ PI++  T   S  VYCLAV+
Sbjct: 302 SKSVDSPFEFCYIITSTSDEDKLPSVSFEMKGGAAYDVFSPILVFDTSDGSFQVYCLAVL 361

Query: 416 K--SAELNIIGQNFMTGY 431
           K  +A+ +IIG+N   G+
Sbjct: 362 KQVNADFSIIGRNDTNGF 379


>UniRef100_Q5Z8N8 Aspartic proteinase nepenthesin II-like [Oryza sativa]
          Length = 307

 Score =  206 bits (523), Expect = 2e-51
 Identities = 108/218 (49%), Positives = 144/218 (65%), Gaps = 13/218 (5%)

Query: 245 LFGLGMEKISVPSMLSREGFT-ADSFSMCFGRDGIGRISFGDKGSLDQDETPFNVNPSHP 303
           L GLGMEK+SVPS+L+  G   ++SFSMCF +DG+GRI+FGD GS DQ ETPF V  +H 
Sbjct: 9   LMGLGMEKVSVPSILASTGVVKSNSFSMCFSKDGLGRINFGDTGSADQSETPFIVKSTHS 68

Query: 304 TYNITINQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRR-----RP 358
            YNI+I  + VG   + + F A+ DSGTSFTYL DP Y+  + +F++Q+ +RR       
Sbjct: 69  YYNISITSMSVGDKNLPLGFYAIADSGTSFTYLNDPAYTAYTTNFNAQISERRANFSGST 128

Query: 359 PDSRIPFDYCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDPIIIISTQSE------LVYCL 412
                PF+YCY +SPD  T  +P +SLT  GG+ F V  P+  I+ Q        + YCL
Sbjct: 129 RSGPFPFEYCYSLSPDQTTVELPVVSLTTNGGAVFPVTSPVYPIAAQMTNGEIRIIGYCL 188

Query: 413 AVVKS-AELNIIGQNFMTGYRVVFDRGKLILGWKKSDC 449
           AV+KS   ++IIGQNFMTG +VVF+R K +LGW+K DC
Sbjct: 189 AVIKSDLPIDIIGQNFMTGLKVVFNREKSVLGWQKFDC 226


>UniRef100_Q94AV8 AT3g51330/F24M12_370 [Arabidopsis thaliana]
          Length = 260

 Score =  192 bits (489), Expect = 1e-47
 Identities = 89/184 (48%), Positives = 130/184 (70%), Gaps = 5/184 (2%)

Query: 271 MCFGR--DGIGRISFGDKGSLDQDETPFNVNPSHPTYNITINQVRVGTTLIDVEFTALFD 328
           MCFG   D +GRISFGDKG  DQ ETP       PTY +++ +V VG   + V+  ALFD
Sbjct: 1   MCFGNIIDVVGRISFGDKGYTDQMETPLLPTEPSPTYAVSVTEVSVGGDAVGVQLLALFD 60

Query: 329 SGTSFTYLVDPTYSRLSESFHSQVEDRRRPPDSRIPFDYCYDMSPDSNTSLIPSMSLTMG 388
           +GTSFT+L++P Y  ++++F   V D+RRP D  +PF++CYD+SP+  T L P +++T  
Sbjct: 61  TGTSFTHLLEPEYGLITKAFDDHVTDKRRPIDPELPFEFCYDLSPNKTTILFPRVAMTFE 120

Query: 389 GGSRFVVYDPIIII-STQSELVYCLAVVKSAE--LNIIGQNFMTGYRVVFDRGKLILGWK 445
           GGS+  + +P+ I+ +  +  +YCL ++KS +  +NIIGQNFM+GYR+VFDR ++ILGWK
Sbjct: 121 GGSQMFLRNPLFIVWNEDNSAMYCLGILKSVDFKINIIGQNFMSGYRIVFDRERMILGWK 180

Query: 446 KSDC 449
           +SDC
Sbjct: 181 RSDC 184


>UniRef100_Q9M8R6 Putative aspartyl protease [Arabidopsis thaliana]
          Length = 488

 Score =  132 bits (331), Expect = 3e-29
 Identities = 109/375 (29%), Positives = 170/375 (45%), Gaps = 44/375 (11%)

Query: 100 LHYTTIELGTPGVKFMVALDTGSDLFWVPC-DCTRCSATRSSAFASALASDFDLSVYNPN 158
           L++  I LGTP   F V +DTGSD+ WV C  C RC   R S          +L+ Y+ +
Sbjct: 84  LYFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCP--RKSDLV-------ELTPYDVD 134

Query: 159 GSSTSKKVTCNNSLCTHRNQC--LGTFSNCPYMVSYVSAETSTSGILVEDVLHLTQPDDN 216
            SST+K V+C+++ C++ NQ     + S C Y++ Y    +ST+G LV+DV+HL     N
Sbjct: 135 ASSTAKSVSCSDNFCSYVNQRSECHSGSTCQYVIMYGDG-SSTNGYLVKDVVHLDLVTGN 193

Query: 217 HDLVEAN--VIFGCGQVQSGSFLDV-AAPNGLFGLGMEKISVPSMLSREGFTADSFSMCF 273
                 N  +IFGCG  QSG   +  AA +G+ G G    S  S L+ +G    SF+ C 
Sbjct: 194 RQTGSTNGTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCL 253

Query: 274 -GRDGIGRISFGDKGSLDQDETPFNVNPSHPTYNITINQVRVGTTLIDVEFTA------- 325
              +G G  + G+  S     TP     +H  Y++ +N + VG +++++   A       
Sbjct: 254 DNNNGGGIFAIGEVVSPKVKTTPMLSKSAH--YSVNLNAIEVGNSVLELSSNAFDSGDDK 311

Query: 326 --LFDSGTSFTYLVDPTYSRL-SESFHSQVE-DRRRPPDSRIPFDYCYDMSPDSNTSLIP 381
             + DSGT+  YL D  Y+ L +E   S  E       +S   F Y             P
Sbjct: 312 GVIIDSGTTLVYLPDAVYNPLLNEILASHPELTLHTVQESFTCFHYT------DKLDRFP 365

Query: 382 SMSLTMGGGSRFVVYDPIIIISTQSELVYC-------LAVVKSAELNIIGQNFMTGYRVV 434
           +++          VY    +   + +  +C       L     A L I+G   ++   VV
Sbjct: 366 TVTFQFDKSVSLAVYPREYLFQVRED-TWCFGWQNGGLQTKGGASLTILGDMALSNKLVV 424

Query: 435 FDRGKLILGWKKSDC 449
           +D    ++GW   +C
Sbjct: 425 YDIENQVIGWTNHNC 439


>UniRef100_Q5ZAZ8 Putative aspartic proteinase nepenthesin II [Oryza sativa]
          Length = 504

 Score =  129 bits (325), Expect = 1e-28
 Identities = 122/431 (28%), Positives = 193/431 (44%), Gaps = 54/431 (12%)

Query: 53  PEKG-SVEYYAELADRDRFLRGRR--LSQFDAGLAFSDGNSTFRISS-----LGFLHYTT 104
           P KG +VE+   L +RDR   GRR  L     G+A   G   F +       +  L++T 
Sbjct: 41  PHKGVAVEH---LRERDRARHGRRGLLGGGGGGVA---GVVDFPVEGSANPFMVGLYFTR 94

Query: 105 IELGTPGVKFMVALDTGSDLFWVPCD-CTRCSATRSSAFASALASDFDLSVYNPNGSSTS 163
           ++LG+P  ++ V +DTGSD+ WV C  CT C         S+   +  L  +NP+ SSTS
Sbjct: 95  VKLGSPPKEYFVQIDTGSDILWVACSPCTGC--------PSSSGLNIQLEFFNPDTSSTS 146

Query: 164 KKVTCNNSLCTHRNQC------LGTFSNCPYMVSYVSAETSTSGILVEDVLHLTQPDDNH 217
            K+ C++  CT   Q           S C Y  +Y    + TSG  V D ++      N 
Sbjct: 147 SKIPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDG-SGTSGYYVSDTMYFDTVMGNE 205

Query: 218 DLV--EANVIFGCGQVQSGSFLDV-AAPNGLFGLGMEKISVPSMLSREGFTADSFSMCF- 273
                 A+++FGC   QSG       A +G+FG G  ++SV S L+  G +   FS C  
Sbjct: 206 QTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLK 265

Query: 274 GRD-GIGRISFGDKGSLDQDETPFNVNPSHPTYNITINQVRVGTTLIDVE---FT----- 324
           G D G G +  G+        TP  + PS P YN+ +  + V    + ++   FT     
Sbjct: 266 GSDNGGGILVLGEIVEPGLVYTP--LVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNTQ 323

Query: 325 -ALFDSGTSFTYLVDPTYSRLSESFHSQVEDRRRPPDSRIPFDYCYDMSPDSNTSLIPSM 383
             + DSGT+  YL D  Y     +  + V    R   S+   + C+ ++  S  S  P++
Sbjct: 324 GTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVRSLVSK--GNQCF-VTSSSVDSSFPTV 380

Query: 384 SLTMGGGSRFVVYDPIIII---STQSELVYCLAVVKS--AELNIIGQNFMTGYRVVFDRG 438
           SL   GG    V     ++   S  + +++C+   ++   ++ I+G   +     V+D  
Sbjct: 381 SLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYDLA 440

Query: 439 KLILGWKKSDC 449
            + +GW   DC
Sbjct: 441 NMRMGWTDYDC 451


>UniRef100_Q9FPD6 Hypothetical protein At1g08210 [Arabidopsis thaliana]
          Length = 492

 Score =  127 bits (318), Expect = 1e-27
 Identities = 103/374 (27%), Positives = 171/374 (45%), Gaps = 40/374 (10%)

Query: 100 LHYTTIELGTPGVKFMVALDTGSDLFWVPC-DCTRCSATRSSAFASALASDFDLSVYNPN 158
           L+YT ++LGTP  +F V +DTGSD+ WV C  C  C  T              LS ++P 
Sbjct: 83  LYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSE--------LQIQLSFFDPG 134

Query: 159 GSSTSKKVTCNNSLCTHRNQCLGTFSN---CPYMVSYVSAETSTSGILVEDVLHLTQPDD 215
            SS++  V+C++  C    Q     S    C Y   Y    + TSG  + D +       
Sbjct: 135 VSSSASLVSCSDRRCYSNFQTESGCSPNNLCSYSFKYGDG-SGTSGYYISDFMSFDTVIT 193

Query: 216 NHDLVEANV--IFGCGQVQSGSF-LDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMC 272
           +   + ++   +FGC  +QSG       A +G+FGLG   +SV S L+ +G     FS C
Sbjct: 194 STLAINSSAPFVFGCSNLQSGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHC 253

Query: 273 FGRD--GIGRISFGDKGSLDQDETPFNVNPSHPTYNITINQVRVGTTLIDVE---FT--- 324
              D  G G +  G     D   TP  + PS P YN+ +  + V   ++ ++   FT   
Sbjct: 254 LKGDKSGGGIMVLGQIKRPDTVYTP--LVPSQPHYNVNLQSIAVNGQILPIDPSVFTIAT 311

Query: 325 ---ALFDSGTSFTYLVDPTYSRLSESFHSQVEDRRRPPDSRIPFD--YCYDMSPDSNTSL 379
               + D+GT+  YL D  YS   ++  + V    RP    I ++   C++++   +  +
Sbjct: 312 GDGTIIDTGTTLAYLPDEAYSPFIQAVANAVSQYGRP----ITYESYQCFEITA-GDVDV 366

Query: 380 IPSMSLTMGGGSRFVVYDP--IIIISTQSELVYCLAVVKSA--ELNIIGQNFMTGYRVVF 435
            P +SL+  GG+  V+     + I S+    ++C+   + +   + I+G   +    VV+
Sbjct: 367 FPQVSLSFAGGASMVLGPRAYLQIFSSSGSSIWCIGFQRMSHRRITILGDLVLKDKVVVY 426

Query: 436 DRGKLILGWKKSDC 449
           D  +  +GW + DC
Sbjct: 427 DLVRQRIGWAEYDC 440


>UniRef100_Q9FG60 Aspartyl protease-like [Arabidopsis thaliana]
          Length = 478

 Score =  125 bits (314), Expect = 3e-27
 Identities = 112/417 (26%), Positives = 183/417 (43%), Gaps = 45/417 (10%)

Query: 56  GSVEYYAELADRDRFLRGRRLSQFDAGLAFSDGNSTFRISSLGFLHYTTIELGTPGVKFM 115
           G  +  +EL   D F   R L+  D  L    G+S  R  S+G L++T I+LG+P  ++ 
Sbjct: 35  GKEKQLSELKSHDSFRHARMLANIDLPLG---GDS--RADSIG-LYFTKIKLGSPPKEYY 88

Query: 116 VALDTGSDLFWVPC-DCTRCSATRSSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCT 174
           V +DTGSD+ WV C  C +C                 LS+Y+   SSTSK V C +  C+
Sbjct: 89  VQVDTGSDILWVNCAPCPKCPVKTDLG--------IPLSLYDSKTSSTSKNVGCEDDFCS 140

Query: 175 H--RNQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDL--VEANVIFGCGQ 230
              +++  G    C Y V Y    TS  G  ++D + L Q   N     +   V+FGCG+
Sbjct: 141 FIMQSETCGAKKPCSYHVVYGDGSTS-DGDFIKDNITLEQVTGNLRTAPLAQEVVFGCGK 199

Query: 231 VQSGSFLDV-AAPNGLFGLGMEKISVPSMLSREGFTADSFSMCF-GRDGIGRISFGDKGS 288
            QSG      +A +G+ G G    S+ S L+  G T   FS C    +G G  + G+  S
Sbjct: 200 NQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMNGGGIFAVGEVES 259

Query: 289 LDQDETPFNVNPSHPTYNITINQVRVGTTLIDV---------EFTALFDSGTSFTYLVDP 339
                TP   N  H  YN+ +  + V    ID+         +   + DSGT+  YL   
Sbjct: 260 PVVKTTPIVPNQVH--YNVILKGMDVDGDPIDLPPSLASTNGDGGTIIDSGTTLAYLPQN 317

Query: 340 TYSRLSESFHSQVEDRRRPPDSRIPFDYCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDPI 399
            Y+ L E   ++ + +            C+  + +++ +  P ++L      +  VY   
Sbjct: 318 LYNSLIEKITAKQQVKLHMVQETFA---CFSFTSNTDKA-FPVVNLHFEDSLKLSVYPHD 373

Query: 400 IIISTQSELVYC-------LAVVKSAELNIIGQNFMTGYRVVFDRGKLILGWKKSDC 449
            + S + ++ YC       +     A++ ++G   ++   VV+D    ++GW   +C
Sbjct: 374 YLFSLREDM-YCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEVIGWADHNC 429


  Database: uniref100
    Posted date:  Jan 5, 2005  1:24 AM
  Number of letters in database: 848,049,833
  Number of sequences in database:  2,790,947
  
Lambda     K      H
   0.323    0.137    0.425 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 825,545,956
Number of Sequences: 2790947
Number of extensions: 34691836
Number of successful extensions: 77910
Number of sequences better than 10.0: 606
Number of HSP's better than 10.0 without gapping: 213
Number of HSP's successfully gapped in prelim test: 393
Number of HSP's that attempted gapping in prelim test: 76828
Number of HSP's gapped (non-prelim): 879
length of query: 492
length of database: 848,049,833
effective HSP length: 131
effective length of query: 361
effective length of database: 482,435,776
effective search space: 174159315136
effective search space used: 174159315136
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 77 (34.3 bits)


Medicago: description of AC149547.3