Miyakogusa Predicted Gene

Lj6g3v1880260.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj6g3v1880260.1 Non Chatacterized Hit- tr|K4AXN7|K4AXN7_SOLLC
Uncharacterized protein OS=Solanum lycopersicum
GN=Sol,41.73,0.000000000000009,seg,NULL; Acid proteases,Peptidase
aspartic; no description,Peptidase aspartic, catalytic; BASIC 7S
,NODE_43012_length_1315_cov_14.914829.path1.1
         (215 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT1G03220.1 | Symbols:  | Eukaryotic aspartyl protease family pr...   142   1e-34
AT1G03230.1 | Symbols:  | Eukaryotic aspartyl protease family pr...   136   9e-33
AT5G48430.1 | Symbols:  | Eukaryotic aspartyl protease family pr...   115   2e-26
AT5G19100.1 | Symbols:  | Eukaryotic aspartyl protease family pr...   115   2e-26
AT5G19110.1 | Symbols:  | Eukaryotic aspartyl protease family pr...   113   1e-25
AT5G19120.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    80   1e-15
AT1G25510.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    55   2e-08
AT3G25700.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    55   5e-08
AT3G59080.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    52   4e-07
AT3G59080.2 | Symbols:  | Eukaryotic aspartyl protease family pr...    52   4e-07
AT3G18490.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    49   4e-06

>AT1G03220.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr1:787143-788444 FORWARD LENGTH=433
          Length = 433

 Score =  142 bits (359), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 78/183 (42%), Positives = 111/183 (60%), Gaps = 7/183 (3%)

Query: 12  EFQTTPLIVNPVATGAVTIPGGASQEYFIDVKSVLINGNVLNLKPSMLSID-KKGNGGTK 70
             QTTPL++NPV+T +    G  S EYFI V ++ I    + + P++L I+   G GGTK
Sbjct: 228 SLQTTPLLINPVSTASAFSQGEKSSEYFIGVTAIQIVEKTVPINPTLLKINASTGIGGTK 287

Query: 71  ISTISAFTELQSSVYRIFIREYLKAASDSKLKRVAAVAPFEACYDSTTIFNTLAGLNVPT 130
           IS+++ +T L+SS+Y  F  E++K A+   +KRVA+V PF AC+ +  +  T  G  VP 
Sbjct: 288 ISSVNPYTVLESSIYNAFTSEFVKQAAARSIKRVASVKPFGACFSTKNVGVTRLGYAVPE 347

Query: 131 IDLVMQG-GAQGKILGANAMVMVKKNVACLAIVDGGTEPRMSAVKASIVVGAHQLEDNLL 189
           I+LV+       +I GAN+MV V  +V CL  VDGG   R      S+V+G  QLEDNL+
Sbjct: 348 IELVLHSKDVVWRIFGANSMVSVSDDVICLGFVDGGVNAR-----TSVVIGGFQLEDNLI 402

Query: 190 VFD 192
            FD
Sbjct: 403 EFD 405


>AT1G03230.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr1:790110-791414 FORWARD LENGTH=434
          Length = 434

 Score =  136 bits (343), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 77/183 (42%), Positives = 108/183 (59%), Gaps = 7/183 (3%)

Query: 12  EFQTTPLIVNPVATGAVTIPGGASQEYFIDVKSVLINGNVLNLKPSMLSID-KKGNGGTK 70
             Q TPL++NP  T      G  S EYFI V ++ I    L + P++L I+   G GGTK
Sbjct: 229 RLQKTPLLINPGTTVFEFSKGEKSPEYFIGVTAIKIVEKTLPIDPTLLKINASTGIGGTK 288

Query: 71  ISTISAFTELQSSVYRIFIREYLKAASDSKLKRVAAVAPFEACYDSTTIFNTLAGLNVPT 130
           IS+++ +T L+SS+Y+ F  E+++ A+   +KRVA+V PF AC+ +  +  T  G  VP 
Sbjct: 289 ISSVNPYTVLESSIYKAFTSEFIRQAAARSIKRVASVKPFGACFSTKNVGVTRLGYAVPE 348

Query: 131 IDLVMQG-GAQGKILGANAMVMVKKNVACLAIVDGGTEPRMSAVKASIVVGAHQLEDNLL 189
           I LV+       +I GAN+MV V  +V CL  VDGG  P      AS+V+G  QLEDNL+
Sbjct: 349 IQLVLHSKDVVWRIFGANSMVSVSDDVICLGFVDGGVNP-----GASVVIGGFQLEDNLI 403

Query: 190 VFD 192
            FD
Sbjct: 404 EFD 406


>AT5G48430.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr5:19627892-19629112 REVERSE LENGTH=406
          Length = 406

 Score =  115 bits (289), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 61/157 (38%), Positives = 91/157 (57%), Gaps = 13/157 (8%)

Query: 36  QEYFIDVKSVLINGNVLNLKPSMLSIDKKGNGGTKISTISAFTELQSSVYRIFIREYLKA 95
             YF+ +K + +NGN +   P+  + D+ G+GG  +STI  FT L+S +YR+FI  + +A
Sbjct: 237 NNYFLGLKGISVNGNRILFAPNAFAFDRNGDGGVTLSTIFPFTMLRSDIYRVFIEAFSQA 296

Query: 96  ASDSKLKRVAAVAPFEACYDSTTIFNTLAGLNVPTIDLVMQGGAQGKILGANAMVMVKKN 155
            S   + RV++  PFE C  +TT F       VP IDL +  G   K+  ANAM  V  +
Sbjct: 297 TSG--IPRVSSTTPFEFCLSTTTNF------QVPRIDLELANGVIWKLSPANAMKKVSDD 348

Query: 156 VACLAIVDGGTEPRMSAVKASIVVGAHQLEDNLLVFD 192
           VACLA V+GG      A   ++++G HQ+E+ L+ FD
Sbjct: 349 VACLAFVNGG-----DAAAQAVMIGIHQMENTLVEFD 380


>AT5G19100.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr5:6408242-6409417 REVERSE LENGTH=391
          Length = 391

 Score =  115 bits (288), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 70/180 (38%), Positives = 97/180 (53%), Gaps = 35/180 (19%)

Query: 13  FQTTPLIVNPVATGAVTIPGGASQEYFIDVKSVLINGNVLNLKPSMLSIDKKGNGGTKIS 72
           F +TPLI N           G S EY IDVKS+ I    + +            G TKIS
Sbjct: 220 FASTPLIGN-----------GKSGEYLIDVKSIQIGAKTVPIP----------YGATKIS 258

Query: 73  TISAFTELQSSVYRIFIREYLKAASDSKLKRVAAVAPFEACYDSTTIFNTLAGLNVPTID 132
           T++ +T  Q+S+Y+  +  + +   + K+ +  AV PF AC+ S        G  VP ID
Sbjct: 259 TLAPYTVFQTSLYKALLTAFTE---NIKIAKAPAVKPFGACFYSN------GGRGVPVID 309

Query: 133 LVMQGGAQGKILGANAMVMVKKNVACLAIVDGGTEPRMSAVKASIVVGAHQLEDNLLVFD 192
           LV+ GGA+ +I G+N++V V KNV CL  VDGG +P     K  IV+G  Q+EDNL+ FD
Sbjct: 310 LVLSGGAKWRIYGSNSLVKVNKNVVCLGFVDGGVKP-----KYPIVIGGFQMEDNLVEFD 364


>AT5G19110.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr5:6411720-6413170 REVERSE LENGTH=405
          Length = 405

 Score =  113 bits (282), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 66/176 (37%), Positives = 96/176 (54%), Gaps = 15/176 (8%)

Query: 21  NPVATGAVTIPGGASQEYFIDVKSVLINGNVLNLKPSMLSIDKKGNGGTKISTISAFTEL 80
           NP+      I G  S +Y I VKS+ + G  L L P +L+      GG K+ST+  +T L
Sbjct: 215 NPIPRTLTPIKGTDSGDYLITVKSIYVGGTALKLNPDLLT------GGAKLSTVVHYTVL 268

Query: 81  QSSVYRIFIREYLKAASDSKLKRVAAVAPFEACYDSTTIFNTL-AGLNVPTIDLVMQGG- 138
           Q+ +Y    + +   A    + +V +VAPF+ C+DS T    L AG NVP I++ + G  
Sbjct: 269 QTDIYNALAQSFTLKAKAMGIAKVPSVAPFKHCFDSRTAGKNLTAGPNVPVIEIGLPGRI 328

Query: 139 --AQGKILGANAMVMVKKNVACLAIVDGGTEPRMSAVKASIVVGAHQLEDNLLVFD 192
              +    GAN +V VK+ V CLA +DGG  P     K  +V+G HQL+D++L FD
Sbjct: 329 GEVKWGFYGANTVVKVKETVMCLAFIDGGKTP-----KDLMVIGTHQLQDHMLEFD 379


>AT5G19120.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr5:6414585-6415745 FORWARD LENGTH=386
          Length = 386

 Score = 79.7 bits (195), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 58/161 (36%), Positives = 85/161 (52%), Gaps = 25/161 (15%)

Query: 33  GASQEYFIDVKSVLINGNVLNLKPSMLSIDKKGNGGTKISTISAFTELQSSVYRIFIREY 92
           G+S  Y I+VKS+ +NG  L+++         G    ++ST+  +T L+SS+Y++F   Y
Sbjct: 231 GSSGNYVINVKSIRVNGEKLSVE---------GPLAVELSTVVPYTILESSIYKVFAEAY 281

Query: 93  LKAASDSKLKRVAAVAPFEACYDSTTIFNTLAGLNVPTIDLVMQGG-AQGKILGANAMVM 151
            KAA ++    V  VAPF  C+ S   F        P +DL +Q    + +I G N MV 
Sbjct: 282 AKAAGEA--TSVPPVAPFGLCFTSDVDF--------PAVDLALQSEMVRWRIHGKNLMVD 331

Query: 152 VKKNVACLAIVDGGTEPRMSAVKASIVVGAHQLEDNLLVFD 192
           V   V C  IVDGG+  R++     IV+G  QLE  +L FD
Sbjct: 332 VGGGVRCSGIVDGGSS-RVNP----IVMGGLQLEGFILDFD 367


>AT1G25510.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr1:8959372-8960823 REVERSE LENGTH=483
          Length = 483

 Score = 55.5 bits (132), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 40/156 (25%), Positives = 71/156 (45%), Gaps = 15/156 (9%)

Query: 38  YFIDVKSVLINGNVLNLKPSMLSIDKKGNGGTKISTISAFTELQSSVYRIFIREYLKAAS 97
           Y++ +  + + G +L +  S   +D+ G+GG  I + +A T LQ+ +Y      ++K   
Sbjct: 329 YYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTAVTRLQTEIYNSLRDSFVKGTL 388

Query: 98  DSKLKRVAAVAPFEACYDSTTIFNTLAGLNVPTIDLVMQGGAQGKILGANAMVMVKK-NV 156
           D  L++ A VA F+ CY+     +    + VPT+     GG    +   N M+ V     
Sbjct: 389 D--LEKAAGVAMFDTCYN----LSAKTTVEVPTVAFHFPGGKMLALPAKNYMIPVDSVGT 442

Query: 157 ACLAIVDGGTEPRMSAVKASIVVGAHQLEDNLLVFD 192
            CLA       P  S++    ++G  Q +   + FD
Sbjct: 443 FCLAFA-----PTASSLA---IIGNVQQQGTRVTFD 470


>AT3G25700.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr3:9358937-9360295 FORWARD LENGTH=452
          Length = 452

 Score = 54.7 bits (130), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 41/147 (27%), Positives = 65/147 (44%), Gaps = 16/147 (10%)

Query: 16  TPLIVNPVATGAVTIPGGASQEYFIDVKSVLINGNVLNLKPSMLSIDKKGNGGTKISTIS 75
           TPL+ NP++             Y++ +KSV +NG  L + PS+  ID  GNGGT + + +
Sbjct: 280 TPLLTNPLSP----------TFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVVDSGT 329

Query: 76  AFTELQSSVYRIFIREYLKAASDSKLKRVAAVAP-FEACYDSTTIFNTLAGLNVPTIDLV 134
               L    YR  I    +     KL    A+ P F+ C + + +  T     +P +   
Sbjct: 330 TLAFLAEPAYRSVIAAVRRRV---KLPIADALTPGFDLCVNVSGV--TKPEKILPRLKFE 384

Query: 135 MQGGAQGKILGANAMVMVKKNVACLAI 161
             GGA       N  +  ++ + CLAI
Sbjct: 385 FSGGAVFVPPPRNYFIETEEQIQCLAI 411


>AT3G59080.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr3:21836812-21838419 FORWARD LENGTH=535
          Length = 535

 Score = 51.6 bits (122), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 35/155 (22%), Positives = 73/155 (47%), Gaps = 12/155 (7%)

Query: 38  YFIDVKSVLINGNVLNLKPSMLSIDKKGNGGTKISTISAFTELQSSVYRIFIREYLKAAS 97
           Y++ +KS+L+ G VLN+     +I   G GGT I + +  +      Y  FI+  +   +
Sbjct: 377 YYVQIKSILVAGEVLNIPEETWNISSDGAGGTIIDSGTTLSYFAEPAYE-FIKNKIAEKA 435

Query: 98  DSKLKRVAAVAPFEACYDSTTIFNTLAGLNVPTIDLVMQGGAQGKILGANAMVMVKKNVA 157
             K          + C++ + I N    + +P + +    GA       N+ + + +++ 
Sbjct: 436 KGKYPVYRDFPILDPCFNVSGIHN----VQLPELGIAFADGAVWNFPTENSFIWLNEDLV 491

Query: 158 CLAIVDGGTEPRMSAVKASIVVGAHQLEDNLLVFD 192
           CLA++  GT P+     A  ++G +Q ++  +++D
Sbjct: 492 CLAML--GT-PK----SAFSIIGNYQQQNFHILYD 519


>AT3G59080.2 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr3:21836812-21838419 FORWARD LENGTH=499
          Length = 499

 Score = 51.6 bits (122), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 35/155 (22%), Positives = 73/155 (47%), Gaps = 12/155 (7%)

Query: 38  YFIDVKSVLINGNVLNLKPSMLSIDKKGNGGTKISTISAFTELQSSVYRIFIREYLKAAS 97
           Y++ +KS+L+ G VLN+     +I   G GGT I + +  +      Y  FI+  +   +
Sbjct: 341 YYVQIKSILVAGEVLNIPEETWNISSDGAGGTIIDSGTTLSYFAEPAYE-FIKNKIAEKA 399

Query: 98  DSKLKRVAAVAPFEACYDSTTIFNTLAGLNVPTIDLVMQGGAQGKILGANAMVMVKKNVA 157
             K          + C++ + I N    + +P + +    GA       N+ + + +++ 
Sbjct: 400 KGKYPVYRDFPILDPCFNVSGIHN----VQLPELGIAFADGAVWNFPTENSFIWLNEDLV 455

Query: 158 CLAIVDGGTEPRMSAVKASIVVGAHQLEDNLLVFD 192
           CLA++  GT P+     A  ++G +Q ++  +++D
Sbjct: 456 CLAML--GT-PK----SAFSIIGNYQQQNFHILYD 483


>AT3G18490.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr3:6349090-6350592 REVERSE LENGTH=500
          Length = 500

 Score = 48.5 bits (114), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 36/156 (23%), Positives = 71/156 (45%), Gaps = 14/156 (8%)

Query: 38  YFIDVKSVLINGNVLNLKPSMLSIDKKGNGGTKISTISAFTELQSSVYRIFIREYLKAAS 97
           Y++ +    + G  + L  ++  +D  G+GG  +   +A T LQ+  Y      +LK   
Sbjct: 345 YYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTV 404

Query: 98  DSKLKRVAAVAPFEACYDSTTIFNTLAGLNVPTIDLVMQGGAQGKILGANAMVMVKKN-V 156
           + K K  ++++ F+ CYD    F++L+ + VPT+     GG    +   N ++ V  +  
Sbjct: 405 NLK-KGSSSISLFDTCYD----FSSLSTVKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGT 459

Query: 157 ACLAIVDGGTEPRMSAVKASIVVGAHQLEDNLLVFD 192
            C A       P  S++    ++G  Q +   + +D
Sbjct: 460 FCFAFA-----PTSSSLS---IIGNVQQQGTRITYD 487