Miyakogusa Predicted Gene

Lj3g3v2364200.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj3g3v2364200.1 Non Chatacterized Hit- tr|K3YDP0|K3YDP0_SETIT
Uncharacterized protein OS=Setaria italica
GN=Si012345,28.57,1e-18,seg,NULL; DUF1191,Protein of unknown function
DUF1191,NODE_43185_length_1595_cov_18.417555.path1.1
         (241 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT4G23720.1 | Symbols:  | Protein of unknown function (DUF1191) ...   104   5e-23
AT4G01140.1 | Symbols:  | Protein of unknown function (DUF1191) ...    98   5e-21
AT3G08600.1 | Symbols:  | Protein of unknown function (DUF1191) ...    97   8e-21
AT4G22900.1 | Symbols:  | Protein of unknown function (DUF1191) ...    74   1e-13
AT1G62981.2 | Symbols:  | Protein of unknown function (DUF1191) ...    70   1e-12
AT1G62981.1 | Symbols:  | Protein of unknown function (DUF1191) ...    70   1e-12
AT4G11950.1 | Symbols:  | Protein of unknown function (DUF1191) ...    70   1e-12

>AT4G23720.1 | Symbols:  | Protein of unknown function (DUF1191) |
           chr4:12358707-12359648 FORWARD LENGTH=313
          Length = 313

 Score =  104 bits (260), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 69/246 (28%), Positives = 117/246 (47%), Gaps = 27/246 (10%)

Query: 1   MEIAVIRLRTDSFWLRGVKHSFLNFPPRVVPQPNRERMAIVYENLGNWSSHYYNVPNHTM 60
           +E++V+RL   S W  G K S +  P R V  P   R+ IVY+NLGNWS+H+Y VP + +
Sbjct: 78  IEVSVVRLTGKSLWNSGAKFSNVLIPERSVSVPPARRVVIVYQNLGNWSNHWYTVPGYRL 137

Query: 61  VAPVFGFRAYTSSEKALISTEKMDLIIEGDPITIQFHHVGPHEKNNSPI----CAKFGAG 116
           +  V GF+    S++  +   K  ++   +P+ + F  + P E++   +    C  F A 
Sbjct: 138 ITSVLGFKVLDVSDQDNV---KEIILKMKNPVEVSFRDL-PKERDEEMLSRVRCVSFKAQ 193

Query: 117 GSVE----FNNMTKPYVCEAETPGHYTLVXXXXXXXXXXXXXXXXQSQSKGFNTW---WV 169
              E     + M  P VC   + G Y+++                +   + ++TW   W+
Sbjct: 194 TKDEEATHISRMVIPGVCYGSSHGDYSVI----------EPLENDKKNVESWSTWWWLWI 243

Query: 170 LGFVIGSXXXXXXXXXXXXXXKEAKRRRIRKMEKISEGGEPFDTFWIGDTKLPLAPMIRT 229
           +GFV+G               +  K + +  ME+ +  GE F++ W G +K+P A + RT
Sbjct: 244 VGFVLGFGLLGFLCTMGIRVSRAKKIQVM--MERDANDGEVFESRWFGGSKMPSAAVTRT 301

Query: 230 QPVLEN 235
            P LE+
Sbjct: 302 LPELES 307


>AT4G01140.1 | Symbols:  | Protein of unknown function (DUF1191) |
           chr4:491012-491932 REVERSE LENGTH=306
          Length = 306

 Score = 97.8 bits (242), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 73/241 (30%), Positives = 116/241 (48%), Gaps = 19/241 (7%)

Query: 1   MEIAVIRLRTDSFWLRGVKHSFLNFPPRVVPQPNRERMAIVYENLG-NWSSHYYNVP-NH 58
           ++ +V+ +R   FW +G   S +  PP V   P  +R+A V+E+ G N SS Y+ +  N+
Sbjct: 66  IKASVVTVRNSIFWRKGTNFSGVLIPPMVKTSPYAKRIAFVFESFGDNSSSVYFRLADNY 125

Query: 59  TMVAPVFGFRAYTSSEKALISTEKMDLIIEGD-PITIQFHHVGPH-EKNNSPI-CAKFGA 115
           + V+PV GF  Y ++       +K++L I+ D PI I+F    PH  ++ S + C  FG 
Sbjct: 126 SFVSPVIGFTGYDATNTN--DLKKLNLSIKRDKPILIKF---DPHASRDRSKVKCIVFGD 180

Query: 116 GGSV-EFNNMTKPYVCEA-ETPGHYTLVXXXXXXXXXXXXXXXXQSQSKGFNTWWVLGFV 173
            G +   +N  + Y C    + GHY LV                +      N WW++  +
Sbjct: 181 NGLLLNISNTIRNYECATTNSHGHYALVVLNQEKVKPKHEPVLVRR-----NWWWIV--L 233

Query: 174 IGSXXXXXXXXXXXXXXKEAKRRRIRKMEKISEGGEPFDTFWIGDTKLPLAPMIRTQPVL 233
            G               K  +++R+R ME+ SE  E     WIG +++P A M+RTQP L
Sbjct: 234 TGIGVSVIVVVVIIVSVKLVRKKRLRDMERESEKSETIGNVWIGRSRMPAATMVRTQPCL 293

Query: 234 E 234
           E
Sbjct: 294 E 294


>AT3G08600.1 | Symbols:  | Protein of unknown function (DUF1191) |
           chr3:2612646-2613596 FORWARD LENGTH=316
          Length = 316

 Score = 97.4 bits (241), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 71/247 (28%), Positives = 120/247 (48%), Gaps = 8/247 (3%)

Query: 1   MEIAVIRLRTDSFWLRGVK-HSFLNFPPRVVPQPNRERMAIVYENLGNWSSHYYNVPNHT 59
           +++A +RLR+ SF  RGV   +  + P  V+ +P   R+ +VY+NL N+S  YY +  + 
Sbjct: 70  IKLAAMRLRSGSFRKRGVTPFNEFSIPSGVIVKPYVTRLVLVYQNLANFSHLYYPLSGYD 129

Query: 60  MVAPVFGFRAYTSSEKALISTEKMDLIIEGDPITIQFHHVGPHEKNNSPICAKFGAGGSV 119
            VAPV G  AY +   + ++  ++DL +  DPI I F  +    + +S  C +F + G  
Sbjct: 130 YVAPVLGLLAYDAKNLSALNLPQLDLRVSNDPIRIDFSDLERIPQGSSAKCVRFDSKGEA 189

Query: 120 EFNNMTKP-YVCEAETPGHYTLVXXXXXXXXXXX-----XXXXXQSQSKGFNTWWVLGFV 173
            F++  +P   CE E  GH+++V                     +S      TW ++G V
Sbjct: 190 SFSDSIQPGNTCETEHQGHFSVVVKSVASAPSLAPPGIESKKKKKSSDSNSKTWIIVGSV 249

Query: 174 IGSXXXXXXXXXXXXXXKEAKRR-RIRKMEKISEGGEPFDTFWIGDTKLPLAPMIRTQPV 232
           +G               +  K++ ++R+ME+  E GE      +G+T+ P A   RTQP+
Sbjct: 250 VGGLILLGLLLFLVLRCRNYKKQEKMREMERAGETGEALRMTQVGETRAPTATTTRTQPM 309

Query: 233 LENDDAV 239
           LE + A 
Sbjct: 310 LETEYAA 316


>AT4G22900.1 | Symbols:  | Protein of unknown function (DUF1191) |
           chr4:12010221-12011252 FORWARD LENGTH=343
          Length = 343

 Score = 73.6 bits (179), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 69/267 (25%), Positives = 111/267 (41%), Gaps = 31/267 (11%)

Query: 1   MEIAVIRLRTDSFWLRGVKHSFLNFPPRVVPQPNRERMAIVYENLG-NWSSHY---YNVP 56
           ++I  ++LR  S    G K    +    +  +P  ER+ ++ +N G NWSS Y   YN+ 
Sbjct: 64  IDIDTVKLRCGSLRRYGAKIGEFHIGSGLTVEPCPERVMLIRQNFGSNWSSIYSTGYNLS 123

Query: 57  --NHTMVAPVFGFRAYTSSEKALIST--EKMDLIIEGDPITIQF--------HHVGPHEK 104
             N+ +V+PV G  AY ++   +     E   +  + +PI I F            P +K
Sbjct: 124 GYNYKLVSPVLGLLAYNANPDGVARNPYEVNVVGTDQNPILIDFLINKATNNTSPNPTKK 183

Query: 105 NNSPICAKFGAGGSVEFNNMTKPYVCEAETPGHYTLVXXXXXXX------XXXXXXXXXQ 158
           N+S +CA F +  +  F+    PYVC+    GHY LV                       
Sbjct: 184 NSSVLCACFTSNSNTTFSEQVSPYVCKGTRQGHYALVMKTEAQKDDHEGGGSSGGVVASS 243

Query: 159 SQSKGFN------TWWV-LGFVIGS-XXXXXXXXXXXXXXKEAKRRRIR-KMEKISEGGE 209
           ++  G N       W V +G VIGS                + K++ +R +ME+ +   E
Sbjct: 244 TEVNGGNGGGKLSRWKVAVGSVIGSGIGAILLGMLVVAMLVKGKKKAMREEMERRAYEEE 303

Query: 210 PFDTFWIGDTKLPLAPMIRTQPVLEND 236
                 +G  + P AP  RT P + +D
Sbjct: 304 ALQVSMVGHVRAPTAPGTRTLPRISDD 330


>AT1G62981.2 | Symbols:  | Protein of unknown function (DUF1191) |
           chr1:23333793-23334824 FORWARD LENGTH=343
          Length = 343

 Score = 70.1 bits (170), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 47/157 (29%), Positives = 78/157 (49%), Gaps = 16/157 (10%)

Query: 1   MEIAVIRLRTDSFWLRGVKHSFLNFPPRVVPQPNRERMAIVYENLGN-WSSHYY---NVP 56
           +++  +R R  S    G K    N     + +P  ER+ +V ++LG+ WS  YY   ++ 
Sbjct: 82  IKLDAVRFRCGSLRRYGAKIEEFNIGVGAILEPCGERLLVVRQSLGSKWSDIYYKNYDLS 141

Query: 57  NHTMVAPVFGFRAYTSSEKALI-----STEKMDLIIE--GDPITIQFHHV-GPH--EKN- 105
            + +V+PV G  AY +    ++     S+ ++ L++    DP  + F +V GP   E+  
Sbjct: 142 GYRLVSPVLGLLAYNALNDVVLGNNVSSSYQISLLLARTKDPSNVDFGNVSGPSVVERTF 201

Query: 106 -NSPICAKFGAGGSVEFNNMTKPYVCEAETPGHYTLV 141
            N P+CA F   G V      KP+VC  +T GH+ LV
Sbjct: 202 LNKPMCATFELDGKVTLAAEVKPFVCAVKTNGHFGLV 238


>AT1G62981.1 | Symbols:  | Protein of unknown function (DUF1191) |
           chr1:23333793-23334824 FORWARD LENGTH=343
          Length = 343

 Score = 70.1 bits (170), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 47/157 (29%), Positives = 78/157 (49%), Gaps = 16/157 (10%)

Query: 1   MEIAVIRLRTDSFWLRGVKHSFLNFPPRVVPQPNRERMAIVYENLGN-WSSHYY---NVP 56
           +++  +R R  S    G K    N     + +P  ER+ +V ++LG+ WS  YY   ++ 
Sbjct: 82  IKLDAVRFRCGSLRRYGAKIEEFNIGVGAILEPCGERLLVVRQSLGSKWSDIYYKNYDLS 141

Query: 57  NHTMVAPVFGFRAYTSSEKALI-----STEKMDLIIE--GDPITIQFHHV-GPH--EKN- 105
            + +V+PV G  AY +    ++     S+ ++ L++    DP  + F +V GP   E+  
Sbjct: 142 GYRLVSPVLGLLAYNALNDVVLGNNVSSSYQISLLLARTKDPSNVDFGNVSGPSVVERTF 201

Query: 106 -NSPICAKFGAGGSVEFNNMTKPYVCEAETPGHYTLV 141
            N P+CA F   G V      KP+VC  +T GH+ LV
Sbjct: 202 LNKPMCATFELDGKVTLAAEVKPFVCAVKTNGHFGLV 238


>AT4G11950.1 | Symbols:  | Protein of unknown function (DUF1191) |
           chr4:7173276-7174259 REVERSE LENGTH=327
          Length = 327

 Score = 69.7 bits (169), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 46/157 (29%), Positives = 78/157 (49%), Gaps = 16/157 (10%)

Query: 1   MEIAVIRLRTDSFWLRGVKHSFLNFPPRVVPQPNRERMAIVYENLG-NWSSHYYNVP--- 56
           ++IA  + R  S    G +    +  P +  +P  ER+ +V +NLG NWSS+ Y+     
Sbjct: 62  IDIATAKFRCGSLRRHGARIGEFHLGPGLTVEPCVERVILVRQNLGFNWSSYIYSTGYNL 121

Query: 57  ---NHTMVAPVFGFRAYTSS-EKALISTEKMDLI-IEGDPITIQFHHVGPH-------EK 104
               + +V+PV G  AY S+ +   ++  +++++  E +PI I+F             +K
Sbjct: 122 TGYKYRLVSPVLGLLAYNSNPDGVAVNPYEVNVMGTEQNPILIKFLSSEASGSPKPNTKK 181

Query: 105 NNSPICAKFGAGGSVEFNNMTKPYVCEAETPGHYTLV 141
           N+S +CA F + G++ F      YVC     GHY LV
Sbjct: 182 NSSVLCACFTSNGNITFREQVSAYVCLGTRQGHYALV 218