Miyakogusa Predicted Gene

Lj3g3v1855430.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj3g3v1855430.1 Non Chatacterized Hit- tr|I1KML0|I1KML0_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.41941 PE,25.65,0.000002,
,CUFF.43205.1
         (189 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT4G02550.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   196   7e-51
AT4G02550.3 | Symbols:  | unknown protein; BEST Arabidopsis thal...   196   1e-50
AT4G02550.4 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   170   6e-43
AT4G02550.2 | Symbols:  | unknown protein; BEST Arabidopsis thal...   170   6e-43
AT4G02210.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    49   3e-06
AT4G02210.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    49   3e-06

>AT4G02550.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G02210.2); Has 370 Blast hits to 300 proteins
           in 18 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 10; Plants - 354; Viruses - 0; Other Eukaryotes
           - 6 (source: NCBI BLink). | chr4:1120622-1121629 REVERSE
           LENGTH=307
          Length = 307

 Score =  196 bits (499), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 103/205 (50%), Positives = 139/205 (67%), Gaps = 17/205 (8%)

Query: 1   MIECDRVELWKRYVAAHPDAKGFCGKQIEMYDQLKIVCGNYQAPGRWAKVKDD--NHLVN 58
           MI+C+  ELW+RY+A +PDAK F GKQIEMY++L+ VCG+YQ PG++ KVK +  +HL +
Sbjct: 104 MIDCESDELWRRYIAVNPDAKAFRGKQIEMYEELRTVCGDYQTPGKYNKVKKESSHHLND 163

Query: 59  MKNCDDESASFASPVSENTSETDGTESYSEPPEYEQMPNGYQEAPVVHPLRQLPKRPRSS 118
           +K  +++S SF    SE  S+TDGTESY+   EY          P   PLR+  KR R+S
Sbjct: 164 VKQFEEDSVSFPLGSSEEHSDTDGTESYAGASEYMH-EESQDLPPPRDPLRRPSKRSRNS 222

Query: 119 DELQEALMTVASSIRRLADSMERSKCSIDAAELLQA--------------PFEYLNADPI 164
           D  QEA++ VASSIRRLAD++ +SK  I+  ELL+A               FEYLN DP+
Sbjct: 223 DPCQEAMLVVASSIRRLADAVVQSKTLINTEELLKAVMEIDELEEAKQMYAFEYLNGDPV 282

Query: 165 KARAFLTYNTRMRKIYMFKQFWWWR 189
           KARAF+ YN RMRK+++F+QFWWW+
Sbjct: 283 KARAFMAYNNRMRKMFLFRQFWWWK 307


>AT4G02550.3 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G02210.2); Has 35333 Blast hits to 34131
           proteins in 2444 species: Archae - 798; Bacteria -
           22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
           - 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
           chr4:1120622-1121674 REVERSE LENGTH=322
          Length = 322

 Score =  196 bits (497), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 103/205 (50%), Positives = 139/205 (67%), Gaps = 17/205 (8%)

Query: 1   MIECDRVELWKRYVAAHPDAKGFCGKQIEMYDQLKIVCGNYQAPGRWAKVKDD--NHLVN 58
           MI+C+  ELW+RY+A +PDAK F GKQIEMY++L+ VCG+YQ PG++ KVK +  +HL +
Sbjct: 119 MIDCESDELWRRYIAVNPDAKAFRGKQIEMYEELRTVCGDYQTPGKYNKVKKESSHHLND 178

Query: 59  MKNCDDESASFASPVSENTSETDGTESYSEPPEYEQMPNGYQEAPVVHPLRQLPKRPRSS 118
           +K  +++S SF    SE  S+TDGTESY+   EY          P   PLR+  KR R+S
Sbjct: 179 VKQFEEDSVSFPLGSSEEHSDTDGTESYAGASEYMH-EESQDLPPPRDPLRRPSKRSRNS 237

Query: 119 DELQEALMTVASSIRRLADSMERSKCSIDAAELLQA--------------PFEYLNADPI 164
           D  QEA++ VASSIRRLAD++ +SK  I+  ELL+A               FEYLN DP+
Sbjct: 238 DPCQEAMLVVASSIRRLADAVVQSKTLINTEELLKAVMEIDELEEAKQMYAFEYLNGDPV 297

Query: 165 KARAFLTYNTRMRKIYMFKQFWWWR 189
           KARAF+ YN RMRK+++F+QFWWW+
Sbjct: 298 KARAFMAYNNRMRKMFLFRQFWWWK 322


>AT4G02550.4 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 18 plant
           structures; EXPRESSED DURING: 7 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT4G02210.2). | chr4:1120622-1121629 REVERSE
           LENGTH=278
          Length = 278

 Score =  170 bits (430), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 94/203 (46%), Positives = 121/203 (59%), Gaps = 42/203 (20%)

Query: 1   MIECDRVELWKRYVAAHPDAKGFCGKQIEMYDQLKIVCGNYQAPGRWAKVKDDNHLVNMK 60
           MI+C+  ELW+RY+A +PDAK F GKQIEMY++L+ VCG+YQ PG               
Sbjct: 104 MIDCESDELWRRYIAVNPDAKAFRGKQIEMYEELRTVCGDYQTPGS-------------- 149

Query: 61  NCDDESASFASPVSENTSETDGTESYSEPPEYEQMPNGYQEAPVVHPLRQLPKRPRSSDE 120
                        SE  S+TDGTESY+   EY          P   PLR+  KR R+SD 
Sbjct: 150 -------------SEEHSDTDGTESYAGASEYMH-EESQDLPPPRDPLRRPSKRSRNSDP 195

Query: 121 LQEALMTVASSIRRLADSMERSKCSIDAAELLQA--------------PFEYLNADPIKA 166
            QEA++ VASSIRRLAD++ +SK  I+  ELL+A               FEYLN DP+KA
Sbjct: 196 CQEAMLVVASSIRRLADAVVQSKTLINTEELLKAVMEIDELEEAKQMYAFEYLNGDPVKA 255

Query: 167 RAFLTYNTRMRKIYMFKQFWWWR 189
           RAF+ YN RMRK+++F+QFWWW+
Sbjct: 256 RAFMAYNNRMRKMFLFRQFWWWK 278


>AT4G02550.2 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G02210.2); Has 350 Blast hits to 284 proteins
           in 18 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 13; Plants - 331; Viruses - 0; Other Eukaryotes
           - 6 (source: NCBI BLink). | chr4:1120622-1121629 REVERSE
           LENGTH=278
          Length = 278

 Score =  170 bits (430), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 94/203 (46%), Positives = 121/203 (59%), Gaps = 42/203 (20%)

Query: 1   MIECDRVELWKRYVAAHPDAKGFCGKQIEMYDQLKIVCGNYQAPGRWAKVKDDNHLVNMK 60
           MI+C+  ELW+RY+A +PDAK F GKQIEMY++L+ VCG+YQ PG               
Sbjct: 104 MIDCESDELWRRYIAVNPDAKAFRGKQIEMYEELRTVCGDYQTPGS-------------- 149

Query: 61  NCDDESASFASPVSENTSETDGTESYSEPPEYEQMPNGYQEAPVVHPLRQLPKRPRSSDE 120
                        SE  S+TDGTESY+   EY          P   PLR+  KR R+SD 
Sbjct: 150 -------------SEEHSDTDGTESYAGASEYMH-EESQDLPPPRDPLRRPSKRSRNSDP 195

Query: 121 LQEALMTVASSIRRLADSMERSKCSIDAAELLQA--------------PFEYLNADPIKA 166
            QEA++ VASSIRRLAD++ +SK  I+  ELL+A               FEYLN DP+KA
Sbjct: 196 CQEAMLVVASSIRRLADAVVQSKTLINTEELLKAVMEIDELEEAKQMYAFEYLNGDPVKA 255

Query: 167 RAFLTYNTRMRKIYMFKQFWWWR 189
           RAF+ YN RMRK+++F+QFWWW+
Sbjct: 256 RAFMAYNNRMRKMFLFRQFWWWK 278


>AT4G02210.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 22 plant
           structures; EXPRESSED DURING: 13 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT2G24960.2). | chr4:974320-975917 REVERSE
           LENGTH=439
          Length = 439

 Score = 48.5 bits (114), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 40/183 (21%), Positives = 81/183 (44%), Gaps = 29/183 (15%)

Query: 9   LWKRYVAAHPDAKGFCGKQIEMYDQLKIVCGNYQAPGRWAKVKDDNHLVNMKNCDDES-- 66
           +W+ Y+ AH DA+ F  + I  Y  L ++CG+       + ++++   V M   D E+  
Sbjct: 275 VWQDYIKAHRDARQFMTRPIPYYKDLCVLCGD-------SGIEENECFVAMDWFDPETEF 327

Query: 67  ----ASFASPVSENTSETDGTESYSEPPEYEQMPNGYQEAPVVHPLRQLPKRPRSSDELQ 122
               +S  + +S +  E D      +P            +P+       PK+PR  +   
Sbjct: 328 QEFKSSGTTDLSISAEEEDSNSLLFDPKNKRDQLANTDTSPIN------PKKPRVDETQT 381

Query: 123 EALMTVASSIRRLADSMERSKCSIDAAELLQAPFEYLNADPIKARAFLTYNTRMRKIYMF 182
            ++     +I+ L D  +  +  +DA +LL+        D +KA+ FL  + ++RK ++ 
Sbjct: 382 MSIEDTVEAIQALPDMDD--ELILDACDLLE--------DKLKAKTFLALDVKLRKKWLL 431

Query: 183 KQF 185
           ++ 
Sbjct: 432 RKL 434


>AT4G02210.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G24960.2); Has 791 Blast hits to 465 proteins
           in 19 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 17; Plants - 748; Viruses - 0; Other Eukaryotes
           - 26 (source: NCBI BLink). | chr4:974320-975917 REVERSE
           LENGTH=439
          Length = 439

 Score = 48.5 bits (114), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 40/183 (21%), Positives = 81/183 (44%), Gaps = 29/183 (15%)

Query: 9   LWKRYVAAHPDAKGFCGKQIEMYDQLKIVCGNYQAPGRWAKVKDDNHLVNMKNCDDES-- 66
           +W+ Y+ AH DA+ F  + I  Y  L ++CG+       + ++++   V M   D E+  
Sbjct: 275 VWQDYIKAHRDARQFMTRPIPYYKDLCVLCGD-------SGIEENECFVAMDWFDPETEF 327

Query: 67  ----ASFASPVSENTSETDGTESYSEPPEYEQMPNGYQEAPVVHPLRQLPKRPRSSDELQ 122
               +S  + +S +  E D      +P            +P+       PK+PR  +   
Sbjct: 328 QEFKSSGTTDLSISAEEEDSNSLLFDPKNKRDQLANTDTSPIN------PKKPRVDETQT 381

Query: 123 EALMTVASSIRRLADSMERSKCSIDAAELLQAPFEYLNADPIKARAFLTYNTRMRKIYMF 182
            ++     +I+ L D  +  +  +DA +LL+        D +KA+ FL  + ++RK ++ 
Sbjct: 382 MSIEDTVEAIQALPDMDD--ELILDACDLLE--------DKLKAKTFLALDVKLRKKWLL 431

Query: 183 KQF 185
           ++ 
Sbjct: 432 RKL 434