Miyakogusa Predicted Gene

Lj5g3v1326740.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj5g3v1326740.1 Non Chatacterized Hit- tr|I1MJ02|I1MJ02_SOYBN
Uncharacterized protein OS=Glycine max PE=4
SV=1,64,3e-18,seg,NULL,CUFF.55533.1
         (446 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT3G04550.1 | Symbols:  | unknown protein; INVOLVED IN: biologic...   350   9e-97
AT5G28500.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   340   9e-94
AT5G28500.2 | Symbols:  | unknown protein; BEST Arabidopsis thal...   221   1e-57

>AT3G04550.1 | Symbols:  | unknown protein; INVOLVED IN:
           biological_process unknown; LOCATED IN: chloroplast
           stroma, chloroplast; EXPRESSED IN: 22 plant structures;
           EXPRESSED DURING: 13 growth stages; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT5G28500.1); Has 110 Blast hits to 110 proteins
           in 51 species: Archae - 0; Bacteria - 67; Metazoa - 1;
           Fungi - 0; Plants - 41; Viruses - 0; Other Eukaryotes -
           1 (source: NCBI BLink). | chr3:1225961-1227310 FORWARD
           LENGTH=449
          Length = 449

 Score =  350 bits (899), Expect = 9e-97,   Method: Compositional matrix adjust.
 Identities = 198/384 (51%), Positives = 242/384 (63%), Gaps = 22/384 (5%)

Query: 70  TLDIPGQIDILANRLGVWHEYAPLISSHIREGFTPPTIEELTGITGVEQNRFIVAAQVRD 129
           +LD  G+I+ILA R+ +W EYAPLISS   +GFTPPTIEELTGI+ +EQNR IV AQVRD
Sbjct: 81  SLDSAGKIEILAGRMALWFEYAPLISSLYTDGFTPPTIEELTGISSIEQNRLIVGAQVRD 140

Query: 130 SLLQSNTDPDLVSFFDNGGAELLYQIXXXXXXXXXXXXXFIVENKSDGKGANELARSMKD 189
           S+LQS  +P+L+S FD GGAELLY+I             FI++   D KGA +LAR++KD
Sbjct: 141 SILQSIHEPELISAFDTGGAELLYEIRLLSTTQRVAAATFIIDRNIDSKGAQDLARAIKD 200

Query: 190 FPSRRGEKGWESFDYNLPGDCLSFMYYRLSREYNRNP-EQRDSVLEQALRVAESEKARDA 248
           +P+RRG+ GW  FDYNLPGDCLSF+YYR SRE N+NP +QR S+L QAL VAESEKA++ 
Sbjct: 201 YPNRRGDVGWLDFDYNLPGDCLSFLYYRQSRE-NKNPSDQRTSMLLQALGVAESEKAKNR 259

Query: 249 IQKEL----KXXXXXXXXXXXXXXXXXXXXXXRLKIGEXXXXXXXXXLPVCNADEGGDAI 304
           +  EL    +                      RLK GE         LPVC A+EG   I
Sbjct: 260 LNTELYGDKEAEKEKEKKKKEEEVKAIRIPVVRLKFGEVAEATSVVVLPVCKAEEGEKKI 319

Query: 305 LEAPSECRTXXXXXXXXXXXXXXXXXXXWEKWVVLPGWEPLVELGKGCVVVSFVDAR-VL 363
           LEAP E                      W++WVVLP W P+  +GKG V VSF D R VL
Sbjct: 320 LEAPMEI-------IAGGDFKVVEAEKGWKRWVVLPSWNPVAAIGKGGVAVSFRDDRKVL 372

Query: 364 PWKANRWYKEEPILVVADRSKREVGADDGFYLVKVEDDGVGLKVERGLGLKERGVSESLG 423
           PW      KEEP+LVVADR +  V ADDG+YLV  E+   GLK+E+G  LK R V ESLG
Sbjct: 373 PWDG----KEEPLLVVADRVRNVVEADDGYYLVVAEN---GLKLEKGSDLKAREVKESLG 425

Query: 424 SVILVVRPPREEDDG-QLSDEDWD 446
            V+LVVRPPRE+DD  Q S ++WD
Sbjct: 426 MVVLVVRPPREDDDDWQTSHQNWD 449


>AT5G28500.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: chloroplast
           stroma, chloroplast; EXPRESSED IN: 23 plant structures;
           EXPRESSED DURING: 13 growth stages; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G04550.1); Has 109 Blast hits to 109 proteins
           in 49 species: Archae - 0; Bacteria - 67; Metazoa - 0;
           Fungi - 0; Plants - 41; Viruses - 0; Other Eukaryotes -
           1 (source: NCBI BLink). | chr5:10477810-10479114 FORWARD
           LENGTH=434
          Length = 434

 Score =  340 bits (873), Expect = 9e-94,   Method: Compositional matrix adjust.
 Identities = 190/379 (50%), Positives = 241/379 (63%), Gaps = 17/379 (4%)

Query: 70  TLDIPGQIDILANRLGVWHEYAPLISSHIREGFTPPTIEELTGITGVEQNRFIVAAQVRD 129
           +LD  G+I++LA+RLG+W EYAPLISS   EGFTPP+IEELTGI+GVEQN  IV AQVRD
Sbjct: 71  SLDTAGKIEVLADRLGLWFEYAPLISSLYTEGFTPPSIEELTGISGVEQNSLIVGAQVRD 130

Query: 130 SLLQSNTDPDLVSFFDNGGAELLYQIXXXXXXXXXXXXXFIVENKSDGKGANELARSMKD 189
           SL+QS   P+L++ FD  GAELLY+I             +IV++  D KGA +LAR++KD
Sbjct: 131 SLVQSGAKPELIAAFDTNGAELLYEIRLLNTTQRVAAAEYIVDHGFDTKGAGDLARAIKD 190

Query: 190 FPSRRGEKGWESFDYNLPGDCLSFMYYRLSREYNRNPEQRDSVLEQALRVAESEKARDAI 249
           FP RRG+ G   FDYNLPGDCLSFM YR SRE+    E R ++LEQAL  A +EKA+ A+
Sbjct: 191 FPHRRGDVGLGDFDYNLPGDCLSFMLYRKSREHRSPSEIRTTLLEQALETAVTEKAKKAV 250

Query: 250 QKELKXXXXXXXXXXXXXXXXXXXXXXRLKIGEXXXXXXXXXLPVCNADEGGDAILEAPS 309
            +EL                       RL+ GE         LPVC A+EG + +LEAP 
Sbjct: 251 LRELH-GESEEERVKEEEIKIIRVPVVRLRFGEVAGASSVVVLPVCKAEEGEEKLLEAPM 309

Query: 310 ECRTXXXXXXXXXXXXXXXXXXXWEKWVVLPGWEPLVELGKGCVVVSFVDAR-VLPWKAN 368
           E  +                   W +WVVLPGW+P+V + KG V VSF D R VLPW   
Sbjct: 310 EFES-------GGEFGVVEAEKDWSRWVVLPGWDPVVAVRKG-VAVSFSDDREVLPWNG- 360

Query: 369 RWYKEEPILVVADRSKREVGADDGFYLVKVEDDGVGLKVERGLGLKERGVSESLGSVILV 428
              K E I+VV DR K+ V AD+G+Y + V D   G+K++RGL LKE+GV+ESLG V+LV
Sbjct: 361 ---KGEAIMVVIDREKKTVEADNGYYYLVVADG--GMKLDRGLVLKEKGVNESLGMVVLV 415

Query: 429 VRPPREEDDG-QLSDEDWD 446
           VRPPR++DD  Q++DEDWD
Sbjct: 416 VRPPRDDDDEWQINDEDWD 434


>AT5G28500.2 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G04550.1). | chr5:10477810-10478945 FORWARD
           LENGTH=269
          Length = 269

 Score =  221 bits (562), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 107/185 (57%), Positives = 134/185 (72%)

Query: 70  TLDIPGQIDILANRLGVWHEYAPLISSHIREGFTPPTIEELTGITGVEQNRFIVAAQVRD 129
           +LD  G+I++LA+RLG+W EYAPLISS   EGFTPP+IEELTGI+GVEQN  IV AQVRD
Sbjct: 71  SLDTAGKIEVLADRLGLWFEYAPLISSLYTEGFTPPSIEELTGISGVEQNSLIVGAQVRD 130

Query: 130 SLLQSNTDPDLVSFFDNGGAELLYQIXXXXXXXXXXXXXFIVENKSDGKGANELARSMKD 189
           SL+QS   P+L++ FD  GAELLY+I             +IV++  D KGA +LAR++KD
Sbjct: 131 SLVQSGAKPELIAAFDTNGAELLYEIRLLNTTQRVAAAEYIVDHGFDTKGAGDLARAIKD 190

Query: 190 FPSRRGEKGWESFDYNLPGDCLSFMYYRLSREYNRNPEQRDSVLEQALRVAESEKARDAI 249
           FP RRG+ G   FDYNLPGDCLSFM YR SRE+    E R ++LEQAL  A +EKA+ A+
Sbjct: 191 FPHRRGDVGLGDFDYNLPGDCLSFMLYRKSREHRSPSEIRTTLLEQALETAVTEKAKKAV 250

Query: 250 QKELK 254
            +EL 
Sbjct: 251 LRELH 255