Miyakogusa Predicted Gene

Lj1g3v3300040.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj1g3v3300040.1 Non Chatacterized Hit- tr|I1N5B1|I1N5B1_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.41437
PE,69.18,0,DUF4378,Domain of unknown function DUF4378; VARLMGL,NULL;
seg,NULL,CUFF.30300.1
         (922 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT1G67040.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   262   7e-70
AT5G26910.3 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    67   5e-11
AT5G26910.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    67   5e-11
AT3G58650.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    59   1e-08

>AT1G67040.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 20 plant
           structures; EXPRESSED DURING: 11 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT5G26910.3); Has 89 Blast hits to 84 proteins in
           15 species: Archae - 0; Bacteria - 0; Metazoa - 5; Fungi
           - 2; Plants - 82; Viruses - 0; Other Eukaryotes - 0
           (source: NCBI BLink). | chr1:25019105-25021922 REVERSE
           LENGTH=826
          Length = 826

 Score =  262 bits (670), Expect = 7e-70,   Method: Compositional matrix adjust.
 Identities = 277/929 (29%), Positives = 412/929 (44%), Gaps = 144/929 (15%)

Query: 10  AITEKKVQKPGGCVGIFLQLIDWKRRLAKKKLFSQKLLTPAR--AKKFRGDEKMPNSKLH 67
           AITEK+  + GGCVG+F QL DW RR AKKKLFS+K L P +  +K+F G+EKM  SKL+
Sbjct: 14  AITEKRPNRLGGCVGVFFQLFDWNRRFAKKKLFSRKSLLPGKQVSKRFGGNEKMLKSKLN 73

Query: 68  LIANENSEGFPSAKKGGSHGVDVEQKTDMRAPSLVARLMGLEYIPAAQRNNSKEVLNSGS 127
           LI +EN   FP+  +     V   +K +MR+PSLVARLMGLE +P+  R+  K       
Sbjct: 74  LIDDENRGSFPNRNE-----VMEVKKHEMRSPSLVARLMGLESMPSNHRDKGKNKKKKPL 128

Query: 128 CGDGKESLANNCELHNRKGVDLDMGVVKHDSRPQKLQKTGAACERR-AVTRFGAEALHIR 186
               +++  + C+L + +  + D GV K   RPQK+Q+T   C+RR AV +FG+EAL I+
Sbjct: 129 FSQIQDT--DKCDLFDVEEEEEDSGVDK--LRPQKMQRTTGVCDRRVAVKKFGSEALQIK 184

Query: 187 SVLSRARKXXXXXXXXPKLASPLKXXXXXXXXXXXXXXXLIGAATRILEPGLQARKGSLT 246
           +VL+R RK         KLASP++               LI AA RILEPG +  KG++ 
Sbjct: 185 NVLTRVRKHHQYNHQHQKLASPVR-----SPRMNRRSSRLIDAAARILEPGKRNAKGAIA 239

Query: 247 YPACTYPHETNTVTKDAQDWSTVMQNQSCYDAGRSKNLMGQTSCKNCGNLLDVIECKQEV 306
           YP  T         K+      V+  +  +  G + ++    SCK+CG+L+DV       
Sbjct: 240 YPGSTGIRRFENAAKEP-----VVSPE--FQCGYNNSV---ASCKSCGSLVDVNGS---- 285

Query: 307 PVPISVVSDVATANSMVSSSQKEKSFTPSHGQKRDIVLLRSKEKLISLATEEEGKNNAQQ 366
              I VV D  T N+M   S+     TP    KR+ V  R+++  +S++    GK++  Q
Sbjct: 286 ---IQVVQD--TGNNMACVSES----TPFQRSKRN-VFWRNEDSSVSVS----GKDSTDQ 331

Query: 367 SWNEPATRRMAMSCEDNASSFPSKHKIQTQKQMLSTEKYSPGSTMSSNMQVKRGSSSA-- 424
              +   R      +D  S    +++ +  K++L  E+  P    S  +  KR  SS   
Sbjct: 332 MVKKALHR---AQFKDEMSLPGYRNRSEYHKKVLHREERFPPEARSFALPSKRSCSSPAN 388

Query: 425 ---SGTKDFVALNRSLSGRTRM-RSPTKLDSSKFDIEKKPCNRQLSPLSHERTLEKKRRI 480
              S  KDF+A+NR  + R+   +SP K ++S  ++++K         SH R  E   R 
Sbjct: 389 AINSKEKDFIAMNRGSTSRSHHSKSPVKFENSDLNLQRK---------SHTRVEESCNR- 438

Query: 481 LNVSQLEGTASVGLKQKDLCSDALGGKRRDFPPSSLNSFKVKNKRDGQGEEINKVIDNKI 540
                  G ++ G K++  C    G       P       V  + D    E +    N+ 
Sbjct: 439 ------SGLSTPGRKRRLACESGHGRGSSSMSP-------VSRRLDS---EYSCACSNET 482

Query: 541 DDVVSFTFNSPLKQKTGIPLEKEETSCNNE--RNAYCNRLSSPLKADALGAFLEQKLKEL 598
               S    S  +  +    E +E        R ++  R   PL        ++QKLKEL
Sbjct: 483 -AFSSLKLGSSNRHYSQCCRETKERRGVQRVPRPSFTKR---PLLDVGTLGLIQQKLKEL 538

Query: 599 TSQEDDEL-ASSALPKKPSSVILQELLS--ALNSEHLVCHDGHVFNDNCVAKQERLIGNT 655
            SQE+DE    S  P KP+S+IL ELLS  AL  +  V      +      +    IGN 
Sbjct: 539 ASQEEDEANGESGFPNKPASLILHELLSSLALQQQPYVRDIDMPYRRKGKTEFWSSIGNA 598

Query: 656 FNGNQLSPGXXXXXXXXXXXXXXXXGHGFHPYSMNYSYG-QPEQWDHDIELSDSATSFNN 714
            N    SPG                 + F   S       +P + D DI L D ATSF N
Sbjct: 599 -NSEYTSPG---SVLDASFSNESCFSNSFDNISGQMRLPLEPIEPDWDI-LEDYATSFKN 653

Query: 715 GM-------IGEILSQIPSALQCLHSFGRQFPRSKVNNVKDVLLNAELVLRIATDHNEGE 767
                    I  ++S + + L+CL + G    + +    ++V+++ EL++   T      
Sbjct: 654 STSDGNYQAIASLISHVSNVLRCLSNTGLILTQQRFTIAREVIIHTELLVGTTTTQENYL 713

Query: 768 VSXXXXXXXXXXXXDTMASDAMWTEFEGFMAYKGDSKQRNKLK----GFLFDCVVEYLES 823
           +                        F+  M Y   S     L     GFL D ++E+LE 
Sbjct: 714 IGPEL--------------------FDELMIYAARSDNLVNLPGLTGGFLVDAMIEHLE- 752

Query: 824 NCCKYFHSGFRAWTKLPLCVKGNVLAQEVKREMKKWECKAGMMPDEIIEWEMSHSLGKWT 883
                  +        PL  K + L + V  E+ KW   A +  DE+I  EM        
Sbjct: 753 ------ETNISCGLLKPLTAKQDELIRGVIEEVPKW---ARVNMDEVIGIEM-------- 795

Query: 884 DFDIEAFEAGVGIDGDILQILVGEIVEDL 912
           D +   F  G  I  +IL+ L+GE+  DL
Sbjct: 796 DLETHLFGVGSEIAYEILRCLIGELATDL 824


>AT5G26910.3 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: mitochondrion;
           EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT3G58650.1). |
           chr5:9466169-9469523 REVERSE LENGTH=852
          Length = 852

 Score = 67.4 bits (163), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 44/137 (32%), Positives = 67/137 (48%), Gaps = 12/137 (8%)

Query: 784 MASDAMWTEFEGFMAYKGDSKQRNKLKGFLFDCVVEYLESNCCKYFHSGFRAWTKLPLCV 843
           MA+D +       M  +G+       +  LFD V + L   C + F    R      L  
Sbjct: 694 MATDVLPASLFDEMEGRGEVTAAKIKRKTLFDFVNKCLALRCEQMFMGSCRG-----LLG 748

Query: 844 KGNVL-------AQEVKREMKKWECKAGMMPDEIIEWEMSHSLGKWTDFDIEAFEAGVGI 896
           KG  L       A+E+ RE+   +    MM DE+++ EMS   G+W DF+ E +E G+ I
Sbjct: 749 KGGFLFEQRDWLAEELNREIHGLKKMREMMMDELVDKEMSSFEGRWLDFERETYEEGIDI 808

Query: 897 DGDILQILVGEIVEDLV 913
           +G+I+  LV ++V DLV
Sbjct: 809 EGEIVSTLVDDLVNDLV 825


>AT5G26910.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G58650.1); Has 1322 Blast hits to 684 proteins
           in 162 species: Archae - 4; Bacteria - 497; Metazoa -
           157; Fungi - 101; Plants - 155; Viruses - 0; Other
           Eukaryotes - 408 (source: NCBI BLink). |
           chr5:9466169-9469523 REVERSE LENGTH=853
          Length = 853

 Score = 67.4 bits (163), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 44/137 (32%), Positives = 67/137 (48%), Gaps = 12/137 (8%)

Query: 784 MASDAMWTEFEGFMAYKGDSKQRNKLKGFLFDCVVEYLESNCCKYFHSGFRAWTKLPLCV 843
           MA+D +       M  +G+       +  LFD V + L   C + F    R      L  
Sbjct: 695 MATDVLPASLFDEMEGRGEVTAAKIKRKTLFDFVNKCLALRCEQMFMGSCRG-----LLG 749

Query: 844 KGNVL-------AQEVKREMKKWECKAGMMPDEIIEWEMSHSLGKWTDFDIEAFEAGVGI 896
           KG  L       A+E+ RE+   +    MM DE+++ EMS   G+W DF+ E +E G+ I
Sbjct: 750 KGGFLFEQRDWLAEELNREIHGLKKMREMMMDELVDKEMSSFEGRWLDFERETYEEGIDI 809

Query: 897 DGDILQILVGEIVEDLV 913
           +G+I+  LV ++V DLV
Sbjct: 810 EGEIVSTLVDDLVNDLV 826


>AT3G58650.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: plasma membrane;
           EXPRESSED IN: 14 plant structures; EXPRESSED DURING: 8
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT5G26910.1); Has 2350 Blast
           hits to 1412 proteins in 248 species: Archae - 0;
           Bacteria - 487; Metazoa - 577; Fungi - 236; Plants -
           184; Viruses - 4; Other Eukaryotes - 862 (source: NCBI
           BLink). | chr3:21696349-21699219 REVERSE LENGTH=820
          Length = 820

 Score = 59.3 bits (142), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 36/112 (32%), Positives = 62/112 (55%), Gaps = 14/112 (12%)

Query: 813 LFDCV-----VEY---LESNCCKYFHSGFRAWTKLPLCVKGNVLAQEVKREMKKWECKAG 864
           LFDCV     V++   L  +C     SG        L    ++LA+EV RE+K  +    
Sbjct: 708 LFDCVNQCLAVKFERMLIGSCKGMMMSGG------ILLEHRDLLAEEVNREVKGLKKMRE 761

Query: 865 MMPDEIIEWEMSHSLGKWTDFDIEAFEAGVGIDGDILQILVGEIVEDLVGSN 916
           MM DE+++ +MS   G+W  ++ E FE G+ ++G+I+  LV ++V D++ ++
Sbjct: 762 MMIDELVDHDMSCFEGRWIGYEREMFEEGIDMEGEIVSALVDDLVSDILSTS 813