Miyakogusa Predicted Gene

Lj1g3v5031290.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj1g3v5031290.1 Non Chatacterized Hit- tr|F6HHD5|F6HHD5_VITVI
Putative uncharacterized protein OS=Vitis vinifera
GN=,31.88,4e-19,coiled-coil,NULL; FAMILY NOT
NAMED,NULL,NODE_37671_length_1831_cov_14.535773.path2.1
         (220 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G41620.1 | Symbols:  | FUNCTIONS IN: molecular_function unkno...   137   8e-33
AT1G64180.1 | Symbols:  | intracellular protein transport protei...   122   2e-28
AT3G11590.1 | Symbols:  | unknown protein; LOCATED IN: plasma me...    98   4e-21
AT2G46250.1 | Symbols:  | myosin heavy chain-related | chr2:1899...    96   3e-20
AT1G50660.1 | Symbols:  | unknown protein; INVOLVED IN: biologic...    90   9e-19
AT3G20350.1 | Symbols:  | unknown protein; INVOLVED IN: biologic...    89   2e-18
AT5G22310.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    53   2e-07

>AT5G41620.1 | Symbols:  | FUNCTIONS IN: molecular_function unknown;
           INVOLVED IN: biological_process unknown; LOCATED IN:
           chloroplast, plasma membrane; EXPRESSED IN: 9 plant
           structures; EXPRESSED DURING: 6 growth stages; BEST
           Arabidopsis thaliana protein match is: intracellular
           protein transport protein USO1-related
           (TAIR:AT1G64180.1); Has 30201 Blast hits to 17322
           proteins in 780 species: Archae - 12; Bacteria - 1396;
           Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
           0; Other Eukaryotes - 2996 (source: NCBI BLink). |
           chr5:16646330-16648776 FORWARD LENGTH=623
          Length = 623

 Score =  137 bits (344), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 76/182 (41%), Positives = 124/182 (68%), Gaps = 5/182 (2%)

Query: 2   EVDLSRARVKDLLQENKINKQAMEKLIKKLTEDKLVRKSNEHVKIKAAVQSIKEEIQDEK 61
           EV  SR R+K+LL+  + ++  ++ ++K+L E+KL+ K+ E  ++ +AVQS+++ ++DE+
Sbjct: 226 EVAHSRVRIKELLRYQQADRHELDSVVKQLAEEKLLSKNKEVERMSSAVQSVRKALEDER 285

Query: 62  RLRKHSESLHRKLVRELSEVKSSFCSSLRDLESERKARIVLKNLCDDFAKGTRDYEHEVH 121
           +LRK SESLHRK+ RELSEVKSS  + +++LE   K+  +++ LCD+FAKG + YE E+H
Sbjct: 286 KLRKRSESLHRKMARELSEVKSSLSNCVKELERGSKSNKMMELLCDEFAKGIKSYEEEIH 345

Query: 122 SL----MHSVEKGQVKGD-CRLRISEAWLNERKKMKLVQASSDLTDRDSIIDKLGVDIET 176
            L    +     G+  GD   L I+E+WL+ER +M+L    +      S++DKL V+IET
Sbjct: 346 GLKKKNLDKDWAGRGGGDQLVLHIAESWLDERMQMRLEGGDTLNGKNRSVLDKLEVEIET 405

Query: 177 YL 178
           +L
Sbjct: 406 FL 407


>AT1G64180.1 | Symbols:  | intracellular protein transport protein
           USO1-related | chr1:23821640-23824193 FORWARD LENGTH=593
          Length = 593

 Score =  122 bits (306), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 72/181 (39%), Positives = 120/181 (66%), Gaps = 18/181 (9%)

Query: 2   EVDLSRARVKDLLQENKINKQAMEKLIKKLTEDKLVRKSNEHVKIKAAVQSIKEEIQDEK 61
           E+  SRAR+KDLL+  + +K+ M+  +K+L E+KL + + EH ++ +AVQS+    +DE+
Sbjct: 218 ELAHSRARIKDLLRCKQADKRDMDDFVKQLAEEKLSKGTKEHDRLSSAVQSL----EDER 273

Query: 62  RLRKHSESLHRKLVRELSEVKSSFCSSLRDLESERKARIVLKNLCDDFAKGTRDYEHEVH 121
           +LRK SESL+RKL +ELSEVKS+  + ++++E   +++ +L+ LCD+FAKG + YE E+H
Sbjct: 274 KLRKRSESLYRKLAQELSEVKSTLSNCVKEMERGTESKKILERLCDEFAKGIKSYEREIH 333

Query: 122 SLMHSVEKGQVKGDCR----LRISEAWLNERKKMKLVQASSDLTDRDSIIDKLGVDIETY 177
            L   ++K     D +    L I+E+WL+ER     +Q+ +      S ++KL  +IET+
Sbjct: 334 GLKQKLDKNWKGWDEQDHMILCIAESWLDER-----IQSGN-----GSALEKLEFEIETF 383

Query: 178 L 178
           L
Sbjct: 384 L 384


>AT3G11590.1 | Symbols:  | unknown protein; LOCATED IN: plasma
           membrane; EXPRESSED IN: 22 plant structures; EXPRESSED
           DURING: 13 growth stages; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT5G22310.1);
           Has 22320 Blast hits to 15179 proteins in 1213 species:
           Archae - 372; Bacteria - 2307; Metazoa - 10906; Fungi -
           1700; Plants - 1146; Viruses - 65; Other Eukaryotes -
           5824 (source: NCBI BLink). | chr3:3660628-3663537
           FORWARD LENGTH=622
          Length = 622

 Score = 98.2 bits (243), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 60/182 (32%), Positives = 117/182 (64%), Gaps = 3/182 (1%)

Query: 2   EVDLSRARVKDLLQENKINKQAMEKLIKKLTEDKLVRKSNEHVKIKAAVQSIKEEIQDEK 61
           E++ +R +V  L+ E+K     +  L+K+  E+K V KSNE   ++AA++S+  E++ E+
Sbjct: 263 ELERARLQVNQLIHEHKPENNDISYLMKRFAEEKAVWKSNEQEVVEAAIESVAGELEVER 322

Query: 62  RLRKHSESLHRKLVRELSEVKSSFCSSLRDLESERKARIVLKNLCDDFAKGTRDYEHEVH 121
           +LR+  ESL++KL +EL+E KS+   +++++E+E++AR++++ +CD+ A+   + + EV 
Sbjct: 323 KLRRRFESLNKKLGKELAETKSALMKAVKEIENEKRARVMVEKVCDELARDISEDKAEVE 382

Query: 122 SL---MHSVEKGQVKGDCRLRISEAWLNERKKMKLVQASSDLTDRDSIIDKLGVDIETYL 178
            L      V++   K    L++++A   ER +MKL +A   L ++++ +DKL   ++TYL
Sbjct: 383 ELKRESFKVKEEVEKEREMLQLADALREERVQMKLSEAKHQLEEKNAAVDKLRNQLQTYL 442

Query: 179 LA 180
            A
Sbjct: 443 KA 444


>AT2G46250.1 | Symbols:  | myosin heavy chain-related |
           chr2:18991386-18993201 FORWARD LENGTH=468
          Length = 468

 Score = 95.5 bits (236), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 60/153 (39%), Positives = 93/153 (60%), Gaps = 16/153 (10%)

Query: 1   MEVDLSRARVKDLLQENKINKQAMEKLIKKLTEDKLVRKSNEHVKIKAAVQSIKEEIQDE 60
           ME+D  RA +K++ Q  K++             D+ +RK  E  ++K   +SIK E+ DE
Sbjct: 191 MELDECRAEIKEVQQRKKLS-------------DRPLRKKKEEEEVKDVFRSIKRELDDE 237

Query: 61  KRLRKHSESLHRKLVRELSEVKSSFCSSLRDLESERKARIVLKNLCDDFAKGTRDYEHEV 120
           +++RK SE+LHRKL REL E K     +L+DLE E + R+V++NLCD+FAK  +DYE +V
Sbjct: 238 RKVRKESETLHRKLTRELCEAKHCLSKALKDLEKETQERVVVENLCDEFAKAVKDYEDKV 297

Query: 121 HSLMHSVEKGQVKGDCRLRISEAWLNERKKMKL 153
             +    +K  V     ++I+E W ++R +MKL
Sbjct: 298 RRIG---KKSPVSDKVIVQIAEVWSDQRLQMKL 327


>AT1G50660.1 | Symbols:  | unknown protein; INVOLVED IN:
           biological_process unknown; LOCATED IN: chloroplast;
           EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT3G20350.1); Has 21445 Blast
           hits to 15134 proteins in 1325 species: Archae - 461;
           Bacteria - 2309; Metazoa - 11052; Fungi - 1737; Plants -
           1035; Viruses - 42; Other Eukaryotes - 4809 (source:
           NCBI BLink). | chr1:18771386-18774385 FORWARD LENGTH=725
          Length = 725

 Score = 90.1 bits (222), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 59/180 (32%), Positives = 113/180 (62%), Gaps = 3/180 (1%)

Query: 2   EVDLSRARVKDLLQENKINKQAMEKLIKKLTEDKLVRKSNEHVKIKAAVQSIKEEIQDEK 61
           E++ + AR++DL  E + +K+ +E+ ++K++E++   +S EH K++A +  +K ++  EK
Sbjct: 245 ELEEAHARIEDLESEKRSHKKKLEQFLRKVSEERAAWRSREHEKVRAIIDDMKTDMNREK 304

Query: 62  RLRKHSESLHRKLVRELSEVKSSFCSSLRDLESERKARIVLKNLCDDFAKGTRDYEHEVH 121
           + R+  E ++ KLV EL++ K +    ++D E ERKAR +++ +CD+ AK   + + E+ 
Sbjct: 305 KTRQRLEIVNHKLVNELADSKLAVKRYMQDYEKERKARELIEEVCDELAKEIGEDKAEIE 364

Query: 122 SL-MHSVEKGQVKGDCR--LRISEAWLNERKKMKLVQASSDLTDRDSIIDKLGVDIETYL 178
           +L   S+   +   D R  L+++E W  ER +MKL+ A   L +R S ++KL  D+E++L
Sbjct: 365 ALKRESMSLREEVDDERRMLQMAEVWREERVQMKLIDAKVALEERYSQMNKLVGDLESFL 424


>AT3G20350.1 | Symbols:  | unknown protein; INVOLVED IN:
           biological_process unknown; LOCATED IN: plasma membrane;
           EXPRESSED IN: cotyledon; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT1G50660.1);
           Has 15095 Blast hits to 11224 proteins in 1051 species:
           Archae - 223; Bacteria - 1586; Metazoa - 7000; Fungi -
           1255; Plants - 746; Viruses - 40; Other Eukaryotes -
           4245 (source: NCBI BLink). | chr3:7096602-7099372
           FORWARD LENGTH=673
          Length = 673

 Score = 89.0 bits (219), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 58/185 (31%), Positives = 112/185 (60%), Gaps = 3/185 (1%)

Query: 6   SRARVKDLLQENKINKQAMEKLIKKLTEDKLVRKSNEHVKIKAAVQSIKEEIQDEKRLRK 65
           +RA +KDL  E +  K+ +E+ +KK++E++   +S EH K++A +  +K ++  EK+ R+
Sbjct: 226 ARACIKDLESEKRSQKKKLEQFLKKVSEERAAWRSREHEKVRAIIDDMKADMNQEKKTRQ 285

Query: 66  HSESLHRKLVRELSEVKSSFCSSLRDLESERKARIVLKNLCDDFAKGTRDYEHEVHSL-M 124
             E ++ KLV EL++ K +    + D + ERKAR +++ +CD+ AK   + + E+ +L  
Sbjct: 286 RLEIVNSKLVNELADSKLAVKRYMHDYQQERKARELIEEVCDELAKEIEEDKAEIEALKS 345

Query: 125 HSVEKGQVKGDCR--LRISEAWLNERKKMKLVQASSDLTDRDSIIDKLGVDIETYLLAMS 182
            S+   +   D R  L+++E W  ER +MKL+ A   L ++ S ++KL  D+E +L + +
Sbjct: 346 ESMNLREEVDDERRMLQMAEVWREERVQMKLIDAKVTLEEKYSQMNKLVGDMEAFLSSRN 405

Query: 183 SVNLR 187
           +  ++
Sbjct: 406 TTGVK 410


>AT5G22310.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G11590.1); Has 30201 Blast hits to 17322
           proteins in 780 species: Archae - 12; Bacteria - 1396;
           Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
           0; Other Eukaryotes - 2996 (source: NCBI BLink). |
           chr5:7383742-7385345 REVERSE LENGTH=481
          Length = 481

 Score = 52.8 bits (125), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 35/122 (28%), Positives = 68/122 (55%), Gaps = 17/122 (13%)

Query: 52  SIKEEIQDEKRLRKHSESLHRKLVRELSEVKSSFCSSLRDLESERKARIVLKNLCDDFAK 111
           S++EE   E++LR+ +E ++R+L REL+E K +      +++ E++A+ VL+ +CD+  K
Sbjct: 248 SLQEEAMVERKLRRRTEKMNRRLGRELTEAKETERKMKEEMKREKRAKDVLEEVCDELTK 307

Query: 112 GTRDYEHEV---HSLMHSVEKGQVKGDCRLRISEAWLNERKKMKLVQASSDLTDRDSIID 168
           G  D + E+     +MH              I++    ER +MKL +A  +  D+ + ++
Sbjct: 308 GIGDDKKEMEKEREMMH--------------IADVLREERVQMKLTEAKFEFEDKYAAVE 353

Query: 169 KL 170
           +L
Sbjct: 354 RL 355