Miyakogusa Predicted Gene

Lj6g3v1092210.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj6g3v1092210.1 CUFF.59067.1
         (226 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT2G12400.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   333   4e-92
AT2G25270.1 | Symbols:  | unknown protein; LOCATED IN: plasma me...   295   1e-80
AT1G71110.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   251   3e-67
AT1G80540.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   236   1e-62
AT5G67550.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    73   2e-13

>AT2G12400.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: endomembrane
           system; EXPRESSED IN: 25 plant structures; EXPRESSED
           DURING: 13 growth stages; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT2G25270.1);
           Has 177 Blast hits to 172 proteins in 23 species: Archae
           - 0; Bacteria - 2; Metazoa - 3; Fungi - 0; Plants - 164;
           Viruses - 0; Other Eukaryotes - 8 (source: NCBI BLink).
           | chr2:5005144-5008140 REVERSE LENGTH=541
          Length = 541

 Score =  333 bits (855), Expect = 4e-92,   Method: Compositional matrix adjust.
 Identities = 152/226 (67%), Positives = 184/226 (81%)

Query: 1   MLFVAFLGFLFSIFGLQGLVYFLVIVGWILVAGTFILCGVFLFLHNVVADSCVAMDEWVL 60
           MLF+AF+GFL SIFGLQ LVY LVI+GWILV  TF+LCG FL LHNVV D+CVAMD+WV 
Sbjct: 264 MLFLAFIGFLLSIFGLQCLVYTLVILGWILVTVTFVLCGGFLLLHNVVGDTCVAMDQWVQ 323

Query: 61  NPTAHTALDEILPCVDNATAQQTLLQSRDVTHELVNLVDKIINTVTNRNLPPAAGSLYYN 120
           NPTAHTALD+ILPCVDNATA++TL +++ VT++LVNL+D  I+ +TNRN PP    LYYN
Sbjct: 324 NPTAHTALDDILPCVDNATARETLTRTKLVTYQLVNLLDNAISNMTNRNFPPQFRPLYYN 383

Query: 121 QSGPLMPVLCNPYNPNLTTRSCAPGEVPVDNAREVWKNYTCQVSPAGICATPGRMTPTIY 180
           QSGPLMP+LCNP+N +L+ R C PG+V ++NA EVWKN+TCQ+   G C+TPGR+TP +Y
Sbjct: 384 QSGPLMPLLCNPFNADLSDRQCQPGQVHLNNATEVWKNFTCQIVTPGTCSTPGRLTPKLY 443

Query: 181 GQFEAAVNISYGLYHYVPFLVDLQDCTFVRETFTDISNQYCPGLRR 226
            Q  AAVN+SYGLY Y PFL DLQ C FVR TFTDI   +CPGL+R
Sbjct: 444 SQMAAAVNVSYGLYKYGPFLADLQGCDFVRSTFTDIERDHCPGLKR 489


>AT2G25270.1 | Symbols:  | unknown protein; LOCATED IN: plasma
           membrane; EXPRESSED IN: 18 plant structures; EXPRESSED
           DURING: 13 growth stages; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT2G12400.1);
           Has 35333 Blast hits to 34131 proteins in 2444 species:
           Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi -
           991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610
           (source: NCBI BLink). | chr2:10759779-10762358 FORWARD
           LENGTH=545
          Length = 545

 Score =  295 bits (756), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 133/226 (58%), Positives = 172/226 (76%)

Query: 1   MLFVAFLGFLFSIFGLQGLVYFLVIVGWILVAGTFILCGVFLFLHNVVADSCVAMDEWVL 60
           ML V FLG + SIFG+Q +VY LVI+GWILV GTFIL G FL LHN  AD+CVAM EWV 
Sbjct: 269 MLVVTFLGLVSSIFGMQVIVYTLVILGWILVTGTFILSGTFLVLHNATADTCVAMSEWVE 328

Query: 61  NPTAHTALDEILPCVDNATAQQTLLQSRDVTHELVNLVDKIINTVTNRNLPPAAGSLYYN 120
            P+++TALDEILPC DNATAQ+TL++SR+VT +LV L++ +I  V+N N  P    +YYN
Sbjct: 329 RPSSNTALDEILPCTDNATAQETLMRSREVTGQLVELINTVITNVSNINFSPVFVPMYYN 388

Query: 121 QSGPLMPVLCNPYNPNLTTRSCAPGEVPVDNAREVWKNYTCQVSPAGICATPGRMTPTIY 180
           QSGPL+P+LCNP+N +LT RSC+PG++ ++NA E W ++ CQVS  G C T GR+TP +Y
Sbjct: 389 QSGPLLPLLCNPFNHDLTDRSCSPGDLDLNNATEAWTSFVCQVSQNGTCTTTGRLTPALY 448

Query: 181 GQFEAAVNISYGLYHYVPFLVDLQDCTFVRETFTDISNQYCPGLRR 226
            Q  + VNIS GL    PFLV LQDC++ ++TF DI+N +CPGL+R
Sbjct: 449 SQMASGVNISTGLIRDAPFLVQLQDCSYAKQTFRDITNDHCPGLQR 494


>AT1G71110.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: endomembrane
           system; BEST Arabidopsis thaliana protein match is:
           unknown protein (TAIR:AT2G12400.1); Has 173 Blast hits
           to 169 proteins in 21 species: Archae - 0; Bacteria - 0;
           Metazoa - 3; Fungi - 0; Plants - 165; Viruses - 0; Other
           Eukaryotes - 5 (source: NCBI BLink). |
           chr1:26818244-26820852 FORWARD LENGTH=557
          Length = 557

 Score =  251 bits (641), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 119/226 (52%), Positives = 154/226 (68%), Gaps = 1/226 (0%)

Query: 1   MLFVAFLGFLFSIFGLQGLVYFLVIVGWILVAGTFILCGVFLFLHNVVADSCVAMDEWVL 60
           ML ++F+G L S+   Q +V+  V+ GWILVA TF+LCGVFL L+N ++D+CVAM EWV 
Sbjct: 265 MLILSFVGLLLSVLRHQHVVHIFVVSGWILVAVTFVLCGVFLILNNAISDTCVAMKEWVD 324

Query: 61  NPTAHTALDEILPCVDNATAQQTLLQSRDVTHELVNLVDKIINTVTNRNLPPAAGSLYYN 120
           NP A TAL  ILPCVD  T  QTL QS+ V + +V +V+  +  V N N P      YYN
Sbjct: 325 NPHAETALSSILPCVDQQTTNQTLSQSKVVINSIVTVVNTFVYAVANTN-PAPGQDRYYN 383

Query: 121 QSGPLMPVLCNPYNPNLTTRSCAPGEVPVDNAREVWKNYTCQVSPAGICATPGRMTPTIY 180
           QSGP MP LC P++ N+  R C+P E+ ++NA  VW+NY C+V+P+GIC T GR+TP  +
Sbjct: 384 QSGPPMPPLCIPFDANMEDRQCSPWELSIENASSVWENYKCEVTPSGICTTVGRVTPDTF 443

Query: 181 GQFEAAVNISYGLYHYVPFLVDLQDCTFVRETFTDISNQYCPGLRR 226
           GQ  AAVN SY L HY P L+  +DC FVRETF  I++ YCP L R
Sbjct: 444 GQLVAAVNESYALEHYTPPLLSFRDCNFVRETFMSITSDYCPPLVR 489


>AT1G80540.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G12400.1); Has 175 Blast hits to 171 proteins
           in 20 species: Archae - 0; Bacteria - 0; Metazoa - 2;
           Fungi - 0; Plants - 171; Viruses - 0; Other Eukaryotes -
           2 (source: NCBI BLink). | chr1:30281638-30284258 REVERSE
           LENGTH=538
          Length = 538

 Score =  236 bits (601), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 114/224 (50%), Positives = 151/224 (67%)

Query: 1   MLFVAFLGFLFSIFGLQGLVYFLVIVGWILVAGTFILCGVFLFLHNVVADSCVAMDEWVL 60
           ML VAFLG LFS  GL+ LVY LVI+GWILV  T +L  VFL  HNVVAD+C+AMD+WV 
Sbjct: 267 MLAVAFLGLLFSFCGLRVLVYLLVILGWILVTATILLSAVFLVFHNVVADTCMAMDQWVH 326

Query: 61  NPTAHTALDEILPCVDNATAQQTLLQSRDVTHELVNLVDKIINTVTNRNLPPAAGSLYYN 120
           +P A +AL ++LPC+D  T  +TL  ++ +T   V++ +     V+N +  P     Y+N
Sbjct: 327 DPAADSALSQLLPCLDPKTIGETLDITKTMTATAVDMTNAYTVNVSNHDQFPPNAPFYHN 386

Query: 121 QSGPLMPVLCNPYNPNLTTRSCAPGEVPVDNAREVWKNYTCQVSPAGICATPGRMTPTIY 180
           QSGPL+P+LCNP + N   R CAP EV + NA +V+K Y CQV+  GIC T GR+T   Y
Sbjct: 387 QSGPLVPLLCNPLDQNHKPRPCAPDEVLLANASQVYKGYICQVNAEGICTTQGRLTQGSY 446

Query: 181 GQFEAAVNISYGLYHYVPFLVDLQDCTFVRETFTDISNQYCPGL 224
            Q   A+N+++ L HY PFL  + DCTFVR+TF DI+ + CPGL
Sbjct: 447 DQMMGAINVAFTLDHYGPFLASIADCTFVRDTFRDITTKNCPGL 490


>AT5G67550.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: endomembrane
           system; EXPRESSED IN: flower; EXPRESSED DURING: 4
           anthesis; BEST Arabidopsis thaliana protein match is:
           unknown protein (TAIR:AT1G71110.1); Has 161 Blast hits
           to 154 proteins in 16 species: Archae - 0; Bacteria - 0;
           Metazoa - 0; Fungi - 0; Plants - 161; Viruses - 0; Other
           Eukaryotes - 0 (source: NCBI BLink). |
           chr5:26946908-26949112 REVERSE LENGTH=509
          Length = 509

 Score = 72.8 bits (177), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 49/210 (23%), Positives = 96/210 (45%), Gaps = 11/210 (5%)

Query: 23  LVIVGWILVAGTFILCGVFLFLHNVVADSCVAMDEWVLNPTAHTALDEILPCVDNATAQQ 82
           ++ + WI+    ++L G   F+H    D C A + +V NP   T L  + PC+D   + +
Sbjct: 250 VIFLCWIITTLCWVLTGFDFFIHTFAEDLCSAFNGFVQNPRNST-LTNLFPCMDPLHSDK 308

Query: 83  TLLQSRDVTHELV-NLVDKIINTVTNRNLPPAAGSLYYNQSGPLMPVLCNPYNP----NL 137
           TL++   + H  +  L  K+  ++ +  L   + ++ +    P   ++C+P+      + 
Sbjct: 309 TLIEISLMIHNFITQLNSKVAESMRSNALTDRSNTVSW---APESGIICDPFVGQQINSY 365

Query: 138 TTRSCAPGEVPVDNAREVWKNYTCQ-VSPAGICATPGRMTP-TIYGQFEAAVNISYGLYH 195
           T +SC+ G +P+     +   +TC    P   C   G+  P   Y +  A  N + G+  
Sbjct: 366 TPQSCSNGAIPIGEFPNILSRFTCHDKDPPETCRITGKFIPEAAYLKVYAYSNSAQGMLD 425

Query: 196 YVPFLVDLQDCTFVRETFTDISNQYCPGLR 225
            +P   +L +C  V++T + I +  C   R
Sbjct: 426 ILPSFQNLTECLAVKDTLSSIVSNQCDPFR 455