Miyakogusa Predicted Gene

Lj3g3v0349300.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj3g3v0349300.1 Non Chatacterized Hit- tr|K4BLN0|K4BLN0_SOLLC
Uncharacterized protein OS=Solanum lycopersicum
GN=Sol,23.85,8e-19,seg,NULL; FAMILY NOT NAMED,NULL,CUFF.40576.1
         (249 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G67550.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   145   2e-35
AT2G12400.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    94   1e-19
AT2G25270.1 | Symbols:  | unknown protein; LOCATED IN: plasma me...    87   1e-17
AT1G80540.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    72   3e-13
AT1G71110.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    67   8e-12

>AT5G67550.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: endomembrane
           system; EXPRESSED IN: flower; EXPRESSED DURING: 4
           anthesis; BEST Arabidopsis thaliana protein match is:
           unknown protein (TAIR:AT1G71110.1); Has 161 Blast hits
           to 154 proteins in 16 species: Archae - 0; Bacteria - 0;
           Metazoa - 0; Fungi - 0; Plants - 161; Viruses - 0; Other
           Eukaryotes - 0 (source: NCBI BLink). |
           chr5:26946908-26949112 REVERSE LENGTH=509
          Length = 509

 Score =  145 bits (367), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 77/261 (29%), Positives = 133/261 (50%), Gaps = 13/261 (4%)

Query: 1   MLVVLCWLTTVICWLFFGVYFFLEKFSNDACTALDNFQENPYNNSLSSILPCDELLSAKP 60
           M++ LCW+ T +CW+  G  FF+  F+ D C+A + F +NP N++L+++ PC + L +  
Sbjct: 249 MVIFLCWIITTLCWVLTGFDFFIHTFAEDLCSAFNGFVQNPRNSTLTNLFPCMDPLHSDK 308

Query: 61  VLSDVSAGIYDLVNKVNANIS------ALQATS-----YPDLVQVCNPFSEPPNYFYQPE 109
            L ++S  I++ + ++N+ ++      AL   S      P+   +C+PF       Y P+
Sbjct: 309 TLIEISLMIHNFITQLNSKVAESMRSNALTDRSNTVSWAPESGIICDPFVGQQINSYTPQ 368

Query: 110 NCPANTIRIGDIPKVLKAFTCLDAN-DGTCD-NGNLISSSEYVRVEAYTTSIQDLLNVYP 167
           +C    I IG+ P +L  FTC D +   TC   G  I  + Y++V AY+ S Q +L++ P
Sbjct: 369 SCSNGAIPIGEFPNILSRFTCHDKDPPETCRITGKFIPEAAYLKVYAYSNSAQGMLDILP 428

Query: 168 SMEHLLECQIVKDAFSQVLSKQCKPMKKYARMVWVGMXXXXXXXXXXXXXWTIKARREHR 227
           S ++L EC  VKD  S ++S QC P +     +W  +             +  KA +E  
Sbjct: 429 SFQNLTECLAVKDTLSSIVSNQCDPFRASMYRLWASILALSLIMVVLVLLFLAKAFQEKG 488

Query: 228 YHLSDSSVEPLESRPSKEIEI 248
              +  S+ P  S   +++ I
Sbjct: 489 KSFAWFSIHPTSSAEIRQVNI 509


>AT2G12400.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: endomembrane
           system; EXPRESSED IN: 25 plant structures; EXPRESSED
           DURING: 13 growth stages; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT2G25270.1);
           Has 177 Blast hits to 172 proteins in 23 species: Archae
           - 0; Bacteria - 2; Metazoa - 3; Fungi - 0; Plants - 164;
           Viruses - 0; Other Eukaryotes - 8 (source: NCBI BLink).
           | chr2:5005144-5008140 REVERSE LENGTH=541
          Length = 541

 Score = 93.6 bits (231), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 64/258 (24%), Positives = 106/258 (41%), Gaps = 19/258 (7%)

Query: 2   LVVLCWLTTVICWLFFGVYFFLEKFSNDACTALDNFQENPY-NNSLSSILPCDELLSAKP 60
           LV+L W+   + ++  G +  L     D C A+D + +NP  + +L  ILPC +  +A+ 
Sbjct: 286 LVILGWILVTVTFVLCGGFLLLHNVVGDTCVAMDQWVQNPTAHTALDDILPCVDNATARE 345

Query: 61  VLSDVSAGIYDLVNKVNANISALQATSYPDLVQ-------------VCNPFSEPPNYFYQ 107
            L+      Y LVN ++  IS +   ++P   +             +CNPF    N    
Sbjct: 346 TLTRTKLVTYQLVNLLDNAISNMTNRNFPPQFRPLYYNQSGPLMPLLCNPF----NADLS 401

Query: 108 PENCPANTIRIGDIPKVLKAFTCLDANDGTCDNGNLISSSEYVRVEAYTTSIQDLLNVYP 167
              C    + + +  +V K FTC     GTC     ++   Y ++ A       L    P
Sbjct: 402 DRQCQPGQVHLNNATEVWKNFTCQIVTPGTCSTPGRLTPKLYSQMAAAVNVSYGLYKYGP 461

Query: 168 SMEHLLECQIVKDAFSQVLSKQCKPMKKYARMVWVGMXXXXXXXXXXXXXWTIKAR-REH 226
            +  L  C  V+  F+ +    C  +K+Y + ++VG+             W I AR R H
Sbjct: 462 FLADLQGCDFVRSTFTDIERDHCPGLKRYTQWIYVGLVVVSASVMSSLVFWVIYARERRH 521

Query: 227 RYHLSDSSVEPLESRPSK 244
           R +  D +    E   SK
Sbjct: 522 RVYTKDYNAMHSEDPRSK 539


>AT2G25270.1 | Symbols:  | unknown protein; LOCATED IN: plasma
           membrane; EXPRESSED IN: 18 plant structures; EXPRESSED
           DURING: 13 growth stages; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT2G12400.1);
           Has 35333 Blast hits to 34131 proteins in 2444 species:
           Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi -
           991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610
           (source: NCBI BLink). | chr2:10759779-10762358 FORWARD
           LENGTH=545
          Length = 545

 Score = 86.7 bits (213), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 57/241 (23%), Positives = 105/241 (43%), Gaps = 19/241 (7%)

Query: 2   LVVLCWLTTVICWLFFGVYFFLEKFSNDACTALDNFQENPYNNS-LSSILPCDELLSAKP 60
           LV+L W+     ++  G +  L   + D C A+  + E P +N+ L  ILPC +  +A+ 
Sbjct: 291 LVILGWILVTGTFILSGTFLVLHNATADTCVAMSEWVERPSSNTALDEILPCTDNATAQE 350

Query: 61  VL---SDVSAGIYDLVNKVNANISALQAT----------SYPDLVQVCNPFSEPPNYFYQ 107
            L    +V+  + +L+N V  N+S +  +          S P L  +CNPF    N+   
Sbjct: 351 TLMRSREVTGQLVELINTVITNVSNINFSPVFVPMYYNQSGPLLPLLCNPF----NHDLT 406

Query: 108 PENCPANTIRIGDIPKVLKAFTCLDANDGTCDNGNLISSSEYVRVEAYTTSIQDLLNVYP 167
             +C    + + +  +   +F C  + +GTC     ++ + Y ++ +       L+   P
Sbjct: 407 DRSCSPGDLDLNNATEAWTSFVCQVSQNGTCTTTGRLTPALYSQMASGVNISTGLIRDAP 466

Query: 168 SMEHLLECQIVKDAFSQVLSKQCKPMKKYARMVWVGMXXXXXXXXXXXXXWTIKAR-REH 226
            +  L +C   K  F  + +  C  +++Y   V+VG+             W I +R R H
Sbjct: 467 FLVQLQDCSYAKQTFRDITNDHCPGLQRYGYWVYVGLAILATAVMLSLMFWIIYSRERRH 526

Query: 227 R 227
           R
Sbjct: 527 R 527


>AT1G80540.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G12400.1); Has 175 Blast hits to 171 proteins
           in 20 species: Archae - 0; Bacteria - 0; Metazoa - 2;
           Fungi - 0; Plants - 171; Viruses - 0; Other Eukaryotes -
           2 (source: NCBI BLink). | chr1:30281638-30284258 REVERSE
           LENGTH=538
          Length = 538

 Score = 72.4 bits (176), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 51/251 (20%), Positives = 97/251 (38%), Gaps = 19/251 (7%)

Query: 1   MLVVLCWLTTVICWLFFGVYFFLEKFSNDACTALDNFQENPYNNS-LSSILPC---DELL 56
           +LV+L W+      L   V+        D C A+D +  +P  +S LS +LPC     + 
Sbjct: 288 LLVILGWILVTATILLSAVFLVFHNVVADTCMAMDQWVHDPAADSALSQLLPCLDPKTIG 347

Query: 57  SAKPVLSDVSAGIYDLVNKVNANISA----------LQATSYPDLVQVCNPFSEPPNYFY 106
               +   ++A   D+ N    N+S               S P +  +CNP  +     +
Sbjct: 348 ETLDITKTMTATAVDMTNAYTVNVSNHDQFPPNAPFYHNQSGPLVPLLCNPLDQN----H 403

Query: 107 QPENCPANTIRIGDIPKVLKAFTCLDANDGTCDNGNLISSSEYVRVEAYTTSIQDLLNVY 166
           +P  C  + + + +  +V K + C    +G C     ++   Y ++         L +  
Sbjct: 404 KPRPCAPDEVLLANASQVYKGYICQVNAEGICTTQGRLTQGSYDQMMGAINVAFTLDHYG 463

Query: 167 PSMEHLLECQIVKDAFSQVLSKQCKPMKKYARMVWVGMXXXXXXXXXXXXXWTIKAR-RE 225
           P +  + +C  V+D F  + +K C  +   ++ ++ G+             W I  R R 
Sbjct: 464 PFLASIADCTFVRDTFRDITTKNCPGLSITSQWIYAGLASLSGAVMFSLIFWLIFVRERR 523

Query: 226 HRYHLSDSSVE 236
           HR     S ++
Sbjct: 524 HRSQTKKSMIQ 534


>AT1G71110.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: endomembrane
           system; BEST Arabidopsis thaliana protein match is:
           unknown protein (TAIR:AT2G12400.1); Has 173 Blast hits
           to 169 proteins in 21 species: Archae - 0; Bacteria - 0;
           Metazoa - 3; Fungi - 0; Plants - 165; Viruses - 0; Other
           Eukaryotes - 5 (source: NCBI BLink). |
           chr1:26818244-26820852 FORWARD LENGTH=557
          Length = 557

 Score = 67.4 bits (163), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 53/240 (22%), Positives = 92/240 (38%), Gaps = 17/240 (7%)

Query: 1   MLVVLCWLTTVICWLFFGVYFFLEKFSNDACTALDNFQENPY-NNSLSSILPCDELLSAK 59
           + VV  W+   + ++  GV+  L    +D C A+  + +NP+   +LSSILPC +  +  
Sbjct: 286 IFVVSGWILVAVTFVLCGVFLILNNAISDTCVAMKEWVDNPHAETALSSILPCVDQQTTN 345

Query: 60  PVLSDVSAGIYDLVNKVNANISALQAT------------SYPDLVQVCNPFSEPPNYFYQ 107
             LS     I  +V  VN  + A+  T            S P +  +C PF        +
Sbjct: 346 QTLSQSKVVINSIVTVVNTFVYAVANTNPAPGQDRYYNQSGPPMPPLCIPFDAN----ME 401

Query: 108 PENCPANTIRIGDIPKVLKAFTCLDANDGTCDNGNLISSSEYVRVEAYTTSIQDLLNVYP 167
              C    + I +   V + + C     G C     ++   + ++ A       L +  P
Sbjct: 402 DRQCSPWELSIENASSVWENYKCEVTPSGICTTVGRVTPDTFGQLVAAVNESYALEHYTP 461

Query: 168 SMEHLLECQIVKDAFSQVLSKQCKPMKKYARMVWVGMXXXXXXXXXXXXXWTIKARREHR 227
            +    +C  V++ F  + S  C P+ +  R+V  G+             W   A R  R
Sbjct: 462 PLLSFRDCNFVRETFMSITSDYCPPLVRNLRIVNAGLGLISVGVLLCLVLWIFYANRPQR 521