Miyakogusa Predicted Gene

Lj0g3v0341679.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj0g3v0341679.1 Non Chatacterized Hit- tr|D7T4R6|D7T4R6_VITVI
Putative uncharacterized protein OS=Vitis vinifera
GN=,27.6,2e-17,seg,NULL,CUFF.23408.1
         (373 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT2G40070.1 | Symbols:  | BEST Arabidopsis thaliana protein matc...    75   5e-14
AT1G27850.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    66   3e-11
AT2G40070.2 | Symbols:  | FUNCTIONS IN: molecular_function unkno...    62   5e-10
AT3G09000.1 | Symbols:  | proline-rich family protein | chr3:274...    57   3e-08

>AT2G40070.1 | Symbols:  | BEST Arabidopsis thaliana protein match
           is: proline-rich family protein (TAIR:AT3G09000.1); Has
           35333 Blast hits to 34131 proteins in 2444 species:
           Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi -
           991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610
           (source: NCBI BLink). | chr2:16728378-16731160 REVERSE
           LENGTH=607
          Length = 607

 Score = 75.5 bits (184), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 49/121 (40%), Positives = 71/121 (58%), Gaps = 14/121 (11%)

Query: 24  DDDLSLFRELRKRQNDHVPSFLINGAASEEYECDTNIGGSVGKFSLYRIHSG-----KKE 78
           D++LSLF E+R+R+ +   + L+N    +E+E  T +G   G   ++ I SG     K  
Sbjct: 32  DEELSLFLEMRRREKEQD-NLLLNNNP-DEFE--TPLGSKHGTSPVFNISSGAPPSRKAA 87

Query: 79  HGEFM--ETNKNDYDWLKTPPATPLFPSLEMEPNAHLVTQKEIPISQP---ISRLAKSDM 133
             +F+  E +KNDY+WL TPP TPLFPSLEME +  +++Q     S+P    SRLA S  
Sbjct: 88  PDDFLNSEGDKNDYEWLLTPPGTPLFPSLEMESHRTMMSQTGDSKSRPATLTSRLANSST 147

Query: 134 E 134
           E
Sbjct: 148 E 148


>AT1G27850.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: plasma membrane;
           EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT2G40070.1); Has 9215 Blast
           hits to 5316 proteins in 473 species: Archae - 6;
           Bacteria - 773; Metazoa - 3392; Fungi - 1710; Plants -
           539; Viruses - 143; Other Eukaryotes - 2652 (source:
           NCBI BLink). | chr1:9699265-9703701 FORWARD LENGTH=1148
          Length = 1148

 Score = 66.2 bits (160), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 50/123 (40%), Positives = 67/123 (54%), Gaps = 10/123 (8%)

Query: 24  DDDLSLFRELRKRQNDHVPSFLINGAASEEYECDTNIGGSVGKFSLYRI-HSGKKEHGEF 82
           DDDL+LF E++ ++ D   SFL+  +   E    T +      FS + I   G+      
Sbjct: 36  DDDLALFSEMQDKERD---SFLLQSSDDLEDVFSTKLK----HFSEFTIPVQGESSRLLT 88

Query: 83  METNKNDYDWLKTPPATPLFPSLEMEPNAHLVTQKEIPISQPISRLAKSDMEVLKPKSSD 142
            E +KNDYDWL TPP TPLFPSL+ +P A  V ++  P SQ IS    S ME  + +SS 
Sbjct: 89  AEGDKNDYDWLLTPPDTPLFPSLDDQPPAASVVRRGRPQSQ-ISLSRSSTMEKSR-RSSK 146

Query: 143 GRA 145
           G A
Sbjct: 147 GSA 149


>AT2G40070.2 | Symbols:  | FUNCTIONS IN: molecular_function unknown;
           INVOLVED IN: biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 17 plant
           structures; EXPRESSED DURING: 7 growth stages; BEST
           Arabidopsis thaliana protein match is: proline-rich
           family protein (TAIR:AT3G09000.1); Has 108635 Blast hits
           to 60786 proteins in 2176 species: Archae - 287;
           Bacteria - 15142; Metazoa - 39415; Fungi - 26849; Plants
           - 4416; Viruses - 2864; Other Eukaryotes - 19662
           (source: NCBI BLink). | chr2:16728378-16731040 REVERSE
           LENGTH=567
          Length = 567

 Score = 62.4 bits (150), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 43/112 (38%), Positives = 63/112 (56%), Gaps = 14/112 (12%)

Query: 33  LRKRQNDHVPSFLINGAASEEYECDTNIGGSVGKFSLYRIHSG-----KKEHGEFM--ET 85
           +R+R+ +   + L+N    +E+E  T +G   G   ++ I SG     K    +F+  E 
Sbjct: 1   MRRREKEQD-NLLLNNNP-DEFE--TPLGSKHGTSPVFNISSGAPPSRKAAPDDFLNSEG 56

Query: 86  NKNDYDWLKTPPATPLFPSLEMEPNAHLVTQKEIPISQP---ISRLAKSDME 134
           +KNDY+WL TPP TPLFPSLEME +  +++Q     S+P    SRLA S  E
Sbjct: 57  DKNDYEWLLTPPGTPLFPSLEMESHRTMMSQTGDSKSRPATLTSRLANSSTE 108


>AT3G09000.1 | Symbols:  | proline-rich family protein |
           chr3:2746014-2748326 FORWARD LENGTH=541
          Length = 541

 Score = 56.6 bits (135), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 41/131 (31%), Positives = 65/131 (49%), Gaps = 21/131 (16%)

Query: 20  HDRSDDDLSLFRELRKRQNDHVPSFLINGA-----------ASEEYECDTNIGGSVGKFS 68
           HDR D++LSLF E+R+R+ +H    L+ G+           A+       +   S  ++ 
Sbjct: 4   HDR-DEELSLFLEMRRREKEHRADSLLTGSDNVSINATLTAAAAAALSGVSETASSQRYP 62

Query: 69  LYRIHSGKKEHGEFMETNKNDYDWLKTPPATPLFPSLEMEPNAHLVTQKEIPISQPI--- 125
           L R  +   E+  + E  K+DYDWL TPP TP F   E E +  ++ Q + P S+P    
Sbjct: 63  LRRTAA---ENFLYSENEKSDYDWLLTPPGTPQF---EKESHRSVMNQHDAPNSRPTVLK 116

Query: 126 SRLAKSDMEVL 136
           SRL     +++
Sbjct: 117 SRLGNCREDIV 127