Miyakogusa Predicted Gene

Lj6g3v0519480.2
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj6g3v0519480.2 Non Chatacterized Hit- tr|B9RMM4|B9RMM4_RICCO
Putative uncharacterized protein OS=Ricinus communis
G,46.36,3e-17,seg,NULL; SUBFAMILY NOT NAMED,NULL; FAMILY NOT
NAMED,NULL,CUFF.57971.2
         (362 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT2G40070.1 | Symbols:  | BEST Arabidopsis thaliana protein matc...   160   1e-39
AT2G40070.2 | Symbols:  | FUNCTIONS IN: molecular_function unkno...   114   1e-25
AT3G09000.1 | Symbols:  | proline-rich family protein | chr3:274...    94   2e-19
AT1G27850.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    67   3e-11
AT2G38160.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    62   7e-10
AT2G38160.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    62   7e-10
AT3G08670.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    49   8e-06

>AT2G40070.1 | Symbols:  | BEST Arabidopsis thaliana protein match
           is: proline-rich family protein (TAIR:AT3G09000.1); Has
           35333 Blast hits to 34131 proteins in 2444 species:
           Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi -
           991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610
           (source: NCBI BLink). | chr2:16728378-16731160 REVERSE
           LENGTH=607
          Length = 607

 Score =  160 bits (405), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 91/163 (55%), Positives = 109/163 (66%), Gaps = 9/163 (5%)

Query: 1   MNRSFRAQESQM-QAALKQREQQFGGLRGSVMKEKDEELALFLEMKKREKERXXXXXXXX 59
           MNRSFRA+ES +  +A +QR+Q    LR S+M EKDEEL+LFLEM++REKE+        
Sbjct: 1   MNRSFRAKESLLLDSAERQRQQ----LRASMMAEKDEELSLFLEMRRREKEQDNLLLNNN 56

Query: 60  XXX-XXXXGSNPSTSSIFNIPASAPIPVRKNGTDDFLNSENDKNDYDWLLTPPGTPLFPS 118
                   GS   TS +FNI + AP P RK   DDFLNSE DKNDY+WLLTPPGTPLFPS
Sbjct: 57  PDEFETPLGSKHGTSPVFNISSGAP-PSRKAAPDDFLNSEGDKNDYEWLLTPPGTPLFPS 115

Query: 119 LEMESRKTVMSQLGSGNPASRPVALKSRLGNHQSEPAGRTSLV 161
           LEMES +T+MSQ  +G+  SRP  L SRL N  +E A R  L 
Sbjct: 116 LEMESHRTMMSQ--TGDSKSRPATLTSRLANSSTESAARNHLT 156


>AT2G40070.2 | Symbols:  | FUNCTIONS IN: molecular_function unknown;
           INVOLVED IN: biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 17 plant
           structures; EXPRESSED DURING: 7 growth stages; BEST
           Arabidopsis thaliana protein match is: proline-rich
           family protein (TAIR:AT3G09000.1); Has 108635 Blast hits
           to 60786 proteins in 2176 species: Archae - 287;
           Bacteria - 15142; Metazoa - 39415; Fungi - 26849; Plants
           - 4416; Viruses - 2864; Other Eukaryotes - 19662
           (source: NCBI BLink). | chr2:16728378-16731040 REVERSE
           LENGTH=567
          Length = 567

 Score =  114 bits (284), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 59/95 (62%), Positives = 67/95 (70%), Gaps = 3/95 (3%)

Query: 67  GSNPSTSSIFNIPASAPIPVRKNGTDDFLNSENDKNDYDWLLTPPGTPLFPSLEMESRKT 126
           GS   TS +FNI + AP P RK   DDFLNSE DKNDY+WLLTPPGTPLFPSLEMES +T
Sbjct: 25  GSKHGTSPVFNISSGAP-PSRKAAPDDFLNSEGDKNDYEWLLTPPGTPLFPSLEMESHRT 83

Query: 127 VMSQLGSGNPASRPVALKSRLGNHQSEPAGRTSLV 161
           +MSQ  +G+  SRP  L SRL N  +E A R  L 
Sbjct: 84  MMSQ--TGDSKSRPATLTSRLANSSTESAARNHLT 116


>AT3G09000.1 | Symbols:  | proline-rich family protein |
           chr3:2746014-2748326 FORWARD LENGTH=541
          Length = 541

 Score = 94.0 bits (232), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 53/129 (41%), Positives = 75/129 (58%), Gaps = 10/129 (7%)

Query: 30  VMKEKDEELALFLEMKKREKERXXXX-----XXXXXXXXXXXGSNPSTSSIFNIPASAPI 84
           +  ++DEEL+LFLEM++REKE                      +  + S +    +S   
Sbjct: 2   LTHDRDEELSLFLEMRRREKEHRADSLLTGSDNVSINATLTAAAAAALSGVSETASSQRY 61

Query: 85  PVRKNGTDDFLNSENDKNDYDWLLTPPGTPLFPSLEMESRKTVMSQLGSGNPASRPVALK 144
           P+R+   ++FL SEN+K+DYDWLLTPPGTP F   E ES ++VM+Q  +  P SRP  LK
Sbjct: 62  PLRRTAAENFLYSENEKSDYDWLLTPPGTPQF---EKESHRSVMNQHDA--PNSRPTVLK 116

Query: 145 SRLGNHQSE 153
           SRLGN + +
Sbjct: 117 SRLGNCRED 125


>AT1G27850.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: plasma membrane;
           EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT2G40070.1); Has 9215 Blast
           hits to 5316 proteins in 473 species: Archae - 6;
           Bacteria - 773; Metazoa - 3392; Fungi - 1710; Plants -
           539; Viruses - 143; Other Eukaryotes - 2652 (source:
           NCBI BLink). | chr1:9699265-9703701 FORWARD LENGTH=1148
          Length = 1148

 Score = 66.6 bits (161), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 37/93 (39%), Positives = 49/93 (52%), Gaps = 9/93 (9%)

Query: 28  GSVMKEKDEELALFLEMKKREKERXXXXXXXXXXXXXXXGSNPSTSSIFNIPASAPIPVR 87
           G + ++KD++LALF EM+  +KER                +     S F IP        
Sbjct: 29  GILFRDKDDDLALFSEMQ--DKERDSFLLQSSDDLEDVFSTKLKHFSEFTIPV------- 79

Query: 88  KNGTDDFLNSENDKNDYDWLLTPPGTPLFPSLE 120
           +  +   L +E DKNDYDWLLTPP TPLFPSL+
Sbjct: 80  QGESSRLLTAEGDKNDYDWLLTPPDTPLFPSLD 112


>AT2G38160.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 9 plant
           structures; EXPRESSED DURING: 4 anthesis, F mature
           embryo stage, petal differentiation and expansion stage,
           E expanded cotyledon stage, D bilateral stage; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT2G40070.2). | chr2:15986643-15988464 REVERSE
           LENGTH=314
          Length = 314

 Score = 61.6 bits (148), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 30/87 (34%), Positives = 48/87 (55%), Gaps = 4/87 (4%)

Query: 30  VMKEKDEELALFLEMKKREKERXXXXXXXXXXXXXXXGSNPSTSSIFNIPASAPIPVRKN 89
           V+++ DEEL+LFLEM++REK +                S  S      +  ++ + +R++
Sbjct: 6   VVRDGDEELSLFLEMRRREKLQGVSSLSQPGANSVEKTSTKS----LELLKTSCVKLRRS 61

Query: 90  GTDDFLNSENDKNDYDWLLTPPGTPLF 116
             + FL+SENDK+DY+WLL  P   L 
Sbjct: 62  SVEKFLDSENDKSDYEWLLAAPEITLL 88


>AT2G38160.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G40070.2); Has 972 Blast hits to 731 proteins
           in 211 species: Archae - 0; Bacteria - 236; Metazoa -
           194; Fungi - 201; Plants - 218; Viruses - 32; Other
           Eukaryotes - 91 (source: NCBI BLink). |
           chr2:15986643-15988464 REVERSE LENGTH=314
          Length = 314

 Score = 61.6 bits (148), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 30/87 (34%), Positives = 48/87 (55%), Gaps = 4/87 (4%)

Query: 30  VMKEKDEELALFLEMKKREKERXXXXXXXXXXXXXXXGSNPSTSSIFNIPASAPIPVRKN 89
           V+++ DEEL+LFLEM++REK +                S  S      +  ++ + +R++
Sbjct: 6   VVRDGDEELSLFLEMRRREKLQGVSSLSQPGANSVEKTSTKS----LELLKTSCVKLRRS 61

Query: 90  GTDDFLNSENDKNDYDWLLTPPGTPLF 116
             + FL+SENDK+DY+WLL  P   L 
Sbjct: 62  SVEKFLDSENDKSDYEWLLAAPEITLL 88


>AT3G08670.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G51540.1); Has 48380 Blast hits to 29827
           proteins in 1356 species: Archae - 46; Bacteria - 5589;
           Metazoa - 17361; Fungi - 13192; Plants - 2237; Viruses -
           905; Other Eukaryotes - 9050 (source: NCBI BLink). |
           chr3:2633946-2636536 FORWARD LENGTH=567
          Length = 567

 Score = 48.5 bits (114), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 21/27 (77%), Positives = 22/27 (81%), Gaps = 1/27 (3%)

Query: 90  GTDDFLNS-ENDKNDYDWLLTPPGTPL 115
           G DD L+S E  KNDYDWLLTPPGTPL
Sbjct: 83  GKDDLLSSAEGGKNDYDWLLTPPGTPL 109