Miyakogusa Predicted Gene

Lj3g3v0839390.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj3g3v0839390.1 Non Chatacterized Hit- tr|B9T0Y0|B9T0Y0_RICCO
Putative uncharacterized protein OS=Ricinus communis
G,65.62,4e-18,seg,NULL,CUFF.41576.1
         (359 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT3G08490.1 | Symbols:  | BEST Arabidopsis thaliana protein matc...   194   9e-50
AT3G24600.1 | Symbols:  | Late embryogenesis abundant protein, g...    72   6e-13
AT5G42860.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    66   4e-11
AT1G45688.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    65   1e-10
AT4G35170.1 | Symbols:  | Late embryogenesis abundant (LEA) hydr...    54   2e-07
AT1G45688.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    53   4e-07
AT2G41990.1 | Symbols:  | CONTAINS InterPro DOMAIN/s: Late embry...    49   5e-06

>AT3G08490.1 | Symbols:  | BEST Arabidopsis thaliana protein match
           is: Late embryogenesis abundant protein, group 2
           (TAIR:AT3G24600.1); Has 161 Blast hits to 158 proteins
           in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 161; Viruses - 0; Other Eukaryotes -
           0 (source: NCBI BLink). | chr3:2574105-2575125 REVERSE
           LENGTH=271
          Length = 271

 Score =  194 bits (492), Expect = 9e-50,   Method: Compositional matrix adjust.
 Identities = 98/206 (47%), Positives = 126/206 (61%), Gaps = 2/206 (0%)

Query: 155 KRYFSYSYADSCGWICLQMSWRLMVSFGIALLVFYIATKPPPPNFSLEISKIPEFKLGEG 214
           KR      ++S  WI LQ+ WR + S G+ALLVFYIAT+PP PN S  I +  +F L EG
Sbjct: 67  KRLVPLGTSNSSWWIVLQVGWRFLFSLGVALLVFYIATQPPHPNISFRIGRFNQFMLEEG 126

Query: 215 VDRTGVTTKILTCNCSMNLIIENKSRFFGLHIRPPLMDMKFSVLPFASSNGPKLYAES-G 273
           VD  GV+TK LT NCS  LII+NKS  FGLHI PP +   F  L FA + GPKLY  S  
Sbjct: 127 VDSHGVSTKFLTFNCSTKLIIDNKSNVFGLHIHPPSIKFFFGPLNFAKAQGPKLYGLSHE 186

Query: 274 LTIFTLQLGVKNKPMYGAGRSMQDMLDSGKGLPIAIRVILSSSFEVVPNLIKPRFHHRVE 333
            T F L +   N+ MYGAG  M DML S  GLP+ +R  + S + VV N+I P++HH+VE
Sbjct: 187 STTFQLYIATTNRAMYGAGTEMNDMLLSRAGLPLILRTSIISDYRVVWNIINPKYHHKVE 246

Query: 334 CIVVLKKDYDRKHRTQAFNSTCKVTS 359
           C+++L  D +R          C++ S
Sbjct: 247 CLLLL-ADKERHSHVTMIREKCRLVS 271


>AT3G24600.1 | Symbols:  | Late embryogenesis abundant protein,
           group 2 | chr3:8972195-8974867 REVERSE LENGTH=506
          Length = 506

 Score = 72.0 bits (175), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 73/148 (49%), Gaps = 2/148 (1%)

Query: 188 FYIATKPPPPNFSLEISKIPEFKLGEGVDRTGVTTKILTCNCSMNLIIENKSRFFGLHIR 247
            + A+ P  P  S++   I  F  GEG+DRTGV TKIL+ N S+ + I++ + +FG+H+ 
Sbjct: 335 LWGASHPFSPIVSVKSVDIHSFYYGEGIDRTGVATKILSFNSSVKVTIDSPAPYFGIHVS 394

Query: 248 PPLMDMKFSVLPFASSNGPKLYA-ESGLTIFTLQLGVKNKPMYGAGRSMQDMLDSGKGLP 306
                + FS L  A+      Y       I  ++L     P+YGAG  +      GK +P
Sbjct: 395 SSTFKLTFSALTLATGQLKSYYQPRKSKHISIVKLTGAEVPLYGAGPHLAASDKKGK-VP 453

Query: 307 IAIRVILSSSFEVVPNLIKPRFHHRVEC 334
           + +   + S   ++  L+K +  + V C
Sbjct: 454 VKLEFEIRSRGNLLGKLVKSKHENHVSC 481



 Score = 54.3 bits (129), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 38/141 (26%), Positives = 68/141 (48%), Gaps = 10/141 (7%)

Query: 176 RLMVSFGIALLVFYI-------ATKPPPPNFSLEISKIPEFKLGEGVDRTGVTTKILTCN 228
           RL++     L +F++       A++  PP   ++   +  F  GEG D TGV TKI+   
Sbjct: 111 RLILGVVATLSIFFLLCSVLFGASQSSPPIVYIKGVNVRSFYYGEGSDNTGVPTKIMNVK 170

Query: 229 CSMNLIIENKSRFFGLHIRPPLMDMKFSVLPFASSNGPKLYAESGLTIFTLQLGV--KNK 286
           CS+ +   N S  FG+H+    + + +S     ++   K Y +   +  T ++ +     
Sbjct: 171 CSVVITTHNPSTLFGIHVSSTAVSLIYSRQFTLANARLKSYHQPKQSNHTSRINLIGSKV 230

Query: 287 PMYGAGRSMQDMLDSGKGLPI 307
           P+YGAG  +    +SG G+P+
Sbjct: 231 PLYGAGAELVASDNSG-GVPV 250


>AT5G42860.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: plasma membrane;
           EXPRESSED IN: 19 plant structures; EXPRESSED DURING: 11
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT1G45688.1); Has 1807 Blast
           hits to 1807 proteins in 277 species: Archae - 0;
           Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385;
           Viruses - 0; Other Eukaryotes - 339 (source: NCBI
           BLink). | chr5:17183339-17184857 REVERSE LENGTH=320
          Length = 320

 Score = 65.9 bits (159), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 51/217 (23%), Positives = 94/217 (43%), Gaps = 28/217 (12%)

Query: 166 CGWICLQMSWRLMVSFGIALLVFYIATKPPPPNFSLEISKIPEFKLGEGVDRTGVTTKIL 225
           C  +   + + L+  F    L+ Y A KP  P  S++     + K+  G D  G+ T ++
Sbjct: 108 CYVLAFIVGFSLL--FAFFSLILYAAAKPQKPKISVKSITFEQLKVQAGQDAGGIGTDMI 165

Query: 226 TCNCSMNLIIENKSRFFGLHIRPPLMDMKFSVLPFASSNGPKLYA--ESGLTIFTLQLGV 283
           T N ++ ++  N   FFG+H+    +D+ FS +   S +  K Y   +S  T+    LG 
Sbjct: 166 TMNATLRMLYRNTGTFFGVHVTSSPIDLSFSQITIGSGSIKKFYQSRKSQRTVVVNVLGD 225

Query: 284 KNKPMYGAGRSM---------------------QDMLDSGKGLPIAIRVILSSSFEVVPN 322
           K  P+YG+G ++                      +       +P+ +   + S   V+  
Sbjct: 226 K-IPLYGSGSTLVPPPPPAPIPKPKKKKGPIVIVEPPAPPAPVPMRLNFTVRSRAYVLGK 284

Query: 323 LIKPRFHHRVECIVVLKKDYDRKHRTQAFNSTCKVTS 359
           L++P+F+ R+ C++  +     KH      + C VTS
Sbjct: 285 LVQPKFYKRIVCLINFEHKKLSKH--IPITNNCTVTS 319


>AT1G45688.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: plasma membrane;
           EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT5G42860.1); Has 258 Blast
           hits to 242 proteins in 39 species: Archae - 0; Bacteria
           - 11; Metazoa - 10; Fungi - 14; Plants - 198; Viruses -
           17; Other Eukaryotes - 8 (source: NCBI BLink). |
           chr1:17191502-17192870 FORWARD LENGTH=342
          Length = 342

 Score = 64.7 bits (156), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 50/207 (24%), Positives = 90/207 (43%), Gaps = 27/207 (13%)

Query: 177 LMVSFGIALLVFYIATKPPPPNFSLEISKIPEFKLGEGVDRTGVTTKILTCNCSMNLIIE 236
             + FG   L+ Y A KP  P  +++       K+  G D  GV T ++T N ++ ++  
Sbjct: 138 FFILFGFFSLILYGAAKPMKPKITVKSITFETLKIQAGQDAGGVGTDMITMNATLRMLYR 197

Query: 237 NKSRFFGLHIRPPLMDMKFSVLPFASSNGPKLYA--ESGLTIFTLQLGVKNKPMYGAGRS 294
           N   FFG+H+    +D+ FS +   S +  K Y   +S  T+    +G K  P+YG+G +
Sbjct: 198 NTGTFFGVHVTSTPIDLSFSQIKIGSGSVKKFYQGRKSERTVLVHVIGEKI-PLYGSGST 256

Query: 295 ----------------------MQDMLDSGKGLPIAIRVILSSSFEVVPNLIKPRFHHRV 332
                                 + D       +P+ +  ++ S   V+  L++P+F+ ++
Sbjct: 257 LLPPAPPAPLPKPKKKKGAPVPIPDPPAPPAPVPMTLSFVVRSRAYVLGKLVQPKFYKKI 316

Query: 333 ECIVVLKKDYDRKHRTQAFNSTCKVTS 359
           EC +  +     KH     N  C VT+
Sbjct: 317 ECDINFEHKNLNKHIVITKN--CTVTT 341


>AT4G35170.1 | Symbols:  | Late embryogenesis abundant (LEA)
           hydroxyproline-rich glycoprotein family |
           chr4:16736839-16738186 FORWARD LENGTH=299
          Length = 299

 Score = 53.5 bits (127), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 45/201 (22%), Positives = 87/201 (43%), Gaps = 11/201 (5%)

Query: 144 DYYYGNGKGGWKRYFSYSYADSCGWICLQMSWRLMVSFGIALLVFYIATKPPPPNFSLEI 203
           DY   +G    +R  +  Y+      CL  +  L+++F +  L+ +  +K   P  +L+ 
Sbjct: 92  DYDEMDGPDEKRRRITRFYS------CLLFT--LVLAFTLFCLILWGVSKSFAPIATLKE 143

Query: 204 SKIPEFKLGEGVDRTGVTTKILTCNCSMNLIIENKSRFFGLHIRPPLMDMKFSVLPFASS 263
             +    +  G D++GV T +LT N ++ ++  N + FF +H+    + + +S L  AS 
Sbjct: 144 MVLENLNVQSGNDQSGVLTDMLTLNSTVRILYRNPATFFTVHVTSAPLQLSYSQLILASG 203

Query: 264 N-GPKLYAESGLTIFTLQLGVKNKPMYGAGRSM--QDMLDSGKGLPIAIRVILSSSFEVV 320
             G          I   ++     P+YG   ++  Q        LP+ +   L +   V+
Sbjct: 204 QMGEFSQRRKSERIIETKVFGDQIPLYGGVPALFGQRAEPDQVVLPLNLTFTLRARAYVL 263

Query: 321 PNLIKPRFHHRVECIVVLKKD 341
             L+K  FH  ++C +    D
Sbjct: 264 GRLVKTTFHSNIKCSITFYGD 284


>AT1G45688.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: plasma membrane;
           EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT5G42860.1); Has 35333 Blast
           hits to 34131 proteins in 2444 species: Archae - 798;
           Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants -
           531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI
           BLink). | chr1:17191502-17192464 FORWARD LENGTH=248
          Length = 248

 Score = 52.8 bits (125), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 25/88 (28%), Positives = 43/88 (48%)

Query: 177 LMVSFGIALLVFYIATKPPPPNFSLEISKIPEFKLGEGVDRTGVTTKILTCNCSMNLIIE 236
             + FG   L+ Y A KP  P  +++       K+  G D  GV T ++T N ++ ++  
Sbjct: 138 FFILFGFFSLILYGAAKPMKPKITVKSITFETLKIQAGQDAGGVGTDMITMNATLRMLYR 197

Query: 237 NKSRFFGLHIRPPLMDMKFSVLPFASSN 264
           N   FFG+H+    +D+ FS +   S +
Sbjct: 198 NTGTFFGVHVTSTPIDLSFSQIKIGSGS 225


>AT2G41990.1 | Symbols:  | CONTAINS InterPro DOMAIN/s: Late
           embryogenesis abundant protein, group 2
           (InterPro:IPR004864); BEST Arabidopsis thaliana protein
           match is: Late embryogenesis abundant (LEA)
           hydroxyproline-rich glycoprotein family
           (TAIR:AT4G35170.1); Has 172 Blast hits to 168 proteins
           in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 172; Viruses - 0; Other Eukaryotes -
           0 (source: NCBI BLink). | chr2:17527396-17528527 FORWARD
           LENGTH=297
          Length = 297

 Score = 49.3 bits (116), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 41/177 (23%), Positives = 79/177 (44%), Gaps = 11/177 (6%)

Query: 168 WICLQMSWRLMVSFGIALLVFYIATKPPPPNFSLEISKIPEFKLGEGVDRTGVTTKILTC 227
           W+ L +    +  F +  L+ + A+K  PP  +++   + +  L  G D +GV T +L+ 
Sbjct: 117 WLLLSV----IFLFTVFSLILWGASKSYPPKVTVKGMLVRDLNLQAGNDLSGVPTDMLSL 172

Query: 228 NCSMNLIIENKSRFFGLHI--RPPLMDMKFSVLPFASSNGPKLYAESGLTIFTLQLGVKN 285
           N ++ +   N S FF +H+   P L+     +L     N   +       + T+  G   
Sbjct: 173 NSTVRIYYRNPSTFFAVHVTASPLLLHYSNLLLSSGEMNKFTVGRNGETNVVTVVQG-HQ 231

Query: 286 KPMYGAGRSMQDMLDSGKGLPIAIRVILSSSFEVVPNLIKPRFHHRVECIVVLKKDY 342
            P+YG      D L     LP+ + ++L S   ++  L+  +F+ R+ C   L  ++
Sbjct: 232 IPLYGGVSFHLDTLS----LPLNLTIVLHSKAYILGRLVTSKFYTRIICSFTLDANH 284