Miyakogusa Predicted Gene

Lj5g3v1697640.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj5g3v1697640.1 tr|C1EEU2|C1EEU2_MICSR Predicted protein
OS=Micromonas sp. (strain RCC299 / NOUM17)
GN=MICPUN_106293,24.46,1e-17,FAMILY NOT NAMED,NULL;
seg,NULL,CUFF.55759.1
         (533 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT3G11760.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   405   e-113
AT5G04860.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   367   e-101
AT2G10560.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   194   1e-49
AT2G25460.1 | Symbols:  | CONTAINS InterPro DOMAIN/s: C2 calcium...    94   3e-19

>AT3G11760.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 23 plant
           structures; EXPRESSED DURING: 14 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT5G04860.1); Has 84 Blast hits to 73 proteins in
           13 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi
           - 0; Plants - 84; Viruses - 0; Other Eukaryotes - 0
           (source: NCBI BLink). | chr3:3718529-3721123 FORWARD
           LENGTH=702
          Length = 702

 Score =  405 bits (1042), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 231/507 (45%), Positives = 304/507 (59%), Gaps = 31/507 (6%)

Query: 34  EFSTLKAGLRRVKTFTDYVSTRRAKKASYKDEGSDGRSSTRSEDSEYKYTSDLDSQDNDD 93
           + S +KAGLR+VK FT++VSTR+AKKA  ++EG      +     +++  +D D      
Sbjct: 211 DVSAIKAGLRKVKIFTEFVSTRKAKKACREEEGRFSSFESSESLDDFE--TDFD------ 262

Query: 94  VNKSEERDEDSCVRHSMSYETLTCGNNVGESPYTSSTFNGKNEFKLNSGSQNSYFGDALV 153
               E ++E   +R S SY  L+  N VG S    +  + ++E  +    + S  G    
Sbjct: 263 ----EGKEELMSMRKSFSYGPLSYANGVGTSLNCGAKVSDEDEDWVYYSHRKSDVGAGCS 318

Query: 154 ENYNTCDQVEYHIS---KYRILSWRKRKLQFRSRSFKVKGELLLKKHXXXXXXXXXXXXR 210
           +  ++   + Y  S   +  IL WRKRKL FRS   K KGE LLKK             R
Sbjct: 319 DAEDSAAGLVYEASLLPRRSILPWRKRKLSFRSP--KSKGEPLLKKDNGEEGGDDIDFDR 376

Query: 211 RVLSSSDEYTCQKWHKTQENITTTQSSISGFGENNFTVGTWEHKEVISQDGLMKLHTEIF 270
           R LSS + +        +++    ++S S FGE++F +G+WE KEVIS+DG MKL T +F
Sbjct: 377 RQLSSDEAHPPFGSKIDEDSSANPRTSFSEFGEDSFAIGSWEEKEVISRDGHMKLQTSVF 436

Query: 271 FASIDQRSECAAGESACAVLVVLIVDWLKLNQAEIPIKCEFDSLIKDGSSEWRIICENKD 330
            ASIDQRSE AAGESAC  LV +I DW + N   +PIK +FDSLI++GS EWR +CEN+ 
Sbjct: 437 LASIDQRSERAAGESACTALVAVIADWFQKNGNLMPIKSQFDSLIREGSLEWRNLCENET 496

Query: 331 YMKNFPDKHFDLDTVFRAKTRRVSVVPEKSYVXXXXXXXXXXXXXXXXXXXAMSFDSIWE 390
           YM+ FPDKHFDLDTV +AK R ++V+P KS+V                   AMSFDSIW 
Sbjct: 497 YMQKFPDKHFDLDTVLQAKIRPLTVIPGKSFVGFFHPDGMINEGRFEFLQGAMSFDSIWA 556

Query: 391 EI------SHCASELHLFSEAIVYIVSWNDHFFVLKVQHDANYIIDTLGERLHEGCNQAY 444
           EI      S         S   VYIVSWNDHFFVLKV+ +A YIIDTLGERL+EGC+QAY
Sbjct: 557 EIISLEESSANGDSYDDDSPPHVYIVSWNDHFFVLKVEKEAYYIIDTLGERLYEGCDQAY 616

Query: 445 ILKFDANTRIEKLCNENQVLDAKPPSDEVNDVRKKEIICRGKESCKEYIKRFLASIPVRE 504
           +LKFD  T I K+ +  +      P        + EI+ RGKESCKEYIK FLA+IP+RE
Sbjct: 617 VLKFDHKTVIHKILHTEEAGSESEP--------ESEILSRGKESCKEYIKNFLAAIPIRE 668

Query: 505 LQDDVKRGLKASMPLHRRLQIEFHYTT 531
           LQ+D+K+GL ++ P+H RLQIEFHYTT
Sbjct: 669 LQEDIKKGLASTAPVHHRLQIEFHYTT 695


>AT5G04860.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G11760.1); Has 1807 Blast hits to 1807 proteins
           in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
           Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
           - 339 (source: NCBI BLink). | chr5:1411760-1414459
           REVERSE LENGTH=782
          Length = 782

 Score =  367 bits (943), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 232/550 (42%), Positives = 324/550 (58%), Gaps = 52/550 (9%)

Query: 11  WPPLSSKKFEASLVKPEEVRSLVEFSTLKAGLRRVKTFTDYVSTRRAKKASYKDEGSDGR 70
           W PLS++  +A              S +K GLR++KTF + +S+ +A +   + +GS G 
Sbjct: 200 WSPLSAEAEKAE-------------SVVKVGLRKMKTFNNCMSSTQASEKESEKDGSSGS 246

Query: 71  SST-----RSEDSEYKYTSDLDSQDNDDV--NKSEERDEDSCVRHSMSYETLTCGNNVGE 123
            S      R+ DS+  Y  D DS D  D      E ++ +S +   ++Y+TL   N    
Sbjct: 247 GSDGKSPERNLDSDSSYPFDTDSLDEGDAADESEENKENESSLADPVNYKTLRSANWARG 306

Query: 124 SPYTSSTFNGKNEFKLNSGS---QNSYFGDALVENYNTCDQVEYHISKYRILSWRKRKLQ 180
           S +T +    ++    +  S   +  +  D +  +  + +Q +  +SK R+LSW+KRKL 
Sbjct: 307 SFHTVTNPEDEDLIYYSHRSPLAETGHCSDEVSNDVVSLEQAKGQMSKKRMLSWKKRKLS 366

Query: 181 FRSRSFKVKGELLLKKHXXXXXXXXXXXXRRVLSSSDEYTCQKWHKTQENITTTQSSISG 240
           FRS   K KGE LLKK             RR LSSSDE +   W+++ + I      +S 
Sbjct: 367 FRSP--KQKGEPLLKKDCLEEGGDDIDFDRRQLSSSDE-SNSDWYRSDDAIMKP---LSQ 420

Query: 241 FGENNFTVGTWEHKEVISQDGLMKLHTEIFFASIDQRSECAAGESACAVLVVLIVDWLKL 300
           FG+++F VG+WE KE+IS+DGLMKL   +F ASIDQRSE AAGESAC  LV ++  WL  
Sbjct: 421 FGDDDFVVGSWETKEIISRDGLMKLTARVFLASIDQRSERAAGESACTALVAVMAHWLGS 480

Query: 301 NQAEIPIKCEFDSLIKDGSSEWRIICENKDYMKNFPDKHFDLDTVFRAKTRRVSVVPEKS 360
           N+  IP + EFDSLI++GSSEWR +CEN++Y + FPDKHFDL+TV +AK R + VVPE+S
Sbjct: 481 NRDIIPTRSEFDSLIREGSSEWRNMCENEEYRERFPDKHFDLETVLQAKVRPICVVPERS 540

Query: 361 YV-----XXXXXXXXXXXXXXXXXXXAMSFDSIWEEISHCASELHLFSEAIVYIVSWNDH 415
           ++                         MSFDSIWEE+     E    SE ++YIVSWNDH
Sbjct: 541 FIGFFHPEKSEEEEGKEDASLDFLKGVMSFDSIWEELMKQEPE-ESASEPVIYIVSWNDH 599

Query: 416 FFVLKVQHDANYIIDTLGERLHEGCNQAYILKFDANTRIEKLCN---------ENQVLDA 466
           FFVL V HDA YIIDTLGERL+EGCNQAY+LKFD +  I++L +          NQ    
Sbjct: 600 FFVLLVNHDAYYIIDTLGERLYEGCNQAYVLKFDKDAEIKRLPSVIKDNKADMGNQKQGG 659

Query: 467 KPPSDE------VNDVRKKEIICRGKESCKEYIKRFLASIPVRELQDDVKRGLKASMPLH 520
           K  S++        +  ++E++CRGKESC+EYIK FLA+IP+++++ D+K+GL +S  LH
Sbjct: 660 KNKSEQPERSKESEEQEEEEVVCRGKESCREYIKSFLAAIPIQQVKADMKKGLVSS--LH 717

Query: 521 RRLQIEFHYT 530
            RLQIE HYT
Sbjct: 718 HRLQIELHYT 727


>AT2G10560.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT5G04860.1); Has 70 Blast hits to 70 proteins in
           12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi
           - 0; Plants - 70; Viruses - 0; Other Eukaryotes - 0
           (source: NCBI BLink). | chr2:4109862-4110698 REVERSE
           LENGTH=278
          Length = 278

 Score =  194 bits (493), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 103/226 (45%), Positives = 141/226 (62%), Gaps = 23/226 (10%)

Query: 325 ICENKDYMKNFPDKHFDLDTVFRAKTRRVSVVPEKSYV-----XXXXXXXXXXXXXXXXX 379
           +CEN++Y + FPDKHFDL+TV +AK R + VVPE++++                      
Sbjct: 1   MCENEEYRERFPDKHFDLETVLQAKVRPICVVPERTFIGFFHREKSKEEEEKEDVSLDFL 60

Query: 380 XXAMSFDSIWEEISHCASELHLFSEAIVYIVSWNDHFFVLKVQHDANYIIDTLGERLHEG 439
              MSFDSIWEEI     E    SE ++YIVSWNDH+FVL V HDA YIIDTLGER++EG
Sbjct: 61  KGVMSFDSIWEEIMKQEPE-ESASEHVIYIVSWNDHYFVLLVNHDAYYIIDTLGERVYEG 119

Query: 440 CNQAYILKFDANTRIEKL---CNENQV------------LDAKPPSDEVNDVRKKEIICR 484
           CNQAY+LKFD +  I++L     +N+              +    S E  +  ++ ++CR
Sbjct: 120 CNQAYVLKFDQDAEIKRLPSVIKDNKADMGSQKQGGKNKYEQPERSKESEEQGEEVVVCR 179

Query: 485 GKESCKEYIKRFLASIPVRELQDDVKRGLKASMPLHRRLQIEFHYT 530
           GKESC+EYIK FLA+IP+++++ D+K GL +S   H RLQIE +YT
Sbjct: 180 GKESCREYIKSFLAAIPIQQVKADMKEGLVSS--FHHRLQIELYYT 223


>AT2G25460.1 | Symbols:  | CONTAINS InterPro DOMAIN/s: C2
           calcium-dependent membrane targeting
           (InterPro:IPR000008); BEST Arabidopsis thaliana protein
           match is: unknown protein (TAIR:AT5G04860.1); Has 108
           Blast hits to 69 proteins in 11 species: Archae - 0;
           Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 108;
           Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).
           | chr2:10833175-10835374 REVERSE LENGTH=423
          Length = 423

 Score = 93.6 bits (231), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 60/144 (41%), Positives = 85/144 (59%), Gaps = 7/144 (4%)

Query: 251 WEHKEVISQDGLMKLHTEIFFASIDQRSECAAGESACAVLVVLIVDWLKLNQAEI-PIKC 309
           W  K+++S+DG  KL +E++ ASIDQRSE AAGE+ACA + V++  W   N   I P   
Sbjct: 282 WVMKDLVSRDGKSKLKSEVYLASIDQRSEQAAGEAACAAVAVVVAHWFHANPKLINPSGT 341

Query: 310 EFDSLIKDGSSEWRIICENKDYMKNFPDKHFDLDTVFRAKTRRVSVVPEKSYVXXXXXXX 369
            FDSLI  GSS W+ +C+ + Y++ FP++HFDL+T+  A  R V V  +KS+        
Sbjct: 342 AFDSLITQGSSLWQSLCDKESYLRLFPNRHFDLETIVSANLRPVRVCTDKSFTGLFSPER 401

Query: 370 XXXXXXXXXXXXAMSFDSIWEEIS 393
                        MSFD IW+E+S
Sbjct: 402 FASLDGL------MSFDQIWDELS 419