Miyakogusa Predicted Gene

Lj1g3v3531580.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj1g3v3531580.1 CUFF.30826.1
         (559 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT1G71110.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   551   e-157
AT2G12400.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   347   2e-95
AT2G25270.1 | Symbols:  | unknown protein; LOCATED IN: plasma me...   313   2e-85
AT1G80540.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   249   4e-66
AT5G67550.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    65   9e-11

>AT1G71110.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: endomembrane
           system; BEST Arabidopsis thaliana protein match is:
           unknown protein (TAIR:AT2G12400.1); Has 173 Blast hits
           to 169 proteins in 21 species: Archae - 0; Bacteria - 0;
           Metazoa - 3; Fungi - 0; Plants - 165; Viruses - 0; Other
           Eukaryotes - 5 (source: NCBI BLink). |
           chr1:26818244-26820852 FORWARD LENGTH=557
          Length = 557

 Score =  551 bits (1420), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 272/504 (53%), Positives = 343/504 (68%), Gaps = 5/504 (0%)

Query: 29  SPSGSTRHAVNTILGEVNLDPWKTEVAQIALAPYPGVDSP-EGTLVLAANRTNRPDILQR 87
           S   +++  +  ILG  N   WK     I+LAP P  D      L+LAA+RT RPDIL+ 
Sbjct: 25  SSVSASQDPLRLILGSPNFGTWK---GGISLAPGPESDDVVSDYLLLAAHRTKRPDILRA 81

Query: 88  FRRYKGGWDIANRHYWASVGFTGAAGFILAVLWFISFGLALVIHSCCGWGINIKEEGSN- 146
           F+ Y GGW+I N HYWASVGFTGA GFILAV+W +SFG  LV++ C  W I  K +GS+ 
Sbjct: 82  FKPYHGGWNITNNHYWASVGFTGAPGFILAVIWLLSFGSLLVVYHCFKWRICDKAKGSSF 141

Query: 147 RLQRVCXXXXXXFTCTAVTGCVLLSFGQDKFHGEAIHTLHYVVNQSDYTVEILRNVTEYL 206
             +R+C      FTC A  GC+LLS GQDKFH EA+HTL YVVNQSDYTVEIL+NVT+YL
Sbjct: 142 DTRRICFILLIVFTCVAAVGCILLSVGQDKFHTEAMHTLKYVVNQSDYTVEILQNVTQYL 201

Query: 207 SLAKSITVAEMFLPSDIMNDIDNLNGDLKAAADTLYEKTHENSIKIRKVFDTVXXXXXXX 266
           SLAK+I V ++ +PSD+M +ID LN +L  AA TL E T +N+ KI++VF  V       
Sbjct: 202 SLAKTINVTQIVIPSDVMGEIDKLNVNLNTAAVTLGETTTDNAAKIKRVFYAVRSALITV 261

Query: 267 XXXXXXXXXXXXXXXXXXYQHAILIFVISGWLLVVTTFILCGVFMLLNNAISDTCLAMGE 326
                             +QH + IFV+SGW+LV  TF+LCGVF++LNNAISDTC+AM E
Sbjct: 262 ATVMLILSFVGLLLSVLRHQHVVHIFVVSGWILVAVTFVLCGVFLILNNAISDTCVAMKE 321

Query: 327 WEENPQAESTLRNILPCVDQGTTNRTLFQSKQVVTNIVSVVNRFIYSTADANPSQGSMNY 386
           W +NP AE+ L +ILPCVDQ TTN+TL QSK V+ +IV+VVN F+Y+ A+ NP+ G   Y
Sbjct: 322 WVDNPHAETALSSILPCVDQQTTNQTLSQSKVVINSIVTVVNTFVYAVANTNPAPGQDRY 381

Query: 387 YNQSGPAMPPLCYPFDSEFKERQCTTQEVSSFNASSVWKKYECEVSEYGICTSVGRVTPE 446
           YNQSGP MPPLC PFD+  ++RQC+  E+S  NASSVW+ Y+CEV+  GICT+VGRVTP+
Sbjct: 382 YNQSGPPMPPLCIPFDANMEDRQCSPWELSIENASSVWENYKCEVTPSGICTTVGRVTPD 441

Query: 447 IYLELVAAVNEIYALEHYTPLVLSLQNCNFVRDTFKEIISSYCPPLNHYINVINEXXXXX 506
            + +LVAAVNE YALEHYTP +LS ++CNFVR+TF  I S YCPPL   + ++N      
Sbjct: 442 TFGQLVAAVNESYALEHYTPPLLSFRDCNFVRETFMSITSDYCPPLVRNLRIVNAGLGLI 501

Query: 507 XXXXXXXXXXXXXYANRPQREEVF 530
                        YANRPQREEVF
Sbjct: 502 SVGVLLCLVLWIFYANRPQREEVF 525


>AT2G12400.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: endomembrane
           system; EXPRESSED IN: 25 plant structures; EXPRESSED
           DURING: 13 growth stages; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT2G25270.1);
           Has 177 Blast hits to 172 proteins in 23 species: Archae
           - 0; Bacteria - 2; Metazoa - 3; Fungi - 0; Plants - 164;
           Viruses - 0; Other Eukaryotes - 8 (source: NCBI BLink).
           | chr2:5005144-5008140 REVERSE LENGTH=541
          Length = 541

 Score =  347 bits (889), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 179/448 (39%), Positives = 256/448 (57%), Gaps = 5/448 (1%)

Query: 50  WKTEVAQIALAPYPGVDSPEGTLVLAANRTNRPDILQRFRRYKGGWDIANRHYWASVGFT 109
           W+T V +  +A   G +S   +L+LAA RT R D    F+ Y GGW+I+N HY  SVG+T
Sbjct: 46  WRTSVIERVIAEESGENS---SLILAAKRTRRKDPADNFKLYTGGWNISNSHYLTSVGYT 102

Query: 110 GAAGFILAVLWFISFGLALVIHSCCGWGINIKEEGSNRLQ-RVCXXXXXXFTCTAVTGCV 168
            A   I+A++WF+ FGL+L +   C      +  G +R+   +       FT  A+ GCV
Sbjct: 103 AAPFIIIALVWFVFFGLSLSLICLCYCCCARQSYGYSRVAYALSLILLISFTIAAIIGCV 162

Query: 169 LLSFGQDKFHGEAIHTLHYVVNQSDYTVEILRNVTEYLSLAKSITVAEMFLPSDIMNDID 228
            L  GQ KFH     TL YVV+Q++ T E LRNV++YL+ AK + V    LP D+++ ID
Sbjct: 163 FLYTGQGKFHASTTDTLDYVVSQANLTSENLRNVSDYLNAAKKVDVQSSILPQDVLSSID 222

Query: 229 NLNGDLKAAADTLYEKTHENSIKIRKVFDTVXXXXXXXXXXXXXXXXXXXXXXXXXYQHA 288
           N+ G + ++A TL  KT EN  KI+ V D +                          Q  
Sbjct: 223 NIQGKINSSATTLSVKTMENQDKIQNVLDIMRLALVIIAAVMLFLAFIGFLLSIFGLQCL 282

Query: 289 ILIFVISGWLLVVTTFILCGVFMLLNNAISDTCLAMGEWEENPQAESTLRNILPCVDQGT 348
           +   VI GW+LV  TF+LCG F+LL+N + DTC+AM +W +NP A + L +ILPCVD  T
Sbjct: 283 VYTLVILGWILVTVTFVLCGGFLLLHNVVGDTCVAMDQWVQNPTAHTALDDILPCVDNAT 342

Query: 349 TNRTLFQSKQVVTNIVSVVNRFIYSTADAN-PSQGSMNYYNQSGPAMPPLCYPFDSEFKE 407
              TL ++K V   +V++++  I +  + N P Q    YYNQSGP MP LC PF+++  +
Sbjct: 343 ARETLTRTKLVTYQLVNLLDNAISNMTNRNFPPQFRPLYYNQSGPLMPLLCNPFNADLSD 402

Query: 408 RQCTTQEVSSFNASSVWKKYECEVSEYGICTSVGRVTPEIYLELVAAVNEIYALEHYTPL 467
           RQC   +V   NA+ VWK + C++   G C++ GR+TP++Y ++ AAVN  Y L  Y P 
Sbjct: 403 RQCQPGQVHLNNATEVWKNFTCQIVTPGTCSTPGRLTPKLYSQMAAAVNVSYGLYKYGPF 462

Query: 468 VLSLQNCNFVRDTFKEIISSYCPPLNHY 495
           +  LQ C+FVR TF +I   +CP L  Y
Sbjct: 463 LADLQGCDFVRSTFTDIERDHCPGLKRY 490


>AT2G25270.1 | Symbols:  | unknown protein; LOCATED IN: plasma
           membrane; EXPRESSED IN: 18 plant structures; EXPRESSED
           DURING: 13 growth stages; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT2G12400.1);
           Has 35333 Blast hits to 34131 proteins in 2444 species:
           Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi -
           991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610
           (source: NCBI BLink). | chr2:10759779-10762358 FORWARD
           LENGTH=545
          Length = 545

 Score =  313 bits (803), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 159/451 (35%), Positives = 250/451 (55%), Gaps = 6/451 (1%)

Query: 51  KTEVAQIALAPYPGVDSPE----GTLVLAANRTNRPDILQRFRRYKGGWDIANRHYWASV 106
           K  VA+  L    G ++P+     ++ LAA RT R D L  F +Y GGW+I+N+HYWASV
Sbjct: 45  KFIVAEAPLLGPAGFNNPQVIEVASVALAAQRTYRKDPLNGFEKYTGGWNISNQHYWASV 104

Query: 107 GFTGAAGFILAVLWFISFGLALVIHSCCGWGINIKEEGSNRLQRVCXXX-XXXFTCTAVT 165
            +T    F+LA +WF+ FG+ L++   C         G +++  V        FT  A+ 
Sbjct: 105 SYTAVPLFVLAAVWFLGFGICLLVICMCHICHRTNSVGYSKVAYVVSLIFLLIFTVIAII 164

Query: 166 GCVLLSFGQDKFHGEAIHTLHYVVNQSDYTVEILRNVTEYLSLAKSITVAEMFLPSDIMN 225
           GCVLL  GQ +++     TL YV++Q+D T+  LR +++YL+ AK   V ++ LP+++  
Sbjct: 165 GCVLLYSGQIRYNKSTTETLEYVMSQADSTISQLRAISDYLASAKQAAVLQVLLPANVQT 224

Query: 226 DIDNLNGDLKAAADTLYEKTHENSIKIRKVFDTVXXXXXXXXXXXXXXXXXXXXXXXXXY 285
           +ID +   L ++  T+ EK+  +S  IR   D+V                          
Sbjct: 225 EIDQIGVKLDSSVATITEKSTNSSNHIRHFLDSVRVALIVVSIVMLVVTFLGLVSSIFGM 284

Query: 286 QHAILIFVISGWLLVVTTFILCGVFMLLNNAISDTCLAMGEWEENPQAESTLRNILPCVD 345
           Q  +   VI GW+LV  TFIL G F++L+NA +DTC+AM EW E P + + L  ILPC D
Sbjct: 285 QVIVYTLVILGWILVTGTFILSGTFLVLHNATADTCVAMSEWVERPSSNTALDEILPCTD 344

Query: 346 QGTTNRTLFQSKQVVTNIVSVVNRFIYSTADANPSQGSM-NYYNQSGPAMPPLCYPFDSE 404
             T   TL +S++V   +V ++N  I + ++ N S   +  YYNQSGP +P LC PF+ +
Sbjct: 345 NATAQETLMRSREVTGQLVELINTVITNVSNINFSPVFVPMYYNQSGPLLPLLCNPFNHD 404

Query: 405 FKERQCTTQEVSSFNASSVWKKYECEVSEYGICTSVGRVTPEIYLELVAAVNEIYALEHY 464
             +R C+  ++   NA+  W  + C+VS+ G CT+ GR+TP +Y ++ + VN    L   
Sbjct: 405 LTDRSCSPGDLDLNNATEAWTSFVCQVSQNGTCTTTGRLTPALYSQMASGVNISTGLIRD 464

Query: 465 TPLVLSLQNCNFVRDTFKEIISSYCPPLNHY 495
            P ++ LQ+C++ + TF++I + +CP L  Y
Sbjct: 465 APFLVQLQDCSYAKQTFRDITNDHCPGLQRY 495


>AT1G80540.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G12400.1); Has 175 Blast hits to 171 proteins
           in 20 species: Archae - 0; Bacteria - 0; Metazoa - 2;
           Fungi - 0; Plants - 171; Viruses - 0; Other Eukaryotes -
           2 (source: NCBI BLink). | chr1:30281638-30284258 REVERSE
           LENGTH=538
          Length = 538

 Score =  249 bits (636), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 143/430 (33%), Positives = 214/430 (49%), Gaps = 10/430 (2%)

Query: 72  LVLAANRTNRPDILQRFRRYKGGWDIANRHYWASVGFTGAAGFILAVLWFISFGLALVIH 131
           LVLAA RT RPD L  F  Y  GW++ N HY ASVGF+     ++A+ WF+  GL L+  
Sbjct: 64  LVLAAERTQRPDPLNHFNIYVDGWNVTNSHYIASVGFSAVPFIVIAIAWFVLLGLFLICS 123

Query: 132 SCCGWGINIKEEGSNRLQRVCXXXXXXF----TCTAVTGCVLLSFGQDKFHGEAIHTLHY 187
             C               RVC      F    T  AV G  +L  GQ++F+G    T  Y
Sbjct: 124 CLCCCCCGCGRRNYG-YSRVCYTLSLVFLLLFTIAAVIGSAMLYTGQNEFYGSVERTFMY 182

Query: 188 VVNQSDYTVEILRNVTEYLSLAKSITV-AEMFLPSDIMNDIDNLNGDLKAAADTLYEKTH 246
           +V Q+   +  L ++ + +  AK I +      P +   +ID+ N  +K +  T  ++  
Sbjct: 183 IVKQATGVLTKLTSLWDSIQSAKDIQLDGHNLFPPEFRGNIDHFNNMIKMSNITYPDRVA 242

Query: 247 ENSIK-IRKVFDTVXXXXXXXXXXXXXXXXXXXXXXXXXYQHAILIFVISGWLLVVTTFI 305
             +I+ +    + V                          +  + + VI GW+LV  T +
Sbjct: 243 NQTIRYLTGALNPVRYVLNVIAGVMLAVAFLGLLFSFCGLRVLVYLLVILGWILVTATIL 302

Query: 306 LCGVFMLLNNAISDTCLAMGEWEENPQAESTLRNILPCVDQGTTNRTLFQSKQVVTNIVS 365
           L  VF++ +N ++DTC+AM +W  +P A+S L  +LPC+D  T   TL  +K +    V 
Sbjct: 303 LSAVFLVFHNVVADTCMAMDQWVHDPAADSALSQLLPCLDPKTIGETLDITKTMTATAVD 362

Query: 366 VVNRFIY--STADANPSQGSMNYYNQSGPAMPPLCYPFDSEFKERQCTTQEVSSFNASSV 423
           + N +    S  D  P      Y+NQSGP +P LC P D   K R C   EV   NAS V
Sbjct: 363 MTNAYTVNVSNHDQFPPNAPF-YHNQSGPLVPLLCNPLDQNHKPRPCAPDEVLLANASQV 421

Query: 424 WKKYECEVSEYGICTSVGRVTPEIYLELVAAVNEIYALEHYTPLVLSLQNCNFVRDTFKE 483
           +K Y C+V+  GICT+ GR+T   Y +++ A+N  + L+HY P + S+ +C FVRDTF++
Sbjct: 422 YKGYICQVNAEGICTTQGRLTQGSYDQMMGAINVAFTLDHYGPFLASIADCTFVRDTFRD 481

Query: 484 IISSYCPPLN 493
           I +  CP L+
Sbjct: 482 ITTKNCPGLS 491


>AT5G67550.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: endomembrane
           system; EXPRESSED IN: flower; EXPRESSED DURING: 4
           anthesis; BEST Arabidopsis thaliana protein match is:
           unknown protein (TAIR:AT1G71110.1); Has 161 Blast hits
           to 154 proteins in 16 species: Archae - 0; Bacteria - 0;
           Metazoa - 0; Fungi - 0; Plants - 161; Viruses - 0; Other
           Eukaryotes - 0 (source: NCBI BLink). |
           chr5:26946908-26949112 REVERSE LENGTH=509
          Length = 509

 Score = 65.5 bits (158), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 45/202 (22%), Positives = 89/202 (44%), Gaps = 8/202 (3%)

Query: 297 WLLVVTTFILCGVFMLLNNAISDTCLAMGEWEENPQAESTLRNILPCVDQGTTNRTLFQS 356
           W++    ++L G    ++    D C A   + +NP+  STL N+ PC+D   +++TL + 
Sbjct: 255 WIITTLCWVLTGFDFFIHTFAEDLCSAFNGFVQNPR-NSTLTNLFPCMDPLHSDKTLIEI 313

Query: 357 KQVVTNIVSVVNRFIYSTADANPSQGSMNYYNQSGPAMPPLCYPFDSE----FKERQCTT 412
             ++ N ++ +N  +  +  +N      N  + + P    +C PF  +    +  + C+ 
Sbjct: 314 SLMIHNFITQLNSKVAESMRSNALTDRSNTVSWA-PESGIICDPFVGQQINSYTPQSCSN 372

Query: 413 QEVSSFNASSVWKKYECEVSE-YGICTSVGRVTPE-IYLELVAAVNEIYALEHYTPLVLS 470
             +      ++  ++ C   +    C   G+  PE  YL++ A  N    +    P   +
Sbjct: 373 GAIPIGEFPNILSRFTCHDKDPPETCRITGKFIPEAAYLKVYAYSNSAQGMLDILPSFQN 432

Query: 471 LQNCNFVRDTFKEIISSYCPPL 492
           L  C  V+DT   I+S+ C P 
Sbjct: 433 LTECLAVKDTLSSIVSNQCDPF 454



 Score = 50.4 bits (119), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 22/58 (37%), Positives = 33/58 (56%), Gaps = 7/58 (12%)

Query: 78  RTNRPDILQRFRRYKGGWDIANRHYWASVGFTGAAGFILAVLWFISFGLALVIHSCCG 135
           R  R D L  FR Y GG+++ N+HYWA+  FTG  G+ +A       G+ +++  C G
Sbjct: 36  RFKRRDPLNSFRYYDGGFNVRNKHYWAATAFTGIHGYAVA-------GVLIIVGICLG 86