Miyakogusa Predicted Gene

Lj5g3v2166200.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj5g3v2166200.1 Non Chatacterized Hit- tr|I1LEM9|I1LEM9_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.53973
PE,80.36,0,coiled-coil,NULL; FAMILY NOT NAMED,NULL;
seg,NULL,CUFF.56792.1
         (558 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT1G71110.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   584   e-167
AT2G12400.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   363   e-100
AT2G25270.1 | Symbols:  | unknown protein; LOCATED IN: plasma me...   338   4e-93
AT1G80540.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   257   1e-68
AT5G67550.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    95   2e-19

>AT1G71110.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: endomembrane
           system; BEST Arabidopsis thaliana protein match is:
           unknown protein (TAIR:AT2G12400.1); Has 173 Blast hits
           to 169 proteins in 21 species: Archae - 0; Bacteria - 0;
           Metazoa - 3; Fungi - 0; Plants - 165; Viruses - 0; Other
           Eukaryotes - 5 (source: NCBI BLink). |
           chr1:26818244-26820852 FORWARD LENGTH=557
          Length = 557

 Score =  584 bits (1505), Expect = e-167,   Method: Compositional matrix adjust.
 Identities = 287/498 (57%), Positives = 351/498 (70%), Gaps = 8/498 (1%)

Query: 35  EHPVKFIIGEENLGPWRNQLTQVAPAPGPNAEDT----LVLAANRTKRPDILQGFRHYRG 90
           + P++ I+G  N G W+  ++    APGP ++D     L+LAA+RTKRPDIL+ F+ Y G
Sbjct: 31  QDPLRLILGSPNFGTWKGGISL---APGPESDDVVSDYLLLAAHRTKRPDILRAFKPYHG 87

Query: 91  GWDITNQHYWASVGFTGGAGFILAVLWFVSFGLALAIHLCCGWGINIKDKESS-HSQRIC 149
           GW+ITN HYWASVGFTG  GFILAV+W +SFG  L ++ C  W I  K K SS  ++RIC
Sbjct: 88  GWNITNNHYWASVGFTGAPGFILAVIWLLSFGSLLVVYHCFKWRICDKAKGSSFDTRRIC 147

Query: 150 LMLLIVFTFAATTGCILLFVGQDKFHGEALDTLHYFVNQSDYSVQILRNVTQYLSLAKTI 209
            +LLIVFT  A  GCILL VGQDKFH EA+ TL Y VNQSDY+V+IL+NVTQYLSLAKTI
Sbjct: 148 FILLIVFTCVAAVGCILLSVGQDKFHTEAMHTLKYVVNQSDYTVEILQNVTQYLSLAKTI 207

Query: 210 HVTQILLPSDVMDDIDKLTVDLNSAADTLSEKTNENAVKFRRVFKDVRXXXXXXXXXXXX 269
           +VTQI++PSDVM +IDKL V+LN+AA TL E T +NA K +RVF  VR            
Sbjct: 208 NVTQIVIPSDVMGEIDKLNVNLNTAAVTLGETTTDNAAKIKRVFYAVRSALITVATVMLI 267

Query: 270 XXXXXXXXXXXGYQHAILIFVITGWLLVATTFILCGVFMILNNAISDTCMAMGEWVENPH 329
                       +QH + IFV++GW+LVA TF+LCGVF+ILNNAISDTC+AM EWV+NPH
Sbjct: 268 LSFVGLLLSVLRHQHVVHIFVVSGWILVAVTFVLCGVFLILNNAISDTCVAMKEWVDNPH 327

Query: 330 RESSLSDVLPCVDQRTTNQTLIQSKQVVTNIAGVVNRFIYETANINATQGTPGYYNQSGP 389
            E++LS +LPCVDQ+TTNQTL QSK V+ +I  VVN F+Y  AN N   G   YYNQSGP
Sbjct: 328 AETALSSILPCVDQQTTNQTLSQSKVVINSIVTVVNTFVYAVANTNPAPGQDRYYNQSGP 387

Query: 390 AMMPLCYPFDSQLQEHQCSDQAVSSANASMVWKNNECEVSESGICTTVGRVTPEIYAQLV 449
            M PLC PFD+ +++ QCS   +S  NAS VW+N +CEV+ SGICTTVGRVTP+ + QLV
Sbjct: 388 PMPPLCIPFDANMEDRQCSPWELSIENASSVWENYKCEVTPSGICTTVGRVTPDTFGQLV 447

Query: 450 AAVNASYALEHYTPLLLSLQNCNFVRDAFTGITSSHCPPLNHYLKITNXXXXXXXXXXXX 509
           AAVN SYALEHYTP LLS ++CNFVR+ F  ITS +CPPL   L+I N            
Sbjct: 448 AAVNESYALEHYTPPLLSFRDCNFVRETFMSITSDYCPPLVRNLRIVNAGLGLISVGVLL 507

Query: 510 XXXXWILYANRPQRGEVF 527
               WI YANRPQR EVF
Sbjct: 508 CLVLWIFYANRPQREEVF 525


>AT2G12400.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: endomembrane
           system; EXPRESSED IN: 25 plant structures; EXPRESSED
           DURING: 13 growth stages; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT2G25270.1);
           Has 177 Blast hits to 172 proteins in 23 species: Archae
           - 0; Bacteria - 2; Metazoa - 3; Fungi - 0; Plants - 164;
           Viruses - 0; Other Eukaryotes - 8 (source: NCBI BLink).
           | chr2:5005144-5008140 REVERSE LENGTH=541
          Length = 541

 Score =  363 bits (932), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 192/489 (39%), Positives = 267/489 (54%), Gaps = 17/489 (3%)

Query: 50  WRNQLTQVAPAPGPNAEDTLVLAANRTKRPDILQGFRHYRGGWDITNQHYWASVGFTGGA 109
           WR  + +   A       +L+LAA RT+R D    F+ Y GGW+I+N HY  SVG+T   
Sbjct: 46  WRTSVIERVIAEESGENSSLILAAKRTRRKDPADNFKLYTGGWNISNSHYLTSVGYTAAP 105

Query: 110 GFILAVLWFVSFGLALAI----HLCCGWGINIKDKESSHSQRIC----LMLLIVFTFAAT 161
             I+A++WFV FGL+L++    + CC        ++S    R+     L+LLI FT AA 
Sbjct: 106 FIIIALVWFVFFGLSLSLICLCYCCCA-------RQSYGYSRVAYALSLILLISFTIAAI 158

Query: 162 TGCILLFVGQDKFHGEALDTLHYFVNQSDYSVQILRNVTQYLSLAKTIHVTQILLPSDVM 221
            GC+ L+ GQ KFH    DTL Y V+Q++ + + LRNV+ YL+ AK + V   +LP DV+
Sbjct: 159 IGCVFLYTGQGKFHASTTDTLDYVVSQANLTSENLRNVSDYLNAAKKVDVQSSILPQDVL 218

Query: 222 DDIDKLTVDLNSAADTLSEKTNENAVKFRRVFKDVRXXXXXXXXXXXXXXXXXXXXXXXG 281
             ID +   +NS+A TLS KT EN  K + V   +R                       G
Sbjct: 219 SSIDNIQGKINSSATTLSVKTMENQDKIQNVLDIMRLALVIIAAVMLFLAFIGFLLSIFG 278

Query: 282 YQHAILIFVITGWLLVATTFILCGVFMILNNAISDTCMAMGEWVENPHRESSLSDVLPCV 341
            Q  +   VI GW+LV  TF+LCG F++L+N + DTC+AM +WV+NP   ++L D+LPCV
Sbjct: 279 LQCLVYTLVILGWILVTVTFVLCGGFLLLHNVVGDTCVAMDQWVQNPTAHTALDDILPCV 338

Query: 342 DQRTTNQTLIQSKQVVTNIAGVVNRFIYETANIN-ATQGTPGYYNQSGPAMMPLCYPFDS 400
           D  T  +TL ++K V   +  +++  I    N N   Q  P YYNQSGP M  LC PF++
Sbjct: 339 DNATARETLTRTKLVTYQLVNLLDNAISNMTNRNFPPQFRPLYYNQSGPLMPLLCNPFNA 398

Query: 401 QLQEHQCSDQAVSSANASMVWKNNECEVSESGICTTVGRVTPEIYAQLVAAVNASYALEH 460
            L + QC    V   NA+ VWKN  C++   G C+T GR+TP++Y+Q+ AAVN SY L  
Sbjct: 399 DLSDRQCQPGQVHLNNATEVWKNFTCQIVTPGTCSTPGRLTPKLYSQMAAAVNVSYGLYK 458

Query: 461 YTPLLLSLQNCNFVRDAFTGITSSHCPPLNHYLKITNXXXXXXXXXXXXXXXXWILYANR 520
           Y P L  LQ C+FVR  FT I   HCP L  Y +                   W++YA R
Sbjct: 459 YGPFLADLQGCDFVRSTFTDIERDHCPGLKRYTQWIYVGLVVVSASVMSSLVFWVIYA-R 517

Query: 521 PQRGEVFVK 529
            +R  V+ K
Sbjct: 518 ERRHRVYTK 526


>AT2G25270.1 | Symbols:  | unknown protein; LOCATED IN: plasma
           membrane; EXPRESSED IN: 18 plant structures; EXPRESSED
           DURING: 13 growth stages; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT2G12400.1);
           Has 35333 Blast hits to 34131 proteins in 2444 species:
           Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi -
           991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610
           (source: NCBI BLink). | chr2:10759779-10762358 FORWARD
           LENGTH=545
          Length = 545

 Score =  338 bits (868), Expect = 4e-93,   Method: Compositional matrix adjust.
 Identities = 186/518 (35%), Positives = 283/518 (54%), Gaps = 23/518 (4%)

Query: 31  TGSIEHPVKFIIGEENL-GP--WRN-QLTQVAPAPGPNAEDTLVLAANRTKRPDILQGFR 86
           TGS+   +KFI+ E  L GP  + N Q+ +VA         ++ LAA RT R D L GF 
Sbjct: 40  TGSV---MKFIVAEAPLLGPAGFNNPQVIEVA---------SVALAAQRTYRKDPLNGFE 87

Query: 87  HYRGGWDITNQHYWASVGFTGGAGFILAVLWFVSFGLALAIHLCCGWGINIKDKESSHSQ 146
            Y GGW+I+NQHYWASV +T    F+LA +WF+ FG+ L +   C   I  +     +S+
Sbjct: 88  KYTGGWNISNQHYWASVSYTAVPLFVLAAVWFLGFGICLLV--ICMCHICHRTNSVGYSK 145

Query: 147 R---ICLMLLIVFTFAATTGCILLFVGQDKFHGEALDTLHYFVNQSDYSVQILRNVTQYL 203
               + L+ L++FT  A  GC+LL+ GQ +++    +TL Y ++Q+D ++  LR ++ YL
Sbjct: 146 VAYVVSLIFLLIFTVIAIIGCVLLYSGQIRYNKSTTETLEYVMSQADSTISQLRAISDYL 205

Query: 204 SLAKTIHVTQILLPSDVMDDIDKLTVDLNSAADTLSEKTNENAVKFRRVFKDVRXXXXXX 263
           + AK   V Q+LLP++V  +ID++ V L+S+  T++EK+  ++   R     VR      
Sbjct: 206 ASAKQAAVLQVLLPANVQTEIDQIGVKLDSSVATITEKSTNSSNHIRHFLDSVRVALIVV 265

Query: 264 XXXXXXXXXXXXXXXXXGYQHAILIFVITGWLLVATTFILCGVFMILNNAISDTCMAMGE 323
                            G Q  +   VI GW+LV  TFIL G F++L+NA +DTC+AM E
Sbjct: 266 SIVMLVVTFLGLVSSIFGMQVIVYTLVILGWILVTGTFILSGTFLVLHNATADTCVAMSE 325

Query: 324 WVENPHRESSLSDVLPCVDQRTTNQTLIQSKQVVTNIAGVVNRFIYETANINATQ-GTPG 382
           WVE P   ++L ++LPC D  T  +TL++S++V   +  ++N  I   +NIN +    P 
Sbjct: 326 WVERPSSNTALDEILPCTDNATAQETLMRSREVTGQLVELINTVITNVSNINFSPVFVPM 385

Query: 383 YYNQSGPAMMPLCYPFDSQLQEHQCSDQAVSSANASMVWKNNECEVSESGICTTVGRVTP 442
           YYNQSGP +  LC PF+  L +  CS   +   NA+  W +  C+VS++G CTT GR+TP
Sbjct: 386 YYNQSGPLLPLLCNPFNHDLTDRSCSPGDLDLNNATEAWTSFVCQVSQNGTCTTTGRLTP 445

Query: 443 EIYAQLVAAVNASYALEHYTPLLLSLQNCNFVRDAFTGITSSHCPPLNHYLKITNXXXXX 502
            +Y+Q+ + VN S  L    P L+ LQ+C++ +  F  IT+ HCP L  Y          
Sbjct: 446 ALYSQMASGVNISTGLIRDAPFLVQLQDCSYAKQTFRDITNDHCPGLQRYGYWVYVGLAI 505

Query: 503 XXXXXXXXXXXWILYAN-RPQRGEVFVKLSLPEKIKNI 539
                      WI+Y+  R  R E   + S  ++I  +
Sbjct: 506 LATAVMLSLMFWIIYSRERRHRKEALPEFSESKEIVRV 543


>AT1G80540.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G12400.1); Has 175 Blast hits to 171 proteins
           in 20 species: Archae - 0; Bacteria - 0; Metazoa - 2;
           Fungi - 0; Plants - 171; Viruses - 0; Other Eukaryotes -
           2 (source: NCBI BLink). | chr1:30281638-30284258 REVERSE
           LENGTH=538
          Length = 538

 Score =  257 bits (657), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 146/429 (34%), Positives = 222/429 (51%), Gaps = 8/429 (1%)

Query: 69  LVLAANRTKRPDILQGFRHYRGGWDITNQHYWASVGFTGGAGFILAVLWFVSFGLALAIH 128
           LVLAA RT+RPD L  F  Y  GW++TN HY ASVGF+     ++A+ WFV  GL L   
Sbjct: 64  LVLAAERTQRPDPLNHFNIYVDGWNVTNSHYIASVGFSAVPFIVIAIAWFVLLGLFLICS 123

Query: 129 LCCGWGINIKDKESSHSQRIC----LMLLIVFTFAATTGCILLFVGQDKFHGEALDTLHY 184
             C        +   +S R+C    L+ L++FT AA  G  +L+ GQ++F+G    T  Y
Sbjct: 124 CLCCCCCGCGRRNYGYS-RVCYTLSLVFLLLFTIAAVIGSAMLYTGQNEFYGSVERTFMY 182

Query: 185 FVNQSDYSVQILRNVTQYLSLAKTIHVT-QILLPSDVMDDIDKLTVDLNSAADTLSEKTN 243
            V Q+   +  L ++   +  AK I +    L P +   +ID     +  +  T  ++  
Sbjct: 183 IVKQATGVLTKLTSLWDSIQSAKDIQLDGHNLFPPEFRGNIDHFNNMIKMSNITYPDRVA 242

Query: 244 ENAVKFRR-VFKDVRXXXXXXXXXXXXXXXXXXXXXXXGYQHAILIFVITGWLLVATTFI 302
              +++       VR                       G +  + + VI GW+LV  T +
Sbjct: 243 NQTIRYLTGALNPVRYVLNVIAGVMLAVAFLGLLFSFCGLRVLVYLLVILGWILVTATIL 302

Query: 303 LCGVFMILNNAISDTCMAMGEWVENPHRESSLSDVLPCVDQRTTNQTLIQSKQVVTNIAG 362
           L  VF++ +N ++DTCMAM +WV +P  +S+LS +LPC+D +T  +TL  +K +      
Sbjct: 303 LSAVFLVFHNVVADTCMAMDQWVHDPAADSALSQLLPCLDPKTIGETLDITKTMTATAVD 362

Query: 363 VVNRFIYETANINA-TQGTPGYYNQSGPAMMPLCYPFDSQLQEHQCSDQAVSSANASMVW 421
           + N +    +N +      P Y+NQSGP +  LC P D   +   C+   V  ANAS V+
Sbjct: 363 MTNAYTVNVSNHDQFPPNAPFYHNQSGPLVPLLCNPLDQNHKPRPCAPDEVLLANASQVY 422

Query: 422 KNNECEVSESGICTTVGRVTPEIYAQLVAAVNASYALEHYTPLLLSLQNCNFVRDAFTGI 481
           K   C+V+  GICTT GR+T   Y Q++ A+N ++ L+HY P L S+ +C FVRD F  I
Sbjct: 423 KGYICQVNAEGICTTQGRLTQGSYDQMMGAINVAFTLDHYGPFLASIADCTFVRDTFRDI 482

Query: 482 TSSHCPPLN 490
           T+ +CP L+
Sbjct: 483 TTKNCPGLS 491


>AT5G67550.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: endomembrane
           system; EXPRESSED IN: flower; EXPRESSED DURING: 4
           anthesis; BEST Arabidopsis thaliana protein match is:
           unknown protein (TAIR:AT1G71110.1); Has 161 Blast hits
           to 154 proteins in 16 species: Archae - 0; Bacteria - 0;
           Metazoa - 0; Fungi - 0; Plants - 161; Viruses - 0; Other
           Eukaryotes - 0 (source: NCBI BLink). |
           chr5:26946908-26949112 REVERSE LENGTH=509
          Length = 509

 Score = 94.7 bits (234), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 96/440 (21%), Positives = 166/440 (37%), Gaps = 46/440 (10%)

Query: 75  RTKRPDILQGFRHYRGGWDITNQHYWASVGFTGGAGFILAVLWFVSFGLALAIHLCCGWG 134
           R KR D L  FR+Y GG+++ N+HYWA+  FTG  G+ +A       G+ + + +C G  
Sbjct: 36  RFKRRDPLNSFRYYDGGFNVRNKHYWAATAFTGIHGYAVA-------GVLIIVGICLGLY 88

Query: 135 INIKDKESSHSQRICLMLLIVFTF------------AATTGCILLFVGQDKFH----GEA 178
           +   DK    S      L   +                TTG ++    + K       E 
Sbjct: 89  VAFSDKRRRVSSTRRRYLDRYYLPLFLLLLLFMFLSVVTTGIVIAANQRSKNRTEEMKET 148

Query: 179 LDTLHYFVNQSDYSVQILRNVTQYLSLAKTIHVTQILLPSDVMDDIDKLTVDLNSAADTL 238
           +D     VNQ+  +V +     QYL L    + T +L         +  T  L   +  +
Sbjct: 149 IDKAGEDVNQNIRTVIVSLTKIQYLLLPYDQNTTHLL---------NVTTHRLGKGSRLI 199

Query: 239 SEKTNENAVKFRRVFKDVRXXXXXXXXXXXXXXXXXXXXXXXGYQHAILIFVITGWLLVA 298
               +          K                           +    ++ +   W++  
Sbjct: 200 QSFLHHKGRSIDLAIKISYVSHLMITSTNLFLLLLAFLPLLLHWHPGFIMVIFLCWIITT 259

Query: 299 TTFILCGVFMILNNAISDTCMAMGEWVENPHRESSLSDVLPCVDQRTTNQTLIQSKQVVT 358
             ++L G    ++    D C A   +V+NP R S+L+++ PC+D   +++TLI+   ++ 
Sbjct: 260 LCWVLTGFDFFIHTFAEDLCSAFNGFVQNP-RNSTLTNLFPCMDPLHSDKTLIEISLMIH 318

Query: 359 NIAGVVNRFIYETANINA---TQGTPGYYNQSGPAMMPLCYPFDSQ----LQEHQCSDQA 411
           N    +N  + E+   NA      T  +  +SG     +C PF  Q         CS+ A
Sbjct: 319 NFITQLNSKVAESMRSNALTDRSNTVSWAPESG----IICDPFVGQQINSYTPQSCSNGA 374

Query: 412 VSSANASMVWKNNECEVSE-SGICTTVGRVTPE-IYAQLVAAVNASYALEHYTPLLLSLQ 469
           +       +     C   +    C   G+  PE  Y ++ A  N++  +    P   +L 
Sbjct: 375 IPIGEFPNILSRFTCHDKDPPETCRITGKFIPEAAYLKVYAYSNSAQGMLDILPSFQNLT 434

Query: 470 NCNFVRDAFTGITSSHCPPL 489
            C  V+D  + I S+ C P 
Sbjct: 435 ECLAVKDTLSSIVSNQCDPF 454