Miyakogusa Predicted Gene

Lj0g3v0036819.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj0g3v0036819.1 Non Chatacterized Hit- tr|I1K3W7|I1K3W7_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.9593 PE=,81.77,0,FAMILY
NOT NAMED,NULL; seg,NULL,NODE_22169_length_2288_cov_81.632866.path1.1
         (584 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT4G08810.1 | Symbols: SUB1 | calcium ion binding | chr4:5616204...   756   0.0  
AT2G04280.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   519   e-147
AT4G12700.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   501   e-142
AT2G41150.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    84   4e-16
AT3G56750.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    77   4e-14

>AT4G08810.1 | Symbols: SUB1 | calcium ion binding |
           chr4:5616204-5617862 REVERSE LENGTH=552
          Length = 552

 Score =  756 bits (1953), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 391/574 (68%), Positives = 439/574 (76%), Gaps = 35/574 (6%)

Query: 17  TEPIAQNLIKLISNLCFSLFVFSVLIFTVIAITYQPPDPWLESSPALTNLFTKTENATFH 76
           TEPIAQNLIKLISN+CFS+FVF+VLIFTVIA+TYQPPDPWLES+PALT L T+TENATF 
Sbjct: 8   TEPIAQNLIKLISNVCFSVFVFTVLIFTVIAVTYQPPDPWLESAPALTKLLTETENATFK 67

Query: 77  IDSSIIKTGEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXFANXXXXXXXXXXXXXXX 136
           ID SI+KTGE                                                  
Sbjct: 68  IDGSILKTGEDLASSPSSSPPSNSTEQVTEATIEKSEAKIGNMT---------------- 111

Query: 137 XXXXXXXXCD---KTLNCSDPRILIAIQRFNLRAFKSIAFFDYQPPVNGSSLGECDVAWR 193
                   CD   K +NCSDPR+L+A++RFNL+ FKSI F +Y+ PVNGS L ECDV+WR
Sbjct: 112 --VKNSIDCDEDLKIVNCSDPRVLVAVERFNLKVFKSIVFLEYETPVNGSKLDECDVSWR 169

Query: 194 FRNKREKSWRKYRDFRRFKITVTDDCRYKVVHAGGWHSGGNARRS-PTRPSGN--ARGRT 250
           FRNK+EKSWR+YRDFRRF+    ++C YKV H  GWHSG NARR   +RPS +  ARG  
Sbjct: 170 FRNKKEKSWRRYRDFRRFRFGFGENCTYKVFHTSGWHSGVNARRPRISRPSSSRGARGG- 228

Query: 251 VPRVSTRDEEINDTIPSLGSESNFRNGKYLYYTRGGDYCKGMNHYMWSFLCGLGEAMYLN 310
                  D EINDTIP+LGS+++FR GKYLYY+RGGDYCKGMN YMWSFLCGLGEAMYLN
Sbjct: 229 -------DSEINDTIPTLGSQTSFRRGKYLYYSRGGDYCKGMNQYMWSFLCGLGEAMYLN 281

Query: 311 RTFVMDLSVCLSSSYNPSNKDEEGKDFRYYFDFEHLKEVSSIVEEAEFLRDWKKWDRTHL 370
           RTFVMDLS+CLSSSY+   KDEEGKDFRYYFDFEHLKE +SIVEE EFLRDWKKW+R H 
Sbjct: 282 RTFVMDLSLCLSSSYSSKGKDEEGKDFRYYFDFEHLKETASIVEEGEFLRDWKKWNRLH- 340

Query: 371 XXXXXXXXXXXXXXXTPMQLQKDKSTIIMRQFDDAPEPENYWYRVCEGQAAEYIQRPWHA 430
                          +P+QL KDKSTII RQFD  PEPENYWYRVCEGQA++Y++RPWHA
Sbjct: 341 -KRKVPVRKVKTHRVSPLQLSKDKSTIIWRQFD-TPEPENYWYRVCEGQASKYVERPWHA 398

Query: 431 LWKSKRLMNIVTEISGRMDWDFDAVHVVRGEKAQNKELWPHLDADTSPDALAEKLKGMVQ 490
           LWKSKRLMNIV+EISG+MDWDFDAVHVVRGEKA+NK+LWPHLDADT PDA+  KLKG+VQ
Sbjct: 399 LWKSKRLMNIVSEISGKMDWDFDAVHVVRGEKAKNKKLWPHLDADTWPDAILTKLKGLVQ 458

Query: 491 PSRNLYIATNEPFYNFFDKLRSNYKVHLLDDYNELWGNASEWYNETRLLNNGKPVEFDGY 550
             RNLY+ATNEPFYN+FDKLRS YKVHLLDDY+ LWGN SEWYNET LLNNGKPVEFDGY
Sbjct: 459 VWRNLYVATNEPFYNYFDKLRSQYKVHLLDDYSYLWGNKSEWYNETSLLNNGKPVEFDGY 518

Query: 551 MRVAVDTEVFYRGKTRVETFYNLTRDCKDGVNTC 584
           MRVAVDTEVFYRGKTRVETFYNLT DCKDG+NTC
Sbjct: 519 MRVAVDTEVFYRGKTRVETFYNLTTDCKDGINTC 552


>AT2G04280.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; EXPRESSED IN: 24 plant
           structures; EXPRESSED DURING: 13 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT4G12700.1); Has 130 Blast hits to 130 proteins
           in 16 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 124; Viruses - 0; Other Eukaryotes -
           6 (source: NCBI BLink). | chr2:1480277-1481983 REVERSE
           LENGTH=568
          Length = 568

 Score =  519 bits (1337), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 263/577 (45%), Positives = 345/577 (59%), Gaps = 34/577 (5%)

Query: 16  RTEPIAQNLIKLISNLCFSLFVFSVLIFTVIAITYQPPDPWLESSPALTNLFTKTENATF 75
           R E + QN + LI N+ FSLFVF VLIFT+IA TY+P DP    S  +T   T T NAT 
Sbjct: 13  RAENLGQNALTLIGNIGFSLFVFGVLIFTIIAATYEPEDPLFHPSDKITTFLTSTSNATL 72

Query: 76  HIDSSIIKTGEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXFANXXXXXXXXXXXXXX 135
             D S++KTGE                                F N              
Sbjct: 73  RSDDSVVKTGEDFMLANQTAFAE--------------------FININDVEASTNETTTE 112

Query: 136 XXXXXXXXXCDKTLNCSDPRILIAIQRFNLRAFKSIAFFDYQPPVNGS-SLGECDVAWRF 194
                     +  ++C D ++   + R  +  FK I F+ +  PV G   +  CD+AWR+
Sbjct: 113 EEGNKLECDVNTPIDCKDQQVFHLMMRATIDKFKDIHFYKFGKPVTGEEGVNSCDMAWRY 172

Query: 195 RNKREKSWRKYRDFRRFKITVTDDCRYKVVHAGGWHSGGNARRSPT-------RPSGNAR 247
           R +  KS   Y+D+RRF +  +++C   VV  G +HSG NAR+          +  G   
Sbjct: 173 RPRDGKSAAFYKDYRRFVVAKSENCSVSVVGIGEYHSGLNARKRKKNQKAGFEKTGGKKD 232

Query: 248 GRTVPRVSTRDEEINDTIPSLGSESNFRNGKYLYYTRGGDYCKGMNHYMWSFLCGLGEAM 307
             ++P V    E +ND++P + S+S F+ GKYL Y  GGD CK MNH++WSFLC LGEA 
Sbjct: 233 DFSLPVVG---ELVNDSLPMVESDSVFKTGKYLVYVGGGDRCKSMNHFLWSFLCALGEAQ 289

Query: 308 YLNRTFVMDLSVCLSSSYNPSNKDEEGKDFRYYFDFEHLKEVSSIVEEAEFLRDWKKWDR 367
           YLNRT VMDL++CLSS Y  S ++EEGKDFR+YFDFEHLKE +S+++EA+F   W K  +
Sbjct: 290 YLNRTLVMDLTLCLSSIYTSSGQNEEGKDFRFYFDFEHLKEAASVLDEAQFWAQWGKLRK 349

Query: 368 THLXXXXXXXXXXXXXXXTPMQLQKDKSTIIMRQFDDAPEPENYWYRVCEGQAAEYIQRP 427
                             TPM+L   K T+IMR+F  + EP+NYWYRVCEG A   ++RP
Sbjct: 350 KR--RNRLNLHLVEDFRVTPMKLAAVKDTLIMRKFG-SVEPDNYWYRVCEGDAESVVKRP 406

Query: 428 WHALWKSKRLMNIVTEISGRMDWDFDAVHVVRGEKAQNKELWPHLDADTSPDALAEKLKG 487
           WH LWKS+RLM IV+ I+ R++WD+DAVH+ RGEKA+NKE+WP+L+ADTSP AL   L+ 
Sbjct: 407 WHLLWKSRRLMEIVSAIASRLNWDYDAVHIERGEKARNKEVWPNLEADTSPSALLSTLQD 466

Query: 488 MVQPSRNLYIATNEPFYNFFDKLRSNYKVHLLDDYNELWGNASEWYNETRLLNNGKPVEF 547
            V+  R+LYIATNE   +FF+ L+  Y  H L DY +LW  +SEWY+ET  LN G PVEF
Sbjct: 467 KVEEGRHLYIATNEGELSFFNPLKDKYATHFLYDYKDLWDESSEWYSETTKLNGGNPVEF 526

Query: 548 DGYMRVAVDTEVFYRGKTRVETFYNLTRDCKDGVNTC 584
           DGYMR +VDTEVF RGK ++ETF +LT DCKDGV TC
Sbjct: 527 DGYMRASVDTEVFLRGKKQIETFNDLTNDCKDGVGTC 563


>AT4G12700.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G04280.1); Has 136 Blast hits to 136 proteins
           in 17 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 125; Viruses - 0; Other Eukaryotes -
           11 (source: NCBI BLink). | chr4:7482643-7484328 REVERSE
           LENGTH=561
          Length = 561

 Score =  501 bits (1290), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 253/575 (44%), Positives = 344/575 (59%), Gaps = 38/575 (6%)

Query: 16  RTEPIAQNLIKLISNLCFSLFVFSVLIFTVIAITYQPPDPWLESSPALTNLFTKTENATF 75
           R E + QN + LI ++ FS+ V  V++FT+IA TY+P DP    S  +T   T   NAT 
Sbjct: 14  RPENLGQNAVSLIGSIGFSVLVIGVVVFTIIAATYEPEDPLFHPSDKITTFLTSNSNATL 73

Query: 76  HIDSSIIKTGEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXFANXXXXXXXXXXXXXX 135
             D SI+KTGE                                F N              
Sbjct: 74  KSDDSIVKTGEDFMAANQTAFGG--------------------FINIADVETSENDSDGN 113

Query: 136 XXXXXXXXXCDKTL--NCSDPRILIAIQRFNLRAFKSIAFFDYQPPV--NGSSLGECDVA 191
                    CD  +  +C DP +   + +  +  FK   F+ +  PV   GSS   CD+A
Sbjct: 114 QLD------CDTNIPIDCKDPEVFHLMMKATMEKFKDSHFYKFGKPVIVEGSS-SSCDMA 166

Query: 192 WRFRNKREKSWRKYRDFRRFKITVTDDCRYKVVHAGGWHSGGNARRSPTRPSGNARGRTV 251
           WR+R K  K+   Y+D+RRF I  + +C   V+  G +HSG NAR+       N+ G  V
Sbjct: 167 WRYRPKDGKAAAFYKDYRRFVIEKSGNCSVSVMGIGEYHSGVNARKRKRPGFRNSSGGKV 226

Query: 252 P--RVSTRDEEINDTIPSLGSESNFRNGKYLYYTRGGDYCKGMNHYMWSFLCGLGEAMYL 309
               +    E +ND++P + SE+ F+ G YL Y+ GGD CK MNH++WSFLC LGEA YL
Sbjct: 227 DDFALPVVGEAVNDSLPVVESENVFKEGHYLVYSGGGDRCKSMNHFLWSFLCALGEAQYL 286

Query: 310 NRTFVMDLSVCLSSSYNPSNKDEEGKDFRYYFDFEHLKEVSSIVEEAEFLRDWKKWDRTH 369
           NRT VMDL++CLSS Y  S ++EEGKDFR+YFDFEHLKE +S++++ +F  DW KW + +
Sbjct: 287 NRTLVMDLTLCLSSVYTLSGQNEEGKDFRFYFDFEHLKEAASMLDQVQFWADWGKWYKKN 346

Query: 370 LXXXXXXXXXXXXXXXTPMQLQKDKSTIIMRQFDDAPEPENYWYRVCEGQAAEYIQRPWH 429
                           TPM+L   K T+IMR+F    EP+NYWYRVCEG+    +QRPW+
Sbjct: 347 ----GLKLHLVEDFRVTPMKLVDVKDTLIMRKFGTV-EPDNYWYRVCEGETESVVQRPWN 401

Query: 430 ALWKSKRLMNIVTEISGRMDWDFDAVHVVRGEKAQNKELWPHLDADTSPDALAEKLKGMV 489
            LWKSKRLM IV+ I+ R++WD+DA+H+ RG+KA+NKE+WP+L+ DTSP ++   L+  +
Sbjct: 402 LLWKSKRLMEIVSAIASRLNWDYDAIHIERGDKARNKEVWPNLEKDTSPSSILSTLQDKI 461

Query: 490 QPSRNLYIATNEPFYNFFDKLRSNYKVHLLDDYNELWGNASEWYNETRLLNNGKPVEFDG 549
           +  RNLYIATNEP  +FF+ L+  YK H LD++ +LW  +SEWY+ET  LN G PVEFDG
Sbjct: 462 EQGRNLYIATNEPELSFFNPLKDKYKPHFLDEFKDLWDESSEWYSETTKLNGGNPVEFDG 521

Query: 550 YMRVAVDTEVFYRGKTRVETFYNLTRDCKDGVNTC 584
           YMR +VDTEVF RGK ++ETF +LT DC+DG+ TC
Sbjct: 522 YMRASVDTEVFLRGKKQIETFNDLTNDCRDGIGTC 556


>AT2G41150.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: endomembrane
           system; EXPRESSED IN: leaf; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT3G56750.1);
           Has 127 Blast hits to 127 proteins in 16 species: Archae
           - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 117;
           Viruses - 0; Other Eukaryotes - 10 (source: NCBI BLink).
           | chr2:17153851-17155633 FORWARD LENGTH=404
          Length = 404

 Score = 83.6 bits (205), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 74/312 (23%), Positives = 131/312 (41%), Gaps = 37/312 (11%)

Query: 278 KYLYYT-------RGGDYCKGMNHYMWSFLCGLGEAMYLNRTFVMDLSVCLSSSYNP--- 327
           KYLY+        +  + C G+ H   S  C L EAM+LNRTFVM   +C++  +N    
Sbjct: 69  KYLYWGNRIDCPGKNCETCAGLGHQESSLRCALEEAMFLNRTFVMPSRMCINPIHNKKGI 128

Query: 328 ---SNKDEEGKDFRYY-FDFEHLKEVSSIVEEAEFLRD----WKKWDRTHLXXXXXXXXX 379
              SN +   + +       E L ++  I E+   + D    W     T +         
Sbjct: 129 LNRSNNETREESWEVSSCAMESLYDIDLISEKIPVILDDSETWHIMLSTSMKLKERGSAH 188

Query: 380 XXXXXXTPMQLQKDKSTIIMRQFDDAPEPENYWYRVCEGQAAEY-IQRPWHAL--WKSKR 436
                   +    D + +++     +P     W+  C+ +     +  P+  L    + R
Sbjct: 189 VYGANRHELNDSSDFTNLLLINRTASPLA---WFVECKDRGNRSDVMLPYSFLQTMAASR 245

Query: 437 LMNIVTEISGRMDWDFDAVHVVRGEKAQNKE--------LWPHLDADTSPDALAEKLKGM 488
           L +   +I  ++  D+DA+HV RG+K + ++         +PHLD DT P+ +  +++  
Sbjct: 246 LRDAAEKIKAKLG-DYDAIHVRRGDKLKTRKDRFRVERSQFPHLDRDTRPEFIIGRIQKQ 304

Query: 489 VQPSRNLYIATNEPFYNFFDKLRSNYKVHLLDDYNELWG----NASEWYNETRLLNNGKP 544
           + P R L+I +NE   +FF  L   YKV    +++E+      N  + +   RL+  G  
Sbjct: 305 IPPGRTLFIGSNERTPDFFSPLAIRYKVAYSSNFSEILDPIIENNYQLFMVERLIMMGAK 364

Query: 545 VEFDGYMRVAVD 556
             F  +     D
Sbjct: 365 TFFKTFREYETD 376


>AT3G56750.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G41150.2); Has 128 Blast hits to 128 proteins
           in 16 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 117; Viruses - 0; Other Eukaryotes -
           11 (source: NCBI BLink). | chr3:21018326-21020192
           REVERSE LENGTH=403
          Length = 403

 Score = 76.6 bits (187), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 74/318 (23%), Positives = 127/318 (39%), Gaps = 32/318 (10%)

Query: 269 GSESNFRNGKYLYYT-------RGGDYCKGMNHYMWSFLCGLGEAMYLNRTFVMDLSVCL 321
           G   +  + KYLY+        +  + C G+ H   S  C L EAM+LNRTFVM   +C+
Sbjct: 60  GKRQHSLSEKYLYWGNRIDCPGKNCETCAGLGHQESSLRCALEEAMFLNRTFVMPSGMCI 119

Query: 322 SSSYNPS-------NKDEEGKDFRYYFDFEHLKEVSSIVEEAE-FLRDWKKWDRTHLXXX 373
           +  +N         NK  E          + L ++  I E+    L D K W        
Sbjct: 120 NPIHNKKGILNRSDNKTTEEGWLGSSCAMDSLYDIDLISEKIPVILDDSKTWHIVLSTSM 179

Query: 374 XXXXXXXXXXXXTPMQLQKDKSTIIMRQFDDAPEPENYWYRVCEGQA-AEYIQRPWHAL- 431
                             K+     +   +    P   W+  C+ ++    +  P+  L 
Sbjct: 180 KLGERGIAHVSGVTRHRLKESHYSNLLIINRTASPLA-WFVECKDRSNRSAVMLPYSFLP 238

Query: 432 -WKSKRLMNIVTEISGRMDWDFDAVHVVRGEKAQNKE--------LWPHLDADTSPDALA 482
              + +L N   +I  ++  D+DA+HV RG+K + ++         +PHLD DT P+ + 
Sbjct: 239 NMAAAKLRNAAEKIKAQLG-DYDAIHVRRGDKLKTRKDRFGVERIQFPHLDRDTRPEFIL 297

Query: 483 EKLKGMVQPSRNLYIATNEPFYNFFDKLRSNYKVHLLDDYNELWG----NASEWYNETRL 538
            +++  +   R L+I +NE    FF  L   YK+    +++E+      N  + +   RL
Sbjct: 298 RRIEKRIPRGRTLFIGSNERKPGFFSPLAVRYKLAYSSNFSEILDPIIENNYQLFMMERL 357

Query: 539 LNNGKPVEFDGYMRVAVD 556
           +  G    F  +     D
Sbjct: 358 VMMGAKTYFKTFKEYETD 375