Miyakogusa Predicted Gene

Lj2g3v2220780.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj2g3v2220780.1 tr|Q10KX9|Q10KX9_ORYSJ Expressed protein OS=Oryza
sativa subsp. japonica GN=LOC_Os03g25110 PE=2
SV=1,26.76,0.00000000000002,seg,NULL; FAMILY NOT
NAMED,NULL,CUFF.38741.1
         (536 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT2G04280.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   738   0.0  
AT4G12700.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   731   0.0  
AT4G08810.1 | Symbols: SUB1 | calcium ion binding | chr4:5616204...   440   e-123
AT3G56750.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    82   7e-16
AT2G41150.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    82   9e-16

>AT2G04280.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; EXPRESSED IN: 24 plant
           structures; EXPRESSED DURING: 13 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT4G12700.1); Has 130 Blast hits to 130 proteins
           in 16 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 124; Viruses - 0; Other Eukaryotes -
           6 (source: NCBI BLink). | chr2:1480277-1481983 REVERSE
           LENGTH=568
          Length = 568

 Score =  738 bits (1906), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 350/445 (78%), Positives = 397/445 (89%), Gaps = 5/445 (1%)

Query: 96  PIDCRDPEVFHLMMRATIEKFQDIHFYRFGKPTPGSND-STCDMAWRFRPKEGKAAAFYK 154
           PIDC+D +VFHLMMRATI+KF+DIHFY+FGKP  G    ++CDMAWR+RP++GK+AAFYK
Sbjct: 125 PIDCKDQQVFHLMMRATIDKFKDIHFYKFGKPVTGEEGVNSCDMAWRYRPRDGKSAAFYK 184

Query: 155 DYRRFVIEKSENCTLSVVSIGEYHSGPNARKRKKNQKAGLEKSTQKLDEANALPVVGEFV 214
           DYRRFV+ KSENC++SVV IGEYHSG NARKRKKNQKAG EK+  K D+  +LPVVGE V
Sbjct: 185 DYRRFVVAKSENCSVSVVGIGEYHSGLNARKRKKNQKAGFEKTGGKKDDF-SLPVVGELV 243

Query: 215 NDSLP-VFSESSFGQGKYLVYHGGGDRCKSMNHFLWSFLCALGEAQYLNRTLVMDLSICL 273
           NDSLP V S+S F  GKYLVY GGGDRCKSMNHFLWSFLCALGEAQYLNRTLVMDL++CL
Sbjct: 244 NDSLPMVESDSVFKTGKYLVYVGGGDRCKSMNHFLWSFLCALGEAQYLNRTLVMDLTLCL 303

Query: 274 SSIYTSSKQDEEGKDFRFYFDFEHLKEAASVLDKEQFWADWNKWQQK--DGMTQHLVEDF 331
           SSIYTSS Q+EEGKDFRFYFDFEHLKEAASVLD+ QFWA W K ++K  + +  HLVEDF
Sbjct: 304 SSIYTSSGQNEEGKDFRFYFDFEHLKEAASVLDEAQFWAQWGKLRKKRRNRLNLHLVEDF 363

Query: 332 RVTPMKLMDVKDALIMRKFGSVEPDNYWYRVCEGETESVVQRPWHLLWKSRRLMDIVSAI 391
           RVTPMKL  VKD LIMRKFGSVEPDNYWYRVCEG+ ESVV+RPWHLLWKSRRLM+IVSAI
Sbjct: 364 RVTPMKLAAVKDTLIMRKFGSVEPDNYWYRVCEGDAESVVKRPWHLLWKSRRLMEIVSAI 423

Query: 392 ASRLNWDYDSVHVERGEKARNRELWPNLDADTSSDALLSTLRDKVDEGRNLYIATNEPDT 451
           ASRLNWDYD+VH+ERGEKARN+E+WPNL+ADTS  ALLSTL+DKV+EGR+LYIATNE + 
Sbjct: 424 ASRLNWDYDAVHIERGEKARNKEVWPNLEADTSPSALLSTLQDKVEEGRHLYIATNEGEL 483

Query: 452 SFFDPLKDKYTTHFLDEYKDLWEENSEWYSETTKLNNGVPVEFDGYMRVSIDTEVFLRGK 511
           SFF+PLKDKY THFL +YKDLW+E+SEWYSETTKLN G PVEFDGYMR S+DTEVFLRGK
Sbjct: 484 SFFNPLKDKYATHFLYDYKDLWDESSEWYSETTKLNGGNPVEFDGYMRASVDTEVFLRGK 543

Query: 512 KQLETFNDLTSDCKDGINTCNAAAN 536
           KQ+ETFNDLT+DCKDG+ TCNAA +
Sbjct: 544 KQIETFNDLTNDCKDGVGTCNAATS 568


>AT4G12700.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G04280.1); Has 136 Blast hits to 136 proteins
           in 17 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 125; Viruses - 0; Other Eukaryotes -
           11 (source: NCBI BLink). | chr4:7482643-7484328 REVERSE
           LENGTH=561
          Length = 561

 Score =  731 bits (1886), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 342/444 (77%), Positives = 396/444 (89%), Gaps = 7/444 (1%)

Query: 96  PIDCRDPEVFHLMMRATIEKFQDIHFYRFGKPT--PGSNDSTCDMAWRFRPKEGKAAAFY 153
           PIDC+DPEVFHLMM+AT+EKF+D HFY+FGKP    GS+ S+CDMAWR+RPK+GKAAAFY
Sbjct: 122 PIDCKDPEVFHLMMKATMEKFKDSHFYKFGKPVIVEGSS-SSCDMAWRYRPKDGKAAAFY 180

Query: 154 KDYRRFVIEKSENCTLSVVSIGEYHSGPNARKRKKNQKAGLEKSTQKLDEANALPVVGEF 213
           KDYRRFVIEKS NC++SV+ IGEYHSG NARKRK+    G   S+    +  ALPVVGE 
Sbjct: 181 KDYRRFVIEKSGNCSVSVMGIGEYHSGVNARKRKR---PGFRNSSGGKVDDFALPVVGEA 237

Query: 214 VNDSLPVF-SESSFGQGKYLVYHGGGDRCKSMNHFLWSFLCALGEAQYLNRTLVMDLSIC 272
           VNDSLPV  SE+ F +G YLVY GGGDRCKSMNHFLWSFLCALGEAQYLNRTLVMDL++C
Sbjct: 238 VNDSLPVVESENVFKEGHYLVYSGGGDRCKSMNHFLWSFLCALGEAQYLNRTLVMDLTLC 297

Query: 273 LSSIYTSSKQDEEGKDFRFYFDFEHLKEAASVLDKEQFWADWNKWQQKDGMTQHLVEDFR 332
           LSS+YT S Q+EEGKDFRFYFDFEHLKEAAS+LD+ QFWADW KW +K+G+  HLVEDFR
Sbjct: 298 LSSVYTLSGQNEEGKDFRFYFDFEHLKEAASMLDQVQFWADWGKWYKKNGLKLHLVEDFR 357

Query: 333 VTPMKLMDVKDALIMRKFGSVEPDNYWYRVCEGETESVVQRPWHLLWKSRRLMDIVSAIA 392
           VTPMKL+DVKD LIMRKFG+VEPDNYWYRVCEGETESVVQRPW+LLWKS+RLM+IVSAIA
Sbjct: 358 VTPMKLVDVKDTLIMRKFGTVEPDNYWYRVCEGETESVVQRPWNLLWKSKRLMEIVSAIA 417

Query: 393 SRLNWDYDSVHVERGEKARNRELWPNLDADTSSDALLSTLRDKVDEGRNLYIATNEPDTS 452
           SRLNWDYD++H+ERG+KARN+E+WPNL+ DTS  ++LSTL+DK+++GRNLYIATNEP+ S
Sbjct: 418 SRLNWDYDAIHIERGDKARNKEVWPNLEKDTSPSSILSTLQDKIEQGRNLYIATNEPELS 477

Query: 453 FFDPLKDKYTTHFLDEYKDLWEENSEWYSETTKLNNGVPVEFDGYMRVSIDTEVFLRGKK 512
           FF+PLKDKY  HFLDE+KDLW+E+SEWYSETTKLN G PVEFDGYMR S+DTEVFLRGKK
Sbjct: 478 FFNPLKDKYKPHFLDEFKDLWDESSEWYSETTKLNGGNPVEFDGYMRASVDTEVFLRGKK 537

Query: 513 QLETFNDLTSDCKDGINTCNAAAN 536
           Q+ETFNDLT+DC+DGI TCN AA+
Sbjct: 538 QIETFNDLTNDCRDGIGTCNVAAS 561


>AT4G08810.1 | Symbols: SUB1 | calcium ion binding |
           chr4:5616204-5617862 REVERSE LENGTH=552
          Length = 552

 Score =  440 bits (1132), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 227/445 (51%), Positives = 306/445 (68%), Gaps = 13/445 (2%)

Query: 90  CEGSSGPIDCRDPEVFHLMMRATIEKFQDIHFYRFGKPTPGSNDSTCDMAWRFRPKEGKA 149
           C+     ++C DP V   + R  ++ F+ I F  +  P  GS    CD++WRFR K+ K+
Sbjct: 118 CDEDLKIVNCSDPRVLVAVERFNLKVFKSIVFLEYETPVNGSKLDECDVSWRFRNKKEKS 177

Query: 150 AAFYKDYRRFVIEKSENCTLSVVSIGEYHSGPNARKRKKNQKAGLEKSTQKLDEANALPV 209
              Y+D+RRF     ENCT  V     +HSG NAR+ + ++ +    +     E      
Sbjct: 178 WRRYRDFRRFRFGFGENCTYKVFHTSGWHSGVNARRPRISRPSSSRGARGGDSE------ 231

Query: 210 VGEFVNDSLPVF-SESSFGQGKYLVYHGGGDRCKSMNHFLWSFLCALGEAQYLNRTLVMD 268
               +ND++P   S++SF +GKYL Y  GGD CK MN ++WSFLC LGEA YLNRT VMD
Sbjct: 232 ----INDTIPTLGSQTSFRRGKYLYYSRGGDYCKGMNQYMWSFLCGLGEAMYLNRTFVMD 287

Query: 269 LSICLSSIYTSSKQDEEGKDFRFYFDFEHLKEAASVLDKEQFWADWNKWQQ--KDGMTQH 326
           LS+CLSS Y+S  +DEEGKDFR+YFDFEHLKE AS++++ +F  DW KW +  K  +   
Sbjct: 288 LSLCLSSSYSSKGKDEEGKDFRYYFDFEHLKETASIVEEGEFLRDWKKWNRLHKRKVPVR 347

Query: 327 LVEDFRVTPMKLMDVKDALIMRKFGSVEPDNYWYRVCEGETESVVQRPWHLLWKSRRLMD 386
            V+  RV+P++L   K  +I R+F + EP+NYWYRVCEG+    V+RPWH LWKS+RLM+
Sbjct: 348 KVKTHRVSPLQLSKDKSTIIWRQFDTPEPENYWYRVCEGQASKYVERPWHALWKSKRLMN 407

Query: 387 IVSAIASRLNWDYDSVHVERGEKARNRELWPNLDADTSSDALLSTLRDKVDEGRNLYIAT 446
           IVS I+ +++WD+D+VHV RGEKA+N++LWP+LDADT  DA+L+ L+  V   RNLY+AT
Sbjct: 408 IVSEISGKMDWDFDAVHVVRGEKAKNKKLWPHLDADTWPDAILTKLKGLVQVWRNLYVAT 467

Query: 447 NEPDTSFFDPLKDKYTTHFLDEYKDLWEENSEWYSETTKLNNGVPVEFDGYMRVSIDTEV 506
           NEP  ++FD L+ +Y  H LD+Y  LW   SEWY+ET+ LNNG PVEFDGYMRV++DTEV
Sbjct: 468 NEPFYNYFDKLRSQYKVHLLDDYSYLWGNKSEWYNETSLLNNGKPVEFDGYMRVAVDTEV 527

Query: 507 FLRGKKQLETFNDLTSDCKDGINTC 531
           F RGK ++ETF +LT+DCKDGINTC
Sbjct: 528 FYRGKTRVETFYNLTTDCKDGINTC 552


>AT3G56750.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G41150.2); Has 128 Blast hits to 128 proteins
           in 16 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 117; Viruses - 0; Other Eukaryotes -
           11 (source: NCBI BLink). | chr3:21018326-21020192
           REVERSE LENGTH=403
          Length = 403

 Score = 82.4 bits (202), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 75/317 (23%), Positives = 133/317 (41%), Gaps = 62/317 (19%)

Query: 241 CKSMNHFLWSFLCALGEAQYLNRTLVMDLSICLSSIY--------TSSKQDEEG-----K 287
           C  + H   S  CAL EA +LNRT VM   +C++ I+        + +K  EEG      
Sbjct: 87  CAGLGHQESSLRCALEEAMFLNRTFVMPSGMCINPIHNKKGILNRSDNKTTEEGWLGSSC 146

Query: 288 DFRFYFDFEHLKEAASV-LDKEQFWADWNKWQQK---------DGMTQHLVEDFRVTPMK 337
                +D + + E   V LD  + W        K          G+T+H +++   +   
Sbjct: 147 AMDSLYDIDLISEKIPVILDDSKTWHIVLSTSMKLGERGIAHVSGVTRHRLKESHYS--- 203

Query: 338 LMDVKDALIMRKFGSVEPDNYWYRVCEGET-ESVVQRPWHLL--WKSRRLMDIVSAIASR 394
                + LI+ +  S      W+  C+  +  S V  P+  L    + +L +    I ++
Sbjct: 204 -----NLLIINRTASPLA---WFVECKDRSNRSAVMLPYSFLPNMAAAKLRNAAEKIKAQ 255

Query: 395 LNWDYDSVHVERGEKARNRE--------LWPNLDADTSSDALLSTLRDKVDEGRNLYIAT 446
           L  DYD++HV RG+K + R+         +P+LD DT  + +L  +  ++  GR L+I +
Sbjct: 256 LG-DYDAIHVRRGDKLKTRKDRFGVERIQFPHLDRDTRPEFILRRIEKRIPRGRTLFIGS 314

Query: 447 NEPDTSFFDPLKDKYTTHFLDEYKDLWEENSEWYSETTKLNNGVPVEFDGYMRVSIDTEV 506
           NE    FF PL  +Y   +   + ++ +                P+  + Y    ++  V
Sbjct: 315 NERKPGFFSPLAVRYKLAYSSNFSEILD----------------PIIENNYQLFMMERLV 358

Query: 507 FLRGKKQLETFNDLTSD 523
            +  K   +TF +  +D
Sbjct: 359 MMGAKTYFKTFKEYETD 375


>AT2G41150.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: endomembrane
           system; EXPRESSED IN: leaf; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT3G56750.1);
           Has 127 Blast hits to 127 proteins in 16 species: Archae
           - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 117;
           Viruses - 0; Other Eukaryotes - 10 (source: NCBI BLink).
           | chr2:17153851-17155633 FORWARD LENGTH=404
          Length = 404

 Score = 82.0 bits (201), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 71/310 (22%), Positives = 131/310 (42%), Gaps = 47/310 (15%)

Query: 241 CKSMNHFLWSFLCALGEAQYLNRTLVMDLSICLSSIYTS----SKQDEEGKD-------- 288
           C  + H   S  CAL EA +LNRT VM   +C++ I+      ++ + E ++        
Sbjct: 87  CAGLGHQESSLRCALEEAMFLNRTFVMPSRMCINPIHNKKGILNRSNNETREESWEVSSC 146

Query: 289 -FRFYFDFEHLKEAASV-LDKEQFWADW--NKWQQKDGMTQHLVEDFRVTPMKLMDVKDA 344
                +D + + E   V LD  + W        + K+  + H+    R       D  + 
Sbjct: 147 AMESLYDIDLISEKIPVILDDSETWHIMLSTSMKLKERGSAHVYGANRHELNDSSDFTNL 206

Query: 345 LIMRKFGSVEPDNYWYRVCEGE-TESVVQRPWHLL--WKSRRLMDIVSAIASRLNWDYDS 401
           L++ +  S      W+  C+     S V  P+  L    + RL D    I ++L  DYD+
Sbjct: 207 LLINRTASPLA---WFVECKDRGNRSDVMLPYSFLQTMAASRLRDAAEKIKAKLG-DYDA 262

Query: 402 VHVERGEKARNRE--------LWPNLDADTSSDALLSTLRDKVDEGRNLYIATNEPDTSF 453
           +HV RG+K + R+         +P+LD DT  + ++  ++ ++  GR L+I +NE    F
Sbjct: 263 IHVRRGDKLKTRKDRFRVERSQFPHLDRDTRPEFIIGRIQKQIPPGRTLFIGSNERTPDF 322

Query: 454 FDPLKDKYTTHFLDEYKDLWEENSEWYSETTKLNNGVPVEFDGYMRVSIDTEVFLRGKKQ 513
           F PL  +Y   +   + ++ +                P+  + Y    ++  + +  K  
Sbjct: 323 FSPLAIRYKVAYSSNFSEILD----------------PIIENNYQLFMVERLIMMGAKTF 366

Query: 514 LETFNDLTSD 523
            +TF +  +D
Sbjct: 367 FKTFREYETD 376