Miyakogusa Predicted Gene

Lj0g3v0258939.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj0g3v0258939.1 Non Chatacterized Hit- tr|K3Z576|K3Z576_SETIT
Uncharacterized protein OS=Setaria italica
GN=Si021694,33.72,2e-18,seg,NULL; SUBFAMILY NOT NAMED,NULL; FAMILY NOT
NAMED,NULL,CUFF.17174.1
         (414 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G23490.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   280   2e-75
AT5G08440.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   221   1e-57
AT5G08440.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   190   2e-48
AT3G03560.1 | Symbols:  | unknown protein; LOCATED IN: plasma me...    82   9e-16

>AT5G23490.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT5G08440.1); Has 202 Blast hits to 197 proteins
           in 48 species: Archae - 0; Bacteria - 13; Metazoa - 25;
           Fungi - 9; Plants - 109; Viruses - 0; Other Eukaryotes -
           46 (source: NCBI BLink). | chr5:7919831-7926499 FORWARD
           LENGTH=729
          Length = 729

 Score =  280 bits (715), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 188/467 (40%), Positives = 244/467 (52%), Gaps = 94/467 (20%)

Query: 1   MENGHDGKLAEKFSGLAINQQHGQQGVHDQSNLSSNH--NESLYQVMKAVESAEVTIKQQ 58
           MENGH+ +LAE+FSGL         G  D S L  N   N++L+QV+KAVE+AE TIK  
Sbjct: 1   MENGHEERLAERFSGL---------GFEDSSLLPENEFKNDNLFQVIKAVEAAETTIK-- 49

Query: 59  RRNEQVDQNSHPWKEQ-----VYGSYEARQSIPSSAISNTSNYSGSSEIN---------- 103
              EQV++NS    E          Y++ +S+P +  SN  +++ S+ ++          
Sbjct: 50  ---EQVEENSRLKAELQRSALELAKYKSDESLPQT--SNIGDHTNSTTVSRLVHQPVDWK 104

Query: 104 ------------GTLRVQPNE--------------------------RLPVENTGNSQLS 125
                       G L V P+                           +  ++ TG SQ  
Sbjct: 105 PVVIKASDADSSGLLVVHPHVNANGEEATVSNRFESHSEETISNGTVKRAIDGTGPSQ-- 162

Query: 126 SPFTRSISPNRHLLGGDLDPQFNPPRQGLTPMAETNNSNTSLQQDLAIKVXXXXXXXXXX 185
             F  SISP R  L G+ D  F+    G  P+ E N+S  + +QDL  KV          
Sbjct: 163 --FDSSISPMRMRLEGEHDAHFSSSTHGSMPVGEVNHSGNAWKQDLIHKVQEQEQEISQL 220

Query: 186 XKHLADYAAKEAQIRNEKYVLDKRIAYMRVAFDQQQQDLVDAASKALSYRQDVIEENIRL 245
            ++L D + KEAQIRNEKYVL+KRIAYMR+AFDQQQQDLVDA+SKALSYRQ++IEENIRL
Sbjct: 221 RRYLTDCSVKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDASSKALSYRQEIIEENIRL 280

Query: 246 TYALQDAQQERSTFVSSLVPLLAEYSLQPNVLDAQSIVSNVKVLFKHXXXXXXXXXXXXX 305
           TYALQ  QQERSTFVS L+PLL+EYSLQP V DAQSIVSNVKVLFKH             
Sbjct: 281 TYALQATQQERSTFVSYLLPLLSEYSLQPQVSDAQSIVSNVKVLFKHLQEKLLLTETKLK 340

Query: 306 XXXYQLTPWRSDMNQNHATAATQSPSHSIGAPLATSNKNGLELVPRHIYSQVKTQVSVDT 365
              YQL PW+SD+  NH+  +  +PS S G  L  S K+ +       YS   T +    
Sbjct: 341 ESEYQLAPWQSDV--NHSNDSPLAPSRSAGVALTHSTKDSM-------YSHDHTAI---- 387

Query: 366 QAGTDWGMLGRHQSGLGGGVASNVDADDLERYSPL--ASRGILDLHI 410
               DW +  + Q   G     N   DD   +SPL  +     ++H+
Sbjct: 388 ----DWNLERQQQDEPGSSAVRNYHLDDSSTFSPLENSQSAAFEMHV 430


>AT5G08440.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: plasma membrane;
           EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 13
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT5G23490.1); Has 141 Blast
           hits to 139 proteins in 35 species: Archae - 0; Bacteria
           - 9; Metazoa - 21; Fungi - 6; Plants - 94; Viruses - 0;
           Other Eukaryotes - 11 (source: NCBI BLink). |
           chr5:2721037-2726970 FORWARD LENGTH=726
          Length = 726

 Score =  221 bits (562), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 129/264 (48%), Positives = 154/264 (58%), Gaps = 31/264 (11%)

Query: 136 RHLLGGDLDPQFNPPRQGLTPMAETNNSNTSLQQDLAIKVXXXXXXXXXXXKHLADYAAK 195
           R LL GD D   N     L P+ E NNS T+ +Q+L  KV           K+LADY+ K
Sbjct: 184 RPLLEGDHDLHINSSSHELMPVGEVNNSGTAWKQELIHKVQEQDQEILRLRKYLADYSTK 243

Query: 196 EAQIRNEKYVLDKRIAYMRVAFDQQQQDLVDAASKALSYRQDVIEENIRLTYALQDAQQE 255
           E QIRNEKYVL+KRIA+MR AFDQQQQDLVDAASKALSYRQ++IEENIRLTYALQ A+QE
Sbjct: 244 EVQIRNEKYVLEKRIAHMRSAFDQQQQDLVDAASKALSYRQEIIEENIRLTYALQAAEQE 303

Query: 256 RSTFVSSLVPLLAEYSLQPNVLDAQSIVSNVKVLFKHXXXXXXXXXXXXXXXXYQLTPWR 315
           RS FVS L+PLL+EYSL P + D+QSIVS+VKVLF+H                YQL PW+
Sbjct: 304 RSLFVSILLPLLSEYSLHPQISDSQSIVSSVKVLFRHLQEKLNVTETKLKETEYQLAPWQ 363

Query: 316 SDMNQNHATAATQSPSHSIGAPLATSNKNGLELVPRHIYSQVKTQVSVDTQAGTDWGMLG 375
           SD+N  H+ A+  SP   +G  L                     + S D++         
Sbjct: 364 SDVN--HSNASPLSPYQPVGVGL---------------------RYSTDSEH-------- 392

Query: 376 RHQSGLGGGVASNVDADDLERYSP 399
            HQ   GG  ASN   D  E  SP
Sbjct: 393 HHQDRRGGSAASNYHLDGPESRSP 416



 Score = 56.6 bits (135), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 38/106 (35%), Positives = 63/106 (59%), Gaps = 10/106 (9%)

Query: 1   MENGHDGKLAEKFSGLAINQQHGQQGVHDQSNLSSNHNESLYQVMKAVESAEVTIKQQRR 60
           M+NGH+ +LAE+FSG+ + +  G       S+ +   N+SL+QV+KAVE+AE TIKQQ  
Sbjct: 1   MDNGHEERLAERFSGVGLGESSG-------SHENDVKNDSLFQVIKAVEAAEATIKQQVE 53

Query: 61  NEQVDQNSHPWKEQVYGSYEARQSIP-SSAISNTSNYS--GSSEIN 103
              + +     +      Y++ +S+P +S + N SN +  GSS ++
Sbjct: 54  ENNLLKAELQRRYLELAKYKSGESLPQTSDLGNHSNTTTGGSSPLH 99


>AT5G08440.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; EXPRESSED IN: 21 plant
           structures; EXPRESSED DURING: 13 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT5G23490.1). | chr5:2721037-2726970 FORWARD
           LENGTH=772
          Length = 772

 Score =  190 bits (482), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 98/153 (64%), Positives = 115/153 (75%)

Query: 136 RHLLGGDLDPQFNPPRQGLTPMAETNNSNTSLQQDLAIKVXXXXXXXXXXXKHLADYAAK 195
           R LL GD D   N     L P+ E NNS T+ +Q+L  KV           K+LADY+ K
Sbjct: 184 RPLLEGDHDLHINSSSHELMPVGEVNNSGTAWKQELIHKVQEQDQEILRLRKYLADYSTK 243

Query: 196 EAQIRNEKYVLDKRIAYMRVAFDQQQQDLVDAASKALSYRQDVIEENIRLTYALQDAQQE 255
           E QIRNEKYVL+KRIA+MR AFDQQQQDLVDAASKALSYRQ++IEENIRLTYALQ A+QE
Sbjct: 244 EVQIRNEKYVLEKRIAHMRSAFDQQQQDLVDAASKALSYRQEIIEENIRLTYALQAAEQE 303

Query: 256 RSTFVSSLVPLLAEYSLQPNVLDAQSIVSNVKV 288
           RS FVS L+PLL+EYSL P + D+QSIVS+VK+
Sbjct: 304 RSLFVSILLPLLSEYSLHPQISDSQSIVSSVKI 336



 Score = 56.6 bits (135), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 38/106 (35%), Positives = 63/106 (59%), Gaps = 10/106 (9%)

Query: 1   MENGHDGKLAEKFSGLAINQQHGQQGVHDQSNLSSNHNESLYQVMKAVESAEVTIKQQRR 60
           M+NGH+ +LAE+FSG+ + +  G       S+ +   N+SL+QV+KAVE+AE TIKQQ  
Sbjct: 1   MDNGHEERLAERFSGVGLGESSG-------SHENDVKNDSLFQVIKAVEAAEATIKQQVE 53

Query: 61  NEQVDQNSHPWKEQVYGSYEARQSIP-SSAISNTSNYS--GSSEIN 103
              + +     +      Y++ +S+P +S + N SN +  GSS ++
Sbjct: 54  ENNLLKAELQRRYLELAKYKSGESLPQTSDLGNHSNTTTGGSSPLH 99


>AT3G03560.1 | Symbols:  | unknown protein; LOCATED IN: plasma
           membrane; EXPRESSED IN: 22 plant structures; EXPRESSED
           DURING: 13 growth stages; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT5G23490.1);
           Has 157 Blast hits to 146 proteins in 38 species: Archae
           - 3; Bacteria - 14; Metazoa - 8; Fungi - 0; Plants -
           120; Viruses - 0; Other Eukaryotes - 12 (source: NCBI
           BLink). | chr3:853153-856486 REVERSE LENGTH=521
          Length = 521

 Score = 81.6 bits (200), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 48/133 (36%), Positives = 77/133 (57%), Gaps = 5/133 (3%)

Query: 162 NSNTSLQQD-----LAIKVXXXXXXXXXXXKHLADYAAKEAQIRNEKYVLDKRIAYMRVA 216
           ++NT L QD     L  KV           + +A    K+ Q+ NEKY L+++ A +RVA
Sbjct: 27  DTNTKLIQDPEEMALYAKVRSQEEEIHSLQERIAAACLKDMQLLNEKYGLERKCADLRVA 86

Query: 217 FDQQQQDLVDAASKALSYRQDVIEENIRLTYALQDAQQERSTFVSSLVPLLAEYSLQPNV 276
            D++Q + V +A   L+ R+  +EEN++L + L+  + ER  F++SL+ LLAEY + P V
Sbjct: 87  IDEKQNESVTSALNELARRKGDLEENLKLAHDLKVTEDERYIFMTSLLGLLAEYGVWPRV 146

Query: 277 LDAQSIVSNVKVL 289
            +A +I S +K L
Sbjct: 147 ANATAISSGIKHL 159