Miyakogusa Predicted Gene

Lj5g3v2045980.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj5g3v2045980.1 Non Chatacterized Hit- tr|B9FBT2|B9FBT2_ORYSJ
Putative uncharacterized protein OS=Oryza sativa
subsp,46.39,1e-18,coiled-coil,NULL; seg,NULL; DUF1423,Protein of
unknown function DUF1423, plant,gene.g62874.t1.1
         (445 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT1G05410.1 | Symbols:  | Protein of unknown function (DUF1423) ...   329   2e-90
AT1G05410.2 | Symbols:  | Protein of unknown function (DUF1423) ...   276   2e-74
AT4G14840.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   105   6e-23
AT3G22520.1 | Symbols:  | unknown protein; INVOLVED IN: biologic...   100   3e-21

>AT1G05410.1 | Symbols:  | Protein of unknown function (DUF1423) |
           chr1:1585504-1587268 REVERSE LENGTH=471
          Length = 471

 Score =  329 bits (844), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 186/461 (40%), Positives = 263/461 (57%), Gaps = 30/461 (6%)

Query: 3   LPPVAPEQSGQGLPYAPENFPNPGDVWSWRTGLRVAVTGFFKDRYLYPPANICRAVNSGS 62
           L PV+P +SG+GLPYAPEN+PNPGD W W+ G R++  G+F DRYLYPP  +   +++  
Sbjct: 19  LRPVSPLESGEGLPYAPENWPNPGDTWHWKVGPRISGKGYFVDRYLYPPKYL-PGLDTEI 77

Query: 63  SRRRITFASKLAVQRFVKESFPDANVQDFFASFSWKIPAICPGNGHAYAAEPISAVPPHL 122
            R+   F S+L++QR+++  FP+A+VQ FFASFSW IP     +G     +    +P + 
Sbjct: 78  LRKNKVFRSRLSLQRYIRVHFPEADVQKFFASFSWSIPC---RDGQGVLPQKQVQLPVY- 133

Query: 123 LAXXXXXXXXXXXXXXXVGCKVGNQRCRSLILEEVEKYSPAMPCDICCANPRFXXXXXXX 182
                              CK GN++CRSL+ +   +  PAMPCDICC   +F       
Sbjct: 134 ---SSDEDPMRDDGSDTAVCKAGNEKCRSLMPQCEAETLPAMPCDICCGERKFCVDCCCI 190

Query: 183 XXXKALDLAYDGYSYFKCPAKVGD-YICGHAVHFDCALRSYLAGTVGGIIGLDVEYLCMR 241
              K + L + GYSY KC A V + +ICGH  H +CALR+YLAGT+GG +GLD EY C R
Sbjct: 191 LCCKLISLEHGGYSYIKCEAVVSEGHICGHVAHMNCALRAYLAGTIGGSMGLDTEYYCRR 250

Query: 242 CDGKTDLISHVNKLLQTCEAIDADDDNKEKILNLGVSLLGESEKASAKELMRRIELAISK 301
           CD K DL  HVNK L+ C+ ++   D  EKILNLG+ +L  +++ +AKEL+  IE  + K
Sbjct: 251 CDAKKDLFPHVNKFLEICQTVEYQGD-VEKILNLGICILRGAQRDNAKELLNCIESTVIK 309

Query: 302 LKGGTSTGDITNVVDNPTDXXXXXXXXXXXXKESYLS----------------DFGRXXX 345
           LK GTS  D+ N  D PT              ++  S                +  +   
Sbjct: 310 LKCGTSLEDLWN-DDTPTIWSDYSDSGEARENDTLQSLQDVTPIGPIPFNHEAEMHKLEE 368

Query: 346 XXXXXXXYLRKSQENEFNLAEERLNEHKRYLQSLYQQLAFEKSELANEAHSSRSDALFHD 405
                   LRK+QE E+ +AE +L+  K  L  LY+QL  EKSEL+     + +++L  +
Sbjct: 369 EIGEVLRALRKAQEFEYQIAEGKLHAQKECLSDLYRQLEKEKSELSRRVSGTDANSLMTN 428

Query: 406 AAVGERVEQIRREVKKFEEMKKVAQGFGSTPKHNL-EYFGL 445
             V +R++QIR+EV K +EM++VA+GFG TP+  L EYF L
Sbjct: 429 --VLKRLDQIRKEVTKLKEMEEVAKGFGRTPRGVLEEYFHL 467


>AT1G05410.2 | Symbols:  | Protein of unknown function (DUF1423) |
           chr1:1585504-1587268 REVERSE LENGTH=444
          Length = 444

 Score =  276 bits (706), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 165/431 (38%), Positives = 237/431 (54%), Gaps = 30/431 (6%)

Query: 33  TGLRVAVTGFFKDRYLYPPANICRAVNSGSSRRRITFASKLAVQRFVKESFPDANVQDFF 92
            G R++  G+F DRYLYPP  +   +++   R+   F S+L++QR+++  FP+A+VQ FF
Sbjct: 22  VGPRISGKGYFVDRYLYPPKYL-PGLDTEILRKNKVFRSRLSLQRYIRVHFPEADVQKFF 80

Query: 93  ASFSWKIPAICPGNGHAYAAEPISAVPPHLLAXXXXXXXXXXXXXXXVGCKVGNQRCRSL 152
           ASFSW IP     +G     +    +P +                    CK GN++CRSL
Sbjct: 81  ASFSWSIPC---RDGQGVLPQKQVQLPVY----SSDEDPMRDDGSDTAVCKAGNEKCRSL 133

Query: 153 ILEEVEKYSPAMPCDICCANPRFXXXXXXXXXXKALDLAYDGYSYFKCPAKVGD-YICGH 211
           + +   +  PAMPCDICC   +F          K + L + GYSY KC A V + +ICGH
Sbjct: 134 MPQCEAETLPAMPCDICCGERKFCVDCCCILCCKLISLEHGGYSYIKCEAVVSEGHICGH 193

Query: 212 AVHFDCALRSYLAGTVGGIIGLDVEYLCMRCDGKTDLISHVNKLLQTCEAIDADDDNKEK 271
             H +CALR+YLAGT+GG +GLD EY C RCD K DL  HVNK L+ C+ ++   D  EK
Sbjct: 194 VAHMNCALRAYLAGTIGGSMGLDTEYYCRRCDAKKDLFPHVNKFLEICQTVEYQGD-VEK 252

Query: 272 ILNLGVSLLGESEKASAKELMRRIELAISKLKGGTSTGDITNVVDNPTDXXXXXXXXXXX 331
           ILNLG+ +L  +++ +AKEL+  IE  + KLK GTS  D+ N  D PT            
Sbjct: 253 ILNLGICILRGAQRDNAKELLNCIESTVIKLKCGTSLEDLWN-DDTPTIWSDYSDSGEAR 311

Query: 332 XKESYLS----------------DFGRXXXXXXXXXXYLRKSQENEFNLAEERLNEHKRY 375
             ++  S                +  +           LRK+QE E+ +AE +L+  K  
Sbjct: 312 ENDTLQSLQDVTPIGPIPFNHEAEMHKLEEEIGEVLRALRKAQEFEYQIAEGKLHAQKEC 371

Query: 376 LQSLYQQLAFEKSELANEAHSSRSDALFHDAAVGERVEQIRREVKKFEEMKKVAQGFGST 435
           L  LY+QL  EKSEL+     + +++L  +  V +R++QIR+EV K +EM++VA+GFG T
Sbjct: 372 LSDLYRQLEKEKSELSRRVSGTDANSLMTN--VLKRLDQIRKEVTKLKEMEEVAKGFGRT 429

Query: 436 PKHNL-EYFGL 445
           P+  L EYF L
Sbjct: 430 PRGVLEEYFHL 440


>AT4G14840.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G22520.1); Has 1681 Blast hits to 1532 proteins
           in 283 species: Archae - 19; Bacteria - 179; Metazoa -
           579; Fungi - 131; Plants - 223; Viruses - 5; Other
           Eukaryotes - 545 (source: NCBI BLink). |
           chr4:8511587-8513532 REVERSE LENGTH=555
          Length = 555

 Score =  105 bits (262), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 50/100 (50%), Positives = 64/100 (64%), Gaps = 6/100 (6%)

Query: 3   LPPVAPEQSGQGLPYAPENFPNPGDVWSWRTGLRVAVTGFFKDRYLYPPANICRAVNSGS 62
           LP + P  SGQGLP+AP +FP+PGDVW+WR G RV   GF KDR L  P  + +  N   
Sbjct: 68  LPAIPPASSGQGLPFAPVDFPSPGDVWTWRVGRRVNNAGFHKDRLLILPERL-KGKNVPK 126

Query: 63  SRRRITFASKLAVQRFVKESFPDANVQDFFASFSWKIPAI 102
           S     FASK  + R+++ SFPD +   FFASF+W IPA+
Sbjct: 127 S-----FASKNTLSRYLETSFPDMDANAFFASFTWNIPAL 161


>AT3G22520.1 | Symbols:  | unknown protein; INVOLVED IN:
           biological_process unknown; LOCATED IN: chloroplast
           stroma, chloroplast, chloroplast envelope; EXPRESSED IN:
           24 plant structures; EXPRESSED DURING: 13 growth stages;
           BEST Arabidopsis thaliana protein match is: unknown
           protein (TAIR:AT4G14840.1); Has 717 Blast hits to 703
           proteins in 179 species: Archae - 14; Bacteria - 134;
           Metazoa - 141; Fungi - 74; Plants - 209; Viruses - 0;
           Other Eukaryotes - 145 (source: NCBI BLink). |
           chr3:7974984-7977406 FORWARD LENGTH=600
          Length = 600

 Score = 99.8 bits (247), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 47/105 (44%), Positives = 65/105 (61%), Gaps = 7/105 (6%)

Query: 3   LPPVAPEQSGQGLPYAPENFPNPGDVWSWRTGLRVAVTGFFKDRYLYPPANICRAVNSGS 62
           LP + P  +GQGLPYAP ++P+PGDVW+WR G RV   G+ +DR+L  P  + +     S
Sbjct: 102 LPAIPPVSTGQGLPYAPVDWPSPGDVWTWRVGRRVTAMGYHQDRFLILPQRLQQKHVPKS 161

Query: 63  SRRRITFASKLAVQRFVKESFPDANVQDFFASFSWKIPAIC-PGN 106
                 FASK  + R+++  FP  +   FFASFSWK+PA+  P N
Sbjct: 162 ------FASKPQLARYLESDFPGMDADAFFASFSWKVPALFQPAN 200