Miyakogusa Predicted Gene

Lj1g3v0095210.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj1g3v0095210.1 Non Chatacterized Hit- tr|Q94GQ3|Q94GQ3_ORYSJ
Putative uncharacterized protein OJ1124_H03.11
OS=Oryz,46.39,1e-18,DUF1423,Protein of unknown function DUF1423,
plant; coiled-coil,NULL; seg,NULL,gene.g28806.t1.1
         (460 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT1G05410.1 | Symbols:  | Protein of unknown function (DUF1423) ...   331   5e-91
AT1G05410.2 | Symbols:  | Protein of unknown function (DUF1423) ...   277   1e-74
AT4G14840.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   105   5e-23
AT3G22520.1 | Symbols:  | unknown protein; INVOLVED IN: biologic...   100   3e-21

>AT1G05410.1 | Symbols:  | Protein of unknown function (DUF1423) |
           chr1:1585504-1587268 REVERSE LENGTH=471
          Length = 471

 Score =  331 bits (849), Expect = 5e-91,   Method: Compositional matrix adjust.
 Identities = 190/479 (39%), Positives = 271/479 (56%), Gaps = 31/479 (6%)

Query: 1   MEDEGINASGNNAAP-MNLPPVAPEQSGQGLPYAPENFPNPGDVWSWRTGLRVAVTGFFK 59
           M+   I+ S    +P + L PV+P +SG+GLPYAPEN+PNPGD W W+ G R++  G+F 
Sbjct: 1   MDSMDIDQSNIGESPHLLLRPVSPLESGEGLPYAPENWPNPGDTWHWKVGPRISGKGYFV 60

Query: 60  DRYLYPPANICRAVNSGSSRRRITFASKLAVQRFVKESFPDANVQDFFASFSWKIPAICP 119
           DRYLYPP  +   +++   R+   F S+L++QR+++  FP+A+VQ FFASFSW IP    
Sbjct: 61  DRYLYPPKYL-PGLDTEILRKNKVFRSRLSLQRYIRVHFPEADVQKFFASFSWSIPC--- 116

Query: 120 GNGHAYAAEPISAVPPHLLAXXXXXXXXXXXXXXXVGCKVGNQRCRSLILEEVEKYSPAM 179
            +G     +    +P +                    CK GN++CRSL+ +   +  PAM
Sbjct: 117 RDGQGVLPQKQVQLPVY----SSDEDPMRDDGSDTAVCKAGNEKCRSLMPQCEAETLPAM 172

Query: 180 PCDICCANPRFXXXXXXXXXXKALDLAYDGYSYFKCPAKVGD-YICGHAVHFDCALRSYL 238
           PCDICC   +F          K + L + GYSY KC A V + +ICGH  H +CALR+YL
Sbjct: 173 PCDICCGERKFCVDCCCILCCKLISLEHGGYSYIKCEAVVSEGHICGHVAHMNCALRAYL 232

Query: 239 AGTVGGIIGLDVEYLCMRCDGKTDLISHVNKLLQTCEAIDADDDNKEKILNLGVSLLGES 298
           AGT+GG +GLD EY C RCD K DL  HVNK L+ C+ ++   D  EKILNLG+ +L  +
Sbjct: 233 AGTIGGSMGLDTEYYCRRCDAKKDLFPHVNKFLEICQTVEYQGD-VEKILNLGICILRGA 291

Query: 299 EKASAKELMRRIELAISKLKGGTSTGDITNVVDNPTDXXXXXXXXXXXXKESYLS----- 353
           ++ +AKEL+  IE  + KLK GTS  D+ N  D PT              ++  S     
Sbjct: 292 QRDNAKELLNCIESTVIKLKCGTSLEDLWN-DDTPTIWSDYSDSGEARENDTLQSLQDVT 350

Query: 354 -----------DFGRXXXXXXXXXXYLRKSQENEFNLAEERLNEHKRYLQSLYQQLAFEK 402
                      +  +           LRK+QE E+ +AE +L+  K  L  LY+QL  EK
Sbjct: 351 PIGPIPFNHEAEMHKLEEEIGEVLRALRKAQEFEYQIAEGKLHAQKECLSDLYRQLEKEK 410

Query: 403 SELANEAHSSRSDALFHDAAVGERVEQIRREVKKFEEMKKVAQGFGSTPKHNL-EYFGL 460
           SEL+     + +++L  +  V +R++QIR+EV K +EM++VA+GFG TP+  L EYF L
Sbjct: 411 SELSRRVSGTDANSLMTN--VLKRLDQIRKEVTKLKEMEEVAKGFGRTPRGVLEEYFHL 467


>AT1G05410.2 | Symbols:  | Protein of unknown function (DUF1423) |
           chr1:1585504-1587268 REVERSE LENGTH=444
          Length = 444

 Score =  277 bits (708), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 165/431 (38%), Positives = 237/431 (54%), Gaps = 30/431 (6%)

Query: 48  TGLRVAVTGFFKDRYLYPPANICRAVNSGSSRRRITFASKLAVQRFVKESFPDANVQDFF 107
            G R++  G+F DRYLYPP  +   +++   R+   F S+L++QR+++  FP+A+VQ FF
Sbjct: 22  VGPRISGKGYFVDRYLYPPKYL-PGLDTEILRKNKVFRSRLSLQRYIRVHFPEADVQKFF 80

Query: 108 ASFSWKIPAICPGNGHAYAAEPISAVPPHLLAXXXXXXXXXXXXXXXVGCKVGNQRCRSL 167
           ASFSW IP     +G     +    +P +                    CK GN++CRSL
Sbjct: 81  ASFSWSIPC---RDGQGVLPQKQVQLPVY----SSDEDPMRDDGSDTAVCKAGNEKCRSL 133

Query: 168 ILEEVEKYSPAMPCDICCANPRFXXXXXXXXXXKALDLAYDGYSYFKCPAKVGD-YICGH 226
           + +   +  PAMPCDICC   +F          K + L + GYSY KC A V + +ICGH
Sbjct: 134 MPQCEAETLPAMPCDICCGERKFCVDCCCILCCKLISLEHGGYSYIKCEAVVSEGHICGH 193

Query: 227 AVHFDCALRSYLAGTVGGIIGLDVEYLCMRCDGKTDLISHVNKLLQTCEAIDADDDNKEK 286
             H +CALR+YLAGT+GG +GLD EY C RCD K DL  HVNK L+ C+ ++   D  EK
Sbjct: 194 VAHMNCALRAYLAGTIGGSMGLDTEYYCRRCDAKKDLFPHVNKFLEICQTVEYQGD-VEK 252

Query: 287 ILNLGVSLLGESEKASAKELMRRIELAISKLKGGTSTGDITNVVDNPTDXXXXXXXXXXX 346
           ILNLG+ +L  +++ +AKEL+  IE  + KLK GTS  D+ N  D PT            
Sbjct: 253 ILNLGICILRGAQRDNAKELLNCIESTVIKLKCGTSLEDLWN-DDTPTIWSDYSDSGEAR 311

Query: 347 XKESYLS----------------DFGRXXXXXXXXXXYLRKSQENEFNLAEERLNEHKRY 390
             ++  S                +  +           LRK+QE E+ +AE +L+  K  
Sbjct: 312 ENDTLQSLQDVTPIGPIPFNHEAEMHKLEEEIGEVLRALRKAQEFEYQIAEGKLHAQKEC 371

Query: 391 LQSLYQQLAFEKSELANEAHSSRSDALFHDAAVGERVEQIRREVKKFEEMKKVAQGFGST 450
           L  LY+QL  EKSEL+     + +++L  +  V +R++QIR+EV K +EM++VA+GFG T
Sbjct: 372 LSDLYRQLEKEKSELSRRVSGTDANSLMTN--VLKRLDQIRKEVTKLKEMEEVAKGFGRT 429

Query: 451 PKHNL-EYFGL 460
           P+  L EYF L
Sbjct: 430 PRGVLEEYFHL 440


>AT4G14840.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G22520.1); Has 1681 Blast hits to 1532 proteins
           in 283 species: Archae - 19; Bacteria - 179; Metazoa -
           579; Fungi - 131; Plants - 223; Viruses - 5; Other
           Eukaryotes - 545 (source: NCBI BLink). |
           chr4:8511587-8513532 REVERSE LENGTH=555
          Length = 555

 Score =  105 bits (263), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 53/117 (45%), Positives = 72/117 (61%), Gaps = 7/117 (5%)

Query: 2   EDEGINASGNNAAPMN-LPPVAPEQSGQGLPYAPENFPNPGDVWSWRTGLRVAVTGFFKD 60
           +D  +N   ++   +N LP + P  SGQGLP+AP +FP+PGDVW+WR G RV   GF KD
Sbjct: 51  DDVKVNGDRSSFTDLNQLPAIPPASSGQGLPFAPVDFPSPGDVWTWRVGRRVNNAGFHKD 110

Query: 61  RYLYPPANICRAVNSGSSRRRITFASKLAVQRFVKESFPDANVQDFFASFSWKIPAI 117
           R L  P  + +  N   S     FASK  + R+++ SFPD +   FFASF+W IPA+
Sbjct: 111 RLLILPERL-KGKNVPKS-----FASKNTLSRYLETSFPDMDANAFFASFTWNIPAL 161


>AT3G22520.1 | Symbols:  | unknown protein; INVOLVED IN:
           biological_process unknown; LOCATED IN: chloroplast
           stroma, chloroplast, chloroplast envelope; EXPRESSED IN:
           24 plant structures; EXPRESSED DURING: 13 growth stages;
           BEST Arabidopsis thaliana protein match is: unknown
           protein (TAIR:AT4G14840.1); Has 717 Blast hits to 703
           proteins in 179 species: Archae - 14; Bacteria - 134;
           Metazoa - 141; Fungi - 74; Plants - 209; Viruses - 0;
           Other Eukaryotes - 145 (source: NCBI BLink). |
           chr3:7974984-7977406 FORWARD LENGTH=600
          Length = 600

 Score =  100 bits (248), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 47/105 (44%), Positives = 65/105 (61%), Gaps = 7/105 (6%)

Query: 18  LPPVAPEQSGQGLPYAPENFPNPGDVWSWRTGLRVAVTGFFKDRYLYPPANICRAVNSGS 77
           LP + P  +GQGLPYAP ++P+PGDVW+WR G RV   G+ +DR+L  P  + +     S
Sbjct: 102 LPAIPPVSTGQGLPYAPVDWPSPGDVWTWRVGRRVTAMGYHQDRFLILPQRLQQKHVPKS 161

Query: 78  SRRRITFASKLAVQRFVKESFPDANVQDFFASFSWKIPAIC-PGN 121
                 FASK  + R+++  FP  +   FFASFSWK+PA+  P N
Sbjct: 162 ------FASKPQLARYLESDFPGMDADAFFASFSWKVPALFQPAN 200