Miyakogusa Predicted Gene

Lj1g3v2280630.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj1g3v2280630.1 Non Chatacterized Hit- tr|D7U522|D7U522_VITVI
Putative uncharacterized protein OS=Vitis vinifera
GN=,75.47,0.00000000000006,seg,NULL,CUFF.28784.1
         (452 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT1G42430.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   617   e-177
AT1G42430.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   592   e-169
AT3G55760.3 | Symbols:  | unknown protein; EXPRESSED IN: 16 plan...   201   1e-51
AT3G55760.2 | Symbols:  | unknown protein; LOCATED IN: chloropla...   201   1e-51
AT3G55760.1 | Symbols:  | unknown protein; LOCATED IN: chloropla...   201   1e-51

>AT1G42430.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G55760.3); Has 186 Blast hits to 143 proteins
           in 47 species: Archae - 0; Bacteria - 23; Metazoa - 14;
           Fungi - 6; Plants - 87; Viruses - 0; Other Eukaryotes -
           56 (source: NCBI BLink). | chr1:15891512-15894322
           FORWARD LENGTH=426
          Length = 426

 Score =  617 bits (1592), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 320/436 (73%), Positives = 350/436 (80%), Gaps = 32/436 (7%)

Query: 1   MAASSRGFSTAQFDFKLRARRLNVVAASLLLPSNHVLVESSCSXXXXXXTCFFDVDRA-- 58
           MAASS   + +  D KLR  R  V A      SNH L            T +F  D+A  
Sbjct: 4   MAASS---AISLLDIKLR--RFGVGA------SNHEL----------RLTKWFKGDQAGA 42

Query: 59  --RLRCCCSDSVTPIRRASGAGNGGDRSEEWRLDPKKNPHIHRMRVQXXXXXXXXXXXXX 116
             R   C +D + PIRR+       ++SEE R D K + H   ++               
Sbjct: 43  PTRRFTCFADMLAPIRRS-------EKSEERRFDQKMSAHGAGIKTSSSAVPFASPKSRF 95

Query: 117 XLKQEKFYPRCTPRNSGPQSRDTPPKRDTGIANEKDWGISLLNENVNESGTNEDGSTWYR 176
             KQEKFYPRCTPR +GPQSRDTPPKRDTGIANEKDWGI LLNENVNE+GTNEDGS+W+R
Sbjct: 96  LSKQEKFYPRCTPRLTGPQSRDTPPKRDTGIANEKDWGIDLLNENVNEAGTNEDGSSWFR 155

Query: 177 ENGEELGENGYRCRWTRMGGQSHDGSSEWKETWWEKSDWTGYKELGVEKSGRNSEGDSWW 236
           E+G +LG+NGYRCRW+RMGG+SHDGSSEW ETWWEKSDWTGYKELGVEKSG+NSEGDSWW
Sbjct: 156 ESGHDLGDNGYRCRWSRMGGRSHDGSSEWTETWWEKSDWTGYKELGVEKSGKNSEGDSWW 215

Query: 237 ETWQENLQQDEWSNIARIERSAQKQAKSGTENAGWYEKWWEKYDAKGWTEKGAHKYGRLN 296
           ETWQE L QDEWSN+ARIERSAQKQAKSGTENAGWYEKWWEKYDAKGWTEKGAHKYGRLN
Sbjct: 216 ETWQEVLHQDEWSNLARIERSAQKQAKSGTENAGWYEKWWEKYDAKGWTEKGAHKYGRLN 275

Query: 297 EQSWWEKWGEHYDGRESVLKWTDKWAETELGTKWGDKWEERFFKGIGSRHGETWHVSPSS 356
           EQSWWEKWGEHYDGR SVLKWTDKWAETELGTKWGDKWEE+FF GIGSR GETWHVSP+S
Sbjct: 276 EQSWWEKWGEHYDGRGSVLKWTDKWAETELGTKWGDKWEEKFFSGIGSRQGETWHVSPNS 335

Query: 357 ERWSRTWGEEHFGNGKVHKYGNSTTGESWDIVVDEETYYEAEPHYGWADVVGDSSQLLSI 416
           +RWSRTWGEEHFGNGKVHKYG STTGESWDIVVDEETYYEAEPHYGWADVVGDS+QLLSI
Sbjct: 336 DRWSRTWGEEHFGNGKVHKYGKSTTGESWDIVVDEETYYEAEPHYGWADVVGDSTQLLSI 395

Query: 417 EPRERPPGVFPNLDFG 432
           +PRERPPGV+PNL+FG
Sbjct: 396 QPRERPPGVYPNLEFG 411


>AT1G42430.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT3G55760.3). |
           chr1:15891512-15894322 FORWARD LENGTH=409
          Length = 409

 Score =  592 bits (1525), Expect = e-169,   Method: Compositional matrix adjust.
 Identities = 310/436 (71%), Positives = 342/436 (78%), Gaps = 49/436 (11%)

Query: 1   MAASSRGFSTAQFDFKLRARRLNVVAASLLLPSNHVLVESSCSXXXXXXTCFFDVDRA-- 58
           MAASS   + +  D KLR  R  V A      SNH L            T +F  D+A  
Sbjct: 4   MAASS---AISLLDIKLR--RFGVGA------SNHEL----------RLTKWFKGDQAGA 42

Query: 59  --RLRCCCSDSVTPIRRASGAGNGGDRSEEWRLDPKKNPHIHRMRVQXXXXXXXXXXXXX 116
             R   C +D + PIRR+       ++SEE R D K + H   ++               
Sbjct: 43  PTRRFTCFADMLAPIRRS-------EKSEERRFDQKMSAHGAGIKTSSSA---------- 85

Query: 117 XLKQEKFYPRCTPRNSGPQSRDTPPKRDTGIANEKDWGISLLNENVNESGTNEDGSTWYR 176
                   P  +P+ +GPQSRDTPPKRDTGIANEKDWGI LLNENVNE+GTNEDGS+W+R
Sbjct: 86  -------VPFASPKLTGPQSRDTPPKRDTGIANEKDWGIDLLNENVNEAGTNEDGSSWFR 138

Query: 177 ENGEELGENGYRCRWTRMGGQSHDGSSEWKETWWEKSDWTGYKELGVEKSGRNSEGDSWW 236
           E+G +LG+NGYRCRW+RMGG+SHDGSSEW ETWWEKSDWTGYKELGVEKSG+NSEGDSWW
Sbjct: 139 ESGHDLGDNGYRCRWSRMGGRSHDGSSEWTETWWEKSDWTGYKELGVEKSGKNSEGDSWW 198

Query: 237 ETWQENLQQDEWSNIARIERSAQKQAKSGTENAGWYEKWWEKYDAKGWTEKGAHKYGRLN 296
           ETWQE L QDEWSN+ARIERSAQKQAKSGTENAGWYEKWWEKYDAKGWTEKGAHKYGRLN
Sbjct: 199 ETWQEVLHQDEWSNLARIERSAQKQAKSGTENAGWYEKWWEKYDAKGWTEKGAHKYGRLN 258

Query: 297 EQSWWEKWGEHYDGRESVLKWTDKWAETELGTKWGDKWEERFFKGIGSRHGETWHVSPSS 356
           EQSWWEKWGEHYDGR SVLKWTDKWAETELGTKWGDKWEE+FF GIGSR GETWHVSP+S
Sbjct: 259 EQSWWEKWGEHYDGRGSVLKWTDKWAETELGTKWGDKWEEKFFSGIGSRQGETWHVSPNS 318

Query: 357 ERWSRTWGEEHFGNGKVHKYGNSTTGESWDIVVDEETYYEAEPHYGWADVVGDSSQLLSI 416
           +RWSRTWGEEHFGNGKVHKYG STTGESWDIVVDEETYYEAEPHYGWADVVGDS+QLLSI
Sbjct: 319 DRWSRTWGEEHFGNGKVHKYGKSTTGESWDIVVDEETYYEAEPHYGWADVVGDSTQLLSI 378

Query: 417 EPRERPPGVFPNLDFG 432
           +PRERPPGV+PNL+FG
Sbjct: 379 QPRERPPGVYPNLEFG 394


>AT3G55760.3 | Symbols:  | unknown protein; EXPRESSED IN: 16 plant
           structures; EXPRESSED DURING: 10 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT1G42430.2). | chr3:20700648-20702886 FORWARD
           LENGTH=578
          Length = 578

 Score =  201 bits (510), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 119/279 (42%), Positives = 175/279 (62%), Gaps = 19/279 (6%)

Query: 155 ISLLNENVNES---GTNEDGSTWYRENGEELGENGYRCRWTRMGGQSHDGSSEWKETWWE 211
           ++ + ++++ES   G +EDG  W+++ G E   +G  CRWT + G + DG  EW++ +WE
Sbjct: 297 VARVLDSLDESSTHGVSEDGLKWWKQTGVEKRPDGVVCRWTMIRGVTADGVVEWQDKYWE 356

Query: 212 KSDWTGYKELGVEKSGRNSEGDSWWETWQENLQQDEWSNIARIERSAQKQAKSGTENAGW 271
            SD  G+KELG EKSGR++ G+ W E W+E++ Q+  + +  +E++A K  KSG  +  W
Sbjct: 357 ASDDFGFKELGSEKSGRDATGNVWREFWRESMSQE--NGVVHMEKTADKWGKSGQGDE-W 413

Query: 272 YEKWWEKYDAKGWTEKGAHKYGRLNEQS---------WWEKWGEHYDGRESVLKWTDKWA 322
            EKWWE YDA G +EK AHK+  ++  +         W E+WGE YDG+    K+TDKWA
Sbjct: 414 QEKWWEHYDATGKSEKWAHKWCSIDRNTPLDAGHAHVWHERWGEKYDGQGGSTKYTDKWA 473

Query: 323 ETELG---TKWGDKWEERFFKGI-GSRHGETWHVSPSSERWSRTWGEEHFGNGKVHKYGN 378
           E  +G    KWGDKW+E F     G + GETW      +RW+R+WGE H G+G VHKYG 
Sbjct: 474 ERWVGDGWDKWGDKWDENFNPSAQGVKQGETWWEGKHGDRWNRSWGEGHNGSGWVHKYGK 533

Query: 379 STTGESWDIVVDEETYYEAEPHYGWADVVGDSSQLLSIE 417
           S++GE WD  V +ET+YE  PH+G+     +S QL +++
Sbjct: 534 SSSGEHWDTHVPQETWYEKFPHFGFFHCFDNSVQLRAVK 572


>AT3G55760.2 | Symbols:  | unknown protein; LOCATED IN: chloroplast;
           EXPRESSED IN: 16 plant structures; EXPRESSED DURING: 10
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT1G42430.2); Has 176 Blast
           hits to 125 proteins in 40 species: Archae - 0; Bacteria
           - 3; Metazoa - 19; Fungi - 9; Plants - 81; Viruses - 0;
           Other Eukaryotes - 64 (source: NCBI BLink). |
           chr3:20700648-20702886 FORWARD LENGTH=578
          Length = 578

 Score =  201 bits (510), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 119/279 (42%), Positives = 175/279 (62%), Gaps = 19/279 (6%)

Query: 155 ISLLNENVNES---GTNEDGSTWYRENGEELGENGYRCRWTRMGGQSHDGSSEWKETWWE 211
           ++ + ++++ES   G +EDG  W+++ G E   +G  CRWT + G + DG  EW++ +WE
Sbjct: 297 VARVLDSLDESSTHGVSEDGLKWWKQTGVEKRPDGVVCRWTMIRGVTADGVVEWQDKYWE 356

Query: 212 KSDWTGYKELGVEKSGRNSEGDSWWETWQENLQQDEWSNIARIERSAQKQAKSGTENAGW 271
            SD  G+KELG EKSGR++ G+ W E W+E++ Q+  + +  +E++A K  KSG  +  W
Sbjct: 357 ASDDFGFKELGSEKSGRDATGNVWREFWRESMSQE--NGVVHMEKTADKWGKSGQGDE-W 413

Query: 272 YEKWWEKYDAKGWTEKGAHKYGRLNEQS---------WWEKWGEHYDGRESVLKWTDKWA 322
            EKWWE YDA G +EK AHK+  ++  +         W E+WGE YDG+    K+TDKWA
Sbjct: 414 QEKWWEHYDATGKSEKWAHKWCSIDRNTPLDAGHAHVWHERWGEKYDGQGGSTKYTDKWA 473

Query: 323 ETELG---TKWGDKWEERFFKGI-GSRHGETWHVSPSSERWSRTWGEEHFGNGKVHKYGN 378
           E  +G    KWGDKW+E F     G + GETW      +RW+R+WGE H G+G VHKYG 
Sbjct: 474 ERWVGDGWDKWGDKWDENFNPSAQGVKQGETWWEGKHGDRWNRSWGEGHNGSGWVHKYGK 533

Query: 379 STTGESWDIVVDEETYYEAEPHYGWADVVGDSSQLLSIE 417
           S++GE WD  V +ET+YE  PH+G+     +S QL +++
Sbjct: 534 SSSGEHWDTHVPQETWYEKFPHFGFFHCFDNSVQLRAVK 572


>AT3G55760.1 | Symbols:  | unknown protein; LOCATED IN: chloroplast
           stroma, chloroplast; EXPRESSED IN: 16 plant structures;
           EXPRESSED DURING: 10 growth stages; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G42430.2); Has 176 Blast hits to 125 proteins
           in 40 species: Archae - 0; Bacteria - 3; Metazoa - 19;
           Fungi - 9; Plants - 81; Viruses - 0; Other Eukaryotes -
           64 (source: NCBI BLink). | chr3:20700648-20702886
           FORWARD LENGTH=578
          Length = 578

 Score =  201 bits (510), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 119/279 (42%), Positives = 175/279 (62%), Gaps = 19/279 (6%)

Query: 155 ISLLNENVNES---GTNEDGSTWYRENGEELGENGYRCRWTRMGGQSHDGSSEWKETWWE 211
           ++ + ++++ES   G +EDG  W+++ G E   +G  CRWT + G + DG  EW++ +WE
Sbjct: 297 VARVLDSLDESSTHGVSEDGLKWWKQTGVEKRPDGVVCRWTMIRGVTADGVVEWQDKYWE 356

Query: 212 KSDWTGYKELGVEKSGRNSEGDSWWETWQENLQQDEWSNIARIERSAQKQAKSGTENAGW 271
            SD  G+KELG EKSGR++ G+ W E W+E++ Q+  + +  +E++A K  KSG  +  W
Sbjct: 357 ASDDFGFKELGSEKSGRDATGNVWREFWRESMSQE--NGVVHMEKTADKWGKSGQGDE-W 413

Query: 272 YEKWWEKYDAKGWTEKGAHKYGRLNEQS---------WWEKWGEHYDGRESVLKWTDKWA 322
            EKWWE YDA G +EK AHK+  ++  +         W E+WGE YDG+    K+TDKWA
Sbjct: 414 QEKWWEHYDATGKSEKWAHKWCSIDRNTPLDAGHAHVWHERWGEKYDGQGGSTKYTDKWA 473

Query: 323 ETELG---TKWGDKWEERFFKGI-GSRHGETWHVSPSSERWSRTWGEEHFGNGKVHKYGN 378
           E  +G    KWGDKW+E F     G + GETW      +RW+R+WGE H G+G VHKYG 
Sbjct: 474 ERWVGDGWDKWGDKWDENFNPSAQGVKQGETWWEGKHGDRWNRSWGEGHNGSGWVHKYGK 533

Query: 379 STTGESWDIVVDEETYYEAEPHYGWADVVGDSSQLLSIE 417
           S++GE WD  V +ET+YE  PH+G+     +S QL +++
Sbjct: 534 SSSGEHWDTHVPQETWYEKFPHFGFFHCFDNSVQLRAVK 572