Miyakogusa Predicted Gene

Lj5g3v2112060.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj5g3v2112060.1 Non Chatacterized Hit- tr|F6H0B1|F6H0B1_VITVI
Putative uncharacterized protein OS=Vitis vinifera
GN=,26.83,1e-18,coiled-coil,NULL; seg,NULL; DUF760,Protein of unknown
function DUF760,CUFF.56696.1
         (378 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT2G14910.1 | Symbols:  | unknown protein; LOCATED IN: chloropla...   422   e-118
AT2G14910.2 | Symbols:  | unknown protein; LOCATED IN: chloropla...   370   e-103
AT5G14970.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   199   2e-51
AT1G63610.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   103   2e-22
AT1G63610.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   100   2e-21

>AT2G14910.1 | Symbols:  | unknown protein; LOCATED IN: chloroplast;
           EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT5G14970.1); Has 605 Blast
           hits to 425 proteins in 102 species: Archae - 0;
           Bacteria - 300; Metazoa - 25; Fungi - 0; Plants - 89;
           Viruses - 0; Other Eukaryotes - 191 (source: NCBI
           BLink). | chr2:6406957-6409015 REVERSE LENGTH=386
          Length = 386

 Score =  422 bits (1086), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 227/333 (68%), Positives = 262/333 (78%), Gaps = 16/333 (4%)

Query: 53  SFGDFT-----SKPKKSVLRELIQEIEPLDVSHIQKDVPPTTADAMKRTISGMLGLLPSD 107
           S  DFT       PKK VL +LIQEIEPLDVS IQKDVP TT DAMKRTISGMLGLLPSD
Sbjct: 63  SLDDFTLHSDSRSPKKCVLSDLIQEIEPLDVSLIQKDVPVTTLDAMKRTISGMLGLLPSD 122

Query: 108 QFHVVIEALWEPLSKLLISSMMTGYTLRNAEYRLCLEKNLDMCERDLEKPKAESTPMDLQ 167
           +F V IE+LWEPLSKLL+SSMMTGYTLRNAEYRL LEKNLDM    L+   +E+T  D++
Sbjct: 123 RFQVHIESLWEPLSKLLVSSMMTGYTLRNAEYRLFLEKNLDMSGGGLDSHASENTEYDME 182

Query: 168 GLLHDSVNVIDFGRKSNLSSKVEKVHEDVDIQDLGEISSEAQQYILNLQSRLSSMKKELH 227
           G   D  +V      S   S+ + + E +D + LG +SSEAQ+YIL LQS+LSS+KKEL 
Sbjct: 183 GTFPDEDHV-----SSKRDSRTQNLSETIDEEGLGRVSSEAQEYILRLQSQLSSVKKELQ 237

Query: 228 EVKRKNAALQMQQFVGEEKNDLLDYLRSLQPEQVAQLSEFTSPELKEIIISVVHGLLATL 287
           E++RKNAALQMQQFVGEEKNDLLDYLRSLQPE+VA+LSE  +PE+KE I SVVHGLLATL
Sbjct: 238 EMRRKNAALQMQQFVGEEKNDLLDYLRSLQPEKVAELSEPAAPEVKETIHSVVHGLLATL 297

Query: 288 SPKMHSK-PSTMSENATIGTANAGS-EDCAEVVENSSIQFHPVISLTRDYLARLLFWCML 345
           SPKMHSK P+  SE     T  A S EDCAE+VEN+S+QF P+ISLTRDYLARLLFWCML
Sbjct: 298 SPKMHSKFPA--SEVPPTETVKAKSDEDCAELVENTSLQFQPLISLTRDYLARLLFWCML 355

Query: 346 LGHYLRGLEYRVDLTELLSLTSDAENNGNEQIA 378
           LGHYLRGLEYR++L E+LSLT DA  NG+E +A
Sbjct: 356 LGHYLRGLEYRMELMEVLSLTCDA--NGSENVA 386


>AT2G14910.2 | Symbols:  | unknown protein; LOCATED IN: chloroplast;
           EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT5G14970.1); Has 35333 Blast
           hits to 34131 proteins in 2444 species: Archae - 798;
           Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants -
           531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI
           BLink). | chr2:6407195-6409015 REVERSE LENGTH=366
          Length = 366

 Score =  370 bits (951), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 201/297 (67%), Positives = 231/297 (77%), Gaps = 14/297 (4%)

Query: 53  SFGDFT-----SKPKKSVLRELIQEIEPLDVSHIQKDVPPTTADAMKRTISGMLGLLPSD 107
           S  DFT       PKK VL +LIQEIEPLDVS IQKDVP TT DAMKRTISGMLGLLPSD
Sbjct: 63  SLDDFTLHSDSRSPKKCVLSDLIQEIEPLDVSLIQKDVPVTTLDAMKRTISGMLGLLPSD 122

Query: 108 QFHVVIEALWEPLSKLLISSMMTGYTLRNAEYRLCLEKNLDMCERDLEKPKAESTPMDLQ 167
           +F V IE+LWEPLSKLL+SSMMTGYTLRNAEYRL LEKNLDM    L+   +E+T  D++
Sbjct: 123 RFQVHIESLWEPLSKLLVSSMMTGYTLRNAEYRLFLEKNLDMSGGGLDSHASENTEYDME 182

Query: 168 GLLHDSVNVIDFGRKSNLSSKVEKVHEDVDIQDLGEISSEAQQYILNLQSRLSSMKKELH 227
           G   D  +V      S   S+ + + E +D + LG +SSEAQ+YIL LQS+LSS+KKEL 
Sbjct: 183 GTFPDEDHV-----SSKRDSRTQNLSETIDEEGLGRVSSEAQEYILRLQSQLSSVKKELQ 237

Query: 228 EVKRKNAALQMQQFVGEEKNDLLDYLRSLQPEQVAQLSEFTSPELKEIIISVVHGLLATL 287
           E++RKNAALQMQQFVGEEKNDLLDYLRSLQPE+VA+LSE  +PE+KE I SVVHGLLATL
Sbjct: 238 EMRRKNAALQMQQFVGEEKNDLLDYLRSLQPEKVAELSEPAAPEVKETIHSVVHGLLATL 297

Query: 288 SPKMHSK-PSTMSENATIGTANAGS-EDCAEVVENSSIQFHPVISLTRDYLARLLFW 342
           SPKMHSK P+  SE     T  A S EDCAE+VEN+S+QF P+ISLTRDYLARLLFW
Sbjct: 298 SPKMHSKFPA--SEVPPTETVKAKSDEDCAELVENTSLQFQPLISLTRDYLARLLFW 352


>AT5G14970.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G14910.1); Has 579 Blast hits to 397 proteins
           in 95 species: Archae - 0; Bacteria - 294; Metazoa - 0;
           Fungi - 0; Plants - 86; Viruses - 0; Other Eukaryotes -
           199 (source: NCBI BLink). | chr5:4847368-4848760 FORWARD
           LENGTH=355
          Length = 355

 Score =  199 bits (506), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 128/307 (41%), Positives = 179/307 (58%), Gaps = 35/307 (11%)

Query: 64  SVLRELIQEIEPLDVSHIQKDVPPTTADAMKRTISGMLGLLPSDQFHVVIEALWEPLSKL 123
           +V+  L+  I+PLD S I K +  +  D+MK+TIS MLGLLPSDQF V +    +PL +L
Sbjct: 78  AVIANLVNRIQPLDTSVISKGLSDSAKDSMKQTISSMLGLLPSDQFSVSVTISEQPLYRL 137

Query: 124 LISSMMTGYTLRNAEYRLCLEKNLDMCERDLEKPKAESTPMDLQGLLHD--SVNVIDFGR 181
           LISS++TGYTL NAEYR+ L +N D+             P+D +    D  S + + FG 
Sbjct: 138 LISSIITGYTLWNAEYRVSLRRNFDI-------------PIDPRKEEEDQSSKDNVRFGS 184

Query: 182 KSNLSSKVEKVHEDVDI---QDLGEISSEAQQYILNLQSRLSSMKKELHEVKRKNAALQM 238
           +  +S  +    E+ +    Q  G++S EA  YI  LQS LSSMK+EL   K+K  AL++
Sbjct: 185 EKGMSEDLGNCVEEFERLSPQVFGDLSPEALSYIQLLQSELSSMKEELDSQKKK--ALRI 242

Query: 239 QQFVGEEKNDLLDYLRSLQPEQVAQLSEFTSPELKEIIISVVHGLLATLSPKMHSKPSTM 298
           +   G  +NDLLDYLRSL PE V +LS+ +SPE++EI+  +V  +L  L     +  S  
Sbjct: 243 ECEKG-NRNDLLDYLRSLDPEMVTELSQLSSPEVEEIVNQLVQNVLERLFEDQTT--SNF 299

Query: 299 SENATIGTANAGSEDCAEVVENSSIQFHPVISLTRDYLARLLFWCMLLGHYLRGLEYRVD 358
            +N  I T   G     +V              +RDYLA+LLFWCMLLGH+LRGLE R+ 
Sbjct: 300 MQNPGIRTTEGGDGTGRKV------------DTSRDYLAKLLFWCMLLGHHLRGLENRLH 347

Query: 359 LTELLSL 365
           L+ ++ L
Sbjct: 348 LSCVVGL 354


>AT1G63610.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: chloroplast;
           EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT2G14910.1); Has 537 Blast
           hits to 411 proteins in 100 species: Archae - 0;
           Bacteria - 231; Metazoa - 0; Fungi - 0; Plants - 94;
           Viruses - 0; Other Eukaryotes - 212 (source: NCBI
           BLink). | chr1:23583675-23585599 REVERSE LENGTH=340
          Length = 340

 Score =  103 bits (257), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 83/314 (26%), Positives = 153/314 (48%), Gaps = 61/314 (19%)

Query: 51  TPSFGDFTSKPKKSVLRELIQEIEPLDVSHIQKDVPPTTADAMKRTISGMLGLLPSDQFH 110
           TP  G    K ++ +L E +Q ++P  +    K  P    +AM++T++ M+G LP   F 
Sbjct: 74  TPPNGTRQPKSRRDILLEYVQNVKPEFMEMFVKRAPKHVVEAMRQTVTNMIGTLPPQFFA 133

Query: 111 VVIEALWEPLSKLLISSMMTGYTLRNAEYRLCLEKNLDMCERDLEKPKAESTPMDLQGLL 170
           V + ++ E L++L++S +MTGY  RNA+YRL L+++L+          A   P D +G  
Sbjct: 134 VTVTSVAENLAQLMMSVLMTGYMFRNAQYRLELQQSLEQV--------ALPEPRDQKGGD 185

Query: 171 HDSVNVIDFGRKSNLSSKVEKVHEDVDIQDLGEISSEAQQYILNLQSRLSSMKKELHEVK 230
            D       G + N+S +V + +     + +     +A++YI  L++ +  + +   +V 
Sbjct: 186 EDYAP----GTQKNVSGEVIRWNNVSGPEKI-----DAKKYIELLEAEIEELNR---QVG 233

Query: 231 RKNAALQMQQFVGEEKNDLLDYLRSLQPEQVAQLSEFTSPELKEIIISVVHGLLATLSPK 290
           RK+A          ++N++L+YL+SL+P+ + +L+     ++   + + V  LLA     
Sbjct: 234 RKSA---------NQQNEILEYLKSLEPQNLKELTSTAGEDVAVAMNTFVKRLLAV---- 280

Query: 291 MHSKPSTMSENATIGTANAGSEDCAEVVENSSIQFHPVISLTRDYLARLLFWCMLLGHYL 350
             S P+ M  N T               E S+             LA+LL+W M++G+ +
Sbjct: 281 --SDPNQMKTNVT---------------ETSAAD-----------LAKLLYWLMVVGYSI 312

Query: 351 RGLEYRVDLTELLS 364
           R +E R D+  +L 
Sbjct: 313 RNIEVRFDMERVLG 326


>AT1G63610.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: chloroplast;
           EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT2G14910.1); Has 35333 Blast
           hits to 34131 proteins in 2444 species: Archae - 798;
           Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants -
           531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI
           BLink). | chr1:23583675-23585599 REVERSE LENGTH=341
          Length = 341

 Score =  100 bits (248), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 80/305 (26%), Positives = 150/305 (49%), Gaps = 61/305 (20%)

Query: 60  KPKKSVLRELIQEIEPLDVSHIQKDVPPTTADAMKRTISGMLGLLPSDQFHVVIEALWEP 119
           K ++ +L E +Q ++P  +    K  P    +AM++T++ M+G LP   F V + ++ E 
Sbjct: 84  KSRRDILLEYVQNVKPEFMEMFVKRAPKHVVEAMRQTVTNMIGTLPPQFFAVTVTSVAEN 143

Query: 120 LSKLLISSMMTGYTLRNAEYRLCLEKNLDMCERDLEKPKAESTPMDLQGLLHDSVNVIDF 179
           L++L++S +MTGY  RNA+YRL L+++L+          A   P D +G   D       
Sbjct: 144 LAQLMMSVLMTGYMFRNAQYRLELQQSLEQV--------ALPEPRDQKGGDEDYAP---- 191

Query: 180 GRKSNLSSKVEKVHEDVDIQDLGEISSEAQQYILNLQSRLSSMKKELHEVKRKNAALQMQ 239
           G + N+S +V + +     + +     +A++YI  L++ +  + +   +V RK+A     
Sbjct: 192 GTQKNVSGEVIRWNNVSGPEKI-----DAKKYIELLEAEIEELNR---QVGRKSA----- 238

Query: 240 QFVGEEKNDLLDYLRSLQPEQVAQLSEFTSPELKEIIISVVHGLLATLSPKMHSKPSTMS 299
                ++N++L+YL+SL+P+ + +L+     ++   + + V  LLA       S P+ M 
Sbjct: 239 ----NQQNEILEYLKSLEPQNLKELTSTAGEDVAVAMNTFVKRLLAV------SDPNQMK 288

Query: 300 ENATIGTANAGSEDCAEVVENSSIQFHPVISLTRDYLARLLFWCMLLGHYLRGLEYRVDL 359
            N T               E S+             LA+LL+W M++G+ +R +E R D+
Sbjct: 289 TNVT---------------ETSAAD-----------LAKLLYWLMVVGYSIRNIEVRFDM 322

Query: 360 TELLS 364
             +L 
Sbjct: 323 ERVLG 327