Miyakogusa Predicted Gene

Lj4g3v1464060.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj4g3v1464060.1 Non Chatacterized Hit- tr|D7M2X3|D7M2X3_ARALL
Putative uncharacterized protein OS=Arabidopsis
lyrata,30,6e-18,SUBFAMILY NOT NAMED,NULL; FAMILY NOT NAMED,NULL;
seg,NULL,CUFF.49293.1
         (533 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G48310.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   196   3e-50
AT5G48310.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   186   2e-47
AT4G24610.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    95   1e-19
AT4G24610.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    95   1e-19
AT5G65440.3 | Symbols:  | unknown protein; INVOLVED IN: biologic...    69   6e-12
AT5G65440.1 | Symbols:  | unknown protein; INVOLVED IN: biologic...    69   7e-12
AT5G65440.2 | Symbols:  | unknown protein; INVOLVED IN: biologic...    69   9e-12

>AT5G48310.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 18 plant
           structures; EXPRESSED DURING: 13 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT4G24610.1). | chr5:19574961-19580362 REVERSE
           LENGTH=1129
          Length = 1129

 Score =  196 bits (499), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 143/400 (35%), Positives = 201/400 (50%), Gaps = 51/400 (12%)

Query: 120 DEDQLFGCKQPQQNPKPSGRNNGILRKGLVNENLSVQVPNTVRRFTDGGDLGFKK---KI 176
           DE+++FG K        S  N G+L+    ++NL ++VP   RR TD  +L  ++   K 
Sbjct: 87  DEEEVFGDKSN------SKLNRGMLK----DKNLRIEVPFMNRRVTDC-ELELRRFALKN 135

Query: 177 MTPXXXXXXXXXXXIQLQKHVHLHNQNLNCLDDPAELATPSAPPAPITDAD--FSLENEP 234
            TP              ++  H  +   +   D  ++ TPSAPP   +  +   SLE E 
Sbjct: 136 STPAS------------ERRPHTLSSKGSVYWDLEDIRTPSAPPIMESGQEDSISLEIEK 183

Query: 235 DHHGIGSSVDCDGRRSESSVEQTPSAVA------------KDPDIVQRQDTTFTQDMERQ 282
           D   I   + C     ESS +++  + +            KD   V+  D+  ++    +
Sbjct: 184 DIQKIEDEI-CGEAGVESSKQESMRSSSHLYRVEEFGESVKDSKTVE--DSKISEICSDE 240

Query: 283 PPHLQCYNTSRCNSQYAWQTLITYDACIRLCLQSWAKGCTEAPEFLKDECLALRAAFGLH 342
               +C++ S    QYAWQ+L+ YDACIRLCL  W+KG TEA EFL+DEC  LR AFGLH
Sbjct: 241 LE--ECHSIS---GQYAWQSLLAYDACIRLCLYEWSKGSTEASEFLRDECRILRGAFGLH 295

Query: 343 EFLLQPRGVKPPEGISTRPSEQTIPLKMNKAVGKIRVEVXXXXXXXXXXXXSANSQQGGS 402
           +FLLQPRGV+  E  +   +E    LK    V K+RVEV              +S +   
Sbjct: 296 KFLLQPRGVRSSEKNNNVKAEPKPSLKSKNVVRKLRVEVKRLRLIPQRKLRGTDSLR-SL 354

Query: 403 IYMQAGM--DYVRQVSSIVKXXXXXXXXXXXXXXXEEPLYCLLQLKSATEENESESCSAI 460
           + MQ GM  +Y RQVSS+VK               EE   C LQ+KS  E  + E  S++
Sbjct: 355 MNMQIGMGAEYCRQVSSLVKTGMTSIKQATLSAVSEEQFSCYLQMKSTAEGGQIEQGSSV 414

Query: 461 FLRPGNRDYHDFFPLSQGDALLVEVQDSKKTVHGEARIPI 500
            L+ G   YH FFP S+GDAL++EVQD KK+V G+A I I
Sbjct: 415 CLQSGTGSYHVFFPESEGDALMIEVQDKKKSVQGKAMISI 454


>AT5G48310.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G24610.1); Has 1807 Blast hits to 1807 proteins
           in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
           Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
           - 339 (source: NCBI BLink). | chr5:19574961-19580362
           REVERSE LENGTH=1156
          Length = 1156

 Score =  186 bits (473), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 142/420 (33%), Positives = 200/420 (47%), Gaps = 64/420 (15%)

Query: 120 DEDQLFGCKQPQQNPKPSGRNNGILRKGLVNENLSVQVPNTVRRFTDGGDLGFKK---KI 176
           DE+++FG K        S  N G+L+    ++NL ++VP   RR TD  +L  ++   K 
Sbjct: 87  DEEEVFGDKSN------SKLNRGMLK----DKNLRIEVPFMNRRVTDC-ELELRRFALKN 135

Query: 177 MTPXXXXXXXXXXXIQLQKHVHLHNQNLNCLDDPAELATPSAPPAPIT--DADFSLENEP 234
            TP              ++  H  +   +   D  ++ TPSAPP   +  +   SLE E 
Sbjct: 136 STPAS------------ERRPHTLSSKGSVYWDLEDIRTPSAPPIMESGQEDSISLEIEK 183

Query: 235 DHHGIGSSVDCDGRRSESSVEQTPSAVAKDPDIVQRQDTTF------------------- 275
           D   I   + C     ESS +++  + +    + +  +  F                   
Sbjct: 184 DIQKIEDEI-CGEAGVESSKQESMRSSSHLYRVEEFGERYFPNLTRFFVISFCGLVLMCL 242

Query: 276 ---------TQDMERQPPHLQCYN-TSRCNS---QYAWQTLITYDACIRLCLQSWAKGCT 322
                    ++ +E       C +    C+S   QYAWQ+L+ YDACIRLCL  W+KG T
Sbjct: 243 IMVWCSVKDSKTVEDSKISEICSDELEECHSISGQYAWQSLLAYDACIRLCLYEWSKGST 302

Query: 323 EAPEFLKDECLALRAAFGLHEFLLQPRGVKPPEGISTRPSEQTIPLKMNKAVGKIRVEVX 382
           EA EFL+DEC  LR AFGLH+FLLQPRGV+  E  +   +E    LK    V K+RVEV 
Sbjct: 303 EASEFLRDECRILRGAFGLHKFLLQPRGVRSSEKNNNVKAEPKPSLKSKNVVRKLRVEVK 362

Query: 383 XXXXXXXXXXXSANSQQGGSIYMQAGM--DYVRQVSSIVKXXXXXXXXXXXXXXXEEPLY 440
                        +S +   + MQ GM  +Y RQVSS+VK               EE   
Sbjct: 363 RLRLIPQRKLRGTDSLR-SLMNMQIGMGAEYCRQVSSLVKTGMTSIKQATLSAVSEEQFS 421

Query: 441 CLLQLKSATEENESESCSAIFLRPGNRDYHDFFPLSQGDALLVEVQDSKKTVHGEARIPI 500
           C LQ+KS  E  + E  S++ L+ G   YH FFP S+GDAL++EVQD KK+V G+A I I
Sbjct: 422 CYLQMKSTAEGGQIEQGSSVCLQSGTGSYHVFFPESEGDALMIEVQDKKKSVQGKAMISI 481


>AT4G24610.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: plasma membrane;
           EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 12
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT5G65440.1); Has 820 Blast
           hits to 264 proteins in 74 species: Archae - 0; Bacteria
           - 15; Metazoa - 77; Fungi - 83; Plants - 96; Viruses -
           0; Other Eukaryotes - 549 (source: NCBI BLink). |
           chr4:12700837-12707899 REVERSE LENGTH=1150
          Length = 1150

 Score = 94.7 bits (234), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 73/252 (28%), Positives = 111/252 (44%), Gaps = 53/252 (21%)

Query: 280 ERQPPHLQCYNTSRCNSQYAWQTLITYDACIRLCLQSWAKGCTEAPEFLKDECLALRAAF 339
           ++ P  L  ++ S   S+  W  +++YDAC+RLCL +W+ GC EAP FL++EC  LR AF
Sbjct: 219 DQHPARLPTFHAS---SRGPWHAVVSYDACVRLCLHAWSTGCMEAPMFLENECALLREAF 275

Query: 340 GLHEFLL--------QPRGVKPPEGISTRPSEQTIPLK---------MNKAVG------- 375
           GL + LL        +     P EG++ +P +    +K         M+   G       
Sbjct: 276 GLQQLLLQSEEELLAKRSSQAPHEGVAPKPKKNIGKMKVQVRRVKTVMDGPTGCSISSLK 335

Query: 376 -------KIRVEVXXXXXXXXXXXXSANS------QQGGSI------YMQAGMDYVRQVS 416
                  KIR+              +           G S+      Y+ A   Y++QVS
Sbjct: 336 PSLIKFEKIRIHFSNMSTRLFSGWRALRKIHVRVPANGSSLPRQSLAYVHASTQYLKQVS 395

Query: 417 SIVKXXXXXXXXXXXXXXXEEPLY-CLLQLKSATEENESESCSAIFLRPGNRDYHDFFPL 475
            ++K                +  Y C L+LKS  E+N      AI ++PG+ + H FFP 
Sbjct: 396 GLLKTGVTSLRNNSTSYDIVQETYSCKLRLKSLAEDN------AIMMQPGSGESHVFFPD 449

Query: 476 SQGDALLVEVQD 487
           S GD L+VE+ D
Sbjct: 450 SHGDDLIVEILD 461


>AT4G24610.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; EXPRESSED IN: 20 plant
           structures; EXPRESSED DURING: 12 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT5G65440.1). | chr4:12700837-12707899 REVERSE
           LENGTH=1155
          Length = 1155

 Score = 94.7 bits (234), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 73/252 (28%), Positives = 111/252 (44%), Gaps = 53/252 (21%)

Query: 280 ERQPPHLQCYNTSRCNSQYAWQTLITYDACIRLCLQSWAKGCTEAPEFLKDECLALRAAF 339
           ++ P  L  ++ S   S+  W  +++YDAC+RLCL +W+ GC EAP FL++EC  LR AF
Sbjct: 223 DQHPARLPTFHAS---SRGPWHAVVSYDACVRLCLHAWSTGCMEAPMFLENECALLREAF 279

Query: 340 GLHEFLL--------QPRGVKPPEGISTRPSEQTIPLK---------MNKAVG------- 375
           GL + LL        +     P EG++ +P +    +K         M+   G       
Sbjct: 280 GLQQLLLQSEEELLAKRSSQAPHEGVAPKPKKNIGKMKVQVRRVKTVMDGPTGCSISSLK 339

Query: 376 -------KIRVEVXXXXXXXXXXXXSANS------QQGGSI------YMQAGMDYVRQVS 416
                  KIR+              +           G S+      Y+ A   Y++QVS
Sbjct: 340 PSLIKFEKIRIHFSNMSTRLFSGWRALRKIHVRVPANGSSLPRQSLAYVHASTQYLKQVS 399

Query: 417 SIVKXXXXXXXXXXXXXXXEEPLY-CLLQLKSATEENESESCSAIFLRPGNRDYHDFFPL 475
            ++K                +  Y C L+LKS  E+N      AI ++PG+ + H FFP 
Sbjct: 400 GLLKTGVTSLRNNSTSYDIVQETYSCKLRLKSLAEDN------AIMMQPGSGESHVFFPD 453

Query: 476 SQGDALLVEVQD 487
           S GD L+VE+ D
Sbjct: 454 SHGDDLIVEILD 465


>AT5G65440.3 | Symbols:  | unknown protein; INVOLVED IN:
           biological_process unknown; EXPRESSED IN: 22 plant
           structures; EXPRESSED DURING: 13 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT4G24610.1). | chr5:26152015-26156896 FORWARD
           LENGTH=1125
          Length = 1125

 Score = 69.3 bits (168), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 35/92 (38%), Positives = 51/92 (55%), Gaps = 2/92 (2%)

Query: 291 TSRCNSQYAWQTLITYDACIRLCLQSWAK-GCTEAPEFLKDECLALRAAFGLHEFLLQPR 349
           T   + Q  W  +I Y+AC+RLCL SW+    +EA  FL +EC  +R AF L  F L   
Sbjct: 172 TFHASEQGPWSAMIAYEACVRLCLHSWSTDSVSEASYFLNNECTIMRNAFSLQRFFLHSE 231

Query: 350 GVKPPEGISTRPSEQTIPLKMNKAVGKIRVEV 381
                +G S   +E ++P K  K +GKI+++V
Sbjct: 232 EELLGKGPSELVTETSVP-KSKKTIGKIKLQV 262


>AT5G65440.1 | Symbols:  | unknown protein; INVOLVED IN:
           biological_process unknown; LOCATED IN: plasma membrane;
           EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT4G24610.1); Has 30201 Blast
           hits to 17322 proteins in 780 species: Archae - 12;
           Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants -
           5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI
           BLink). | chr5:26152015-26156896 FORWARD LENGTH=1050
          Length = 1050

 Score = 68.9 bits (167), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 35/92 (38%), Positives = 51/92 (55%), Gaps = 2/92 (2%)

Query: 291 TSRCNSQYAWQTLITYDACIRLCLQSWAK-GCTEAPEFLKDECLALRAAFGLHEFLLQPR 349
           T   + Q  W  +I Y+AC+RLCL SW+    +EA  FL +EC  +R AF L  F L   
Sbjct: 131 TFHASEQGPWSAMIAYEACVRLCLHSWSTDSVSEASYFLNNECTIMRNAFSLQRFFLHSE 190

Query: 350 GVKPPEGISTRPSEQTIPLKMNKAVGKIRVEV 381
                +G S   +E ++P K  K +GKI+++V
Sbjct: 191 EELLGKGPSELVTETSVP-KSKKTIGKIKLQV 221


>AT5G65440.2 | Symbols:  | unknown protein; INVOLVED IN:
           biological_process unknown; LOCATED IN: plasma membrane;
           EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT4G24610.1); Has 30201 Blast
           hits to 17322 proteins in 780 species: Archae - 12;
           Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants -
           5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI
           BLink). | chr5:26152015-26156623 FORWARD LENGTH=1016
          Length = 1016

 Score = 68.9 bits (167), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 35/92 (38%), Positives = 51/92 (55%), Gaps = 2/92 (2%)

Query: 291 TSRCNSQYAWQTLITYDACIRLCLQSWAK-GCTEAPEFLKDECLALRAAFGLHEFLLQPR 349
           T   + Q  W  +I Y+AC+RLCL SW+    +EA  FL +EC  +R AF L  F L   
Sbjct: 131 TFHASEQGPWSAMIAYEACVRLCLHSWSTDSVSEASYFLNNECTIMRNAFSLQRFFLHSE 190

Query: 350 GVKPPEGISTRPSEQTIPLKMNKAVGKIRVEV 381
                +G S   +E ++P K  K +GKI+++V
Sbjct: 191 EELLGKGPSELVTETSVP-KSKKTIGKIKLQV 221