Miyakogusa Predicted Gene

Lj5g3v1913900.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj5g3v1913900.1 CUFF.56178.1
         (135 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT4G02550.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    70   5e-13
AT4G02550.3 | Symbols:  | unknown protein; BEST Arabidopsis thal...    70   5e-13
AT4G02550.4 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    70   5e-13
AT4G02550.2 | Symbols:  | unknown protein; BEST Arabidopsis thal...    70   5e-13
AT4G02210.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    63   6e-11
AT4G02210.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    63   6e-11
AT2G24960.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    62   8e-11
AT2G24960.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    62   1e-10

>AT4G02550.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G02210.2); Has 370 Blast hits to 300 proteins
           in 18 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 10; Plants - 354; Viruses - 0; Other Eukaryotes
           - 6 (source: NCBI BLink). | chr4:1120622-1121629 REVERSE
           LENGTH=307
          Length = 307

 Score = 69.7 bits (169), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 54/98 (55%)

Query: 30  WNLEMDRALADILRDQRSMGNKSDGAWKGVAYNTAAQILSSRFNLQLIGENVKNRIKLWR 89
           W++ MD+ L + L  Q   GNK D  +   AY  A   +++RFNL L  +   NR+K  +
Sbjct: 22  WSVGMDKCLIEALAVQAKNGNKVDKCFNDKAYTAACVAVNTRFNLNLTSQKAINRLKTIK 81

Query: 90  SWYGIVSDILSQSGFDWDGTKYMISVSNEDAWNEYVKV 127
             Y ++ DILS+ GF W+ +  MI   +++ W  Y+ V
Sbjct: 82  KRYRVMRDILSRDGFWWNSSTKMIDCESDELWRRYIAV 119


>AT4G02550.3 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G02210.2); Has 35333 Blast hits to 34131
           proteins in 2444 species: Archae - 798; Bacteria -
           22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
           - 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
           chr4:1120622-1121674 REVERSE LENGTH=322
          Length = 322

 Score = 69.7 bits (169), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 54/98 (55%)

Query: 30  WNLEMDRALADILRDQRSMGNKSDGAWKGVAYNTAAQILSSRFNLQLIGENVKNRIKLWR 89
           W++ MD+ L + L  Q   GNK D  +   AY  A   +++RFNL L  +   NR+K  +
Sbjct: 37  WSVGMDKCLIEALAVQAKNGNKVDKCFNDKAYTAACVAVNTRFNLNLTSQKAINRLKTIK 96

Query: 90  SWYGIVSDILSQSGFDWDGTKYMISVSNEDAWNEYVKV 127
             Y ++ DILS+ GF W+ +  MI   +++ W  Y+ V
Sbjct: 97  KRYRVMRDILSRDGFWWNSSTKMIDCESDELWRRYIAV 134


>AT4G02550.4 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 18 plant
           structures; EXPRESSED DURING: 7 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT4G02210.2). | chr4:1120622-1121629 REVERSE
           LENGTH=278
          Length = 278

 Score = 69.7 bits (169), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 54/98 (55%)

Query: 30  WNLEMDRALADILRDQRSMGNKSDGAWKGVAYNTAAQILSSRFNLQLIGENVKNRIKLWR 89
           W++ MD+ L + L  Q   GNK D  +   AY  A   +++RFNL L  +   NR+K  +
Sbjct: 22  WSVGMDKCLIEALAVQAKNGNKVDKCFNDKAYTAACVAVNTRFNLNLTSQKAINRLKTIK 81

Query: 90  SWYGIVSDILSQSGFDWDGTKYMISVSNEDAWNEYVKV 127
             Y ++ DILS+ GF W+ +  MI   +++ W  Y+ V
Sbjct: 82  KRYRVMRDILSRDGFWWNSSTKMIDCESDELWRRYIAV 119


>AT4G02550.2 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G02210.2); Has 350 Blast hits to 284 proteins
           in 18 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 13; Plants - 331; Viruses - 0; Other Eukaryotes
           - 6 (source: NCBI BLink). | chr4:1120622-1121629 REVERSE
           LENGTH=278
          Length = 278

 Score = 69.7 bits (169), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 54/98 (55%)

Query: 30  WNLEMDRALADILRDQRSMGNKSDGAWKGVAYNTAAQILSSRFNLQLIGENVKNRIKLWR 89
           W++ MD+ L + L  Q   GNK D  +   AY  A   +++RFNL L  +   NR+K  +
Sbjct: 22  WSVGMDKCLIEALAVQAKNGNKVDKCFNDKAYTAACVAVNTRFNLNLTSQKAINRLKTIK 81

Query: 90  SWYGIVSDILSQSGFDWDGTKYMISVSNEDAWNEYVKV 127
             Y ++ DILS+ GF W+ +  MI   +++ W  Y+ V
Sbjct: 82  KRYRVMRDILSRDGFWWNSSTKMIDCESDELWRRYIAV 119


>AT4G02210.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 22 plant
           structures; EXPRESSED DURING: 13 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT2G24960.2). | chr4:974320-975917 REVERSE
           LENGTH=439
          Length = 439

 Score = 63.2 bits (152), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 30/104 (28%), Positives = 53/104 (50%), Gaps = 1/104 (0%)

Query: 24  SRAYFTWNLEMDRALADILRDQRSMGNKSDGAWKGVAYNTAAQILSSRFNLQLIGENVKN 83
           +R   TW+  MDR   D++ DQ   GN+ +G ++  A+     + +++F      + +KN
Sbjct: 180 TRCRTTWHPPMDRYFIDLMLDQARRGNQIEGVFRKQAWTEMVNLFNAKFESNFDVDVLKN 239

Query: 84  RIKLWRSWYGIVSDILSQSGFDWDGTKYMISVSNEDAWNEYVKV 127
           R K  R  +  +  IL   GF WD  + M++  N + W +Y+K 
Sbjct: 240 RYKSLRRQFNAIKSILRSDGFAWDNERQMVTADN-NVWQDYIKA 282



 Score = 52.0 bits (123), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 32/105 (30%), Positives = 59/105 (56%), Gaps = 4/105 (3%)

Query: 25  RAYFTWNLEMDRALADILRDQRSMGNK-SDGAWKGVAYNTAAQILSSRFNLQLIGENV-K 82
           R    W  EMD+   +++ +Q   GN+  D  +   A+   +   +++F   L G++V K
Sbjct: 9   RLRTVWTPEMDQYFIELMVEQVRKGNRFEDHLFSKRAWKFMSCSFTAKFKF-LYGKDVLK 67

Query: 83  NRIKLWRSWYGIVSDILSQSGFDWDGTKYMISVSNEDAWNEYVKV 127
           NR K  R+ +  V+++L + GF WD T+ M+ V++   W+EY+K+
Sbjct: 68  NRHKTLRNLFKSVNNLLIEDGFSWDDTRQMV-VADNCVWDEYLKI 111


>AT4G02210.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G24960.2); Has 791 Blast hits to 465 proteins
           in 19 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 17; Plants - 748; Viruses - 0; Other Eukaryotes
           - 26 (source: NCBI BLink). | chr4:974320-975917 REVERSE
           LENGTH=439
          Length = 439

 Score = 63.2 bits (152), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 30/104 (28%), Positives = 53/104 (50%), Gaps = 1/104 (0%)

Query: 24  SRAYFTWNLEMDRALADILRDQRSMGNKSDGAWKGVAYNTAAQILSSRFNLQLIGENVKN 83
           +R   TW+  MDR   D++ DQ   GN+ +G ++  A+     + +++F      + +KN
Sbjct: 180 TRCRTTWHPPMDRYFIDLMLDQARRGNQIEGVFRKQAWTEMVNLFNAKFESNFDVDVLKN 239

Query: 84  RIKLWRSWYGIVSDILSQSGFDWDGTKYMISVSNEDAWNEYVKV 127
           R K  R  +  +  IL   GF WD  + M++  N + W +Y+K 
Sbjct: 240 RYKSLRRQFNAIKSILRSDGFAWDNERQMVTADN-NVWQDYIKA 282



 Score = 52.0 bits (123), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 32/105 (30%), Positives = 59/105 (56%), Gaps = 4/105 (3%)

Query: 25  RAYFTWNLEMDRALADILRDQRSMGNK-SDGAWKGVAYNTAAQILSSRFNLQLIGENV-K 82
           R    W  EMD+   +++ +Q   GN+  D  +   A+   +   +++F   L G++V K
Sbjct: 9   RLRTVWTPEMDQYFIELMVEQVRKGNRFEDHLFSKRAWKFMSCSFTAKFKF-LYGKDVLK 67

Query: 83  NRIKLWRSWYGIVSDILSQSGFDWDGTKYMISVSNEDAWNEYVKV 127
           NR K  R+ +  V+++L + GF WD T+ M+ V++   W+EY+K+
Sbjct: 68  NRHKTLRNLFKSVNNLLIEDGFSWDDTRQMV-VADNCVWDEYLKI 111


>AT2G24960.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G02210.2); Has 1453 Blast hits to 509 proteins
           in 26 species: Archae - 0; Bacteria - 0; Metazoa - 1;
           Fungi - 39; Plants - 1363; Viruses - 0; Other Eukaryotes
           - 50 (source: NCBI BLink). | chr2:10617263-10620034
           FORWARD LENGTH=797
          Length = 797

 Score = 62.4 bits (150), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 36/116 (31%), Positives = 61/116 (52%), Gaps = 3/116 (2%)

Query: 13  TEASNEDKKDDSRAYFTWNLEMDRALADILRDQRSMGNKSDGAWKGVAYNTAAQILSSRF 72
           T+AS E   D +R +  W   MD  L D+L +Q + GN+    +   A+N      +++F
Sbjct: 312 TKASQEQNSDRTRIF--WTPPMDYHLIDLLVEQVNNGNRVGQTFITSAWNEMVTAFNAKF 369

Query: 73  NLQLIGENVKNRIKLWRSWYGIVSDILSQSGFDWDGTKYMISVSNEDAWNEYVKVM 128
             Q   + +KNR K  R  Y  +  +L Q+GF WD  + M+ ++++D WN Y++  
Sbjct: 370 GSQHNKDVLKNRYKHLRRLYNDIKFLLEQNGFSWDARRDMV-IADDDIWNTYIQAC 424



 Score = 55.5 bits (132), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 31/97 (31%), Positives = 53/97 (54%), Gaps = 1/97 (1%)

Query: 30  WNLEMDRALADILRDQRSMGNKSDGAWKGVAYNTAAQILSSRFNLQLIGENVKNRIKLWR 89
           W LEMD+   +I+ DQ   GNK+  A+   A+     + ++RF+ Q     +++R     
Sbjct: 172 WTLEMDQYFVEIMVDQIGRGNKTGNAFSKQAWIDMLVLFNARFSGQYGKRVLRHRYNKLL 231

Query: 90  SWYGIVSDILSQSGFDWDGTKYMISVSNEDAWNEYVK 126
            +Y  +  IL + GF WD T+ MIS +++  W+ Y+K
Sbjct: 232 KYYKDMEAILKEDGFSWDETRLMIS-ADDAVWDSYIK 267


>AT2G24960.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 21 plant
           structures; EXPRESSED DURING: 12 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT4G02210.2); Has 30201 Blast hits to 17322
           proteins in 780 species: Archae - 12; Bacteria - 1396;
           Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
           0; Other Eukaryotes - 2996 (source: NCBI BLink). |
           chr2:10617263-10620034 FORWARD LENGTH=774
          Length = 774

 Score = 62.0 bits (149), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 36/115 (31%), Positives = 61/115 (53%), Gaps = 3/115 (2%)

Query: 13  TEASNEDKKDDSRAYFTWNLEMDRALADILRDQRSMGNKSDGAWKGVAYNTAAQILSSRF 72
           T+AS E   D +R +  W   MD  L D+L +Q + GN+    +   A+N      +++F
Sbjct: 312 TKASQEQNSDRTRIF--WTPPMDYHLIDLLVEQVNNGNRVGQTFITSAWNEMVTAFNAKF 369

Query: 73  NLQLIGENVKNRIKLWRSWYGIVSDILSQSGFDWDGTKYMISVSNEDAWNEYVKV 127
             Q   + +KNR K  R  Y  +  +L Q+GF WD  + M+ ++++D WN Y++ 
Sbjct: 370 GSQHNKDVLKNRYKHLRRLYNDIKFLLEQNGFSWDARRDMV-IADDDIWNTYIQA 423



 Score = 55.5 bits (132), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 31/97 (31%), Positives = 53/97 (54%), Gaps = 1/97 (1%)

Query: 30  WNLEMDRALADILRDQRSMGNKSDGAWKGVAYNTAAQILSSRFNLQLIGENVKNRIKLWR 89
           W LEMD+   +I+ DQ   GNK+  A+   A+     + ++RF+ Q     +++R     
Sbjct: 172 WTLEMDQYFVEIMVDQIGRGNKTGNAFSKQAWIDMLVLFNARFSGQYGKRVLRHRYNKLL 231

Query: 90  SWYGIVSDILSQSGFDWDGTKYMISVSNEDAWNEYVK 126
            +Y  +  IL + GF WD T+ MIS +++  W+ Y+K
Sbjct: 232 KYYKDMEAILKEDGFSWDETRLMIS-ADDAVWDSYIK 267