Miyakogusa Predicted Gene

Lj0g3v0032719.1
Show Alignment: 
BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj0g3v0032719.1 Non Chatacterized Hit- tr|J3M3H4|J3M3H4_ORYBR
Uncharacterized protein OS=Oryza brachyantha
GN=OB05G1,40.16,0.000000000000001, ,CUFF.1477.1
         (265 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G13950.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   247   6e-66
AT5G13950.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   247   6e-66
AT5G13950.3 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   245   2e-65
AT3G45830.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    72   5e-13
AT1G02290.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    71   7e-13

>AT5G13950.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 23 plant
           structures; EXPRESSED DURING: 13 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT1G02290.1); Has 147 Blast hits to 145 proteins
           in 44 species: Archae - 0; Bacteria - 2; Metazoa - 56;
           Fungi - 6; Plants - 81; Viruses - 0; Other Eukaryotes -
           2 (source: NCBI BLink). | chr5:4496196-4500206 REVERSE
           LENGTH=939
          Length = 939

 Score =  247 bits (630), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 124/264 (46%), Positives = 169/264 (64%), Gaps = 8/264 (3%)

Query: 1   MAADQRRKRINGASIVGYGSREHHRTKRKNSGQGQNDLNLRSHISVEWDSNQKRVVAKRE 60
           MAADQRRKR+N A+++G  SREH+R KRK +      L    HI++EWD N+ +VV+K+E
Sbjct: 1   MAADQRRKRMNSANVIGTSSREHYRAKRKKNASPDGALRSGDHITLEWDRNRSKVVSKKE 60

Query: 61  QIGISRRQMKPFVNFVSNDHKVLADVFTMPQEIFSLDNLSEVLSYEVWKTLLSENERNLL 120
           Q+G+S R ++ FV+ V     VLA V  +P E F L+NLSEVLS EVW++ LS+ ERN L
Sbjct: 61  QVGLSFRHLREFVDVVPPRRNVLAQVCPVPHETFQLENLSEVLSNEVWRSCLSDGERNYL 120

Query: 121 MQFLPSGLEPHHAVEELLSGSNFQFGNPLLKWGASLCLGDLHPDMIVDRELHLKAENRAY 180
            QFLP G++    V+ LL G NF FGNP L WG ++C G  HPD IV RE  L+A+ R Y
Sbjct: 121 RQFLPEGVDVEQVVQALLDGENFHFGNPSLDWGTAVCSGKAHPDQIVSREECLRADKRRY 180

Query: 181 YSQLYNYHNDMLGFLSKLKERWKSCKDPEKEFFQKI-RRSKHDEKKMPSNLNESRIFDRD 239
           YS L  YH D++ +L  LKE+W+SCKDPEK+  + +  RS+    ++  +        + 
Sbjct: 181 YSNLEKYHQDIIDYLQTLKEKWESCKDPEKDIVKMMWGRSRGGNAQVNGSC-------QG 233

Query: 240 ENVTSDSCSWDAEEKACSSDNQIS 263
               S S SW+ ++K  SSDN IS
Sbjct: 234 LTAASGSSSWNEDDKPDSSDNMIS 257


>AT5G13950.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G02290.1); Has 147 Blast hits to 145 proteins
           in 44 species: Archae - 0; Bacteria - 2; Metazoa - 56;
           Fungi - 6; Plants - 81; Viruses - 0; Other Eukaryotes -
           2 (source: NCBI BLink). | chr5:4496196-4500206 REVERSE
           LENGTH=939
          Length = 939

 Score =  247 bits (630), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 124/264 (46%), Positives = 169/264 (64%), Gaps = 8/264 (3%)

Query: 1   MAADQRRKRINGASIVGYGSREHHRTKRKNSGQGQNDLNLRSHISVEWDSNQKRVVAKRE 60
           MAADQRRKR+N A+++G  SREH+R KRK +      L    HI++EWD N+ +VV+K+E
Sbjct: 1   MAADQRRKRMNSANVIGTSSREHYRAKRKKNASPDGALRSGDHITLEWDRNRSKVVSKKE 60

Query: 61  QIGISRRQMKPFVNFVSNDHKVLADVFTMPQEIFSLDNLSEVLSYEVWKTLLSENERNLL 120
           Q+G+S R ++ FV+ V     VLA V  +P E F L+NLSEVLS EVW++ LS+ ERN L
Sbjct: 61  QVGLSFRHLREFVDVVPPRRNVLAQVCPVPHETFQLENLSEVLSNEVWRSCLSDGERNYL 120

Query: 121 MQFLPSGLEPHHAVEELLSGSNFQFGNPLLKWGASLCLGDLHPDMIVDRELHLKAENRAY 180
            QFLP G++    V+ LL G NF FGNP L WG ++C G  HPD IV RE  L+A+ R Y
Sbjct: 121 RQFLPEGVDVEQVVQALLDGENFHFGNPSLDWGTAVCSGKAHPDQIVSREECLRADKRRY 180

Query: 181 YSQLYNYHNDMLGFLSKLKERWKSCKDPEKEFFQKI-RRSKHDEKKMPSNLNESRIFDRD 239
           YS L  YH D++ +L  LKE+W+SCKDPEK+  + +  RS+    ++  +        + 
Sbjct: 181 YSNLEKYHQDIIDYLQTLKEKWESCKDPEKDIVKMMWGRSRGGNAQVNGSC-------QG 233

Query: 240 ENVTSDSCSWDAEEKACSSDNQIS 263
               S S SW+ ++K  SSDN IS
Sbjct: 234 LTAASGSSSWNEDDKPDSSDNMIS 257


>AT5G13950.3 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 23 plant
           structures; EXPRESSED DURING: 13 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT1G02290.1). | chr5:4496196-4500206 REVERSE
           LENGTH=954
          Length = 954

 Score =  245 bits (626), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 126/272 (46%), Positives = 173/272 (63%), Gaps = 9/272 (3%)

Query: 1   MAADQRRKRINGASIVGYGSREHHRTKRKNSGQGQNDLNLRSHISVEWDSNQKRVVAKRE 60
           MAADQRRKR+N A+++G  SREH+R KRK +      L    HI++EWD N+ +VV+K+E
Sbjct: 1   MAADQRRKRMNSANVIGTSSREHYRAKRKKNASPDGALRSGDHITLEWDRNRSKVVSKKE 60

Query: 61  QIGISRRQMKPFVNFVSNDHKVLADVFTMPQEIFSLDNLSEVLSYEVWKTLLSENERNLL 120
           Q+G+S R ++ FV+ V     VLA V  +P E F L+NLSEVLS EVW++ LS+ ERN L
Sbjct: 61  QVGLSFRHLREFVDVVPPRRNVLAQVCPVPHETFQLENLSEVLSNEVWRSCLSDGERNYL 120

Query: 121 MQFLPSGLEPHHAVEELLSGSNFQFGNPLLKWGASLCLGDLHPDMIVDRELHLKAENRAY 180
            QFLP G++    V+ LL G NF FGNP L WG ++C G  HPD IV RE  L+A+ R Y
Sbjct: 121 RQFLPEGVDVEQVVQALLDGENFHFGNPSLDWGTAVCSGKAHPDQIVSREECLRADKRRY 180

Query: 181 YSQLYNYHNDMLGFLSKLKERWKSCKDPEKEFFQKIRRS-------KHDEKKMPSNLNES 233
           YS L  YH D++ +L  LKE+W+SCKDPEK+  + +  S       K    ++ S    +
Sbjct: 181 YSNLEKYHQDIIDYLQTLKEKWESCKDPEKDIVKMMWGSVLYNFLFKRRTVEVRSRGGNA 240

Query: 234 RIFDRDENVT--SDSCSWDAEEKACSSDNQIS 263
           ++    + +T  S S SW+ ++K  SSDN IS
Sbjct: 241 QVNGSCQGLTAASGSSSWNEDDKPDSSDNMIS 272


>AT3G45830.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 24 plant
           structures; EXPRESSED DURING: 15 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT1G02290.1); Has 499 Blast hits to 438 proteins
           in 100 species: Archae - 0; Bacteria - 7; Metazoa - 236;
           Fungi - 15; Plants - 108; Viruses - 2; Other Eukaryotes
           - 131 (source: NCBI BLink). | chr3:16841277-16845173
           FORWARD LENGTH=1298
          Length = 1298

 Score = 71.6 bits (174), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 43/124 (34%), Positives = 65/124 (52%), Gaps = 8/124 (6%)

Query: 88  TMPQEIFSLDNLSEVLSYEVWKTLLSENERNLLMQFLP--SGLEPHHAVEELLSGSNFQF 145
           ++P E++ L +L ++LS +VW   L+E ER  L  +LP    L     ++EL  G NF F
Sbjct: 82  SIPFELYDLPSLEDILSVDVWNECLTEKERFSLSSYLPDVDQLTFMRTLKELFEGCNFHF 141

Query: 146 GNPLLKWGASLCLGDLHPD---MIVDRELHLKAENRAYYSQLYNYHNDMLGFLSKLKERW 202
           G+P+ K    L  G   P     +  R L L+ +   +Y  L  YHNDM+  L + ++ W
Sbjct: 142 GSPVKKLFDMLKGGQCEPRNTLYLEGRSLFLRTK---HYHSLRKYHNDMVVNLCQTRDAW 198

Query: 203 KSCK 206
            SCK
Sbjct: 199 TSCK 202


>AT1G02290.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G45830.1); Has 134 Blast hits to 134 proteins
           in 37 species: Archae - 0; Bacteria - 0; Metazoa - 54;
           Fungi - 0; Plants - 78; Viruses - 0; Other Eukaryotes -
           2 (source: NCBI BLink). | chr1:450646-451977 REVERSE
           LENGTH=443
          Length = 443

 Score = 71.2 bits (173), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 39/117 (33%), Positives = 64/117 (54%), Gaps = 4/117 (3%)

Query: 89  MPQEIFSLDNLSEVLSYEVWKTLLSENERNLLMQFLPSGLEPHH---AVEELLSGSNFQF 145
           +P E++ L +L+ +LS E W +LL+E ER  L  FLP  ++P      ++ELL G+N  F
Sbjct: 38  IPYELYDLPDLTGILSVETWNSLLTEEERFFLSCFLPD-MDPQTFSLTMQELLDGANLYF 96

Query: 146 GNPLLKWGASLCLGDLHPDMIVDRELHLKAENRAYYSQLYNYHNDMLGFLSKLKERW 202
           GNP  K+  +L  G   P +   +E  +  + R YY  L  YH  ++   ++++  W
Sbjct: 97  GNPEDKFYKNLLGGLFTPKVACFKEGVMFVKRRKYYYSLKFYHEKLIRTFTEMQRVW 153