Miyakogusa Predicted Gene

Lj3g3v0030810.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj3g3v0030810.1 Non Chatacterized Hit- tr|D7LP21|D7LP21_ARALL
Putative uncharacterized protein OS=Arabidopsis
lyrata,37.44,1e-17,seg,NULL,
gene.Ljchr3_pseudomol_20120830.path1.gene68.1
         (458 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT1G15780.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   234   1e-61
AT1G15770.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   213   3e-55
AT2G10440.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    93   4e-19
AT2G10440.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    84   2e-16
AT1G15772.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    50   4e-06

>AT1G15780.1 | Symbols:  | unknown protein; BEST Arabidopsis thaliana
            protein match is: unknown protein (TAIR:AT2G10440.1); Has
            103701 Blast hits to 43153 proteins in 1828 species:
            Archae - 30; Bacteria - 7385; Metazoa - 38639; Fungi -
            11531; Plants - 7727; Viruses - 307; Other Eukaryotes -
            38082 (source: NCBI BLink). | chr1:5430446-5435921
            REVERSE LENGTH=1335
          Length = 1335

 Score =  234 bits (597), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 127/216 (58%), Positives = 152/216 (70%), Gaps = 6/216 (2%)

Query: 234  ANIGHQHTGCAAVAAQSLAIGTSGISAPPESAEFTGPDGVHGNSFPPTLGKSTVSEQPLD 293
             NI  Q         QSLAIGT GISA P   EFT PDG   NS   T GK + +E P++
Sbjct: 989  GNIARQQATGMQGVVQSLAIGTPGISASPLLQEFTSPDGNILNSSTITSGKPSATELPIE 1048

Query: 294  RLIKAVKSLTPNALSTAVSDIGSVVSMNDRIVGSAPGNGSKAAAGADLVAMTNCRLQARN 353
            RLI+AVKS++P ALS+AVSDIGSVVSM DRI GSAPGNGS+A+ G DLVAMT CRLQARN
Sbjct: 1049 RLIRAVKSISPQALSSAVSDIGSVVSMVDRIAGSAPGNGSRASVGEDLVAMTKCRLQARN 1108

Query: 354  FINQDGANGTRRMKRCTHATPLNDVSYAGCLNNSIKQL---EASDLDSTATCI-KKPRIQ 409
            F+ Q+G   T++MKR T A PL+  S  G + ++ KQ    E SDL+STAT   KK R +
Sbjct: 1109 FMTQEGMMATKKMKRHTTAMPLSVASLGGSVGDNYKQFAGSETSDLESTATSDGKKARTE 1168

Query: 410  TNHALLEEIREVNQRLIDTVVDISDEE--IDPTAAA 443
            T HALLEEI+E+NQRLIDTVV+ISD+E   DP+  A
Sbjct: 1169 TEHALLEEIKEINQRLIDTVVEISDDEDAADPSEVA 1204



 Score =  208 bits (530), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 119/203 (58%), Positives = 145/203 (71%), Gaps = 12/203 (5%)

Query: 21  DMWRRLQASG----SLTLPQSVLDQQKQLCQSQRPLPKISSRKKTSLDSTTQTGQPNGGD 76
           D+ +RLQASG    SL  PQ+V+DQQ+QL QSQR LP++ S   +SLDST QT   NGGD
Sbjct: 536 DVQQRLQASGQVTGSLLPPQNVVDQQRQLYQSQRTLPEMPS---SSLDSTAQTESANGGD 592

Query: 77  WQEEVYQKIKTMKENYLPDLNELYQKIGTKLQQHDSLAQQSKSDKLEKYKLFKMMLERMI 136
           WQEEVYQKIK+MKE YLPDLNE+YQ++  KLQQ DS+ QQ +SD+LEK + FK MLERMI
Sbjct: 593 WQEEVYQKIKSMKETYLPDLNEIYQRVAAKLQQ-DSMPQQQRSDQLEKLRQFKTMLERMI 651

Query: 137 SFLQVSKSNIAPSFKEKLGSYEKQIINFININRPRKGMSSLQHPGHLPPQHMHSMSQSHP 196
            FL VSKSNI P+ K+K+  YEKQII F+N++RPRK +      G LP   M  M Q   
Sbjct: 652 QFLSVSKSNIMPALKDKVAYYEKQIIGFLNMHRPRKPV----QQGQLPQSQMQPMQQPQS 707

Query: 197 QVTQVQPLENQINSQMQTTNMQG 219
           Q  Q Q  +NQ N QMQ+ +MQG
Sbjct: 708 QTVQDQSHDNQTNPQMQSMSMQG 730


>AT1G15770.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G15780.1); Has 30201 Blast hits to 17322
           proteins in 780 species: Archae - 12; Bacteria - 1396;
           Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
           0; Other Eukaryotes - 2996 (source: NCBI BLink). |
           chr1:5426892-5428215 REVERSE LENGTH=293
          Length = 293

 Score =  213 bits (541), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 123/266 (46%), Positives = 159/266 (59%), Gaps = 21/266 (7%)

Query: 95  DLNELYQKIGTKLQQHDSLA-QQSKSDKLEKYKLFKMMLERMISFLQVSKSNIAPSFKEK 153
           DLNE+YQ++  KLQQ DSL+ Q+ +SD+ EK K  K +LE M+ FL +SKSNI P  K+ 
Sbjct: 15  DLNEIYQRVAAKLQQEDSLSHQKQRSDQFEKLKRGKTVLEGMLRFLSLSKSNIKPDLKDS 74

Query: 154 LGSYEKQIINFININRPRKGMSSLQHPGHLPPQHMHSMSQSHPQVTQVQPLENQINSQMQ 213
           +   +  I+NF+N+   RK +  LQ    L    +  M Q   Q  Q Q  ++Q   QMQ
Sbjct: 75  MDYRKNNIMNFLNMQSLRKTVQKLQ----LTKSEIQPMQQPLSQTVQDQSHDDQTTLQMQ 130

Query: 214 TTNMQGXXXXXXXXXXXXHGANIGHQHTGCAAVAAQSLAIGTSGISAPPESAEFTGPDGV 273
           + +MQG             G+ +     G      QSL IGT GISA P   E T PDG 
Sbjct: 131 SMSMQGA------------GSRVQQIRQG----VLQSLEIGTPGISASPLLPELTSPDGN 174

Query: 274 HGNSFPPTLGKSTVSEQPLDRLIKAVKSLTPNALSTAVSDIGSVVSMNDRIVGSAPGNGS 333
             N    T GKS+ +E P++RLI+A+KS++P ALS+AV DI SVVSM DRI GS PG GS
Sbjct: 175 IINPLTSTCGKSSATELPIERLIRAMKSISPQALSSAVCDIRSVVSMVDRIAGSVPGKGS 234

Query: 334 KAAAGADLVAMTNCRLQARNFINQDG 359
           +A+ G DLVAMT C LQ RNF+ QDG
Sbjct: 235 RASFGVDLVAMTKCHLQERNFMTQDG 260


>AT2G10440.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G15780.1); Has 8319 Blast hits to 5104 proteins
           in 317 species: Archae - 0; Bacteria - 285; Metazoa -
           1706; Fungi - 535; Plants - 320; Viruses - 18; Other
           Eukaryotes - 5455 (source: NCBI BLink). |
           chr2:4013752-4018046 REVERSE LENGTH=935
          Length = 935

 Score = 93.2 bits (230), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 67/187 (35%), Positives = 100/187 (53%), Gaps = 13/187 (6%)

Query: 25  RLQASGSLTLPQSVLDQQKQLCQSQRPLPKISSRK--KTSLDSTTQTGQPNGGDWQEEVY 82
           + QA+ SL   Q++ DQQ Q  Q +R  P I        S DST +T   N G+WQEE Y
Sbjct: 256 QFQAASSLRQTQNITDQQNQPQQLERANPSILIMNIIVASQDSTGKTVNVNAGNWQEETY 315

Query: 83  QKIKTMKENYLPDLNELYQKIGTKLQQHDSLAQQS-KSDKLEKYKLFKMMLERMISFLQV 141
           QKIK +KE  LP L+ ++Q++  KL++ +SL  Q  ++  +EK K  K+ +E ++ FL V
Sbjct: 316 QKIKKLKEMCLPVLSLMHQRVAEKLRETESLPPQPMQAQWIEKLKAGKLSMEHLMFFLNV 375

Query: 142 SKSNIAPSFKEKLGSYEKQIINFIN----INRPRKGMSSLQHPGHLPPQHMHSMSQSHPQ 197
            +S+++   ++K   YE  I+ F      + RP +     Q  G  PP      +QS PQ
Sbjct: 376 HRSSVSEKHRDKFSQYEYHILKFTKSQTMVLRPTQ-----QQQGQFPPSQTAMQTQS-PQ 429

Query: 198 VTQVQPL 204
           V   Q L
Sbjct: 430 VHVSQSL 436



 Score = 76.3 bits (186), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 57/151 (37%), Positives = 89/151 (58%), Gaps = 8/151 (5%)

Query: 287 VSEQPLDRLIKAVKSLTPNALSTAVSDIGSVVSMNDRI-VGSAPGNGSKAAAGADLVAMT 345
           ++E+P+DRLIKA ++ +P +L+ +VS+I SV+SM D I        GS+A  G DL   T
Sbjct: 655 ITERPIDRLIKAFQAASPKSLAESVSEISSVISMVDMIGGSFPSSGGSRAGLGEDLSERT 714

Query: 346 NCRLQARNFINQDGANGTRRMKRCTHATPLNDVSYAGCLNNSIKQLEASDLDSTATCIKK 405
                 RNF   +  N ++RMKR  +  P  D+S        +  LE+  + +T++ +K 
Sbjct: 715 ------RNFTTHEETNLSKRMKRSINIVP-PDMSSQIDSYEQLSSLESEVVSTTSSGLKV 767

Query: 406 PRIQTNHALLEEIREVNQRLIDTVVDISDEE 436
             I   +ALL+EI+E N RL++TVV+I DE+
Sbjct: 768 NNIAPGYALLQEIKETNGRLVETVVEICDED 798


>AT2G10440.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT1G15780.1);
           Has 1628 Blast hits to 1350 proteins in 149 species:
           Archae - 0; Bacteria - 39; Metazoa - 480; Fungi - 159;
           Plants - 187; Viruses - 2; Other Eukaryotes - 761
           (source: NCBI BLink). | chr2:4013752-4018046 REVERSE
           LENGTH=845
          Length = 845

 Score = 84.3 bits (207), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 54/148 (36%), Positives = 82/148 (55%), Gaps = 11/148 (7%)

Query: 62  SLDSTTQTGQPNGGDWQEEVYQKIKTMKENYLPDLNELYQKIGTKLQQHDSL-AQQSKSD 120
           S DST +T   N G+WQEE YQKIK +KE  LP L+ ++Q++  KL++ +SL  Q  ++ 
Sbjct: 215 SQDSTGKTVNVNAGNWQEETYQKIKKLKEMCLPVLSLMHQRVAEKLRETESLPPQPMQAQ 274

Query: 121 KLEKYKLFKMMLERMISFLQVSKSNIAPSFKEKLGSYEKQIINFIN----INRPRKGMSS 176
            +EK K  K+ +E ++ FL V +S+++   ++K   YE  I+ F      + RP +    
Sbjct: 275 WIEKLKAGKLSMEHLMFFLNVHRSSVSEKHRDKFSQYEYHILKFTKSQTMVLRPTQ---- 330

Query: 177 LQHPGHLPPQHMHSMSQSHPQVTQVQPL 204
            Q  G  PP      +QS PQV   Q L
Sbjct: 331 -QQQGQFPPSQTAMQTQS-PQVHVSQSL 356



 Score = 75.9 bits (185), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 57/151 (37%), Positives = 89/151 (58%), Gaps = 8/151 (5%)

Query: 287 VSEQPLDRLIKAVKSLTPNALSTAVSDIGSVVSMNDRI-VGSAPGNGSKAAAGADLVAMT 345
           ++E+P+DRLIKA ++ +P +L+ +VS+I SV+SM D I        GS+A  G DL   T
Sbjct: 575 ITERPIDRLIKAFQAASPKSLAESVSEISSVISMVDMIGGSFPSSGGSRAGLGEDLSERT 634

Query: 346 NCRLQARNFINQDGANGTRRMKRCTHATPLNDVSYAGCLNNSIKQLEASDLDSTATCIKK 405
                 RNF   +  N ++RMKR  +  P  D+S        +  LE+  + +T++ +K 
Sbjct: 635 ------RNFTTHEETNLSKRMKRSINIVP-PDMSSQIDSYEQLSSLESEVVSTTSSGLKV 687

Query: 406 PRIQTNHALLEEIREVNQRLIDTVVDISDEE 436
             I   +ALL+EI+E N RL++TVV+I DE+
Sbjct: 688 NNIAPGYALLQEIKETNGRLVETVVEICDED 718


>AT1G15772.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT1G15780.1);
           Has 30201 Blast hits to 17322 proteins in 780 species:
           Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi -
           3422; Plants - 5037; Viruses - 0; Other Eukaryotes -
           2996 (source: NCBI BLink). | chr1:5428877-5429232
           REVERSE LENGTH=87
          Length = 87

 Score = 49.7 bits (117), Expect = 4e-06,   Method: Composition-based stats.
 Identities = 18/37 (48%), Positives = 29/37 (78%)

Query: 73  NGGDWQEEVYQKIKTMKENYLPDLNELYQKIGTKLQQ 109
           NG DW+EEV+QK+++M+E YLP + ++YQ++  KL  
Sbjct: 5   NGDDWREEVFQKLRSMRERYLPHVTDVYQRLADKLNH 41