Miyakogusa Predicted Gene

Lj3g3v2542370.2
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj3g3v2542370.2 tr|Q9MAH6|Q9MAH6_ARATH F12M16.15 OS=Arabidopsis
thaliana GN=At1g53250 PE=4 SV=1,42.19,3e-17,coiled-coil,NULL;
seg,NULL,CUFF.44157.2
         (526 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT1G53800.2 | Symbols:  | unknown protein; BEST Arabidopsis thal...   266   2e-71
AT1G53800.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   266   3e-71
AT1G53250.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    87   2e-17

>AT1G53800.2 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G53250.1); Has 35333 Blast hits to 34131
           proteins in 2444 species: Archae - 798; Bacteria -
           22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
           - 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
           chr1:20081888-20084320 FORWARD LENGTH=572
          Length = 572

 Score =  266 bits (680), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 203/576 (35%), Positives = 290/576 (50%), Gaps = 77/576 (13%)

Query: 8   IANAHPSFQYALCSPRLQIL---SSVSCDWKLLDKFNNGNFDVGGVEMRRCGRFLVKACA 64
           IA   PSFQ  L     Q +    S+   W+      N  F  G   +RR G+ L+ A A
Sbjct: 10  IATIQPSFQAHLVPLGAQSIIHAKSLPNPWRQSCFSKNLKFYTGHSHVRR-GKVLITAVA 68

Query: 65  TTTLEPKRVAGEEGE------VLGSKMLLESCYEDSEGLDEREKLRRMRISKANKGNTPW 118
           T  LE K  A +E E         SK    S  +  E +D+REKLRRMRISKAN+GNTPW
Sbjct: 69  T--LETKYPAQKENERSSSLSSASSKSSNGSADDGEEQVDDREKLRRMRISKANRGNTPW 126

Query: 119 NKGRKLSKHSAETLRKIKERTRLAMQSPKVKMKLVNPRNAHSAERKQKIAAGVKMRWQRR 178
           NKGRK   HS ETL+KI+ERT++AMQ PK+KMKL N  +A + E + KI  GV+MRW RR
Sbjct: 127 NKGRK---HSPETLQKIRERTKIAMQDPKIKMKLANLGHAQNKETRMKIGEGVRMRWARR 183

Query: 179 REKMAVQETCCLEWQNLIAQASREGYVDQKELQWNXXXXXXXXXXXXXXXXXXQRKQMPR 238
           +E+  VQETC  EWQNL+A+A+++GY D++ELQW+                  QRK +  
Sbjct: 184 KERRKVQETCHFEWQNLLAEAAKQGYTDEEELQWDSYNILDQQNQLEWLESVEQRKAIKG 243

Query: 239 APGSKTTPWTSGQRRKVAEAISV---DLEYRRKGYTTKAKYHDIE-GAEMKPRRRPSDDG 294
           A  ++  P +  QRR++AEAI+    D  YR +  +  AKYH I  G E + RR  SD  
Sbjct: 244 AKSNRRAPKSPEQRRRIAEAIAAKWADPSYRERVCSGLAKYHGIPVGVERRRRRPRSDAE 303

Query: 295 QSTRSNTVMEKDPANDISVESRTRILNLISSGNAEFPAFKDHLESPKLDMMKSVKAQRGV 354
              ++ T   K    D   E ++++  ++     + PA+KD L S KL+M+KS++A+R  
Sbjct: 304 PRKKTPT---KKSTRDSEFERQSQV-QVVKVRKRKTPAYKDPLASSKLEMIKSIRAKRVA 359

Query: 355 AETKLNKXXXXXXXXXXXXXXXXXXXXXTEMKSPFAQASLMESRKLIAEAIQSLQSIDTK 414
            E+K                          +KSP AQASL+ES+KLIAEA Q ++S++ +
Sbjct: 360 EESKKMDAVERARLLISEAEKAAKVLEIAALKSPVAQASLLESKKLIAEATQLIKSLEMR 419

Query: 415 GITASNVPSVALAMANEENDTEFEV-----------LSQSHMLPINGKKM---LSSSDYN 460
            I +    +    ++ + ND+E E            ++ +H L ING+ +   + S+D  
Sbjct: 420 QIASDEDGTYPFLLSPQPNDSESETKDTNDQERPGEINGTHTLQINGESLHMNMRSNDLP 479

Query: 461 KFSED--LDKCASDQQMETEQDQSSEYKTDPSPTVLGVQSIKNETQLKLPAV-------- 510
            F  +   ++  SD +  T Q    + K       LG+    N T++  PA         
Sbjct: 480 TFVIEGTTNQFVSDMESNTSQGGREDIK-------LGIVGQPNGTRVHPPAESNGAISLA 532

Query: 511 -----------------------VSKKWVRGRLVEL 523
                                  V+KKWVRGRLVE+
Sbjct: 533 ENHPLPNGYHGIDEKAASLESGNVTKKWVRGRLVEV 568


>AT1G53800.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G53250.1); Has 1136 Blast hits to 882 proteins
           in 242 species: Archae - 2; Bacteria - 216; Metazoa -
           257; Fungi - 77; Plants - 87; Viruses - 4; Other
           Eukaryotes - 493 (source: NCBI BLink). |
           chr1:20081888-20084320 FORWARD LENGTH=568
          Length = 568

 Score =  266 bits (680), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 203/579 (35%), Positives = 291/579 (50%), Gaps = 77/579 (13%)

Query: 5   AFGIANAHPSFQYALCSPRLQIL---SSVSCDWKLLDKFNNGNFDVGGVEMRRCGRFLVK 61
           +  IA   PSFQ  L     Q +    S+   W+      N  F  G   +RR G+ L+ 
Sbjct: 3   SLDIATIQPSFQAHLVPLGAQSIIHAKSLPNPWRQSCFSKNLKFYTGHSHVRR-GKVLIT 61

Query: 62  ACATTTLEPKRVAGEEGE------VLGSKMLLESCYEDSEGLDEREKLRRMRISKANKGN 115
           A AT  LE K  A +E E         SK    S  +  E +D+REKLRRMRISKAN+GN
Sbjct: 62  AVAT--LETKYPAQKENERSSSLSSASSKSSNGSADDGEEQVDDREKLRRMRISKANRGN 119

Query: 116 TPWNKGRKLSKHSAETLRKIKERTRLAMQSPKVKMKLVNPRNAHSAERKQKIAAGVKMRW 175
           TPWNKGRK   HS ETL+KI+ERT++AMQ PK+KMKL N  +A + E + KI  GV+MRW
Sbjct: 120 TPWNKGRK---HSPETLQKIRERTKIAMQDPKIKMKLANLGHAQNKETRMKIGEGVRMRW 176

Query: 176 QRRREKMAVQETCCLEWQNLIAQASREGYVDQKELQWNXXXXXXXXXXXXXXXXXXQRKQ 235
            RR+E+  VQETC  EWQNL+A+A+++GY D++ELQW+                  QRK 
Sbjct: 177 ARRKERRKVQETCHFEWQNLLAEAAKQGYTDEEELQWDSYNILDQQNQLEWLESVEQRKA 236

Query: 236 MPRAPGSKTTPWTSGQRRKVAEAISV---DLEYRRKGYTTKAKYHDIE-GAEMKPRRRPS 291
           +  A  ++  P +  QRR++AEAI+    D  YR +  +  AKYH I  G E + RR  S
Sbjct: 237 IKGAKSNRRAPKSPEQRRRIAEAIAAKWADPSYRERVCSGLAKYHGIPVGVERRRRRPRS 296

Query: 292 DDGQSTRSNTVMEKDPANDISVESRTRILNLISSGNAEFPAFKDHLESPKLDMMKSVKAQ 351
           D     ++ T   K    D   E ++++  ++     + PA+KD L S KL+M+KS++A+
Sbjct: 297 DAEPRKKTPT---KKSTRDSEFERQSQV-QVVKVRKRKTPAYKDPLASSKLEMIKSIRAK 352

Query: 352 RGVAETKLNKXXXXXXXXXXXXXXXXXXXXXTEMKSPFAQASLMESRKLIAEAIQSLQSI 411
           R   E+K                          +KSP AQASL+ES+KLIAEA Q ++S+
Sbjct: 353 RVAEESKKMDAVERARLLISEAEKAAKVLEIAALKSPVAQASLLESKKLIAEATQLIKSL 412

Query: 412 DTKGITASNVPSVALAMANEENDTEFEV-----------LSQSHMLPINGKKM---LSSS 457
           + + I +    +    ++ + ND+E E            ++ +H L ING+ +   + S+
Sbjct: 413 EMRQIASDEDGTYPFLLSPQPNDSESETKDTNDQERPGEINGTHTLQINGESLHMNMRSN 472

Query: 458 DYNKFSED--LDKCASDQQMETEQDQSSEYKTDPSPTVLGVQSIKNETQLKLPAV----- 510
           D   F  +   ++  SD +  T Q    + K       LG+    N T++  PA      
Sbjct: 473 DLPTFVIEGTTNQFVSDMESNTSQGGREDIK-------LGIVGQPNGTRVHPPAESNGAI 525

Query: 511 --------------------------VSKKWVRGRLVEL 523
                                     V+KKWVRGRLVE+
Sbjct: 526 SLAENHPLPNGYHGIDEKAASLESGNVTKKWVRGRLVEV 564


>AT1G53250.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G53800.1); Has 11909 Blast hits to 7704
           proteins in 757 species: Archae - 51; Bacteria - 1338;
           Metazoa - 4550; Fungi - 987; Plants - 464; Viruses - 24;
           Other Eukaryotes - 4495 (source: NCBI BLink). |
           chr1:19857468-19859156 FORWARD LENGTH=371
          Length = 371

 Score = 87.4 bits (215), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 49/114 (42%), Positives = 69/114 (60%), Gaps = 3/114 (2%)

Query: 100 REKLRRMRISKANKGNTPWNKGRKLSKHSAETLRKIKERTRLAMQSPKVKMKLVNPRNAH 159
           +E+ RR +I  ANKG  PWNKGRK   HS +T R+IK+RT  A+ +PKV+ K+ + +  H
Sbjct: 101 KEEERRRKIGLANKGKVPWNKGRK---HSEDTRRRIKQRTIEALTNPKVRKKMSDHQQPH 157

Query: 160 SAERKQKIAAGVKMRWQRRREKMAVQETCCLEWQNLIAQASREGYVDQKELQWN 213
           S E K+KI A VK  W  R     ++E     W   IA+A+R+G   + EL W+
Sbjct: 158 SNETKEKIRASVKQVWAERSRSKRLKEKFMSSWSENIAEAARKGGSGEAELDWD 211