Miyakogusa Predicted Gene

Lj5g3v2288720.1
Show Alignment: 
BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj5g3v2288720.1 Non Chatacterized Hit- tr|D7LJS1|D7LJS1_ARALL
Putative uncharacterized protein (Fragment)
OS=Arabido,38.66,3e-17,seg,NULL,CUFF.57189.1
         (356 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT2G39370.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   157   9e-39
AT2G37380.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    69   5e-12
AT5G26230.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    52   5e-07

>AT2G39370.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G37380.1); Has 184 Blast hits to 178 proteins
           in 53 species: Archae - 0; Bacteria - 58; Metazoa - 9;
           Fungi - 0; Plants - 103; Viruses - 0; Other Eukaryotes -
           14 (source: NCBI BLink). | chr2:16444280-16445266
           REVERSE LENGTH=328
          Length = 328

 Score =  157 bits (398), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 149/369 (40%), Positives = 197/369 (53%), Gaps = 54/369 (14%)

Query: 1   MAATLLTCDVADDDYIDMEVSSFSNLCHSVTSHHLQHGEFEFHMSSIVP-EKEAITSPAD 59
           MAA L  CD  ++DYIDMEV+SF+NL     S++    EFEF MS + P E +  TSPAD
Sbjct: 1   MAAYLERCDSVEEDYIDMEVTSFTNLVRKTLSNNYPR-EFEFQMSHLCPLEIDKTTSPAD 59

Query: 60  ELFYKGKLLPLHLPPRLQMVEKLLQNSNSPFEEEKNVFEEXXXXXXXXXXXXXXXXXXXF 119
           ELFYKGKLLPLHLPPRLQMV+K+L++    F++E                         F
Sbjct: 60  ELFYKGKLLPLHLPPRLQMVQKILEDYT--FDDEF-----YSTPLATGTVTTPVTSNTPF 112

Query: 120 ESCNISPSDSCQVSRELKPEEYYSLDYLEDTTSGFVVENQKKSWTXX---XXXXXXXXXX 176
           ESC +SP++SCQVS+EL PE+Y    +LE + S    + +KKSWT               
Sbjct: 113 ESCTVSPAESCQVSKELNPEDY----FLEYSDSLEEDDEKKKSWTTKLRLMKQSSLGTKI 168

Query: 177 XASRAYLKSWFGKSGCSYETYATST--KVADEGSVSKAREILNKQAQVVKKNPYGQIQRQ 234
            ASRAYL+S+FGK+ CS E+   S+  +VADE SV +   +           P+GQI+ +
Sbjct: 169 KASRAYLRSFFGKTSCSDESSCASSAARVADEDSVLRYSRV----------KPFGQIKTE 218

Query: 235 RYQPSNSNMRSYKEKTSEDRSNHHRRSFSVGIKXXXXXXXXXXXXXXXXXXXXXXXXXXY 294
           R  P        K++++   S  HRRSFSV ++                           
Sbjct: 219 R--P--------KKQSNGSVSGSHRRSFSVSMRRQAAKSSNNKSSNSLGFRP-------- 260

Query: 295 GCQSLKRCSSVNSEIENSIQGAIAHCKKSQQKK-------NASEVGLYSLPESRNSVCED 347
             Q LKR +S +SEIENSIQGAI HCK+SQQ+K         +EVG  SL  SR +  +D
Sbjct: 261 -LQFLKRSTSSSSEIENSIQGAILHCKQSQQQKQKQKQYSTVNEVGFCSLSASRIAARDD 319

Query: 348 QERVVLCRG 356
           QE   + RG
Sbjct: 320 QEWAQMFRG 328


>AT2G37380.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G39370.1); Has 1284 Blast hits to 422 proteins
           in 114 species: Archae - 0; Bacteria - 90; Metazoa -
           125; Fungi - 151; Plants - 136; Viruses - 0; Other
           Eukaryotes - 782 (source: NCBI BLink). |
           chr2:15686828-15687793 FORWARD LENGTH=321
          Length = 321

 Score = 68.9 bits (167), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 114/371 (30%), Positives = 150/371 (40%), Gaps = 87/371 (23%)

Query: 5   LLTCDVADDDYIDMEVSSFSNLCHSVTSHHL----------QHGEFEFHM-SSIVPEKEA 53
           +L+ D  DD YIDMEV+  S+   S +S             Q  EFEF M SS V   E+
Sbjct: 12  VLSTD-GDDGYIDMEVNLSSSSSSSTSSSSFFSFPVTSSPPQSREFEFQMCSSAVASGES 70

Query: 54  ITSPADELFYKGKLLPLHLPPRLQMVEKLLQNSNSPFEEEKNVFEEXXXXXXXXXXXXXX 113
            TSPADELFYKG+LLPLHLPPRL+MV+KLL  S+S     +                   
Sbjct: 71  TTSPADELFYKGQLLPLHLPPRLKMVQKLLLASSSSTAATETPI--------SPRAAADV 122

Query: 114 XXXXXFESCNISPSDSC--QVSRELKPEEYYSLDYLEDTTSGFVVENQK---KSWTXXXX 168
                F SC I   ++C  ++S ELK                F+  N+     SW+    
Sbjct: 123 LSPRRFSSCEIGQDENCFFEISTELK---------------RFIESNENHLGNSWSKKIK 167

Query: 169 XXXXXXXXXASRAYLKSWFGKSGCSYETYATSTKVADEGSVSKAREILNKQAQVVKKNPY 228
                    ASRAY+K+ F K  CS  +        +   VS+            KKNP+
Sbjct: 168 HSSITQKLKASRAYIKALFSKQACSDSSEINPRFKIEPSKVSR------------KKNPF 215

Query: 229 GQIQRQRYQPSNSNMRSYKEKTSEDRSNHHRRSFSVGIKXXXXXX-----XXXXXXXXXX 283
                                 SE+    HRRSFS  I+                     
Sbjct: 216 --------------------VNSENPLLIHRRSFSGVIQRHSQAKCSTSSSSSSSASSLS 255

Query: 284 XXXXXXXXXXYGCQSLKRCSSVNSEIENSIQGAIAHCKKS--QQKKNASEVGLYSLPESR 341
                        Q+L R S+ +   +NSI+GAI HCK+S   +K N +E  L S   SR
Sbjct: 256 SSFSFGSNGSLDLQTLMRSSNAS---DNSIEGAIEHCKQSFTTRKSNVTESELCS---SR 309

Query: 342 NSV--CEDQER 350
            SV  C D ++
Sbjct: 310 TSVSTCGDLDK 320


>AT5G26230.1 | Symbols:  | unknown protein; FUNCTIONS IN:
          molecular_function unknown; INVOLVED IN:
          biological_process unknown; LOCATED IN: chloroplast;
          EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
          growth stages; Has 1807 Blast hits to 1807 proteins in
          277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
          Fungi - 347; Plants - 385; Viruses - 0; Other
          Eukaryotes - 339 (source: NCBI BLink). |
          chr5:9173517-9174542 REVERSE LENGTH=341
          Length = 341

 Score = 52.4 bits (124), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 27/45 (60%), Positives = 33/45 (73%), Gaps = 2/45 (4%)

Query: 39 EFEFHMSSIVPEKEAIT-SPADELFYKGKLLPLHLPPRLQMVEKL 82
          EFEF++S I P K + +  PADELFYKG+LLPL L PRL +V  L
Sbjct: 25 EFEFNIS-ISPRKASSSLCPADELFYKGQLLPLQLSPRLSLVRTL 68