Miyakogusa Predicted Gene

Lj0g3v0238219.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj0g3v0238219.1 CUFF.15613.1
         (455 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G03670.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   164   2e-40
AT2G36420.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    80   4e-15

>AT5G03670.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G36420.1); Has 700 Blast hits to 624 proteins
           in 104 species: Archae - 0; Bacteria - 18; Metazoa -
           333; Fungi - 60; Plants - 73; Viruses - 24; Other
           Eukaryotes - 192 (source: NCBI BLink). |
           chr5:947311-949898 FORWARD LENGTH=516
          Length = 516

 Score =  164 bits (414), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 166/491 (33%), Positives = 214/491 (43%), Gaps = 70/491 (14%)

Query: 7   QHLLHELLKEDQEPFLLKNYISHRRNQLKRPSQKPTHXXXXXXXXXXXXXXXXXXXXXHK 66
           Q  L +LL+EDQEPF L++YIS RR Q+   +   TH                       
Sbjct: 4   QRHLKDLLEEDQEPFQLQSYISDRRCQI---NAHVTHLQVKKRRPISQNAGLPSR----- 55

Query: 67  XXXXXXXFLCRNACFLGA-----TTKSPLFELVKSPCRSPSNAIFLQIPAKTASRLLDAA 121
                    CRNACF          KSPLFEL KSP RS  NAIF+ IPA+TAS LL+AA
Sbjct: 56  --------FCRNACFFSLRESPDPKKSPLFEL-KSPNRS-QNAIFVNIPARTASILLEAA 105

Query: 122 LRIQKNQS---KTKLPSNKNSFALLGSFFKXXXXXXXXXXXXXXXXXNNVKVSVKDLLRW 178
           +RIQK  S   KT+  +  N+F + GS  K                      SVKD+LRW
Sbjct: 106 VRIQKQSSEVSKTRTRNAGNAFGIFGSVLKKLTNRKKREISGGKEAGRVSSSSVKDMLRW 165

Query: 179 DSSNNRK------------SSTSVEEKLKKENGFVVECSCDGRASSAVWXXXXXXXXXXX 226
           +S   RK            +++S   K+  E  F    S  G  S +V            
Sbjct: 166 ESPVVRKIVTRKSKRNEEENASSQTHKIASETHFSRRSSSSGVWSESVTNGERSWDVDFE 225

Query: 227 XXXXXXXXCGGHSCEKENAVXXXXXXXXXXXXQSPFRFVLQKSPSASSGHRTPEFSSAAA 286
                     G S E    +            +SPF FVLQ  PS + G RTP FSS AA
Sbjct: 226 TSISTSSRSNG-SDEFAMMMNGQDLSEDKRFCESPFHFVLQTMPS-NGGFRTPNFSSPAA 283

Query: 287 SSSR-CGTQDKEN---NVANGVNEFQSEEEKEQCSPVSILETPFXXXXXXXXXXXXXXXX 342
           S    C   +KE+        +   + EEEKEQ SPVS+L+ PF                
Sbjct: 284 SPRHDCHEMEKESYEVEKLKKLEMEEEEEEKEQSSPVSVLDPPFQDDDEDIHMDDNN--- 340

Query: 343 XXXLECSYANVQRTKQQLLDRLSRFEKLAELDAIELEKRMLDQED--------------- 387
              +  S+ +VQ+ K  LL +L RFE+LA LD +ELEKRM DQE                
Sbjct: 341 ---IPSSFRSVQKAKHLLLQKLCRFEQLAGLDPMELEKRMSDQETEEEEEEEEEEMKSLY 397

Query: 388 --EFVTYSEEDDDGETSCEEKTPEDLKRLVHDLIMEE-ERELGSSEDRNMVMRRVCRKLE 444
             E +T        E   E   PE ++ L+ DL  EE   ++    +  +V +RVC +L 
Sbjct: 398 HCEIITQRVLKTYFEEMVE--VPEGVEALISDLAAEELPSDIDGEAEAAIVAKRVCERLR 455

Query: 445 LWREVEPNTID 455
            WR+VE NTID
Sbjct: 456 SWRDVESNTID 466


>AT2G36420.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT5G03670.1); Has 10588 Blast hits to 6606
           proteins in 440 species: Archae - 8; Bacteria - 365;
           Metazoa - 4146; Fungi - 1198; Plants - 483; Viruses -
           212; Other Eukaryotes - 4176 (source: NCBI BLink). |
           chr2:15286498-15288990 FORWARD LENGTH=439
          Length = 439

 Score = 79.7 bits (195), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 58/160 (36%), Positives = 77/160 (48%), Gaps = 31/160 (19%)

Query: 1   MMAQNKQHLLHELLKEDQEPFLLKNYISHRRNQLKRPSQKPTHXXXXXXXXXXXXXXXXX 60
           M  + K+  LHE L++DQEPF L +YI + R+Q+     +                    
Sbjct: 1   MAEKEKKKHLHEFLEDDQEPFHLNHYIGNLRSQMGCSDMR-----------------VKK 43

Query: 61  XXXXHKXXXXXXXFLCRNACFLGA-----TTKSPLFELVKSPCRSP--SNAIFLQIPAKT 113
               +        F C N+CF  A       KSPLFEL +SP +       +FLQIPA+T
Sbjct: 44  RKSDNVATFPPGLFSCENSCFFAAHKSPDPRKSPLFEL-RSPGKKKIRDGRVFLQIPART 102

Query: 114 ASRLLDAALRIQKNQSKTKLPSNK-----NSFALLGSFFK 148
           A+ LLDAA RIQK QS+ K  +NK     N F + GS  K
Sbjct: 103 AAILLDAAARIQKQQSE-KAKTNKARTRGNGFGMFGSVLK 141



 Score = 75.5 bits (184), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 73/220 (33%), Positives = 110/220 (50%), Gaps = 43/220 (19%)

Query: 259 QSPFRFVLQKSPSASSGHRTPEFSSAAASSSRCGTQDKENNVANGVNEFQSEEEKE---- 314
           +SPF FVLQ +PS SSGH+TP F+S A S +R  T+D++++    + + + +EE++    
Sbjct: 187 ESPFHFVLQTTPS-SSGHQTPHFTSTATSPARRSTEDEDSDETESLEKVRGQEEEDKEEE 245

Query: 315 ---QCSPVSILETPFXXXXXXXXXXXXXXXXXXXLECSYANVQRTKQQLLDRLSRFEKLA 371
              QCSPVS+L+ P                    L CS+  VQR K++LL +L RFEKLA
Sbjct: 246 DKEQCSPVSVLD-PLEEEEEDEDHHQHEPDPPNNLSCSFEIVQRAKRRLLKKLRRFEKLA 304

Query: 372 ELDAIELEKRM----------------------LDQEDEFVTYSEEDDDGETSCEEKTPE 409
            LD +ELE +M                       D ++E+     ED D   + E +  E
Sbjct: 305 GLDPVELEGKMSEEEDEEEEEYEESEEDDNIRIYDSDEEY-----EDVDEAMARESRCAE 359

Query: 410 DLKRLVHDLIMEEER-------ELGSSEDRNMVMRRVCRK 442
           D KR  +D   ++ R        LG+ ED + V+R+  R+
Sbjct: 360 DEKRKKNDERQKKWRMMNAWRVGLGAEEDVDAVVRKDLRE 399