Miyakogusa Predicted Gene

Lj5g3v1206650.3
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj5g3v1206650.3 CUFF.55004.3
         (395 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G43745.1 | Symbols:  | Protein of unknown function (DUF1012) ...   380   e-105
AT5G02940.1 | Symbols:  | Protein of unknown function (DUF1012) ...   337   1e-92
AT5G49960.1 | Symbols:  | unknown protein; CONTAINS InterPro DOM...    53   3e-07

>AT5G43745.1 | Symbols:  | Protein of unknown function (DUF1012) |
           chr5:17569435-17574954 REVERSE LENGTH=817
          Length = 817

 Score =  380 bits (976), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 188/296 (63%), Positives = 218/296 (73%), Gaps = 12/296 (4%)

Query: 100 KRMIQYMSLYFILRLTRTKFVDLMIKVVQPMLQDMLQTLSAASLPLACVSNTLNKPTPLK 159
           K +I  + LY + R+ +        K+ Q  L  ++Q    A LP AC SN+L  PTPLK
Sbjct: 84  KVVIGCIPLYAVFRIAQ--------KICQE-LPRLVQNSVGAGLPFACASNSL--PTPLK 132

Query: 160 LDVSLPSLYDIRWSLARLLYLFNIQLERNVATFFVVLLIACFSFVVIGGLLFFKLRGNKQ 219
           LDVS PS  DIRW LAR LYLFNIQLE+N+ TF V L+IAC SFV+IGGLLFFK R +  
Sbjct: 133 LDVSFPSFQDIRWGLARFLYLFNIQLEKNIGTFLVALMIACVSFVIIGGLLFFKFRKD-L 191

Query: 220 SLEDCVWEAWACLCSSSTHLKQPTRIERVIGFLLAIWGILFYSRLLSTMTEQFRNNMQRL 279
            LEDC+WEAWACL SSSTHLKQ TRIERVIGF+LAIWGILFYSRLLSTMTEQFR NM +L
Sbjct: 192 PLEDCLWEAWACLISSSTHLKQKTRIERVIGFVLAIWGILFYSRLLSTMTEQFRYNMTKL 251

Query: 280 REGAQLQVLETDHIIVCGMNSHLPFILKQLNKYHEFSVRLGTATARKQRILLMSDLPRKQ 339
           REGAQ+QVLE DHII+CG+NSHLPFILKQLN YHE +VRLGTATARKQR+LLMSD PRKQ
Sbjct: 252 REGAQMQVLEADHIIICGINSHLPFILKQLNSYHEHAVRLGTATARKQRLLLMSDTPRKQ 311

Query: 340 IDRIADNIAKDLNHIDVXXXXXXXXXXXXFEXXXXXXXXXXXXLPTKGERFEVDTD 395
           +D++A+  +KD NHID+            FE            LPTKG+R+EVDTD
Sbjct: 312 MDKLAEAYSKDFNHIDILTKSCSLNLTKSFERAAASMARAIIILPTKGDRYEVDTD 367


>AT5G02940.1 | Symbols:  | Protein of unknown function (DUF1012) |
           chr5:684671-689674 REVERSE LENGTH=813
          Length = 813

 Score =  337 bits (863), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 178/367 (48%), Positives = 237/367 (64%), Gaps = 31/367 (8%)

Query: 34  RFMPWTKSSALHEY---GIRAHS-EGRWEVDSHRS-DVKSNSKTNKHVEENLGTESIWME 88
           R   + +S +LH     GI++ S  G ++V S R+ D +  +K  K +            
Sbjct: 23  RLASFHRSLSLHSLPLGGIKSSSFRGTFKVKSQRTGDTEPPNKNFKDL------------ 70

Query: 89  KNKSSSQGPQAKRMIQYMSLYFILRLTRTKFVDLMIKVVQPMLQDMLQTLSAASLPLACV 148
            N    +    K +I  + LY +LR+ +  F +L          +++Q    A LP AC 
Sbjct: 71  -NSKFYKSLPYKLVIGCIPLYAVLRIAQKIFQEL---------PNLIQNSVKAGLPFACA 120

Query: 149 SNTLNKPTPLKLDVSLPSLYDIRWSLARLLYLFNIQLERNVATFFVVLLIACFSFVVIGG 208
           SN ++K   LK   ++PS +DI+W LAR  YLFN QLE+N+ T FVVLLI CFSFV+IGG
Sbjct: 121 SNAIDKHPLLK---AIPSSHDIKWGLARSSYLFNTQLEKNLGTVFVVLLITCFSFVIIGG 177

Query: 209 LLFFKLRGNKQSLEDCVWEAWACLCSSSTHLKQPTRIERVIGFLLAIWGILFYSRLLSTM 268
           L FFK R +  SLEDC+WEAWACL ++ THL+Q TR ER+IGF+LAIWGI+FYSRLLSTM
Sbjct: 178 LFFFKFRKD-TSLEDCLWEAWACLVNADTHLEQKTRFERLIGFVLAIWGIVFYSRLLSTM 236

Query: 269 TEQFRNNMQRLREGAQLQVLETDHIIVCGMNSHLPFILKQLNKYHEFSVRLGTATARKQR 328
           TEQFR +M+++REGA +QVLE+DHII+CG+NSHLPFILKQLN Y + +VRLGT TARKQ 
Sbjct: 237 TEQFRYHMKKVREGAHMQVLESDHIIICGINSHLPFILKQLNSYQQHAVRLGTTTARKQT 296

Query: 329 ILLMSDLPRKQIDRIADNIAKDLNHIDVXXXXXXXXXXXXFEXXXXXXXXXXXXLPTKGE 388
           +LLMSD PRK++D++A+  AKD + +D+            FE            LPTKG+
Sbjct: 297 LLLMSDTPRKEMDKLAEAYAKDFDQLDILTKSCSLNMTKSFERAAACMARAIIILPTKGD 356

Query: 389 RFEVDTD 395
           R+EVDTD
Sbjct: 357 RYEVDTD 363


>AT5G49960.1 | Symbols:  | unknown protein; CONTAINS InterPro
           DOMAIN/s: Protein of unknown function DUF1012
           (InterPro:IPR010420); BEST Arabidopsis thaliana protein
           match is: Protein of unknown function (DUF1012)
           (TAIR:AT5G02940.1); Has 1807 Blast hits to 1807 proteins
           in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
           Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
           - 339 (source: NCBI BLink). | chr5:20324173-20327687
           REVERSE LENGTH=824
          Length = 824

 Score = 53.1 bits (126), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 26/106 (24%), Positives = 58/106 (54%), Gaps = 3/106 (2%)

Query: 204 VVIGGLLFFKLRGNKQSLEDCVWEAWACLCSSSTHLKQPTRIERVIGFLLAIWGILFYSR 263
           +V GGL  + +  +   +++ +W +W  +  S +H  +     R++   ++  G+L ++ 
Sbjct: 208 IVYGGLALYAV--SDCGVDEALWLSWTFVADSGSHADRVGVGARIVSVAISAGGMLIFAT 265

Query: 264 LLSTMTEQFRNNMQRLREGAQLQVLETDHIIVCGMNSHLPFILKQL 309
           +L  +++     +  LR+G   +VLE++HI++ G +  L  +LKQL
Sbjct: 266 MLGLISDAISKMVDSLRKGKS-EVLESNHILILGWSDKLGSLLKQL 310