Miyakogusa Predicted Gene

Lj6g3v1077810.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj6g3v1077810.1 Non Chatacterized Hit- tr|H9MAB2|H9MAB2_PINLA
Uncharacterized protein (Fragment) OS=Pinus
lambertian,40.87,1e-16,APO,APO domain; seg,NULL; coiled-coil,NULL;
UNCHARACTERIZED,NULL; EUKARYOTIC TRANSLATION INITIATION ,CUFF.58949.1
         (442 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G57930.1 | Symbols: APO2, emb1629 | Arabidopsis thaliana prot...   489   e-138
AT5G57930.2 | Symbols: APO2 | Arabidopsis thaliana protein of un...   489   e-138
AT1G64810.2 | Symbols: APO1 | Arabidopsis thaliana protein of un...   287   8e-78
AT1G64810.1 | Symbols: APO1 | Arabidopsis thaliana protein of un...   286   2e-77
AT5G61930.2 | Symbols: APO3 | Arabidopsis thaliana protein of un...   252   3e-67
AT5G61930.1 | Symbols: APO3 | Arabidopsis thaliana protein of un...   252   3e-67
AT3G21740.1 | Symbols: APO4 | Arabidopsis thaliana protein of un...   184   7e-47

>AT5G57930.1 | Symbols: APO2, emb1629 | Arabidopsis thaliana protein
           of unknown function (DUF794) | chr5:23454690-23456354
           FORWARD LENGTH=440
          Length = 440

 Score =  489 bits (1258), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 243/377 (64%), Positives = 287/377 (76%), Gaps = 3/377 (0%)

Query: 66  RQHALTIRNEVPQNADFPRRYSKKEKKPFPVPXXXXXXXXXXXXXXKMKNEPQKPLSLSA 125
           R   L +RN+ PQN D P++Y+++EKKPFPVP                 N+ +    L  
Sbjct: 67  RSLPLVVRNDRPQNEDLPKQYTRREKKPFPVPIVDLRRAARERVK---NNKDKPKRPLPP 123

Query: 126 PKNGLLVKKLIPTAYKVYNSRITLINNXXXXXXXXXXHACGYCSEIHVGPVGHPFKSCRG 185
           PKNG++VK L+P AYKVYN+RI LINN          +ACG+C+EIHVGP GHPFKSC+G
Sbjct: 124 PKNGMVVKSLVPLAYKVYNARIRLINNLHRLMKVVRVNACGWCNEIHVGPYGHPFKSCKG 183

Query: 186 TQANIRKGLHEWTNAHFEDILTPVEAYHLSDRLGKRITHEERFSIPRIPAVVELCIQAGV 245
              + RKGLHEWTN+  ED++ P+EAYHL DRLGKRI H+ERFSIPR+PAVVELCIQ GV
Sbjct: 184 PNTSQRKGLHEWTNSVIEDVIVPLEAYHLFDRLGKRIRHDERFSIPRVPAVVELCIQGGV 243

Query: 246 EIPEYPTXXXXXXXXXXXXXEYVDADESELPDQMPENPPKLLLTEIPDSEIVAPVDKEEI 305
           EIPE+P              E+VDADE+ELPD  P+ PP  LLTE+P SEI  P  +EE 
Sbjct: 244 EIPEFPAKRRRKPIIRIGKSEFVDADETELPDPEPQPPPVPLLTELPVSEITPPSSEEET 303

Query: 306 VQLAEETLQAWERMRKGAKRLMGMYRVRVCGYCPEIHVGPQGHKAQNCGAHKHQQRNGQH 365
           V LAEETLQAWE MR GAK+LM MYRVRVCGYCPE+HVGP GHKAQNCGA KHQQRNGQH
Sbjct: 304 VSLAEETLQAWEEMRAGAKKLMRMYRVRVCGYCPEVHVGPTGHKAQNCGAFKHQQRNGQH 363

Query: 366 GWQSSVLNDLIPPRFVWHVPDVNGPPLQRELREFYGQAPAVVEMCIQAGAALPEQYKSTM 425
           GWQS+VL+DLIPPR+VWHVPDVNGPP+QRELR FYGQAPAVVE+C QAGA +PE Y++TM
Sbjct: 364 GWQSAVLDDLIPPRYVWHVPDVNGPPMQRELRSFYGQAPAVVEICAQAGAVVPEHYRATM 423

Query: 426 RLDVGIPSTMQEAEMVV 442
           RL+VGIPS+++EAEMVV
Sbjct: 424 RLEVGIPSSVKEAEMVV 440


>AT5G57930.2 | Symbols: APO2 | Arabidopsis thaliana protein of
           unknown function (DUF794) | chr5:23454690-23456354
           FORWARD LENGTH=443
          Length = 443

 Score =  489 bits (1258), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 243/377 (64%), Positives = 287/377 (76%), Gaps = 3/377 (0%)

Query: 66  RQHALTIRNEVPQNADFPRRYSKKEKKPFPVPXXXXXXXXXXXXXXKMKNEPQKPLSLSA 125
           R   L +RN+ PQN D P++Y+++EKKPFPVP                 N+ +    L  
Sbjct: 70  RSLPLVVRNDRPQNEDLPKQYTRREKKPFPVPIVDLRRAARERVK---NNKDKPKRPLPP 126

Query: 126 PKNGLLVKKLIPTAYKVYNSRITLINNXXXXXXXXXXHACGYCSEIHVGPVGHPFKSCRG 185
           PKNG++VK L+P AYKVYN+RI LINN          +ACG+C+EIHVGP GHPFKSC+G
Sbjct: 127 PKNGMVVKSLVPLAYKVYNARIRLINNLHRLMKVVRVNACGWCNEIHVGPYGHPFKSCKG 186

Query: 186 TQANIRKGLHEWTNAHFEDILTPVEAYHLSDRLGKRITHEERFSIPRIPAVVELCIQAGV 245
              + RKGLHEWTN+  ED++ P+EAYHL DRLGKRI H+ERFSIPR+PAVVELCIQ GV
Sbjct: 187 PNTSQRKGLHEWTNSVIEDVIVPLEAYHLFDRLGKRIRHDERFSIPRVPAVVELCIQGGV 246

Query: 246 EIPEYPTXXXXXXXXXXXXXEYVDADESELPDQMPENPPKLLLTEIPDSEIVAPVDKEEI 305
           EIPE+P              E+VDADE+ELPD  P+ PP  LLTE+P SEI  P  +EE 
Sbjct: 247 EIPEFPAKRRRKPIIRIGKSEFVDADETELPDPEPQPPPVPLLTELPVSEITPPSSEEET 306

Query: 306 VQLAEETLQAWERMRKGAKRLMGMYRVRVCGYCPEIHVGPQGHKAQNCGAHKHQQRNGQH 365
           V LAEETLQAWE MR GAK+LM MYRVRVCGYCPE+HVGP GHKAQNCGA KHQQRNGQH
Sbjct: 307 VSLAEETLQAWEEMRAGAKKLMRMYRVRVCGYCPEVHVGPTGHKAQNCGAFKHQQRNGQH 366

Query: 366 GWQSSVLNDLIPPRFVWHVPDVNGPPLQRELREFYGQAPAVVEMCIQAGAALPEQYKSTM 425
           GWQS+VL+DLIPPR+VWHVPDVNGPP+QRELR FYGQAPAVVE+C QAGA +PE Y++TM
Sbjct: 367 GWQSAVLDDLIPPRYVWHVPDVNGPPMQRELRSFYGQAPAVVEICAQAGAVVPEHYRATM 426

Query: 426 RLDVGIPSTMQEAEMVV 442
           RL+VGIPS+++EAEMVV
Sbjct: 427 RLEVGIPSSVKEAEMVV 443


>AT1G64810.2 | Symbols: APO1 | Arabidopsis thaliana protein of
           unknown function (DUF794) | chr1:24086810-24088276
           FORWARD LENGTH=460
          Length = 460

 Score =  287 bits (735), Expect = 8e-78,   Method: Compositional matrix adjust.
 Identities = 153/381 (40%), Positives = 211/381 (55%), Gaps = 12/381 (3%)

Query: 67  QHALTIRNEVPQNADFPRRYSKKEKKPFPVPXXX-XXXXXXXXXXXKMKNEPQKPLSLSA 125
           Q +   R    QN D P    K +KKP+P+P               +M  E Q    L  
Sbjct: 85  QTSFKKRYVSTQNVDLPPILPKNKKKPYPIPFKQIQEEARKDKKLAQMGIEKQ----LDP 140

Query: 126 PKNGLLVKKLIPTAYKVYNSRITLINNXXXXXXXXXXHACGYCSEIHVGPVGHPFKSCRG 185
           PKNGLLV  L+P A +V ++   LI             AC  C  +HV  VGH  + C G
Sbjct: 141 PKNGLLVPNLVPVADQVIDNWKLLIKGLAQLLHVVPVFACSECGAVHVANVGHNIRDCNG 200

Query: 186 TQANIRKGLHEWTNAHFEDILTPVEAYHLSDRLGKRITHEERFSIPRIPAVVELCIQAGV 245
              + R+G H W      D+L PVE+YH+ D  G+RI HE RF   RIPA+VELCIQAGV
Sbjct: 201 PTNSQRRGSHSWVKGTINDVLIPVESYHMYDPFGRRIKHETRFEYERIPALVELCIQAGV 260

Query: 246 EIPEYPTXXXXXXXXXXXXXEYVDADESELPDQMPENPPKLL-----LTEIPDSEIVAPV 300
           EIPEYP                +D        + P+    L      L  +   E   P 
Sbjct: 261 EIPEYPCRRRTQPIRMMGK-RVIDRGGYHKEPEKPQTSSSLSSPLAELDTLGVFERYPPP 319

Query: 301 DKEEIVQLAEETLQAWERMRKGAKRLMGMYRVRVCGYCPEIHVGPQGHKAQNCGAHKHQQ 360
             E+I ++A+ET+ A+E++R G  +LM  + V+ CGYC E+HVGP GH  + CG  KHQ 
Sbjct: 320 TPEDIPKIAQETMDAYEKVRLGVTKLMRKFTVKACGYCSEVHVGPWGHSVKLCGEFKHQW 379

Query: 361 RNGQHGWQSSVLNDLIPPRFVWHVPDVNGPPLQRELREFYGQAPAVVEMCIQAGAALPEQ 420
           R+G+HGWQ ++++++ PP +VWHV D+ G PL   LR FYG+APA+VE+C+ +GA +P++
Sbjct: 380 RDGKHGWQDALVDEVFPPNYVWHVRDLKGNPLTGNLRRFYGKAPALVEICMHSGARVPQR 439

Query: 421 YKSTMRLDVGIPSTMQEAEMV 441
           YK+ MRLD+ +P + QEA+MV
Sbjct: 440 YKAMMRLDIIVPDS-QEADMV 459


>AT1G64810.1 | Symbols: APO1 | Arabidopsis thaliana protein of
           unknown function (DUF794) | chr1:24086882-24088276
           FORWARD LENGTH=436
          Length = 436

 Score =  286 bits (733), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 153/381 (40%), Positives = 211/381 (55%), Gaps = 12/381 (3%)

Query: 67  QHALTIRNEVPQNADFPRRYSKKEKKPFPVPXXX-XXXXXXXXXXXKMKNEPQKPLSLSA 125
           Q +   R    QN D P    K +KKP+P+P               +M  E Q    L  
Sbjct: 61  QTSFKKRYVSTQNVDLPPILPKNKKKPYPIPFKQIQEEARKDKKLAQMGIEKQ----LDP 116

Query: 126 PKNGLLVKKLIPTAYKVYNSRITLINNXXXXXXXXXXHACGYCSEIHVGPVGHPFKSCRG 185
           PKNGLLV  L+P A +V ++   LI             AC  C  +HV  VGH  + C G
Sbjct: 117 PKNGLLVPNLVPVADQVIDNWKLLIKGLAQLLHVVPVFACSECGAVHVANVGHNIRDCNG 176

Query: 186 TQANIRKGLHEWTNAHFEDILTPVEAYHLSDRLGKRITHEERFSIPRIPAVVELCIQAGV 245
              + R+G H W      D+L PVE+YH+ D  G+RI HE RF   RIPA+VELCIQAGV
Sbjct: 177 PTNSQRRGSHSWVKGTINDVLIPVESYHMYDPFGRRIKHETRFEYERIPALVELCIQAGV 236

Query: 246 EIPEYPTXXXXXXXXXXXXXEYVDADESELPDQMPENPPKLL-----LTEIPDSEIVAPV 300
           EIPEYP                +D        + P+    L      L  +   E   P 
Sbjct: 237 EIPEYPCRRRTQPIRMMGK-RVIDRGGYHKEPEKPQTSSSLSSPLAELDTLGVFERYPPP 295

Query: 301 DKEEIVQLAEETLQAWERMRKGAKRLMGMYRVRVCGYCPEIHVGPQGHKAQNCGAHKHQQ 360
             E+I ++A+ET+ A+E++R G  +LM  + V+ CGYC E+HVGP GH  + CG  KHQ 
Sbjct: 296 TPEDIPKIAQETMDAYEKVRLGVTKLMRKFTVKACGYCSEVHVGPWGHSVKLCGEFKHQW 355

Query: 361 RNGQHGWQSSVLNDLIPPRFVWHVPDVNGPPLQRELREFYGQAPAVVEMCIQAGAALPEQ 420
           R+G+HGWQ ++++++ PP +VWHV D+ G PL   LR FYG+APA+VE+C+ +GA +P++
Sbjct: 356 RDGKHGWQDALVDEVFPPNYVWHVRDLKGNPLTGNLRRFYGKAPALVEICMHSGARVPQR 415

Query: 421 YKSTMRLDVGIPSTMQEAEMV 441
           YK+ MRLD+ +P + QEA+MV
Sbjct: 416 YKAMMRLDIIVPDS-QEADMV 435


>AT5G61930.2 | Symbols: APO3 | Arabidopsis thaliana protein of
           unknown function (DUF794) | chr5:24866230-24867665
           REVERSE LENGTH=402
          Length = 402

 Score =  252 bits (644), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 136/362 (37%), Positives = 200/362 (55%), Gaps = 9/362 (2%)

Query: 74  NEVPQNADFPRR-YSKKEKKPFPVPXXXXXXXXXXXXXXKMKNEPQKPLSLSAPKNGLLV 132
           +E P  AD P+    K E+KP+P P              + K +P + L    P NGLLV
Sbjct: 38  DEDPLYADVPKPPKDKSERKPYPTPMKELIRRAKEEKQLR-KLQPCRVLE-DPPDNGLLV 95

Query: 133 KKLIPTAYKVYNSRITLINNXXXXXXXXXXHACGYCSEIHVGPVGHPFKSCRGTQANIRK 192
            +L+  A+ V+  R  L++           H C  C+E+H+G  GH  ++C G  +  R 
Sbjct: 96  PELVDVAHCVHRCRNMLLSGLSKIIHHVPVHRCRLCAEVHIGKQGHEIRTCTGPGSGSRS 155

Query: 193 GLHEWTNAHFEDILTPVEAYHLSDRLGK-RITHEERFSIPRIPAVVELCIQAGVEIPEYP 251
             H W      D++   + +HL DR  K R+ H+ERF++P+I AV+ELCIQAGV++ ++P
Sbjct: 156 ATHVWKRGRVSDVVLFPKCFHLYDRAVKPRVIHDERFTVPKISAVLELCIQAGVDLEKFP 215

Query: 252 TXXXXXXXXXXXXXEYVDADESELPDQMPENPPKLLLTEIPDSEIVAPVDKEEIVQLAEE 311
           +             E    D  ++ D   E       T I + +     +K+ + +L+ E
Sbjct: 216 SKRRSKPVYSI---EGRIVDFEDVNDGNSELAVTSTTTLIQEDDRCKE-EKKSLKELSFE 271

Query: 312 TLQAWERMRKGAKRLMGMYRVRVCGYCPEIHVGPQGHKAQNCGAHKHQQRNGQHGWQSSV 371
           T+++W  M  G ++LM  YRV  CGYCPEI VGP+GHK + C A KHQ R+G H WQ + 
Sbjct: 272 TMESWFEMVLGVRKLMERYRVWTCGYCPEIQVGPKGHKVRMCKATKHQMRDGMHAWQEAT 331

Query: 372 LNDLIPPRFVWHVPD-VNGPPLQRELREFYGQAPAVVEMCIQAGAALPEQYKSTMRLDVG 430
           ++D++ P +VWHV D  +G  L   L+ FYG+APAV+EMC+Q GA +P+QY S MRLDV 
Sbjct: 332 IDDVVGPTYVWHVRDPTDGSVLDNSLKRFYGKAPAVIEMCVQGGAPVPDQYNSMMRLDVV 391

Query: 431 IP 432
            P
Sbjct: 392 YP 393


>AT5G61930.1 | Symbols: APO3 | Arabidopsis thaliana protein of
           unknown function (DUF794) | chr5:24866230-24867665
           REVERSE LENGTH=402
          Length = 402

 Score =  252 bits (644), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 136/362 (37%), Positives = 200/362 (55%), Gaps = 9/362 (2%)

Query: 74  NEVPQNADFPRR-YSKKEKKPFPVPXXXXXXXXXXXXXXKMKNEPQKPLSLSAPKNGLLV 132
           +E P  AD P+    K E+KP+P P              + K +P + L    P NGLLV
Sbjct: 38  DEDPLYADVPKPPKDKSERKPYPTPMKELIRRAKEEKQLR-KLQPCRVLE-DPPDNGLLV 95

Query: 133 KKLIPTAYKVYNSRITLINNXXXXXXXXXXHACGYCSEIHVGPVGHPFKSCRGTQANIRK 192
            +L+  A+ V+  R  L++           H C  C+E+H+G  GH  ++C G  +  R 
Sbjct: 96  PELVDVAHCVHRCRNMLLSGLSKIIHHVPVHRCRLCAEVHIGKQGHEIRTCTGPGSGSRS 155

Query: 193 GLHEWTNAHFEDILTPVEAYHLSDRLGK-RITHEERFSIPRIPAVVELCIQAGVEIPEYP 251
             H W      D++   + +HL DR  K R+ H+ERF++P+I AV+ELCIQAGV++ ++P
Sbjct: 156 ATHVWKRGRVSDVVLFPKCFHLYDRAVKPRVIHDERFTVPKISAVLELCIQAGVDLEKFP 215

Query: 252 TXXXXXXXXXXXXXEYVDADESELPDQMPENPPKLLLTEIPDSEIVAPVDKEEIVQLAEE 311
           +             E    D  ++ D   E       T I + +     +K+ + +L+ E
Sbjct: 216 SKRRSKPVYSI---EGRIVDFEDVNDGNSELAVTSTTTLIQEDDRCKE-EKKSLKELSFE 271

Query: 312 TLQAWERMRKGAKRLMGMYRVRVCGYCPEIHVGPQGHKAQNCGAHKHQQRNGQHGWQSSV 371
           T+++W  M  G ++LM  YRV  CGYCPEI VGP+GHK + C A KHQ R+G H WQ + 
Sbjct: 272 TMESWFEMVLGVRKLMERYRVWTCGYCPEIQVGPKGHKVRMCKATKHQMRDGMHAWQEAT 331

Query: 372 LNDLIPPRFVWHVPD-VNGPPLQRELREFYGQAPAVVEMCIQAGAALPEQYKSTMRLDVG 430
           ++D++ P +VWHV D  +G  L   L+ FYG+APAV+EMC+Q GA +P+QY S MRLDV 
Sbjct: 332 IDDVVGPTYVWHVRDPTDGSVLDNSLKRFYGKAPAVIEMCVQGGAPVPDQYNSMMRLDVV 391

Query: 431 IP 432
            P
Sbjct: 392 YP 393


>AT3G21740.1 | Symbols: APO4 | Arabidopsis thaliana protein of
           unknown function (DUF794) | chr3:7662542-7663638 REVERSE
           LENGTH=337
          Length = 337

 Score =  184 bits (468), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 107/298 (35%), Positives = 158/298 (53%), Gaps = 35/298 (11%)

Query: 132 VKKLIPTAYKVYNSRITLINNXXXXXXXXXXHACGYCSEIHVGPVGHPFKSCRGTQANIR 191
           VK+++P A ++  +R  LI+N            C +CSE+ VG  GH  ++CR   + IR
Sbjct: 55  VKEIVPVAEEILIARKNLISNIAALLKVFPVLTCKFCSEVFVGKEGHLIETCR---SYIR 111

Query: 192 KG---LHEWTNAHFEDILTPVEAYHLSDRLGKRITHEERFSIPRIPAVVELCIQAGVEIP 248
           +G   LHEW      DIL PVE+YHL +     I H+ERF   R+PA++ELC QAG   P
Sbjct: 112 RGNNRLHEWVPGSINDILVPVESYHLHNISQGVIRHQERFDYDRVPAILELCCQAGAIHP 171

Query: 249 EYPTXXXXXXXXXXXXXEYVDADESELPDQMPENPPKLLLTEIPDSEIVAPVDKEEIVQL 308
           E                +Y     SE+ D    NP      +I + +I + +   ++  +
Sbjct: 172 E-------------EILQY-----SEIHD----NP------QISEEDIRS-LPAGDLKYV 202

Query: 309 AEETLQAWERMRKGAKRLMGMYRVRVCGYCPEIHVGPQGHKAQNCGAHKHQQRNGQHGWQ 368
               L AWE++R G K+L+ +Y  +VC  C E+HVGP GHKA+ CG  K++   G H W+
Sbjct: 203 GANALMAWEKVRAGVKKLLLVYPSKVCKRCKEVHVGPSGHKARLCGVFKYESWRGTHYWE 262

Query: 369 SSVLNDLIPPRFVWHVPDVNGPPLQRELREFYGQAPAVVEMCIQAGAALPEQYKSTMR 426
            + +NDL+P + VWH    +   L  E R +YG APA+V +C   GA +P +Y   M+
Sbjct: 263 KAGVNDLVPEKMVWHRRPQDPVVLVDEGRSYYGHAPAIVSLCSHTGAIVPVKYACKMK 320