Miyakogusa Predicted Gene
- Lj0g3v0324209.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0324209.1 Non Chatacterized Hit- tr|D8SBN2|D8SBN2_SELML
Putative uncharacterized protein OS=Selaginella
moelle,32.14,5e-18,FAMILY NOT NAMED,NULL,CUFF.22059.1
         (284 letters)
Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters
Searching..................................................done
                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value
AT4G18975.4 | Symbols:  | Pentatricopeptide repeat (PPR) superfa...   323   7e-89
AT4G18975.3 | Symbols:  | Pentatricopeptide repeat (PPR) superfa...   323   7e-89
AT4G18975.1 | Symbols:  | Pentatricopeptide repeat (PPR) superfa...   323   7e-89
AT4G18975.2 | Symbols:  | Pentatricopeptide repeat (PPR) superfa...   320   1e-87
AT4G21190.1 | Symbols: emb1417 | Pentatricopeptide repeat (PPR) ...   178   4e-45
AT1G04590.1 | Symbols:  | BEST Arabidopsis thaliana protein matc...   135   2e-32
AT1G04590.2 | Symbols:  | BEST Arabidopsis thaliana protein matc...   132   4e-31
>AT4G18975.4 | Symbols:  | Pentatricopeptide repeat (PPR)
           superfamily protein | chr4:10392170-10393665 REVERSE
           LENGTH=287
          Length = 287
 Score =  323 bits (829), Expect = 7e-89,   Method: Compositional matrix adjust.
 Identities = 165/279 (59%), Positives = 198/279 (70%), Gaps = 20/279 (7%)
Query: 9   ILFPLLPSRTCQANTDTKT---ALLWGTKFSTATITISPKTRCIYCMFVQSKLSQNVGDP 65
           I F LL S  C + +  KT        +KFS      + K    Y   V SK  + VG  
Sbjct: 25  ICFSLLQSPRCGSYSSLKTKRFGFCIRSKFSEKE---AGKLDRGYVATVNSKEIKKVG-- 79
Query: 66  XXXXXXXXXXXXXXXHHLWKKRDSAQSGQKALALVRTVSELPNEKEAVYGALDKWTAWET 125
                          HHLWKK DSA SGQKAL LVR +S LPNEKEAVYGAL+KW AWE 
Sbjct: 80  ------------KKEHHLWKKNDSAGSGQKALNLVRMLSGLPNEKEAVYGALNKWVAWEV 127
Query: 126 EFPLIAVVKALKILRRKRQWVRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDQRVDEAES 185
           EFP+IA  KAL+ILR++ QW RVIQ+AKWMLSKGQGATMGTYD LLLAFDMD+R DEAES
Sbjct: 128 EFPIIAAAKALQILRKRSQWHRVIQLAKWMLSKGQGATMGTYDILLLAFDMDERADEAES 187
Query: 186 LWNMIIHAHMRSVSKRLFSRMISVYDHHNMPDKIVEVFADMEELQVKPDEDTVRKVASAF 245
           LWNMI+H H RS+ +RLF+RMI++Y HH++ DK++EVFADMEEL+V PDED+ R+VA AF
Sbjct: 188 LWNMILHTHTRSIPRRLFARMIALYAHHDLHDKVIEVFADMEELKVSPDEDSARRVARAF 247
Query: 246 RNLGQEEKRKLVIKRYGLKWKYIHFNGERVRVRRETWED 284
           R L QEE RKL+++RY  ++KYI+FNGERVRV+R   ED
Sbjct: 248 RELNQEENRKLILRRYLSEYKYIYFNGERVRVKRYFSED 286
>AT4G18975.3 | Symbols:  | Pentatricopeptide repeat (PPR)
           superfamily protein | chr4:10392170-10393665 REVERSE
           LENGTH=287
          Length = 287
 Score =  323 bits (829), Expect = 7e-89,   Method: Compositional matrix adjust.
 Identities = 165/279 (59%), Positives = 198/279 (70%), Gaps = 20/279 (7%)
Query: 9   ILFPLLPSRTCQANTDTKT---ALLWGTKFSTATITISPKTRCIYCMFVQSKLSQNVGDP 65
           I F LL S  C + +  KT        +KFS      + K    Y   V SK  + VG  
Sbjct: 25  ICFSLLQSPRCGSYSSLKTKRFGFCIRSKFSEKE---AGKLDRGYVATVNSKEIKKVG-- 79
Query: 66  XXXXXXXXXXXXXXXHHLWKKRDSAQSGQKALALVRTVSELPNEKEAVYGALDKWTAWET 125
                          HHLWKK DSA SGQKAL LVR +S LPNEKEAVYGAL+KW AWE 
Sbjct: 80  ------------KKEHHLWKKNDSAGSGQKALNLVRMLSGLPNEKEAVYGALNKWVAWEV 127
Query: 126 EFPLIAVVKALKILRRKRQWVRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDQRVDEAES 185
           EFP+IA  KAL+ILR++ QW RVIQ+AKWMLSKGQGATMGTYD LLLAFDMD+R DEAES
Sbjct: 128 EFPIIAAAKALQILRKRSQWHRVIQLAKWMLSKGQGATMGTYDILLLAFDMDERADEAES 187
Query: 186 LWNMIIHAHMRSVSKRLFSRMISVYDHHNMPDKIVEVFADMEELQVKPDEDTVRKVASAF 245
           LWNMI+H H RS+ +RLF+RMI++Y HH++ DK++EVFADMEEL+V PDED+ R+VA AF
Sbjct: 188 LWNMILHTHTRSIPRRLFARMIALYAHHDLHDKVIEVFADMEELKVSPDEDSARRVARAF 247
Query: 246 RNLGQEEKRKLVIKRYGLKWKYIHFNGERVRVRRETWED 284
           R L QEE RKL+++RY  ++KYI+FNGERVRV+R   ED
Sbjct: 248 RELNQEENRKLILRRYLSEYKYIYFNGERVRVKRYFSED 286
>AT4G18975.1 | Symbols:  | Pentatricopeptide repeat (PPR)
           superfamily protein | chr4:10392170-10393665 REVERSE
           LENGTH=287
          Length = 287
 Score =  323 bits (829), Expect = 7e-89,   Method: Compositional matrix adjust.
 Identities = 165/279 (59%), Positives = 198/279 (70%), Gaps = 20/279 (7%)
Query: 9   ILFPLLPSRTCQANTDTKT---ALLWGTKFSTATITISPKTRCIYCMFVQSKLSQNVGDP 65
           I F LL S  C + +  KT        +KFS      + K    Y   V SK  + VG  
Sbjct: 25  ICFSLLQSPRCGSYSSLKTKRFGFCIRSKFSEKE---AGKLDRGYVATVNSKEIKKVG-- 79
Query: 66  XXXXXXXXXXXXXXXHHLWKKRDSAQSGQKALALVRTVSELPNEKEAVYGALDKWTAWET 125
                          HHLWKK DSA SGQKAL LVR +S LPNEKEAVYGAL+KW AWE 
Sbjct: 80  ------------KKEHHLWKKNDSAGSGQKALNLVRMLSGLPNEKEAVYGALNKWVAWEV 127
Query: 126 EFPLIAVVKALKILRRKRQWVRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDQRVDEAES 185
           EFP+IA  KAL+ILR++ QW RVIQ+AKWMLSKGQGATMGTYD LLLAFDMD+R DEAES
Sbjct: 128 EFPIIAAAKALQILRKRSQWHRVIQLAKWMLSKGQGATMGTYDILLLAFDMDERADEAES 187
Query: 186 LWNMIIHAHMRSVSKRLFSRMISVYDHHNMPDKIVEVFADMEELQVKPDEDTVRKVASAF 245
           LWNMI+H H RS+ +RLF+RMI++Y HH++ DK++EVFADMEEL+V PDED+ R+VA AF
Sbjct: 188 LWNMILHTHTRSIPRRLFARMIALYAHHDLHDKVIEVFADMEELKVSPDEDSARRVARAF 247
Query: 246 RNLGQEEKRKLVIKRYGLKWKYIHFNGERVRVRRETWED 284
           R L QEE RKL+++RY  ++KYI+FNGERVRV+R   ED
Sbjct: 248 RELNQEENRKLILRRYLSEYKYIYFNGERVRVKRYFSED 286
>AT4G18975.2 | Symbols:  | Pentatricopeptide repeat (PPR)
           superfamily protein | chr4:10392170-10393503 REVERSE
           LENGTH=260
          Length = 260
 Score =  320 bits (819), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 147/204 (72%), Positives = 175/204 (85%)
Query: 81  HHLWKKRDSAQSGQKALALVRTVSELPNEKEAVYGALDKWTAWETEFPLIAVVKALKILR 140
           HHLWKK DSA SGQKAL LVR +S LPNEKEAVYGAL+KW AWE EFP+IA  KAL+ILR
Sbjct: 56  HHLWKKNDSAGSGQKALNLVRMLSGLPNEKEAVYGALNKWVAWEVEFPIIAAAKALQILR 115
Query: 141 RKRQWVRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDQRVDEAESLWNMIIHAHMRSVSK 200
           ++ QW RVIQ+AKWMLSKGQGATMGTYD LLLAFDMD+R DEAESLWNMI+H H RS+ +
Sbjct: 116 KRSQWHRVIQLAKWMLSKGQGATMGTYDILLLAFDMDERADEAESLWNMILHTHTRSIPR 175
Query: 201 RLFSRMISVYDHHNMPDKIVEVFADMEELQVKPDEDTVRKVASAFRNLGQEEKRKLVIKR 260
           RLF+RMI++Y HH++ DK++EVFADMEEL+V PDED+ R+VA AFR L QEE RKL+++R
Sbjct: 176 RLFARMIALYAHHDLHDKVIEVFADMEELKVSPDEDSARRVARAFRELNQEENRKLILRR 235
Query: 261 YGLKWKYIHFNGERVRVRRETWED 284
           Y  ++KYI+FNGERVRV+R   ED
Sbjct: 236 YLSEYKYIYFNGERVRVKRYFSED 259
>AT4G21190.1 | Symbols: emb1417 | Pentatricopeptide repeat (PPR)
           superfamily protein | chr4:11292493-11293763 REVERSE
           LENGTH=307
          Length = 307
 Score =  178 bits (452), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 90/197 (45%), Positives = 126/197 (63%), Gaps = 1/197 (0%)
Query: 83  LWKKRDSAQSGQKALALVRTVSELPNEKEAVYGALDKWTAWETEFPLIAVVKALKILRRK 142
           +WK R    +  KA  ++  +  L N KE VYGALD + AWE EFPL+ V KAL IL  +
Sbjct: 45  VWKTRKRIGTISKAAKMIACIKGLSNVKEEVYGALDSFIAWELEFPLVIVKKALVILEDE 104
Query: 143 RQWVRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDQRVDEAESLWNMIIHAHMRSVSKRL 202
           ++W ++IQV KWMLSKGQG TMGTY +LL A   D R+DEAE LWN +   H+    ++ 
Sbjct: 105 KEWKKIIQVTKWMLSKGQGRTMGTYFSLLNALAEDNRLDEAEELWNKLFMEHLEGTPRKF 164
Query: 203 FSRMISVYDHHNMPDKIVEVFADMEELQVKPDEDTVRKVASAFRNLGQEEKRKLVIKRY- 261
           F++MIS+Y   +M  K+ EVFADMEEL VKP+   V  V   F  L  ++K + ++K+Y 
Sbjct: 165 FNKMISIYYKRDMHQKLFEVFADMEELGVKPNVAIVSMVGKVFVKLEMKDKYEKLMKKYP 224
Query: 262 GLKWKYIHFNGERVRVR 278
             +W++ +  G RV+V+
Sbjct: 225 PPQWEFRYIKGRRVKVK 241
>AT1G04590.1 | Symbols:  | BEST Arabidopsis thaliana protein match
           is: Pentatricopeptide repeat (PPR) superfamily protein
           (TAIR:AT4G21190.1); Has 111 Blast hits to 111 proteins
           in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 109; Viruses - 0; Other Eukaryotes -
           2 (source: NCBI BLink). | chr1:1258760-1261411 REVERSE
           LENGTH=381
          Length = 381
 Score =  135 bits (341), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 68/164 (41%), Positives = 106/164 (64%), Gaps = 1/164 (0%)
Query: 99  LVRTVSELPNEKEAVYGALDKWTAWETEFPLIAVVKALKILRRKRQWVRVIQVAKWMLSK 158
           LV T+ ++ + KEAVYGALD W AWE  FP+ ++   +  L ++ QW R++QV KW+LSK
Sbjct: 149 LVNTLLDIEDNKEAVYGALDAWVAWERNFPIASLKIVIASLEKEHQWHRMVQVIKWILSK 208
Query: 159 GQGATMGTYDTLLLAFDMDQRVDEAESLWNMIIHAHMRSVSKRLFSRMISVYDHHNMPDK 218
           GQG TMGTY  L+ A DMD+R +EA  +W   +   + SV  +L  +M+ +Y  +NM  +
Sbjct: 209 GQGNTMGTYGQLIRALDMDRRAEEAHVIWRKKVGNDLHSVPWQLCLQMMRIYFRNNMLQE 268
Query: 219 IVEVFADMEELQVK-PDEDTVRKVASAFRNLGQEEKRKLVIKRY 261
           +V++F D+E    K PD+  V+ VA A+  LG  ++++ V+ +Y
Sbjct: 269 LVKLFKDLESYDRKPPDKHIVQTVADAYELLGMLDEKERVVTKY 312
>AT1G04590.2 | Symbols:  | BEST Arabidopsis thaliana protein match
           is: Pentatricopeptide repeat (PPR) superfamily protein
           (TAIR:AT4G18975.4); Has 111 Blast hits to 111 proteins
           in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 109; Viruses - 0; Other Eukaryotes -
           2 (source: NCBI BLink). | chr1:1258760-1261411 REVERSE
           LENGTH=384
          Length = 384
 Score =  132 bits (331), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 69/167 (41%), Positives = 106/167 (63%), Gaps = 4/167 (2%)
Query: 99  LVRTVSELPNEKEAVYGALDKWTAWETEFPLIAVVKALKILRRKRQWVRVIQVAKWMLSK 158
           LV T+ ++ + KEAVYGALD W AWE  FP+ ++   +  L ++ QW R++QV KW+LSK
Sbjct: 149 LVNTLLDIEDNKEAVYGALDAWVAWERNFPIASLKIVIASLEKEHQWHRMVQVIKWILSK 208
Query: 159 GQGATMGTYDTLLLAFDMDQRVDEAESLWNMIIHAHMRSVSKRLFSRMISVYDHHNMPDK 218
           GQG TMGTY  L+ A DMD+R +EA  +W   +   + SV  +L  +M+ +Y  +NM  +
Sbjct: 209 GQGNTMGTYGQLIRALDMDRRAEEAHVIWRKKVGNDLHSVPWQLCLQMMRIYFRNNMLQE 268
Query: 219 IVEV---FADMEELQVK-PDEDTVRKVASAFRNLGQEEKRKLVIKRY 261
           +V+V   F D+E    K PD+  V+ VA A+  LG  ++++ V+ +Y
Sbjct: 269 LVKVMKLFKDLESYDRKPPDKHIVQTVADAYELLGMLDEKERVVTKY 315