Miyakogusa Predicted Gene
- Lj0g3v0324209.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0324209.1 Non Chatacterized Hit- tr|D8SBN2|D8SBN2_SELML
Putative uncharacterized protein OS=Selaginella
moelle,32.14,5e-18,FAMILY NOT NAMED,NULL,CUFF.22059.1
(284 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT4G18975.4 | Symbols: | Pentatricopeptide repeat (PPR) superfa... 323 7e-89
AT4G18975.3 | Symbols: | Pentatricopeptide repeat (PPR) superfa... 323 7e-89
AT4G18975.1 | Symbols: | Pentatricopeptide repeat (PPR) superfa... 323 7e-89
AT4G18975.2 | Symbols: | Pentatricopeptide repeat (PPR) superfa... 320 1e-87
AT4G21190.1 | Symbols: emb1417 | Pentatricopeptide repeat (PPR) ... 178 4e-45
AT1G04590.1 | Symbols: | BEST Arabidopsis thaliana protein matc... 135 2e-32
AT1G04590.2 | Symbols: | BEST Arabidopsis thaliana protein matc... 132 4e-31
>AT4G18975.4 | Symbols: | Pentatricopeptide repeat (PPR)
superfamily protein | chr4:10392170-10393665 REVERSE
LENGTH=287
Length = 287
Score = 323 bits (829), Expect = 7e-89, Method: Compositional matrix adjust.
Identities = 165/279 (59%), Positives = 198/279 (70%), Gaps = 20/279 (7%)
Query: 9 ILFPLLPSRTCQANTDTKT---ALLWGTKFSTATITISPKTRCIYCMFVQSKLSQNVGDP 65
I F LL S C + + KT +KFS + K Y V SK + VG
Sbjct: 25 ICFSLLQSPRCGSYSSLKTKRFGFCIRSKFSEKE---AGKLDRGYVATVNSKEIKKVG-- 79
Query: 66 XXXXXXXXXXXXXXXHHLWKKRDSAQSGQKALALVRTVSELPNEKEAVYGALDKWTAWET 125
HHLWKK DSA SGQKAL LVR +S LPNEKEAVYGAL+KW AWE
Sbjct: 80 ------------KKEHHLWKKNDSAGSGQKALNLVRMLSGLPNEKEAVYGALNKWVAWEV 127
Query: 126 EFPLIAVVKALKILRRKRQWVRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDQRVDEAES 185
EFP+IA KAL+ILR++ QW RVIQ+AKWMLSKGQGATMGTYD LLLAFDMD+R DEAES
Sbjct: 128 EFPIIAAAKALQILRKRSQWHRVIQLAKWMLSKGQGATMGTYDILLLAFDMDERADEAES 187
Query: 186 LWNMIIHAHMRSVSKRLFSRMISVYDHHNMPDKIVEVFADMEELQVKPDEDTVRKVASAF 245
LWNMI+H H RS+ +RLF+RMI++Y HH++ DK++EVFADMEEL+V PDED+ R+VA AF
Sbjct: 188 LWNMILHTHTRSIPRRLFARMIALYAHHDLHDKVIEVFADMEELKVSPDEDSARRVARAF 247
Query: 246 RNLGQEEKRKLVIKRYGLKWKYIHFNGERVRVRRETWED 284
R L QEE RKL+++RY ++KYI+FNGERVRV+R ED
Sbjct: 248 RELNQEENRKLILRRYLSEYKYIYFNGERVRVKRYFSED 286
>AT4G18975.3 | Symbols: | Pentatricopeptide repeat (PPR)
superfamily protein | chr4:10392170-10393665 REVERSE
LENGTH=287
Length = 287
Score = 323 bits (829), Expect = 7e-89, Method: Compositional matrix adjust.
Identities = 165/279 (59%), Positives = 198/279 (70%), Gaps = 20/279 (7%)
Query: 9 ILFPLLPSRTCQANTDTKT---ALLWGTKFSTATITISPKTRCIYCMFVQSKLSQNVGDP 65
I F LL S C + + KT +KFS + K Y V SK + VG
Sbjct: 25 ICFSLLQSPRCGSYSSLKTKRFGFCIRSKFSEKE---AGKLDRGYVATVNSKEIKKVG-- 79
Query: 66 XXXXXXXXXXXXXXXHHLWKKRDSAQSGQKALALVRTVSELPNEKEAVYGALDKWTAWET 125
HHLWKK DSA SGQKAL LVR +S LPNEKEAVYGAL+KW AWE
Sbjct: 80 ------------KKEHHLWKKNDSAGSGQKALNLVRMLSGLPNEKEAVYGALNKWVAWEV 127
Query: 126 EFPLIAVVKALKILRRKRQWVRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDQRVDEAES 185
EFP+IA KAL+ILR++ QW RVIQ+AKWMLSKGQGATMGTYD LLLAFDMD+R DEAES
Sbjct: 128 EFPIIAAAKALQILRKRSQWHRVIQLAKWMLSKGQGATMGTYDILLLAFDMDERADEAES 187
Query: 186 LWNMIIHAHMRSVSKRLFSRMISVYDHHNMPDKIVEVFADMEELQVKPDEDTVRKVASAF 245
LWNMI+H H RS+ +RLF+RMI++Y HH++ DK++EVFADMEEL+V PDED+ R+VA AF
Sbjct: 188 LWNMILHTHTRSIPRRLFARMIALYAHHDLHDKVIEVFADMEELKVSPDEDSARRVARAF 247
Query: 246 RNLGQEEKRKLVIKRYGLKWKYIHFNGERVRVRRETWED 284
R L QEE RKL+++RY ++KYI+FNGERVRV+R ED
Sbjct: 248 RELNQEENRKLILRRYLSEYKYIYFNGERVRVKRYFSED 286
>AT4G18975.1 | Symbols: | Pentatricopeptide repeat (PPR)
superfamily protein | chr4:10392170-10393665 REVERSE
LENGTH=287
Length = 287
Score = 323 bits (829), Expect = 7e-89, Method: Compositional matrix adjust.
Identities = 165/279 (59%), Positives = 198/279 (70%), Gaps = 20/279 (7%)
Query: 9 ILFPLLPSRTCQANTDTKT---ALLWGTKFSTATITISPKTRCIYCMFVQSKLSQNVGDP 65
I F LL S C + + KT +KFS + K Y V SK + VG
Sbjct: 25 ICFSLLQSPRCGSYSSLKTKRFGFCIRSKFSEKE---AGKLDRGYVATVNSKEIKKVG-- 79
Query: 66 XXXXXXXXXXXXXXXHHLWKKRDSAQSGQKALALVRTVSELPNEKEAVYGALDKWTAWET 125
HHLWKK DSA SGQKAL LVR +S LPNEKEAVYGAL+KW AWE
Sbjct: 80 ------------KKEHHLWKKNDSAGSGQKALNLVRMLSGLPNEKEAVYGALNKWVAWEV 127
Query: 126 EFPLIAVVKALKILRRKRQWVRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDQRVDEAES 185
EFP+IA KAL+ILR++ QW RVIQ+AKWMLSKGQGATMGTYD LLLAFDMD+R DEAES
Sbjct: 128 EFPIIAAAKALQILRKRSQWHRVIQLAKWMLSKGQGATMGTYDILLLAFDMDERADEAES 187
Query: 186 LWNMIIHAHMRSVSKRLFSRMISVYDHHNMPDKIVEVFADMEELQVKPDEDTVRKVASAF 245
LWNMI+H H RS+ +RLF+RMI++Y HH++ DK++EVFADMEEL+V PDED+ R+VA AF
Sbjct: 188 LWNMILHTHTRSIPRRLFARMIALYAHHDLHDKVIEVFADMEELKVSPDEDSARRVARAF 247
Query: 246 RNLGQEEKRKLVIKRYGLKWKYIHFNGERVRVRRETWED 284
R L QEE RKL+++RY ++KYI+FNGERVRV+R ED
Sbjct: 248 RELNQEENRKLILRRYLSEYKYIYFNGERVRVKRYFSED 286
>AT4G18975.2 | Symbols: | Pentatricopeptide repeat (PPR)
superfamily protein | chr4:10392170-10393503 REVERSE
LENGTH=260
Length = 260
Score = 320 bits (819), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 147/204 (72%), Positives = 175/204 (85%)
Query: 81 HHLWKKRDSAQSGQKALALVRTVSELPNEKEAVYGALDKWTAWETEFPLIAVVKALKILR 140
HHLWKK DSA SGQKAL LVR +S LPNEKEAVYGAL+KW AWE EFP+IA KAL+ILR
Sbjct: 56 HHLWKKNDSAGSGQKALNLVRMLSGLPNEKEAVYGALNKWVAWEVEFPIIAAAKALQILR 115
Query: 141 RKRQWVRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDQRVDEAESLWNMIIHAHMRSVSK 200
++ QW RVIQ+AKWMLSKGQGATMGTYD LLLAFDMD+R DEAESLWNMI+H H RS+ +
Sbjct: 116 KRSQWHRVIQLAKWMLSKGQGATMGTYDILLLAFDMDERADEAESLWNMILHTHTRSIPR 175
Query: 201 RLFSRMISVYDHHNMPDKIVEVFADMEELQVKPDEDTVRKVASAFRNLGQEEKRKLVIKR 260
RLF+RMI++Y HH++ DK++EVFADMEEL+V PDED+ R+VA AFR L QEE RKL+++R
Sbjct: 176 RLFARMIALYAHHDLHDKVIEVFADMEELKVSPDEDSARRVARAFRELNQEENRKLILRR 235
Query: 261 YGLKWKYIHFNGERVRVRRETWED 284
Y ++KYI+FNGERVRV+R ED
Sbjct: 236 YLSEYKYIYFNGERVRVKRYFSED 259
>AT4G21190.1 | Symbols: emb1417 | Pentatricopeptide repeat (PPR)
superfamily protein | chr4:11292493-11293763 REVERSE
LENGTH=307
Length = 307
Score = 178 bits (452), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 90/197 (45%), Positives = 126/197 (63%), Gaps = 1/197 (0%)
Query: 83 LWKKRDSAQSGQKALALVRTVSELPNEKEAVYGALDKWTAWETEFPLIAVVKALKILRRK 142
+WK R + KA ++ + L N KE VYGALD + AWE EFPL+ V KAL IL +
Sbjct: 45 VWKTRKRIGTISKAAKMIACIKGLSNVKEEVYGALDSFIAWELEFPLVIVKKALVILEDE 104
Query: 143 RQWVRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDQRVDEAESLWNMIIHAHMRSVSKRL 202
++W ++IQV KWMLSKGQG TMGTY +LL A D R+DEAE LWN + H+ ++
Sbjct: 105 KEWKKIIQVTKWMLSKGQGRTMGTYFSLLNALAEDNRLDEAEELWNKLFMEHLEGTPRKF 164
Query: 203 FSRMISVYDHHNMPDKIVEVFADMEELQVKPDEDTVRKVASAFRNLGQEEKRKLVIKRY- 261
F++MIS+Y +M K+ EVFADMEEL VKP+ V V F L ++K + ++K+Y
Sbjct: 165 FNKMISIYYKRDMHQKLFEVFADMEELGVKPNVAIVSMVGKVFVKLEMKDKYEKLMKKYP 224
Query: 262 GLKWKYIHFNGERVRVR 278
+W++ + G RV+V+
Sbjct: 225 PPQWEFRYIKGRRVKVK 241
>AT1G04590.1 | Symbols: | BEST Arabidopsis thaliana protein match
is: Pentatricopeptide repeat (PPR) superfamily protein
(TAIR:AT4G21190.1); Has 111 Blast hits to 111 proteins
in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 109; Viruses - 0; Other Eukaryotes -
2 (source: NCBI BLink). | chr1:1258760-1261411 REVERSE
LENGTH=381
Length = 381
Score = 135 bits (341), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 68/164 (41%), Positives = 106/164 (64%), Gaps = 1/164 (0%)
Query: 99 LVRTVSELPNEKEAVYGALDKWTAWETEFPLIAVVKALKILRRKRQWVRVIQVAKWMLSK 158
LV T+ ++ + KEAVYGALD W AWE FP+ ++ + L ++ QW R++QV KW+LSK
Sbjct: 149 LVNTLLDIEDNKEAVYGALDAWVAWERNFPIASLKIVIASLEKEHQWHRMVQVIKWILSK 208
Query: 159 GQGATMGTYDTLLLAFDMDQRVDEAESLWNMIIHAHMRSVSKRLFSRMISVYDHHNMPDK 218
GQG TMGTY L+ A DMD+R +EA +W + + SV +L +M+ +Y +NM +
Sbjct: 209 GQGNTMGTYGQLIRALDMDRRAEEAHVIWRKKVGNDLHSVPWQLCLQMMRIYFRNNMLQE 268
Query: 219 IVEVFADMEELQVK-PDEDTVRKVASAFRNLGQEEKRKLVIKRY 261
+V++F D+E K PD+ V+ VA A+ LG ++++ V+ +Y
Sbjct: 269 LVKLFKDLESYDRKPPDKHIVQTVADAYELLGMLDEKERVVTKY 312
>AT1G04590.2 | Symbols: | BEST Arabidopsis thaliana protein match
is: Pentatricopeptide repeat (PPR) superfamily protein
(TAIR:AT4G18975.4); Has 111 Blast hits to 111 proteins
in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 109; Viruses - 0; Other Eukaryotes -
2 (source: NCBI BLink). | chr1:1258760-1261411 REVERSE
LENGTH=384
Length = 384
Score = 132 bits (331), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 69/167 (41%), Positives = 106/167 (63%), Gaps = 4/167 (2%)
Query: 99 LVRTVSELPNEKEAVYGALDKWTAWETEFPLIAVVKALKILRRKRQWVRVIQVAKWMLSK 158
LV T+ ++ + KEAVYGALD W AWE FP+ ++ + L ++ QW R++QV KW+LSK
Sbjct: 149 LVNTLLDIEDNKEAVYGALDAWVAWERNFPIASLKIVIASLEKEHQWHRMVQVIKWILSK 208
Query: 159 GQGATMGTYDTLLLAFDMDQRVDEAESLWNMIIHAHMRSVSKRLFSRMISVYDHHNMPDK 218
GQG TMGTY L+ A DMD+R +EA +W + + SV +L +M+ +Y +NM +
Sbjct: 209 GQGNTMGTYGQLIRALDMDRRAEEAHVIWRKKVGNDLHSVPWQLCLQMMRIYFRNNMLQE 268
Query: 219 IVEV---FADMEELQVK-PDEDTVRKVASAFRNLGQEEKRKLVIKRY 261
+V+V F D+E K PD+ V+ VA A+ LG ++++ V+ +Y
Sbjct: 269 LVKVMKLFKDLESYDRKPPDKHIVQTVADAYELLGMLDEKERVVTKY 315