Miyakogusa Predicted Gene
- Lj3g3v2476560.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj3g3v2476560.1 tr|G8JGY3|G8JGY3_ARAHA At2g46550-like protein
(Fragment) OS=Arabidopsis halleri PE=4 SV=1,38.69,1e-18,seg,NULL;
coiled-coil,NULL,CUFF.44049.1
(422 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT2G46550.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 148 6e-36
AT2G46550.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 122 3e-28
AT1G01240.3 | Symbols: | unknown protein; BEST Arabidopsis thal... 109 4e-24
AT1G01240.2 | Symbols: | unknown protein; BEST Arabidopsis thal... 109 4e-24
AT1G01240.1 | Symbols: | unknown protein; INVOLVED IN: N-termin... 109 4e-24
>AT2G46550.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G01240.3); Has 72 Blast hits to 68 proteins in
13 species: Archae - 0; Bacteria - 0; Metazoa - 1; Fungi
- 0; Plants - 71; Viruses - 0; Other Eukaryotes - 0
(source: NCBI BLink). | chr2:19112264-19113457 REVERSE
LENGTH=397
Length = 397
Score = 148 bits (374), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 139/447 (31%), Positives = 197/447 (44%), Gaps = 84/447 (18%)
Query: 1 MAAAEVRAARQKTADHCVVQEDAKRAPKFACCQSSCATSKL------VDAGSASAADEP- 53
MAAAE RA Q+T + C VQEDAKRAPK CQSS +++ V S + EP
Sbjct: 1 MAAAEARAVWQRTVNRCFVQEDAKRAPKLTYCQSSSSSTASSTKQVEVSGSSPRVSVEPR 60
Query: 54 -DQACVNDTHVNQKSSFSNRIL-DSRWWLNLHPNCGFXXXXXX-------XXXXXXXXXX 104
+C + + +F + + ++R W HP+ F
Sbjct: 61 TQSSCAGFMPLPRNPNFPDLLPHNTRLWS--HPHHQFQVNKKQPLEDEVNNQGVSEKKSE 118
Query: 105 XXXXXXXXXTCKGENFQEGIKQKTCMKGMQDEMMEI-DSVGCSN---------QTLDFSL 154
+ E+FQE I E+ME +S G + L F
Sbjct: 119 LGAGEKQGKSFNSESFQEFI-----------ELMETRESYGSTGYDESSEKKLSELSFDP 167
Query: 155 ISDYSWIEGEKPHPWWQTTDRNELASFVSQKSLNNIENCDLPPPRKNYLGGQSNAYISDE 214
S ++ + EK PWW+TTD++ELAS V+Q+SL+ +ENCDLP P+K +Y
Sbjct: 168 SSPWNLLSSEKAAPWWRTTDKDELASLVAQRSLDYVENCDLPTPQK-----MKRSYYGSP 222
Query: 215 KINTIGFDWEAKSSVFSNLTDQAQGSLDSGFMQGNLRHSHFACDKSPSYTSTIHEDVTEQ 274
+ GFD + L D S+ ++G + S +C P +S
Sbjct: 223 R----GFDSDG-------LRDY---SVSGQTIKGTSKGS--SCKNRPEASS--------- 257
Query: 275 AFEGDQSIAQLMEALCHSQTRARAAEEVAKQAYAEKEHIFTQFLMQASQLLAYEQWFRLL 334
E D S ++L+EAL SQTRAR AE +AK+AYAEKEH+ L QA++L Y+QW +LL
Sbjct: 258 --ESDLSKSELLEALRRSQTRAREAENMAKEAYAEKEHLVKILLKQAAELFGYKQWLQLL 315
Query: 335 QLETDNTSQIKNK--DQQVSTKFPETLP-WILFEGRKLQKRKKLLVNAKKEMLGKLKSDR 391
QLE QIKNK D + + ++P W + RK +K + K +
Sbjct: 316 QLEALYL-QIKNKEIDNKNNDDPGVSIPCWSNGKARK---------EGRKRRSKRGKPNG 365
Query: 392 RTYXXXXXXXXXXXXXXXXXXWTVGWM 418
Y WTVGWM
Sbjct: 366 AKYAVGLALGMSLVGAGLLLGWTVGWM 392
>AT2G46550.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; EXPRESSED IN: 23 plant
structures; EXPRESSED DURING: 13 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT1G01240.3). | chr2:19112264-19113037 REVERSE
LENGTH=257
Length = 257
Score = 122 bits (307), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 94/272 (34%), Positives = 132/272 (48%), Gaps = 45/272 (16%)
Query: 150 LDFSLISDYSWIEGEKPHPWWQTTDRNELASFVSQKSLNNIENCDLPPPRKNYLGGQSNA 209
L F S ++ + EK PWW+TTD++ELAS V+Q+SL+ +ENCDLP P+K +
Sbjct: 23 LSFDPSSPWNLLSSEKAAPWWRTTDKDELASLVAQRSLDYVENCDLPTPQK-----MKRS 77
Query: 210 YISDEKINTIGFDWEAKSSVFSNLTDQAQGSLDSGFMQGNLRHSHFACDKSPSYTSTIHE 269
Y + GFD + L D S+ ++G + S +C P
Sbjct: 78 YYGSPR----GFDSDG-------LRDY---SVSGQTIKGTSKGS--SCKNRP-------- 113
Query: 270 DVTEQAFEGDQSIAQLMEALCHSQTRARAAEEVAKQAYAEKEHIFTQFLMQASQLLAYEQ 329
E + E D S ++L+EAL SQTRAR AE +AK+AYAEKEH+ L QA++L Y+Q
Sbjct: 114 ---EASSESDLSKSELLEALRRSQTRAREAENMAKEAYAEKEHLVKILLKQAAELFGYKQ 170
Query: 330 WFRLLQLETDNTSQIKNK--DQQVSTKFPETLP-WILFEGRKLQKRKKLLVNAKKEMLGK 386
W +LLQLE QIKNK D + + ++P W + RK +K +
Sbjct: 171 WLQLLQLEALYL-QIKNKEIDNKNNDDPGVSIPCWSNGKARK---------EGRKRRSKR 220
Query: 387 LKSDRRTYXXXXXXXXXXXXXXXXXXWTVGWM 418
K + Y WTVGWM
Sbjct: 221 GKPNGAKYAVGLALGMSLVGAGLLLGWTVGWM 252
>AT1G01240.3 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT2G46550.1); Has 35333 Blast hits to 34131
proteins in 2444 species: Archae - 798; Bacteria -
22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
- 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
chr1:100683-101678 FORWARD LENGTH=331
Length = 331
Score = 109 bits (272), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 131/423 (30%), Positives = 182/423 (43%), Gaps = 96/423 (22%)
Query: 1 MAAAEVRAARQKTADHC-VVQEDAKRAPKFACCQSSCATSKLVDAGSASAADEPDQACVN 59
M AAE RA Q+TA C VV EDAK AP+ ACCQ ++S + N
Sbjct: 1 MGAAEARALWQRTASRCFVVHEDAKMAPRLACCQHQQSSSGNTEK--------------N 46
Query: 60 DTHVNQKSSFSNRILDSRWWLNLHPNCGFXXXXXXXXXXXXXXXXXXXXXXXXXTCKGEN 119
S+ D++WWL + GF E
Sbjct: 47 SFSSGSFGDSSDFSCDTKWWLK--GSTGFD----------------------------EE 76
Query: 120 FQEGIKQKTCMKGMQDEMMEIDSVGCSNQTLDFSLISDYSWIEGEKPHPWWQ-TTDRNEL 178
+ T K + + +D +G + D+S IS + + PWW+ TTD++EL
Sbjct: 77 VTNSFLEDTKCKKLHEF---VDLIGIREEE-DYSFISK----KADATTPWWRSTTDKDEL 128
Query: 179 ASFVSQKSLN-NIENCDLPPPRKNYLGGQSNAYISDEKINTIGFDWEAKSSVFSNLTDQA 237
A V+ KS++ NI+NCDLPPP+K + S+ S EK GF KS
Sbjct: 129 ALMVATKSVDHNIQNCDLPPPQKLHKSIHSS---SGEK----GFKTAVKSP-------WK 174
Query: 238 QGSLDSGFMQGNLRHSHFACDKSPSYTSTIHEDVTEQAFEGDQSIAQLMEALCHSQTRAR 297
QG F + +L ++ K+ S S+ D D S QL+EAL HSQTRAR
Sbjct: 175 QGVWKDRF-ERSLSYNGSTESKNTSPMSSPRSD--------DLSKGQLLEALRHSQTRAR 225
Query: 298 AAEEVAKQAYAEKEHIFTQFLMQASQLLAYEQWFRLLQLETDNTSQIKNKDQQVSTKFPE 357
AE A++A AEK+ + T L QASQ+LAY+QW +LL++E K ++Q+ K
Sbjct: 226 EAERAAREACAEKDRVITILLKQASQMLAYKQWLKLLEMEALYLQMKKEEEQEEQVK--- 282
Query: 358 TLPWILFEGRKLQKRKKLLVNAKKEMLGKLKSDRRTYXXXXXXXXXXXXXXXXXXWTVGW 417
G L+KRK+ KK G+ Y WTVGW
Sbjct: 283 --------GMNLKKRKQRGEKKKKGETGR-------YMMAFALGFSLIGAGLLLGWTVGW 327
Query: 418 MFP 420
+ P
Sbjct: 328 LLP 330
>AT1G01240.2 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT2G46550.1); Has 35333 Blast hits to 34131
proteins in 2444 species: Archae - 798; Bacteria -
22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
- 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
chr1:100683-101678 FORWARD LENGTH=331
Length = 331
Score = 109 bits (272), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 131/423 (30%), Positives = 182/423 (43%), Gaps = 96/423 (22%)
Query: 1 MAAAEVRAARQKTADHC-VVQEDAKRAPKFACCQSSCATSKLVDAGSASAADEPDQACVN 59
M AAE RA Q+TA C VV EDAK AP+ ACCQ ++S + N
Sbjct: 1 MGAAEARALWQRTASRCFVVHEDAKMAPRLACCQHQQSSSGNTEK--------------N 46
Query: 60 DTHVNQKSSFSNRILDSRWWLNLHPNCGFXXXXXXXXXXXXXXXXXXXXXXXXXTCKGEN 119
S+ D++WWL + GF E
Sbjct: 47 SFSSGSFGDSSDFSCDTKWWLK--GSTGFD----------------------------EE 76
Query: 120 FQEGIKQKTCMKGMQDEMMEIDSVGCSNQTLDFSLISDYSWIEGEKPHPWWQ-TTDRNEL 178
+ T K + + +D +G + D+S IS + + PWW+ TTD++EL
Sbjct: 77 VTNSFLEDTKCKKLHEF---VDLIGIREEE-DYSFISK----KADATTPWWRSTTDKDEL 128
Query: 179 ASFVSQKSLN-NIENCDLPPPRKNYLGGQSNAYISDEKINTIGFDWEAKSSVFSNLTDQA 237
A V+ KS++ NI+NCDLPPP+K + S+ S EK GF KS
Sbjct: 129 ALMVATKSVDHNIQNCDLPPPQKLHKSIHSS---SGEK----GFKTAVKSP-------WK 174
Query: 238 QGSLDSGFMQGNLRHSHFACDKSPSYTSTIHEDVTEQAFEGDQSIAQLMEALCHSQTRAR 297
QG F + +L ++ K+ S S+ D D S QL+EAL HSQTRAR
Sbjct: 175 QGVWKDRF-ERSLSYNGSTESKNTSPMSSPRSD--------DLSKGQLLEALRHSQTRAR 225
Query: 298 AAEEVAKQAYAEKEHIFTQFLMQASQLLAYEQWFRLLQLETDNTSQIKNKDQQVSTKFPE 357
AE A++A AEK+ + T L QASQ+LAY+QW +LL++E K ++Q+ K
Sbjct: 226 EAERAAREACAEKDRVITILLKQASQMLAYKQWLKLLEMEALYLQMKKEEEQEEQVK--- 282
Query: 358 TLPWILFEGRKLQKRKKLLVNAKKEMLGKLKSDRRTYXXXXXXXXXXXXXXXXXXWTVGW 417
G L+KRK+ KK G+ Y WTVGW
Sbjct: 283 --------GMNLKKRKQRGEKKKKGETGR-------YMMAFALGFSLIGAGLLLGWTVGW 327
Query: 418 MFP 420
+ P
Sbjct: 328 LLP 330
>AT1G01240.1 | Symbols: | unknown protein; INVOLVED IN: N-terminal
protein myristoylation; EXPRESSED IN: 17 plant
structures; EXPRESSED DURING: 11 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT2G46550.1); Has 95 Blast hits to 78 proteins in
16 species: Archae - 0; Bacteria - 2; Metazoa - 11;
Fungi - 0; Plants - 80; Viruses - 0; Other Eukaryotes -
2 (source: NCBI BLink). | chr1:100683-101678 FORWARD
LENGTH=331
Length = 331
Score = 109 bits (272), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 131/423 (30%), Positives = 182/423 (43%), Gaps = 96/423 (22%)
Query: 1 MAAAEVRAARQKTADHC-VVQEDAKRAPKFACCQSSCATSKLVDAGSASAADEPDQACVN 59
M AAE RA Q+TA C VV EDAK AP+ ACCQ ++S + N
Sbjct: 1 MGAAEARALWQRTASRCFVVHEDAKMAPRLACCQHQQSSSGNTEK--------------N 46
Query: 60 DTHVNQKSSFSNRILDSRWWLNLHPNCGFXXXXXXXXXXXXXXXXXXXXXXXXXTCKGEN 119
S+ D++WWL + GF E
Sbjct: 47 SFSSGSFGDSSDFSCDTKWWLK--GSTGFD----------------------------EE 76
Query: 120 FQEGIKQKTCMKGMQDEMMEIDSVGCSNQTLDFSLISDYSWIEGEKPHPWWQ-TTDRNEL 178
+ T K + + +D +G + D+S IS + + PWW+ TTD++EL
Sbjct: 77 VTNSFLEDTKCKKLHEF---VDLIGIREEE-DYSFISK----KADATTPWWRSTTDKDEL 128
Query: 179 ASFVSQKSLN-NIENCDLPPPRKNYLGGQSNAYISDEKINTIGFDWEAKSSVFSNLTDQA 237
A V+ KS++ NI+NCDLPPP+K + S+ S EK GF KS
Sbjct: 129 ALMVATKSVDHNIQNCDLPPPQKLHKSIHSS---SGEK----GFKTAVKSP-------WK 174
Query: 238 QGSLDSGFMQGNLRHSHFACDKSPSYTSTIHEDVTEQAFEGDQSIAQLMEALCHSQTRAR 297
QG F + +L ++ K+ S S+ D D S QL+EAL HSQTRAR
Sbjct: 175 QGVWKDRF-ERSLSYNGSTESKNTSPMSSPRSD--------DLSKGQLLEALRHSQTRAR 225
Query: 298 AAEEVAKQAYAEKEHIFTQFLMQASQLLAYEQWFRLLQLETDNTSQIKNKDQQVSTKFPE 357
AE A++A AEK+ + T L QASQ+LAY+QW +LL++E K ++Q+ K
Sbjct: 226 EAERAAREACAEKDRVITILLKQASQMLAYKQWLKLLEMEALYLQMKKEEEQEEQVK--- 282
Query: 358 TLPWILFEGRKLQKRKKLLVNAKKEMLGKLKSDRRTYXXXXXXXXXXXXXXXXXXWTVGW 417
G L+KRK+ KK G+ Y WTVGW
Sbjct: 283 --------GMNLKKRKQRGEKKKKGETGR-------YMMAFALGFSLIGAGLLLGWTVGW 327
Query: 418 MFP 420
+ P
Sbjct: 328 LLP 330