Miyakogusa Predicted Gene
- Lj0g3v0072019.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0072019.1 Non Chatacterized Hit- tr|D8SJ29|D8SJ29_SELML
Putative uncharacterized protein OS=Selaginella
moelle,61.73,1e-17,Histone chaperone domain CHZ,Histone chaperone
domain CHZ; seg,NULL; CHZ,Histone chaperone domain CH,CUFF.3543.1
(486 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT4G08310.1 | Symbols: | FUNCTIONS IN: molecular_function unkno... 265 6e-71
AT1G44780.1 | Symbols: | CONTAINS InterPro DOMAIN/s: Histone ch... 213 2e-55
AT1G44780.2 | Symbols: | INVOLVED IN: biological_process unknow... 212 4e-55
>AT4G08310.1 | Symbols: | FUNCTIONS IN: molecular_function unknown;
INVOLVED IN: biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 25 plant
structures; EXPRESSED DURING: 13 growth stages; CONTAINS
InterPro DOMAIN/s: Histone chaperone domain CHZ
(InterPro:IPR019098); BEST Arabidopsis thaliana protein
match is: unknown protein (TAIR:AT1G44780.2); Has 53711
Blast hits to 33687 proteins in 1618 species: Archae -
142; Bacteria - 4400; Metazoa - 24303; Fungi - 6688;
Plants - 2484; Viruses - 449; Other Eukaryotes - 15245
(source: NCBI BLink). | chr4:5249047-5252139 REVERSE
LENGTH=504
Length = 504
Score = 265 bits (676), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 175/420 (41%), Positives = 229/420 (54%), Gaps = 38/420 (9%)
Query: 17 NDVESQIQTAMSSRVPYFKEQSDSLTFEGVRRVLEKDLGLEEFALDVHKRFIKQCLIKCL 76
D+ESQI AM SRV Y ++++D+ TFEGVRR+LE+DL LE+ ALDVHK F+KQ L++CL
Sbjct: 32 TDIESQILAAMQSRVTYLRDKADNFTFEGVRRLLEEDLKLEKHALDVHKSFVKQHLVQCL 91
Query: 77 EGVGEEDGPKMPGEEAAEKGAST--RESVRP--------KEEGQSEDVKDLCPEDEEKME 126
G +E +E T ++ V P KE +D K+ D+EK +
Sbjct: 92 AGA--------ENDETSENSLETEKKDDVTPVKEAAELSKEHTTKKDGKEDMTGDDEKTK 143
Query: 127 DSPVLGLLKEQKGVKVETFEQEYNGKKIVLSEAQIKKAVRKRSSYIKSNAEEITMAGLRR 186
DSPV+GLL E+ K EQ + K VL ++ IKKA+RKRSSYIK+N+E+ITM LRR
Sbjct: 144 DSPVMGLLTEENTSK-SVAEQTKDEDKEVL-QSDIKKALRKRSSYIKANSEKITMGLLRR 201
Query: 187 LLEADLKLEPYTLDPFKKFISQQLDEVLASSEVLXXXXXXXXXXXXXXXXXXXXXXXSEE 246
LLE DLKLE Y+LDP+KKFI+ +LDE+L + E +
Sbjct: 202 LLEQDLKLEKYSLDPYKKFINGELDEILQAHEATQSSTKAQRKPVSKKVKSTPA-----K 256
Query: 247 NSDTSD--KVSXXXXXXXXXVKPRKKIVPKGKTQTSLGPKKRKGEETNXXXXXXXXXXXX 304
NSD+ + V +KK+ K K S G KRK E+
Sbjct: 257 NSDSEEMFDSDGEDEEEDKEVAVKKKMAEKRKLSKSEGTGKRKREKEKPASAKKTKQT-- 314
Query: 305 XXEDNSDTEDNEKNSEDDEPHSSPEKLTKKKEVPTPVYSKRVERLKSVIKECGMSVPPVI 364
D++ +S+ E S EK KK E PT Y KRVE LKS+IK CGMS+ P +
Sbjct: 315 ---------DSQSDSDAGEKAPSSEKSVKKPETPTTGYGKRVEHLKSIIKSCGMSISPSV 365
Query: 365 YKKVKQVPENKREGQLIKELEEILSREGLSSNPSXXXXXXXXXXXXXXXXXXGIDMSNIV 424
Y+K KQ PE KRE LIKEL+E+L++EGLS+NPS GID SNIV
Sbjct: 366 YRKAKQAPEEKREEILIKELKELLAKEGLSANPSEKEIKEVKKRKERTKELEGIDTSNIV 425
>AT1G44780.1 | Symbols: | CONTAINS InterPro DOMAIN/s: Histone
chaperone domain CHZ (InterPro:IPR019098); BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT4G08310.1); Has 18105 Blast hits to 11200
proteins in 808 species: Archae - 37; Bacteria - 1195;
Metazoa - 7724; Fungi - 1727; Plants - 674; Viruses -
183; Other Eukaryotes - 6565 (source: NCBI BLink). |
chr1:16909753-16912060 FORWARD LENGTH=463
Length = 463
Score = 213 bits (542), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 159/409 (38%), Positives = 208/409 (50%), Gaps = 37/409 (9%)
Query: 18 DVESQIQTAMSSRVPYFKEQSDSLTFEGVRRVLEKDLGLEEFALDVHKRFIKQCLIKCLE 77
++E +I A+ SRV Y + ++D T VRR+LE+D+GLE+ LDV+K F+K+ L+KCLE
Sbjct: 22 EIEFKILAALRSRVTYLRNEADCFTLVSVRRMLEEDIGLEKCDLDVYKSFVKEHLVKCLE 81
Query: 78 GVGEEDGPKMPGE-EAAEKGASTRESVRPKEEGQSEDVKDLCPEDEEKMEDSPVLGLLKE 136
G D + E E + T+E EE E + D E+ K E V G
Sbjct: 82 EAGNNDTSENSQETEREDDEIPTKEVAEQSEE--HEPMNDAGEENTSKREAKDVKG---- 135
Query: 137 QKGVKVETFEQEYNGKKIVLSEAQIKKAVRKRSSYIKSNAEEITMAGLRRLLEADLKLEP 196
KG K ET +++ IK+A+RKR+SYIK+N+E ITMA LRRLLE DLKLE
Sbjct: 136 -KGNK-ETLQRD------------IKRALRKRASYIKANSETITMASLRRLLEEDLKLEK 181
Query: 197 YTLDPFKKFISQQLDEVLASSEVLXXXXXXXXXXXXXXXXXXXXXXXSEENSDTSDKVSX 256
+LD FKKFI+++LDEVL + S E + SD
Sbjct: 182 ESLDLFKKFINKELDEVLQLPDAPKCSTESIVKNVKKKVKSTPSKMVSSEYNSDSD---T 238
Query: 257 XXXXXXXXVKPRKKIVPKGKTQTSLGPKKRKGEETNXXX-XXXXXXXXXXXEDNSDTEDN 315
V +K + K K KRK E E++SD+ D+
Sbjct: 239 EGNVDNEEVAVKKTMARKVKLSKPEMMGKRKSENGKQVSGRKKAKHTEIDSENDSDSGDS 298
Query: 316 EKNSEDDEPHSSPEKLTKKKEVPTPVYSKRVERLKSVIKECGMSVPPVIYKKVKQVPENK 375
EK+ L + KE T VY KRVE LKSVIK CGMSVPP IYKK KQ P+ K
Sbjct: 299 EKS------------LKQTKETATDVYGKRVEHLKSVIKSCGMSVPPNIYKKAKQAPQEK 346
Query: 376 REGQLIKELEEILSREGLSSNPSXXXXXXXXXXXXXXXXXXGIDMSNIV 424
RE LI+ELE+IL++EGLSS+PS GID +NIV
Sbjct: 347 REAMLIEELEQILAKEGLSSDPSALEIKEVKKRKNISRELEGIDTNNIV 395
>AT1G44780.2 | Symbols: | INVOLVED IN: biological_process unknown;
LOCATED IN: cellular_component unknown; EXPRESSED IN: 20
plant structures; EXPRESSED DURING: 9 growth stages;
CONTAINS InterPro DOMAIN/s: Histone chaperone domain CHZ
(InterPro:IPR019098); BEST Arabidopsis thaliana protein
match is: unknown protein (TAIR:AT4G08310.1); Has 35333
Blast hits to 34131 proteins in 2444 species: Archae -
798; Bacteria - 22429; Metazoa - 974; Fungi - 991;
Plants - 531; Viruses - 0; Other Eukaryotes - 9610
(source: NCBI BLink). | chr1:16909753-16912060 FORWARD
LENGTH=462
Length = 462
Score = 212 bits (540), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 159/409 (38%), Positives = 207/409 (50%), Gaps = 38/409 (9%)
Query: 18 DVESQIQTAMSSRVPYFKEQSDSLTFEGVRRVLEKDLGLEEFALDVHKRFIKQCLIKCLE 77
++E +I A+ SRV Y + ++D T VRR+LE+D+GLE+ LDV+K F+K+ L+KCLE
Sbjct: 22 EIEFKILAALRSRVTYLRNEADCFTLVSVRRMLEEDIGLEKCDLDVYKSFVKEHLVKCLE 81
Query: 78 GVGEEDGPKMPGE-EAAEKGASTRESVRPKEEGQSEDVKDLCPEDEEKMEDSPVLGLLKE 136
G D + E E + T+E EE E + D E+ K E V G
Sbjct: 82 EAGNNDTSENSQETEREDDEIPTKEVAEQSEE--HEPMNDAGEENTSKREAKDVKG---- 135
Query: 137 QKGVKVETFEQEYNGKKIVLSEAQIKKAVRKRSSYIKSNAEEITMAGLRRLLEADLKLEP 196
KG K ET +++ IK+A+RKR+SYIK+N+E ITMA LRRLLE DLKLE
Sbjct: 136 -KGNK-ETLQRD------------IKRALRKRASYIKANSETITMASLRRLLEEDLKLEK 181
Query: 197 YTLDPFKKFISQQLDEVLASSEVLXXXXXXXXXXXXXXXXXXXXXXXSEENSDTSDKVSX 256
+LD FKKFI+++LDEVL + S E + SD
Sbjct: 182 ESLDLFKKFINKELDEVLQLPDAPKCSTESIVKNVKKKVKSTPSKMVSSEYNSDSD---T 238
Query: 257 XXXXXXXXVKPRKKIVPKGKTQTSLGPKKRKGEETNXXX-XXXXXXXXXXXEDNSDTEDN 315
V +K + K K KRK E E++SD+ D+
Sbjct: 239 EGNVDNEEVAVKKTMARKVKLSKPEMMGKRKSENGKQVSGRKKAKHTEIDSENDSDSGDS 298
Query: 316 EKNSEDDEPHSSPEKLTKKKEVPTPVYSKRVERLKSVIKECGMSVPPVIYKKVKQVPENK 375
EK+ K KE T VY KRVE LKSVIK CGMSVPP IYKK KQ P+ K
Sbjct: 299 EKS-------------LKTKETATDVYGKRVEHLKSVIKSCGMSVPPNIYKKAKQAPQEK 345
Query: 376 REGQLIKELEEILSREGLSSNPSXXXXXXXXXXXXXXXXXXGIDMSNIV 424
RE LI+ELE+IL++EGLSS+PS GID +NIV
Sbjct: 346 REAMLIEELEQILAKEGLSSDPSALEIKEVKKRKNISRELEGIDTNNIV 394