Miyakogusa Predicted Gene
- Lj5g3v1531230.2
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj5g3v1531230.2 Non Chatacterized Hit- tr|I1NME4|I1NME4_ORYGL
Uncharacterized protein OS=Oryza glaberrima PE=4
SV=1,36.24,1e-16,coiled-coil,NULL; seg,NULL,CUFF.55430.2
(348 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G26770.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 395 e-110
AT5G26770.2 | Symbols: | unknown protein; BEST Arabidopsis thal... 395 e-110
AT1G09470.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 350 1e-96
AT5G26770.3 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 347 9e-96
AT1G09470.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 333 1e-91
AT3G05830.1 | Symbols: | Encodes alpha-helical IF (intermediate... 330 1e-90
AT3G05830.2 | Symbols: | Encodes alpha-helical IF (intermediate... 322 3e-88
AT1G09483.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 115 4e-26
>AT5G26770.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G05830.1); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
0; Other Eukaryotes - 2996 (source: NCBI BLink). |
chr5:9407981-9409735 REVERSE LENGTH=335
Length = 335
Score = 395 bits (1015), Expect = e-110, Method: Compositional matrix adjust.
Identities = 203/329 (61%), Positives = 251/329 (76%), Gaps = 2/329 (0%)
Query: 20 VDPXXXXXXXXXQNFRRNVVSLAAELKELRGRLAAQEKSYAIETLTRQEAETNAKSLELE 79
VDP ++FRRNVVS+AAELK++RGRL +QE+ + E+ R+EAE AK++E+E
Sbjct: 9 VDPLLKDLDGKKESFRRNVVSMAAELKQVRGRLVSQEQFFVKESFCRKEAEKKAKNMEME 68
Query: 80 IGRLQKNLDERNEQLQASASSAEKYLMELDDLKTQLVXXXXXXXXXXXXXXXXQLHCVEL 139
I +LQK L++RN +L AS S+AEK+L E+DDL++QL QL C L
Sbjct: 69 ICKLQKKLEDRNCELVASTSAAEKFLEEVDDLRSQLALTKDIAETSAASAQSAQLQCSVL 128
Query: 140 VKELDEKNSSLREHEDRVTRLAEQLENLQKDLQARESSQKHLKDEVFRIEHDIMEALTKA 199
++LD+K SLREHEDRVT L QL+NLQ+DL+ RE SQK L++EV RIE +I EA+ K+
Sbjct: 129 TEQLDDKTRSLREHEDRVTHLGHQLDNLQRDLKTRECSQKQLREEVMRIEREITEAVAKS 188
Query: 200 GDNKDRELRKILDEVSPRNFEKMNKLLVVKDEEIVRMKDEIKIMSAHWKLKTKELESQLE 259
G + ELRK+L+EVSP+NFE+MN LL VKDEEI ++KD++K+MSAHWKLKTKELESQLE
Sbjct: 189 GKGTECELRKLLEEVSPKNFERMNMLLAVKDEEIAKLKDDVKLMSAHWKLKTKELESQLE 248
Query: 260 KQRRADQELKKKVLKLEFCLQETRSQTRKLQRMGERRDKAIKELRDQLAAKRKRGVAAEE 319
+QRRADQELKKKVLKLEFCLQE RSQTRKLQR GERRDKAIKEL DQ+ K+ + E
Sbjct: 249 RQRRADQELKKKVLKLEFCLQEARSQTRKLQRAGERRDKAIKELSDQITGKQLNESVSGE 308
Query: 320 KQQQNFWDTSGFKIVVSMSMLVLVAFSKR 348
K QNFWDTSGFKIVVSMSML+LV SKR
Sbjct: 309 K--QNFWDTSGFKIVVSMSMLILVIISKR 335
>AT5G26770.2 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G05830.1); Has 35333 Blast hits to 34131
proteins in 2444 species: Archae - 798; Bacteria -
22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
- 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
chr5:9407981-9409735 REVERSE LENGTH=335
Length = 335
Score = 395 bits (1015), Expect = e-110, Method: Compositional matrix adjust.
Identities = 203/329 (61%), Positives = 251/329 (76%), Gaps = 2/329 (0%)
Query: 20 VDPXXXXXXXXXQNFRRNVVSLAAELKELRGRLAAQEKSYAIETLTRQEAETNAKSLELE 79
VDP ++FRRNVVS+AAELK++RGRL +QE+ + E+ R+EAE AK++E+E
Sbjct: 9 VDPLLKDLDGKKESFRRNVVSMAAELKQVRGRLVSQEQFFVKESFCRKEAEKKAKNMEME 68
Query: 80 IGRLQKNLDERNEQLQASASSAEKYLMELDDLKTQLVXXXXXXXXXXXXXXXXQLHCVEL 139
I +LQK L++RN +L AS S+AEK+L E+DDL++QL QL C L
Sbjct: 69 ICKLQKKLEDRNCELVASTSAAEKFLEEVDDLRSQLALTKDIAETSAASAQSAQLQCSVL 128
Query: 140 VKELDEKNSSLREHEDRVTRLAEQLENLQKDLQARESSQKHLKDEVFRIEHDIMEALTKA 199
++LD+K SLREHEDRVT L QL+NLQ+DL+ RE SQK L++EV RIE +I EA+ K+
Sbjct: 129 TEQLDDKTRSLREHEDRVTHLGHQLDNLQRDLKTRECSQKQLREEVMRIEREITEAVAKS 188
Query: 200 GDNKDRELRKILDEVSPRNFEKMNKLLVVKDEEIVRMKDEIKIMSAHWKLKTKELESQLE 259
G + ELRK+L+EVSP+NFE+MN LL VKDEEI ++KD++K+MSAHWKLKTKELESQLE
Sbjct: 189 GKGTECELRKLLEEVSPKNFERMNMLLAVKDEEIAKLKDDVKLMSAHWKLKTKELESQLE 248
Query: 260 KQRRADQELKKKVLKLEFCLQETRSQTRKLQRMGERRDKAIKELRDQLAAKRKRGVAAEE 319
+QRRADQELKKKVLKLEFCLQE RSQTRKLQR GERRDKAIKEL DQ+ K+ + E
Sbjct: 249 RQRRADQELKKKVLKLEFCLQEARSQTRKLQRAGERRDKAIKELSDQITGKQLNESVSGE 308
Query: 320 KQQQNFWDTSGFKIVVSMSMLVLVAFSKR 348
K QNFWDTSGFKIVVSMSML+LV SKR
Sbjct: 309 K--QNFWDTSGFKIVVSMSMLILVIISKR 335
>AT1G09470.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; EXPRESSED IN: cotyledon;
BEST Arabidopsis thaliana protein match is: unknown
protein (TAIR:AT5G26770.1); Has 55019 Blast hits to
30094 proteins in 2088 species: Archae - 730; Bacteria -
6553; Metazoa - 28961; Fungi - 4800; Plants - 2559;
Viruses - 111; Other Eukaryotes - 11305 (source: NCBI
BLink). | chr1:3055391-3056931 REVERSE LENGTH=336
Length = 336
Score = 350 bits (897), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 185/334 (55%), Positives = 241/334 (72%), Gaps = 2/334 (0%)
Query: 15 VPARDVDPXXXXXXXXXQNFRRNVVSLAAELKELRGRLAAQEKSYAIETLTRQEAETNAK 74
V R+ DP Q+FRRNVVSLA ELKE R RLA QE+S + E ++RQEAET K
Sbjct: 5 VSLREDDPLLKDLSEKKQSFRRNVVSLATELKEARTRLAEQERSCSKEAMSRQEAETRVK 64
Query: 75 SLELEIGRLQKNLDERNEQLQASASSAEKYLMELDDLKTQLVXXXXXXXXXXXXXXXXQL 134
+E E+ L K L+E+ EQ++AS + EK++ EL D+K+QL
Sbjct: 65 RMEDEMHELAKELNEKVEQIRASDVATEKFVKELADIKSQLAATHATAEASALSAESAHS 124
Query: 135 HCVELVKELDEKNSSLREHEDRVTRLAEQLENLQKDLQARESSQKHLKDEVFRIEHDIME 194
HC L K+L E+ SL+EHED+VTRL EQLENL+K+L+ RESSQK L+DE+ ++E DIM
Sbjct: 125 HCRVLSKQLHERTGSLKEHEDQVTRLGEQLENLRKELRVRESSQKQLRDELLKVEGDIMR 184
Query: 195 ALTKAGDNKDRELRKILDEVSPRNFEKMNKLLVVKDEEIVRMKDEIKIMSAHWKLKTKEL 254
A++ ++ E+R +L+E +P+N E++NKLL KD+EI R++DE+KI+SAHW+ KTKEL
Sbjct: 185 AVSVVKTKENSEVRNMLNEDTPKNSERINKLLTAKDDEIARLRDELKIISAHWRFKTKEL 244
Query: 255 ESQLEKQRRADQELKKKVLKLEFCLQETRSQTRKLQRMGERRDKAIKELRDQLAAKRKRG 314
E Q+E QRR DQELKKKVLKLEFCL+ETR QTRKLQ+MGER D AI+EL++QLAAK++
Sbjct: 245 EDQVENQRRIDQELKKKVLKLEFCLRETRIQTRKLQKMGERNDVAIQELKEQLAAKKQH- 303
Query: 315 VAAEEKQQQNFWDTSGFKIVVSMSMLVLVAFSKR 348
A+ QN WD SGFKIVVSMSML+LVAFS+R
Sbjct: 304 -EADHSSNQNLWDKSGFKIVVSMSMLILVAFSRR 336
>AT5G26770.3 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 16 plant
structures; EXPRESSED DURING: 8 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT3G05830.1); Has 26484 Blast hits to 16065
proteins in 1382 species: Archae - 343; Bacteria - 2653;
Metazoa - 15273; Fungi - 2108; Plants - 1148; Viruses -
36; Other Eukaryotes - 4923 (source: NCBI BLink). |
chr5:9407981-9409735 REVERSE LENGTH=315
Length = 315
Score = 347 bits (889), Expect = 9e-96, Method: Compositional matrix adjust.
Identities = 184/329 (55%), Positives = 232/329 (70%), Gaps = 22/329 (6%)
Query: 20 VDPXXXXXXXXXQNFRRNVVSLAAELKELRGRLAAQEKSYAIETLTRQEAETNAKSLELE 79
VDP ++FRRNVVS+AAELK++RGRL +QE+ + E+ R+EAE AK++E+E
Sbjct: 9 VDPLLKDLDGKKESFRRNVVSMAAELKQVRGRLVSQEQFFVKESFCRKEAEKKAKNMEME 68
Query: 80 IGRLQKNLDERNEQLQASASSAEKYLMELDDLKTQLVXXXXXXXXXXXXXXXXQLHCVEL 139
I +LQK L++RN +L AS S+AEK+L E+DDL++QL QL C L
Sbjct: 69 ICKLQKKLEDRNCELVASTSAAEKFLEEVDDLRSQLALTKDIAETSAASAQSAQLQCSVL 128
Query: 140 VKELDEKNSSLREHEDRVTRLAEQLENLQKDLQARESSQKHLKDEVFRIEHDIMEALTKA 199
++LD+K SLREHEDRVT L QL+NLQ+DL+ RE SQK L++EV RIE +I EA+ K+
Sbjct: 129 TEQLDDKTRSLREHEDRVTHLGHQLDNLQRDLKTRECSQKQLREEVMRIEREITEAVAKS 188
Query: 200 GDNKDRELRKILDEVSPRNFEKMNKLLVVKDEEIVRMKDEIKIMSAHWKLKTKELESQLE 259
G + ELRK+L+EVSP+NFE+MN LL VKDEEI ++KD++K+MSAHWKLKTKELESQLE
Sbjct: 189 GKGTECELRKLLEEVSPKNFERMNMLLAVKDEEIAKLKDDVKLMSAHWKLKTKELESQLE 248
Query: 260 KQRRADQELKKKVLKLEFCLQETRSQTRKLQRMGERRDKAIKELRDQLAAKRKRGVAAEE 319
+QRRADQELKKK GERRDKAIKEL DQ+ K+ + E
Sbjct: 249 RQRRADQELKKKA--------------------GERRDKAIKELSDQITGKQLNESVSGE 288
Query: 320 KQQQNFWDTSGFKIVVSMSMLVLVAFSKR 348
K QNFWDTSGFKIVVSMSML+LV SKR
Sbjct: 289 K--QNFWDTSGFKIVVSMSMLILVIISKR 315
>AT1G09470.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G26770.1). |
chr1:3055391-3056931 REVERSE LENGTH=335
Length = 335
Score = 333 bits (853), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 181/334 (54%), Positives = 236/334 (70%), Gaps = 3/334 (0%)
Query: 15 VPARDVDPXXXXXXXXXQNFRRNVVSLAAELKELRGRLAAQEKSYAIETLTRQEAETNAK 74
V R+ DP Q+FRRNVVSLA ELKE R RLA QE+S + E ++RQEAET K
Sbjct: 5 VSLREDDPLLKDLSEKKQSFRRNVVSLATELKEARTRLAEQERSCSKEAMSRQEAETRVK 64
Query: 75 SLELEIGRLQKNLDERNEQLQASASSAEKYLMELDDLKTQLVXXXXXXXXXXXXXXXXQL 134
+E E+ L K L+E+ EQ++AS + EK++ EL D+K+QL
Sbjct: 65 RMEDEMHELAKELNEKVEQIRASDVATEKFVKELADIKSQLAATHATAEASALSAESAHS 124
Query: 135 HCVELVKELDEKNSSLREHEDRVTRLAEQLENLQKDLQARESSQKHLKDEVFRIEHDIME 194
HC L K+L E+ SL+EHED+VTRL EQLENL+K+L+ RESSQK L+DE+ ++E DIM
Sbjct: 125 HCRVLSKQLHERTGSLKEHEDQVTRLGEQLENLRKELRVRESSQKQLRDELLKVEGDIMR 184
Query: 195 ALTKAGDNKDRELRKILDEVSPRNFEKMNKLLVVKDEEIVRMKDEIKIMSAHWKLKTKEL 254
A++ ++ E+R +L+E +P+N E++NKLL KD+EI R++DE+KI+SAHW K L
Sbjct: 185 AVSVVKTKENSEVRNMLNEDTPKNSERINKLLTAKDDEIARLRDELKIISAHWS-KAFVL 243
Query: 255 ESQLEKQRRADQELKKKVLKLEFCLQETRSQTRKLQRMGERRDKAIKELRDQLAAKRKRG 314
Q+E QRR DQELKKKVLKLEFCL+ETR QTRKLQ+MGER D AI+EL++QLAAK++
Sbjct: 244 LDQVENQRRIDQELKKKVLKLEFCLRETRIQTRKLQKMGERNDVAIQELKEQLAAKKQH- 302
Query: 315 VAAEEKQQQNFWDTSGFKIVVSMSMLVLVAFSKR 348
A+ QN WD SGFKIVVSMSML+LVAFS+R
Sbjct: 303 -EADHSSNQNLWDKSGFKIVVSMSMLILVAFSRR 335
>AT3G05830.1 | Symbols: | Encodes alpha-helical IF (intermediate
filament)-like protein. | chr3:1736796-1738565 FORWARD
LENGTH=336
Length = 336
Score = 330 bits (846), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 184/330 (55%), Positives = 242/330 (73%), Gaps = 3/330 (0%)
Query: 20 VDPXXXXXXXXXQNFRRNVVSLAAELKELRGRLAAQEKSYAIETLTRQEAETNAKSLELE 79
VDP ++FRRNVVSLA ELK++RGRL +QE+S+ ET+TR+EAE K++E+E
Sbjct: 9 VDPLLRDLDEKKESFRRNVVSLATELKQVRGRLVSQEQSFLKETITRKEAEKRGKNMEME 68
Query: 80 IGRLQKNLDERNEQLQASASSAEKYLMELDDLKTQLVXXXXXXXXXXXXXXXXQLHCVEL 139
I +LQK L+ERN QL+ASAS+A+K++ EL++ + +L ++ C L
Sbjct: 69 ICKLQKRLEERNCQLEASASAADKFIKELEEFRLKLDTTKQTAEASADSAQSTKIQCSML 128
Query: 140 VKELDEKNSSLREHEDRVTRLAEQLENLQKDLQARESSQKHLKDEVFRIEHDIMEALTKA 199
++LD+K SLRE EDR+T+L QL++LQ+ L RE S+K L++EV RIE ++ EA+ KA
Sbjct: 129 KQQLDDKTRSLREQEDRMTQLGHQLDDLQRGLSLRECSEKQLREEVRRIEREVTEAIAKA 188
Query: 200 G-DNKDRELRKILDEVSPRNFEKMNKLLVVKDEEIVRMKDEIKIMSAHWKLKTKELESQL 258
G D EL+K+L++VSP FE+MN+L+ VKDEEI ++KDEI++MS WK KTKELESQL
Sbjct: 189 GIGGMDSELQKLLEDVSPMKFERMNRLVEVKDEEITKLKDEIRLMSGQWKHKTKELESQL 248
Query: 259 EKQRRADQELKKKVLKLEFCLQETRSQTRKLQRMGERRDKAIKELRDQLAAKRKRGVAAE 318
EKQRR DQ+LKKKVLKLEFCLQE RSQTRKLQR GERRD IKE+RD ++ K+ + E
Sbjct: 249 EKQRRTDQDLKKKVLKLEFCLQEARSQTRKLQRKGERRDMEIKEIRDLISE--KQNLNNE 306
Query: 319 EKQQQNFWDTSGFKIVVSMSMLVLVAFSKR 348
+Q FWD SGFKIVVSMSML+LV SKR
Sbjct: 307 SWDKQKFWDNSGFKIVVSMSMLMLVVVSKR 336
>AT3G05830.2 | Symbols: | Encodes alpha-helical IF (intermediate
filament)-like protein. | chr3:1736796-1738565 FORWARD
LENGTH=349
Length = 349
Score = 322 bits (824), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 184/343 (53%), Positives = 242/343 (70%), Gaps = 16/343 (4%)
Query: 20 VDPXXXXXXXXXQNFRRNVVSLAAELKELRGRLAAQEKSYAIETLTRQEAETNAKSLELE 79
VDP ++FRRNVVSLA ELK++RGRL +QE+S+ ET+TR+EAE K++E+E
Sbjct: 9 VDPLLRDLDEKKESFRRNVVSLATELKQVRGRLVSQEQSFLKETITRKEAEKRGKNMEME 68
Query: 80 IGRLQKNLDERNEQLQASASSAEKYLMELDDLKTQLVXXXXXXXXXXXXXXXXQLHCVEL 139
I +LQK L+ERN QL+ASAS+A+K++ EL++ + +L ++ C L
Sbjct: 69 ICKLQKRLEERNCQLEASASAADKFIKELEEFRLKLDTTKQTAEASADSAQSTKIQCSML 128
Query: 140 VKELDEKNSSLREHEDRVTRLAEQLENLQKDLQARESSQKHLKDEVFRIEHDIMEALTKA 199
++LD+K SLRE EDR+T+L QL++LQ+ L RE S+K L++EV RIE ++ EA+ KA
Sbjct: 129 KQQLDDKTRSLREQEDRMTQLGHQLDDLQRGLSLRECSEKQLREEVRRIEREVTEAIAKA 188
Query: 200 G-DNKDRELRKILDEVSPRNFEKMNKLLVVKDEEIVRMKDEIKIMSAHWKLKTKELESQL 258
G D EL+K+L++VSP FE+MN+L+ VKDEEI ++KDEI++MS WK KTKELESQL
Sbjct: 189 GIGGMDSELQKLLEDVSPMKFERMNRLVEVKDEEITKLKDEIRLMSGQWKHKTKELESQL 248
Query: 259 EKQRRADQELKKKVLKLEFCLQETRSQTRKLQRM-------------GERRDKAIKELRD 305
EKQRR DQ+LKKKVLKLEFCLQE RSQTRKLQR GERRD IKE+RD
Sbjct: 249 EKQRRTDQDLKKKVLKLEFCLQEARSQTRKLQRFYCCCCFVMNGAQKGERRDMEIKEIRD 308
Query: 306 QLAAKRKRGVAAEEKQQQNFWDTSGFKIVVSMSMLVLVAFSKR 348
++ K+ + E +Q FWD SGFKIVVSMSML+LV SKR
Sbjct: 309 LIS--EKQNLNNESWDKQKFWDNSGFKIVVSMSMLMLVVVSKR 349
>AT1G09483.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G09470.1); Has 48 Blast hits to 48 proteins in
9 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi
- 0; Plants - 48; Viruses - 0; Other Eukaryotes - 0
(source: NCBI BLink). | chr1:3060992-3061729 REVERSE
LENGTH=112
Length = 112
Score = 115 bits (288), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 64/106 (60%), Positives = 77/106 (72%), Gaps = 7/106 (6%)
Query: 243 MSAHWKLKTKELESQLEKQRRADQELKKKVLKLEFCLQETRSQTRKLQRMGERRDKAIKE 302
MSAHW KTKELE Q+E QRR DQELKKKVLKLEFCL+ETR QTRKLQ+MGER D AI+E
Sbjct: 1 MSAHWTFKTKELEDQVENQRRIDQELKKKVLKLEFCLRETRIQTRKLQKMGERNDMAIQE 60
Query: 303 -LRDQLAAKRKRGVAAEEKQQQNFWDTSGFKIVVSMSMLVLVAFSK 347
L +QLAAK++ A+ QN WD S S+ ++V ++F K
Sbjct: 61 VLNEQLAAKKQH--EADLSSNQNLWDKSA----SSVPLVVFMSFYK 100