Miyakogusa Predicted Gene
- Lj0g3v0258179.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0258179.1 Non Chatacterized Hit- tr|I1NCX2|I1NCX2_SOYBN
Uncharacterized protein OS=Glycine max PE=4 SV=1,77.02,0,seg,NULL;
DUF241,Protein of unknown function DUF241, plant;
coiled-coil,NULL,CUFF.16969.1
(248 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G76210.1 | Symbols: | Arabidopsis protein of unknown functio... 141 4e-34
AT1G20520.1 | Symbols: | Arabidopsis protein of unknown functio... 119 3e-27
AT4G35690.1 | Symbols: | Arabidopsis protein of unknown functio... 76 2e-14
AT4G35660.1 | Symbols: | Arabidopsis protein of unknown functio... 65 4e-11
AT2G17070.1 | Symbols: | Arabidopsis protein of unknown functio... 62 2e-10
AT2G17080.1 | Symbols: | Arabidopsis protein of unknown functio... 62 3e-10
AT4G35710.1 | Symbols: | Arabidopsis protein of unknown functio... 56 2e-08
AT4G35680.1 | Symbols: | Arabidopsis protein of unknown functio... 56 2e-08
AT4G35200.1 | Symbols: | Arabidopsis protein of unknown functio... 51 6e-07
AT1G76220.1 | Symbols: | Arabidopsis protein of unknown functio... 49 3e-06
>AT1G76210.1 | Symbols: | Arabidopsis protein of unknown function
(DUF241) | chr1:28595202-28595882 REVERSE LENGTH=226
Length = 226
Score = 141 bits (355), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 89/232 (38%), Positives = 129/232 (55%), Gaps = 9/232 (3%)
Query: 1 MTPLDFDALKDMQNSANDLLH-SPLVQQALVHWHEEKWVSDVSESSLRILEVCGISKDVL 59
M ++F+ L+++ + N LLH P +++L +EKW VSE+SLR+L++C +SKDV+
Sbjct: 1 MAQINFEGLRELHDCVNYLLHHCPKTRESLSQQGQEKWTEQVSEASLRMLDICNVSKDVM 60
Query: 60 LLVKEHLQELQFTLRRASIGEPGIEGKITAFNCYRXXXXXXXXXXXXXXXGMKTEKIITS 119
LVK LQ+LQ TLR + + KI A+N Y+ MK + +
Sbjct: 61 TLVKHSLQDLQLTLRGNESSD--VNEKIAAYNRYKNKLKKETLKCLNCLKNMKGNEGRVA 118
Query: 120 MK-DQKLTLVVDVLREVRMTSISTVESLLSLISSPWLDYSRTGKRSFMSKLVHVNLQNNS 178
M +Q L V +VL+EVR ++ VESL SL PWL+ KRS L + +S
Sbjct: 119 MPIEQNLLFVTEVLKEVRRVVVTMVESLFSLGCIPWLE-----KRSSKGSLSSIFSIRSS 173
Query: 179 SDHDMYYDAMVLQSANKRLAGVQMAIEDLEVELECMFRRLIHTRVLLLNIYT 230
D +D +QSA RL ++ + +LE+ELE +FRRLI TRV LLNI T
Sbjct: 174 YLLDDEWDETAVQSATTRLEAAEITVVELEIELESIFRRLIQTRVSLLNILT 225
>AT1G20520.1 | Symbols: | Arabidopsis protein of unknown function
(DUF241) | chr1:7106922-7107617 REVERSE LENGTH=231
Length = 231
Score = 119 bits (297), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 90/248 (36%), Positives = 133/248 (53%), Gaps = 37/248 (14%)
Query: 1 MTPLDFDALKDMQNSANDLL-HSPLVQQALVHWHEEKWVSDVSESSLRILEVCGISKDVL 59
M LD + L+++Q+ AN LL H P +++L +E W+ VSE+SL +L+VC +SKDV+
Sbjct: 1 MALLDLEGLRELQDCANYLLDHCPEARESLCQQGKENWIEQVSEASLIMLDVCNVSKDVM 60
Query: 60 LLVKEHLQELQFTLRRASIGEPGIEGKITAFNCYRXXXXXXXXXXXXXXXGMKTEKIITS 119
LV+ LQ+LQ TLR + K+ A+N YR +T K + S
Sbjct: 61 ALVRHGLQDLQLTLR---CNGSNLSEKVAAYNRYRNKLKK------------ETLKCLNS 105
Query: 120 MK---------------DQKLTLVVDVLREVRMTSISTVESLLSLISSPWLDYSRTGKRS 164
+K +Q L V +VL+EVR ++ VESL SL+ PWL+ + S
Sbjct: 106 LKSIEGGGRGMMEMQSIEQNLLFVAEVLKEVRRAVVTMVESLFSLVCVPWLE-RKPSIGS 164
Query: 165 FMSKLVHVNLQNNSSDHDMYYDAMVLQSANKRLAGVQMAIEDLEVELECMFRRLIHTRVL 224
F S D +D + ++SA+ RL ++ +E+LE+ELEC+FRRLI TRV
Sbjct: 165 FSSIFTM-----QFCCFDDAWDEVAMRSASTRLEAAEITVEELEIELECIFRRLIQTRVS 219
Query: 225 LLNIYTYQ 232
LLNI T +
Sbjct: 220 LLNILTAK 227
>AT4G35690.1 | Symbols: | Arabidopsis protein of unknown function
(DUF241) | chr4:16921886-16922740 FORWARD LENGTH=284
Length = 284
Score = 76.3 bits (186), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 56/240 (23%), Positives = 117/240 (48%), Gaps = 15/240 (6%)
Query: 4 LDFDALKDMQNSANDLLHSPLVQQALVHWHEEKWVSDVSESSLRILEVCGISKDVLLLVK 63
+ + L+++ N D L Q+ + +++ ++ + SLR++++C +S+D+++ +
Sbjct: 48 MGLEGLEELYNCTEDFLKMGSTQRVMSSSDGSEFMEEMLDGSLRLMDICSVSRDLMVETQ 107
Query: 64 EHLQELQFTLRRASI--GEPGIEGKITAFNCYRXXXXXXXXXXXXXXXGM-----KTEKI 116
EH++ +Q +RR + GE ++ + + +R + + +
Sbjct: 108 EHVRGVQSCVRRKKVVGGEDQLDVAVAGYVGFRKNMRKEAKRLLGSLKNIDGGLSSSSSV 167
Query: 117 ITSMKDQKLTLVVDVLREVRMTSISTVESLLSLISSPWLDYSRTGKRSFM--SKLVHVNL 174
+++ L +VVD +R+V S++ + S L +S ++ S + K+ HV
Sbjct: 168 NNGEQEEHLVVVVDAMRQVVSVSVAVLRSFLEFLSGRRQSNIKSKLASVLKKKKVHHVEE 227
Query: 175 QNNSSDHDMYYDAMVLQSAN---KRLAGVQMAIEDLEVELECMFRRLIHTRVLLLNIYTY 231
N ++ D + S N K+L V+M+I+ E +LE +FRRLI TR LLNI ++
Sbjct: 228 TKNELEN---LDLEIFCSRNDLQKKLEEVEMSIDGFEKKLEGLFRRLIRTRASLLNIISH 284
>AT4G35660.1 | Symbols: | Arabidopsis protein of unknown function
(DUF241) | chr4:16912792-16913658 FORWARD LENGTH=288
Length = 288
Score = 65.1 bits (157), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 59/227 (25%), Positives = 105/227 (46%), Gaps = 25/227 (11%)
Query: 19 LLHSPLVQQALVHWHEEKWVSDVSESSLRILEVCGISKDVLLLVKEHLQELQFTLRRASI 78
++ SP QQAL K V D + S+ +L+V ++D++ + EH+QELQ LRR
Sbjct: 71 VISSPQGQQALRLCRNRKLVEDALDESIVLLDVSDFTRDLIGTLMEHIQELQSALRRRRG 130
Query: 79 GEPGIEGKITAFNCYRXXXXXXXXXXXXXXXGMKTEKIITSMK-----DQKLTLVVDVLR 133
++ +I ++ + +T+K +K D+ ++V ++LR
Sbjct: 131 NLSSVQSEIRSYISFHKKSKTEAARQVKSLARRQTKKKAWVIKQSGGLDEHSSMVSNILR 190
Query: 134 EVRMTSISTVESLLSLISSPWLDYSRTGK----------RSFMSKLVHVNLQNNSSDHDM 183
+ ++IS ++SLL +S+ + + RSF +++ +
Sbjct: 191 QSNASTISILQSLLQFLSTSGENNEKKNGEIGCVDNSMIRSFFGRIIGRKMVKE------ 244
Query: 184 YYDAMVLQSANKRLAGVQMAIEDLEVELECMFRRLIHTRVLLLNIYT 230
DA Q+ RLA V +++E ++ EL + RRLI R LLNI T
Sbjct: 245 -IDA---QTILGRLAMVNVSLEAIKDELSYLSRRLIQHRASLLNIVT 287
>AT2G17070.1 | Symbols: | Arabidopsis protein of unknown function
(DUF241) | chr2:7430863-7431654 REVERSE LENGTH=263
Length = 263
Score = 62.4 bits (150), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 64/232 (27%), Positives = 104/232 (44%), Gaps = 22/232 (9%)
Query: 6 FDALKDMQNSANDLLHSPLVQQALVHWHEEKWVSDVSESSLRILEVCGISKDVLLLVKEH 65
D L+++ S + L+ P+ QQAL +K V + + SL+IL+VC ISKD L +KE
Sbjct: 48 LDNLQELHESLDKLIRLPVTQQALGQEKNKKDVEQLLDGSLKILDVCNISKDALSQMKEG 107
Query: 66 LQELQFTLRRASIGEPGIEGKITAFNCYRXXXXXXXXXXXXXXXGMKTEKIITSMKDQKL 125
L E+Q LRR + G++ + R + E KD+ L
Sbjct: 108 LMEIQSILRRK---RGDLSGEVKKYLASRKSFKKTFQKVQKSLKAAQAE----DNKDKSL 160
Query: 126 TLVVDVLREVRMTSISTVESLLSLISSPWLDYSRT-GKRSFMSKLVHVNLQNNSSDHDMY 184
V E +I+ +SL S +S S+T K S +SKL++ + + +
Sbjct: 161 A----VFGEAEAVTIAMFDSLFSYMSG-----SKTCSKWSVVSKLMNKKKITCEAQENEF 211
Query: 185 --YDAMVLQSANKRLAGVQM---AIEDLEVELECMFRRLIHTRVLLLNIYTY 231
D+ ++ VQ+ I+D E LE + + LI RV +LN + +
Sbjct: 212 TKVDSEFQSEKTLKMEDVQILESCIQDFEDGLESLSKSLIKYRVSILNSFGH 263
>AT2G17080.1 | Symbols: | Arabidopsis protein of unknown function
(DUF241) | chr2:7433326-7434117 REVERSE LENGTH=263
Length = 263
Score = 62.4 bits (150), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 70/234 (29%), Positives = 109/234 (46%), Gaps = 26/234 (11%)
Query: 6 FDALKDMQNSANDLLHSPLVQQALVHWHEEKWVSDVSESSLRILEVCGISKDVLLLVKEH 65
D L+++ S + L+ P+ QQAL H +K V + + SLRIL++C ISKD L +KE
Sbjct: 48 LDNLQELHESLDKLISRPVTQQALSQEHNKKAVEQLLDGSLRILDLCNISKDALSEMKEG 107
Query: 66 LQELQFTLRRASIGEPGIEGK--ITAFNCYRXXXXXXXXXXXXXXXGMKTEKIITSMKDQ 123
L E+Q LRR G+ E K +T+ + K+ K+ + +
Sbjct: 108 LMEIQSILRRKR-GDLSEEVKKYLTSRKSLKKSFQKV----------QKSLKVTQAEDNN 156
Query: 124 KLTLVVDVLREVRMTSISTVESLLSLISSPWLDYSRT-GKRSFMSKLVHVNLQNNSSDHD 182
TL V E ++S +SLLS +S S+T K S +SKL++ + +
Sbjct: 157 DDTLAV--FGEAEAITLSLFDSLLSYMSG-----SKTCSKWSVVSKLMNKKKVTCEAQEN 209
Query: 183 MY--YDAMVLQSANKRLAGVQ---MAIEDLEVELECMFRRLIHTRVLLLNIYTY 231
+ D+ ++ VQ I+DLE LE + + LI RV LNI +
Sbjct: 210 EFTKVDSEFQSEKTLKMDDVQNLESCIQDLEDGLESLSKSLIKYRVSFLNILGH 263
>AT4G35710.1 | Symbols: | Arabidopsis protein of unknown function
(DUF241) | chr4:16925301-16926152 FORWARD LENGTH=283
Length = 283
Score = 56.2 bits (134), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 54/215 (25%), Positives = 100/215 (46%), Gaps = 9/215 (4%)
Query: 26 QQALVHWHEEKWVSDVSESSLRILEVCGISKDVLLLVKEHLQELQFTLRRASIGEPGIEG 85
+Q ++ +++ ++ + SLR++++C +S+D+++ EH++ +Q +RR + G
Sbjct: 69 KQRVMSSGGSEFMEEMLDGSLRLMDICSVSRDLMVETHEHVRGVQSYVRRKKVSGGGGGD 128
Query: 86 KI-TAFNCYRXXXXXXXXXXXXXXXGMK-----TEKIITSMKDQKLTLVVDVLREVRMTS 139
KI A + Y +K T +D++L V+D +R V S
Sbjct: 129 KIDVAVSDYVGFRKNMRKEAKKLLGSLKKVDGGTRSCDNDHEDEQLVAVIDRVRRVVSVS 188
Query: 140 ISTVESLLSLISSPWLDY-SRTGKRSFMSKLVHVNLQN--NSSDHDMYYDAMVLQSANKR 196
+ ++S L L+S + S+ M K H +N + D ++ D +
Sbjct: 189 VVVLKSFLELLSRRKSNIKSKLASVLKMKKDNHAPAKNVLETLDSAIFGDFLSHDDLQNE 248
Query: 197 LAGVQMAIEDLEVELECMFRRLIHTRVLLLNIYTY 231
L V+M I E LE +FRRLI TR +LNI ++
Sbjct: 249 LEEVEMCIGGFERNLEGLFRRLIRTRASILNIISH 283
>AT4G35680.1 | Symbols: | Arabidopsis protein of unknown function
(DUF241) | chr4:16917938-16919749 FORWARD LENGTH=503
Length = 503
Score = 56.2 bits (134), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 57/243 (23%), Positives = 100/243 (41%), Gaps = 27/243 (11%)
Query: 9 LKDMQNSANDLLHSPLVQQALVHWHEEKWVSDVSESSLRILEVCGISKDVLLLVKEHLQE 68
L ++ ++LL SP V+ L+H E K + + + S+ +L+V +++V++ ++EH+
Sbjct: 69 LTELYGCVHELLESPYVKHTLLHHQEGKLLDESLDGSVLLLDVYEGTREVIVAMREHVTN 128
Query: 69 LQFTLRRASIGEPGIEGKITAFNCYRXXXXXXXXXXXXXXXGMKTEKIITSMKDQKLTLV 128
L+ LRR +E + A+ R M+T I T+
Sbjct: 129 LKSALRRKG----SLEKEAKAYFNLRKKAKKEISKQINALKKMETRDISTNTDQDSAIAS 184
Query: 129 VDVLREVRMTSISTVESLL-------------------SLISSPWLDYSRTGKR----SF 165
VLRE ++S LL L+S P++ S + K
Sbjct: 185 TSVLRETIQITVSMFRHLLLFLSTIPPPPSPAIFKTTIGLLSIPFVSPSLSDKSLILIKE 244
Query: 166 MSKLVHVNLQNNSSDHDMYYDAMVLQSANKRLAGVQMAIEDLEVELECMFRRLIHTRVLL 225
M L V L + ++ +++ R V+ DLE EL+ + + L+ RVL
Sbjct: 245 MKSLDDVFLGSILDSRKTLFEVETMENEKMRRDVVEDGFRDLEAELDSVSKCLVKNRVLF 304
Query: 226 LNI 228
LNI
Sbjct: 305 LNI 307
>AT4G35200.1 | Symbols: | Arabidopsis protein of unknown function
(DUF241) | chr4:16749142-16749903 REVERSE LENGTH=253
Length = 253
Score = 51.2 bits (121), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 60/226 (26%), Positives = 103/226 (45%), Gaps = 26/226 (11%)
Query: 9 LKDMQNSANDLLHSPLVQQALVHWHEEKWVSDVSESSLRILEVCGISKDVLLLVKEHLQE 68
L+D+ +S ++ + AL EK + + SLRIL++C I+KD + +KE L E
Sbjct: 48 LQDLHDSLEKMIRLSVTNLALSQDQIEKLL----DGSLRILDLCNIAKDAISQMKEGLME 103
Query: 69 LQFTLRRASIGEPG-IEGKITAFNCYRXXXXXXXXXXXXXXXGMKTEKIITSMKDQKLTL 127
+Q LRR +PG + G++ + R +K+ K+ S +L
Sbjct: 104 IQSILRR----KPGDLSGEVKKYLVSRKFLKKSLQKV------IKSLKVCQSKDSTNASL 153
Query: 128 VVDVLREVRMTSISTVESLLSLISSPWLDYSRTGKRSFMSKLVHVNLQNNSSDHDMY--Y 185
V V +++ ESL S +S GK S +SK++ N ++ + +
Sbjct: 154 V--VFGRAEAVTMALFESLFSFMSGS----KACGKWSLVSKMMSQNKVTCEAEANEFTRI 207
Query: 186 DAMVLQSANKRLAGVQ---MAIEDLEVELECMFRRLIHTRVLLLNI 228
D+ + ++ VQ I+DLE +E + + LI RV +LNI
Sbjct: 208 DSEFQSEKSLQMEDVQNLESCIQDLEDGIESLSKSLIKYRVSILNI 253
>AT1G76220.1 | Symbols: | Arabidopsis protein of unknown function
(DUF241) | chr1:28597530-28598300 REVERSE LENGTH=256
Length = 256
Score = 48.9 bits (115), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 53/194 (27%), Positives = 93/194 (47%), Gaps = 16/194 (8%)
Query: 40 DVSESSLRILEVCGISKDVLLLVKEHLQELQFTLRRASIGEPGIEGKITAFNCYRXXXXX 99
DVS+ + +L++C +DVL+ +KE ++EL+ +LRR I E + AF
Sbjct: 75 DVSDGFIEVLDICSTIRDVLMEIKEQVRELESSLRRRLIRSKSGEDQ-EAFVARETDAYV 133
Query: 100 XXXXXXXXXXGMKTEKIITSMKDQKLTL--VVDVLREVRMTSISTVESLL-SLISSPWLD 156
+ +K M+ +K V++V++ V TS + SLL +++ D
Sbjct: 134 FKRRALSRTIVKQLKKTEEKMRKRKRDCGDVINVMKRVEKTSFDVLVSLLIEVVTKDQRD 193
Query: 157 YSRTGKRSFMSKLVHVNLQNNSSDHDMYYDAMVLQSANKRLAGVQMAIEDLEVELECMFR 216
+ +R +S++ N +N D D K+L + IE+ E ELEC+++
Sbjct: 194 QKKGSRRGIVSRI--FNKKNQEVDVDEL----------KKLRETEHEIEETEKELECVYK 241
Query: 217 RLIHTRVLLLNIYT 230
+L+ TRV LN+ T
Sbjct: 242 KLLKTRVSFLNMLT 255