Miyakogusa Predicted Gene
- Lj1g3v3382500.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v3382500.1 Non Chatacterized Hit- tr|F6GZJ1|F6GZJ1_VITVI
Putative uncharacterized protein OS=Vitis vinifera
GN=,39.07,2e-19,seg,NULL; DUF241,Protein of unknown function DUF241,
plant,CUFF.30581.1
(218 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT2G17070.1 | Symbols: | Arabidopsis protein of unknown functio... 139 2e-33
AT2G17080.1 | Symbols: | Arabidopsis protein of unknown functio... 129 1e-30
AT4G35210.1 | Symbols: | Arabidopsis protein of unknown functio... 112 1e-25
AT4G35200.1 | Symbols: | Arabidopsis protein of unknown functio... 108 4e-24
AT4G35690.1 | Symbols: | Arabidopsis protein of unknown functio... 78 4e-15
AT2G17680.1 | Symbols: | Arabidopsis protein of unknown functio... 72 2e-13
AT1G76210.1 | Symbols: | Arabidopsis protein of unknown functio... 65 3e-11
AT4G35680.1 | Symbols: | Arabidopsis protein of unknown functio... 64 7e-11
AT4G35710.1 | Symbols: | Arabidopsis protein of unknown functio... 63 1e-10
AT1G76220.1 | Symbols: | Arabidopsis protein of unknown functio... 62 4e-10
AT1G20520.1 | Symbols: | Arabidopsis protein of unknown functio... 58 5e-09
AT4G35720.1 | Symbols: | Arabidopsis protein of unknown functio... 57 8e-09
AT4G35660.1 | Symbols: | Arabidopsis protein of unknown functio... 57 1e-08
>AT2G17070.1 | Symbols: | Arabidopsis protein of unknown function
(DUF241) | chr2:7430863-7431654 REVERSE LENGTH=263
Length = 263
Score = 139 bits (349), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 79/208 (37%), Positives = 127/208 (61%), Gaps = 7/208 (3%)
Query: 11 SLHIRSNSLPSSPHSLVSECEEHLQRLKNTEATXXXXXXXXXXQKLNGLLDLHDCIDRLL 70
S H+RS+S PS PH + +E L RL+++E T Q+L+ L +LH+ +D+L+
Sbjct: 4 SFHVRSHSYPSIPHPQAAHVDEQLARLRSSEETSTSSSSSIC-QRLDNLQELHESLDKLI 62
Query: 71 QLPIKQQELARECNDKSVDDTLEGSLKLLDICSELKDFLLMSKEIMYELQSVIRRRRSNE 130
+LP+ QQ L +E N K V+ L+GSLK+LD+C+ KD L KE + E+QS++RR+R +
Sbjct: 63 RLPVTQQALGQEKNKKDVEQLLDGSLKILDVCNISKDALSQMKEGLMEIQSILRRKRGDL 122
Query: 131 TGLKVEGAKYLASRKKLNKAIRKSLGNLKAMKHECVEDRTSTIISILKEAEQVTVSSLES 190
+G E KYLASRK K +K +LKA + E +D++ +++ EAE VT++ +S
Sbjct: 123 SG---EVKKYLASRKSFKKTFQKVQKSLKAAQAEDNKDKS---LAVFGEAEAVTIAMFDS 176
Query: 191 MLMLICDQPKHRRWSTISKMLQPKRIAC 218
+ + +WS +SK++ K+I C
Sbjct: 177 LFSYMSGSKTCSKWSVVSKLMNKKKITC 204
>AT2G17080.1 | Symbols: | Arabidopsis protein of unknown function
(DUF241) | chr2:7433326-7434117 REVERSE LENGTH=263
Length = 263
Score = 129 bits (325), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 74/208 (35%), Positives = 121/208 (58%), Gaps = 7/208 (3%)
Query: 11 SLHIRSNSLPSSPHSLVSECEEHLQRLKNTEATXXXXXXXXXXQKLNGLLDLHDCIDRLL 70
S H+RSNS PS H + +E L RL+++ Q+L+ L +LH+ +D+L+
Sbjct: 4 SFHVRSNSFPSRSHPQAAHVDEQLARLRSS-EQASSSSSSSICQRLDNLQELHESLDKLI 62
Query: 71 QLPIKQQELARECNDKSVDDTLEGSLKLLDICSELKDFLLMSKEIMYELQSVIRRRRSNE 130
P+ QQ L++E N K+V+ L+GSL++LD+C+ KD L KE + E+QS++RR+R +
Sbjct: 63 SRPVTQQALSQEHNKKAVEQLLDGSLRILDLCNISKDALSEMKEGLMEIQSILRRKRGD- 121
Query: 131 TGLKVEGAKYLASRKKLNKAIRKSLGNLKAMKHECVEDRTSTIISILKEAEQVTVSSLES 190
L E KYL SRK L K+ +K +LK + ED +++ EAE +T+S +S
Sbjct: 122 --LSEEVKKYLTSRKSLKKSFQKVQKSLKVTQ---AEDNNDDTLAVFGEAEAITLSLFDS 176
Query: 191 MLMLICDQPKHRRWSTISKMLQPKRIAC 218
+L + +WS +SK++ K++ C
Sbjct: 177 LLSYMSGSKTCSKWSVVSKLMNKKKVTC 204
>AT4G35210.1 | Symbols: | Arabidopsis protein of unknown function
(DUF241) | chr4:16751428-16752180 FORWARD LENGTH=250
Length = 250
Score = 112 bits (281), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 69/208 (33%), Positives = 117/208 (56%), Gaps = 17/208 (8%)
Query: 11 SLHIRSNSLPSSPHSLVSECEEHLQRLKNTEATXXXXXXXXXXQKLNGLLDLHDCIDRLL 70
S H+RS+S PS H + +E L RL+++ Q+L+ L DLHD +++++
Sbjct: 4 SFHVRSSSYPSRQHPQAAHVDEQLTRLRSSGTASSSSIC----QRLSNLQDLHDSLEKMI 59
Query: 71 QLPIKQQELARECNDKSVDDTLEGSLKLLDICSELKDFLLMSKEIMYELQSVIRRRRSNE 130
+L + Q L+++ ++ L+GS+K+LD+CS KD L KE + E+QS++RR+R +
Sbjct: 60 RLSVTNQALSQD----QIEKLLDGSIKILDLCSISKDGLSQMKESLKEIQSIVRRKRGD- 114
Query: 131 TGLKVEGAKYLASRKKLNKAIRKSLGNLKAMKHECVEDRTSTIISILKEAEQVTVSSLES 190
L E KYLASRK L K+ K L +LK +++ + +++ EAE VT++ ES
Sbjct: 115 --LSAEVKKYLASRKFLKKSFEKVLKSLKTSQNK------NDALAVFGEAETVTIALFES 166
Query: 191 MLMLICDQPKHRRWSTISKMLQPKRIAC 218
+ + +WS +SKM+ + C
Sbjct: 167 LFSFMSGSKACGKWSLVSKMMSQSKGTC 194
>AT4G35200.1 | Symbols: | Arabidopsis protein of unknown function
(DUF241) | chr4:16749142-16749903 REVERSE LENGTH=253
Length = 253
Score = 108 bits (269), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 65/208 (31%), Positives = 119/208 (57%), Gaps = 14/208 (6%)
Query: 11 SLHIRSNSLPSSPHSLVSECEEHLQRLKNTEATXXXXXXXXXXQKLNGLLDLHDCIDRLL 70
S H+RSNS PS H + +E L RL+++++ Q+L+ L DLHD +++++
Sbjct: 4 SFHVRSNSYPSRQHPQAAHVDEQLTRLRSSDSASSSSIC----QRLSNLQDLHDSLEKMI 59
Query: 71 QLPIKQQELARECNDKSVDDTLEGSLKLLDICSELKDFLLMSKEIMYELQSVIRRRRSNE 130
+L + L+++ ++ L+GSL++LD+C+ KD + KE + E+QS++RR+ +
Sbjct: 60 RLSVTNLALSQD----QIEKLLDGSLRILDLCNIAKDAISQMKEGLMEIQSILRRKPGDL 115
Query: 131 TGLKVEGAKYLASRKKLNKAIRKSLGNLKAMKHECVEDRTSTIISILKEAEQVTVSSLES 190
+G E KYL SRK L K+++K + +LK + +D T+ + + AE VT++ ES
Sbjct: 116 SG---EVKKYLVSRKFLKKSLQKVIKSLKVCQS---KDSTNASLVVFGRAEAVTMALFES 169
Query: 191 MLMLICDQPKHRRWSTISKMLQPKRIAC 218
+ + +WS +SKM+ ++ C
Sbjct: 170 LFSFMSGSKACGKWSLVSKMMSQNKVTC 197
>AT4G35690.1 | Symbols: | Arabidopsis protein of unknown function
(DUF241) | chr4:16921886-16922740 FORWARD LENGTH=284
Length = 284
Score = 78.2 bits (191), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 55/216 (25%), Positives = 110/216 (50%), Gaps = 14/216 (6%)
Query: 13 HIRSNSLPSSPHSLVSECEEHLQRLKNTEATXXXXXXXXXXQKLNGLLDLHDCIDRLLQL 72
+RS SLPSS H + EE L ++K L GL +L++C + L++
Sbjct: 9 QLRSISLPSSSHPSTTGIEESLNKVKTI--NTMTGSSESVLMGLEGLEELYNCTEDFLKM 66
Query: 73 PIKQQELARECNDKSVDDTLEGSLKLLDICSELKDFLLMSKEIMYELQSVIRRRR--SNE 130
Q+ ++ + +++ L+GSL+L+DICS +D ++ ++E + +QS +RR++ E
Sbjct: 67 GSTQRVMSSSDGSEFMEEMLDGSLRLMDICSVSRDLMVETQEHVRGVQSCVRRKKVVGGE 126
Query: 131 TGLKVEGAKYLASRKKLNKAIRKSLGNLK----------AMKHECVEDRTSTIISILKEA 180
L V A Y+ RK + K ++ LG+LK ++ + E+ ++ +++
Sbjct: 127 DQLDVAVAGYVGFRKNMRKEAKRLLGSLKNIDGGLSSSSSVNNGEQEEHLVVVVDAMRQV 186
Query: 181 EQVTVSSLESMLMLICDQPKHRRWSTISKMLQPKRI 216
V+V+ L S L + + + S ++ +L+ K++
Sbjct: 187 VSVSVAVLRSFLEFLSGRRQSNIKSKLASVLKKKKV 222
>AT2G17680.1 | Symbols: | Arabidopsis protein of unknown function
(DUF241) | chr2:7679241-7680119 FORWARD LENGTH=292
Length = 292
Score = 72.4 bits (176), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 58/199 (29%), Positives = 99/199 (49%), Gaps = 25/199 (12%)
Query: 13 HIRSNSLPSSPHSLVSECEEHLQRLKNTEATXXXXXXXXXXQKLNGLLDLHDCIDRLLQL 72
H+RS SL S H + EE L + T T L+GL DL+DC + LL++
Sbjct: 9 HVRSISLQSRSHPSTAAIEESLDKFLITMNTSTMASSESVHSGLSGLEDLYDCSEDLLKM 68
Query: 73 PIKQQELA---------RECNDKSVDDTLEGSLKLLDICSELKDFLLMSKEIMYELQSVI 123
Q+ L+ R+ + +++ L+GSL+L+DIC+ +D ++ + E + LQS +
Sbjct: 69 GSTQRVLSFSDEKKKKKRKVKGEFMEEMLDGSLRLMDICNVSRDLMVETHEHVLGLQSCV 128
Query: 124 RRRRSNETGLKVEGAKYLASRKKLNKAIRKSLGNLKAMK-------HECVEDRTSTIISI 176
RRR+ V+ + Y+ RK + K ++K LG+LK + H +D +++
Sbjct: 129 RRRKD------VDVSGYVGFRKNMRKEVKKLLGSLKNINVGLVMRDHGYDQDGDIHFLAV 182
Query: 177 LKEAEQV---TVSSLESML 192
+ +V TVS L+S
Sbjct: 183 IHAMRRVVYMTVSVLKSFF 201
>AT1G76210.1 | Symbols: | Arabidopsis protein of unknown function
(DUF241) | chr1:28595202-28595882 REVERSE LENGTH=226
Length = 226
Score = 65.5 bits (158), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 47/148 (31%), Positives = 76/148 (51%), Gaps = 9/148 (6%)
Query: 56 LNGLLDLHDCIDRLLQ-LPIKQQELARECNDKSVDDTLEGSLKLLDICSELKDFLLMSKE 114
GL +LHDC++ LL P ++ L+++ +K + E SL++LDIC+ KD + + K
Sbjct: 6 FEGLRELHDCVNYLLHHCPKTRESLSQQGQEKWTEQVSEASLRMLDICNVSKDVMTLVKH 65
Query: 115 IMYELQSVIRRRRSNETGLKVEGAKYLASRKKLNKAIRKSLGNLKAMKHE------CVED 168
+ +LQ +R S++ K+ A Y + KL K K L LK MK +E
Sbjct: 66 SLQDLQLTLRGNESSDVNEKI--AAYNRYKNKLKKETLKCLNCLKNMKGNEGRVAMPIEQ 123
Query: 169 RTSTIISILKEAEQVTVSSLESMLMLIC 196
+ +LKE +V V+ +ES+ L C
Sbjct: 124 NLLFVTEVLKEVRRVVVTMVESLFSLGC 151
>AT4G35680.1 | Symbols: | Arabidopsis protein of unknown function
(DUF241) | chr4:16917938-16919749 FORWARD LENGTH=503
Length = 503
Score = 64.3 bits (155), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 46/149 (30%), Positives = 77/149 (51%), Gaps = 9/149 (6%)
Query: 56 LNGLLDLHDCIDRLLQLPIKQQELARECNDKSVDDTLEGSLKLLDICSELKDFLLMSKEI 115
L L +L+ C+ LL+ P + L K +D++L+GS+ LLD+ ++ ++ +E
Sbjct: 66 LVNLTELYGCVHELLESPYVKHTLLHHQEGKLLDESLDGSVLLLDVYEGTREVIVAMREH 125
Query: 116 MYELQSVIRRRRSNETGLKVEGAKYLASRKKLNKAIRKSLGNLKAMKHECVE---DRTST 172
+ L+S +RR+ S L+ E Y RKK K I K + LK M+ + D+ S
Sbjct: 126 VTNLKSALRRKGS----LEKEAKAYFNLRKKAKKEISKQINALKKMETRDISTNTDQDSA 181
Query: 173 I--ISILKEAEQVTVSSLESMLMLICDQP 199
I S+L+E Q+TVS +L+ + P
Sbjct: 182 IASTSVLRETIQITVSMFRHLLLFLSTIP 210
>AT4G35710.1 | Symbols: | Arabidopsis protein of unknown function
(DUF241) | chr4:16925301-16926152 FORWARD LENGTH=283
Length = 283
Score = 63.2 bits (152), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 80/151 (52%), Gaps = 7/151 (4%)
Query: 13 HIRSNSLPSSPHSLVSECEEHLQRLKNTEATXXXXXXXXXXQKLNGLLDLHDCIDRLLQL 72
+RS SLPS S EE L ++K T L GL +L+ ++ L++
Sbjct: 9 QLRSISLPSRSQPSTSGLEESLNKIKTINTTTGSSESIL--MGLAGLEELYIFLEEFLKM 66
Query: 73 PIKQQELARECNDKSVDDTLEGSLKLLDICSELKDFLLMSKEIMYELQSVIRRRRSNETG 132
KQ+ ++ + +++ L+GSL+L+DICS +D ++ + E + +QS +RR++ + G
Sbjct: 67 GSKQRVMS-SGGSEFMEEMLDGSLRLMDICSVSRDLMVETHEHVRGVQSYVRRKKVSGGG 125
Query: 133 ----LKVEGAKYLASRKKLNKAIRKSLGNLK 159
+ V + Y+ RK + K +K LG+LK
Sbjct: 126 GGDKIDVAVSDYVGFRKNMRKEAKKLLGSLK 156
>AT1G76220.1 | Symbols: | Arabidopsis protein of unknown function
(DUF241) | chr1:28597530-28598300 REVERSE LENGTH=256
Length = 256
Score = 61.6 bits (148), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 58/221 (26%), Positives = 102/221 (46%), Gaps = 33/221 (14%)
Query: 10 SSLHIRSNSLPSSPHSLVSECEEHLQRLKNTEATXXXXXXXXXXQKLNGLLDLHDCIDRL 69
S H+RS S + L E+HL LK + +KL L +L++ ++
Sbjct: 7 SGAHVRSTSWSEDVNPLSRAIEDHLLLLKKRPESAR--------RKLGVLKNLYEVVEVF 58
Query: 70 LQLPIKQQELARECNDKSVDDTLEGSLKLLDICSELKDFLLMSKEIMYELQSVIRRR--R 127
L+ Q + + +D +G +++LDICS ++D L+ KE + EL+S +RRR R
Sbjct: 59 LRF----QTTKTQKSFTGFEDVSDGFIEVLDICSTIRDVLMEIKEQVRELESSLRRRLIR 114
Query: 128 S-----NETGLKVEGAKYLASRKKLNKAIRKSLG----NLKAMKHECVEDRTSTIISILK 178
S E + E Y+ R+ L++ I K L ++ K +C +I+++K
Sbjct: 115 SKSGEDQEAFVARETDAYVFKRRALSRTIVKQLKKTEEKMRKRKRDC-----GDVINVMK 169
Query: 179 EAEQVTVSSLESMLMLIC-----DQPKHRRWSTISKMLQPK 214
E+ + L S+L+ + DQ K R +S++ K
Sbjct: 170 RVEKTSFDVLVSLLIEVVTKDQRDQKKGSRRGIVSRIFNKK 210
>AT1G20520.1 | Symbols: | Arabidopsis protein of unknown function
(DUF241) | chr1:7106922-7107617 REVERSE LENGTH=231
Length = 231
Score = 57.8 bits (138), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 43/150 (28%), Positives = 75/150 (50%), Gaps = 12/150 (8%)
Query: 56 LNGLLDLHDCIDRLL-QLPIKQQELARECNDKSVDDTLEGSLKLLDICSELKDFLLMSKE 114
L GL +L DC + LL P ++ L ++ + ++ E SL +LD+C+ KD + + +
Sbjct: 6 LEGLRELQDCANYLLDHCPEARESLCQQGKENWIEQVSEASLIMLDVCNVSKDVMALVRH 65
Query: 115 IMYELQSVIRRRRSNETGLKVEGAKYLASRKKLNKAIRKSLGNLKA--------MKHECV 166
+ +LQ + R N + L + A Y R KL K K L +LK+ M+ + +
Sbjct: 66 GLQDLQLTL---RCNGSNLSEKVAAYNRYRNKLKKETLKCLNSLKSIEGGGRGMMEMQSI 122
Query: 167 EDRTSTIISILKEAEQVTVSSLESMLMLIC 196
E + +LKE + V+ +ES+ L+C
Sbjct: 123 EQNLLFVAEVLKEVRRAVVTMVESLFSLVC 152
>AT4G35720.1 | Symbols: | Arabidopsis protein of unknown function
(DUF241) | chr4:16927972-16928949 FORWARD LENGTH=325
Length = 325
Score = 57.0 bits (136), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 52/215 (24%), Positives = 101/215 (46%), Gaps = 19/215 (8%)
Query: 15 RSNSLPSSPHSLVSECEEHLQRLKNTEATXXXXXXXXXXQKLNGLLDLHDCIDRLLQLPI 74
R SLP H V +E + +++ + L+GL +L+ C+ L
Sbjct: 17 RCVSLPVRSHPSVRRIQEVVSKVRAL-GSSSLDSRTIVRDSLSGLTELYRCLSEDLFKSS 75
Query: 75 KQQELARECNDKSVDDTLEGSLKLLDICSELKDFLLMSKEIMYELQSVIRR-RRSNETGL 133
+ + A D +++ LE SLK L++C KD K+I+ ELQS +RR ++ E L
Sbjct: 76 SETQQALLNGDGLMEELLEVSLKYLEVCGGAKDAASRIKKIVVELQSALRRSKKGGEFSL 135
Query: 134 KVEGAKYLASRKKLNKAIRKSL-------GNLKAMKHECVEDRTSTIISILKEAEQVTVS 186
+ + Y+ASRK++ + I+K + +L+++ + + S ++ +++E +T
Sbjct: 136 ESDVDAYVASRKEIKQEIKKYMVMSKETDASLESVWCDGDDQEMSALVRVMQETSVMTCF 195
Query: 187 SLESMLMLICDQPK---------HRRWSTISKMLQ 212
L S+ + PK H+ W + K+++
Sbjct: 196 VLRSVFSFL-SSPKGLKTKNHHHHKGWGIVMKLVK 229
>AT4G35660.1 | Symbols: | Arabidopsis protein of unknown function
(DUF241) | chr4:16912792-16913658 FORWARD LENGTH=288
Length = 288
Score = 57.0 bits (136), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 61/211 (28%), Positives = 109/211 (51%), Gaps = 18/211 (8%)
Query: 1 MAFPETNTKSSLHI--RSNSLPSS-PHSLVSECEEHLQRLKNTEATXXXXXXXXXXQKLN 57
MA +++ ++ H+ RS SLP+ H EE L++++ ++ L
Sbjct: 1 MAESSSSSVATTHVPARSISLPTRLIHPKAQRVEEELKKIQALNSSSSASSRIQL--GLA 58
Query: 58 GLLDLHDCID-RLLQLPIKQQELAREC-NDKSVDDTLEGSLKLLDICSELKDFLLMSKEI 115
L++L+D ++ +++ P QQ L R C N K V+D L+ S+ LLD+ +D + E
Sbjct: 59 KLVELYDFVNEQVISSPQGQQAL-RLCRNRKLVEDALDESIVLLDVSDFTRDLIGTLMEH 117
Query: 116 MYELQSVIRRRRSNETGLKVEGAKYLASRKKL-NKAIR--KSLGNLKAMKHECV------ 166
+ ELQS +RRRR N + ++ E Y++ KK +A R KSL + K V
Sbjct: 118 IQELQSALRRRRGNLSSVQSEIRSYISFHKKSKTEAARQVKSLARRQTKKKAWVIKQSGG 177
Query: 167 -EDRTSTIISILKEAEQVTVSSLESMLMLIC 196
++ +S + +IL+++ T+S L+S+L +
Sbjct: 178 LDEHSSMVSNILRQSNASTISILQSLLQFLS 208