Miyakogusa Predicted Gene
- Lj1g3v0099440.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v0099440.1 Non Chatacterized Hit- tr|C5X1I3|C5X1I3_SORBI
Putative uncharacterized protein Sb01g021730
OS=Sorghu,42.06,1e-18,DUF581,Protein of unknown function DUF581;
seg,NULL,NODE_16964_length_1430_cov_156.974823.path2.1
(295 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G22550.1 | Symbols: | Protein of unknown function (DUF581) |... 175 4e-44
AT3G63210.1 | Symbols: MARD1 | Protein of unknown function (DUF5... 144 5e-35
AT5G11460.1 | Symbols: | Protein of unknown function (DUF581) |... 103 1e-22
AT2G25690.2 | Symbols: | Protein of unknown function (DUF581) |... 81 7e-16
AT2G25690.1 | Symbols: | Protein of unknown function (DUF581) |... 81 7e-16
AT1G22160.1 | Symbols: | Protein of unknown function (DUF581) |... 66 3e-11
AT4G39795.1 | Symbols: | Protein of unknown function (DUF581) |... 61 7e-10
AT5G49120.1 | Symbols: | Protein of unknown function (DUF581) |... 60 2e-09
AT5G47060.1 | Symbols: | Protein of unknown function (DUF581) |... 56 2e-08
AT1G78020.1 | Symbols: | Protein of unknown function (DUF581) |... 56 3e-08
AT4G17670.1 | Symbols: | Protein of unknown function (DUF581) |... 56 4e-08
AT1G74940.1 | Symbols: | Protein of unknown function (DUF581) |... 55 6e-08
AT1G53903.1 | Symbols: | Protein of unknown function (DUF581) |... 55 6e-08
AT1G53885.1 | Symbols: | Protein of unknown function (DUF581) |... 55 6e-08
AT2G44670.1 | Symbols: | Protein of unknown function (DUF581) |... 52 5e-07
AT5G65040.1 | Symbols: | Protein of unknown function (DUF581) |... 52 5e-07
AT5G20700.1 | Symbols: | Protein of unknown function (DUF581) |... 52 6e-07
AT1G19200.1 | Symbols: | Protein of unknown function (DUF581) |... 52 7e-07
>AT3G22550.1 | Symbols: | Protein of unknown function (DUF581) |
chr3:7991827-7992805 REVERSE LENGTH=267
Length = 267
Score = 175 bits (443), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 115/275 (41%), Positives = 151/275 (54%), Gaps = 34/275 (12%)
Query: 11 AENHRKSGSSFFNSPRLFTNLSP-KSFNEAETMMSPTSILDSKPFSGLKNPFWCESNSPR 69
++N ++S ++ F PRLFT S KSF E + + SPTSILD+KPFS LKNPF S++P+
Sbjct: 19 SQNQKQSKTTPF--PRLFTAFSSFKSFTENDAVASPTSILDTKPFSVLKNPFG--SDNPK 74
Query: 70 TPGGGEVKRCWENLDSKGVGLGLVDALVDDHKHGEVNSKSESRMVLFGSQLKIQIPPLTP 129
T + L+ K +GL +VD+L+ D S +LFGSQL+I++P
Sbjct: 75 T----QEPETRLKLEPKRIGLAIVDSLIQDETP---EPGPRSGTILFGSQLRIRVP---- 123
Query: 130 TPTFSSADFAIKTRNXXXXXXXXXXXXXPMGKYP--YGCANTHQVFTGCLAASEMELSED 187
SS+DF IKTRN P K P + ++ +G AS+MELSED
Sbjct: 124 DSPISSSDFGIKTRNSQ-----------PETKKPGSESGLGSPRIISGYFPASDMELSED 172
Query: 188 YTRVTSHGPNPRTTHIFDNXXXXXXXXXXXXXXXXXXMENGCFPHHTSYYPSESFLSNCF 247
YT VT HGPNPRT HIFDN E+ + Y P +SFLS C
Sbjct: 173 YTCVTCHGPNPRTIHIFDNCIVESQPGVVFFRSSDPVNES-----DSDYSPPDSFLSCCC 227
Query: 248 YCKKNLGQGKDIYMYRGERAFCSHECRYQGMLLEE 282
CKK+LG DI+MYRG+RAFCS ECR M++ E
Sbjct: 228 NCKKSLGPRDDIFMYRGDRAFCSSECRSIEMMMSE 262
>AT3G63210.1 | Symbols: MARD1 | Protein of unknown function (DUF581)
| chr3:23354019-23354906 REVERSE LENGTH=263
Length = 263
Score = 144 bits (364), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 109/280 (38%), Positives = 140/280 (50%), Gaps = 53/280 (18%)
Query: 10 PAENHRKSGSSFFNSP--RLFTN---LSPKSFNEAETMMSPTSILDSKP--FSGLKNPFW 62
P N S F+SP R FT+ ++P F+ +++SPTSIL++ P FS KNP
Sbjct: 27 PKPNTCHCSPSLFSSPKFRFFTSKMMMTP--FDSDFSLVSPTSILEANPSIFSS-KNPKP 83
Query: 63 CESNSPRTPGGGEVKRCWENLDSKGVGLGLVDALVDDHKHGEVNSKSESRMVLFGSQLKI 122
P P + S V GL D + D + + + K ++MVLFGS+L++
Sbjct: 84 VSYFEPTIPNP-------QRFHSPDV-FGLADLVKDGDSNRDHSRKPVNKMVLFGSKLRV 135
Query: 123 QIPPLTPTPTFSSADFAIKTRNXXXXXXXXXXXXXPMGKYPYGCANTHQVFTGCLAASEM 182
QIP SSADF KT +YP C + V T LA SE+
Sbjct: 136 QIP--------SSADFGTKTG----------------IRYP-PCQLSPCVQTKVLAVSEI 170
Query: 183 ELSEDYTRVTSHGPNPRTTHIFDNXXXXXXXXXXXXXXXXXXMENGCFPHHTSYYPSESF 242
+ +EDYTRV SHGPNP THIFDN ME +ESF
Sbjct: 171 DQTEDYTRVISHGPNPTITHIFDN-SVFVEATPCSVPLPQPAMETKS---------TESF 220
Query: 243 LSNCFYCKKNLGQGKDIYMYRGERAFCSHECRYQGMLLEE 282
LS CF CKKNL Q +DIY+YRGE+ FCS ECRYQ MLL++
Sbjct: 221 LSRCFTCKKNLDQKQDIYIYRGEKGFCSSECRYQEMLLDQ 260
>AT5G11460.1 | Symbols: | Protein of unknown function (DUF581) |
chr5:3657064-3658388 REVERSE LENGTH=344
Length = 344
Score = 103 bits (258), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 88/270 (32%), Positives = 117/270 (43%), Gaps = 39/270 (14%)
Query: 37 NEAETMMSPTSILDSKPFSGLKNPFWCESNSPRTPGGGEVKRCWENLDSKGVGLGLVDAL 96
++ E+ SPTS LD + FS L NPF ++S R+ G+ +R W DS VGL +V +L
Sbjct: 41 SDYESAWSPTSPLDFRLFSTLGNPF--AASSSRSIWRGK-QRSW---DSGKVGLSIVHSL 94
Query: 97 VDDHKHGE----VNSKSESRMVLFGSQLKIQIPP----------LTPTPTFSSADFAI-- 140
VDDH V +S+ ++FGS ++ P L P +A F I
Sbjct: 95 VDDHHTDSSATIVLPSPDSKNIIFGSLMRSGQKPHLLSQPFTKALMPKDVIPNAVFEIGH 154
Query: 141 ------KTRNXXXXXXXXXXXXXPMGKYPYGCANTHQVFTGCL---AASEMELSEDYTRV 191
+ R C T Q G L S+ME+SEDYT V
Sbjct: 155 DVIDVLELRKSGSVDAAYCSGAENFSVNNNACQVTKQD-PGSLNGGTESDMEISEDYTCV 213
Query: 192 TSHGPNPRTTHIFDNXXXXXXXXXXXXXXXXXXMENGCFPHH-------TSYYPSESFLS 244
SHGPNP+TTH + + + F P + FLS
Sbjct: 214 ISHGPNPKTTHFYGDQVMESVEREELKNRCCKNEKESIFAVAPLDLTTPVDVLPPKDFLS 273
Query: 245 NCFYCKKNLGQGKDIYMYRGERAFCSHECR 274
C+ C K LG G+DIYMY G +AFCS ECR
Sbjct: 274 FCYGCSKKLGMGEDIYMYSGYKAFCSSECR 303
>AT2G25690.2 | Symbols: | Protein of unknown function (DUF581) |
chr2:10940530-10941649 REVERSE LENGTH=324
Length = 324
Score = 81.3 bits (199), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 74/276 (26%), Positives = 114/276 (41%), Gaps = 48/276 (17%)
Query: 34 KSFNEAETMMSPTSILDSKPFSGLKNPFWCESNSPRTPGGGEVKRCWENLDSKGVGLGLV 93
K ++++ + SP S L+ + S + + F+ S PR+ + C + VGL +V
Sbjct: 52 KCISDSDFVRSPKSPLEFRVLSTMADSFFLRS--PRSSLTAHLNCCCG--PAAKVGLSIV 107
Query: 94 DALVDDHKHGEVNSKSESRMVLFGSQLKIQIPPLT---PTPTFSSADFAIKTRNXXXXXX 150
D+L DD + ++FG L+I+ + P F A+ + K N
Sbjct: 108 DSLGDD--------RCLLPDIVFGPALRIKCSEVMDKHPKLLFPVANKSKKIENERSGVV 159
Query: 151 XXXXXXXPMGKYPYGCANTHQVFTGCL------------------------AASEMELSE 186
+ P G N CL A S + E
Sbjct: 160 FEIGDNSSETE-PVGLRNRSFSANDCLRKTRVLSRSKLGQEGDFPGSGSDNAFSSEDDME 218
Query: 187 DYTRVTSHGPNPRTTHIFDNXXXXXXXXXXXXXXXXXXMENGCFPHHTSYYPSESFLSNC 246
DYT + +HGPNP+TTHI+ + S +PS++FL C
Sbjct: 219 DYTCIIAHGPNPKTTHIYGDRVLECHKNELKGDEDNKE-------KFGSVFPSDNFLGIC 271
Query: 247 FYCKKNLGQGKDIYMYRGERAFCSHECRYQGMLLEE 282
+C K LG G DIYMYR E++FCS ECR + M+++E
Sbjct: 272 NFCNKKLGGGDDIYMYR-EKSFCSEECRSEEMMIDE 306
>AT2G25690.1 | Symbols: | Protein of unknown function (DUF581) |
chr2:10940530-10941649 REVERSE LENGTH=324
Length = 324
Score = 81.3 bits (199), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 74/276 (26%), Positives = 114/276 (41%), Gaps = 48/276 (17%)
Query: 34 KSFNEAETMMSPTSILDSKPFSGLKNPFWCESNSPRTPGGGEVKRCWENLDSKGVGLGLV 93
K ++++ + SP S L+ + S + + F+ S PR+ + C + VGL +V
Sbjct: 52 KCISDSDFVRSPKSPLEFRVLSTMADSFFLRS--PRSSLTAHLNCCCG--PAAKVGLSIV 107
Query: 94 DALVDDHKHGEVNSKSESRMVLFGSQLKIQIPPLT---PTPTFSSADFAIKTRNXXXXXX 150
D+L DD + ++FG L+I+ + P F A+ + K N
Sbjct: 108 DSLGDD--------RCLLPDIVFGPALRIKCSEVMDKHPKLLFPVANKSKKIENERSGVV 159
Query: 151 XXXXXXXPMGKYPYGCANTHQVFTGCL------------------------AASEMELSE 186
+ P G N CL A S + E
Sbjct: 160 FEIGDNSSETE-PVGLRNRSFSANDCLRKTRVLSRSKLGQEGDFPGSGSDNAFSSEDDME 218
Query: 187 DYTRVTSHGPNPRTTHIFDNXXXXXXXXXXXXXXXXXXMENGCFPHHTSYYPSESFLSNC 246
DYT + +HGPNP+TTHI+ + S +PS++FL C
Sbjct: 219 DYTCIIAHGPNPKTTHIYGDRVLECHKNELKGDEDNKE-------KFGSVFPSDNFLGIC 271
Query: 247 FYCKKNLGQGKDIYMYRGERAFCSHECRYQGMLLEE 282
+C K LG G DIYMYR E++FCS ECR + M+++E
Sbjct: 272 NFCNKKLGGGDDIYMYR-EKSFCSEECRSEEMMIDE 306
>AT1G22160.1 | Symbols: | Protein of unknown function (DUF581) |
chr1:7823238-7823774 FORWARD LENGTH=147
Length = 147
Score = 66.2 bits (160), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 31/60 (51%), Positives = 39/60 (65%), Gaps = 1/60 (1%)
Query: 227 NGCFPHHTSYYPSESFLSNCFYCKKNLGQGKDIYMYRGERAFCSHECRYQGMLLEEGMSK 286
G H+S Y SE FL +C CK+ L G+DIYMYRG+RAFCS ECR Q + ++E K
Sbjct: 64 RGTQRRHSSDY-SEDFLRSCSLCKRLLVHGRDIYMYRGDRAFCSLECRQQQITVDERKEK 122
>AT4G39795.1 | Symbols: | Protein of unknown function (DUF581) |
chr4:18466621-18467325 FORWARD LENGTH=126
Length = 126
Score = 61.2 bits (147), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 26/46 (56%), Positives = 34/46 (73%)
Query: 241 SFLSNCFYCKKNLGQGKDIYMYRGERAFCSHECRYQGMLLEEGMSK 286
SFL NC +CK+ L G+DIYMY+G+ AFCS ECR Q M +EG ++
Sbjct: 72 SFLVNCGFCKRGLAPGRDIYMYKGDAAFCSIECREQQMEHDEGKTR 117
>AT5G49120.1 | Symbols: | Protein of unknown function (DUF581) |
chr5:19908800-19909332 REVERSE LENGTH=150
Length = 150
Score = 60.5 bits (145), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 33/46 (71%)
Query: 237 YPSESFLSNCFYCKKNLGQGKDIYMYRGERAFCSHECRYQGMLLEE 282
Y FL +CF C++ L KDIYMY+G+RAFCS ECR + M+++E
Sbjct: 63 YQDSGFLEHCFLCRRKLLPAKDIYMYKGDRAFCSVECRSKQMIMDE 108
>AT5G47060.1 | Symbols: | Protein of unknown function (DUF581) |
chr5:19116843-19117639 FORWARD LENGTH=177
Length = 177
Score = 56.2 bits (134), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 24/45 (53%), Positives = 30/45 (66%)
Query: 242 FLSNCFYCKKNLGQGKDIYMYRGERAFCSHECRYQGMLLEEGMSK 286
FL +CF CKK LG +DIYMYRG+ FCS ECR + + +E K
Sbjct: 97 FLDSCFLCKKPLGDNRDIYMYRGDTPFCSEECRQEQIERDEAKEK 141
>AT1G78020.1 | Symbols: | Protein of unknown function (DUF581) |
chr1:29338787-29339491 FORWARD LENGTH=162
Length = 162
Score = 56.2 bits (134), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 24/55 (43%), Positives = 34/55 (61%)
Query: 232 HHTSYYPSESFLSNCFYCKKNLGQGKDIYMYRGERAFCSHECRYQGMLLEEGMSK 286
H + + FL +C C++ L G+DIYMYRG++AFCS ECR + M +E K
Sbjct: 79 HSGDFSDAGHFLRSCALCERLLVPGRDIYMYRGDKAFCSSECRQEQMAQDERKEK 133
>AT4G17670.1 | Symbols: | Protein of unknown function (DUF581) |
chr4:9833948-9834663 REVERSE LENGTH=159
Length = 159
Score = 55.8 bits (133), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 23/48 (47%), Positives = 32/48 (66%)
Query: 242 FLSNCFYCKKNLGQGKDIYMYRGERAFCSHECRYQGMLLEEGMSKLEA 289
FL +CF CKK LG +DI+MYRG+ FCS ECR + + +E K ++
Sbjct: 76 FLDSCFLCKKRLGDNRDIFMYRGDTPFCSEECREEQIERDEAKEKKQS 123
>AT1G74940.1 | Symbols: | Protein of unknown function (DUF581) |
chr1:28146284-28147065 FORWARD LENGTH=222
Length = 222
Score = 55.1 bits (131), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 40/112 (35%), Positives = 54/112 (48%), Gaps = 5/112 (4%)
Query: 173 FTGCLAASEMELS-EDYTRVTSHGPN-PRTTHIFDNXXXXXXXXXXXXXXXXXXMENGCF 230
++G E++LS E+YT VTS PN P + D+ ++
Sbjct: 81 YSGRFRCPEIDLSDEEYTYVTS--PNGPTKVYYNDDGFELSENDYRRVHKPMVTVDEPPV 138
Query: 231 PHHTSYYPSESFLSNCFYCKKNLGQGKDIYMYRGERAFCSHECRYQGMLLEE 282
S FLS+C CKK L QGKDIYMY+GE FCS ECR ++ +E
Sbjct: 139 IERQSVRGPTEFLSSCCLCKKKL-QGKDIYMYKGEMGFCSAECRSVQIMNDE 189
>AT1G53903.1 | Symbols: | Protein of unknown function (DUF581) |
chr1:20132363-20132842 FORWARD LENGTH=126
Length = 126
Score = 55.1 bits (131), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 25/48 (52%), Positives = 31/48 (64%), Gaps = 1/48 (2%)
Query: 242 FLSNCFYCKKNLGQGKDIYMYRGERAFCSHECRYQGMLLEEGMSKLEA 289
FL C C K L Q KD+YMYRG+ FCS ECR ML+++ +LEA
Sbjct: 42 FLKTCHLCNKQLHQDKDVYMYRGDLGFCSRECRESQMLIDDR-KELEA 88
>AT1G53885.1 | Symbols: | Protein of unknown function (DUF581) |
chr1:20119798-20120277 FORWARD LENGTH=126
Length = 126
Score = 55.1 bits (131), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 25/48 (52%), Positives = 31/48 (64%), Gaps = 1/48 (2%)
Query: 242 FLSNCFYCKKNLGQGKDIYMYRGERAFCSHECRYQGMLLEEGMSKLEA 289
FL C C K L Q KD+YMYRG+ FCS ECR ML+++ +LEA
Sbjct: 42 FLKTCHLCNKQLHQDKDVYMYRGDLGFCSRECRESQMLIDDR-KELEA 88
>AT2G44670.1 | Symbols: | Protein of unknown function (DUF581) |
chr2:18425279-18425673 FORWARD LENGTH=93
Length = 93
Score = 52.0 bits (123), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 32/45 (71%)
Query: 242 FLSNCFYCKKNLGQGKDIYMYRGERAFCSHECRYQGMLLEEGMSK 286
FL +C C+K+LG DI+MYRG++AFCS+ECR + + +E +
Sbjct: 16 FLESCSLCRKHLGLNSDIFMYRGDKAFCSNECREEQIESDEAKER 60
>AT5G65040.1 | Symbols: | Protein of unknown function (DUF581) |
chr5:25977864-25978350 REVERSE LENGTH=113
Length = 113
Score = 52.0 bits (123), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 30/47 (63%)
Query: 240 ESFLSNCFYCKKNLGQGKDIYMYRGERAFCSHECRYQGMLLEEGMSK 286
+ FL C C ++L +DIYMYRG AFCS ECR + + L+E +K
Sbjct: 55 DDFLKTCSLCNRSLCHHRDIYMYRGNNAFCSLECREKQIKLDEKKAK 101
>AT5G20700.1 | Symbols: | Protein of unknown function (DUF581) |
chr5:7006178-7007003 REVERSE LENGTH=248
Length = 248
Score = 51.6 bits (122), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 32/44 (72%), Gaps = 1/44 (2%)
Query: 231 PHHTSYYPSESFLSNCFYCKKNLGQGKDIYMYRGERAFCSHECR 274
P ++ + FL++C+ C+K L G+DI++YRGE+AFCS ECR
Sbjct: 171 PENSPEFQGLGFLNSCYLCRKKL-HGQDIFIYRGEKAFCSTECR 213
>AT1G19200.1 | Symbols: | Protein of unknown function (DUF581) |
chr1:6625104-6625856 REVERSE LENGTH=215
Length = 215
Score = 51.6 bits (122), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 22/33 (66%), Positives = 26/33 (78%), Gaps = 1/33 (3%)
Query: 242 FLSNCFYCKKNLGQGKDIYMYRGERAFCSHECR 274
FL++C CKK L QGKDIYMY+G+ FCS ECR
Sbjct: 150 FLTSCCLCKKKL-QGKDIYMYKGDEGFCSKECR 181