Miyakogusa Predicted Gene
- Lj0g3v0195199.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0195199.1 Non Chatacterized Hit- tr|K4A5P0|K4A5P0_SETIT
Uncharacterized protein OS=Setaria italica
GN=Si034194,41.03,2e-18,seg,NULL; FAMILY NOT NAMED,NULL,CUFF.12340.1
(520 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G08800.2 | Symbols: | Protein of unknown function, DUF593 | ... 180 2e-45
AT1G08800.1 | Symbols: | Protein of unknown function, DUF593 | ... 180 2e-45
AT2G30690.1 | Symbols: | Protein of unknown function, DUF593 | ... 117 1e-26
AT1G74830.1 | Symbols: | Protein of unknown function, DUF593 | ... 96 8e-20
AT1G18990.1 | Symbols: | Protein of unknown function, DUF593 | ... 95 1e-19
AT5G16720.1 | Symbols: | Protein of unknown function, DUF593 | ... 65 1e-10
AT1G70750.1 | Symbols: | Protein of unknown function, DUF593 | ... 55 2e-07
>AT1G08800.2 | Symbols: | Protein of unknown function, DUF593 |
chr1:2813578-2817078 REVERSE LENGTH=1113
Length = 1113
Score = 180 bits (457), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 140/375 (37%), Positives = 192/375 (51%), Gaps = 68/375 (18%)
Query: 17 SRRVTKALASAXXXXXXXXXXXXDAVLSYVITKFAGYCKLQTPCLFCSRLDHVLGKEK-- 74
SR T+ALA A +++ SYVI +FA Y +LQ+PCL CS LDH+L + K
Sbjct: 3 SRSFTRALALAFNEWLLMFMLFVNSIFSYVIARFADYSELQSPCLMCSNLDHILRRTKDL 62
Query: 75 REYYRDLICSGHKSEISSLALCPAHDKLVSLQGMCETCFLSYATSNKSNAEAYRLLTGKL 134
++ + D+ICS HKSEISSL C AH KLV ++GMCETC S+AT+NKSNAE YRLL GKL
Sbjct: 63 KKTHWDIICSKHKSEISSLVYCHAHGKLVDVRGMCETCLFSFATTNKSNAETYRLLVGKL 122
Query: 135 GEES--GSRFDH--VPLLGEHTIAKRCSCCGEQWVLNSYDQKVVFNKSIGSGPADFDASD 190
GE+S GS+ D P + T C+CC + W + +V A+ +
Sbjct: 123 GEDSHFGSKSDRSKYPNCSKLT---DCTCCNQLWTPQTAATQV----------AEREILP 169
Query: 191 SVGNSFHEKRSSKPFVSVRDAPLRNNHVDPLSRVGYTELNITSDTESESEVPLSDD---- 246
+G + R+ K + AP ++ + L VGYTEL I SDTESE+ SD
Sbjct: 170 KIG-LLGKIRTGK-----QSAPKKSVSFNHLPDVGYTELKIHSDTESEA--VFSDTEPKQ 221
Query: 247 ---------------------DKRCSKPFVSVKDAPLRNNHVNPLSRVGYTELKITSDTE 285
D R KP P ++ N L VGYTELKI SDTE
Sbjct: 222 ESSLNHLPPVGYNEPKIGLVGDVRTGKP-----STPKKSVSFNHLPDVGYTELKIHSDTE 276
Query: 286 SESEVPLSDDDGISIPIHGTDHTKEDIEVPCSHTEPSPIDLHEDLASGKSKTSASVLEPL 345
SE+ S+D+ + + DH + +++ T P I L DLA+ K LEP
Sbjct: 277 SEA--VFSEDE--CVVLKDEDHKYQIVDL---QTHPI-ITLPYDLATDKLLNFDFPLEPF 328
Query: 346 LSKS---RTQLENTD 357
++++ QL+ T+
Sbjct: 329 VTRNDREEVQLQETN 343
Score = 56.6 bits (135), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 28/53 (52%), Positives = 35/53 (66%), Gaps = 2/53 (3%)
Query: 470 LLDLGDAYKLAVSNRGRQLSGMLAEV--WTGKDSTRVSEDLKTLLSQFSATRG 520
LLDL DAY + V N G S ++ W KD++RVSEDLK LL+Q SA+RG
Sbjct: 777 LLDLADAYNIVVGNEGHYDSNGRQQIENWMKKDTSRVSEDLKALLTQISASRG 829
>AT1G08800.1 | Symbols: | Protein of unknown function, DUF593 |
chr1:2813578-2817078 REVERSE LENGTH=1113
Length = 1113
Score = 180 bits (457), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 140/375 (37%), Positives = 192/375 (51%), Gaps = 68/375 (18%)
Query: 17 SRRVTKALASAXXXXXXXXXXXXDAVLSYVITKFAGYCKLQTPCLFCSRLDHVLGKEK-- 74
SR T+ALA A +++ SYVI +FA Y +LQ+PCL CS LDH+L + K
Sbjct: 3 SRSFTRALALAFNEWLLMFMLFVNSIFSYVIARFADYSELQSPCLMCSNLDHILRRTKDL 62
Query: 75 REYYRDLICSGHKSEISSLALCPAHDKLVSLQGMCETCFLSYATSNKSNAEAYRLLTGKL 134
++ + D+ICS HKSEISSL C AH KLV ++GMCETC S+AT+NKSNAE YRLL GKL
Sbjct: 63 KKTHWDIICSKHKSEISSLVYCHAHGKLVDVRGMCETCLFSFATTNKSNAETYRLLVGKL 122
Query: 135 GEES--GSRFDH--VPLLGEHTIAKRCSCCGEQWVLNSYDQKVVFNKSIGSGPADFDASD 190
GE+S GS+ D P + T C+CC + W + +V A+ +
Sbjct: 123 GEDSHFGSKSDRSKYPNCSKLT---DCTCCNQLWTPQTAATQV----------AEREILP 169
Query: 191 SVGNSFHEKRSSKPFVSVRDAPLRNNHVDPLSRVGYTELNITSDTESESEVPLSDD---- 246
+G + R+ K + AP ++ + L VGYTEL I SDTESE+ SD
Sbjct: 170 KIG-LLGKIRTGK-----QSAPKKSVSFNHLPDVGYTELKIHSDTESEA--VFSDTEPKQ 221
Query: 247 ---------------------DKRCSKPFVSVKDAPLRNNHVNPLSRVGYTELKITSDTE 285
D R KP P ++ N L VGYTELKI SDTE
Sbjct: 222 ESSLNHLPPVGYNEPKIGLVGDVRTGKP-----STPKKSVSFNHLPDVGYTELKIHSDTE 276
Query: 286 SESEVPLSDDDGISIPIHGTDHTKEDIEVPCSHTEPSPIDLHEDLASGKSKTSASVLEPL 345
SE+ S+D+ + + DH + +++ T P I L DLA+ K LEP
Sbjct: 277 SEA--VFSEDE--CVVLKDEDHKYQIVDL---QTHPI-ITLPYDLATDKLLNFDFPLEPF 328
Query: 346 LSKS---RTQLENTD 357
++++ QL+ T+
Sbjct: 329 VTRNDREEVQLQETN 343
Score = 56.6 bits (135), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 28/53 (52%), Positives = 35/53 (66%), Gaps = 2/53 (3%)
Query: 470 LLDLGDAYKLAVSNRGRQLSGMLAEV--WTGKDSTRVSEDLKTLLSQFSATRG 520
LLDL DAY + V N G S ++ W KD++RVSEDLK LL+Q SA+RG
Sbjct: 777 LLDLADAYNIVVGNEGHYDSNGRQQIENWMKKDTSRVSEDLKALLTQISASRG 829
>AT2G30690.1 | Symbols: | Protein of unknown function, DUF593 |
chr2:13076229-13078595 FORWARD LENGTH=760
Length = 760
Score = 117 bits (294), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 82/220 (37%), Positives = 112/220 (50%), Gaps = 32/220 (14%)
Query: 40 DAVLSYVITKFAGYCKLQTPCLFCSRLDHVLGKEKREYYRDLICSGHKSEISSLALCPAH 99
DA+LSY++ FA YC+LQ PC CS+L H L ++R L+C H+SE+SS C H
Sbjct: 35 DALLSYLLVWFARYCRLQMPCFLCSKLLHPL------HWRFLLCRNHRSEVSSYMSCQNH 88
Query: 100 -DKLVSLQGMCETCFLSYATSNKSNAEAYRLLTGKLGEE--SGSRFDHVPLLGEHTIAKR 156
+ L +GMC+ C LS+ N + RLL GKLG + S S F H +
Sbjct: 89 GNNLADCRGMCDDCLLSFTKMTGPNPDMNRLLLGKLGYDLLSRSHFAH---------PRS 139
Query: 157 CSCCGEQWVLNSYDQKVVFNKSIGSGPADFDASDSVGNSFHEKR--SSKPFVSVRD--AP 212
CSCC + W + Q+++ +GS + + ++ H R S +RD A
Sbjct: 140 CSCCNKPWRTRHHTQRLI---RLGSRGRNSSSKPNIPAPRHLTRRGSGGSLKKMRDHIAT 196
Query: 213 LRNNHVDPLSR-----VGYTELNITSDTESESEVPLSDDD 247
+ +VD SR VGYTEL I SD SESE SDDD
Sbjct: 197 SGSEYVDVGSRDGMAHVGYTELKIHSD--SESEFLFSDDD 234
>AT1G74830.1 | Symbols: | Protein of unknown function, DUF593 |
chr1:28113469-28115193 REVERSE LENGTH=542
Length = 542
Score = 95.5 bits (236), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 49/149 (32%), Positives = 77/149 (51%), Gaps = 26/149 (17%)
Query: 40 DAVLSYVITKFAGYCKLQTPCLFCSRLDHVL-GKEKREYYRDLICSGHKSEISSLALCPA 98
D V++++ +FA + L PCL C+R+DH+L ++ + YY + IC HK ++SSLA C
Sbjct: 56 DGVIAFLSNQFAKFFDLNIPCLLCTRIDHILVPRDPQFYYNESICDSHKKKVSSLAYCHV 115
Query: 99 HDKLVSLQGMCETCFLSYATSNKSNAEAYRLLTGKLGEESGSRFD---HVPL-------- 147
H KL ++ MCE C LS+AT S+ + Y+ L G L ++ D +PL
Sbjct: 116 HKKLSEIKHMCEGCLLSFATEKDSDCDTYKSLIGILHKDLELLIDDERDLPLAFKKDDNL 175
Query: 148 --------------LGEHTIAKRCSCCGE 162
+ ++ + CSCCGE
Sbjct: 176 VQTTKNLVDYKTNNIKNDSLKQHCSCCGE 204
>AT1G18990.1 | Symbols: | Protein of unknown function, DUF593 |
chr1:6558778-6560432 REVERSE LENGTH=524
Length = 524
Score = 95.1 bits (235), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 57/172 (33%), Positives = 83/172 (48%), Gaps = 28/172 (16%)
Query: 40 DAVLSYVITKFAGYCKLQTPCLFCSRLDHVL-GKEKREYYRDLICSGHKSEISSLALCPA 98
D L++ + A + L+ PCL C+RLDHVL + YY D IC HK +SSLA C
Sbjct: 37 DGFLAFFSNQIAKFFDLKIPCLLCTRLDHVLVSRNPDFYYNDSICDAHKKNVSSLAYCHV 96
Query: 99 HDKLVSLQGMCETCFLSYATSNKSNAEAYRLLTGKLGEE-----------------SGSR 141
H KL ++ MCE C LS+AT +++ + Y+ L G L ++ +GS+
Sbjct: 97 HKKLSEIKRMCEGCLLSFATEKETDVDTYKSLIGILHKDLELLIDDERELQLAFPVAGSK 156
Query: 142 FDHVPLLGEHTI---------AKRCSCCGEQWVLNSYDQKVVFNKSIGSGPA 184
D E+ +RCSCCG+ L S D+ N+S P+
Sbjct: 157 KDENFYQVENRTNNSNDRFQRQQRCSCCGQIMKLKS-DKPKSNNQSFFGAPS 207
>AT5G16720.1 | Symbols: | Protein of unknown function, DUF593 |
chr5:5494309-5496562 FORWARD LENGTH=675
Length = 675
Score = 65.1 bits (157), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 48/98 (48%), Gaps = 2/98 (2%)
Query: 17 SRRVTKALASAXXXXXXXXXXXXDAVLSYVITKFAGYCKLQTPCLFCSRLDHVLGK--EK 74
+ R+T L A ++ +Y I KFA + L+ CL C +LD + + E
Sbjct: 14 TNRITVILVYAFLEWLLMFFIFLNSFFTYFIVKFASFFGLKQVCLLCPKLDRIFERKPEN 73
Query: 75 REYYRDLICSGHKSEISSLALCPAHDKLVSLQGMCETC 112
R Y++L+C H +E++SL+ C H KL +C C
Sbjct: 74 RFTYKELLCQNHIAELASLSFCRTHGKLSESANLCSDC 111
>AT1G70750.1 | Symbols: | Protein of unknown function, DUF593 |
chr1:26680728-26683147 REVERSE LENGTH=749
Length = 749
Score = 54.7 bits (130), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 39/80 (48%), Gaps = 1/80 (1%)
Query: 14 HSLSRRVTKALASAXXXXXXXXXXXXDAVLSYVITKFAGYCKLQTPCLFCSRLDHVL-GK 72
H + R+T L A +++ SY I +FA Y L+ PCLFCSRLD
Sbjct: 11 HRKTNRITLILVYAFLEWSLIFFILLNSLFSYFILRFADYFGLKRPCLFCSRLDRFFDAS 70
Query: 73 EKREYYRDLICSGHKSEISS 92
K +RDL+C H ++ S
Sbjct: 71 GKSPSHRDLLCDDHALQLHS 90