Miyakogusa Predicted Gene
- Lj1g3v4998300.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v4998300.1 Non Chatacterized Hit- tr|I1JRQ3|I1JRQ3_SOYBN
Uncharacterized protein OS=Glycine max PE=4
SV=1,66,0.00000000004,seg,NULL; coiled-coil,NULL;
Zein-binding,Zein-binding domain; FAMILY NOT NAMED,NULL,CUFF.33784.1
(272 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G08800.2 | Symbols: | Protein of unknown function, DUF593 | ... 162 2e-40
AT1G08800.1 | Symbols: | Protein of unknown function, DUF593 | ... 162 2e-40
AT2G30690.1 | Symbols: | Protein of unknown function, DUF593 | ... 122 3e-28
AT1G18990.1 | Symbols: | Protein of unknown function, DUF593 | ... 92 4e-19
AT5G16720.1 | Symbols: | Protein of unknown function, DUF593 | ... 86 4e-17
AT1G70750.1 | Symbols: | Protein of unknown function, DUF593 | ... 85 6e-17
AT1G74830.1 | Symbols: | Protein of unknown function, DUF593 | ... 75 7e-14
AT5G06560.1 | Symbols: | Protein of unknown function, DUF593 | ... 57 1e-08
AT4G13630.2 | Symbols: | Protein of unknown function, DUF593 | ... 56 3e-08
AT4G13630.1 | Symbols: | Protein of unknown function, DUF593 | ... 56 3e-08
AT3G54740.1 | Symbols: | Protein of unknown function, DUF593 | ... 55 6e-08
AT3G54740.2 | Symbols: | Protein of unknown function, DUF593 | ... 55 6e-08
>AT1G08800.2 | Symbols: | Protein of unknown function, DUF593 |
chr1:2813578-2817078 REVERSE LENGTH=1113
Length = 1113
Score = 162 bits (411), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 105/293 (35%), Positives = 150/293 (51%), Gaps = 64/293 (21%)
Query: 1 MQMLQKMISLERNESGLSLXXXXXXXXXXXXXXXRLRRQVDHDRKLLNALYKELEEERNX 60
MQ+L + LERNES LSL RL+RQVD+DRKLL LYKELEEER+
Sbjct: 858 MQLLLQKRMLERNESNLSLEGVSVTEIEGESEGDRLKRQVDYDRKLLTGLYKELEEERSA 917
Query: 61 XXXXXXXXXXMITRLQEEKATLHMEALQYLRVMDEESEYETEALQKANDLLAEKEKEIED 120
MITRLQEEKA+ MEALQ LR+M+E++EY+ EA+Q+ NDLL E+EK I+D
Sbjct: 918 SAVATNQAMAMITRLQEEKASFQMEALQNLRMMEEQAEYDMEAIQRLNDLLVEREKLIQD 977
Query: 121 LEAKLKFYRMRFPDESVSENIVETNSEMKLKDIGLDHSQFACIENDESVIGKSVTEIPNI 180
LEA+++++R + P + ++ E +EM G+ + +C+ +G
Sbjct: 978 LEAEIEYFRDQTPQKKNKLDVAEKVTEMDSPSEGMSNKIQSCL------VG--------- 1022
Query: 181 SDKSDILPTSLEENNVQSIKSSPLEFQDEKLYISQCLEKLEKQV---------------- 224
F +E+LYI+ CLEK+E +V
Sbjct: 1023 -------------------------FDEERLYITSCLEKIENRVNGKAHDDNLPAQESVS 1057
Query: 225 --------LEEDQSFFEHTVNLLSKGDKGLQLLQEIADHLHQLRRIGMREINQ 269
L+ D F E +N L G++G+Q ++EIA HL LR + M+ +Q
Sbjct: 1058 ELHERVERLKGDLYFLEQVMNSLGHGNEGVQFVKEIASHLQTLRSLSMKRQDQ 1110
>AT1G08800.1 | Symbols: | Protein of unknown function, DUF593 |
chr1:2813578-2817078 REVERSE LENGTH=1113
Length = 1113
Score = 162 bits (411), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 105/293 (35%), Positives = 150/293 (51%), Gaps = 64/293 (21%)
Query: 1 MQMLQKMISLERNESGLSLXXXXXXXXXXXXXXXRLRRQVDHDRKLLNALYKELEEERNX 60
MQ+L + LERNES LSL RL+RQVD+DRKLL LYKELEEER+
Sbjct: 858 MQLLLQKRMLERNESNLSLEGVSVTEIEGESEGDRLKRQVDYDRKLLTGLYKELEEERSA 917
Query: 61 XXXXXXXXXXMITRLQEEKATLHMEALQYLRVMDEESEYETEALQKANDLLAEKEKEIED 120
MITRLQEEKA+ MEALQ LR+M+E++EY+ EA+Q+ NDLL E+EK I+D
Sbjct: 918 SAVATNQAMAMITRLQEEKASFQMEALQNLRMMEEQAEYDMEAIQRLNDLLVEREKLIQD 977
Query: 121 LEAKLKFYRMRFPDESVSENIVETNSEMKLKDIGLDHSQFACIENDESVIGKSVTEIPNI 180
LEA+++++R + P + ++ E +EM G+ + +C+ +G
Sbjct: 978 LEAEIEYFRDQTPQKKNKLDVAEKVTEMDSPSEGMSNKIQSCL------VG--------- 1022
Query: 181 SDKSDILPTSLEENNVQSIKSSPLEFQDEKLYISQCLEKLEKQV---------------- 224
F +E+LYI+ CLEK+E +V
Sbjct: 1023 -------------------------FDEERLYITSCLEKIENRVNGKAHDDNLPAQESVS 1057
Query: 225 --------LEEDQSFFEHTVNLLSKGDKGLQLLQEIADHLHQLRRIGMREINQ 269
L+ D F E +N L G++G+Q ++EIA HL LR + M+ +Q
Sbjct: 1058 ELHERVERLKGDLYFLEQVMNSLGHGNEGVQFVKEIASHLQTLRSLSMKRQDQ 1110
>AT2G30690.1 | Symbols: | Protein of unknown function, DUF593 |
chr2:13076229-13078595 FORWARD LENGTH=760
Length = 760
Score = 122 bits (305), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 58/100 (58%), Positives = 80/100 (80%)
Query: 36 LRRQVDHDRKLLNALYKELEEERNXXXXXXXXXXXMITRLQEEKATLHMEALQYLRVMDE 95
L++Q++H RK L L KE EEERN MITRLQEEKA LHMEALQYLR+MDE
Sbjct: 616 LKQQLEHGRKSLRDLNKEFEEERNASAIATNQAMAMITRLQEEKAALHMEALQYLRMMDE 675
Query: 96 ESEYETEALQKANDLLAEKEKEIEDLEAKLKFYRMRFPDE 135
++E++ +AL++AND+LA++EKEI+DLE +L++YR+++PDE
Sbjct: 676 QAEHDMDALERANDVLADREKEIQDLEMELEYYRVKYPDE 715
>AT1G18990.1 | Symbols: | Protein of unknown function, DUF593 |
chr1:6558778-6560432 REVERSE LENGTH=524
Length = 524
Score = 92.0 bits (227), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 51/97 (52%), Positives = 68/97 (70%)
Query: 36 LRRQVDHDRKLLNALYKELEEERNXXXXXXXXXXXMITRLQEEKATLHMEALQYLRVMDE 95
L RQV DRK L LY EL+EER+ MITRLQ EKA + MEALQY R+MDE
Sbjct: 304 LNRQVRLDRKSLMDLYMELDEERSASAVAANNAMAMITRLQAEKAAVQMEALQYQRMMDE 363
Query: 96 ESEYETEALQKANDLLAEKEKEIEDLEAKLKFYRMRF 132
++EY+ EALQ N LL ++E+E+++LEA ++ YR+R+
Sbjct: 364 QAEYDQEALQSMNGLLVKREEEMKELEAGIEVYRLRY 400
>AT5G16720.1 | Symbols: | Protein of unknown function, DUF593 |
chr5:5494309-5496562 FORWARD LENGTH=675
Length = 675
Score = 85.5 bits (210), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 48/97 (49%), Positives = 65/97 (67%)
Query: 35 RLRRQVDHDRKLLNALYKELEEERNXXXXXXXXXXXMITRLQEEKATLHMEALQYLRVMD 94
RLR V +++ L LY ELEEER+ MITRLQEEKA + MEALQY R+M+
Sbjct: 359 RLRETVRAEQEALRDLYAELEEERSASAISANQTMAMITRLQEEKAKVQMEALQYQRMME 418
Query: 95 EESEYETEALQKANDLLAEKEKEIEDLEAKLKFYRMR 131
E++EY+ EALQ N L+ ++EKE E L+ +L+ YR +
Sbjct: 419 EQAEYDQEALQLLNHLMVKREKEKEQLQRELEVYRAK 455
>AT1G70750.1 | Symbols: | Protein of unknown function, DUF593 |
chr1:26680728-26683147 REVERSE LENGTH=749
Length = 749
Score = 84.7 bits (208), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 46/97 (47%), Positives = 66/97 (68%)
Query: 35 RLRRQVDHDRKLLNALYKELEEERNXXXXXXXXXXXMITRLQEEKATLHMEALQYLRVMD 94
+L+ ++ +RK L+ALY+ELE ERN MI RL EEKA + MEALQY R+M+
Sbjct: 415 KLKFELQEERKALHALYEELEVERNASAVAASETMAMINRLHEEKAAMQMEALQYQRMME 474
Query: 95 EESEYETEALQKANDLLAEKEKEIEDLEAKLKFYRMR 131
E++E++ EALQ N+L+ +EKE +LE +L+ YR R
Sbjct: 475 EQAEFDQEALQLLNELMVNREKENAELEKELEVYRKR 511
>AT1G74830.1 | Symbols: | Protein of unknown function, DUF593 |
chr1:28113469-28115193 REVERSE LENGTH=542
Length = 542
Score = 74.7 bits (182), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 46/98 (46%), Positives = 69/98 (70%)
Query: 35 RLRRQVDHDRKLLNALYKELEEERNXXXXXXXXXXXMITRLQEEKATLHMEALQYLRVMD 94
+L+++V D+K L LY EL+EER+ MITRLQ EKA + MEALQY R+MD
Sbjct: 304 QLKKEVRLDKKSLIDLYMELDEERSASAVAANEAMAMITRLQAEKAAVQMEALQYQRMMD 363
Query: 95 EESEYETEALQKANDLLAEKEKEIEDLEAKLKFYRMRF 132
E++EY+ EALQ + LA++E+E+++LEA+ + YR ++
Sbjct: 364 EQAEYDQEALQSMSSELAKREEEMKELEAEFEVYREKY 401
>AT5G06560.1 | Symbols: | Protein of unknown function, DUF593 |
chr5:2003678-2005543 REVERSE LENGTH=518
Length = 518
Score = 57.0 bits (136), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 44/146 (30%), Positives = 72/146 (49%), Gaps = 2/146 (1%)
Query: 36 LRRQVDHDRKLLNALYKELEEERNXXXXXXXXXXXMITRLQEEKATLHMEALQYLRVMDE 95
LR V ++ + LY+EL+EERN MI RLQ +KA L ME Q+ R +E
Sbjct: 74 LRETVSSQQQSIQDLYEELDEERNAASTAASEAMSMILRLQRDKAELQMELRQFKRFAEE 133
Query: 96 ESEYETEALQKANDLLAEKEKEIEDLEAKLKFYRMRFPDESVSENIVETNSEMKLKDIGL 155
+ E++ + L DL+ ++E+ I+ L + + Y+ R +E VET M ++ +
Sbjct: 134 KMEHDQQELLDLEDLIYKREQTIQALTFEAQAYKHRMMSFGFTEAEVETEKNMLSRNPSM 193
Query: 156 --DHSQFACIENDESVIGKSVTEIPN 179
+ Q+ +D I +V E P
Sbjct: 194 IENDYQYDLPTSDYPPIKCNVNENPG 219
>AT4G13630.2 | Symbols: | Protein of unknown function, DUF593 |
chr4:7934093-7936029 REVERSE LENGTH=569
Length = 569
Score = 56.2 bits (134), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 34/90 (37%), Positives = 52/90 (57%)
Query: 43 DRKLLNALYKELEEERNXXXXXXXXXXXMITRLQEEKATLHMEALQYLRVMDEESEYETE 102
+R +L ELE+ERN MI RLQEEKA++ MEA QY R+++E+S ++ E
Sbjct: 232 ERAARASLALELEKERNAAASAADEALGMILRLQEEKASIEMEARQYQRMIEEKSAFDAE 291
Query: 103 ALQKANDLLAEKEKEIEDLEAKLKFYRMRF 132
+ ++L +E+E LE ++ YR F
Sbjct: 292 EMSILKEILLRREREKHFLEKEVDTYRQMF 321
>AT4G13630.1 | Symbols: | Protein of unknown function, DUF593 |
chr4:7934093-7936029 REVERSE LENGTH=569
Length = 569
Score = 56.2 bits (134), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 34/90 (37%), Positives = 52/90 (57%)
Query: 43 DRKLLNALYKELEEERNXXXXXXXXXXXMITRLQEEKATLHMEALQYLRVMDEESEYETE 102
+R +L ELE+ERN MI RLQEEKA++ MEA QY R+++E+S ++ E
Sbjct: 232 ERAARASLALELEKERNAAASAADEALGMILRLQEEKASIEMEARQYQRMIEEKSAFDAE 291
Query: 103 ALQKANDLLAEKEKEIEDLEAKLKFYRMRF 132
+ ++L +E+E LE ++ YR F
Sbjct: 292 EMSILKEILLRREREKHFLEKEVDTYRQMF 321
>AT3G54740.1 | Symbols: | Protein of unknown function, DUF593 |
chr3:20263093-20264466 FORWARD LENGTH=390
Length = 390
Score = 54.7 bits (130), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 43/131 (32%), Positives = 59/131 (45%), Gaps = 14/131 (10%)
Query: 44 RKLLNALYKELEEERNXXXXXXXXXXXMITRLQEEKATLHMEALQYLRVMDEESEYETEA 103
R+ + L+ ELEEERN MI RLQ EKA + MEA Q+ E+ ++ E
Sbjct: 34 RETVKDLHLELEEERNAAASAANETMSMILRLQREKAEIQMEARQFKMFAQEKMTHDQEK 93
Query: 104 LQKANDLLAEKEKEIEDLEAKLKFYRMRFPDESVSENIVETNSEMKLKDIGLDHSQFACI 163
L +LL EKE+ IE L +++ Y+ R VSE + H Q
Sbjct: 94 LSVLENLLYEKEQAIEALTYEVEAYKHRLLSYGVSEAEI--------------HDQILGF 139
Query: 164 ENDESVIGKSV 174
D S +G V
Sbjct: 140 GRDSSTVGFDV 150
>AT3G54740.2 | Symbols: | Protein of unknown function, DUF593 |
chr3:20262949-20264466 FORWARD LENGTH=438
Length = 438
Score = 54.7 bits (130), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 37/96 (38%), Positives = 51/96 (53%)
Query: 44 RKLLNALYKELEEERNXXXXXXXXXXXMITRLQEEKATLHMEALQYLRVMDEESEYETEA 103
R+ + L+ ELEEERN MI RLQ EKA + MEA Q+ E+ ++ E
Sbjct: 82 RETVKDLHLELEEERNAAASAANETMSMILRLQREKAEIQMEARQFKMFAQEKMTHDQEK 141
Query: 104 LQKANDLLAEKEKEIEDLEAKLKFYRMRFPDESVSE 139
L +LL EKE+ IE L +++ Y+ R VSE
Sbjct: 142 LSVLENLLYEKEQAIEALTYEVEAYKHRLLSYGVSE 177