Miyakogusa Predicted Gene
- Lj4g3v1598220.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj4g3v1598220.1 tr|Q9FRQ9|Q9FRQ9_ARATH F22O13.29 OS=Arabidopsis
thaliana PE=4 SV=1,32.41,2e-18,FAMILY NOT NAMED,NULL;
coiled-coil,NULL; seg,NULL; Zein-binding,Zein-binding
domain,gene.g55291.t1.1
(482 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G74830.1 | Symbols: | Protein of unknown function, DUF593 | ... 332 4e-91
AT1G18990.1 | Symbols: | Protein of unknown function, DUF593 | ... 316 3e-86
AT1G08800.2 | Symbols: | Protein of unknown function, DUF593 | ... 99 7e-21
AT1G08800.1 | Symbols: | Protein of unknown function, DUF593 | ... 99 7e-21
AT5G16720.1 | Symbols: | Protein of unknown function, DUF593 | ... 85 1e-16
AT1G70750.1 | Symbols: | Protein of unknown function, DUF593 | ... 83 4e-16
AT2G30690.1 | Symbols: | Protein of unknown function, DUF593 | ... 80 3e-15
AT5G06560.1 | Symbols: | Protein of unknown function, DUF593 | ... 55 1e-07
AT4G13630.2 | Symbols: | Protein of unknown function, DUF593 | ... 50 4e-06
AT4G13630.1 | Symbols: | Protein of unknown function, DUF593 | ... 50 4e-06
>AT1G74830.1 | Symbols: | Protein of unknown function, DUF593 |
chr1:28113469-28115193 REVERSE LENGTH=542
Length = 542
Score = 332 bits (851), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 199/518 (38%), Positives = 274/518 (52%), Gaps = 91/518 (17%)
Query: 1 MATGSLIRFVEHQLGRIPHFLLCMVLEWVLIIVLFLDGFLAFFAIELAKYFELEIPCWLC 60
M+ S +FVE +LG +PHFL+ VLEW LI+ LF+DG +AF + + AK+F+L IPC LC
Sbjct: 20 MSKRSFKKFVEQELGSLPHFLIYTVLEWSLIVFLFIDGVIAFLSNQFAKFFDLNIPCLLC 79
Query: 61 ARMNHVLACKTPNFYYNNSICEAHKKDVSSLAFCHNHKKLSEISKLCDACLLSFATGKES 120
R++H+L + P FYYN SIC++HKK VSSLA+CH HKKLSEI +C+ CLLSFAT K+S
Sbjct: 80 TRIDHILVPRDPQFYYNESICDSHKKKVSSLAYCHVHKKLSEIKHMCEGCLLSFATEKDS 139
Query: 121 DCDTYKSLVGVLNKNLECSVDNGQNVQLSL-KDDGVLQAEKNSA-------------QRC 166
DCDTYKSL+G+L+K+LE +D+ +++ L+ KDD ++Q KN Q C
Sbjct: 140 DCDTYKSLIGILHKDLELLIDDERDLPLAFKKDDNLVQTTKNLVDYKTNNIKNDSLKQHC 199
Query: 167 SCCGAPLKIK---------AFLSRAPTASPRAYPCLTPKHEESRSLECPLLRYSKLKSMS 217
SCCG LKIK +FL AP SPR
Sbjct: 200 SCCGELLKIKSEKLPKNNNSFL--APAPSPR----------------------------- 228
Query: 218 QIDLELPQNKEGEKEVNQNNTLREEDKGASLPLLTQAEDSNIESPKTPLPWSNRFFGIPP 277
+ NK E E ++ +D +++ + + N+FFGIP
Sbjct: 229 -----VSHNKLSENE-------------------SEFKDMDVDRTPSFVRGGNKFFGIPL 264
Query: 278 ADSPNSSPRWXXXXXXXXXXXXT--GVDEADD-----VNNLKRQVHLDRKSLMALYMELD 330
+DS +SPRW T D D +N LK++V LD+KSL+ LYMELD
Sbjct: 265 SDSAQNSPRWSVRSLKKSVLNKTENASDTTDPTGESILNQLKKEVRLDKKSLIDLYMELD 324
Query: 331 EERXXXXXXXXXXXXXITRLQEEKAALQMDALQYQRMMEEQIEYDEEALQEYNDMIXXXX 390
EER ITRLQ EKAA+QM+ALQYQRMM+EQ EYD+EALQ + +
Sbjct: 325 EERSASAVAANEAMAMITRLQAEKAAVQMEALQYQRMMDEQAEYDQEALQSMSSELAKRE 384
Query: 391 XXXXXXXXXXGIYREKYGCLT--EDDFNEAHSSHGG----NEDNEEKDLSLHQAVSSKAE 444
+YREKYGCLT ED E H +G ++ E K +S SS +
Sbjct: 385 EEMKELEAEFEVYREKYGCLTDQEDAREEFHKQNGNASAYDDCQETKPVSDLAVSSSNQQ 444
Query: 445 NGGGKLNESLKDFRMEKTYVLGRMKKSESRTLLAESGI 482
G ++++ + R E++ + E + ++ GI
Sbjct: 445 ENGENIDQNGQSKRSEESTAENVVSADEEKGSESKEGI 482
>AT1G18990.1 | Symbols: | Protein of unknown function, DUF593 |
chr1:6558778-6560432 REVERSE LENGTH=524
Length = 524
Score = 316 bits (809), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 187/436 (42%), Positives = 260/436 (59%), Gaps = 53/436 (12%)
Query: 1 MATGSLIRFVEHQLGRIPHFLLCMVLEWVLIIVLFLDGFLAFFAIELAKYFELEIPCWLC 60
M+ S F+E +LG P FL+ +LEW+LII+LF+DGFLAFF+ ++AK+F+L+IPC LC
Sbjct: 1 MSNRSFKNFIEEELGTFPQFLIYALLEWILIIILFIDGFLAFFSNQIAKFFDLKIPCLLC 60
Query: 61 ARMNHVLACKTPNFYYNNSICEAHKKDVSSLAFCHNHKKLSEISKLCDACLLSFATGKES 120
R++HVL + P+FYYN+SIC+AHKK+VSSLA+CH HKKLSEI ++C+ CLLSFAT KE+
Sbjct: 61 TRLDHVLVSRNPDFYYNDSICDAHKKNVSSLAYCHVHKKLSEIKRMCEGCLLSFATEKET 120
Query: 121 DCDTYKSLVGVLNKNLECSVDNGQNVQLSL------KDDGVLQAEKNS---------AQR 165
D DTYKSL+G+L+K+LE +D+ + +QL+ KD+ Q E + QR
Sbjct: 121 DVDTYKSLIGILHKDLELLIDDERELQLAFPVAGSKKDENFYQVENRTNNSNDRFQRQQR 180
Query: 166 CSCCGAPLKIKAFLSRAPTASPRAYPCLTPKHE-ESRSLECPLLRYSKLKSMSQIDLELP 224
CSCCG +K+K+ ++ S P +P+ R+L+ ++Y+ +LP
Sbjct: 181 CSCCGQIMKLKSDKPKSNNQSFFGAPSPSPRVSFNQRTLDLSNIKYT----------DLP 230
Query: 225 QNKEGEKEVNQNNTLREEDKGASLPLLTQAEDSNIESPKTP--LPWSNRFFGIPPADSPN 282
++ + NT KGASL + +TP + N+FFGIP +DS
Sbjct: 231 EDDDAL------NT-----KGASLDAV---------DDRTPSFVKGGNKFFGIPLSDSAQ 270
Query: 283 SSPRWXXXXXXXXXXXXTGVD----EADDV-NNLKRQVHLDRKSLMALYMELDEERXXXX 337
+SPRW G++ + D + +L RQV LDRKSLM LYMELDEER
Sbjct: 271 NSPRWSVRSMKKSFVDQNGLESEVLDGDSILQHLNRQVRLDRKSLMDLYMELDEERSASA 330
Query: 338 XXXXXXXXXITRLQEEKAALQMDALQYQRMMEEQIEYDEEALQEYNDMIXXXXXXXXXXX 397
ITRLQ EKAA+QM+ALQYQRMM+EQ EYD+EALQ N ++
Sbjct: 331 VAANNAMAMITRLQAEKAAVQMEALQYQRMMDEQAEYDQEALQSMNGLLVKREEEMKELE 390
Query: 398 XXXGIYREKYGCLTED 413
+YR +YG L E+
Sbjct: 391 AGIEVYRLRYGLLREE 406
>AT1G08800.2 | Symbols: | Protein of unknown function, DUF593 |
chr1:2813578-2817078 REVERSE LENGTH=1113
Length = 1113
Score = 99.0 bits (245), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 47/145 (32%), Positives = 77/145 (53%), Gaps = 11/145 (7%)
Query: 27 EWVLIIVLFLDGFLAFFAIELAKYFELEIPCWLCARMNHVLA-CKTPNFYYNNSICEAHK 85
EW+L+ +LF++ ++ A Y EL+ PC +C+ ++H+L K + + IC HK
Sbjct: 16 EWLLMFMLFVNSIFSYVIARFADYSELQSPCLMCSNLDHILRRTKDLKKTHWDIICSKHK 75
Query: 86 KDVSSLAFCHNHKKLSEISKLCDACLLSFATGKESDCDTYKSLVGVLNKNLECSVDNGQN 145
++SSL +CH H KL ++ +C+ CL SFAT +S+ +TY+ LVG L G++
Sbjct: 76 SEISSLVYCHAHGKLVDVRGMCETCLFSFATTNKSNAETYRLLVGKL----------GED 125
Query: 146 VQLSLKDDGVLQAEKNSAQRCSCCG 170
K D + C+CC
Sbjct: 126 SHFGSKSDRSKYPNCSKLTDCTCCN 150
Score = 84.7 bits (208), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 44/78 (56%), Positives = 51/78 (65%)
Query: 309 NNLKRQVHLDRKSLMALYMELDEERXXXXXXXXXXXXXITRLQEEKAALQMDALQYQRMM 368
+ LKRQV DRK L LY EL+EER ITRLQEEKA+ QM+ALQ RMM
Sbjct: 891 DRLKRQVDYDRKLLTGLYKELEEERSASAVATNQAMAMITRLQEEKASFQMEALQNLRMM 950
Query: 369 EEQIEYDEEALQEYNDMI 386
EEQ EYD EA+Q ND++
Sbjct: 951 EEQAEYDMEAIQRLNDLL 968
>AT1G08800.1 | Symbols: | Protein of unknown function, DUF593 |
chr1:2813578-2817078 REVERSE LENGTH=1113
Length = 1113
Score = 99.0 bits (245), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 47/145 (32%), Positives = 77/145 (53%), Gaps = 11/145 (7%)
Query: 27 EWVLIIVLFLDGFLAFFAIELAKYFELEIPCWLCARMNHVLA-CKTPNFYYNNSICEAHK 85
EW+L+ +LF++ ++ A Y EL+ PC +C+ ++H+L K + + IC HK
Sbjct: 16 EWLLMFMLFVNSIFSYVIARFADYSELQSPCLMCSNLDHILRRTKDLKKTHWDIICSKHK 75
Query: 86 KDVSSLAFCHNHKKLSEISKLCDACLLSFATGKESDCDTYKSLVGVLNKNLECSVDNGQN 145
++SSL +CH H KL ++ +C+ CL SFAT +S+ +TY+ LVG L G++
Sbjct: 76 SEISSLVYCHAHGKLVDVRGMCETCLFSFATTNKSNAETYRLLVGKL----------GED 125
Query: 146 VQLSLKDDGVLQAEKNSAQRCSCCG 170
K D + C+CC
Sbjct: 126 SHFGSKSDRSKYPNCSKLTDCTCCN 150
Score = 84.7 bits (208), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 44/78 (56%), Positives = 51/78 (65%)
Query: 309 NNLKRQVHLDRKSLMALYMELDEERXXXXXXXXXXXXXITRLQEEKAALQMDALQYQRMM 368
+ LKRQV DRK L LY EL+EER ITRLQEEKA+ QM+ALQ RMM
Sbjct: 891 DRLKRQVDYDRKLLTGLYKELEEERSASAVATNQAMAMITRLQEEKASFQMEALQNLRMM 950
Query: 369 EEQIEYDEEALQEYNDMI 386
EEQ EYD EA+Q ND++
Sbjct: 951 EEQAEYDMEAIQRLNDLL 968
>AT5G16720.1 | Symbols: | Protein of unknown function, DUF593 |
chr5:5494309-5496562 FORWARD LENGTH=675
Length = 675
Score = 84.7 bits (208), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 60/111 (54%), Gaps = 1/111 (0%)
Query: 1 MATGSLIRFVEHQLGRIPHFLLCMVLEWVLIIVLFLDGFLAFFAIELAKYFELEIPCWLC 60
MA + + RI L+ LEW+L+ +FL+ F +F ++ A +F L+ C LC
Sbjct: 1 MAANNFATKLSRNTNRITVILVYAFLEWLLMFFIFLNSFFTYFIVKFASFFGLKQVCLLC 60
Query: 61 ARMNHVLACKTPN-FYYNNSICEAHKKDVSSLAFCHNHKKLSEISKLCDAC 110
+++ + K N F Y +C+ H +++SL+FC H KLSE + LC C
Sbjct: 61 PKLDRIFERKPENRFTYKELLCQNHIAELASLSFCRTHGKLSESANLCSDC 111
Score = 80.1 bits (196), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 45/106 (42%), Positives = 58/106 (54%)
Query: 301 GVDEADDVNNLKRQVHLDRKSLMALYMELDEERXXXXXXXXXXXXXITRLQEEKAALQMD 360
G D + L+ V ++++L LY EL+EER ITRLQEEKA +QM+
Sbjct: 350 GGDPLRTIERLRETVRAEQEALRDLYAELEEERSASAISANQTMAMITRLQEEKAKVQME 409
Query: 361 ALQYQRMMEEQIEYDEEALQEYNDMIXXXXXXXXXXXXXXGIYREK 406
ALQYQRMMEEQ EYD+EALQ N ++ +YR K
Sbjct: 410 ALQYQRMMEEQAEYDQEALQLLNHLMVKREKEKEQLQRELEVYRAK 455
>AT1G70750.1 | Symbols: | Protein of unknown function, DUF593 |
chr1:26680728-26683147 REVERSE LENGTH=749
Length = 749
Score = 83.2 bits (204), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 63/186 (33%), Positives = 96/186 (51%), Gaps = 24/186 (12%)
Query: 308 VNNLKRQVHLDRKSLMALYMELDEERXXXXXXXXXXXXXITRLQEEKAALQMDALQYQRM 367
V+ LK ++ +RK+L ALY EL+ ER I RL EEKAA+QM+ALQYQRM
Sbjct: 413 VDKLKFELQEERKALHALYEELEVERNASAVAASETMAMINRLHEEKAAMQMEALQYQRM 472
Query: 368 MEEQIEYDEEALQEYNDMIXXXXXXXXXXXXXXGIYR---------EKYGCLT---EDDF 415
MEEQ E+D+EALQ N+++ +YR EK G L D
Sbjct: 473 MEEQAEFDQEALQLLNELMVNREKENAELEKELEVYRKRMEEYEAKEKMGMLRRRLRDSS 532
Query: 416 NEAHSSHGGNEDNEEKDLSLH--QAVS------SKAENGGG----KLNESLKDFRMEKTY 463
+++ ++G +++N +L + V+ ++ EN +L+E L D+ E+
Sbjct: 533 VDSYRNNGDSDENSNGELQFKNVEGVTDWKYRENEMENTPVDVVLRLDECLDDYDGERLS 592
Query: 464 VLGRMK 469
+LGR+K
Sbjct: 593 ILGRLK 598
Score = 55.5 bits (132), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 22/84 (26%), Positives = 39/84 (46%)
Query: 1 MATGSLIRFVEHQLGRIPHFLLCMVLEWVLIIVLFLDGFLAFFAIELAKYFELEIPCWLC 60
MA + + RI L+ LEW LI + L+ ++F + A YF L+ PC C
Sbjct: 1 MAANKFATLIHRKTNRITLILVYAFLEWSLIFFILLNSLFSYFILRFADYFGLKRPCLFC 60
Query: 61 ARMNHVLACKTPNFYYNNSICEAH 84
+R++ + + + +C+ H
Sbjct: 61 SRLDRFFDASGKSPSHRDLLCDDH 84
>AT2G30690.1 | Symbols: | Protein of unknown function, DUF593 |
chr2:13076229-13078595 FORWARD LENGTH=760
Length = 760
Score = 80.1 bits (196), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 43/156 (27%), Positives = 75/156 (48%), Gaps = 26/156 (16%)
Query: 27 EWVLIIVLFLDGFLAFFAIELAKYFELEIPCWLCARMNHVLACKTPNFYYNNSICEAHKK 86
EW LI+++F+D L++ + A+Y L++PC+LC+++ H L ++ +C H+
Sbjct: 25 EWFLILLMFIDALLSYLLVWFARYCRLQMPCFLCSKLLHPL-------HWRFLLCRNHRS 77
Query: 87 DVSSLAFCHNH-KKLSEISKLCDACLLSFATGKESDCDTYKSLVGVLNKNLECSVDNGQN 145
+VSS C NH L++ +CD CLLSF + D + L+G L +L
Sbjct: 78 EVSSYMSCQNHGNNLADCRGMCDDCLLSFTKMTGPNPDMNRLLLGKLGYDL--------- 128
Query: 146 VQLSLKDDGVLQAEKNSAQRCSCCGAPLKIKAFLSR 181
+ ++ + CSCC P + + R
Sbjct: 129 ---------LSRSHFAHPRSCSCCNKPWRTRHHTQR 155
Score = 74.7 bits (182), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 44/100 (44%), Positives = 54/100 (54%)
Query: 308 VNNLKRQVHLDRKSLMALYMELDEERXXXXXXXXXXXXXITRLQEEKAALQMDALQYQRM 367
V LK+Q+ RKSL L E +EER ITRLQEEKAAL M+ALQY RM
Sbjct: 613 VEVLKQQLEHGRKSLRDLNKEFEEERNASAIATNQAMAMITRLQEEKAALHMEALQYLRM 672
Query: 368 MEEQIEYDEEALQEYNDMIXXXXXXXXXXXXXXGIYREKY 407
M+EQ E+D +AL+ ND++ YR KY
Sbjct: 673 MDEQAEHDMDALERANDVLADREKEIQDLEMELEYYRVKY 712
>AT5G06560.1 | Symbols: | Protein of unknown function, DUF593 |
chr5:2003678-2005543 REVERSE LENGTH=518
Length = 518
Score = 55.1 bits (131), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 44/76 (57%)
Query: 311 LKRQVHLDRKSLMALYMELDEERXXXXXXXXXXXXXITRLQEEKAALQMDALQYQRMMEE 370
L+ V ++S+ LY ELDEER I RLQ +KA LQM+ Q++R EE
Sbjct: 74 LRETVSSQQQSIQDLYEELDEERNAASTAASEAMSMILRLQRDKAELQMELRQFKRFAEE 133
Query: 371 QIEYDEEALQEYNDMI 386
++E+D++ L + D+I
Sbjct: 134 KMEHDQQELLDLEDLI 149
>AT4G13630.2 | Symbols: | Protein of unknown function, DUF593 |
chr4:7934093-7936029 REVERSE LENGTH=569
Length = 569
Score = 49.7 bits (117), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 25/62 (40%), Positives = 39/62 (62%)
Query: 318 DRKSLMALYMELDEERXXXXXXXXXXXXXITRLQEEKAALQMDALQYQRMMEEQIEYDEE 377
+R + +L +EL++ER I RLQEEKA+++M+A QYQRM+EE+ +D E
Sbjct: 232 ERAARASLALELEKERNAAASAADEALGMILRLQEEKASIEMEARQYQRMIEEKSAFDAE 291
Query: 378 AL 379
+
Sbjct: 292 EM 293
>AT4G13630.1 | Symbols: | Protein of unknown function, DUF593 |
chr4:7934093-7936029 REVERSE LENGTH=569
Length = 569
Score = 49.7 bits (117), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 25/62 (40%), Positives = 39/62 (62%)
Query: 318 DRKSLMALYMELDEERXXXXXXXXXXXXXITRLQEEKAALQMDALQYQRMMEEQIEYDEE 377
+R + +L +EL++ER I RLQEEKA+++M+A QYQRM+EE+ +D E
Sbjct: 232 ERAARASLALELEKERNAAASAADEALGMILRLQEEKASIEMEARQYQRMIEEKSAFDAE 291
Query: 378 AL 379
+
Sbjct: 292 EM 293