Miyakogusa Predicted Gene
- Lj0g3v0302009.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0302009.1 Non Chatacterized Hit- tr|G7KKC4|G7KKC4_MEDTR
Putative uncharacterized protein OS=Medicago
truncatul,61.03,0,seg,NULL; DUF789,Protein of unknown function DUF789;
SUBFAMILY NOT NAMED,NULL; FAMILY NOT NAMED,NULL,CUFF.20317.1
(1148 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G73210.1 | Symbols: | Protein of unknown function (DUF789) |... 68 5e-11
AT5G49220.1 | Symbols: | Protein of unknown function (DUF789) |... 67 9e-11
AT1G73210.2 | Symbols: | Protein of unknown function (DUF789) |... 66 1e-10
AT1G15030.1 | Symbols: | Protein of unknown function (DUF789) |... 64 5e-10
AT4G16100.1 | Symbols: | Protein of unknown function (DUF789) |... 63 1e-09
AT2G01260.1 | Symbols: | Protein of unknown function (DUF789) |... 60 1e-08
AT1G03610.1 | Symbols: | Protein of unknown function (DUF789) |... 59 2e-08
AT4G28150.2 | Symbols: | Protein of unknown function (DUF789) |... 59 2e-08
AT4G28150.1 | Symbols: | Protein of unknown function (DUF789) |... 59 3e-08
AT1G17830.1 | Symbols: | Protein of unknown function (DUF789) |... 56 1e-07
AT4G03420.1 | Symbols: | Protein of unknown function (DUF789) |... 56 2e-07
AT5G23380.1 | Symbols: | Protein of unknown function (DUF789) |... 53 1e-06
>AT1G73210.1 | Symbols: | Protein of unknown function (DUF789) |
chr1:27528428-27530453 REVERSE LENGTH=314
Length = 314
Score = 67.8 bits (164), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 66/281 (23%), Positives = 108/281 (38%), Gaps = 47/281 (16%)
Query: 822 PSCLTCSQNHGGVSLCRHEMPALSLGYLWQWYEEHGSYGLEIRAQEYENLKKLGGVGQFP 881
P + Q G + E+ LG LW Y+E +YG + +L V Q
Sbjct: 25 PPSFSLPQEQG-----KEEIEYFRLGDLWDCYDEMSAYGFGTQV----DLNNGETVMQ-- 73
Query: 882 FRAYFSSSLSAVQLFKNREHQCASSSDDLPNCKVSEACQMIDISEHSIFSVPQPHDQDAN 941
Y+ LSA+Q+ N+ + + +++ + SE D + S +D
Sbjct: 74 ---YYVPYLSAIQIHTNKPALLSRNQNEVAESESSEGWS--DSESEKLLSRSMSNDSSKT 128
Query: 942 IQIPKKTASMNSASNPSINSACSGXXXXXXXXXXXXPPQQRRPLYEKIQELVRGDVPIQS 1001
+ + + +P + PP +R PL +KI LV
Sbjct: 129 WDAVSEDSVFDPDGSPLLKDRLGNLDFKYIERD---PPHKRIPLTDKINVLVEK------ 179
Query: 1002 KIYGDPTNLNSVNLRDLHPRSWYSVAWYPIYRIP----EGNFRAAFLTYHSLGHLVRRST 1057
Y L SV D+ P SW +VAWYPIY IP E + FLTYH+L + +
Sbjct: 180 --YPGLMTLRSV---DMSPASWMAVAWYPIYHIPTCRNEKDLTTGFLTYHTLSSSFQDNV 234
Query: 1058 --------NSDLQTVDSCIVS-----PAVGLQSYNAKGECW 1085
N + + + +++ P G+ +Y +G+ W
Sbjct: 235 VEGDQSNNNEETEFCEDSVINKRMPLPPFGVTTYKMQGDLW 275
>AT5G49220.1 | Symbols: | Protein of unknown function (DUF789) |
chr5:19956627-19958453 FORWARD LENGTH=409
Length = 409
Score = 66.6 bits (161), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 58/173 (33%), Positives = 74/173 (42%), Gaps = 40/173 (23%)
Query: 978 PPQQRRPLYEKIQELVRGDVPIQSKIYGDPTNLNSVNLRDLHPRSWYSVAWYPIYRIPEG 1037
PP R PL KI +L VP L + DL P SW SV+WYPIYRIP G
Sbjct: 265 PPFGREPLANKISDLA-SRVP----------ELMTYRSCDLLPSSWVSVSWYPIYRIPVG 313
Query: 1038 ----NFRAAFLTYHSLGHLVRRSTN--SDLQTVDSCIVSPAVGLQSYNAKGECWFQLSHS 1091
N A FLT+HSL +S SD Q + + P GL SY K W Q
Sbjct: 314 PTLQNLDACFLTFHSLSTAPPQSAMGCSDSQP-STKLPLPTFGLASYKLKVSVWNQ---- 368
Query: 1092 ALEAEMLGMNPSFILKERLRTLEETASLMARAVVNKGNQICTNNHPDFEFFMS 1144
R++ ++ SL+ A +K + +HPD+ FF S
Sbjct: 369 ----------------NRIQESQKMTSLLQAA--DKWLKRLQVDHPDYRFFTS 403
>AT1G73210.2 | Symbols: | Protein of unknown function (DUF789) |
chr1:27528428-27530453 REVERSE LENGTH=312
Length = 312
Score = 66.2 bits (160), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 63/265 (23%), Positives = 104/265 (39%), Gaps = 42/265 (15%)
Query: 838 RHEMPALSLGYLWQWYEEHGSYGLEIRAQEYENLKKLGGVGQFPFRAYFSSSLSAVQLFK 897
+ E+ LG LW Y+E +YG + +L V Q Y+ LSA+Q+
Sbjct: 34 KEEIEYFRLGDLWDCYDEMSAYGFGTQV----DLNNGETVMQ-----YYVPYLSAIQIHT 84
Query: 898 NREHQCASSSDDLPNCKVSEACQMIDISEHSIFSVPQPHDQDANIQIPKKTASMNSASNP 957
N+ + + +++ + SE D + S +D + + + +P
Sbjct: 85 NKPALLSRNQNEVAESESSEGWS--DSESEKLLSRSMSNDSSKTWDAVSEDSVFDPDGSP 142
Query: 958 SINSACSGXXXXXXXXXXXXPPQQRRPLYEKIQELVRGDVPIQSKIYGDPTNLNSVNLRD 1017
+ PP +R PL +KI LV Y L SV D
Sbjct: 143 LLKDRLGNLDFKYIERD---PPHKRIPLTDKINVLVEK--------YPGLMTLRSV---D 188
Query: 1018 LHPRSWYSVAWYPIYRIP----EGNFRAAFLTYHSLGHLVRRST--------NSDLQTVD 1065
+ P SW +VAWYPIY IP E + FLTYH+L + + N + + +
Sbjct: 189 MSPASWMAVAWYPIYHIPTCRNEKDLTTGFLTYHTLSSSFQDNVVEGDQSNNNEETEFCE 248
Query: 1066 SCIVS-----PAVGLQSYNAKGECW 1085
+++ P G+ +Y +G+ W
Sbjct: 249 DSVINKRMPLPPFGVTTYKMQGDLW 273
>AT1G15030.1 | Symbols: | Protein of unknown function (DUF789) |
chr1:5177895-5179853 FORWARD LENGTH=360
Length = 360
Score = 64.3 bits (155), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 55/178 (30%), Positives = 75/178 (42%), Gaps = 45/178 (25%)
Query: 989 IQELVRGDVPIQSKIYGDPTN--------LNSVNLRDLHPRSWYSVAWYPIYRIPEG--- 1037
I E + D+P + + D + L ++ DL P SW+SVAWYPIY+IP G
Sbjct: 207 IFEYLERDLPYVREPFADKMSDLASRFPELKTLRSCDLLPSSWFSVAWYPIYKIPTGPTL 266
Query: 1038 -NFRAAFLTYHSL-----GHLVRRSTNSDLQTVDSC--IVSPAVGLQSYNAKGECW--FQ 1087
+ A FLTYHSL G V + +Q +S + P GL SY +G W F
Sbjct: 267 KDLDACFLTYHSLHTPFQGPGVTTGSMHVVQPRESVEKMELPVFGLASYKLRGSVWTSFG 326
Query: 1088 LSHSALEAEMLGMNPSFILKERLRTLEETASLMARAVVNKGNQICTNNHPDFEFFMSR 1145
S L + +++ RLR + NHPDF FF R
Sbjct: 327 GSGHQLANSLFQAADNWL---RLRQV---------------------NHPDFIFFCRR 360
>AT4G16100.1 | Symbols: | Protein of unknown function (DUF789) |
chr4:9105809-9107986 FORWARD LENGTH=394
Length = 394
Score = 62.8 bits (151), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 77/296 (26%), Positives = 113/296 (38%), Gaps = 55/296 (18%)
Query: 807 LLHCCSPVICQSPNSPSCLTCSQNHGGVSLCRHEMPALSLGYLWQWYEEHGSYGLEIRAQ 866
L C +P++ S L + + G + P L LW +EE +YG+ +
Sbjct: 92 FLDCTTPIV-----STQHLPLTSSKGWRTREPEYRPYFLLNDLWDSFEEWSAYGVGVPLL 146
Query: 867 EYENLKKLGGVGQFPFRAYFSSSLSAVQLFKNREHQCASSSDDLPNCKVSEACQMIDISE 926
L + V Q Y+ LS +QL+++ C + + E
Sbjct: 147 ----LNGIDSVVQ-----YYVPYLSGIQLYEDPSRACTTRRR---------------VGE 182
Query: 927 HSIFSVPQPHDQDANIQIPKKTASMNSASNPSINSACSGXXXXXXXXXXXXPPQQRRPLY 986
S P+ D + + + ++ AS C G P
Sbjct: 183 ESDGDSPRDMSSDGSNDCRELSQNLYRASLE--EKPCIGSSSDESEASSNSPG------- 233
Query: 987 EKIQELVRGDVPIQSKIYGDP-TNLNS--VNLR-----DLHPRSWYSVAWYPIYRIPEG- 1037
E + E + G +P + D +NL+S LR DL P SW SVAWYPIYRIP G
Sbjct: 234 ELVFEYLEGAMPFGREPLTDKISNLSSQFPALRTYRSCDLSPSSWVSVAWYPIYRIPLGQ 293
Query: 1038 ---NFRAAFLTYHSLGHLVRRSTNSDLQ-----TVDSCIVSPAVGLQSYNAKGECW 1085
N A FLT+HSL R ++N + Q + + P GL SY K W
Sbjct: 294 SLQNLDACFLTFHSLSTPCRGTSNEEGQSSSKSVASAKLPLPTFGLASYKFKLSEW 349
>AT2G01260.1 | Symbols: | Protein of unknown function (DUF789) |
chr2:135494-137504 REVERSE LENGTH=369
Length = 369
Score = 60.1 bits (144), Expect = 1e-08, Method: Composition-based stats.
Identities = 50/138 (36%), Positives = 62/138 (44%), Gaps = 31/138 (22%)
Query: 1017 DLHPRSWYSVAWYPIYRIPEG----NFRAAFLTYHSL----GHLVRRSTNSDLQTVDSCI 1068
DL SW+SVAWYPIYRIP G + A FLTYHSL G + S Q +S
Sbjct: 254 DLLRSSWFSVAWYPIYRIPTGPTLKDLDACFLTYHSLHTSFGGEGSEQSMSLTQPRESEK 313
Query: 1069 VS-PAVGLQSYNAKGECWFQLSHSALEAEMLGMNPSFILKERLRTLEETASLMARAVVNK 1127
+S P GL SY +G W + S E +++ SL A +K
Sbjct: 314 MSLPVFGLASYKFRGSLWTPIGGS--EHQLVN------------------SLFQAA--DK 351
Query: 1128 GNQICTNNHPDFEFFMSR 1145
C +HPDF FF R
Sbjct: 352 WLHSCHVSHPDFLFFCRR 369
>AT1G03610.1 | Symbols: | Protein of unknown function (DUF789) |
chr1:901304-902672 FORWARD LENGTH=308
Length = 308
Score = 58.9 bits (141), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 76/306 (24%), Positives = 118/306 (38%), Gaps = 68/306 (22%)
Query: 804 FEMLLHCCSPVICQSPNS-PSCLTCSQNHGGVSLCRHEMPALSLGYLWQWYEEHGSYG-- 860
+ LHC +P++ P S P + N R ++ L LW Y+E +YG
Sbjct: 13 LDRFLHCITPLV--PPQSLPKTEIRTLNRLWHPWERQKVEFFRLSDLWDCYDEWSAYGAS 70
Query: 861 LEIRAQEYENLKKLGGVGQFPFRAYFSSSLSAVQLFKNREHQCASSSDDLPNCKVSEACQ 920
+ I E+L + Y+ LSA+Q+F +S L ++ E +
Sbjct: 71 VPIHVTNGESLVQ-----------YYVPYLSAIQIF--------TSHSSL--IRLREESE 109
Query: 921 MIDISEHSIFSVPQPHDQDANIQIPKKTASMNSASNPSINSACSGXXXXXXXXXXXXPPQ 980
+ FS D ++ + ++ N+ +PS P
Sbjct: 110 DGECEGRDPFS-----DSGSDESVSEEGLENNTLLHPS-----DRLGYLYLQYFERSAPY 159
Query: 981 QRRPLYEKIQELVRGDVPIQSKIYGDPTNLNSVNLRDLHPRSWYSVAWYPIYRIPEG--- 1037
R PL +KI EL ++ Y +L SV DL P SW SVAWYPIY IP G
Sbjct: 160 TRVPLMDKINEL--------AQRYPGLMSLRSV---DLSPASWMSVAWYPIYHIPMGRTI 208
Query: 1038 -NFRAAFLTYHSLGHLVRRSTNSDLQTVDSC------------IVSPAVGLQSYNAKGEC 1084
+ FLTYH+L S+ D++ ++ I G+ +Y +G+
Sbjct: 209 KDLSTCFLTYHTLS-----SSFQDMEPEENGGDKERVRREGEDITLLPFGMATYKMQGDV 263
Query: 1085 WFQLSH 1090
W H
Sbjct: 264 WLSQDH 269
>AT4G28150.2 | Symbols: | Protein of unknown function (DUF789) |
chr4:13977642-13978912 REVERSE LENGTH=283
Length = 283
Score = 58.5 bits (140), Expect = 2e-08, Method: Composition-based stats.
Identities = 41/114 (35%), Positives = 54/114 (47%), Gaps = 24/114 (21%)
Query: 979 PQQRRPLYEKIQELVRGDVPIQSKIYGDPTNLNSVNLRDLHPRSWYSVAWYPIYRIPEG- 1037
P R PL +KI+EL V ++S L SV DL P SW +VAWYPIY IP
Sbjct: 151 PYSRVPLMDKIKELGERYVGLRS--------LRSV---DLSPASWMAVAWYPIYHIPMNR 199
Query: 1038 ---NFRAAFLTYHSLG---HLVRRSTNSDLQTVDSCIVSPAVGLQSYNAKGECW 1085
+ FLTYH+L V+R + +V A G+ +Y +G W
Sbjct: 200 SIKDLSTCFLTYHTLSSSFQDVKREEEKERISVS------AFGMATYKMQGRLW 247
>AT4G28150.1 | Symbols: | Protein of unknown function (DUF789) |
chr4:13977642-13978912 REVERSE LENGTH=285
Length = 285
Score = 58.5 bits (140), Expect = 3e-08, Method: Composition-based stats.
Identities = 41/114 (35%), Positives = 54/114 (47%), Gaps = 24/114 (21%)
Query: 979 PQQRRPLYEKIQELVRGDVPIQSKIYGDPTNLNSVNLRDLHPRSWYSVAWYPIYRIPEG- 1037
P R PL +KI+EL V ++S L SV DL P SW +VAWYPIY IP
Sbjct: 153 PYSRVPLMDKIKELGERYVGLRS--------LRSV---DLSPASWMAVAWYPIYHIPMNR 201
Query: 1038 ---NFRAAFLTYHSLG---HLVRRSTNSDLQTVDSCIVSPAVGLQSYNAKGECW 1085
+ FLTYH+L V+R + +V A G+ +Y +G W
Sbjct: 202 SIKDLSTCFLTYHTLSSSFQDVKREEEKERISVS------AFGMATYKMQGRLW 249
>AT1G17830.1 | Symbols: | Protein of unknown function (DUF789) |
chr1:6136118-6138172 REVERSE LENGTH=337
Length = 337
Score = 56.2 bits (134), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 67/274 (24%), Positives = 104/274 (37%), Gaps = 57/274 (20%)
Query: 838 RHEMPALSLGYLWQWYEEHGSYGLEIRAQEYENLKKLGGVGQFPFRAYFSSSLSAVQLFK 897
+ E+ L LW ++E +YGL + +L V Q Y+ LSA+Q++
Sbjct: 53 KDEIEYFRLSDLWDCFDEPSAYGLGSKV----DLNNGESVMQ-----YYVPYLSAIQIYT 103
Query: 898 NREHQCASSSDDLPNCKV------SEACQMIDISEHSIFSVPQPHDQDANIQIPKKTASM 951
N+ + D+ +C+ SE ++ + D+ +I T+S+
Sbjct: 104 NKSTAISRIHSDVVDCESECWSDDSEIEKLSRSMSSGSSKIWDSVSDDSGYEI-DGTSSL 162
Query: 952 NSASNPSINSACSGXXXXXXXXXXXXPPQQRRPLYEKIQELVRGDVPIQSKIYGDPTNLN 1011
SI+ P R PL K+ EL Y + L
Sbjct: 163 MRDKLGSID----------FQYFESVKPHLRVPLTAKVNELAEK--------YPGLSTLR 204
Query: 1012 SVNLRDLHPRSWYSVAWYPIYRIP----EGNFRAAFLTYHSL-----GHLV--------- 1053
SV DL P SW ++AWYPIY IP + + FL+YH+L G+L+
Sbjct: 205 SV---DLSPASWLAIAWYPIYHIPSRKTDKDLSTCFLSYHTLSSAFQGNLIEGDDEINET 261
Query: 1054 --RRSTNSDLQTVDSCIVSPAVGLQSYNAKGECW 1085
+ D V I GL SY +G+ W
Sbjct: 262 MKEETLCFDEGPVTKSIPLAPFGLVSYKLQGDLW 295
>AT4G03420.1 | Symbols: | Protein of unknown function (DUF789) |
chr4:1512226-1513594 FORWARD LENGTH=310
Length = 310
Score = 55.8 bits (133), Expect = 2e-07, Method: Composition-based stats.
Identities = 33/75 (44%), Positives = 40/75 (53%), Gaps = 15/75 (20%)
Query: 979 PQQRRPLYEKIQELVRGDVPIQSKIYGDPTNLNSVNLRDLHPRSWYSVAWYPIYRIPEG- 1037
P R PL +KI EL ++ Y +L SV DL P SW +VAWYPIY IP G
Sbjct: 160 PYARVPLMDKINEL--------AQRYPGLMSLRSV---DLSPASWMAVAWYPIYHIPMGR 208
Query: 1038 ---NFRAAFLTYHSL 1049
+ FLTYH+L
Sbjct: 209 TIKDLSTCFLTYHTL 223
>AT5G23380.1 | Symbols: | Protein of unknown function (DUF789) |
chr5:7866742-7870098 FORWARD LENGTH=301
Length = 301
Score = 53.1 bits (126), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 42/139 (30%), Positives = 55/139 (39%), Gaps = 32/139 (23%)
Query: 1020 PRSWYSVAWYPIYRIPE----GNFRAAFLTYHSLGHLVRRSTNSD------LQTVDSCIV 1069
P SW S+AWYPIY IP AAFLTYH L + D ++ ++
Sbjct: 181 PNSWISIAWYPIYPIPPVIGVDGISAAFLTYHLLKPNFPETIGKDDKGNEQGESSTPEVL 240
Query: 1070 SPAVGLQSYNAKGECWFQLSHSALEAEMLGMNPSFILKERLRTLEETASLMARAVVNKGN 1129
P G +Y A G W S + + EE+A R +G
Sbjct: 241 LPPFGAMTYKAFGNLWMMPGTSDYQNREMN--------------EESADSWLR---KRG- 282
Query: 1130 QICTNNHPDFEFFMSRRRY 1148
+H DF FFMSR+ Y
Sbjct: 283 ----FSHSDFNFFMSRKFY 297