Miyakogusa Predicted Gene
- Lj0g3v0099839.2
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0099839.2 Non Chatacterized Hit- tr|A3C5M6|A3C5M6_ORYSJ
Putative uncharacterized protein OS=Oryza sativa
subsp,57.14,0.00000001,SUBFAMILY NOT NAMED,NULL; FAMILY NOT
NAMED,NULL; MHD2,Mammalian uncoordinated homology 13, domain
2;,CUFF.5606.2
(316 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G04470.1 | Symbols: | Protein of unknown function (DUF810) |... 442 e-124
AT2G33420.1 | Symbols: | Protein of unknown function (DUF810) |... 437 e-123
AT2G25800.1 | Symbols: | Protein of unknown function (DUF810) |... 181 6e-46
AT2G20010.2 | Symbols: | Protein of unknown function (DUF810) |... 159 2e-39
AT2G20010.1 | Symbols: | Protein of unknown function (DUF810) |... 159 2e-39
AT5G06970.1 | Symbols: | Protein of unknown function (DUF810) |... 110 1e-24
AT4G11670.1 | Symbols: | Protein of unknown function (DUF810) |... 59 6e-09
>AT1G04470.1 | Symbols: | Protein of unknown function (DUF810) |
chr1:1211177-1214591 REVERSE LENGTH=1035
Length = 1035
Score = 442 bits (1137), Expect = e-124, Method: Compositional matrix adjust.
Identities = 209/315 (66%), Positives = 250/315 (79%), Gaps = 2/315 (0%)
Query: 2 LPPLTRCNRDSKFTKLWKKAAPCGANFQDLHHMKGAFEGHHPRSSTSRGTQRLYVRLNTL 61
LPPLTRCNRDSKF KLWKKA PC A+ ++L+ M A G+HPR STSRGTQRLY+RLNTL
Sbjct: 723 LPPLTRCNRDSKFVKLWKKATPCAASGEELNQMGEAPGGNHPRPSTSRGTQRLYIRLNTL 782
Query: 62 HYLLTHIHSLEKSISMNPGIVPSNRLRFANNRRAQSNSYFESVNLSILAACQHVSEVAAY 121
H+L + +HSL KS+S+NP ++P+ R R R +S+SYFE I +ACQHVSEVAAY
Sbjct: 783 HFLSSQLHSLNKSLSLNPRVLPATRKRC--RERTKSSSYFEFTQAGIESACQHVSEVAAY 840
Query: 122 RLIFLDSSFVFYDSLYVGRVQRGQIKPALKVLKQNLSLMTTILTDRAQPLAMKEVMKASF 181
RLIFLDS VFY+SLY G V G+IKPAL++LKQNL+LMT IL D+AQ LAMKEVMKASF
Sbjct: 841 RLIFLDSYSVFYESLYPGDVANGRIKPALRILKQNLTLMTAILADKAQALAMKEVMKASF 900
Query: 182 DALLMVLLAGGHSRMFHRSDHEIIYEDFEHLKRVFSNCVDGLIAXXXXXXXXXXXXXXIA 241
+ +L VLLAGGHSR+F R+DH++I EDFE LK+V+ C +GLI I
Sbjct: 901 EVVLTVLLAGGHSRVFCRTDHDLIEEDFESLKKVYCTCGEGLIPEEVVDREAETVEGVIQ 960
Query: 242 LMGQNTEQLMEDFSIVTCESSGIGIMGNGQKLPMPPTTGKWNRADPNTILRVLCYRNDRA 301
LMGQ TEQLMEDFSIVTCESSG+G++G GQKLPMPPTTG+WNR+DPNTILRVLCYR+DR
Sbjct: 961 LMGQPTEQLMEDFSIVTCESSGMGLVGTGQKLPMPPTTGRWNRSDPNTILRVLCYRDDRV 1020
Query: 302 ADQFLKRTFQLAKRR 316
A+QFLK++FQL KRR
Sbjct: 1021 ANQFLKKSFQLGKRR 1035
>AT2G33420.1 | Symbols: | Protein of unknown function (DUF810) |
chr2:14158782-14162304 FORWARD LENGTH=1039
Length = 1039
Score = 437 bits (1124), Expect = e-123, Method: Compositional matrix adjust.
Identities = 205/316 (64%), Positives = 252/316 (79%), Gaps = 2/316 (0%)
Query: 2 LPPLTRCNRDSKFTKLWKKAAPCGANFQDLHHMKGAF-EGHHPRSSTSRGTQRLYVRLNT 60
LPPLTRCNRDS+F KLWK+A PC + +DL + +GHHPR STSRGTQRLY+RLNT
Sbjct: 725 LPPLTRCNRDSRFVKLWKRATPCTTSNEDLKYTTSVISDGHHPRPSTSRGTQRLYIRLNT 784
Query: 61 LHYLLTHIHSLEKSISMNPGIVPSNRLRFANNRRAQSNSYFESVNLSILAACQHVSEVAA 120
LH+L +HIHSL K++S+NP I+P+ R R+ +R S+SYF+ I +ACQHVSEVAA
Sbjct: 785 LHFLSSHIHSLNKTLSLNPRILPATRKRY-RHRNNNSSSYFDFTYAGIESACQHVSEVAA 843
Query: 121 YRLIFLDSSFVFYDSLYVGRVQRGQIKPALKVLKQNLSLMTTILTDRAQPLAMKEVMKAS 180
YRLIFLDS+ V Y+SLYVG V +I+PAL+++KQNL+LM+ IL DRAQ LAM+EVMK+S
Sbjct: 844 YRLIFLDSNSVLYESLYVGEVANARIRPALRIMKQNLTLMSAILADRAQSLAMREVMKSS 903
Query: 181 FDALLMVLLAGGHSRMFHRSDHEIIYEDFEHLKRVFSNCVDGLIAXXXXXXXXXXXXXXI 240
F+A LMVLLAGG+SR+F+RSDH II EDFE+LKRVF C +GLI I
Sbjct: 904 FEAFLMVLLAGGYSRVFYRSDHSIIEEDFENLKRVFCTCGEGLIPEEVVDREAETVEGVI 963
Query: 241 ALMGQNTEQLMEDFSIVTCESSGIGIMGNGQKLPMPPTTGKWNRADPNTILRVLCYRNDR 300
LM Q TEQLMEDFSIVTCE+SG+G++G+GQKLPMPPTTG+WNR+DPNTILRVLC+RNDR
Sbjct: 964 QLMSQPTEQLMEDFSIVTCETSGMGMVGSGQKLPMPPTTGRWNRSDPNTILRVLCHRNDR 1023
Query: 301 AADQFLKRTFQLAKRR 316
A+QFLK++FQL KRR
Sbjct: 1024 VANQFLKKSFQLPKRR 1039
>AT2G25800.1 | Symbols: | Protein of unknown function (DUF810) |
chr2:11006138-11009728 REVERSE LENGTH=987
Length = 987
Score = 181 bits (459), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 108/321 (33%), Positives = 166/321 (51%), Gaps = 29/321 (9%)
Query: 2 LPPLTRCNRDSKFTKLWKKAAPCGANFQDLHHMKGAFEGHHPRSSTSRGTQRLYVRLNTL 61
+P LTRC SKF WKK Q G + S G ++ VR+N+L
Sbjct: 688 MPALTRCTTGSKFQ--WKKKEKTPTT-QKRESQVSVMNGEN-----SFGVTQICVRINSL 739
Query: 62 HYLLTHIHSLEKSISMNPGIVPSNRLRFANNRRAQSNSY-------FESVNLSILAACQH 114
H + + + +EK + + N A ++ + FE + + Q
Sbjct: 740 HKIRSELDVVEKRVITH----------LRNCESAHTDDFSNGLEKKFELTPAACIEGVQQ 789
Query: 115 VSEVAAYRLIFLDSSFVFYDSLYVGRVQRGQIKPALKVLKQNLSLMTTILTDRAQPLAMK 174
+SE AY+++F D S +D LY+G + +I P LK L+QNL+++ + +R + +
Sbjct: 790 LSESLAYKVVFHDLSHTLWDGLYIGDLSSSRIDPFLKELEQNLTVIAETVHERVRTRIIT 849
Query: 175 EVMKASFDALLMVLLAGGHSRMFHRSDHEIIYEDFEHLKRVFSNCVDGLIAXXXXXXXXX 234
++M+AS D L+VLLAGG SR F R D +I+ EDF+ +K +F DGL A
Sbjct: 850 DIMRASLDGFLLVLLAGGPSRAFTRQDSQIMEEDFKSMKDMFWANGDGL-AMDLIDKFST 908
Query: 235 XXXXXIALMGQNTEQLMEDFSIVTCESSGIGIMGNGQKLPMPPTTGKWNRADPNTILRVL 294
+ L +T+ L+E F T E+ G +LP+PPT+G+WN +PNT+LRVL
Sbjct: 909 TVRGVLPLFSTDTDSLIERFKGTTLEAYG---SSAKSRLPLPPTSGQWNGMEPNTLLRVL 965
Query: 295 CYRNDRAADQFLKRTFQLAKR 315
CYRND +A +FLK+T+ L K+
Sbjct: 966 CYRNDESATRFLKKTYNLPKK 986
>AT2G20010.2 | Symbols: | Protein of unknown function (DUF810) |
chr2:8637977-8641184 REVERSE LENGTH=952
Length = 952
Score = 159 bits (402), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 103/320 (32%), Positives = 162/320 (50%), Gaps = 29/320 (9%)
Query: 1 MLPPLTRCNRDSKFTKLWKKAAPCGANFQDLHHMKGAFEGHHPRSSTSRGTQRLYV---- 56
+LP LTRC S+ ++KK K H +S G +
Sbjct: 655 VLPALTRCTVGSRLHGVFKKKE------------KPMVASHRRKSQLGTGNDSAEILQFC 702
Query: 57 -RLNTLHYLLTHIHSLEKSISMNPGIVPSNRLRFANNRRAQSNSYFESVNLSILAACQHV 115
R+NTL Y+ T I S + ++N +P + + + + FE Q +
Sbjct: 703 CRINTLQYIRTEIESSGRK-TLNR--LPESEVAALDAK----GKIFEQSISYCSKGIQQL 755
Query: 116 SEVAAYRLIFLDSSFVFYDSLYVGRVQRGQIKPALKVLKQNLSLMTTILTDRAQPLAMKE 175
SE AY+++F D S V +D LY+G V +I+P L+ L++ L ++++ + DR + + +
Sbjct: 756 SEATAYKIVFHDLSNVLWDGLYLGEVPSSRIEPFLQELERCLEIISSSVHDRVRTRVISD 815
Query: 176 VMKASFDALLMVLLAGGHSRMFHRSDHEIIYEDFEHLKRVFSNCVDGLIAXXXXXXXXXX 235
+M+ASFD L+VLLAGG SR F D + EDF+ L +F + DGL
Sbjct: 816 IMRASFDGFLLVLLAGGPSRGFTIQDSAAVEEDFKFLCDLFWSNGDGL-PLDLIEKVSTT 874
Query: 236 XXXXIALMGQNTEQLMEDFSIVTCESSGIGIMGNGQKLPMPPTTGKWNRADPNTILRVLC 295
+ L+ +T+ L+E F V E+ G + KLP+PPT+G W+ +PNT+LRVLC
Sbjct: 875 VKSILPLLRTDTDSLIERFKAVCLENHG----SDRGKLPLPPTSGPWSPTEPNTLLRVLC 930
Query: 296 YRNDRAADQFLKRTFQLAKR 315
YR D A +FLK+T+ L ++
Sbjct: 931 YRYDEPATKFLKKTYNLPRK 950
>AT2G20010.1 | Symbols: | Protein of unknown function (DUF810) |
chr2:8637977-8640830 REVERSE LENGTH=834
Length = 834
Score = 159 bits (402), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 104/321 (32%), Positives = 160/321 (49%), Gaps = 31/321 (9%)
Query: 1 MLPPLTRCNRDSKFTKLWKKAAPCGANFQDLHHMKGAFEGHHPRSSTSRGTQRLYV---- 56
+LP LTRC S+ ++KK K H +S G +
Sbjct: 537 VLPALTRCTVGSRLHGVFKKKE------------KPMVASHRRKSQLGTGNDSAEILQFC 584
Query: 57 -RLNTLHYLLTHIHSLEKSISMNPGIVPSNRLRFANNRRAQSNSYFESVNLSILA-ACQH 114
R+NTL Y+ T I S G NRL + + ++S + Q
Sbjct: 585 CRINTLQYIRTEIES--------SGRKTLNRLPESEVAALDAKGKIFEQSISYCSKGIQQ 636
Query: 115 VSEVAAYRLIFLDSSFVFYDSLYVGRVQRGQIKPALKVLKQNLSLMTTILTDRAQPLAMK 174
+SE AY+++F D S V +D LY+G V +I+P L+ L++ L ++++ + DR + +
Sbjct: 637 LSEATAYKIVFHDLSNVLWDGLYLGEVPSSRIEPFLQELERCLEIISSSVHDRVRTRVIS 696
Query: 175 EVMKASFDALLMVLLAGGHSRMFHRSDHEIIYEDFEHLKRVFSNCVDGLIAXXXXXXXXX 234
++M+ASFD L+VLLAGG SR F D + EDF+ L +F + DGL
Sbjct: 697 DIMRASFDGFLLVLLAGGPSRGFTIQDSAAVEEDFKFLCDLFWSNGDGL-PLDLIEKVST 755
Query: 235 XXXXXIALMGQNTEQLMEDFSIVTCESSGIGIMGNGQKLPMPPTTGKWNRADPNTILRVL 294
+ L+ +T+ L+E F V E+ G + KLP+PPT+G W+ +PNT+LRVL
Sbjct: 756 TVKSILPLLRTDTDSLIERFKAVCLENHG----SDRGKLPLPPTSGPWSPTEPNTLLRVL 811
Query: 295 CYRNDRAADQFLKRTFQLAKR 315
CYR D A +FLK+T+ L ++
Sbjct: 812 CYRYDEPATKFLKKTYNLPRK 832
>AT5G06970.1 | Symbols: | Protein of unknown function (DUF810) |
chr5:2158431-2166004 REVERSE LENGTH=1101
Length = 1101
Score = 110 bits (274), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 78/274 (28%), Positives = 131/274 (47%), Gaps = 26/274 (9%)
Query: 50 GTQRLYVRLNTLHYLLTHIHSLEKSISMN-PGIVPSNRLRFANNRRAQSNSY-----FES 103
T L V+LNTLHY ++ + LE S+ + P ++ + +S S+ FE
Sbjct: 842 ATAMLCVQLNTLHYAVSQLSKLEDSMWLRWIAKKPREKIVIRKSMVEKSKSFNQKESFEG 901
Query: 104 VNLSILAACQHVSEVAAYRLIFLDSSFVFYDSLYVGRVQRGQIKPALKVLKQNLSLMTTI 163
I AA + E ++IF D F ++LY V + +++ ++ L L + ++
Sbjct: 902 SRKDINAALDRICEFTGTKIIFCDLREPFIENLYKPNVSQSRLEGLIEALDTELGQLCSV 961
Query: 164 LTDRAQPLAMKEVMKASFDALLMVLLAGGHSRMFHRSDHEIIYEDFEHLKRVFSNCVDGL 223
+ + + + +++AS D LL VLL GG SR+FH S+ +++ ED E LK F + DGL
Sbjct: 962 IMEPLRDRIVTSLLQASLDGLLRVLLDGGASRVFHPSESKLLEEDVEVLKEFFISGGDGL 1021
Query: 224 IAXXXXXXXXXXXXXXIALMGQNTEQLMEDF---SIVTCESSGIGIMGNGQKLPMPPTTG 280
+ L G T +L++D S + + G G +G
Sbjct: 1022 -PRGVVENQVARVRLVVKLHGYETRELIDDLRSRSSLEMQQGGKGKLG------------ 1068
Query: 281 KWNRADPNTILRVLCYRNDRAADQFLKRTFQLAK 314
AD T++RVLC+RND A QFLK+ +++ +
Sbjct: 1069 ----ADTQTLVRVLCHRNDSEASQFLKKQYKIPR 1098
>AT4G11670.1 | Symbols: | Protein of unknown function (DUF810) |
chr4:7044401-7052971 REVERSE LENGTH=1117
Length = 1117
Score = 58.5 bits (140), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 60/275 (21%), Positives = 119/275 (43%), Gaps = 40/275 (14%)
Query: 53 RLYVRLNTLHYLLTHIHSLEKSISMNPGIVPSN---RLRFANNRRAQSNSYFESVNLSIL 109
+L + LNTL Y+ I + E I + +V ++ R + NS S + L
Sbjct: 813 KLCIILNTLCYIQKQISATEVGIRKSLTLVEASLNKRSEIETDEAEVENSLTHSEAVDEL 872
Query: 110 AACQH----------VSEVAAYRLIFLDSSFVFYDSLYVGRVQRGQIKPALKVLKQNLSL 159
A + +++ +++ +F+FY + + Q+ L
Sbjct: 873 FATTYDSLRDTNANCITKTRDLIVLWQKYAFLFYWLILMDEKCNAQV----------LDT 922
Query: 160 MTTILTDRAQPLAMKEVMKASFDALLMVLLAGGHSRMFHRSDHEIIYEDFEHLKRVFSNC 219
+ ++ + ++ + + + +++ +A + VLL GG +R F SD ++ ED LK F
Sbjct: 923 VCSLSYEDSRDMVVLSICRSALEAYVRVLLDGGPTRAFSDSDITLMEEDLSILKEFFIAD 982
Query: 220 VDGLIAXXXXXXXXXXXXXXIALMGQNTEQLMEDFSIVTCESSGI--GIMGNGQKLPMPP 277
+GL +L+ Q +Q E + + ES + +M + + M
Sbjct: 983 GEGLPR---------------SLVEQEAKQAKEILDLYSLESDMLIQMLMTASELINMGV 1027
Query: 278 TTGKWNRADPNTILRVLCYRNDRAADQFLKRTFQL 312
++ + D T++RVLC++ DR A +FLKR ++L
Sbjct: 1028 SSEQRRLEDAQTLVRVLCHKKDRNASKFLKRQYEL 1062