Miyakogusa Predicted Gene
- Lj4g3v2593570.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj4g3v2593570.1 Non Chatacterized Hit- tr|I1K3T6|I1K3T6_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.39655
PE,81.31,0,UNCHARACTERIZED,Magnesium transporter NIPA; seg,NULL;
Mg_trans_NIPA,Magnesium transporter NIPA; Mult,CUFF.51201.1
(322 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT4G13800.1 | Symbols: | Protein of unknown function (DUF803) |... 416 e-116
AT1G71900.1 | Symbols: | Protein of unknown function (DUF803) |... 403 e-112
AT3G23870.1 | Symbols: | Protein of unknown function (DUF803) |... 402 e-112
AT1G34470.1 | Symbols: | Protein of unknown function (DUF803) |... 390 e-109
AT4G09640.1 | Symbols: | Protein of unknown function (DUF803) |... 367 e-102
AT2G21120.1 | Symbols: | Protein of unknown function (DUF803) |... 335 2e-92
AT4G38730.1 | Symbols: | Protein of unknown function (DUF803) |... 330 1e-90
>AT4G13800.1 | Symbols: | Protein of unknown function (DUF803) |
chr4:8002129-8003858 REVERSE LENGTH=336
Length = 336
Score = 416 bits (1068), Expect = e-116, Method: Compositional matrix adjust.
Identities = 205/313 (65%), Positives = 232/313 (74%), Gaps = 1/313 (0%)
Query: 3 ISSDNVTGLVLAXXXXXXXXXXXXXKKMGLKKAGNTGKSAASGGHAYLYEPWWWVGMTSM 62
+S DN+ G++LA KK GLKKAG +G A GG+ YLYEPWWW GM +M
Sbjct: 4 MSPDNIHGVILAVSSSIFIGSSFIIKKKGLKKAGVSGARAGEGGYGYLYEPWWWAGMITM 63
Query: 63 IVGEIXXXXXXXXXXXLLVTPLGALSIIFSAVLAHFILEEKLHIFGVLGCALCVVGSTTI 122
IVGEI +LVTPLGALSIIFSAVLAHFILEEKLH+FG+LGC LCVVGSTTI
Sbjct: 64 IVGEIANFAAYAFAPAILVTPLGALSIIFSAVLAHFILEEKLHMFGILGCVLCVVGSTTI 123
Query: 123 VLHAPHERIIHSVKEVWQLATEPGFLVYTCXXXXXXXXXXFYFVPRYGPTHLIIYVGICS 182
VLHAPHE+ I SVK+VW LATEPGFL Y+ FY+ PRYG TH+I+YVGICS
Sbjct: 124 VLHAPHEQGIESVKQVWHLATEPGFLAYSAVVLVVVLALIFYYEPRYGKTHMIVYVGICS 183
Query: 183 LTGSLTVMGVKAVGIALKLTFEGSNQFTYFQTWFFTLVVIGCCLLQINYLNKALDTFNTA 242
L GSLTVM VKAV IA+KLTF G NQF YF W F +VV CC+LQINYLNKALD FNTA
Sbjct: 184 LMGSLTVMSVKAVAIAIKLTFSGMNQFKYFHAWIFIIVVTICCILQINYLNKALDNFNTA 243
Query: 243 VVSPVYYVMFTSLTIFASIIMFKDWDTQNASQIATELCGFVTILSGTFLLHKTKDMGKKP 302
V+SPVYYVMFT+ TI AS+IMFKDW +Q+ QIATELCGFVTILSGTFLLHKTKDMG
Sbjct: 244 VISPVYYVMFTTFTILASMIMFKDWASQSGLQIATELCGFVTILSGTFLLHKTKDMGNST 303
Query: 303 TDQPPVFSTPDRD 315
+ + +P RD
Sbjct: 304 SLRGSTSHSP-RD 315
>AT1G71900.1 | Symbols: | Protein of unknown function (DUF803) |
chr1:27061754-27064053 FORWARD LENGTH=343
Length = 343
Score = 403 bits (1035), Expect = e-112, Method: Compositional matrix adjust.
Identities = 195/297 (65%), Positives = 226/297 (76%)
Query: 2 GISSDNVTGLVLAXXXXXXXXXXXXXKKMGLKKAGNTGKSAASGGHAYLYEPWWWVGMTS 61
G+SSDN+ GLVLA KK GLKKA +TG A GG++YLYEP WW+GMT+
Sbjct: 14 GMSSDNIKGLVLALSSSLFIGASFIVKKKGLKKAASTGTRAGVGGYSYLYEPLWWIGMTT 73
Query: 62 MIVGEIXXXXXXXXXXXLLVTPLGALSIIFSAVLAHFILEEKLHIFGVLGCALCVVGSTT 121
M++GEI +LVTPLGA+SII SAVLAH IL EKLHIFG+LGCALCVVGSTT
Sbjct: 74 MLLGEIANFAAYAFAPAILVTPLGAVSIIISAVLAHIILREKLHIFGILGCALCVVGSTT 133
Query: 122 IVLHAPHERIIHSVKEVWQLATEPGFLVYTCXXXXXXXXXXFYFVPRYGPTHLIIYVGIC 181
IVLHAP ER I SV EVW LATEP F+ Y FVP+YG T++++Y+GIC
Sbjct: 134 IVLHAPQEREIDSVIEVWNLATEPAFMFYASLVIGAAVFLIIRFVPQYGQTNVMVYIGIC 193
Query: 182 SLTGSLTVMGVKAVGIALKLTFEGSNQFTYFQTWFFTLVVIGCCLLQINYLNKALDTFNT 241
SL GSL+VM VKA+GIALKLTF G+NQ Y QTW FTLVV+ C + Q+NYLNKALDTFNT
Sbjct: 194 SLVGSLSVMSVKALGIALKLTFSGTNQLFYPQTWIFTLVVLTCVVTQLNYLNKALDTFNT 253
Query: 242 AVVSPVYYVMFTSLTIFASIIMFKDWDTQNASQIATELCGFVTILSGTFLLHKTKDM 298
A+VSP+YYVMFTSLTI AS+IMFKDWD QN +QI TE+CGFVTILSGTFLLH+TKDM
Sbjct: 254 AIVSPIYYVMFTSLTILASVIMFKDWDRQNGTQIVTEICGFVTILSGTFLLHRTKDM 310
>AT3G23870.1 | Symbols: | Protein of unknown function (DUF803) |
chr3:8620253-8621755 FORWARD LENGTH=335
Length = 335
Score = 402 bits (1033), Expect = e-112, Method: Compositional matrix adjust.
Identities = 205/313 (65%), Positives = 236/313 (75%), Gaps = 1/313 (0%)
Query: 3 ISSDNVTGLVLAXXXXXXXXXXXXXKKMGLKKAGNTGKSAASGGHAYLYEPWWWVGMTSM 62
+S DN+ G++LA KK GLKKAG +G A GG+ YL EPWWW GM +M
Sbjct: 4 MSPDNINGVILAVSSSIFIGSSFIIKKKGLKKAGASGVRAGEGGYGYLKEPWWWAGMITM 63
Query: 63 IVGEIXXXXXXXXXXXLLVTPLGALSIIFSAVLAHFILEEKLHIFGVLGCALCVVGSTTI 122
IVGE+ +LVTPLGALSIIFSAVLAHFIL+EKLH+FG+LGC LCVVGSTTI
Sbjct: 64 IVGEVANFAAYAFAPAILVTPLGALSIIFSAVLAHFILKEKLHMFGILGCILCVVGSTTI 123
Query: 123 VLHAPHERIIHSVKEVWQLATEPGFLVYTCXXXXXXXXXXFYFVPRYGPTHLIIYVGICS 182
VLHAPHE+ I SVK++WQLA EPGFLVY+ FY+ PRYG TH+I+YVGICS
Sbjct: 124 VLHAPHEQKIESVKQIWQLAIEPGFLVYSAVIVIVVAILIFYYEPRYGKTHMIVYVGICS 183
Query: 183 LTGSLTVMGVKAVGIALKLTFEGSNQFTYFQTWFFTLVVIGCCLLQINYLNKALDTFNTA 242
L GSLTVM VKAV IA+KLTF G+NQF YF TW F LVV CC+LQINYLNKALDTFNTA
Sbjct: 184 LMGSLTVMSVKAVAIAIKLTFSGTNQFKYFNTWIFILVVATCCILQINYLNKALDTFNTA 243
Query: 243 VVSPVYYVMFTSLTIFASIIMFKDWDTQNASQIATELCGFVTILSGTFLLHKTKDMGKKP 302
V+SPVYYVMFT+ TI AS+IMFKDW +Q+ +IATELCGFVTILSGTFLLHKTKDMG
Sbjct: 244 VISPVYYVMFTTFTIIASMIMFKDWASQSGLKIATELCGFVTILSGTFLLHKTKDMGNSA 303
Query: 303 TDQPPVFSTPDRD 315
+ + + S P RD
Sbjct: 304 SGRGSI-SMPTRD 315
>AT1G34470.1 | Symbols: | Protein of unknown function (DUF803) |
chr1:12604574-12606758 FORWARD LENGTH=368
Length = 368
Score = 390 bits (1002), Expect = e-109, Method: Compositional matrix adjust.
Identities = 192/297 (64%), Positives = 221/297 (74%)
Query: 2 GISSDNVTGLVLAXXXXXXXXXXXXXKKMGLKKAGNTGKSAASGGHAYLYEPWWWVGMTS 61
G+SSDN+ GLVLA KK GLK+AG +G A SGG++YL EP WWVGM +
Sbjct: 14 GMSSDNIKGLVLALSSSLFIGASFIVKKKGLKRAGASGLRAGSGGYSYLLEPLWWVGMIT 73
Query: 62 MIVGEIXXXXXXXXXXXLLVTPLGALSIIFSAVLAHFILEEKLHIFGVLGCALCVVGSTT 121
MIVGEI +LVTPLGALSII SA LAH IL EKLH FG+LGC LCVVGS T
Sbjct: 74 MIVGEIANFAAYAFAPAILVTPLGALSIIISAALAHVILHEKLHTFGLLGCVLCVVGSIT 133
Query: 122 IVLHAPHERIIHSVKEVWQLATEPGFLVYTCXXXXXXXXXXFYFVPRYGPTHLIIYVGIC 181
IVLHAP E+ I SV +VW LATEP FL+Y FVP+YG +H+++Y+G+C
Sbjct: 134 IVLHAPQEQEIDSVLQVWNLATEPAFLLYAAAVVGAAIILIVQFVPQYGQSHVMVYIGVC 193
Query: 182 SLTGSLTVMGVKAVGIALKLTFEGSNQFTYFQTWFFTLVVIGCCLLQINYLNKALDTFNT 241
SL GSL+VM VKA+GIALKLTF G NQ Y QTW FTL+V+ C + Q+NYLNKALDTFNT
Sbjct: 194 SLVGSLSVMSVKALGIALKLTFSGMNQLIYPQTWVFTLIVLTCVITQMNYLNKALDTFNT 253
Query: 242 AVVSPVYYVMFTSLTIFASIIMFKDWDTQNASQIATELCGFVTILSGTFLLHKTKDM 298
AVVSP+YYVMFTSLTI AS+IMFKDWD Q+ +QI TELCGFVTILSGTFLLHKTKDM
Sbjct: 254 AVVSPIYYVMFTSLTILASVIMFKDWDRQDGTQIVTELCGFVTILSGTFLLHKTKDM 310
>AT4G09640.1 | Symbols: | Protein of unknown function (DUF803) |
chr4:6088433-6090604 REVERSE LENGTH=386
Length = 386
Score = 367 bits (942), Expect = e-102, Method: Compositional matrix adjust.
Identities = 190/297 (63%), Positives = 220/297 (74%)
Query: 2 GISSDNVTGLVLAXXXXXXXXXXXXXKKMGLKKAGNTGKSAASGGHAYLYEPWWWVGMTS 61
G+SSDNV GLVLA KK GLKKAG +G A SGG++YL EP WW+GM +
Sbjct: 14 GMSSDNVKGLVLALSSSIFIGASFIVKKKGLKKAGASGLRAGSGGYSYLLEPLWWIGMIT 73
Query: 62 MIVGEIXXXXXXXXXXXLLVTPLGALSIIFSAVLAHFILEEKLHIFGVLGCALCVVGSTT 121
MIVGEI +LVTPLGALSII SA LAH IL+EKLH FG+LGCALC+VGS T
Sbjct: 74 MIVGEIANFAAYAFAPAILVTPLGALSIIISASLAHIILQEKLHTFGILGCALCIVGSVT 133
Query: 122 IVLHAPHERIIHSVKEVWQLATEPGFLVYTCXXXXXXXXXXFYFVPRYGPTHLIIYVGIC 181
IVLHAP E+ I SV EVW LATEP FL Y F+P YG +H+++Y+G+C
Sbjct: 134 IVLHAPQEQDIVSVLEVWNLATEPAFLFYAAAVVGAAIVLIVQFIPLYGQSHVMVYIGVC 193
Query: 182 SLTGSLTVMGVKAVGIALKLTFEGSNQFTYFQTWFFTLVVIGCCLLQINYLNKALDTFNT 241
SL GSL+VM VKA+GIALKLTF G+NQ Y QTW FT++V+ C + Q+NYLNKALDTFNT
Sbjct: 194 SLIGSLSVMSVKALGIALKLTFSGTNQLGYPQTWVFTVIVLFCVITQMNYLNKALDTFNT 253
Query: 242 AVVSPVYYVMFTSLTIFASIIMFKDWDTQNASQIATELCGFVTILSGTFLLHKTKDM 298
AVVSP+YYVMFTSLTI AS+IMFKDWD Q+ +QI TELCGFVTILSGTFLLH T DM
Sbjct: 254 AVVSPIYYVMFTSLTILASVIMFKDWDRQSGTQIMTELCGFVTILSGTFLLHTTTDM 310
>AT2G21120.1 | Symbols: | Protein of unknown function (DUF803) |
chr2:9052103-9054266 REVERSE LENGTH=328
Length = 328
Score = 335 bits (860), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 165/319 (51%), Positives = 211/319 (66%), Gaps = 5/319 (1%)
Query: 3 ISSDNVTGLVLAXXXXXXXXXXXXXKKMGLKKAGNTGKSAASGGHAYLYEPWWWVGMTSM 62
+ +DN GL+LA KK GLK+AG G A GG+ YL EP WW GM +M
Sbjct: 1 METDNGKGLILAVASSVFIGSSFILKKKGLKRAGAIGTRAGYGGYTYLLEPLWWAGMVTM 60
Query: 63 IVGEIXXXXXXXXXXXLLVTPLGALSIIFSAVLAHFILEEKLHIFGVLGCALCVVGSTTI 122
IVGE +LVTPLGALSII SAVLAHF+L+EKL GVLGC C+VGS I
Sbjct: 61 IVGEAANFVAYIYAPAVLVTPLGALSIIISAVLAHFLLKEKLKKMGVLGCVSCIVGSVVI 120
Query: 123 VLHAPHERIIHSVKEVWQLATEPGFLVYTCXXXXXXXXXXFYFVPRYGPTHLIIYVGICS 182
V+HAP E+ +SV+E+W LAT+P FL+Y +F P G T++++Y+GICS
Sbjct: 121 VIHAPKEQTPNSVEEIWNLATQPAFLIYVAITMSIVLALILHFEPLCGQTNILVYIGICS 180
Query: 183 LTGSLTVMGVKAVGIALKLTFEGSNQFTYFQTWFFTLVVIGCCLLQINYLNKALDTFNTA 242
L G+LTVM +KA+GIA+KLT EG +Q Y QTW F +V + C + Q+ YLNKALDTFN A
Sbjct: 181 LMGALTVMSIKAIGIAIKLTMEGVSQIGYPQTWLFVMVAVTCVVTQLIYLNKALDTFNAA 240
Query: 243 VVSPVYYVMFTSLTIFASIIMFKDWDTQNASQIATELCGFVTILSGTFLLHKTKDMGKKP 302
+VSPVYYVMFT+LTI AS IMFKDW Q+A+ +A+ELCGF+T+L+GT +LH T++
Sbjct: 241 IVSPVYYVMFTTLTIVASAIMFKDWSGQDAASVASELCGFITVLTGTMILHGTRE----- 295
Query: 303 TDQPPVFSTPDRDYDNSAS 321
+Q S R YD+ S
Sbjct: 296 EEQQQASSEHVRWYDSRKS 314
>AT4G38730.1 | Symbols: | Protein of unknown function (DUF803) |
chr4:18080484-18082305 REVERSE LENGTH=326
Length = 326
Score = 330 bits (845), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 160/293 (54%), Positives = 200/293 (68%)
Query: 5 SDNVTGLVLAXXXXXXXXXXXXXKKMGLKKAGNTGKSAASGGHAYLYEPWWWVGMTSMIV 64
SDN GLVLA KK GLK+A G A GG+ YL EP WWVG+ +M
Sbjct: 3 SDNEMGLVLAVSSSVFIGSSFILKKKGLKRAAANGTRAGFGGYTYLLEPLWWVGLVTMTF 62
Query: 65 GEIXXXXXXXXXXXLLVTPLGALSIIFSAVLAHFILEEKLHIFGVLGCALCVVGSTTIVL 124
GEI +LVTPLGALSII SAVLAHF+L+EKL GV GC C+VGS IV+
Sbjct: 63 GEIANFVAYVYAPAVLVTPLGALSIIISAVLAHFLLDEKLRKMGVWGCVCCIVGSVMIVI 122
Query: 125 HAPHERIIHSVKEVWQLATEPGFLVYTCXXXXXXXXXXFYFVPRYGPTHLIIYVGICSLT 184
HAP E+ +SV+E+W+LA +P FL+Y Y P G T++++Y+GICSL
Sbjct: 123 HAPQEQTPNSVEEIWKLAMQPAFLIYVAISMSIVLALILYCEPLCGQTNILVYIGICSLM 182
Query: 185 GSLTVMGVKAVGIALKLTFEGSNQFTYFQTWFFTLVVIGCCLLQINYLNKALDTFNTAVV 244
GSLTVM +KAVGIA+KLTFEG NQ Y +TWFF +V C ++Q+ YLNKALDTFN A+V
Sbjct: 183 GSLTVMSIKAVGIAIKLTFEGINQIWYPETWFFAMVAAICVVMQMIYLNKALDTFNAAIV 242
Query: 245 SPVYYVMFTSLTIFASIIMFKDWDTQNASQIATELCGFVTILSGTFLLHKTKD 297
SP+YYVMFT+LTI AS IMFKDW+ QN IA+E+CGF+T+L+GT +LH T++
Sbjct: 243 SPIYYVMFTTLTIVASAIMFKDWNGQNTDSIASEICGFITVLTGTVILHSTRE 295