Miyakogusa Predicted Gene
- Lj1g3v1465200.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v1465200.1 Non Chatacterized Hit- tr|I1N7W1|I1N7W1_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.27482
PE,29.51,0.000000000005,OS09G0509400 PROTEIN,NULL; FAMILY NOT
NAMED,NULL; seg,NULL; DUF4210,Domain of unknown function
DUF42,CUFF.27351.1
(499 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G41110.1 | Symbols: | unknown protein; INVOLVED IN: biologic... 124 2e-28
AT3G26890.5 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 117 2e-26
AT3G26890.4 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 117 2e-26
AT3G26890.3 | Symbols: | unknown protein; BEST Arabidopsis thal... 117 2e-26
AT3G26890.2 | Symbols: | unknown protein; BEST Arabidopsis thal... 117 2e-26
AT3G26890.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 117 2e-26
AT3G26890.6 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 110 2e-24
>AT5G41110.1 | Symbols: | unknown protein; INVOLVED IN:
biological_process unknown; LOCATED IN: chloroplast;
BEST Arabidopsis thaliana protein match is: unknown
protein (TAIR:AT3G26890.5); Has 1807 Blast hits to 1807
proteins in 277 species: Archae - 0; Bacteria - 0;
Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0;
Other Eukaryotes - 339 (source: NCBI BLink). |
chr5:16452598-16454989 FORWARD LENGTH=621
Length = 621
Score = 124 bits (311), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 131/411 (31%), Positives = 185/411 (45%), Gaps = 58/411 (14%)
Query: 94 VNTEQTPTSRTVGFRIRASTPCVNGFGGNGYSSSVFNATNDATEA---SKSQVRKRLLSP 150
VN +T R VGF + N Y+S N T + E S VRKR+ SP
Sbjct: 93 VNLSRTEVRRIVGFESNEVSSV-----SNEYTSITVNRTGGSNEVDIIGGSLVRKRVSSP 147
Query: 151 LNGMMLADHFRGDPLDIGEGNHRTCPKVGDDCHNAIHEYKKVHIGDD---INIQPTIWSS 207
LN + + FRGD LDI N P H + +G D NI S
Sbjct: 148 LN-KLFPEKFRGDSLDISHSNQ---PMTSAGLPKGFH----IPVGQDHKKANI------S 193
Query: 208 AYFQEFVNSSCNDSGMNKNKIVXXXXXXXXXXXXXMCYKHLKSSSELNDSKERTKLKSLP 267
+ F S+C++ G + + L S++L S++ L S P
Sbjct: 194 GRLRLFTTSNCSEWGNDTSHT-------GKLSSTVFTDGPLLDSNDLQPSQDVHCLYS-P 245
Query: 268 V--ALSIPQKKVS-------SPRFPLSPLGKKSSTNEK-LGGCRTIDVMLDDGNLSFKDV 317
V +P K + SP LSPLG + S K L G ++ DD L
Sbjct: 246 VHETFQVPNKPLPCHRNISVSPPLSLSPLGPRFSERMKALQGGLNGNIFEDDVCLKNTGE 305
Query: 318 EESLDKTCQGFKSQLYYNSMQQKSDMFTRDKIIDMNDYWTRPASFPPKHANFCGTMSRLP 377
E L + F N +Q+ F+ D+ I+ + P S P K F ++S P
Sbjct: 306 EAELRTGHRSFDDT---NGIQR---AFSMDRAIE-----SVPTS-PCKR--FSRSLSGRP 351
Query: 378 SRRSLIXXXXXXXXXXXXXXXKVSQKIEGFLAVLNVTGGNFSPQSQKVPFGVSSVQGDKY 437
+RSL+ + +QKI+GFLA+L++ GGN SP+SQK+PF V+SV D +
Sbjct: 352 IQRSLVGSFEESLFSGRLSYGQANQKIDGFLAILSIAGGNISPKSQKLPFSVTSVGDDCF 411
Query: 438 LLYYSSINLS-GKLSSKSRVSKFQRTLSMDESRSEKSRIRIPMKGRIQLVL 487
LLYY+SI+LS G L SK K + + ++++ R+RIPMKGRIQLVL
Sbjct: 412 LLYYASIDLSGGSLPSKLWGQKLKTNQNKSDAQTINKRLRIPMKGRIQLVL 462
>AT3G26890.5 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; EXPRESSED IN: 24 plant
structures; EXPRESSED DURING: 15 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT5G41110.1); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
0; Other Eukaryotes - 2996 (source: NCBI BLink). |
chr3:9907456-9910463 REVERSE LENGTH=649
Length = 649
Score = 117 bits (292), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 115/359 (32%), Positives = 175/359 (48%), Gaps = 48/359 (13%)
Query: 143 VRKRLLSPLNGMMLADHFRGDPLDIGEGNHRTCPKVGDD---CHNAIHEYKKVHIGDDIN 199
VRKR+LSPLN + FRGD DI GNH+ G C++ ++ K + ++
Sbjct: 144 VRKRVLSPLN-TLFPVKFRGDLHDISCGNHQQITYSGQSNGFCNSVAQDHIKANTPSRLH 202
Query: 200 IQPT-IWSSAYFQEFVNSSCNDSGM--NKNKIVXXXXXXXXXXXXXMC-YKHLKSSSELN 255
+ T +S + + V++S S M ++ +C Y L +S
Sbjct: 203 LSTTPTTTSCWEWKNVSNSGRLSSMVFTDGPLLDSVDLRQPVKGGEVCLYSPLYETSS-- 260
Query: 256 DSKERTKLKSLPVALSIPQKKVS-SPRFPLSPLGKKSSTNEK-LGGCRTIDVMLDDGNLS 313
T K LP K++S SP LSPLG K S K + C+T ++ D N+S
Sbjct: 261 -----TPNKPLPC-----DKEISVSPPLCLSPLGPKFSERVKAVRSCQTGKILEDLRNIS 310
Query: 314 FKDVEESLDKTCQGFKSQLYYNSMQQKSDMFTRDKIIDMNDYWTRPASFPPKHANFCGTM 373
++ E +D+ +L+ D + + M + P K F ++
Sbjct: 311 -EEAELRVDR-------RLF-------DDAYAIRRAFSMERSTESAPASPCKR--FIRSL 353
Query: 374 SRLPSRRSLIXXXXXXXXXXXXXXXKVSQKIEGFLAVLNVTGGNFSPQSQKVPFGVSSVQ 433
S P +RSL+ +QKI+GFLAVL++ GGN SP+SQK+PF V+S
Sbjct: 354 SGRPIQRSLVGSFEESLLTGRLSCGPTNQKIDGFLAVLSIAGGNISPKSQKLPFSVTSAG 413
Query: 434 GDKYLLYYSSINLSGKLSSKSRVSKFQ----RTLSMD-ESRSEKSRIRIPMKGRIQLVL 487
D LLYY+SI+L+G S+++KF +T M+ +++S KS++RIPMKGRIQLVL
Sbjct: 414 DDCLLLYYASIDLAG----GSKLNKFWGQKVKTSQMNSDAQSSKSQLRIPMKGRIQLVL 468
>AT3G26890.4 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; EXPRESSED IN: 24 plant
structures; EXPRESSED DURING: 15 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT5G41110.1); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
0; Other Eukaryotes - 2996 (source: NCBI BLink). |
chr3:9907456-9910463 REVERSE LENGTH=649
Length = 649
Score = 117 bits (292), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 115/359 (32%), Positives = 175/359 (48%), Gaps = 48/359 (13%)
Query: 143 VRKRLLSPLNGMMLADHFRGDPLDIGEGNHRTCPKVGDD---CHNAIHEYKKVHIGDDIN 199
VRKR+LSPLN + FRGD DI GNH+ G C++ ++ K + ++
Sbjct: 144 VRKRVLSPLN-TLFPVKFRGDLHDISCGNHQQITYSGQSNGFCNSVAQDHIKANTPSRLH 202
Query: 200 IQPT-IWSSAYFQEFVNSSCNDSGM--NKNKIVXXXXXXXXXXXXXMC-YKHLKSSSELN 255
+ T +S + + V++S S M ++ +C Y L +S
Sbjct: 203 LSTTPTTTSCWEWKNVSNSGRLSSMVFTDGPLLDSVDLRQPVKGGEVCLYSPLYETSS-- 260
Query: 256 DSKERTKLKSLPVALSIPQKKVS-SPRFPLSPLGKKSSTNEK-LGGCRTIDVMLDDGNLS 313
T K LP K++S SP LSPLG K S K + C+T ++ D N+S
Sbjct: 261 -----TPNKPLPC-----DKEISVSPPLCLSPLGPKFSERVKAVRSCQTGKILEDLRNIS 310
Query: 314 FKDVEESLDKTCQGFKSQLYYNSMQQKSDMFTRDKIIDMNDYWTRPASFPPKHANFCGTM 373
++ E +D+ +L+ D + + M + P K F ++
Sbjct: 311 -EEAELRVDR-------RLF-------DDAYAIRRAFSMERSTESAPASPCKR--FIRSL 353
Query: 374 SRLPSRRSLIXXXXXXXXXXXXXXXKVSQKIEGFLAVLNVTGGNFSPQSQKVPFGVSSVQ 433
S P +RSL+ +QKI+GFLAVL++ GGN SP+SQK+PF V+S
Sbjct: 354 SGRPIQRSLVGSFEESLLTGRLSCGPTNQKIDGFLAVLSIAGGNISPKSQKLPFSVTSAG 413
Query: 434 GDKYLLYYSSINLSGKLSSKSRVSKFQ----RTLSMD-ESRSEKSRIRIPMKGRIQLVL 487
D LLYY+SI+L+G S+++KF +T M+ +++S KS++RIPMKGRIQLVL
Sbjct: 414 DDCLLLYYASIDLAG----GSKLNKFWGQKVKTSQMNSDAQSSKSQLRIPMKGRIQLVL 468
>AT3G26890.3 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G41110.1); Has 35333 Blast hits to 34131
proteins in 2444 species: Archae - 798; Bacteria -
22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
- 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
chr3:9907456-9910463 REVERSE LENGTH=649
Length = 649
Score = 117 bits (292), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 115/359 (32%), Positives = 175/359 (48%), Gaps = 48/359 (13%)
Query: 143 VRKRLLSPLNGMMLADHFRGDPLDIGEGNHRTCPKVGDD---CHNAIHEYKKVHIGDDIN 199
VRKR+LSPLN + FRGD DI GNH+ G C++ ++ K + ++
Sbjct: 144 VRKRVLSPLN-TLFPVKFRGDLHDISCGNHQQITYSGQSNGFCNSVAQDHIKANTPSRLH 202
Query: 200 IQPT-IWSSAYFQEFVNSSCNDSGM--NKNKIVXXXXXXXXXXXXXMC-YKHLKSSSELN 255
+ T +S + + V++S S M ++ +C Y L +S
Sbjct: 203 LSTTPTTTSCWEWKNVSNSGRLSSMVFTDGPLLDSVDLRQPVKGGEVCLYSPLYETSS-- 260
Query: 256 DSKERTKLKSLPVALSIPQKKVS-SPRFPLSPLGKKSSTNEK-LGGCRTIDVMLDDGNLS 313
T K LP K++S SP LSPLG K S K + C+T ++ D N+S
Sbjct: 261 -----TPNKPLPC-----DKEISVSPPLCLSPLGPKFSERVKAVRSCQTGKILEDLRNIS 310
Query: 314 FKDVEESLDKTCQGFKSQLYYNSMQQKSDMFTRDKIIDMNDYWTRPASFPPKHANFCGTM 373
++ E +D+ +L+ D + + M + P K F ++
Sbjct: 311 -EEAELRVDR-------RLF-------DDAYAIRRAFSMERSTESAPASPCKR--FIRSL 353
Query: 374 SRLPSRRSLIXXXXXXXXXXXXXXXKVSQKIEGFLAVLNVTGGNFSPQSQKVPFGVSSVQ 433
S P +RSL+ +QKI+GFLAVL++ GGN SP+SQK+PF V+S
Sbjct: 354 SGRPIQRSLVGSFEESLLTGRLSCGPTNQKIDGFLAVLSIAGGNISPKSQKLPFSVTSAG 413
Query: 434 GDKYLLYYSSINLSGKLSSKSRVSKFQ----RTLSMD-ESRSEKSRIRIPMKGRIQLVL 487
D LLYY+SI+L+G S+++KF +T M+ +++S KS++RIPMKGRIQLVL
Sbjct: 414 DDCLLLYYASIDLAG----GSKLNKFWGQKVKTSQMNSDAQSSKSQLRIPMKGRIQLVL 468
>AT3G26890.2 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G41110.1); Has 393 Blast hits to 383 proteins
in 134 species: Archae - 0; Bacteria - 61; Metazoa -
171; Fungi - 74; Plants - 80; Viruses - 0; Other
Eukaryotes - 7 (source: NCBI BLink). |
chr3:9907456-9910463 REVERSE LENGTH=649
Length = 649
Score = 117 bits (292), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 115/359 (32%), Positives = 175/359 (48%), Gaps = 48/359 (13%)
Query: 143 VRKRLLSPLNGMMLADHFRGDPLDIGEGNHRTCPKVGDD---CHNAIHEYKKVHIGDDIN 199
VRKR+LSPLN + FRGD DI GNH+ G C++ ++ K + ++
Sbjct: 144 VRKRVLSPLN-TLFPVKFRGDLHDISCGNHQQITYSGQSNGFCNSVAQDHIKANTPSRLH 202
Query: 200 IQPT-IWSSAYFQEFVNSSCNDSGM--NKNKIVXXXXXXXXXXXXXMC-YKHLKSSSELN 255
+ T +S + + V++S S M ++ +C Y L +S
Sbjct: 203 LSTTPTTTSCWEWKNVSNSGRLSSMVFTDGPLLDSVDLRQPVKGGEVCLYSPLYETSS-- 260
Query: 256 DSKERTKLKSLPVALSIPQKKVS-SPRFPLSPLGKKSSTNEK-LGGCRTIDVMLDDGNLS 313
T K LP K++S SP LSPLG K S K + C+T ++ D N+S
Sbjct: 261 -----TPNKPLPC-----DKEISVSPPLCLSPLGPKFSERVKAVRSCQTGKILEDLRNIS 310
Query: 314 FKDVEESLDKTCQGFKSQLYYNSMQQKSDMFTRDKIIDMNDYWTRPASFPPKHANFCGTM 373
++ E +D+ +L+ D + + M + P K F ++
Sbjct: 311 -EEAELRVDR-------RLF-------DDAYAIRRAFSMERSTESAPASPCKR--FIRSL 353
Query: 374 SRLPSRRSLIXXXXXXXXXXXXXXXKVSQKIEGFLAVLNVTGGNFSPQSQKVPFGVSSVQ 433
S P +RSL+ +QKI+GFLAVL++ GGN SP+SQK+PF V+S
Sbjct: 354 SGRPIQRSLVGSFEESLLTGRLSCGPTNQKIDGFLAVLSIAGGNISPKSQKLPFSVTSAG 413
Query: 434 GDKYLLYYSSINLSGKLSSKSRVSKFQ----RTLSMD-ESRSEKSRIRIPMKGRIQLVL 487
D LLYY+SI+L+G S+++KF +T M+ +++S KS++RIPMKGRIQLVL
Sbjct: 414 DDCLLLYYASIDLAG----GSKLNKFWGQKVKTSQMNSDAQSSKSQLRIPMKGRIQLVL 468
>AT3G26890.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G41110.1); Has 393 Blast hits to 383 proteins
in 134 species: Archae - 0; Bacteria - 61; Metazoa -
171; Fungi - 74; Plants - 80; Viruses - 0; Other
Eukaryotes - 7 (source: NCBI BLink). |
chr3:9907456-9910463 REVERSE LENGTH=649
Length = 649
Score = 117 bits (292), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 115/359 (32%), Positives = 175/359 (48%), Gaps = 48/359 (13%)
Query: 143 VRKRLLSPLNGMMLADHFRGDPLDIGEGNHRTCPKVGDD---CHNAIHEYKKVHIGDDIN 199
VRKR+LSPLN + FRGD DI GNH+ G C++ ++ K + ++
Sbjct: 144 VRKRVLSPLN-TLFPVKFRGDLHDISCGNHQQITYSGQSNGFCNSVAQDHIKANTPSRLH 202
Query: 200 IQPT-IWSSAYFQEFVNSSCNDSGM--NKNKIVXXXXXXXXXXXXXMC-YKHLKSSSELN 255
+ T +S + + V++S S M ++ +C Y L +S
Sbjct: 203 LSTTPTTTSCWEWKNVSNSGRLSSMVFTDGPLLDSVDLRQPVKGGEVCLYSPLYETSS-- 260
Query: 256 DSKERTKLKSLPVALSIPQKKVS-SPRFPLSPLGKKSSTNEK-LGGCRTIDVMLDDGNLS 313
T K LP K++S SP LSPLG K S K + C+T ++ D N+S
Sbjct: 261 -----TPNKPLPC-----DKEISVSPPLCLSPLGPKFSERVKAVRSCQTGKILEDLRNIS 310
Query: 314 FKDVEESLDKTCQGFKSQLYYNSMQQKSDMFTRDKIIDMNDYWTRPASFPPKHANFCGTM 373
++ E +D+ +L+ D + + M + P K F ++
Sbjct: 311 -EEAELRVDR-------RLF-------DDAYAIRRAFSMERSTESAPASPCKR--FIRSL 353
Query: 374 SRLPSRRSLIXXXXXXXXXXXXXXXKVSQKIEGFLAVLNVTGGNFSPQSQKVPFGVSSVQ 433
S P +RSL+ +QKI+GFLAVL++ GGN SP+SQK+PF V+S
Sbjct: 354 SGRPIQRSLVGSFEESLLTGRLSCGPTNQKIDGFLAVLSIAGGNISPKSQKLPFSVTSAG 413
Query: 434 GDKYLLYYSSINLSGKLSSKSRVSKFQ----RTLSMD-ESRSEKSRIRIPMKGRIQLVL 487
D LLYY+SI+L+G S+++KF +T M+ +++S KS++RIPMKGRIQLVL
Sbjct: 414 DDCLLLYYASIDLAG----GSKLNKFWGQKVKTSQMNSDAQSSKSQLRIPMKGRIQLVL 468
>AT3G26890.6 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; EXPRESSED IN: 24 plant
structures; EXPRESSED DURING: 15 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT5G41110.1). | chr3:9907456-9910463 REVERSE
LENGTH=648
Length = 648
Score = 110 bits (276), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 114/359 (31%), Positives = 174/359 (48%), Gaps = 49/359 (13%)
Query: 143 VRKRLLSPLNGMMLADHFRGDPLDIGEGNHRTCPKVGDD---CHNAIHEYKKVHIGDDIN 199
VRKR+LSPLN + FRGD DI GNH+ G C++ ++ K + ++
Sbjct: 144 VRKRVLSPLN-TLFPVKFRGDLHDISCGNHQQITYSGQSNGFCNSVAQDHIKANTPSRLH 202
Query: 200 IQPT-IWSSAYFQEFVNSSCNDSGM--NKNKIVXXXXXXXXXXXXXMC-YKHLKSSSELN 255
+ T +S + + V++S S M ++ +C Y L +S
Sbjct: 203 LSTTPTTTSCWEWKNVSNSGRLSSMVFTDGPLLDSVDLRQPVKGGEVCLYSPLYETSS-- 260
Query: 256 DSKERTKLKSLPVALSIPQKKVS-SPRFPLSPLGKKSSTNEK-LGGCRTIDVMLDDGNLS 313
T K LP K++S SP LSPLG K S K + C+T ++ D N+S
Sbjct: 261 -----TPNKPLPC-----DKEISVSPPLCLSPLGPKFSERVKAVRSCQTGKILEDLRNIS 310
Query: 314 FKDVEESLDKTCQGFKSQLYYNSMQQKSDMFTRDKIIDMNDYWTRPASFPPKHANFCGTM 373
++ E +D+ +L+ D + + M + P K F ++
Sbjct: 311 -EEAELRVDR-------RLF-------DDAYAIRRAFSMERSTESAPASPCKR--FIRSL 353
Query: 374 SRLPSRRSLIXXXXXXXXXXXXXXXKVSQKIEGFLAVLNVTGGNFSPQSQKVPFGVSSVQ 433
S P +RSL+ +Q I+GFLAVL++ GGN SP+SQK+PF V+S
Sbjct: 354 SGRPIQRSLVGSFEESLLTGRLSCGPTNQ-IDGFLAVLSIAGGNISPKSQKLPFSVTSAG 412
Query: 434 GDKYLLYYSSINLSGKLSSKSRVSKFQ----RTLSMD-ESRSEKSRIRIPMKGRIQLVL 487
D LLYY+SI+L+G S+++KF +T M+ +++S KS++RIPMKGRIQLVL
Sbjct: 413 DDCLLLYYASIDLAG----GSKLNKFWGQKVKTSQMNSDAQSSKSQLRIPMKGRIQLVL 467