Miyakogusa Predicted Gene
- Lj1g3v4027100.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v4027100.1 Non Chatacterized Hit- tr|Q9AV94|Q9AV94_SOYBN
Putative uncharacterized protein OS=Glycine max PE=2
S,70.43,0,seg,NULL,CUFF.31787.1
(808 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G17910.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 234 2e-61
AT5G17910.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 234 2e-61
AT2G29620.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 116 8e-26
AT1G07330.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 109 8e-24
AT5G58880.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 89 1e-17
>AT5G17910.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: mitochondrion;
EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT2G29620.1). |
chr5:5927906-5932292 FORWARD LENGTH=1342
Length = 1342
Score = 234 bits (596), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 179/436 (41%), Positives = 242/436 (55%), Gaps = 61/436 (13%)
Query: 9 SKSAIKWTEDDQKNLLDLGTXXXXXXXXXXXXIXXXXXX---XLMAEKNLIDFDSADIPC 65
SKSAIKWTE DQ+N++DLG+ I LMAE+NLIDFDSADIP
Sbjct: 351 SKSAIKWTEADQRNVMDLGSLELERNQRLENLIARRRARHNMRLMAERNLIDFDSADIPF 410
Query: 66 NVAPIAV-RRNPFDFPDDSYAGMGLPPIPGSAPSILQPRRNPFDIPYDSNEEKPDLKGDS 124
N+ PI+ R NPFD DSY M PIPGSAPSI+ RRNPFD+PY+ NEEKPDLKGD
Sbjct: 411 NMPPISTARHNPFDVSYDSYDDM---PIPGSAPSIMFARRNPFDLPYEPNEEKPDLKGDG 467
Query: 125 FQQEFTQFVQKDTFFRRHESFSMGPSVLGISKQERHDISWKPVFISERMASEGTSYPSFQ 184
FQ+EF+ KD FRRHESFS+GPS+LG + +R +P F+ ER+A+EGTSY F+
Sbjct: 468 FQEEFSSQQPKDPMFRRHESFSVGPSMLGGPRHDR----LRPFFVLERLANEGTSYYPFE 523
Query: 185 RQSSEASDSKLSSVPDTESVSSG-DQDERKFSEQELSQETELTSNMDHLSDVVEHGSQSS 243
RQ SE S+SK+SS+PDTESV + + DE+K E +ET++ + +D +SD E + S+
Sbjct: 524 RQLSEVSESKVSSIPDTESVCTVLEDDEKKVDENNADRETKI-AKVDMVSDNDEENNHSA 582
Query: 244 GEND----------------SVELINVDESS----VHHDEGEIVLGGVE---DPSEMELY 280
++D S E + DE + +HHD EIVLG E + S+M +
Sbjct: 583 SDHDEENSHSASDHDEEKSHSSEDSDFDEQADSKKLHHDVAEIVLGSGETHHEQSDM-ME 641
Query: 281 TETGEVGTHEHFNAGETHLRREPSDEVXXXXXXXXXXXLQSEVID-------------GI 327
ET + G + + ++ L + +E + +V+D G
Sbjct: 642 GETSDKGKLDEVSDSDSSLSEK--EEKIRDISEDEAMLISEQVVDLHEELGASSLPSFGE 699
Query: 328 PDENMEQTSNSQQEDSHLPESR-----ISRQASLEGSNFQNESGDVEDILHVEPVYDSSP 382
+ NM + ++D H E+R I+ SL+ S G + D H EPVYDSSP
Sbjct: 700 LEINM---ARGVEDDYHHDEARAEESFITAHPSLDESAIHVLCG-LGDGDHEEPVYDSSP 755
Query: 383 SAAENLHSFPSVSSDF 398
+ SF SVSSD+
Sbjct: 756 PSGSRFPSFSSVSSDY 771
>AT5G17910.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: mitochondrion;
EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT2G29620.1); Has 1807 Blast
hits to 1807 proteins in 277 species: Archae - 0;
Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385;
Viruses - 0; Other Eukaryotes - 339 (source: NCBI
BLink). | chr5:5927906-5932292 FORWARD LENGTH=1342
Length = 1342
Score = 234 bits (596), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 179/436 (41%), Positives = 242/436 (55%), Gaps = 61/436 (13%)
Query: 9 SKSAIKWTEDDQKNLLDLGTXXXXXXXXXXXXIXXXXXX---XLMAEKNLIDFDSADIPC 65
SKSAIKWTE DQ+N++DLG+ I LMAE+NLIDFDSADIP
Sbjct: 351 SKSAIKWTEADQRNVMDLGSLELERNQRLENLIARRRARHNMRLMAERNLIDFDSADIPF 410
Query: 66 NVAPIAV-RRNPFDFPDDSYAGMGLPPIPGSAPSILQPRRNPFDIPYDSNEEKPDLKGDS 124
N+ PI+ R NPFD DSY M PIPGSAPSI+ RRNPFD+PY+ NEEKPDLKGD
Sbjct: 411 NMPPISTARHNPFDVSYDSYDDM---PIPGSAPSIMFARRNPFDLPYEPNEEKPDLKGDG 467
Query: 125 FQQEFTQFVQKDTFFRRHESFSMGPSVLGISKQERHDISWKPVFISERMASEGTSYPSFQ 184
FQ+EF+ KD FRRHESFS+GPS+LG + +R +P F+ ER+A+EGTSY F+
Sbjct: 468 FQEEFSSQQPKDPMFRRHESFSVGPSMLGGPRHDR----LRPFFVLERLANEGTSYYPFE 523
Query: 185 RQSSEASDSKLSSVPDTESVSSG-DQDERKFSEQELSQETELTSNMDHLSDVVEHGSQSS 243
RQ SE S+SK+SS+PDTESV + + DE+K E +ET++ + +D +SD E + S+
Sbjct: 524 RQLSEVSESKVSSIPDTESVCTVLEDDEKKVDENNADRETKI-AKVDMVSDNDEENNHSA 582
Query: 244 GEND----------------SVELINVDESS----VHHDEGEIVLGGVE---DPSEMELY 280
++D S E + DE + +HHD EIVLG E + S+M +
Sbjct: 583 SDHDEENSHSASDHDEEKSHSSEDSDFDEQADSKKLHHDVAEIVLGSGETHHEQSDM-ME 641
Query: 281 TETGEVGTHEHFNAGETHLRREPSDEVXXXXXXXXXXXLQSEVID-------------GI 327
ET + G + + ++ L + +E + +V+D G
Sbjct: 642 GETSDKGKLDEVSDSDSSLSEK--EEKIRDISEDEAMLISEQVVDLHEELGASSLPSFGE 699
Query: 328 PDENMEQTSNSQQEDSHLPESR-----ISRQASLEGSNFQNESGDVEDILHVEPVYDSSP 382
+ NM + ++D H E+R I+ SL+ S G + D H EPVYDSSP
Sbjct: 700 LEINM---ARGVEDDYHHDEARAEESFITAHPSLDESAIHVLCG-LGDGDHEEPVYDSSP 755
Query: 383 SAAENLHSFPSVSSDF 398
+ SF SVSSD+
Sbjct: 756 PSGSRFPSFSSVSSDY 771
>AT2G29620.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G07330.1); Has 887 Blast hits to 750 proteins
in 151 species: Archae - 2; Bacteria - 63; Metazoa -
270; Fungi - 51; Plants - 111; Viruses - 6; Other
Eukaryotes - 384 (source: NCBI BLink). |
chr2:12663200-12665803 REVERSE LENGTH=747
Length = 747
Score = 116 bits (290), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 66/139 (47%), Positives = 78/139 (56%), Gaps = 10/139 (7%)
Query: 10 KSAIKWTEDDQKNLLDLGTXXXXXXXXXXXXIXXXXXXXL---MAEKNLIDFDSADIPCN 66
K + WTEDDQKNL+DLGT I AE +L+D
Sbjct: 222 KVVVAWTEDDQKNLMDLGTSEIERNKRLENLISRRRSRRFFLLAAEGSLMD------DME 275
Query: 67 VAPIAVRRNPFDFPDDSYAGMGLPPIPGSAPSILQPRRNPFDIPYDSNEEKPDLKGDSFQ 126
V I + RN + F +Y GL +PGSAPS+L PRRNPFD+PYD EEKP+L GDSFQ
Sbjct: 276 VPRICIGRNFYGFDKGNYEIDGLV-MPGSAPSVLLPRRNPFDLPYDPLEEKPNLTGDSFQ 334
Query: 127 QEFTQFVQKDTFFRRHESF 145
QEF + KD FF RHESF
Sbjct: 335 QEFAETNPKDIFFCRHESF 353
>AT1G07330.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT2G29620.1); Has 597 Blast hits to 536 proteins
in 121 species: Archae - 2; Bacteria - 47; Metazoa -
170; Fungi - 43; Plants - 98; Viruses - 0; Other
Eukaryotes - 237 (source: NCBI BLink). |
chr1:2251131-2253585 FORWARD LENGTH=685
Length = 685
Score = 109 bits (273), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 69/160 (43%), Positives = 83/160 (51%), Gaps = 16/160 (10%)
Query: 10 KSAIKWTEDDQKNLLDLGTXXXXXXXXXXXXIXXXXXXXL---MAEKNLIDFDSADIPCN 66
K + WTEDDQKNL+DLG I L AE +L+D +
Sbjct: 156 KKIVAWTEDDQKNLMDLGNSEMERNKRLEHLITRRRMRRLVRLAAESSLMDME------- 208
Query: 67 VAPIAVRRNPFDFPDDSYAGMGLPPIPGSAPSILQPRRNPFDIPYDSNEEKPDLKGDSFQ 126
V P+ V RN F ++Y GL +P SAPS+L P +NPFDIPYD EEKP+L GDSFQ
Sbjct: 209 VPPVCVGRNYFGLDQENYIVDGLQ-MPESAPSVLLPTKNPFDIPYDPQEEKPNLSGDSFQ 267
Query: 127 QEFTQFVQKDTFFRRHESFSMGPSVLGISKQERHDISWKP 166
QEF D FF RHESF V + Q D W+P
Sbjct: 268 QEFAAN-PNDIFFCRHESFCR--RVFPLDNQ--LDTKWEP 302
>AT5G58880.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT2G29620.1); Has 1807 Blast hits to 1807 proteins
in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
- 339 (source: NCBI BLink). | chr5:23775966-23779504
FORWARD LENGTH=1088
Length = 1088
Score = 89.0 bits (219), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 38/55 (69%), Positives = 43/55 (78%)
Query: 92 IPGSAPSILQPRRNPFDIPYDSNEEKPDLKGDSFQQEFTQFVQKDTFFRRHESFS 146
IPGSAPS++ RNPFDIPYD EE+P+L GDSF QEF+ F QKD FF RHESF
Sbjct: 284 IPGSAPSVMLQGRNPFDIPYDPQEERPNLTGDSFDQEFSLFNQKDLFFCRHESFC 338