Miyakogusa Predicted Gene
- Lj6g3v0003330.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj6g3v0003330.1 Non Chatacterized Hit- tr|C6T8U3|C6T8U3_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.30803
PE,88.76,0,UNCHARACTERIZED,Protein of unknown function DUF2363;
seg,NULL; DUF2363,Protein of unknown function D,CUFF.57356.1
(436 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G18420.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 484 e-137
AT5G18420.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 483 e-137
AT5G18420.3 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 476 e-134
>AT5G18420.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 25 plant
structures; EXPRESSED DURING: 15 growth stages; CONTAINS
InterPro DOMAIN/s: Protein of unknown function DUF2363
(InterPro:IPR019312); Has 1807 Blast hits to 1807
proteins in 277 species: Archae - 0; Bacteria - 0;
Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0;
Other Eukaryotes - 339 (source: NCBI BLink). |
chr5:6105638-6109337 REVERSE LENGTH=441
Length = 441
Score = 484 bits (1245), Expect = e-137, Method: Compositional matrix adjust.
Identities = 242/437 (55%), Positives = 307/437 (70%), Gaps = 3/437 (0%)
Query: 1 MSIRRLSGEESMSLFSLLKADQRPIHEILAEFNSTLSRNRHFTLCSYLLMLLQDNKVLTT 60
M I R+ EES + SLLK+D RP+ ++L+EF+S R+ ++C+ L M+LQD ++
Sbjct: 6 MMIMRI--EESAIVLSLLKSDLRPMEDVLSEFDSKFESARYLSVCNSLSMMLQDQQMFKN 63
Query: 61 TERLIAFSLLLEAYSSQKPASNPFITFIINASCDEGSIKVERAFILQLLGVDSSNSGKEF 120
TERLIAF ++ + YSSQKP+ NPF++ +I+A+C+E K ERAF+L LL +S N+ KE
Sbjct: 64 TERLIAFGIIYQCYSSQKPSFNPFLSEMISAACNEQLEKSERAFLLHLLQWNSYNNVKEI 123
Query: 121 LKQSASDYVQGFDQSLHEFPPLDQLQQQFSDKVHPKPYHCLFKDGXXXXXXXXXXXXXXX 180
LK SA DY++ FD S H+FP L +LQ+++ DK P P +F D
Sbjct: 124 LKLSAVDYIRSFDPSTHDFPELGELQREYGDKADPGPSSHIFADYSLKKLLHDPDVPRGC 183
Query: 181 XXXXXEFDLRPGAKPKLGTGDKDEAVVXXXXXXXXXXXXPHWXXXXXXXXXXXDGELVWL 240
EFD++ G P++G+GD+DEA+ P W EL+W+
Sbjct: 184 DPNSPEFDVQAGVNPRIGSGDRDEALSGFLGNLTMGGLGPRWIRPCPPRYPVHQSELLWI 243
Query: 241 NPDDNHELMWDYGMCVDTSRGAAVRDLIAKALKGALAPTQQEQVLLELGNDPKLVYHCGL 300
+PD+ HEL+WD MC DTS GA VRDL+ K LK L+PT+QE ++ EL NDPKLV+HCG+
Sbjct: 244 DPDNKHELVWDDKMCADTSSGATVRDLLVKGLKVTLSPTEQEDIITELANDPKLVFHCGI 303
Query: 301 TPRKLPELVENNPLIAVDVLTKLINSPEIAEYFTVLVNMDMSLHSMEVVNRLTTAVELPS 360
TPRKLP+LVE+NP IAV++LTKL NS EI +Y+ L NMDMSLHSMEVVNRLTTAVELP
Sbjct: 304 TPRKLPQLVEHNPQIAVEILTKLNNSTEINDYYEALGNMDMSLHSMEVVNRLTTAVELPK 363
Query: 361 EFIHMYITNCISSCVSIK-DKYMQNRLVRLVCVFLQSLIRNNIINVKDLFIEVQAFCIEF 419
+FI MYITNCISSC + K DKYMQNRLVRLVCVFLQSLIRN IINVKDLFIEVQAFCIEF
Sbjct: 364 DFIRMYITNCISSCENAKQDKYMQNRLVRLVCVFLQSLIRNKIINVKDLFIEVQAFCIEF 423
Query: 420 SRIREAAALFRLLKSLE 436
SR+REAA LFRLLK L+
Sbjct: 424 SRVREAAGLFRLLKQLD 440
>AT5G18420.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 25 plant
structures; EXPRESSED DURING: 15 growth stages; CONTAINS
InterPro DOMAIN/s: Protein of unknown function DUF2363
(InterPro:IPR019312); Has 233 Blast hits to 233 proteins
in 86 species: Archae - 0; Bacteria - 0; Metazoa - 106;
Fungi - 2; Plants - 59; Viruses - 0; Other Eukaryotes -
66 (source: NCBI BLink). | chr5:6105638-6109337 REVERSE
LENGTH=442
Length = 442
Score = 483 bits (1244), Expect = e-137, Method: Compositional matrix adjust.
Identities = 242/438 (55%), Positives = 307/438 (70%), Gaps = 4/438 (0%)
Query: 1 MSIRRLSGEESMSLFSLLKADQRPIHEILAEFNSTLSRNRHFTLCSYLLMLLQDNKVLTT 60
M I R+ EES + SLLK+D RP+ ++L+EF+S R+ ++C+ L M+LQD ++
Sbjct: 6 MMIMRI--EESAIVLSLLKSDLRPMEDVLSEFDSKFESARYLSVCNSLSMMLQDQQMFKN 63
Query: 61 TERLIAFSLLLEAYSSQKPASNPFITFIINASCDEGSIKVERAFILQLLGVDSSNSGKEF 120
TERLIAF ++ + YSSQKP+ NPF++ +I+A+C+E K ERAF+L LL +S N+ KE
Sbjct: 64 TERLIAFGIIYQCYSSQKPSFNPFLSEMISAACNEQLEKSERAFLLHLLQWNSYNNVKEI 123
Query: 121 LKQSASDYVQGFDQSLHEFPPLDQLQQQFSDKVHPKPYHCLFKDGXXXXXXXXXXXXXXX 180
LK SA DY++ FD S H+FP L +LQ+++ DK P P +F D
Sbjct: 124 LKLSAVDYIRSFDPSTHDFPELGELQREYGDKADPGPSSHIFADYSLKKLLHDPDVPRGC 183
Query: 181 XXXXXEFDLRPGAKPKLGTGDKDEAVVXXXXXXXXXXXXPHWXXXXXXXXXXXDGELVWL 240
EFD++ G P++G+GD+DEA+ P W EL+W+
Sbjct: 184 DPNSPEFDVQAGVNPRIGSGDRDEALSGFLGNLTMGGLGPRWIRPCPPRYPVHQSELLWI 243
Query: 241 NPDDNHELMWDYGMCVDTSRGAAVRDLIAKALKGALAPTQQEQVLLELGNDPKLVYHCGL 300
+PD+ HEL+WD MC DTS GA VRDL+ K LK L+PT+QE ++ EL NDPKLV+HCG+
Sbjct: 244 DPDNKHELVWDDKMCADTSSGATVRDLLVKGLKVTLSPTEQEDIITELANDPKLVFHCGI 303
Query: 301 TPRKLPELVENNPLIAVDVLTKLINSPEIAEYFTVLVNMDMSLHSMEVVNRLTTAVELPS 360
TPRKLP+LVE+NP IAV++LTKL NS EI +Y+ L NMDMSLHSMEVVNRLTTAVELP
Sbjct: 304 TPRKLPQLVEHNPQIAVEILTKLNNSTEINDYYEALGNMDMSLHSMEVVNRLTTAVELPK 363
Query: 361 EFIHMYITNCISSCVSIK--DKYMQNRLVRLVCVFLQSLIRNNIINVKDLFIEVQAFCIE 418
+FI MYITNCISSC + K DKYMQNRLVRLVCVFLQSLIRN IINVKDLFIEVQAFCIE
Sbjct: 364 DFIRMYITNCISSCENAKQQDKYMQNRLVRLVCVFLQSLIRNKIINVKDLFIEVQAFCIE 423
Query: 419 FSRIREAAALFRLLKSLE 436
FSR+REAA LFRLLK L+
Sbjct: 424 FSRVREAAGLFRLLKQLD 441
>AT5G18420.3 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 25 plant
structures; EXPRESSED DURING: 15 growth stages; CONTAINS
InterPro DOMAIN/s: Protein of unknown function DUF2363
(InterPro:IPR019312); Has 35333 Blast hits to 34131
proteins in 2444 species: Archae - 798; Bacteria -
22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
- 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
chr5:6105638-6109337 REVERSE LENGTH=439
Length = 439
Score = 476 bits (1224), Expect = e-134, Method: Compositional matrix adjust.
Identities = 241/438 (55%), Positives = 305/438 (69%), Gaps = 7/438 (1%)
Query: 1 MSIRRLSGEESMSLFSLLKADQRPIHEILAEFNSTLSRNRHFTLCSYLLMLLQDNKVLTT 60
M I R+ EES + SLLK+D RP+ ++L+EF+S R+ ++C+ L M+LQ +
Sbjct: 6 MMIMRI--EESAIVLSLLKSDLRPMEDVLSEFDSKFESARYLSVCNSLSMMLQ---MFKN 60
Query: 61 TERLIAFSLLLEAYSSQKPASNPFITFIINASCDEGSIKVERAFILQLLGVDSSNSGKEF 120
TERLIAF ++ + YSSQKP+ NPF++ +I+A+C+E K ERAF+L LL +S N+ KE
Sbjct: 61 TERLIAFGIIYQCYSSQKPSFNPFLSEMISAACNEQLEKSERAFLLHLLQWNSYNNVKEI 120
Query: 121 LKQSASDYVQGFDQSLHEFPPLDQLQQQFSDKVHPKPYHCLFKDGXXXXXXXXXXXXXXX 180
LK SA DY++ FD S H+FP L +LQ+++ DK P P +F D
Sbjct: 121 LKLSAVDYIRSFDPSTHDFPELGELQREYGDKADPGPSSHIFADYSLKKLLHDPDVPRGC 180
Query: 181 XXXXXEFDLRPGAKPKLGTGDKDEAVVXXXXXXXXXXXXPHWXXXXXXXXXXXDGELVWL 240
EFD++ G P++G+GD+DEA+ P W EL+W+
Sbjct: 181 DPNSPEFDVQAGVNPRIGSGDRDEALSGFLGNLTMGGLGPRWIRPCPPRYPVHQSELLWI 240
Query: 241 NPDDNHELMWDYGMCVDTSRGAAVRDLIAKALKGALAPTQQEQVLLELGNDPKLVYHCGL 300
+PD+ HEL+WD MC DTS GA VRDL+ K LK L+PT+QE ++ EL NDPKLV+HCG+
Sbjct: 241 DPDNKHELVWDDKMCADTSSGATVRDLLVKGLKVTLSPTEQEDIITELANDPKLVFHCGI 300
Query: 301 TPRKLPELVENNPLIAVDVLTKLINSPEIAEYFTVLVNMDMSLHSMEVVNRLTTAVELPS 360
TPRKLP+LVE+NP IAV++LTKL NS EI +Y+ L NMDMSLHSMEVVNRLTTAVELP
Sbjct: 301 TPRKLPQLVEHNPQIAVEILTKLNNSTEINDYYEALGNMDMSLHSMEVVNRLTTAVELPK 360
Query: 361 EFIHMYITNCISSCVSIK--DKYMQNRLVRLVCVFLQSLIRNNIINVKDLFIEVQAFCIE 418
+FI MYITNCISSC + K DKYMQNRLVRLVCVFLQSLIRN IINVKDLFIEVQAFCIE
Sbjct: 361 DFIRMYITNCISSCENAKQQDKYMQNRLVRLVCVFLQSLIRNKIINVKDLFIEVQAFCIE 420
Query: 419 FSRIREAAALFRLLKSLE 436
FSR+REAA LFRLLK L+
Sbjct: 421 FSRVREAAGLFRLLKQLD 438