Miyakogusa Predicted Gene
- Lj1g3v3300040.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v3300040.1 Non Chatacterized Hit- tr|I1N5B1|I1N5B1_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.41437
PE,69.18,0,DUF4378,Domain of unknown function DUF4378; VARLMGL,NULL;
seg,NULL,CUFF.30300.1
(922 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G67040.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 262 7e-70
AT5G26910.3 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 67 5e-11
AT5G26910.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 67 5e-11
AT3G58650.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 59 1e-08
>AT1G67040.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 20 plant
structures; EXPRESSED DURING: 11 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT5G26910.3); Has 89 Blast hits to 84 proteins in
15 species: Archae - 0; Bacteria - 0; Metazoa - 5; Fungi
- 2; Plants - 82; Viruses - 0; Other Eukaryotes - 0
(source: NCBI BLink). | chr1:25019105-25021922 REVERSE
LENGTH=826
Length = 826
Score = 262 bits (670), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 277/929 (29%), Positives = 412/929 (44%), Gaps = 144/929 (15%)
Query: 10 AITEKKVQKPGGCVGIFLQLIDWKRRLAKKKLFSQKLLTPAR--AKKFRGDEKMPNSKLH 67
AITEK+ + GGCVG+F QL DW RR AKKKLFS+K L P + +K+F G+EKM SKL+
Sbjct: 14 AITEKRPNRLGGCVGVFFQLFDWNRRFAKKKLFSRKSLLPGKQVSKRFGGNEKMLKSKLN 73
Query: 68 LIANENSEGFPSAKKGGSHGVDVEQKTDMRAPSLVARLMGLEYIPAAQRNNSKEVLNSGS 127
LI +EN FP+ + V +K +MR+PSLVARLMGLE +P+ R+ K
Sbjct: 74 LIDDENRGSFPNRNE-----VMEVKKHEMRSPSLVARLMGLESMPSNHRDKGKNKKKKPL 128
Query: 128 CGDGKESLANNCELHNRKGVDLDMGVVKHDSRPQKLQKTGAACERR-AVTRFGAEALHIR 186
+++ + C+L + + + D GV K RPQK+Q+T C+RR AV +FG+EAL I+
Sbjct: 129 FSQIQDT--DKCDLFDVEEEEEDSGVDK--LRPQKMQRTTGVCDRRVAVKKFGSEALQIK 184
Query: 187 SVLSRARKXXXXXXXXPKLASPLKXXXXXXXXXXXXXXXLIGAATRILEPGLQARKGSLT 246
+VL+R RK KLASP++ LI AA RILEPG + KG++
Sbjct: 185 NVLTRVRKHHQYNHQHQKLASPVR-----SPRMNRRSSRLIDAAARILEPGKRNAKGAIA 239
Query: 247 YPACTYPHETNTVTKDAQDWSTVMQNQSCYDAGRSKNLMGQTSCKNCGNLLDVIECKQEV 306
YP T K+ V+ + + G + ++ SCK+CG+L+DV
Sbjct: 240 YPGSTGIRRFENAAKEP-----VVSPE--FQCGYNNSV---ASCKSCGSLVDVNGS---- 285
Query: 307 PVPISVVSDVATANSMVSSSQKEKSFTPSHGQKRDIVLLRSKEKLISLATEEEGKNNAQQ 366
I VV D T N+M S+ TP KR+ V R+++ +S++ GK++ Q
Sbjct: 286 ---IQVVQD--TGNNMACVSES----TPFQRSKRN-VFWRNEDSSVSVS----GKDSTDQ 331
Query: 367 SWNEPATRRMAMSCEDNASSFPSKHKIQTQKQMLSTEKYSPGSTMSSNMQVKRGSSSA-- 424
+ R +D S +++ + K++L E+ P S + KR SS
Sbjct: 332 MVKKALHR---AQFKDEMSLPGYRNRSEYHKKVLHREERFPPEARSFALPSKRSCSSPAN 388
Query: 425 ---SGTKDFVALNRSLSGRTRM-RSPTKLDSSKFDIEKKPCNRQLSPLSHERTLEKKRRI 480
S KDF+A+NR + R+ +SP K ++S ++++K SH R E R
Sbjct: 389 AINSKEKDFIAMNRGSTSRSHHSKSPVKFENSDLNLQRK---------SHTRVEESCNR- 438
Query: 481 LNVSQLEGTASVGLKQKDLCSDALGGKRRDFPPSSLNSFKVKNKRDGQGEEINKVIDNKI 540
G ++ G K++ C G P V + D E + N+
Sbjct: 439 ------SGLSTPGRKRRLACESGHGRGSSSMSP-------VSRRLDS---EYSCACSNET 482
Query: 541 DDVVSFTFNSPLKQKTGIPLEKEETSCNNE--RNAYCNRLSSPLKADALGAFLEQKLKEL 598
S S + + E +E R ++ R PL ++QKLKEL
Sbjct: 483 -AFSSLKLGSSNRHYSQCCRETKERRGVQRVPRPSFTKR---PLLDVGTLGLIQQKLKEL 538
Query: 599 TSQEDDEL-ASSALPKKPSSVILQELLS--ALNSEHLVCHDGHVFNDNCVAKQERLIGNT 655
SQE+DE S P KP+S+IL ELLS AL + V + + IGN
Sbjct: 539 ASQEEDEANGESGFPNKPASLILHELLSSLALQQQPYVRDIDMPYRRKGKTEFWSSIGNA 598
Query: 656 FNGNQLSPGXXXXXXXXXXXXXXXXGHGFHPYSMNYSYG-QPEQWDHDIELSDSATSFNN 714
N SPG + F S +P + D DI L D ATSF N
Sbjct: 599 -NSEYTSPG---SVLDASFSNESCFSNSFDNISGQMRLPLEPIEPDWDI-LEDYATSFKN 653
Query: 715 GM-------IGEILSQIPSALQCLHSFGRQFPRSKVNNVKDVLLNAELVLRIATDHNEGE 767
I ++S + + L+CL + G + + ++V+++ EL++ T
Sbjct: 654 STSDGNYQAIASLISHVSNVLRCLSNTGLILTQQRFTIAREVIIHTELLVGTTTTQENYL 713
Query: 768 VSXXXXXXXXXXXXDTMASDAMWTEFEGFMAYKGDSKQRNKLK----GFLFDCVVEYLES 823
+ F+ M Y S L GFL D ++E+LE
Sbjct: 714 IGPEL--------------------FDELMIYAARSDNLVNLPGLTGGFLVDAMIEHLE- 752
Query: 824 NCCKYFHSGFRAWTKLPLCVKGNVLAQEVKREMKKWECKAGMMPDEIIEWEMSHSLGKWT 883
+ PL K + L + V E+ KW A + DE+I EM
Sbjct: 753 ------ETNISCGLLKPLTAKQDELIRGVIEEVPKW---ARVNMDEVIGIEM-------- 795
Query: 884 DFDIEAFEAGVGIDGDILQILVGEIVEDL 912
D + F G I +IL+ L+GE+ DL
Sbjct: 796 DLETHLFGVGSEIAYEILRCLIGELATDL 824
>AT5G26910.3 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: mitochondrion;
EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT3G58650.1). |
chr5:9466169-9469523 REVERSE LENGTH=852
Length = 852
Score = 67.4 bits (163), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 44/137 (32%), Positives = 67/137 (48%), Gaps = 12/137 (8%)
Query: 784 MASDAMWTEFEGFMAYKGDSKQRNKLKGFLFDCVVEYLESNCCKYFHSGFRAWTKLPLCV 843
MA+D + M +G+ + LFD V + L C + F R L
Sbjct: 694 MATDVLPASLFDEMEGRGEVTAAKIKRKTLFDFVNKCLALRCEQMFMGSCRG-----LLG 748
Query: 844 KGNVL-------AQEVKREMKKWECKAGMMPDEIIEWEMSHSLGKWTDFDIEAFEAGVGI 896
KG L A+E+ RE+ + MM DE+++ EMS G+W DF+ E +E G+ I
Sbjct: 749 KGGFLFEQRDWLAEELNREIHGLKKMREMMMDELVDKEMSSFEGRWLDFERETYEEGIDI 808
Query: 897 DGDILQILVGEIVEDLV 913
+G+I+ LV ++V DLV
Sbjct: 809 EGEIVSTLVDDLVNDLV 825
>AT5G26910.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G58650.1); Has 1322 Blast hits to 684 proteins
in 162 species: Archae - 4; Bacteria - 497; Metazoa -
157; Fungi - 101; Plants - 155; Viruses - 0; Other
Eukaryotes - 408 (source: NCBI BLink). |
chr5:9466169-9469523 REVERSE LENGTH=853
Length = 853
Score = 67.4 bits (163), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 44/137 (32%), Positives = 67/137 (48%), Gaps = 12/137 (8%)
Query: 784 MASDAMWTEFEGFMAYKGDSKQRNKLKGFLFDCVVEYLESNCCKYFHSGFRAWTKLPLCV 843
MA+D + M +G+ + LFD V + L C + F R L
Sbjct: 695 MATDVLPASLFDEMEGRGEVTAAKIKRKTLFDFVNKCLALRCEQMFMGSCRG-----LLG 749
Query: 844 KGNVL-------AQEVKREMKKWECKAGMMPDEIIEWEMSHSLGKWTDFDIEAFEAGVGI 896
KG L A+E+ RE+ + MM DE+++ EMS G+W DF+ E +E G+ I
Sbjct: 750 KGGFLFEQRDWLAEELNREIHGLKKMREMMMDELVDKEMSSFEGRWLDFERETYEEGIDI 809
Query: 897 DGDILQILVGEIVEDLV 913
+G+I+ LV ++V DLV
Sbjct: 810 EGEIVSTLVDDLVNDLV 826
>AT3G58650.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: plasma membrane;
EXPRESSED IN: 14 plant structures; EXPRESSED DURING: 8
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT5G26910.1); Has 2350 Blast
hits to 1412 proteins in 248 species: Archae - 0;
Bacteria - 487; Metazoa - 577; Fungi - 236; Plants -
184; Viruses - 4; Other Eukaryotes - 862 (source: NCBI
BLink). | chr3:21696349-21699219 REVERSE LENGTH=820
Length = 820
Score = 59.3 bits (142), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 36/112 (32%), Positives = 62/112 (55%), Gaps = 14/112 (12%)
Query: 813 LFDCV-----VEY---LESNCCKYFHSGFRAWTKLPLCVKGNVLAQEVKREMKKWECKAG 864
LFDCV V++ L +C SG L ++LA+EV RE+K +
Sbjct: 708 LFDCVNQCLAVKFERMLIGSCKGMMMSGG------ILLEHRDLLAEEVNREVKGLKKMRE 761
Query: 865 MMPDEIIEWEMSHSLGKWTDFDIEAFEAGVGIDGDILQILVGEIVEDLVGSN 916
MM DE+++ +MS G+W ++ E FE G+ ++G+I+ LV ++V D++ ++
Sbjct: 762 MMIDELVDHDMSCFEGRWIGYEREMFEEGIDMEGEIVSALVDDLVSDILSTS 813