Miyakogusa Predicted Gene
- Lj4g3v2021640.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj4g3v2021640.1 Non Chatacterized Hit- tr|I3SVM3|I3SVM3_LOTJA
Uncharacterized protein OS=Lotus japonicus PE=2
SV=1,99.64,0,coiled-coil,NULL; seg,NULL,CUFF.50144.1
(304 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G67170.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 154 6e-38
AT5G61920.2 | Symbols: | unknown protein; INVOLVED IN: biologic... 140 8e-34
AT5G61920.1 | Symbols: | unknown protein; INVOLVED IN: biologic... 140 8e-34
AT3G14750.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 135 5e-32
AT1G55170.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 114 7e-26
AT2G30120.2 | Symbols: | unknown protein; EXPRESSED IN: 22 plan... 74 1e-13
AT2G30120.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 61 1e-09
>AT1G67170.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G14750.1); Has 5478 Blast hits to 4354 proteins
in 533 species: Archae - 87; Bacteria - 653; Metazoa -
2554; Fungi - 380; Plants - 418; Viruses - 16; Other
Eukaryotes - 1370 (source: NCBI BLink). |
chr1:25127727-25129145 FORWARD LENGTH=359
Length = 359
Score = 154 bits (390), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 90/281 (32%), Positives = 162/281 (57%), Gaps = 27/281 (9%)
Query: 1 MDARGKV-PTSSFERRLVQASG--------MMRHGQLPGLXXXXXXXXXXXXXXHRSLES 51
M+++G++ P+ RR + G HG +P + S
Sbjct: 1 MESKGRIHPSHHHMRRPLPGPGGCIAHPETFGNHGAIP---------PSAAQGVYPSFNM 51
Query: 52 LPQSHLLEDKVADQEEEIERLAGDNRRLAKTHVALRDDLVSAAQDVQKLKSHIRSIQTES 111
LP ++E K Q E++RLA +N+RL TH +LR +L +A ++Q L + I S+++E
Sbjct: 52 LPPPEVMEQKFVAQHGELQRLAIENQRLGGTHGSLRQELAAAQHEIQMLHAQIGSMKSER 111
Query: 112 DIQIRVLLDKIAKMEVDIRAGDSVRKDLQQALIEAQSLAASRQELSAEIQRAAQEVKKAI 171
+ ++ L +K+AKME +++ ++V+ ++QQA EA+SL +R+EL +++ + QE++K+
Sbjct: 112 EQRMMGLAEKVAKMETELQKSEAVKLEMQQARAEARSLVVAREELMSKVHQLTQELQKSR 171
Query: 172 SDVKSLPDLQAELDDLVQERQRLRSTFEYEKSKNIELVDQMKIKEKNLIAMAREVEVLQA 231
SDV+ +P L +EL++L QE Q+ R+T++YEK + ++ ++ EKN + MAREVE LQA
Sbjct: 172 SDVQQIPALMSELENLRQEYQQCRATYDYEKKFYNDHLESLQAMEKNYMTMAREVEKLQA 231
Query: 232 EIL---NAEKRANAPNLFRATTPVD------GSGSFSDPYG 263
+++ N+++RA P +D G+G + D +G
Sbjct: 232 QLMNNANSDRRAGGPYGNNINAEIDASGHQSGNGYYEDAFG 272
>AT5G61920.2 | Symbols: | unknown protein; INVOLVED IN:
biological_process unknown; LOCATED IN: chloroplast;
EXPRESSED IN: 6 plant structures; EXPRESSED DURING: 4
anthesis, F mature embryo stage, petal differentiation
and expansion stage, E expanded cotyledon stage, D
bilateral stage; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT1G67170.1); Has 30201 Blast
hits to 17322 proteins in 780 species: Archae - 12;
Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants -
5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI
BLink). | chr5:24864830-24865628 FORWARD LENGTH=238
Length = 238
Score = 140 bits (354), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 79/186 (42%), Positives = 128/186 (68%)
Query: 55 SHLLEDKVADQEEEIERLAGDNRRLAKTHVALRDDLVSAAQDVQKLKSHIRSIQTESDIQ 114
S +LE+K+A Q EI+RL+ DNR+LA ++VAL++DL A ++VQ L++HIR +T+ +IQ
Sbjct: 50 SDILENKIAVQAAEIDRLSNDNRKLASSYVALKEDLTVADREVQGLRAHIRKTETDHEIQ 109
Query: 115 IRVLLDKIAKMEVDIRAGDSVRKDLQQALIEAQSLAASRQELSAEIQRAAQEVKKAISDV 174
IR L+KIAKME ++ +++R+++Q A IEA LA R+EL+++++ +++KK +
Sbjct: 110 IRSTLEKIAKMEGMVKNRENIRREVQSAHIEAHRLAREREELASKVKLGMKDLKKVCLEA 169
Query: 175 KSLPDLQAELDDLVQERQRLRSTFEYEKSKNIELVDQMKIKEKNLIAMAREVEVLQAEIL 234
+SL EL+ L +E QRLR FE EKS N+E + Q+K E+ +I + +E L++EI
Sbjct: 170 ESLEASSQELERLKEEHQRLRKEFEEEKSGNVEKLAQLKGMERKIIGAVKAIEKLRSEIS 229
Query: 235 NAEKRA 240
A +A
Sbjct: 230 TARNKA 235
>AT5G61920.1 | Symbols: | unknown protein; INVOLVED IN:
biological_process unknown; LOCATED IN: chloroplast;
EXPRESSED IN: 6 plant structures; EXPRESSED DURING: 4
anthesis, F mature embryo stage, petal differentiation
and expansion stage, E expanded cotyledon stage, D
bilateral stage; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT1G67170.1); Has 1807 Blast
hits to 1807 proteins in 277 species: Archae - 0;
Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385;
Viruses - 0; Other Eukaryotes - 339 (source: NCBI
BLink). | chr5:24864830-24865628 FORWARD LENGTH=238
Length = 238
Score = 140 bits (354), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 79/186 (42%), Positives = 128/186 (68%)
Query: 55 SHLLEDKVADQEEEIERLAGDNRRLAKTHVALRDDLVSAAQDVQKLKSHIRSIQTESDIQ 114
S +LE+K+A Q EI+RL+ DNR+LA ++VAL++DL A ++VQ L++HIR +T+ +IQ
Sbjct: 50 SDILENKIAVQAAEIDRLSNDNRKLASSYVALKEDLTVADREVQGLRAHIRKTETDHEIQ 109
Query: 115 IRVLLDKIAKMEVDIRAGDSVRKDLQQALIEAQSLAASRQELSAEIQRAAQEVKKAISDV 174
IR L+KIAKME ++ +++R+++Q A IEA LA R+EL+++++ +++KK +
Sbjct: 110 IRSTLEKIAKMEGMVKNRENIRREVQSAHIEAHRLAREREELASKVKLGMKDLKKVCLEA 169
Query: 175 KSLPDLQAELDDLVQERQRLRSTFEYEKSKNIELVDQMKIKEKNLIAMAREVEVLQAEIL 234
+SL EL+ L +E QRLR FE EKS N+E + Q+K E+ +I + +E L++EI
Sbjct: 170 ESLEASSQELERLKEEHQRLRKEFEEEKSGNVEKLAQLKGMERKIIGAVKAIEKLRSEIS 229
Query: 235 NAEKRA 240
A +A
Sbjct: 230 TARNKA 235
>AT3G14750.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G67170.1); Has 4036 Blast hits to 3091 proteins
in 519 species: Archae - 61; Bacteria - 669; Metazoa -
1503; Fungi - 255; Plants - 421; Viruses - 4; Other
Eukaryotes - 1123 (source: NCBI BLink). |
chr3:4953765-4955373 REVERSE LENGTH=331
Length = 331
Score = 135 bits (339), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 70/190 (36%), Positives = 123/190 (64%)
Query: 53 PQSHLLEDKVADQEEEIERLAGDNRRLAKTHVALRDDLVSAAQDVQKLKSHIRSIQTESD 112
PQ +LED++A Q ++++ L DN+RLA THVAL+ +L A ++Q++ +I S++ E +
Sbjct: 65 PQFSILEDRLAAQNQDVQGLLADNQRLAATHVALKQELEVAQHELQRIMHYIDSLRAEEE 124
Query: 113 IQIRVLLDKIAKMEVDIRAGDSVRKDLQQALIEAQSLAASRQELSAEIQRAAQEVKKAIS 172
I +R + DK + E+++R D++R ++Q+ + + + RQEL++++ Q++ + +
Sbjct: 125 IMMREMYDKSMRSEMELREVDAMRAEIQKIRADIKEFTSGRQELTSQVHLMTQDLARLTA 184
Query: 173 DVKSLPDLQAELDDLVQERQRLRSTFEYEKSKNIELVDQMKIKEKNLIAMAREVEVLQAE 232
D++ +P L AE+++ QE QR R+ +YEK E + KI E L+AMARE+E L+AE
Sbjct: 185 DLQQIPTLTAEIENTKQELQRARAAIDYEKKGYAENYEHGKIMEHKLVAMARELEKLRAE 244
Query: 233 ILNAEKRANA 242
I N+E A A
Sbjct: 245 IANSETSAYA 254
>AT1G55170.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G14750.1); Has 13439 Blast hits to 8993
proteins in 828 species: Archae - 344; Bacteria - 1469;
Metazoa - 6958; Fungi - 1008; Plants - 683; Viruses -
29; Other Eukaryotes - 2948 (source: NCBI BLink). |
chr1:20580578-20581706 FORWARD LENGTH=283
Length = 283
Score = 114 bits (286), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 72/208 (34%), Positives = 119/208 (57%), Gaps = 6/208 (2%)
Query: 59 EDKVADQEEEIERLAGDNRRLAKTHVALRDDLVSAAQDVQKLKSHIRSIQTESDIQIRVL 118
E ++ Q+ EI RL DN LA + L +LV+A +++ ++ I ++ E D+Q+R
Sbjct: 49 EGEIRRQDAEIRRLLSDNHGLADDRMVLERELVAAKEELHRMNLMISDLRAEQDLQLREF 108
Query: 119 LDKIAKMEVDIRAGDSVRKDLQQALIEAQSLAASRQELSAEIQRAAQEVKKAISDVKSLP 178
+K K+E D+RA +S +K+ Q E Q L ++ELS +Q +++ K SD K +P
Sbjct: 109 SEKRHKLEGDVRAMESYKKEASQLRGEVQKLDEIKRELSGNVQLLRKDLAKLQSDNKQIP 168
Query: 179 DLQAELDDLVQERQRLRSTFEYEKSKNIELVDQMKIKEKNLIAMAREVEVLQAEILNAEK 238
++AE+ DL +E R EYEK + EL++Q + EKN+++MAREVE L+AE+ +
Sbjct: 169 GMRAEVKDLQKELMHARDAIEYEKKEKFELMEQRQTMEKNMVSMAREVEKLRAELATVDS 228
Query: 239 RANAPNLFRATTPVDGS---GSFSDPYG 263
R P F + ++ + G+F YG
Sbjct: 229 R---PWGFGGSYGMNYNNMDGTFRGSYG 253
>AT2G30120.2 | Symbols: | unknown protein; EXPRESSED IN: 22 plant
structures; EXPRESSED DURING: 12 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT3G14750.1); Has 2527 Blast hits to 2101 proteins
in 358 species: Archae - 77; Bacteria - 245; Metazoa -
1087; Fungi - 215; Plants - 350; Viruses - 4; Other
Eukaryotes - 549 (source: NCBI BLink). |
chr2:12860607-12861862 REVERSE LENGTH=288
Length = 288
Score = 73.9 bits (180), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 58/210 (27%), Positives = 106/210 (50%), Gaps = 1/210 (0%)
Query: 54 QSHLLEDKVADQEEEIERLAGDNRRLAKTHVALRDDLVSAAQDVQKLKSHIRSIQTESDI 113
+S +LED++A Q EI+ L DN+RLA H+ L+D L A +++++L ++ E +
Sbjct: 35 RSVILEDRIAIQHREIQSLLNDNQRLAVAHIGLKDQLNVAKRELERLLETAVKVKAEGEA 94
Query: 114 QIRVLLDKIAKMEVDIRAGDSVRKDLQQALIEAQSLAASRQELSAEIQRAAQEVKKAISD 173
++R + +ME + R D + +L Q + Q L + RQEL+ E+ E+ KA +
Sbjct: 95 KVREVYQNALRMEAEARVIDGLGAELGQVRSDVQRLGSDRQELATELAMFDDEMAKAKPN 154
Query: 174 VKSLPDLQAELDDLVQERQRLRSTFEYEKSKNIELVDQMKIKEKNLIAMAREVEVLQAEI 233
+++ E++ L E ++ R+ E EK + + EK + + RE+ L+ E+
Sbjct: 155 SDRAIEVKLEIEILRGEIRKGRAALELEKKTRASNLHHERGMEKTIDHLNREIVKLEEEL 214
Query: 234 LNAEKRANAPNLFRATTPVDGSGSFSDPYG 263
++ E +A N P G + YG
Sbjct: 215 VDLETKAREANAAAEAAPTPSPG-LAASYG 243
>AT2G30120.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G14750.1); Has 275 Blast hits to 241 proteins
in 42 species: Archae - 4; Bacteria - 15; Metazoa - 15;
Fungi - 4; Plants - 188; Viruses - 0; Other Eukaryotes -
49 (source: NCBI BLink). | chr2:12861332-12861862
REVERSE LENGTH=176
Length = 176
Score = 60.8 bits (146), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 38/117 (32%), Positives = 66/117 (56%)
Query: 54 QSHLLEDKVADQEEEIERLAGDNRRLAKTHVALRDDLVSAAQDVQKLKSHIRSIQTESDI 113
+S +LED++A Q EI+ L DN+RLA H+ L+D L A +++++L ++ E +
Sbjct: 35 RSVILEDRIAIQHREIQSLLNDNQRLAVAHIGLKDQLNVAKRELERLLETAVKVKAEGEA 94
Query: 114 QIRVLLDKIAKMEVDIRAGDSVRKDLQQALIEAQSLAASRQELSAEIQRAAQEVKKA 170
++R + +ME + R D + +L Q + Q L + RQEL+ E+ E+ KA
Sbjct: 95 KVREVYQNALRMEAEARVIDGLGAELGQVRSDVQRLGSDRQELATELAMFDDEMAKA 151