Miyakogusa Predicted Gene
- Lj1g3v3105130.2
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v3105130.2 Non Chatacterized Hit- tr|I1KWP5|I1KWP5_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.28833
PE,70.87,0,coiled-coil,NULL; seg,NULL,CUFF.30060.2
(420 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G67170.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 281 4e-76
AT3G14750.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 151 8e-37
AT1G55170.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 132 5e-31
AT5G61920.2 | Symbols: | unknown protein; INVOLVED IN: biologic... 100 1e-21
AT5G61920.1 | Symbols: | unknown protein; INVOLVED IN: biologic... 100 1e-21
AT2G30120.2 | Symbols: | unknown protein; EXPRESSED IN: 22 plan... 90 3e-18
AT2G30120.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 60 3e-09
>AT1G67170.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G14750.1); Has 5478 Blast hits to 4354 proteins
in 533 species: Archae - 87; Bacteria - 653; Metazoa -
2554; Fungi - 380; Plants - 418; Viruses - 16; Other
Eukaryotes - 1370 (source: NCBI BLink). |
chr1:25127727-25129145 FORWARD LENGTH=359
Length = 359
Score = 281 bits (720), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 151/280 (53%), Positives = 192/280 (68%), Gaps = 9/280 (3%)
Query: 24 QVLEQKLESQHVEMQRLATENQRLAATHGALRQDVAAAQHELQMLHAHTAGMKAEREQQM 83
+V+EQK +QH E+QRLA ENQRL THG+LRQ++AAAQHE+QMLHA MK+EREQ+M
Sbjct: 56 EVMEQKFVAQHGELQRLAIENQRLGGTHGSLRQELAAAQHEIQMLHAQIGSMKSEREQRM 115
Query: 84 RAVQDKIVKMEADLQAAEPVKMELQQARREVQSLMVSREELIAKAHHLRQEIQRVHADVQ 143
+ +K+ KME +LQ +E VK+E+QQAR E +SL+V+REEL++K H L QE+Q+ +DVQ
Sbjct: 116 MGLAEKVAKMETELQKSEAVKLEMQQARAEARSLVVAREELMSKVHQLTQELQKSRSDVQ 175
Query: 144 QIPALLSELEGLRQEYQHCRATFEYEKKLYSDHLESLQGMEKNYVSMSREVEKLRAELTN 203
QIPAL+SELE LRQEYQ CRAT++YEKK Y+DHLESLQ MEKNY++M+REVEKL+A+L N
Sbjct: 176 QIPALMSELENLRQEYQQCRATYDYEKKFYNDHLESLQAMEKNYMTMAREVEKLQAQLMN 235
Query: 204 KANVDQRSSGPYGGTSGTNENEASGLPVGQNAYEDGYAVAQGRAPLPPASSGGNTNIXXX 263
AN D+R+ GPYG E +ASG G YED + QG P P A + N
Sbjct: 236 NANSDRRAGGPYGNNINA-EIDASGHQSGNGYYEDAFG-PQGYIPQPVAGNATGPNSVVG 293
Query: 264 XXXXXXXXXXXXXYDAPRGPGYVAPTGPTYDAQRSGAYDP 303
Y P+ PGY P GP G+YDP
Sbjct: 294 AAQYPYQGVTQPGY-FPQRPGYNFPRGP------PGSYDP 326
>AT3G14750.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G67170.1); Has 4036 Blast hits to 3091 proteins
in 519 species: Archae - 61; Bacteria - 669; Metazoa -
1503; Fungi - 255; Plants - 421; Viruses - 4; Other
Eukaryotes - 1123 (source: NCBI BLink). |
chr3:4953765-4955373 REVERSE LENGTH=331
Length = 331
Score = 151 bits (381), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 85/223 (38%), Positives = 138/223 (61%), Gaps = 8/223 (3%)
Query: 25 VLEQKLESQHVEMQRLATENQRLAATHGALRQDVAAAQHELQMLHAHTAGMKAEREQQMR 84
+LE +L +Q+ ++Q L +NQRLAATH AL+Q++ AQHELQ + + ++AE E MR
Sbjct: 69 ILEDRLAAQNQDVQGLLADNQRLAATHVALKQELEVAQHELQRIMHYIDSLRAEEEIMMR 128
Query: 85 AVQDKIVKMEADLQAAEPVKMELQQARREVQSLMVSREELIAKAHHLRQEIQRVHADVQQ 144
+ DK ++ E +L+ + ++ E+Q+ R +++ R+EL ++ H + Q++ R+ AD+QQ
Sbjct: 129 EMYDKSMRSEMELREVDAMRAEIQKIRADIKEFTSGRQELTSQVHLMTQDLARLTADLQQ 188
Query: 145 IPALLSELEGLRQEYQHCRATFEYEKKLYSDHLESLQGMEKNYVSMSREVEKLRAELTNK 204
IP L +E+E +QE Q RA +YEKK Y+++ E + ME V+M+RE+EKLRAE+ N
Sbjct: 189 IPTLTAEIENTKQELQRARAAIDYEKKGYAENYEHGKIMEHKLVAMARELEKLRAEIAN- 247
Query: 205 ANVDQRSSGPYGG-----TSGTNENEASGLPVGQNAYEDGYAV 242
+ ++GP G G N +G PV N Y+ Y +
Sbjct: 248 SETSAYANGPVGNPGGVAYGGGYGNPEAGYPV--NPYQPNYTM 288
>AT1G55170.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G14750.1); Has 13439 Blast hits to 8993
proteins in 828 species: Archae - 344; Bacteria - 1469;
Metazoa - 6958; Fungi - 1008; Plants - 683; Viruses -
29; Other Eukaryotes - 2948 (source: NCBI BLink). |
chr1:20580578-20581706 FORWARD LENGTH=283
Length = 283
Score = 132 bits (332), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 86/232 (37%), Positives = 130/232 (56%), Gaps = 4/232 (1%)
Query: 24 QVLEQKLESQHVEMQRLATENQRLAATHGALRQDVAAAQHELQMLHAHTAGMKAEREQQM 83
Q+ E ++ Q E++RL ++N LA L +++ AA+ EL ++ + ++AE++ Q+
Sbjct: 46 QIQEGEIRRQDAEIRRLLSDNHGLADDRMVLERELVAAKEELHRMNLMISDLRAEQDLQL 105
Query: 84 RAVQDKIVKMEADLQAAEPVKMELQQARREVQSLMVSREELIAKAHHLRQEIQRVHADVQ 143
R +K K+E D++A E K E Q R EVQ L + EL LR+++ ++ +D +
Sbjct: 106 REFSEKRHKLEGDVRAMESYKKEASQLRGEVQKLDEIKRELSGNVQLLRKDLAKLQSDNK 165
Query: 144 QIPALLSELEGLRQEYQHCRATFEYEKKLYSDHLESLQGMEKNYVSMSREVEKLRAELTN 203
QIP + +E++ L++E H R EYEKK + +E Q MEKN VSM+REVEKLRAEL
Sbjct: 166 QIPGMRAEVKDLQKELMHARDAIEYEKKEKFELMEQRQTMEKNMVSMAREVEKLRAEL-- 223
Query: 204 KANVDQRSSGPYGGTSGTNENEASGLPVGQNAYEDGYAVAQGRAPLPPASSG 255
A VD R G +GG+ G N N G G D Y + R+ SG
Sbjct: 224 -ATVDSRPWG-FGGSYGMNYNNMDGTFRGSYGENDTYLGSSERSQYYSHGSG 273
>AT5G61920.2 | Symbols: | unknown protein; INVOLVED IN:
biological_process unknown; LOCATED IN: chloroplast;
EXPRESSED IN: 6 plant structures; EXPRESSED DURING: 4
anthesis, F mature embryo stage, petal differentiation
and expansion stage, E expanded cotyledon stage, D
bilateral stage; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT1G67170.1); Has 30201 Blast
hits to 17322 proteins in 780 species: Archae - 12;
Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants -
5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI
BLink). | chr5:24864830-24865628 FORWARD LENGTH=238
Length = 238
Score = 100 bits (250), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 62/182 (34%), Positives = 107/182 (58%)
Query: 25 VLEQKLESQHVEMQRLATENQRLAATHGALRQDVAAAQHELQMLHAHTAGMKAEREQQMR 84
+LE K+ Q E+ RL+ +N++LA+++ AL++D+ A E+Q L AH + + E Q+R
Sbjct: 52 ILENKIAVQAAEIDRLSNDNRKLASSYVALKEDLTVADREVQGLRAHIRKTETDHEIQIR 111
Query: 85 AVQDKIVKMEADLQAAEPVKMELQQARREVQSLMVSREELIAKAHHLRQEIQRVHADVQQ 144
+ +KI KME ++ E ++ E+Q A E L REEL +K +++++V + +
Sbjct: 112 STLEKIAKMEGMVKNRENIRREVQSAHIEAHRLAREREELASKVKLGMKDLKKVCLEAES 171
Query: 145 IPALLSELEGLRQEYQHCRATFEYEKKLYSDHLESLQGMEKNYVSMSREVEKLRAELTNK 204
+ A ELE L++E+Q R FE EK + L L+GME+ + + +EKLR+E++
Sbjct: 172 LEASSQELERLKEEHQRLRKEFEEEKSGNVEKLAQLKGMERKIIGAVKAIEKLRSEISTA 231
Query: 205 AN 206
N
Sbjct: 232 RN 233
>AT5G61920.1 | Symbols: | unknown protein; INVOLVED IN:
biological_process unknown; LOCATED IN: chloroplast;
EXPRESSED IN: 6 plant structures; EXPRESSED DURING: 4
anthesis, F mature embryo stage, petal differentiation
and expansion stage, E expanded cotyledon stage, D
bilateral stage; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT1G67170.1); Has 1807 Blast
hits to 1807 proteins in 277 species: Archae - 0;
Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385;
Viruses - 0; Other Eukaryotes - 339 (source: NCBI
BLink). | chr5:24864830-24865628 FORWARD LENGTH=238
Length = 238
Score = 100 bits (250), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 62/182 (34%), Positives = 107/182 (58%)
Query: 25 VLEQKLESQHVEMQRLATENQRLAATHGALRQDVAAAQHELQMLHAHTAGMKAEREQQMR 84
+LE K+ Q E+ RL+ +N++LA+++ AL++D+ A E+Q L AH + + E Q+R
Sbjct: 52 ILENKIAVQAAEIDRLSNDNRKLASSYVALKEDLTVADREVQGLRAHIRKTETDHEIQIR 111
Query: 85 AVQDKIVKMEADLQAAEPVKMELQQARREVQSLMVSREELIAKAHHLRQEIQRVHADVQQ 144
+ +KI KME ++ E ++ E+Q A E L REEL +K +++++V + +
Sbjct: 112 STLEKIAKMEGMVKNRENIRREVQSAHIEAHRLAREREELASKVKLGMKDLKKVCLEAES 171
Query: 145 IPALLSELEGLRQEYQHCRATFEYEKKLYSDHLESLQGMEKNYVSMSREVEKLRAELTNK 204
+ A ELE L++E+Q R FE EK + L L+GME+ + + +EKLR+E++
Sbjct: 172 LEASSQELERLKEEHQRLRKEFEEEKSGNVEKLAQLKGMERKIIGAVKAIEKLRSEISTA 231
Query: 205 AN 206
N
Sbjct: 232 RN 233
>AT2G30120.2 | Symbols: | unknown protein; EXPRESSED IN: 22 plant
structures; EXPRESSED DURING: 12 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT3G14750.1); Has 2527 Blast hits to 2101 proteins
in 358 species: Archae - 77; Bacteria - 245; Metazoa -
1087; Fungi - 215; Plants - 350; Viruses - 4; Other
Eukaryotes - 549 (source: NCBI BLink). |
chr2:12860607-12861862 REVERSE LENGTH=288
Length = 288
Score = 89.7 bits (221), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 60/179 (33%), Positives = 95/179 (53%)
Query: 25 VLEQKLESQHVEMQRLATENQRLAATHGALRQDVAAAQHELQMLHAHTAGMKAEREQQMR 84
+LE ++ QH E+Q L +NQRLA H L+ + A+ EL+ L +KAE E ++R
Sbjct: 38 ILEDRIAIQHREIQSLLNDNQRLAVAHIGLKDQLNVAKRELERLLETAVKVKAEGEAKVR 97
Query: 85 AVQDKIVKMEADLQAAEPVKMELQQARREVQSLMVSREELIAKAHHLRQEIQRVHADVQQ 144
V ++MEA+ + + + EL Q R +VQ L R+EL + E+ + + +
Sbjct: 98 EVYQNALRMEAEARVIDGLGAELGQVRSDVQRLGSDRQELATELAMFDDEMAKAKPNSDR 157
Query: 145 IPALLSELEGLRQEYQHCRATFEYEKKLYSDHLESLQGMEKNYVSMSREVEKLRAELTN 203
+ E+E LR E + RA E EKK + +L +GMEK ++RE+ KL EL +
Sbjct: 158 AIEVKLEIEILRGEIRKGRAALELEKKTRASNLHHERGMEKTIDHLNREIVKLEEELVD 216
>AT2G30120.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G14750.1); Has 275 Blast hits to 241 proteins
in 42 species: Archae - 4; Bacteria - 15; Metazoa - 15;
Fungi - 4; Plants - 188; Viruses - 0; Other Eukaryotes -
49 (source: NCBI BLink). | chr2:12861332-12861862
REVERSE LENGTH=176
Length = 176
Score = 59.7 bits (143), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 44/139 (31%), Positives = 72/139 (51%)
Query: 25 VLEQKLESQHVEMQRLATENQRLAATHGALRQDVAAAQHELQMLHAHTAGMKAEREQQMR 84
+LE ++ QH E+Q L +NQRLA H L+ + A+ EL+ L +KAE E ++R
Sbjct: 38 ILEDRIAIQHREIQSLLNDNQRLAVAHIGLKDQLNVAKRELERLLETAVKVKAEGEAKVR 97
Query: 85 AVQDKIVKMEADLQAAEPVKMELQQARREVQSLMVSREELIAKAHHLRQEIQRVHADVQQ 144
V ++MEA+ + + + EL Q R +VQ L R+EL + E+ + + +
Sbjct: 98 EVYQNALRMEAEARVIDGLGAELGQVRSDVQRLGSDRQELATELAMFDDEMAKAKPNSDR 157
Query: 145 IPALLSELEGLRQEYQHCR 163
+ E+E LR E + R
Sbjct: 158 AIEVKLEIEILRGEIRKGR 176