Miyakogusa Predicted Gene
- Lj1g3v5031290.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v5031290.1 Non Chatacterized Hit- tr|F6HHD5|F6HHD5_VITVI
Putative uncharacterized protein OS=Vitis vinifera
GN=,31.88,4e-19,coiled-coil,NULL; FAMILY NOT
NAMED,NULL,NODE_37671_length_1831_cov_14.535773.path2.1
(220 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G41620.1 | Symbols: | FUNCTIONS IN: molecular_function unkno... 137 8e-33
AT1G64180.1 | Symbols: | intracellular protein transport protei... 122 2e-28
AT3G11590.1 | Symbols: | unknown protein; LOCATED IN: plasma me... 98 4e-21
AT2G46250.1 | Symbols: | myosin heavy chain-related | chr2:1899... 96 3e-20
AT1G50660.1 | Symbols: | unknown protein; INVOLVED IN: biologic... 90 9e-19
AT3G20350.1 | Symbols: | unknown protein; INVOLVED IN: biologic... 89 2e-18
AT5G22310.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 53 2e-07
>AT5G41620.1 | Symbols: | FUNCTIONS IN: molecular_function unknown;
INVOLVED IN: biological_process unknown; LOCATED IN:
chloroplast, plasma membrane; EXPRESSED IN: 9 plant
structures; EXPRESSED DURING: 6 growth stages; BEST
Arabidopsis thaliana protein match is: intracellular
protein transport protein USO1-related
(TAIR:AT1G64180.1); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
0; Other Eukaryotes - 2996 (source: NCBI BLink). |
chr5:16646330-16648776 FORWARD LENGTH=623
Length = 623
Score = 137 bits (344), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 76/182 (41%), Positives = 124/182 (68%), Gaps = 5/182 (2%)
Query: 2 EVDLSRARVKDLLQENKINKQAMEKLIKKLTEDKLVRKSNEHVKIKAAVQSIKEEIQDEK 61
EV SR R+K+LL+ + ++ ++ ++K+L E+KL+ K+ E ++ +AVQS+++ ++DE+
Sbjct: 226 EVAHSRVRIKELLRYQQADRHELDSVVKQLAEEKLLSKNKEVERMSSAVQSVRKALEDER 285
Query: 62 RLRKHSESLHRKLVRELSEVKSSFCSSLRDLESERKARIVLKNLCDDFAKGTRDYEHEVH 121
+LRK SESLHRK+ RELSEVKSS + +++LE K+ +++ LCD+FAKG + YE E+H
Sbjct: 286 KLRKRSESLHRKMARELSEVKSSLSNCVKELERGSKSNKMMELLCDEFAKGIKSYEEEIH 345
Query: 122 SL----MHSVEKGQVKGD-CRLRISEAWLNERKKMKLVQASSDLTDRDSIIDKLGVDIET 176
L + G+ GD L I+E+WL+ER +M+L + S++DKL V+IET
Sbjct: 346 GLKKKNLDKDWAGRGGGDQLVLHIAESWLDERMQMRLEGGDTLNGKNRSVLDKLEVEIET 405
Query: 177 YL 178
+L
Sbjct: 406 FL 407
>AT1G64180.1 | Symbols: | intracellular protein transport protein
USO1-related | chr1:23821640-23824193 FORWARD LENGTH=593
Length = 593
Score = 122 bits (306), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 72/181 (39%), Positives = 120/181 (66%), Gaps = 18/181 (9%)
Query: 2 EVDLSRARVKDLLQENKINKQAMEKLIKKLTEDKLVRKSNEHVKIKAAVQSIKEEIQDEK 61
E+ SRAR+KDLL+ + +K+ M+ +K+L E+KL + + EH ++ +AVQS+ +DE+
Sbjct: 218 ELAHSRARIKDLLRCKQADKRDMDDFVKQLAEEKLSKGTKEHDRLSSAVQSL----EDER 273
Query: 62 RLRKHSESLHRKLVRELSEVKSSFCSSLRDLESERKARIVLKNLCDDFAKGTRDYEHEVH 121
+LRK SESL+RKL +ELSEVKS+ + ++++E +++ +L+ LCD+FAKG + YE E+H
Sbjct: 274 KLRKRSESLYRKLAQELSEVKSTLSNCVKEMERGTESKKILERLCDEFAKGIKSYEREIH 333
Query: 122 SLMHSVEKGQVKGDCR----LRISEAWLNERKKMKLVQASSDLTDRDSIIDKLGVDIETY 177
L ++K D + L I+E+WL+ER +Q+ + S ++KL +IET+
Sbjct: 334 GLKQKLDKNWKGWDEQDHMILCIAESWLDER-----IQSGN-----GSALEKLEFEIETF 383
Query: 178 L 178
L
Sbjct: 384 L 384
>AT3G11590.1 | Symbols: | unknown protein; LOCATED IN: plasma
membrane; EXPRESSED IN: 22 plant structures; EXPRESSED
DURING: 13 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G22310.1);
Has 22320 Blast hits to 15179 proteins in 1213 species:
Archae - 372; Bacteria - 2307; Metazoa - 10906; Fungi -
1700; Plants - 1146; Viruses - 65; Other Eukaryotes -
5824 (source: NCBI BLink). | chr3:3660628-3663537
FORWARD LENGTH=622
Length = 622
Score = 98.2 bits (243), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 60/182 (32%), Positives = 117/182 (64%), Gaps = 3/182 (1%)
Query: 2 EVDLSRARVKDLLQENKINKQAMEKLIKKLTEDKLVRKSNEHVKIKAAVQSIKEEIQDEK 61
E++ +R +V L+ E+K + L+K+ E+K V KSNE ++AA++S+ E++ E+
Sbjct: 263 ELERARLQVNQLIHEHKPENNDISYLMKRFAEEKAVWKSNEQEVVEAAIESVAGELEVER 322
Query: 62 RLRKHSESLHRKLVRELSEVKSSFCSSLRDLESERKARIVLKNLCDDFAKGTRDYEHEVH 121
+LR+ ESL++KL +EL+E KS+ +++++E+E++AR++++ +CD+ A+ + + EV
Sbjct: 323 KLRRRFESLNKKLGKELAETKSALMKAVKEIENEKRARVMVEKVCDELARDISEDKAEVE 382
Query: 122 SL---MHSVEKGQVKGDCRLRISEAWLNERKKMKLVQASSDLTDRDSIIDKLGVDIETYL 178
L V++ K L++++A ER +MKL +A L ++++ +DKL ++TYL
Sbjct: 383 ELKRESFKVKEEVEKEREMLQLADALREERVQMKLSEAKHQLEEKNAAVDKLRNQLQTYL 442
Query: 179 LA 180
A
Sbjct: 443 KA 444
>AT2G46250.1 | Symbols: | myosin heavy chain-related |
chr2:18991386-18993201 FORWARD LENGTH=468
Length = 468
Score = 95.5 bits (236), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 60/153 (39%), Positives = 93/153 (60%), Gaps = 16/153 (10%)
Query: 1 MEVDLSRARVKDLLQENKINKQAMEKLIKKLTEDKLVRKSNEHVKIKAAVQSIKEEIQDE 60
ME+D RA +K++ Q K++ D+ +RK E ++K +SIK E+ DE
Sbjct: 191 MELDECRAEIKEVQQRKKLS-------------DRPLRKKKEEEEVKDVFRSIKRELDDE 237
Query: 61 KRLRKHSESLHRKLVRELSEVKSSFCSSLRDLESERKARIVLKNLCDDFAKGTRDYEHEV 120
+++RK SE+LHRKL REL E K +L+DLE E + R+V++NLCD+FAK +DYE +V
Sbjct: 238 RKVRKESETLHRKLTRELCEAKHCLSKALKDLEKETQERVVVENLCDEFAKAVKDYEDKV 297
Query: 121 HSLMHSVEKGQVKGDCRLRISEAWLNERKKMKL 153
+ +K V ++I+E W ++R +MKL
Sbjct: 298 RRIG---KKSPVSDKVIVQIAEVWSDQRLQMKL 327
>AT1G50660.1 | Symbols: | unknown protein; INVOLVED IN:
biological_process unknown; LOCATED IN: chloroplast;
EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT3G20350.1); Has 21445 Blast
hits to 15134 proteins in 1325 species: Archae - 461;
Bacteria - 2309; Metazoa - 11052; Fungi - 1737; Plants -
1035; Viruses - 42; Other Eukaryotes - 4809 (source:
NCBI BLink). | chr1:18771386-18774385 FORWARD LENGTH=725
Length = 725
Score = 90.1 bits (222), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 59/180 (32%), Positives = 113/180 (62%), Gaps = 3/180 (1%)
Query: 2 EVDLSRARVKDLLQENKINKQAMEKLIKKLTEDKLVRKSNEHVKIKAAVQSIKEEIQDEK 61
E++ + AR++DL E + +K+ +E+ ++K++E++ +S EH K++A + +K ++ EK
Sbjct: 245 ELEEAHARIEDLESEKRSHKKKLEQFLRKVSEERAAWRSREHEKVRAIIDDMKTDMNREK 304
Query: 62 RLRKHSESLHRKLVRELSEVKSSFCSSLRDLESERKARIVLKNLCDDFAKGTRDYEHEVH 121
+ R+ E ++ KLV EL++ K + ++D E ERKAR +++ +CD+ AK + + E+
Sbjct: 305 KTRQRLEIVNHKLVNELADSKLAVKRYMQDYEKERKARELIEEVCDELAKEIGEDKAEIE 364
Query: 122 SL-MHSVEKGQVKGDCR--LRISEAWLNERKKMKLVQASSDLTDRDSIIDKLGVDIETYL 178
+L S+ + D R L+++E W ER +MKL+ A L +R S ++KL D+E++L
Sbjct: 365 ALKRESMSLREEVDDERRMLQMAEVWREERVQMKLIDAKVALEERYSQMNKLVGDLESFL 424
>AT3G20350.1 | Symbols: | unknown protein; INVOLVED IN:
biological_process unknown; LOCATED IN: plasma membrane;
EXPRESSED IN: cotyledon; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT1G50660.1);
Has 15095 Blast hits to 11224 proteins in 1051 species:
Archae - 223; Bacteria - 1586; Metazoa - 7000; Fungi -
1255; Plants - 746; Viruses - 40; Other Eukaryotes -
4245 (source: NCBI BLink). | chr3:7096602-7099372
FORWARD LENGTH=673
Length = 673
Score = 89.0 bits (219), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 58/185 (31%), Positives = 112/185 (60%), Gaps = 3/185 (1%)
Query: 6 SRARVKDLLQENKINKQAMEKLIKKLTEDKLVRKSNEHVKIKAAVQSIKEEIQDEKRLRK 65
+RA +KDL E + K+ +E+ +KK++E++ +S EH K++A + +K ++ EK+ R+
Sbjct: 226 ARACIKDLESEKRSQKKKLEQFLKKVSEERAAWRSREHEKVRAIIDDMKADMNQEKKTRQ 285
Query: 66 HSESLHRKLVRELSEVKSSFCSSLRDLESERKARIVLKNLCDDFAKGTRDYEHEVHSL-M 124
E ++ KLV EL++ K + + D + ERKAR +++ +CD+ AK + + E+ +L
Sbjct: 286 RLEIVNSKLVNELADSKLAVKRYMHDYQQERKARELIEEVCDELAKEIEEDKAEIEALKS 345
Query: 125 HSVEKGQVKGDCR--LRISEAWLNERKKMKLVQASSDLTDRDSIIDKLGVDIETYLLAMS 182
S+ + D R L+++E W ER +MKL+ A L ++ S ++KL D+E +L + +
Sbjct: 346 ESMNLREEVDDERRMLQMAEVWREERVQMKLIDAKVTLEEKYSQMNKLVGDMEAFLSSRN 405
Query: 183 SVNLR 187
+ ++
Sbjct: 406 TTGVK 410
>AT5G22310.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G11590.1); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
0; Other Eukaryotes - 2996 (source: NCBI BLink). |
chr5:7383742-7385345 REVERSE LENGTH=481
Length = 481
Score = 52.8 bits (125), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 35/122 (28%), Positives = 68/122 (55%), Gaps = 17/122 (13%)
Query: 52 SIKEEIQDEKRLRKHSESLHRKLVRELSEVKSSFCSSLRDLESERKARIVLKNLCDDFAK 111
S++EE E++LR+ +E ++R+L REL+E K + +++ E++A+ VL+ +CD+ K
Sbjct: 248 SLQEEAMVERKLRRRTEKMNRRLGRELTEAKETERKMKEEMKREKRAKDVLEEVCDELTK 307
Query: 112 GTRDYEHEV---HSLMHSVEKGQVKGDCRLRISEAWLNERKKMKLVQASSDLTDRDSIID 168
G D + E+ +MH I++ ER +MKL +A + D+ + ++
Sbjct: 308 GIGDDKKEMEKEREMMH--------------IADVLREERVQMKLTEAKFEFEDKYAAVE 353
Query: 169 KL 170
+L
Sbjct: 354 RL 355