Miyakogusa Predicted Gene
- Lj5g3v1697640.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj5g3v1697640.1 tr|C1EEU2|C1EEU2_MICSR Predicted protein
OS=Micromonas sp. (strain RCC299 / NOUM17)
GN=MICPUN_106293,24.46,1e-17,FAMILY NOT NAMED,NULL;
seg,NULL,CUFF.55759.1
(533 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G11760.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 405 e-113
AT5G04860.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 367 e-101
AT2G10560.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 194 1e-49
AT2G25460.1 | Symbols: | CONTAINS InterPro DOMAIN/s: C2 calcium... 94 3e-19
>AT3G11760.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 23 plant
structures; EXPRESSED DURING: 14 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT5G04860.1); Has 84 Blast hits to 73 proteins in
13 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi
- 0; Plants - 84; Viruses - 0; Other Eukaryotes - 0
(source: NCBI BLink). | chr3:3718529-3721123 FORWARD
LENGTH=702
Length = 702
Score = 405 bits (1042), Expect = e-113, Method: Compositional matrix adjust.
Identities = 231/507 (45%), Positives = 304/507 (59%), Gaps = 31/507 (6%)
Query: 34 EFSTLKAGLRRVKTFTDYVSTRRAKKASYKDEGSDGRSSTRSEDSEYKYTSDLDSQDNDD 93
+ S +KAGLR+VK FT++VSTR+AKKA ++EG + +++ +D D
Sbjct: 211 DVSAIKAGLRKVKIFTEFVSTRKAKKACREEEGRFSSFESSESLDDFE--TDFD------ 262
Query: 94 VNKSEERDEDSCVRHSMSYETLTCGNNVGESPYTSSTFNGKNEFKLNSGSQNSYFGDALV 153
E ++E +R S SY L+ N VG S + + ++E + + S G
Sbjct: 263 ----EGKEELMSMRKSFSYGPLSYANGVGTSLNCGAKVSDEDEDWVYYSHRKSDVGAGCS 318
Query: 154 ENYNTCDQVEYHIS---KYRILSWRKRKLQFRSRSFKVKGELLLKKHXXXXXXXXXXXXR 210
+ ++ + Y S + IL WRKRKL FRS K KGE LLKK R
Sbjct: 319 DAEDSAAGLVYEASLLPRRSILPWRKRKLSFRSP--KSKGEPLLKKDNGEEGGDDIDFDR 376
Query: 211 RVLSSSDEYTCQKWHKTQENITTTQSSISGFGENNFTVGTWEHKEVISQDGLMKLHTEIF 270
R LSS + + +++ ++S S FGE++F +G+WE KEVIS+DG MKL T +F
Sbjct: 377 RQLSSDEAHPPFGSKIDEDSSANPRTSFSEFGEDSFAIGSWEEKEVISRDGHMKLQTSVF 436
Query: 271 FASIDQRSECAAGESACAVLVVLIVDWLKLNQAEIPIKCEFDSLIKDGSSEWRIICENKD 330
ASIDQRSE AAGESAC LV +I DW + N +PIK +FDSLI++GS EWR +CEN+
Sbjct: 437 LASIDQRSERAAGESACTALVAVIADWFQKNGNLMPIKSQFDSLIREGSLEWRNLCENET 496
Query: 331 YMKNFPDKHFDLDTVFRAKTRRVSVVPEKSYVXXXXXXXXXXXXXXXXXXXAMSFDSIWE 390
YM+ FPDKHFDLDTV +AK R ++V+P KS+V AMSFDSIW
Sbjct: 497 YMQKFPDKHFDLDTVLQAKIRPLTVIPGKSFVGFFHPDGMINEGRFEFLQGAMSFDSIWA 556
Query: 391 EI------SHCASELHLFSEAIVYIVSWNDHFFVLKVQHDANYIIDTLGERLHEGCNQAY 444
EI S S VYIVSWNDHFFVLKV+ +A YIIDTLGERL+EGC+QAY
Sbjct: 557 EIISLEESSANGDSYDDDSPPHVYIVSWNDHFFVLKVEKEAYYIIDTLGERLYEGCDQAY 616
Query: 445 ILKFDANTRIEKLCNENQVLDAKPPSDEVNDVRKKEIICRGKESCKEYIKRFLASIPVRE 504
+LKFD T I K+ + + P + EI+ RGKESCKEYIK FLA+IP+RE
Sbjct: 617 VLKFDHKTVIHKILHTEEAGSESEP--------ESEILSRGKESCKEYIKNFLAAIPIRE 668
Query: 505 LQDDVKRGLKASMPLHRRLQIEFHYTT 531
LQ+D+K+GL ++ P+H RLQIEFHYTT
Sbjct: 669 LQEDIKKGLASTAPVHHRLQIEFHYTT 695
>AT5G04860.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G11760.1); Has 1807 Blast hits to 1807 proteins
in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
- 339 (source: NCBI BLink). | chr5:1411760-1414459
REVERSE LENGTH=782
Length = 782
Score = 367 bits (943), Expect = e-101, Method: Compositional matrix adjust.
Identities = 232/550 (42%), Positives = 324/550 (58%), Gaps = 52/550 (9%)
Query: 11 WPPLSSKKFEASLVKPEEVRSLVEFSTLKAGLRRVKTFTDYVSTRRAKKASYKDEGSDGR 70
W PLS++ +A S +K GLR++KTF + +S+ +A + + +GS G
Sbjct: 200 WSPLSAEAEKAE-------------SVVKVGLRKMKTFNNCMSSTQASEKESEKDGSSGS 246
Query: 71 SST-----RSEDSEYKYTSDLDSQDNDDV--NKSEERDEDSCVRHSMSYETLTCGNNVGE 123
S R+ DS+ Y D DS D D E ++ +S + ++Y+TL N
Sbjct: 247 GSDGKSPERNLDSDSSYPFDTDSLDEGDAADESEENKENESSLADPVNYKTLRSANWARG 306
Query: 124 SPYTSSTFNGKNEFKLNSGS---QNSYFGDALVENYNTCDQVEYHISKYRILSWRKRKLQ 180
S +T + ++ + S + + D + + + +Q + +SK R+LSW+KRKL
Sbjct: 307 SFHTVTNPEDEDLIYYSHRSPLAETGHCSDEVSNDVVSLEQAKGQMSKKRMLSWKKRKLS 366
Query: 181 FRSRSFKVKGELLLKKHXXXXXXXXXXXXRRVLSSSDEYTCQKWHKTQENITTTQSSISG 240
FRS K KGE LLKK RR LSSSDE + W+++ + I +S
Sbjct: 367 FRSP--KQKGEPLLKKDCLEEGGDDIDFDRRQLSSSDE-SNSDWYRSDDAIMKP---LSQ 420
Query: 241 FGENNFTVGTWEHKEVISQDGLMKLHTEIFFASIDQRSECAAGESACAVLVVLIVDWLKL 300
FG+++F VG+WE KE+IS+DGLMKL +F ASIDQRSE AAGESAC LV ++ WL
Sbjct: 421 FGDDDFVVGSWETKEIISRDGLMKLTARVFLASIDQRSERAAGESACTALVAVMAHWLGS 480
Query: 301 NQAEIPIKCEFDSLIKDGSSEWRIICENKDYMKNFPDKHFDLDTVFRAKTRRVSVVPEKS 360
N+ IP + EFDSLI++GSSEWR +CEN++Y + FPDKHFDL+TV +AK R + VVPE+S
Sbjct: 481 NRDIIPTRSEFDSLIREGSSEWRNMCENEEYRERFPDKHFDLETVLQAKVRPICVVPERS 540
Query: 361 YV-----XXXXXXXXXXXXXXXXXXXAMSFDSIWEEISHCASELHLFSEAIVYIVSWNDH 415
++ MSFDSIWEE+ E SE ++YIVSWNDH
Sbjct: 541 FIGFFHPEKSEEEEGKEDASLDFLKGVMSFDSIWEELMKQEPE-ESASEPVIYIVSWNDH 599
Query: 416 FFVLKVQHDANYIIDTLGERLHEGCNQAYILKFDANTRIEKLCN---------ENQVLDA 466
FFVL V HDA YIIDTLGERL+EGCNQAY+LKFD + I++L + NQ
Sbjct: 600 FFVLLVNHDAYYIIDTLGERLYEGCNQAYVLKFDKDAEIKRLPSVIKDNKADMGNQKQGG 659
Query: 467 KPPSDE------VNDVRKKEIICRGKESCKEYIKRFLASIPVRELQDDVKRGLKASMPLH 520
K S++ + ++E++CRGKESC+EYIK FLA+IP+++++ D+K+GL +S LH
Sbjct: 660 KNKSEQPERSKESEEQEEEEVVCRGKESCREYIKSFLAAIPIQQVKADMKKGLVSS--LH 717
Query: 521 RRLQIEFHYT 530
RLQIE HYT
Sbjct: 718 HRLQIELHYT 727
>AT2G10560.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G04860.1); Has 70 Blast hits to 70 proteins in
12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi
- 0; Plants - 70; Viruses - 0; Other Eukaryotes - 0
(source: NCBI BLink). | chr2:4109862-4110698 REVERSE
LENGTH=278
Length = 278
Score = 194 bits (493), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 103/226 (45%), Positives = 141/226 (62%), Gaps = 23/226 (10%)
Query: 325 ICENKDYMKNFPDKHFDLDTVFRAKTRRVSVVPEKSYV-----XXXXXXXXXXXXXXXXX 379
+CEN++Y + FPDKHFDL+TV +AK R + VVPE++++
Sbjct: 1 MCENEEYRERFPDKHFDLETVLQAKVRPICVVPERTFIGFFHREKSKEEEEKEDVSLDFL 60
Query: 380 XXAMSFDSIWEEISHCASELHLFSEAIVYIVSWNDHFFVLKVQHDANYIIDTLGERLHEG 439
MSFDSIWEEI E SE ++YIVSWNDH+FVL V HDA YIIDTLGER++EG
Sbjct: 61 KGVMSFDSIWEEIMKQEPE-ESASEHVIYIVSWNDHYFVLLVNHDAYYIIDTLGERVYEG 119
Query: 440 CNQAYILKFDANTRIEKL---CNENQV------------LDAKPPSDEVNDVRKKEIICR 484
CNQAY+LKFD + I++L +N+ + S E + ++ ++CR
Sbjct: 120 CNQAYVLKFDQDAEIKRLPSVIKDNKADMGSQKQGGKNKYEQPERSKESEEQGEEVVVCR 179
Query: 485 GKESCKEYIKRFLASIPVRELQDDVKRGLKASMPLHRRLQIEFHYT 530
GKESC+EYIK FLA+IP+++++ D+K GL +S H RLQIE +YT
Sbjct: 180 GKESCREYIKSFLAAIPIQQVKADMKEGLVSS--FHHRLQIELYYT 223
>AT2G25460.1 | Symbols: | CONTAINS InterPro DOMAIN/s: C2
calcium-dependent membrane targeting
(InterPro:IPR000008); BEST Arabidopsis thaliana protein
match is: unknown protein (TAIR:AT5G04860.1); Has 108
Blast hits to 69 proteins in 11 species: Archae - 0;
Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 108;
Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).
| chr2:10833175-10835374 REVERSE LENGTH=423
Length = 423
Score = 93.6 bits (231), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 60/144 (41%), Positives = 85/144 (59%), Gaps = 7/144 (4%)
Query: 251 WEHKEVISQDGLMKLHTEIFFASIDQRSECAAGESACAVLVVLIVDWLKLNQAEI-PIKC 309
W K+++S+DG KL +E++ ASIDQRSE AAGE+ACA + V++ W N I P
Sbjct: 282 WVMKDLVSRDGKSKLKSEVYLASIDQRSEQAAGEAACAAVAVVVAHWFHANPKLINPSGT 341
Query: 310 EFDSLIKDGSSEWRIICENKDYMKNFPDKHFDLDTVFRAKTRRVSVVPEKSYVXXXXXXX 369
FDSLI GSS W+ +C+ + Y++ FP++HFDL+T+ A R V V +KS+
Sbjct: 342 AFDSLITQGSSLWQSLCDKESYLRLFPNRHFDLETIVSANLRPVRVCTDKSFTGLFSPER 401
Query: 370 XXXXXXXXXXXXAMSFDSIWEEIS 393
MSFD IW+E+S
Sbjct: 402 FASLDGL------MSFDQIWDELS 419