Miyakogusa Predicted Gene
- Lj6g3v1876730.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj6g3v1876730.1 Non Chatacterized Hit- tr|D8SIX1|D8SIX1_SELML
Putative uncharacterized protein (Fragment)
OS=Selagin,41.67,2e-18,seg,NULL; coiled-coil,NULL; FAMILY NOT
NAMED,NULL,CUFF.60017.1
(595 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G50660.1 | Symbols: | unknown protein; INVOLVED IN: biologic... 284 1e-76
AT3G20350.1 | Symbols: | unknown protein; INVOLVED IN: biologic... 224 1e-58
AT3G11590.1 | Symbols: | unknown protein; LOCATED IN: plasma me... 101 1e-21
AT1G11690.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 79 7e-15
AT5G22310.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 62 9e-10
>AT1G50660.1 | Symbols: | unknown protein; INVOLVED IN:
biological_process unknown; LOCATED IN: chloroplast;
EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT3G20350.1); Has 21445 Blast
hits to 15134 proteins in 1325 species: Archae - 461;
Bacteria - 2309; Metazoa - 11052; Fungi - 1737; Plants -
1035; Viruses - 42; Other Eukaryotes - 4809 (source:
NCBI BLink). | chr1:18771386-18774385 FORWARD LENGTH=725
Length = 725
Score = 284 bits (726), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 197/607 (32%), Positives = 312/607 (51%), Gaps = 58/607 (9%)
Query: 43 TRRLAAGLWKLRFLEVSXXXXXXXXXXSFCHSLPKQAKGNNVKGTSRHRQDSEEKFKVRR 102
R+LAAGLW+L+ + S G + + K+R+
Sbjct: 116 VRKLAAGLWRLQVPDASSSGGERKGKEGLGFQGNGGYMGVPYLYHHSDKPSGGQSNKIRQ 175
Query: 103 -PVTILRSRDGLRCELETYMPCINFSKEEATKWNPALEDG-----------KFVGDRD-- 148
P TI +++G C+LE MP + + E ATKW+P D K + +
Sbjct: 176 NPSTIATTKNGFLCKLEPSMPFPHSAMEGATKWDPVCLDTMEEVHQIYSNMKRIDQQVNA 235
Query: 149 -SVVTVLLEELLRAQRSINKLKAAQKSSEKNVKHFLRNLEREKVFWKRRVRQKIEAMLDD 207
S+V+ L EL A I L++ ++S +K ++ FLR + E+ W+ R +K+ A++DD
Sbjct: 236 VSLVSSLEAELEEAHARIEDLESEKRSHKKKLEQFLRKVSEERAAWRSREHEKVRAIIDD 295
Query: 208 LKDKLAREKRSRERMEVLNTKLGHELAIANLSAKQFMANYXXXXXXXXXXXXVCNELAMY 267
+K + REK++R+R+E++N KL +ELA + L+ K++M +Y VC+ELA
Sbjct: 296 MKTDMNREKKTRQRLEIVNHKLVNELADSKLAVKRYMQDYEKERKARELIEEVCDELAKE 355
Query: 268 IGEGEAKLEEIIRDSMRIREEVEEERKMMQMAELWREERVQMKLADAKLFLENKYNQLLQ 327
IGE +A++E + R+SM +REEV++ER+M+QMAE+WREERVQMKL DAK+ LE +Y+Q+ +
Sbjct: 356 IGEDKAEIEALKRESMSLREEVDDERRMLQMAEVWREERVQMKLIDAKVALEERYSQMNK 415
Query: 328 LIAYLEMFLRSRGAELDTRELEEAELIKQVVESVNLQRIVELSYDFSKSDDTFPIYEELT 387
L+ LE FLRSR D +E+ EAEL+++ SVN+Q I E +Y + DD + ++EE+
Sbjct: 416 LVGDLESFLRSRDIVTDVKEVREAELLRETAASVNIQEIKEFTYVPANPDDIYAVFEEMN 475
Query: 388 KDNANERRIKLDSHTTLAGPSSKIHIESLE------EGLNKNSILHQLSPQSDYDVECLK 441
A++R ++ + SK+H SL+ +G + ++ HQ + D
Sbjct: 476 LGEAHDREMEKSVAYSPISHDSKVHTVSLDANMMNKKGRHSDAYTHQNGDIEEDDSGWET 535
Query: 442 LSSEPQRGDTY-------VINVNQERNKSESEAENSPESLNK--------GTIVNGVYYV 486
+S ++G +Y +N ++ + + ESL K T ++ V +
Sbjct: 536 VSHLEEQGSSYSPDGSIPSVNNKNHNHRHSNASSGGTESLGKVWDDTMTPTTEISEVCSI 595
Query: 487 SRRQSK--------WKANPASKQLR-------SYARPNDETTISSTKSSQHRRQGDRTST 531
RR SK W++ AS R S N + KSS DR S+
Sbjct: 596 PRRSSKKVSSIAKLWRSTGASNGDRDSNYKVISMEGMNGGRVSNGRKSSAGMVSPDRVSS 655
Query: 532 TSHCK------GSVEGNSKDNMNPHIAR-GMKGCIEWPRGIPKANSKVIPLEERVKSQKS 584
G + + +PH+ R GMKGCIEWPRG K++ K +E R++SQK
Sbjct: 656 KGGFSPMMDLVGQWNSSPESANHPHVNRGGMKGCIEWPRGAQKSSLKSKLIEARIESQKV 715
Query: 585 QLQHVLK 591
QL+HVLK
Sbjct: 716 QLKHVLK 722
>AT3G20350.1 | Symbols: | unknown protein; INVOLVED IN:
biological_process unknown; LOCATED IN: plasma membrane;
EXPRESSED IN: cotyledon; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT1G50660.1);
Has 15095 Blast hits to 11224 proteins in 1051 species:
Archae - 223; Bacteria - 1586; Metazoa - 7000; Fungi -
1255; Plants - 746; Viruses - 40; Other Eukaryotes -
4245 (source: NCBI BLink). | chr3:7096602-7099372
FORWARD LENGTH=673
Length = 673
Score = 224 bits (572), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 186/602 (30%), Positives = 298/602 (49%), Gaps = 64/602 (10%)
Query: 37 TLRLNGTRRLAAGLWKLRFLEVSXXXXXXXXXXSFCHSLPKQAKGNNVKGTSRHRQDSEE 96
++R + R+LAAG+W+LR + GN H D +
Sbjct: 90 SVRPDTVRKLAAGVWRLRVPDAVSSGGDKRSKDRLRFQETAGPAGNLGPLFYYHHHDDKH 149
Query: 97 KFKVRRPVTILRSRDGLRCELETYMPCINFSKEEATKWNPALED---------------G 141
SR C+ E +P + + E ATKW+P D
Sbjct: 150 SGFQSNNSRNKHSR--FLCKHEPSVPFPHCAMEGATKWDPICLDTRDDVHQIYTNVKWNN 207
Query: 142 KFVGDRDSVVTVLLEELLRAQRSINKLKAAQKSSEKNVKHFLRNLEREKVFWKRRVRQKI 201
+ V D ++ L+ L A+ I L++ ++S +K ++ FL+ + E+ W+ R +K+
Sbjct: 208 QQVNDVSLASSIELK-LQEARACIKDLESEKRSQKKKLEQFLKKVSEERAAWRSREHEKV 266
Query: 202 EAMLDDLKDKLAREKRSRERMEVLNTKLGHELAIANLSAKQFMANYXXXXXXXXXXXXVC 261
A++DD+K + +EK++R+R+E++N+KL +ELA + L+ K++M +Y VC
Sbjct: 267 RAIIDDMKADMNQEKKTRQRLEIVNSKLVNELADSKLAVKRYMHDYQQERKARELIEEVC 326
Query: 262 NELAMYIGEGEAKLEEIIRDSMRIREEVEEERKMMQMAELWREERVQMKLADAKLFLENK 321
+ELA I E +A++E + +SM +REEV++ER+M+QMAE+WREERVQMKL DAK+ LE K
Sbjct: 327 DELAKEIEEDKAEIEALKSESMNLREEVDDERRMLQMAEVWREERVQMKLIDAKVTLEEK 386
Query: 322 YNQLLQLIAYLEMFLRSRGAELDTRELEEAELIKQVVESV-NLQRIVELSYDFSKSDDTF 380
Y+Q+ +L+ +E FL SR +E+ AEL+++ SV N+Q I E +Y+ +K DD
Sbjct: 387 YSQMNKLVGDMEAFLSSRNT-TGVKEVRVAELLRETAASVDNIQEIKEFTYEPAKPDDIL 445
Query: 381 PIYEELTKDNANERRIKLDSHTTLAGPSSKIHIES-----LEEGLNKNSILHQLSPQSDY 435
++E++ +R + + +SK H S + +G + N+ Q +
Sbjct: 446 MLFEQMNMGENQDRESEQYVAYSPVSHASKAHTVSPDVNLINKGRHSNAFTDQNGEFEED 505
Query: 436 DVECLKLSSEPQRGDTYVINVNQERNKSESEAENSPESLNKGT--------IVNGVYYVS 487
D +S + G +Y + + N S + NS S+N GT + V V
Sbjct: 506 DSGWETVSHSEEHGSSYSPDESIP-NISNTHHRNSNVSMN-GTEYEKTLLREIKEVCSVP 563
Query: 488 RRQSKWKANPASKQLRSYARPNDETTISSTKSSQHRRQGDRTST---TSHCKGSVEG--- 541
RRQ SK+L S A+ SS + R R ST S GS +G
Sbjct: 564 RRQ--------SKKLPSMAK-----LWSSLEGMNGRVSNARKSTVEMVSPETGSNKGGFN 610
Query: 542 ---------NSKDNMNPHIAR-GMKGCIEWPRGIPKANSKVIPLEERVKSQKSQLQHVLK 591
+S D+ N ++ R G KGCIEWPRG K + K +E +++SQK QL+HVL+
Sbjct: 611 TLDLVGQWSSSPDSANANLNRGGRKGCIEWPRGAHKNSLKTKLIEAQIESQKVQLKHVLE 670
Query: 592 PK 593
K
Sbjct: 671 HK 672
>AT3G11590.1 | Symbols: | unknown protein; LOCATED IN: plasma
membrane; EXPRESSED IN: 22 plant structures; EXPRESSED
DURING: 13 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G22310.1);
Has 22320 Blast hits to 15179 proteins in 1213 species:
Archae - 372; Bacteria - 2307; Metazoa - 10906; Fungi -
1700; Plants - 1146; Viruses - 65; Other Eukaryotes -
5824 (source: NCBI BLink). | chr3:3660628-3663537
FORWARD LENGTH=622
Length = 622
Score = 101 bits (252), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 71/205 (34%), Positives = 119/205 (58%)
Query: 149 SVVTVLLEELLRAQRSINKLKAAQKSSEKNVKHFLRNLEREKVFWKRRVRQKIEAMLDDL 208
S+V+ L EL RA+ +N+L K ++ + ++ EK WK ++ +EA ++ +
Sbjct: 255 SLVSALHSELERARLQVNQLIHEHKPENNDISYLMKRFAEEKAVWKSNEQEVVEAAIESV 314
Query: 209 KDKLAREKRSRERMEVLNTKLGHELAIANLSAKQFMANYXXXXXXXXXXXXVCNELAMYI 268
+L E++ R R E LN KLG ELA + + + VC+ELA I
Sbjct: 315 AGELEVERKLRRRFESLNKKLGKELAETKSALMKAVKEIENEKRARVMVEKVCDELARDI 374
Query: 269 GEGEAKLEEIIRDSMRIREEVEEERKMMQMAELWREERVQMKLADAKLFLENKYNQLLQL 328
E +A++EE+ R+S +++EEVE+ER+M+Q+A+ REERVQMKL++AK LE K + +L
Sbjct: 375 SEDKAEVEELKRESFKVKEEVEKEREMLQLADALREERVQMKLSEAKHQLEEKNAAVDKL 434
Query: 329 IAYLEMFLRSRGAELDTRELEEAEL 353
L+ +L+++ + TRE + +L
Sbjct: 435 RNQLQTYLKAKRCKEKTREPPQTQL 459
>AT1G11690.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G20350.1); Has 5959 Blast hits to 4807 proteins
in 476 species: Archae - 156; Bacteria - 436; Metazoa -
2789; Fungi - 309; Plants - 336; Viruses - 9; Other
Eukaryotes - 1924 (source: NCBI BLink). |
chr1:3941469-3942212 FORWARD LENGTH=247
Length = 247
Score = 79.3 bits (194), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 66/225 (29%), Positives = 111/225 (49%), Gaps = 38/225 (16%)
Query: 117 LETYM---PCINFSKEEATKWNPALEDGKFVGDRDSVVTVLLEELLRAQRSINKLKAAQK 173
L TY P NF ++E +N +V L EL +AQ I +L+A +
Sbjct: 12 LRTYYSVEPSENFQEDEFLDFN--------------LVPCLQTELWKAQTRIKELEAEKF 57
Query: 174 SSEKNVKHFLRNLEREKVFWKRRVRQKIEAMLDDLKDKLAREKRSRERMEVLNTKLGHEL 233
SE+ ++ +RN EK + +D LK+KL++E+ ++R++ N++L ++
Sbjct: 58 KSEETIRCLIRNQRNEK-------EETTNPFVDYLKEKLSKEREEKKRVKAENSRLKKKI 110
Query: 234 AIANLSAKQFMANYXXXXXXXXXXXXVCNELAMYIGEGEAKLEEIIRDSMRIREEVEEER 293
S + VC EL +++E+ ++ R+ +E EEER
Sbjct: 111 LDMESSVNRL-------RRERDTMEKVCEELV-------TRIDELKVNTRRVWDETEEER 156
Query: 294 KMMQMAELWREERVQMKLADAKLFLENKYNQLLQLIAYLEMFLRS 338
+M+QMAE+WREERV++K DAKL L+ KY ++ + LE L +
Sbjct: 157 QMLQMAEMWREERVRVKFMDAKLALQEKYEEMNLFVVELEKCLET 201
>AT5G22310.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G11590.1); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
0; Other Eukaryotes - 2996 (source: NCBI BLink). |
chr5:7383742-7385345 REVERSE LENGTH=481
Length = 481
Score = 62.4 bits (150), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 73/148 (49%), Gaps = 21/148 (14%)
Query: 215 EKRSRERMEVLNTKLGHELAIANLSAKQFMANYXXXXXXXXXXXXVCNELAMYIGEGEAK 274
E++ R R E +N +LG EL A + ++ VC+EL IG+
Sbjct: 256 ERKLRRRTEKMNRRLGRELTEAKETERKMKEEMKREKRAKDVLEEVCDELTKGIGDD--- 312
Query: 275 LEEIIRDSMRIREEVEEERKMMQMAELWREERVQMKLADAKLFLENKYNQLLQLIAYLEM 334
++E+E+ER+MM +A++ REERVQMKL +AK E+KY + +L L
Sbjct: 313 -----------KKEMEKEREMMHIADVLREERVQMKLTEAKFEFEDKYAAVERLKKELRR 361
Query: 335 FLRSRGAELDTRELEEAELIKQVVESVN 362
LD E + + I++++E ++
Sbjct: 362 V-------LDGEEGKGSSEIRRILEVID 382