Miyakogusa Predicted Gene
- Lj5g3v0841340.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj5g3v0841340.1 Non Chatacterized Hit- tr|G7I7Z4|G7I7Z4_MEDTR
Putative uncharacterized protein OS=Medicago
truncatul,79,0,coiled-coil,NULL; NT-C2,EEIG1/EHBP1 N-terminal domain;
FAMILY NOT NAMED,NULL; seg,NULL,CUFF.54067.1
(774 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G11760.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 776 0.0
AT5G04860.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 583 e-166
AT2G10560.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 240 2e-63
AT2G25460.1 | Symbols: | CONTAINS InterPro DOMAIN/s: C2 calcium... 133 4e-31
>AT3G11760.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 23 plant
structures; EXPRESSED DURING: 14 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT5G04860.1); Has 84 Blast hits to 73 proteins in
13 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi
- 0; Plants - 84; Viruses - 0; Other Eukaryotes - 0
(source: NCBI BLink). | chr3:3718529-3721123 FORWARD
LENGTH=702
Length = 702
Score = 776 bits (2005), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 432/762 (56%), Positives = 515/762 (67%), Gaps = 87/762 (11%)
Query: 1 MVVKMMKKWRPWPPLVSRKYEVKLLVRTLQGCDLLREGAREG-MFAVEIRWKGPKLALSS 59
MVVKMMK WRPWPPLV+RKYEVKL V+ L+G DL+REG E VEIRWKGPK L S
Sbjct: 1 MVVKMMK-WRPWPPLVTRKYEVKLSVKKLEGWDLVREGVPEKDRLTVEIRWKGPKATLGS 59
Query: 60 LRRSAVARNFTKEAAAGCDGDNNNDVVLW-DEEFQSFCTLSAYKDNNNAFHPWEIAFTVF 118
LRRS V RNFTKEA +DVV W DEEFQS C+L++YKD+ F+PWEI F+VF
Sbjct: 60 LRRS-VKRNFTKEAVG------ESDVVSWEDEEFQSLCSLTSYKDS--LFYPWEITFSVF 110
Query: 119 -NGL--NQRPKVPVIGTASLNLAEFASVIDQKDFDLNIPLTIPGGSAXXXXXXXXXXXXX 175
NG+ Q+ K PV+GTA LNLAE+A V D+K+FD+NIPLT+ A
Sbjct: 111 TNGMKQGQKNKAPVVGTAFLNLAEYACVTDKKEFDINIPLTLSACVASETHPLLFVSLSL 170
Query: 176 XXLRAAQESSELVQKSVVPVASPLAQT----GETNLAEKDEVSTIKAGLRKVKILTEFVX 231
LR E+S+ ++ V + + ET+ EK++VS IKAGLRKVKI TEFV
Sbjct: 171 LELRTTPETSDSAAQTAVVPLPLPSPSPQQPTETHSVEKEDVSAIKAGLRKVKIFTEFVS 230
Query: 232 XXXXXXXXXXXXXXXXNLSARSEDGEYNYPFDSDSLDDFEEGESDEVKED-PNVRKSFSY 290
+ R E+G ++ S+SLDDFE + DE KE+ ++RKSFSY
Sbjct: 231 TRKAKK------------ACREEEGRFSSFESSESLDDFET-DFDEGKEELMSMRKSFSY 277
Query: 291 GKLAYANAEGSFYSSIRVKSDDDVDEGWVYYSNHISDTGXXXXXXXXXXXXXXXXXXXQS 350
G L+YAN G+ + SD+D D WVYYS+ SD G
Sbjct: 278 GPLSYANGVGTSLNCGAKVSDEDED--WVYYSHRKSDVGAGCSDAEDSAAGLVYEASLLP 335
Query: 351 SKRSILPWRKRKLSFRSPKSKGEPLLKKAYGEEGGDDIDFDRRQLSSDESLSP--GKTED 408
+RSILPWRKRKLSFRSPKSKGEPLLKK GEEGGDDIDFDRRQLSSDE+ P K ++
Sbjct: 336 -RRSILPWRKRKLSFRSPKSKGEPLLKKDNGEEGGDDIDFDRRQLSSDEAHPPFGSKIDE 394
Query: 409 DSCAN-RTSISEFGDDNFAVGSWEQKEVMSRDGHMKLQAQVFFASIDQRSERAAGESACT 467
DS AN RTS SEFG+D+FA+GSWE+KEV+SRDGHMKLQ VF ASIDQRSERAAGESACT
Sbjct: 395 DSSANPRTSFSEFGEDSFAIGSWEEKEVISRDGHMKLQTSVFLASIDQRSERAAGESACT 454
Query: 468 ALVAVIADWFQNNHDLMPIKSQFDSLIREGSLEWRNLCENQTYMERFPDKHFDLETVIQA 527
ALVAVIADWFQ N +LMPIKSQFDSLIREGSLEWRNLCEN+TYM++FPDKHFDL+TV+QA
Sbjct: 455 ALVAVIADWFQKNGNLMPIKSQFDSLIREGSLEWRNLCENETYMQKFPDKHFDLDTVLQA 514
Query: 528 KTRPLSVVPGKSFIGFFHPEGM-DEGRFDFLHGAMSFDNIWDEI-----SHNAGHDCTYN 581
K RPL+V+PGKSF+GFFHP+GM +EGRF+FL GAMSFD+IW EI S G +
Sbjct: 515 KIRPLTVIPGKSFVGFFHPDGMINEGRFEFLQGAMSFDSIWAEIISLEESSANGDSYDDD 574
Query: 582 GEPQVYIISWNDHFFILKVEVDAYYIIDTLGERLYEGCNQAYILKFDSNTLIHKMPEVAQ 641
P VYI+SWNDHFF+LKVE +AYYIIDTLGERLYEGC+QAY+LKFD T+IHK+ +
Sbjct: 575 SPPHVYIVSWNDHFFVLKVEKEAYYIIDTLGERLYEGCDQAYVLKFDHKTVIHKILHTEE 634
Query: 642 SSDEKTITDQQQTVADVLENNDKQIQQVNAKEADSVAAXXXXXXXXXXXXXXVVCRGKDA 701
+ E E +S ++ RGK++
Sbjct: 635 AGSE--------------------------SEPES----------------EILSRGKES 652
Query: 702 CKEYIKSFLAAIPIRELQADVKKGLVSSTPLHHRLQIEFHYT 743
CKEYIK+FLAAIPIRELQ D+KKGL S+ P+HHRLQIEFHYT
Sbjct: 653 CKEYIKNFLAAIPIRELQEDIKKGLASTAPVHHRLQIEFHYT 694
>AT5G04860.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G11760.1); Has 1807 Blast hits to 1807 proteins
in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
- 339 (source: NCBI BLink). | chr5:1411760-1414459
REVERSE LENGTH=782
Length = 782
Score = 583 bits (1502), Expect = e-166, Method: Compositional matrix adjust.
Identities = 363/813 (44%), Positives = 476/813 (58%), Gaps = 97/813 (11%)
Query: 1 MVVKM--MKKWRPWPPLVSRKYEVKLLVRTLQGC--------DLLREGAREGMFA----- 45
MVVKM + +W PWPPL + K++V ++V + G D + R G
Sbjct: 1 MVVKMKQIMRWPPWPPLFAVKFDVIVVVHQMDGLLDSDGGGDDSTDQSQRGGGTTTRKRP 60
Query: 46 -VEIRWKGPKLALSSLRRSAVARNFTKEAAAGCDGDNNNDVVLWDEEFQSFCTLSAYKDN 104
VEI+WKGPK +L+RS V RN T+E DG VV W+EEF+ C S YK+
Sbjct: 61 VVEIKWKGPKSV--TLKRSVV-RNLTEEGGFRGDG-----VVEWNEEFKRVCEFSVYKEG 112
Query: 105 NNAFHPWEIAFTVFNGLNQ--RPKVPVIGTASLNLAEFASVIDQKDFDLNIPLTIPGGSA 162
+F PW ++ TVF+GLNQ + KV G ASLN+AE+ S++ + D + +PL S+
Sbjct: 113 --SFLPWFVSLTVFSGLNQGSKEKVRSFGKASLNIAEYFSLMKEDDVQVKVPLKDCDSSS 170
Query: 163 XXXXXXXXXXXXXXXLRAAQESSELVQKSVVPVA-SPLAQTGETNLAEKDEVSTIKAGLR 221
+ +ES Q+S +PV SPL+ AEK E S +K GLR
Sbjct: 171 VRSPHVHISLQF-----SPKESLPERQRSALPVLWSPLSAE-----AEKAE-SVVKVGLR 219
Query: 222 KVKILTEFVXXXXXXXXXXXXXXXXXNLSA-----RSEDGEYNYPFDSDSLD--DFEEGE 274
K+K + + S R+ D + +YPFD+DSLD D +
Sbjct: 220 KMKTFNNCMSSTQASEKESEKDGSSGSGSDGKSPERNLDSDSSYPFDTDSLDEGDAADES 279
Query: 275 SDEVKEDPNVRKSFSYGKLAYAN-AEGSFYSSIRVKSDDDVDEGWVYYSNH--ISDTGXX 331
+ + + ++ +Y L AN A GSF++ + DE +YYS+ +++TG
Sbjct: 280 EENKENESSLADPVNYKTLRSANWARGSFHTVTNPE-----DEDLIYYSHRSPLAETGHC 334
Query: 332 XXXXXXXXXXXXXXXXXQSSKRSILPWRKRKLSFRSPKSKGEPLLKKAYGEEGGDDIDFD 391
Q SK+ +L W+KRKLSFRSPK KGEPLLKK EEGGDDIDFD
Sbjct: 335 SDEVSNDVVSLEQAKG-QMSKKRMLSWKKRKLSFRSPKQKGEPLLKKDCLEEGGDDIDFD 393
Query: 392 RRQLSS-DESLSPGKTEDDSCANRTSISEFGDDNFAVGSWEQKEVMSRDGHMKLQAQVFF 450
RRQLSS DES S DD+ +S+FGDD+F VGSWE KE++SRDG MKL A+VF
Sbjct: 394 RRQLSSSDESNSDWYRSDDAIMK--PLSQFGDDDFVVGSWETKEIISRDGLMKLTARVFL 451
Query: 451 ASIDQRSERAAGESACTALVAVIADWFQNNHDLMPIKSQFDSLIREGSLEWRNLCENQTY 510
ASIDQRSERAAGESACTALVAV+A W +N D++P +S+FDSLIREGS EWRN+CEN+ Y
Sbjct: 452 ASIDQRSERAAGESACTALVAVMAHWLGSNRDIIPTRSEFDSLIREGSSEWRNMCENEEY 511
Query: 511 MERFPDKHFDLETVIQAKTRPLSVVPGKSFIGFFHP------EGMDEGRFDFLHGAMSFD 564
ERFPDKHFDLETV+QAK RP+ VVP +SFIGFFHP EG ++ DFL G MSFD
Sbjct: 512 RERFPDKHFDLETVLQAKVRPICVVPERSFIGFFHPEKSEEEEGKEDASLDFLKGVMSFD 571
Query: 565 NIWDEISHNAGHDCTYNGEPQVYIISWNDHFFILKVEVDAYYIIDTLGERLYEGCNQAYI 624
+IW+E+ + EP +YI+SWNDHFF+L V DAYYIIDTLGERLYEGCNQAY+
Sbjct: 572 SIWEELMKQEPEESA--SEPVIYIVSWNDHFFVLLVNHDAYYIIDTLGERLYEGCNQAYV 629
Query: 625 LKFDSNTLIHKMPEVAQSSDEKTITDQQQTVADVLENNDKQIQQVNAKEADSVAAXXXXX 684
LKFD + I ++P V I D + + + + + +Q +
Sbjct: 630 LKFDKDAEIKRLPSV--------IKDNKADMGNQKQGGKNKSEQPERSKESEEQE----- 676
Query: 685 XXXXXXXXXVVCRGKDACKEYIKSFLAAIPIRELQADVKKGLVSSTPLHHRLQIEFHYTQ 744
VVCRGK++C+EYIKSFLAAIPI++++AD+KKGLVSS LHHRLQIE HYT+
Sbjct: 677 ------EEEVVCRGKESCREYIKSFLAAIPIQQVKADMKKGLVSS--LHHRLQIELHYTK 728
Query: 745 LLQSYDI---------VPVAEASMTVPETLALA 768
L + V V+EA+++V +LA
Sbjct: 729 HLHHHQPNMFESSATEVTVSEAAVSVTVAWSLA 761
>AT2G10560.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G04860.1); Has 70 Blast hits to 70 proteins in
12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi
- 0; Plants - 70; Viruses - 0; Other Eukaryotes - 0
(source: NCBI BLink). | chr2:4109862-4110698 REVERSE
LENGTH=278
Length = 278
Score = 240 bits (613), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 136/280 (48%), Positives = 177/280 (63%), Gaps = 40/280 (14%)
Query: 504 LCENQTYMERFPDKHFDLETVIQAKTRPLSVVPGKSFIGFFH------PEGMDEGRFDFL 557
+CEN+ Y ERFPDKHFDLETV+QAK RP+ VVP ++FIGFFH E ++ DFL
Sbjct: 1 MCENEEYRERFPDKHFDLETVLQAKVRPICVVPERTFIGFFHREKSKEEEEKEDVSLDFL 60
Query: 558 HGAMSFDNIWDEISHNAGHDCTYNGEPQVYIISWNDHFFILKVEVDAYYIIDTLGERLYE 617
G MSFD+IW+EI + E +YI+SWNDH+F+L V DAYYIIDTLGER+YE
Sbjct: 61 KGVMSFDSIWEEIMKQEPEESA--SEHVIYIVSWNDHYFVLLVNHDAYYIIDTLGERVYE 118
Query: 618 GCNQAYILKFDSNTLIHKMPEVAQSSDEKTITDQQQTVADVLENNDKQIQQVNAKEADSV 677
GCNQAY+LKFD + I ++P V + ++ + Q+Q +K Q +KE++
Sbjct: 119 GCNQAYVLKFDQDAEIKRLPSVIK-DNKADMGSQKQG------GKNKYEQPERSKESEEQ 171
Query: 678 AAXXXXXXXXXXXXXXVVCRGKDACKEYIKSFLAAIPIRELQADVKKGLVSSTPLHHRLQ 737
VVCRGK++C+EYIKSFLAAIPI++++AD+K+GLVSS HHRLQ
Sbjct: 172 G------------EEVVVCRGKESCREYIKSFLAAIPIQQVKADMKEGLVSS--FHHRLQ 217
Query: 738 IEFHYTQLLQ---------SYDIVPVAEA--SMTVPETLA 766
IE +YT+ L S V V+EA SMTV LA
Sbjct: 218 IELYYTKHLHHRQPNMFESSTTKVTVSEATVSMTVAWLLA 257
>AT2G25460.1 | Symbols: | CONTAINS InterPro DOMAIN/s: C2
calcium-dependent membrane targeting
(InterPro:IPR000008); BEST Arabidopsis thaliana protein
match is: unknown protein (TAIR:AT5G04860.1); Has 108
Blast hits to 69 proteins in 11 species: Archae - 0;
Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 108;
Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).
| chr2:10833175-10835374 REVERSE LENGTH=423
Length = 423
Score = 133 bits (335), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 74/143 (51%), Positives = 100/143 (69%), Gaps = 6/143 (4%)
Query: 430 WEQKEVMSRDGHMKLQAQVFFASIDQRSERAAGESACTALVAVIADWFQNNHDLM-PIKS 488
W K+++SRDG KL+++V+ ASIDQRSE+AAGE+AC A+ V+A WF N L+ P +
Sbjct: 282 WVMKDLVSRDGKSKLKSEVYLASIDQRSEQAAGEAACAAVAVVVAHWFHANPKLINPSGT 341
Query: 489 QFDSLIREGSLEWRNLCENQTYMERFPDKHFDLETVIQAKTRPLSVVPGKSFIGFFHPEG 548
FDSLI +GS W++LC+ ++Y+ FP++HFDLET++ A RP+ V KSF G F PE
Sbjct: 342 AFDSLITQGSSLWQSLCDKESYLRLFPNRHFDLETIVSANLRPVRVCTDKSFTGLFSPE- 400
Query: 549 MDEGRFDFLHGAMSFDNIWDEIS 571
RF L G MSFD IWDE+S
Sbjct: 401 ----RFASLDGLMSFDQIWDELS 419