Miyakogusa Predicted Gene
- Lj1g3v4752850.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v4752850.1 Non Chatacterized Hit- tr|I1NAV7|I1NAV7_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.43823 PE,75.76,0,FAMILY
NOT NAMED,NULL; seg,NULL; NT-C2,EEIG1/EHBP1 N-terminal
domain,CUFF.33113.1
(753 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G11760.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 745 0.0
AT5G04860.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 570 e-162
AT2G10560.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 253 3e-67
AT2G25460.1 | Symbols: | CONTAINS InterPro DOMAIN/s: C2 calcium... 138 1e-32
>AT3G11760.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 23 plant
structures; EXPRESSED DURING: 14 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT5G04860.1); Has 84 Blast hits to 73 proteins in
13 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi
- 0; Plants - 84; Viruses - 0; Other Eukaryotes - 0
(source: NCBI BLink). | chr3:3718529-3721123 FORWARD
LENGTH=702
Length = 702
Score = 745 bits (1923), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 413/745 (55%), Positives = 507/745 (68%), Gaps = 73/745 (9%)
Query: 1 MVVKMMRWRPWPP-PSKRFHVRLTVRKLTGCDLLRDD--SRSKLTLEIRWKGPKSSLPSL 57
MVVKMM+WRPWPP ++++ V+L+V+KL G DL+R+ + +LT+EIRWKGPK++L SL
Sbjct: 1 MVVKMMKWRPWPPLVTRKYEVKLSVKKLEGWDLVREGVPEKDRLTVEIRWKGPKATLGSL 60
Query: 58 RWNSVARNFTAEAAVDTTAGAVTW-DEEFQSLCNLTADKHNAFHPWEIAFTLF-NGLNQN 115
R SV RNFT EA ++ V+W DEEFQSLC+LT+ K + F+PWEI F++F NG+ Q
Sbjct: 61 R-RSVKRNFTKEAVGESDV--VSWEDEEFQSLCSLTSYKDSLFYPWEITFSVFTNGMKQG 117
Query: 116 IKRKVPIIGTALLNIAEFASPTDQKDFDLNIPLTLPGG-SVEPSPSLCISISLVEISGAQ 174
K K P++GTA LN+AE+A TD+K+FD+NIPLTL + E P L +S+SL+E+
Sbjct: 118 QKNKAPVVGTAFLNLAEYACVTDKKEFDINIPLTLSACVASETHPLLFVSLSLLELRTTP 177
Query: 175 GSLESVHRTIVPVSSPPAQSG----ETTMAEKGDELSAIKAGLRKVKIFTEYVXXXXXXX 230
+ +S +T V P+ S ET EK D +SAIKAGLRKVKIFTE+V
Sbjct: 178 ETSDSAAQTAVVPLPLPSPSPQQPTETHSVEKED-VSAIKAGLRKVKIFTEFVSTRKAKK 236
Query: 231 XXXXXXXXXXXXXXXXXDGECNYPVXXXXXXXXXXXXXXXXXXXXXFRKSFSYGPLAYAN 290
+G + RKSFSYGPL+YAN
Sbjct: 237 ACREE------------EGRFSSFESSESLDDFETDFDEGKEELMSMRKSFSYGPLSYAN 284
Query: 291 A-GGAFCSNMRVNCDDEGWVYYSHRMSD--AGCLRMEDSTLSSSEPNVQSSMRSILSWRK 347
G + +V+ +DE WVYYSHR SD AGC EDS RSIL WRK
Sbjct: 285 GVGTSLNCGAKVSDEDEDWVYYSHRKSDVGAGCSDAEDSAAGLVYEASLLPRRSILPWRK 344
Query: 348 RKLSFRSPKKANKGEPLLKKAYAEEGGDDIDFDRRQLSSDESLSLRLYKNEDDSCAN-RS 406
RKLSFRSPK +KGEPLLKK EEGGDDIDFDRRQLSSDE+ K ++DS AN R+
Sbjct: 345 RKLSFRSPK--SKGEPLLKKDNGEEGGDDIDFDRRQLSSDEAHPPFGSKIDEDSSANPRT 402
Query: 407 SISEFGDDNFAVGSWEQKEVMSRDGHMKLQTQVFFASIDQRSERAAGESACTALVAVIAD 466
S SEFG+D+FA+GSWE+KEV+SRDGHMKLQT VF ASIDQRSERAAGESACTALVAVIAD
Sbjct: 403 SFSEFGEDSFAIGSWEEKEVISRDGHMKLQTSVFLASIDQRSERAAGESACTALVAVIAD 462
Query: 467 WFQNSPDLMPIKSQFDSLIREGSSEWRSMCDNETYRERFPDKHFDLETVIQAKIRPLSVV 526
WFQ + +LMPIKSQFDSLIREGS EWR++C+NETY ++FPDKHFDL+TV+QAKIRPL+V+
Sbjct: 463 WFQKNGNLMPIKSQFDSLIREGSLEWRNLCENETYMQKFPDKHFDLDTVLQAKIRPLTVI 522
Query: 527 PSKSFIGFFHPEGM-DEEKFDILHGAMSFDNIWDEISGSGHESLSNGE-------PHVYI 578
P KSF+GFFHP+GM +E +F+ L GAMSFD+IW EI S ES +NG+ PHVYI
Sbjct: 523 PGKSFVGFFHPDGMINEGRFEFLQGAMSFDSIWAEII-SLEESSANGDSYDDDSPPHVYI 581
Query: 579 VSWNDHFFILKVEADCYYIIDTLGERLYEGCNQAYILKFDSSTVIHKMQNAAKSSLEDKT 638
VSWNDHFF+LKVE + YYIIDTLGERLYEGC+QAY+LKFD TVIHK+ + ++
Sbjct: 582 VSWNDHFFVLKVEKEAYYIIDTLGERLYEGCDQAYVLKFDHKTVIHKILHTEEAG----- 636
Query: 639 TSNQQTVAEVLERDNSKEVDSSSGVAEQQEEEVVLCRGKEACKEYIKSFLAAIPIRELQA 698
+E + E +L RGKE+CKEYIK+FLAAIPIRELQ
Sbjct: 637 -------------------------SESEPESEILSRGKESCKEYIKNFLAAIPIRELQE 671
Query: 699 DVKKGLVLMSSTQVHHRLQIEFHYT 723
D+KKGL S+ VHHRLQIEFHYT
Sbjct: 672 DIKKGLA--STAPVHHRLQIEFHYT 694
>AT5G04860.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G11760.1); Has 1807 Blast hits to 1807 proteins
in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
- 339 (source: NCBI BLink). | chr5:1411760-1414459
REVERSE LENGTH=782
Length = 782
Score = 570 bits (1470), Expect = e-162, Method: Compositional matrix adjust.
Identities = 359/793 (45%), Positives = 464/793 (58%), Gaps = 74/793 (9%)
Query: 1 MVVKM---MRWRPWPPP-SKRFHVRLTVRKLTGC---DLLRDDS------------RSKL 41
MVVKM MRW PWPP + +F V + V ++ G D DDS R +
Sbjct: 1 MVVKMKQIMRWPPWPPLFAVKFDVIVVVHQMDGLLDSDGGGDDSTDQSQRGGGTTTRKRP 60
Query: 42 TLEIRWKGPKSSLPSLRWNSVARNFTAEAAVDTTAGAVTWDEEFQSLCNLTADKHNAFHP 101
+EI+WKGPKS +L+ SV RN T E G V W+EEF+ +C + K +F P
Sbjct: 61 VVEIKWKGPKSV--TLK-RSVVRNLTEEGGF-RGDGVVEWNEEFKRVCEFSVYKEGSFLP 116
Query: 102 WEIAFTLFNGLNQNIKRKVPIIGTALLNIAEFASPTDQKDFDLNIPLTLPGGSVEPSPSL 161
W ++ T+F+GLNQ K KV G A LNIAE+ S + D + +PL S SP +
Sbjct: 117 WFVSLTVFSGLNQGSKEKVRSFGKASLNIAEYFSLMKEDDVQVKVPLKDCDSSSVRSPHV 176
Query: 162 CISISLVEISGAQGSLESVHRTIVPVSSPPAQSGETTMAEKGDELSAIKAGLRKVKIFTE 221
IS+ + SL R+ +PV P S E AE S +K GLRK+K F
Sbjct: 177 HISLQF----SPKESLPERQRSALPVLWSPL-SAEAEKAE-----SVVKVGLRKMKTFNN 226
Query: 222 YVXXXXXXXXXXXXXXXXXXXX-----XXXXDGECNYPVXXXXXXX--XXXXXXXXXXXX 274
+ D + +YP
Sbjct: 227 CMSSTQASEKESEKDGSSGSGSDGKSPERNLDSDSSYPFDTDSLDEGDAADESEENKENE 286
Query: 275 XXFRKSFSYGPLAYAN-AGGAFCSNMRVNCDDEGWVYYSHR--MSDAGCLRME--DSTLS 329
+Y L AN A G+F + N +DE +YYSHR +++ G E + +S
Sbjct: 287 SSLADPVNYKTLRSANWARGSF--HTVTNPEDEDLIYYSHRSPLAETGHCSDEVSNDVVS 344
Query: 330 SSEPNVQSSMRSILSWRKRKLSFRSPKKANKGEPLLKKAYAEEGGDDIDFDRRQLSSDES 389
+ Q S + +LSW+KRKLSFRSPK+ KGEPLLKK EEGGDDIDFDRRQLSS +
Sbjct: 345 LEQAKGQMSKKRMLSWKKRKLSFRSPKQ--KGEPLLKKDCLEEGGDDIDFDRRQLSSSDE 402
Query: 390 LSLRLYKNEDDSCANRSSISEFGDDNFAVGSWEQKEVMSRDGHMKLQTQVFFASIDQRSE 449
+ Y+++D A +S+FGDD+F VGSWE KE++SRDG MKL +VF ASIDQRSE
Sbjct: 403 SNSDWYRSDD---AIMKPLSQFGDDDFVVGSWETKEIISRDGLMKLTARVFLASIDQRSE 459
Query: 450 RAAGESACTALVAVIADWFQNSPDLMPIKSQFDSLIREGSSEWRSMCDNETYRERFPDKH 509
RAAGESACTALVAV+A W ++ D++P +S+FDSLIREGSSEWR+MC+NE YRERFPDKH
Sbjct: 460 RAAGESACTALVAVMAHWLGSNRDIIPTRSEFDSLIREGSSEWRNMCENEEYRERFPDKH 519
Query: 510 FDLETVIQAKIRPLSVVPSKSFIGFFHP------EGMDEEKFDILHGAMSFDNIWDEISG 563
FDLETV+QAK+RP+ VVP +SFIGFFHP EG ++ D L G MSFD+IW+E+
Sbjct: 520 FDLETVLQAKVRPICVVPERSFIGFFHPEKSEEEEGKEDASLDFLKGVMSFDSIWEELMK 579
Query: 564 SGHESLSNGEPHVYIVSWNDHFFILKVEADCYYIIDTLGERLYEGCNQAYILKFDSSTVI 623
E S EP +YIVSWNDHFF+L V D YYIIDTLGERLYEGCNQAY+LKFD I
Sbjct: 580 QEPEE-SASEPVIYIVSWNDHFFVLLVNHDAYYIIDTLGERLYEGCNQAYVLKFDKDAEI 638
Query: 624 HKMQNAAKSSLEDKTTSNQQTVAEVLERDNSKEVDSSSGVAEQQEEEVVLCRGKEACKEY 683
++ + K + D NQ+ + N E S +E+QEEE V+CRGKE+C+EY
Sbjct: 639 KRLPSVIKDNKAD--MGNQKQGGK-----NKSEQPERSKESEEQEEEEVVCRGKESCREY 691
Query: 684 IKSFLAAIPIRELQADVKKGLVLMSSTQVHHRLQIEFHYTQLLQ----SCPATPAVELEA 739
IKSFLAAIPI++++AD+KKGLV + +HHRLQIE HYT+ L + + A E+
Sbjct: 692 IKSFLAAIPIQQVKADMKKGLV----SSLHHRLQIELHYTKHLHHHQPNMFESSATEVTV 747
Query: 740 SIAATPETLALAI 752
S AA T+A ++
Sbjct: 748 SEAAVSVTVAWSL 760
>AT2G10560.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G04860.1); Has 70 Blast hits to 70 proteins in
12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi
- 0; Plants - 70; Viruses - 0; Other Eukaryotes - 0
(source: NCBI BLink). | chr2:4109862-4110698 REVERSE
LENGTH=278
Length = 278
Score = 253 bits (646), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 129/239 (53%), Positives = 166/239 (69%), Gaps = 18/239 (7%)
Query: 495 MCDNETYRERFPDKHFDLETVIQAKIRPLSVVPSKSFIGFFH------PEGMDEEKFDIL 548
MC+NE YRERFPDKHFDLETV+QAK+RP+ VVP ++FIGFFH E ++ D L
Sbjct: 1 MCENEEYRERFPDKHFDLETVLQAKVRPICVVPERTFIGFFHREKSKEEEEKEDVSLDFL 60
Query: 549 HGAMSFDNIWDEISGSGHESLSNGEPHVYIVSWNDHFFILKVEADCYYIIDTLGERLYEG 608
G MSFD+IW+EI E S E +YIVSWNDH+F+L V D YYIIDTLGER+YEG
Sbjct: 61 KGVMSFDSIWEEIMKQEPEE-SASEHVIYIVSWNDHYFVLLVNHDAYYIIDTLGERVYEG 119
Query: 609 CNQAYILKFDSSTVIHKMQNAAKSSLEDKTTSNQQTVAEVLERDNSKEVDSSSGVAEQQE 668
CNQAY+LKFD I ++ + K + D + Q + + + SKE +E+Q
Sbjct: 120 CNQAYVLKFDQDAEIKRLPSVIKDNKADMGSQKQGGKNKYEQPERSKE-------SEEQG 172
Query: 669 EEVVLCRGKEACKEYIKSFLAAIPIRELQADVKKGLVLMSSTQVHHRLQIEFHYTQLLQ 727
EEVV+CRGKE+C+EYIKSFLAAIPI++++AD+K+GLV + HHRLQIE +YT+ L
Sbjct: 173 EEVVVCRGKESCREYIKSFLAAIPIQQVKADMKEGLV----SSFHHRLQIELYYTKHLH 227
>AT2G25460.1 | Symbols: | CONTAINS InterPro DOMAIN/s: C2
calcium-dependent membrane targeting
(InterPro:IPR000008); BEST Arabidopsis thaliana protein
match is: unknown protein (TAIR:AT5G04860.1); Has 108
Blast hits to 69 proteins in 11 species: Archae - 0;
Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 108;
Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).
| chr2:10833175-10835374 REVERSE LENGTH=423
Length = 423
Score = 138 bits (348), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 76/143 (53%), Positives = 102/143 (71%), Gaps = 6/143 (4%)
Query: 421 WEQKEVMSRDGHMKLQTQVFFASIDQRSERAAGESACTALVAVIADWFQNSPDLM-PIKS 479
W K+++SRDG KL+++V+ ASIDQRSE+AAGE+AC A+ V+A WF +P L+ P +
Sbjct: 282 WVMKDLVSRDGKSKLKSEVYLASIDQRSEQAAGEAACAAVAVVVAHWFHANPKLINPSGT 341
Query: 480 QFDSLIREGSSEWRSMCDNETYRERFPDKHFDLETVIQAKIRPLSVVPSKSFIGFFHPEG 539
FDSLI +GSS W+S+CD E+Y FP++HFDLET++ A +RP+ V KSF G F P
Sbjct: 342 AFDSLITQGSSLWQSLCDKESYLRLFPNRHFDLETIVSANLRPVRVCTDKSFTGLFSP-- 399
Query: 540 MDEEKFDILHGAMSFDNIWDEIS 562
E+F L G MSFD IWDE+S
Sbjct: 400 ---ERFASLDGLMSFDQIWDELS 419
Score = 64.7 bits (156), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 43/164 (26%), Positives = 80/164 (48%), Gaps = 18/164 (10%)
Query: 16 KRFHVRLTVRKLTGCD-LLRDDSRSK---LTLEIRWKGPKSS-----LPSLRWNSVARNF 66
++ HV + +L G +L D++ K +E++WKGP S +P R N N
Sbjct: 6 RKLHVTVKPVRLDGLPAILGDETAGKNLSAMVEVKWKGPVSGFGLGFVPFYRSNRPV-NH 64
Query: 67 TAEAAVDTTAGAVTWDEEFQSLCNLTADKHNAFHPWEIAFTLFNGLNQNIKRKVPIIGTA 126
T+ + + V W+EEF+ +C + PW ++F +F G N + K K +IG A
Sbjct: 65 TSSKPIALGSNHVEWEEEFERVCCIVG-------PWNLSFNVFYGENMDAKNKKSLIGKA 117
Query: 127 LLNIAEFASPTDQKDFDLNIPLTLPGGSVEPSPSLCISISLVEI 170
L+++E AS + + +P+ G + +L ++++ E+
Sbjct: 118 SLDLSELAS-KQESTVERKLPIRSKGSVLSKEATLVVNVTFSEV 160