Miyakogusa Predicted Gene
- Lj1g3v2461220.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v2461220.1 Non Chatacterized Hit- tr|I1N372|I1N372_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.41267 PE,80,0,seg,NULL;
coiled-coil,NULL,CUFF.29100.1
(876 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G51650.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 622 e-178
AT3G51640.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 613 e-175
AT3G51640.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 151 2e-36
>AT3G51650.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G51640.1); Has 27645 Blast hits to 15097
proteins in 1246 species: Archae - 44; Bacteria - 3367;
Metazoa - 10036; Fungi - 2690; Plants - 1205; Viruses -
196; Other Eukaryotes - 10107 (source: NCBI BLink). |
chr3:19159449-19162267 FORWARD LENGTH=842
Length = 842
Score = 622 bits (1604), Expect = e-178, Method: Compositional matrix adjust.
Identities = 376/891 (42%), Positives = 488/891 (54%), Gaps = 64/891 (7%)
Query: 1 MCILCVIQKLSRRVATVLPWLVIPLIGLWALSQLLPPAFRFEITSPRLACVIVLLVTLFW 60
MCILCVIQK SR+VAT+LPW VIPLIGLWALSQLLPPAFRFEITSPRLACV VLLVTLFW
Sbjct: 1 MCILCVIQKWSRQVATMLPWFVIPLIGLWALSQLLPPAFRFEITSPRLACVFVLLVTLFW 60
Query: 61 YEVLMPQLSAWRAKRSARLRERKRSEAIELQKLRKTATRRCRNCLNPYRDQNPGGGRFMC 120
YEVLMPQLS WR +R+A+LRER+R EAIELQKL+K ATRRCRNC NPYRDQNPGGG+FMC
Sbjct: 61 YEVLMPQLSTWRVRRNAQLRERERLEAIELQKLKKNATRRCRNCSNPYRDQNPGGGKFMC 120
Query: 121 SYCGHVSKRXXXXXXXXXXXXISNSGIVKDLVGKSGKILNSKVWSENGWMCSQDWLENGN 180
SYCGHVSKR IS SGI+KDLVG+ GK+LN K WSENG++ Q+W +N
Sbjct: 121 SYCGHVSKRPVLDMALSSGLEISGSGILKDLVGRGGKMLNGKGWSENGYLHRQEWSDNST 180
Query: 181 WVGGSIPGNSSSWRTNENGGVYGDEHCLTERSYSGTLFFVCKLFTSFLLSIRWLWRKIFR 240
W G SS WR N GDE+CL E+SYSG + F C+L TSF +SI WLWRKIFR
Sbjct: 181 WTSG-----SSYWRNNSGDTFEGDENCLVEKSYSGGVVFACRLLTSFFMSILWLWRKIFR 235
Query: 241 VSSR-EECSSDAEHRALLAKQGENGASLNESXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 299
SS + S D E R +LA+QGENG S +ES
Sbjct: 236 FSSSVGDSSLDPEQRRMLARQGENGTSSHES--RVEKARRKAEEKRQARLEKEHSEEEER 293
Query: 300 XXXXXXXXXXXXXXXXXXXXXXXDRCRSSNPSKEKNSXXXXXXXXXXXXXXXXXGSSKSN 359
++C + + ++ SSKSN
Sbjct: 294 KQREEVARLVEERRRLRDEILEAEKCSKFSVAAKEKDTKEAEKKRQERRKERDRASSKSN 353
Query: 360 SDVEELERKAGKESERKRDLDKKSEMDRREHQKHGLESAKGQSTDHAHS---KNVIANNR 416
SD EE++++ KE+E+KR L+K D EH++H ++ +G + + H +N + +N
Sbjct: 354 SDGEEVDKRTRKETEQKRGLNKS---DHLEHERHAPDNLRGPNMERRHGHGLENNVTSNG 410
Query: 417 GSTGTRYLDRMRGTILSSSKAFG----FGRGTNVSATVAKDNKLSSSVDHFHTAASRRDI 472
+G RY DRM+ T SSSKAF FGRG N SAT A++NK + S D+ HT A I
Sbjct: 411 TKSGGRYFDRMKSTTFSSSKAFTDSRIFGRGVNTSATFARENKPTGSADNSHTYAHSSHI 470
Query: 473 CPPERPTAKSNLNADDRNINNSVLPEPQPWRAPIMSWQQLFTRSPTVPQSSNSNVICRPN 532
PP+ KS N ++RN NN V+ EP+P R P SW QLF RS P SSN N I RP+
Sbjct: 471 NPPDFVAMKSVPNEEERNTNNPVVSEPKPSREPRKSWHQLFARSTPAPVSSNVNTISRPS 530
Query: 533 SKVQVETKSPQSSGQSPVTQSFNNPIHFGLPSPFKISTHPNGSTSTSLGFSPAIEPLFSP 592
+ Q + Q Q ++F+N I FGLPSPF I + +GST++SLGFSP E +F
Sbjct: 531 TNPQPNVQISQVPSQVSSIRTFDNSISFGLPSPFTIPVYSSGSTTSSLGFSPPTEFVFPQ 590
Query: 593 AGSTSLDLRHDEQELFEDPCYDPDPVSLLGPVSESLDNFQLDLGSGFGTDMEVSKPHSLK 652
G E E FEDPCY PDP+SLLGPVSESLD +G+ T + K H++K
Sbjct: 591 PG---------EDERFEDPCYVPDPISLLGPVSESLD----LRAAGYETGIGQVKYHAMK 637
Query: 653 NISAGSDVNRLSPIESPLSREKHNCSNWFSSTPKGQDMHSSFMDDAAASEKGTWQMWSTS 712
N + + N+ SPIESPLSR + D + G+WQMW +
Sbjct: 638 N-TPSCEANKPSPIESPLSRSR--------------------AADEKQANDGSWQMWKSP 676
Query: 713 PXXXXXXXXXXXXXXXXXXQMNIPTKDDFVLPSSQNTMASFFNKDD-NIISSNHSSQNVF 771
+++ ++ + + Q+ S F+K+D + +S + +
Sbjct: 677 LGQNGLGLVGGSANWVLPSEISRSIEESDMHHAPQHRTESLFSKEDCQLHQGAYSQRKDY 736
Query: 772 VPNVHSGSNFSPVTVSSSYDPWLQSALFPPLS------TGFTAQEAATQNEIIYGSPSAS 825
+ + FSP+T ++ DPW Q FP LS + T ++ N Y SP+ S
Sbjct: 737 LEHDQRSGVFSPITGPTTTDPWSQKMFFPALSGIESPFSITTQTKSVLNNAAGYRSPTGS 796
Query: 826 VSSHVLEGSPANSWSKKEWPIHGSAESVGKPSSVSKTHDGLHPTSDLQSIW 876
+ E N W KK + S + GK V + + D++S W
Sbjct: 797 GPDNPFEHPSPNHWLKK---VKSSGDGTGK--QVLAAGEVENHQKDVESFW 842
>AT3G51640.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G51650.1); Has 26208 Blast hits to 14155
proteins in 1229 species: Archae - 43; Bacteria - 3230;
Metazoa - 9456; Fungi - 2551; Plants - 1160; Viruses -
177; Other Eukaryotes - 9591 (source: NCBI BLink). |
chr3:19154294-19157134 FORWARD LENGTH=842
Length = 842
Score = 613 bits (1582), Expect = e-175, Method: Compositional matrix adjust.
Identities = 376/889 (42%), Positives = 481/889 (54%), Gaps = 60/889 (6%)
Query: 1 MCILCVIQKLSRRVATVLPWLVIPLIGLWALSQLLPPAFRFEITSPRLACVIVLLVTLFW 60
MCILC IQK SR+VAT+LPW VIPLIGLWALSQLLPPAFRFEITSPRLACV VLLVTLFW
Sbjct: 1 MCILCGIQKWSRQVATMLPWFVIPLIGLWALSQLLPPAFRFEITSPRLACVFVLLVTLFW 60
Query: 61 YEVLMPQLSAWRAKRSARLRERKRSEAIELQKLRKTATRRCRNCLNPYRDQNPGGGRFMC 120
YEVLMPQLS WR +R+A+LRER+R EAIELQKL+K ATRRCRNC NPYRDQNPGGG+FMC
Sbjct: 61 YEVLMPQLSTWRVRRNAQLRERERLEAIELQKLKKNATRRCRNCSNPYRDQNPGGGKFMC 120
Query: 121 SYCGHVSKRXXXXXXXXXXXXISNSGIVKDLVGKSGKILNSKVWSENGWMCSQDWLENGN 180
SYCGHVSKR IS SGI+KDLVG+ GK+LN K WSENG++ Q+W +N
Sbjct: 121 SYCGHVSKRPVLDMALSSGLEISGSGILKDLVGRGGKMLNGKGWSENGYLHRQEWSDNST 180
Query: 181 WVGGSIPGNSSSWRTNENGGVYGDEHCLTERSYSGTLFFVCKLFTSFLLSIRWLWRKIFR 240
W G SS WR N GDE+CL E+SYSG + F C+L TSF +SI WLWRKIFR
Sbjct: 181 WTSG-----SSYWRNNSGDTFEGDENCLVEKSYSGGVVFACRLLTSFFMSILWLWRKIFR 235
Query: 241 VSSR-EECSSDAEHRALLAKQGENGASLNESXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 299
SS + S D E R +LA+QGENG S +ES
Sbjct: 236 FSSSVGDSSLDPEQRRMLARQGENGTSCHES--RVEKARRKAEEKRQARLEKEHSEEEER 293
Query: 300 XXXXXXXXXXXXXXXXXXXXXXXDRCRSSNPSKEKNSXXXXXXXXXXXXXXXXXGSSKSN 359
++C + + ++ SSKSN
Sbjct: 294 KQREEVARLVEERRRLRDEILEAEKCSKLSVAAKEKDTKEAEKKRQERRKERDRASSKSN 353
Query: 360 SDVEELERKAGKESERKRDLDKKSEMDRREHQKHGLESAKGQSTDHAHS-KNVIANNRGS 418
SD EE++++ KE+E+KR L K +++ H L H H +N + +N
Sbjct: 354 SDGEEVDKRTRKETEQKRGLYKSDHLEQERHAPDNLR-VPNMERRHGHGLENNVTSNGTK 412
Query: 419 TGTRYLDRMRGTILSSSKAFG----FGRGTNVSATVAKDNKLSSSVDHFHTAASRRDICP 474
+G RY DRM+GT LSSSKAF FGRG N SAT+A++NK S D+ HT A P
Sbjct: 413 SGGRYFDRMKGTFLSSSKAFTDSRLFGRGVNTSATIARENKPIGSADNSHTYAHSSHTNP 472
Query: 475 PERPTAKSNLNADDRNINNSVLPEPQPWRAPIMSWQQLFTRSPTVPQSSNSNVICRPNSK 534
PE K N ++RN NN V+ EP+P R P SW QLF RS P SSN N I RP++
Sbjct: 473 PEFVAMKYVPNEEERNTNNPVVSEPKPSREPKKSWHQLFARSTPAPVSSNVNTISRPSTN 532
Query: 535 VQVETKSPQSSGQSPVTQSFNNPIHFGLPSPFKISTHPNGSTSTSLGFSPAIEPLFSPAG 594
Q +S Q Q ++F+NPI FGLPSPF I + +GST++SLGFSP E +F G
Sbjct: 533 PQPNVQSSQVPSQVSSIRTFDNPISFGLPSPFTIPVYSSGSTTSSLGFSPPTELVFPQPG 592
Query: 595 STSLDLRHDEQELFEDPCYDPDPVSLLGPVSESLDNFQLDLGSGFGTDMEVSKPHSLKNI 654
E E FEDPCY PDP+SLLGPVSESLD +G+ T + K ++KN
Sbjct: 593 ---------EDERFEDPCYVPDPISLLGPVSESLD----LRAAGYETGIGQVKYQAMKN- 638
Query: 655 SAGSDVNRLSPIESPLSREKHNCSNWFSSTPKGQDMHSSFMDDAAASEKGTWQMWSTSPX 714
+ + N+ SPIESPLSR + D + G+WQMW +
Sbjct: 639 TPSCEANKPSPIESPLSRSR--------------------AADEKQANDGSWQMWKSPLG 678
Query: 715 XXXXXXXXXXXXXXXXXQMNIPTKDDFVLPSSQNTMASFFNKDD-NIISSNHSSQNVFVP 773
+++ ++ + + Q+ S F+K+D + +S + ++
Sbjct: 679 QNGLGLVGGSANWVIPSEISRSIEESDMHHAPQHRTESLFSKEDCQLHQGAYSQRKDYLE 738
Query: 774 NVHSGSNFSPVTVSSSYDPWLQSALFPPLS------TGFTAQEAATQNEIIYGSPSASVS 827
+ FSP+T ++ DPW Q FP LS + T ++ N Y SP+ S S
Sbjct: 739 HDQRSGVFSPITGPTTTDPWSQKMFFPALSGIESPFSTTTQTKSVLNNAAGYRSPTGSGS 798
Query: 828 SHVLEGSPANSWSKKEWPIHGSAESVGKPSSVSKTHDGLHPTSDLQSIW 876
+ E N W KK + S GK V + + D++S W
Sbjct: 799 DNPFEHPSPNHWLKK---VKSSGNGSGK--QVLAAGEVENHQKDVESFW 842
>AT3G51640.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: chloroplast;
BEST Arabidopsis thaliana protein match is: unknown
protein (TAIR:AT3G51650.1); Has 34 Blast hits to 34
proteins in 11 species: Archae - 0; Bacteria - 0;
Metazoa - 0; Fungi - 1; Plants - 32; Viruses - 0; Other
Eukaryotes - 1 (source: NCBI BLink). |
chr3:19153918-19157134 FORWARD LENGTH=359
Length = 359
Score = 151 bits (382), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 123/356 (34%), Positives = 173/356 (48%), Gaps = 41/356 (11%)
Query: 494 SVLPEPQPWRAPIMSWQQLFTRSPTVPQSSNSNVICRPNSKVQVETKSPQSSGQSPVTQS 553
SV+ EP+P R P SW QLF RS P SSN N I RP++ Q +S Q Q ++
Sbjct: 9 SVVSEPKPSREPKKSWHQLFARSTPAPVSSNVNTISRPSTNPQPNVQSSQVPSQVSSIRT 68
Query: 554 FNNPIHFGLPSPFKISTHPNGSTSTSLGFSPAIEPLFSPAGSTSLDLRHDEQELFEDPCY 613
F+NPI FGLPSPF I + +GST++SLGFSP E +F G E E FEDPCY
Sbjct: 69 FDNPISFGLPSPFTIPVYSSGSTTSSLGFSPPTELVFPQPG---------EDERFEDPCY 119
Query: 614 DPDPVSLLGPVSESLDNFQLDLGSGFGTDMEVSKPHSLKNISAGSDVNRLSPIESPLSRE 673
PDP+SLLGPVSESLD +G+ T + K ++KN + + N+ SPIESPLSR
Sbjct: 120 VPDPISLLGPVSESLDL----RAAGYETGIGQVKYQAMKN-TPSCEANKPSPIESPLSRS 174
Query: 674 KHNCSNWFSSTPKGQDMHSSFMDDAAASEKGTWQMWSTSPXXXXXXXXXXXXXXXXXXQM 733
+ D + G+WQMW + ++
Sbjct: 175 R--------------------AADEKQANDGSWQMWKSPLGQNGLGLVGGSANWVIPSEI 214
Query: 734 NIPTKDDFVLPSSQNTMASFFNKDD-NIISSNHSSQNVFVPNVHSGSNFSPVTVSSSYDP 792
+ ++ + + Q+ S F+K+D + +S + ++ + FSP+T ++ DP
Sbjct: 215 SRSIEESDMHHAPQHRTESLFSKEDCQLHQGAYSQRKDYLEHDQRSGVFSPITGPTTTDP 274
Query: 793 WLQSALFPPLS------TGFTAQEAATQNEIIYGSPSASVSSHVLEGSPANSWSKK 842
W Q FP LS + T ++ N Y SP+ S S + E N W KK
Sbjct: 275 WSQKMFFPALSGIESPFSTTTQTKSVLNNAAGYRSPTGSGSDNPFEHPSPNHWLKK 330