Miyakogusa Predicted Gene
- Lj1g3v2447050.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v2447050.1 CUFF.29030.1
(877 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G51650.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 357 2e-98
AT3G51640.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 354 2e-97
AT3G51640.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 128 1e-29
>AT3G51650.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G51640.1); Has 27645 Blast hits to 15097
proteins in 1246 species: Archae - 44; Bacteria - 3367;
Metazoa - 10036; Fungi - 2690; Plants - 1205; Viruses -
196; Other Eukaryotes - 10107 (source: NCBI BLink). |
chr3:19159449-19162267 FORWARD LENGTH=842
Length = 842
Score = 357 bits (916), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 175/272 (64%), Positives = 199/272 (73%), Gaps = 11/272 (4%)
Query: 1 MCILCVIQKCSRRVATMLPWLVVPLIGLWALSQLLPPAFRFEITSPRLGCVLVLLGTLFW 60
MCILCVIQK SR+VATMLPW V+PLIGLWALSQLLPPAFRFEITSPRL CV VLL TLFW
Sbjct: 1 MCILCVIQKWSRQVATMLPWFVIPLIGLWALSQLLPPAFRFEITSPRLACVFVLLVTLFW 60
Query: 61 YEILMPQLSXXXXXXXXXXXXXXXFEAIEMQKLRKTATRRCRNCLNPYKDQNPGGGRFMC 120
YE+LMPQLS EAIE+QKL+K ATRRCRNC NPY+DQNPGGG+FMC
Sbjct: 61 YEVLMPQLSTWRVRRNAQLRERERLEAIELQKLKKNATRRCRNCSNPYRDQNPGGGKFMC 120
Query: 121 SYCGHVSKRPVLDLP----VPISNSGIVKDLVGKSGKILNSKVWCENGWMCSQEWLENSN 176
SYCGHVSKRPVLD+ + IS SGI+KDLVG+ GK+LN K W ENG++ QEW +NS
Sbjct: 121 SYCGHVSKRPVLDMALSSGLEISGSGILKDLVGRGGKMLNGKGWSENGYLHRQEWSDNST 180
Query: 177 WVGGSILGNPSKWRMNGNAGIFRGDEHCLTERSYSSLLVFVCNLLTYFFLSIRWLWRKAC 236
W GS S WR N + F GDE+CL E+SYS +VF C LLT FF+SI WLWRK
Sbjct: 181 WTSGS-----SYWR-NNSGDTFEGDENCLVEKSYSGGVVFACRLLTSFFMSILWLWRKIF 234
Query: 237 RISSREGSLS-DAEHRALLAKQSEDGVSLNES 267
R SS G S D E R +LA+Q E+G S +ES
Sbjct: 235 RFSSSVGDSSLDPEQRRMLARQGENGTSSHES 266
Score = 191 bits (485), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 157/435 (36%), Positives = 212/435 (48%), Gaps = 54/435 (12%)
Query: 407 HGKNVATNSYNRGS-SGTRYLDRMRGTILSSSKAFG----FGRGANVPTTVVKESKLNSS 461
HG + N + G+ SG RY DRM+ T SSSKAF FGRG N T +E+K S
Sbjct: 398 HGHGLENNVTSNGTKSGGRYFDRMKSTTFSSSKAFTDSRIFGRGVNTSATFARENKPTGS 457
Query: 462 VDHVHTAASRRGTCPPDLPMAKSNLNGDDRNTIHSVLPEPPQAWTEPKKSWQQLFTRXXX 521
D+ HT A PPD KS N ++RNT + V+ EP + EP+KSW QLF R
Sbjct: 458 ADNSHTYAHSSHINPPDFVAMKSVPNEEERNTNNPVVSEPKPS-REPRKSWHQLFARSTP 516
Query: 522 XXXXXXXNVICRPNSKSQ-EAKSPPLSGQLPFTESFNNPIQFGLQSPFNVSAFPNGSTSC 580
N I RP++ Q + + Q+ +F+N I FGL SPF + + +GST+
Sbjct: 517 APVSSNVNTISRPSTNPQPNVQISQVPSQVSSIRTFDNSISFGLPSPFTIPVYSSGSTTS 576
Query: 581 SLGFTPAIERLFSPVKNPSHDFRHEEQELFEDPCYVPDPVSLLGPVFESLD----NFQLD 636
SLGF+P E +F P D E FEDPCYVPDP+SLLGPV ESLD ++
Sbjct: 577 SLGFSPPTEFVFP---QPGED------ERFEDPCYVPDPISLLGPVSESLDLRAAGYETG 627
Query: 637 LGSGFPHS-SNNPSIGSDVHKPSPIESPLSREKHSYSNQFQSTPQAQDTHAFPMDGVSAN 695
+G H+ N PS + +KPSPIESPLSR + + Q AN
Sbjct: 628 IGQVKYHAMKNTPSC--EANKPSPIESPLSRSRAADEKQ-------------------AN 666
Query: 696 EKGTWQMWSSSPLVQEXXXXXXXXXXXXXXXQRNLPNYVDSVLPSPQKTIASVF-DEDNS 754
+ G+WQMW SPL Q + + + +PQ S+F ED
Sbjct: 667 D-GSWQMW-KSPLGQNGLGLVGGSANWVLPSEISRSIEESDMHHAPQHRTESLFSKEDCQ 724
Query: 755 IISSTHSPQNIFLPNGRKSGGTISPITCSSGYEPWLQQSTFFPQLSS-------CLKAQE 807
+ +S + +L + ++S G SPIT + +PW Q+ FFP LS + +
Sbjct: 725 LHQGAYSQRKDYLEHDQRS-GVFSPITGPTTTDPWSQK-MFFPALSGIESPFSITTQTKS 782
Query: 808 SAQNEMIYRSPSGSA 822
N YRSP+GS
Sbjct: 783 VLNNAAGYRSPTGSG 797
>AT3G51640.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G51650.1); Has 26208 Blast hits to 14155
proteins in 1229 species: Archae - 43; Bacteria - 3230;
Metazoa - 9456; Fungi - 2551; Plants - 1160; Viruses -
177; Other Eukaryotes - 9591 (source: NCBI BLink). |
chr3:19154294-19157134 FORWARD LENGTH=842
Length = 842
Score = 354 bits (908), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 174/272 (63%), Positives = 198/272 (72%), Gaps = 11/272 (4%)
Query: 1 MCILCVIQKCSRRVATMLPWLVVPLIGLWALSQLLPPAFRFEITSPRLGCVLVLLGTLFW 60
MCILC IQK SR+VATMLPW V+PLIGLWALSQLLPPAFRFEITSPRL CV VLL TLFW
Sbjct: 1 MCILCGIQKWSRQVATMLPWFVIPLIGLWALSQLLPPAFRFEITSPRLACVFVLLVTLFW 60
Query: 61 YEILMPQLSXXXXXXXXXXXXXXXFEAIEMQKLRKTATRRCRNCLNPYKDQNPGGGRFMC 120
YE+LMPQLS EAIE+QKL+K ATRRCRNC NPY+DQNPGGG+FMC
Sbjct: 61 YEVLMPQLSTWRVRRNAQLRERERLEAIELQKLKKNATRRCRNCSNPYRDQNPGGGKFMC 120
Query: 121 SYCGHVSKRPVLDLP----VPISNSGIVKDLVGKSGKILNSKVWCENGWMCSQEWLENSN 176
SYCGHVSKRPVLD+ + IS SGI+KDLVG+ GK+LN K W ENG++ QEW +NS
Sbjct: 121 SYCGHVSKRPVLDMALSSGLEISGSGILKDLVGRGGKMLNGKGWSENGYLHRQEWSDNST 180
Query: 177 WVGGSILGNPSKWRMNGNAGIFRGDEHCLTERSYSSLLVFVCNLLTYFFLSIRWLWRKAC 236
W GS S WR N + F GDE+CL E+SYS +VF C LLT FF+SI WLWRK
Sbjct: 181 WTSGS-----SYWR-NNSGDTFEGDENCLVEKSYSGGVVFACRLLTSFFMSILWLWRKIF 234
Query: 237 RISSREGSLS-DAEHRALLAKQSEDGVSLNES 267
R SS G S D E R +LA+Q E+G S +ES
Sbjct: 235 RFSSSVGDSSLDPEQRRMLARQGENGTSCHES 266
Score = 195 bits (496), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 161/436 (36%), Positives = 218/436 (50%), Gaps = 54/436 (12%)
Query: 407 HGKNVATNSYNRGS-SGTRYLDRMRGTILSSSKAFG----FGRGANVPTTVVKESKLNSS 461
HG + N + G+ SG RY DRM+GT LSSSKAF FGRG N T+ +E+K S
Sbjct: 398 HGHGLENNVTSNGTKSGGRYFDRMKGTFLSSSKAFTDSRLFGRGVNTSATIARENKPIGS 457
Query: 462 VDHVHTAASRRGTCPPDLPMAKSNLNGDDRNTIHSVLPEPPQAWTEPKKSWQQLFTRXXX 521
D+ HT A T PP+ K N ++RNT + V+ EP + EPKKSW QLF R
Sbjct: 458 ADNSHTYAHSSHTNPPEFVAMKYVPNEEERNTNNPVVSEPKPS-REPKKSWHQLFARSTP 516
Query: 522 XXXXXXXNVICRPNSKSQ-EAKSPPLSGQLPFTESFNNPIQFGLQSPFNVSAFPNGSTSC 580
N I RP++ Q +S + Q+ +F+NPI FGL SPF + + +GST+
Sbjct: 517 APVSSNVNTISRPSTNPQPNVQSSQVPSQVSSIRTFDNPISFGLPSPFTIPVYSSGSTTS 576
Query: 581 SLGFTPAIERLFSPVKNPSHDFRHEEQELFEDPCYVPDPVSLLGPVFESLD----NFQLD 636
SLGF+P E +F P D E FEDPCYVPDP+SLLGPV ESLD ++
Sbjct: 577 SLGFSPPTELVFP---QPGED------ERFEDPCYVPDPISLLGPVSESLDLRAAGYETG 627
Query: 637 LGS-GFPHSSNNPSIGSDVHKPSPIESPLSREKHSYSNQFQSTPQAQDTHAFPMDGVSAN 695
+G + N PS + +KPSPIESPLSR + + Q AN
Sbjct: 628 IGQVKYQAMKNTPSC--EANKPSPIESPLSRSRAADEKQ-------------------AN 666
Query: 696 EKGTWQMWSSSPLVQEXXXXXXXXXXXXXXXQRNLPNYVDSVLPSPQKTIASVF-DEDNS 754
+ G+WQMW SPL Q + + + +PQ S+F ED
Sbjct: 667 D-GSWQMW-KSPLGQNGLGLVGGSANWVIPSEISRSIEESDMHHAPQHRTESLFSKEDCQ 724
Query: 755 IISSTHSPQNIFLPNGRKSGGTISPITCSSGYEPWLQQSTFFPQL-------SSCLKAQE 807
+ +S + +L + ++S G SPIT + +PW Q+ FFP L S+ + +
Sbjct: 725 LHQGAYSQRKDYLEHDQRS-GVFSPITGPTTTDPWSQK-MFFPALSGIESPFSTTTQTKS 782
Query: 808 SAQNEMIYRSPSGSAS 823
N YRSP+GS S
Sbjct: 783 VLNNAAGYRSPTGSGS 798
>AT3G51640.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: chloroplast;
BEST Arabidopsis thaliana protein match is: unknown
protein (TAIR:AT3G51650.1); Has 34 Blast hits to 34
proteins in 11 species: Archae - 0; Bacteria - 0;
Metazoa - 0; Fungi - 1; Plants - 32; Viruses - 0; Other
Eukaryotes - 1 (source: NCBI BLink). |
chr3:19153918-19157134 FORWARD LENGTH=359
Length = 359
Score = 128 bits (322), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 118/341 (34%), Positives = 163/341 (47%), Gaps = 48/341 (14%)
Query: 502 PQAWTEPKKSWQQLFTRXXXXXXXXXXNVICRPNSKSQ-EAKSPPLSGQLPFTESFNNPI 560
P+ EPKKSW QLF R N I RP++ Q +S + Q+ +F+NPI
Sbjct: 14 PKPSREPKKSWHQLFARSTPAPVSSNVNTISRPSTNPQPNVQSSQVPSQVSSIRTFDNPI 73
Query: 561 QFGLQSPFNVSAFPNGSTSCSLGFTPAIERLFSPVKNPSHDFRHEEQELFEDPCYVPDPV 620
FGL SPF + + +GST+ SLGF+P E +F + E E FEDPCYVPDP+
Sbjct: 74 SFGLPSPFTIPVYSSGSTTSSLGFSPPTELVFP---------QPGEDERFEDPCYVPDPI 124
Query: 621 SLLGPVFESLD----NFQLDLGS-GFPHSSNNPSIGSDVHKPSPIESPLSREKHSYSNQF 675
SLLGPV ESLD ++ +G + N PS + +KPSPIESPLSR + + Q
Sbjct: 125 SLLGPVSESLDLRAAGYETGIGQVKYQAMKNTPSC--EANKPSPIESPLSRSRAADEKQ- 181
Query: 676 QSTPQAQDTHAFPMDGVSANEKGTWQMWSSSPLVQEXXXXXXXXXXXXXXXQRNLPNYVD 735
AN+ G+WQMW SPL Q + +
Sbjct: 182 ------------------AND-GSWQMW-KSPLGQNGLGLVGGSANWVIPSEISRSIEES 221
Query: 736 SVLPSPQKTIASVFD-EDNSIISSTHSPQNIFLPNGRKSGGTISPITCSSGYEPWLQQST 794
+ +PQ S+F ED + +S + +L + ++S G SPIT + +PW Q+
Sbjct: 222 DMHHAPQHRTESLFSKEDCQLHQGAYSQRKDYLEHDQRS-GVFSPITGPTTTDPWSQK-M 279
Query: 795 FFPQL-------SSCLKAQESAQNEMIYRSPSGSASSRVHE 828
FFP L S+ + + N YRSP+GS S E
Sbjct: 280 FFPALSGIESPFSTTTQTKSVLNNAAGYRSPTGSGSDNPFE 320