Miyakogusa Predicted Gene
- Lj0g3v0082949.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0082949.1 CUFF.4391.1
(356 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G14830.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 472 e-133
AT3G14830.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 472 e-133
AT1G53450.2 | Symbols: | unknown protein; BEST Arabidopsis thal... 445 e-125
AT1G53450.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 445 e-125
>AT3G14830.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT1G53450.2);
Has 35333 Blast hits to 34131 proteins in 2444 species:
Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi -
991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610
(source: NCBI BLink). | chr3:4983386-4985666 FORWARD
LENGTH=476
Length = 476
Score = 472 bits (1214), Expect = e-133, Method: Compositional matrix adjust.
Identities = 237/356 (66%), Positives = 289/356 (81%), Gaps = 4/356 (1%)
Query: 1 MVGGGGDNLKSRMRSGVVVHEDLGLGSLSEKLRSHGFSETVTVSGGGTVDAELGGFNLGS 60
+V GD R+ V+ DLG+ + E+LR GFS+T + + E+ L +
Sbjct: 125 VVSVDGDKSTRSHRAYVITKGDLGMAT--ERLRDSGFSKTDDTASVTMSEEEVADSYLRA 182
Query: 61 AGHLGRRQGIINFTSTYDSRTQEVEGSVAARGDLWRVEASHGGSASTSGNENSSLFLVQL 120
AG LGR +G I+ +S+YDSRT +E S+AARGDLWRVEASH S+ST+ + NSSLFL+QL
Sbjct: 183 AGLLGRSKGTIDTSSSYDSRTNGMEHSLAARGDLWRVEASH--SSSTASDGNSSLFLLQL 240
Query: 121 GPLLFIRDSTLLLPVHLSKQHLLWYGYDRKNGMHSLCPAVWSKHRRWLLMSMLCLNPLAC 180
GPLLF+RDSTLLLP+HLSKQHLLWYGYDRK GMHSLCPA+WSKHRRWL+MSML LNPLAC
Sbjct: 241 GPLLFLRDSTLLLPLHLSKQHLLWYGYDRKKGMHSLCPAIWSKHRRWLMMSMLSLNPLAC 300
Query: 181 SFVDLQFPNGQLTYVSGEGLTTSAFLPVCGGLLQAQGQYPGEMKFSFSCKNKWGTRITPM 240
SF+DLQFPNGQLTYVSGEGLTTSAF+P CGGLLQAQGQYPG+M+FS+SCKNK GTRITPM
Sbjct: 301 SFMDLQFPNGQLTYVSGEGLTTSAFVPFCGGLLQAQGQYPGDMRFSYSCKNKCGTRITPM 360
Query: 241 VQWPDKSFSLGLAQALAWKRSGLIVRPTIQFSVCPTLGGSNPGLRAELIHSVKEKLSLIC 300
V WPDKSF L L+Q LAW+RSGL+++PTIQ SVCPT GGSNPG++AE+IHS+ + L+LIC
Sbjct: 361 VHWPDKSFGLDLSQPLAWRRSGLLMKPTIQVSVCPTFGGSNPGIKAEVIHSLSDDLNLIC 420
Query: 301 GCAFMTYPSAFASVSIGKSKWNGNVGNYGLVLRVDAPLCTVGRPSFSIQINSGIEF 356
G A +PSAFASV+ G+SKWNGN+G G+V+R D PL ++G+PSFSIQ+N+ EF
Sbjct: 421 GYALNAHPSAFASVAFGRSKWNGNIGRTGIVVRADTPLASIGQPSFSIQLNNAFEF 476
>AT3G14830.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G53450.2); Has 73 Blast hits to 73 proteins in
12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi
- 0; Plants - 73; Viruses - 0; Other Eukaryotes - 0
(source: NCBI BLink). | chr3:4983386-4985666 FORWARD
LENGTH=476
Length = 476
Score = 472 bits (1214), Expect = e-133, Method: Compositional matrix adjust.
Identities = 237/356 (66%), Positives = 289/356 (81%), Gaps = 4/356 (1%)
Query: 1 MVGGGGDNLKSRMRSGVVVHEDLGLGSLSEKLRSHGFSETVTVSGGGTVDAELGGFNLGS 60
+V GD R+ V+ DLG+ + E+LR GFS+T + + E+ L +
Sbjct: 125 VVSVDGDKSTRSHRAYVITKGDLGMAT--ERLRDSGFSKTDDTASVTMSEEEVADSYLRA 182
Query: 61 AGHLGRRQGIINFTSTYDSRTQEVEGSVAARGDLWRVEASHGGSASTSGNENSSLFLVQL 120
AG LGR +G I+ +S+YDSRT +E S+AARGDLWRVEASH S+ST+ + NSSLFL+QL
Sbjct: 183 AGLLGRSKGTIDTSSSYDSRTNGMEHSLAARGDLWRVEASH--SSSTASDGNSSLFLLQL 240
Query: 121 GPLLFIRDSTLLLPVHLSKQHLLWYGYDRKNGMHSLCPAVWSKHRRWLLMSMLCLNPLAC 180
GPLLF+RDSTLLLP+HLSKQHLLWYGYDRK GMHSLCPA+WSKHRRWL+MSML LNPLAC
Sbjct: 241 GPLLFLRDSTLLLPLHLSKQHLLWYGYDRKKGMHSLCPAIWSKHRRWLMMSMLSLNPLAC 300
Query: 181 SFVDLQFPNGQLTYVSGEGLTTSAFLPVCGGLLQAQGQYPGEMKFSFSCKNKWGTRITPM 240
SF+DLQFPNGQLTYVSGEGLTTSAF+P CGGLLQAQGQYPG+M+FS+SCKNK GTRITPM
Sbjct: 301 SFMDLQFPNGQLTYVSGEGLTTSAFVPFCGGLLQAQGQYPGDMRFSYSCKNKCGTRITPM 360
Query: 241 VQWPDKSFSLGLAQALAWKRSGLIVRPTIQFSVCPTLGGSNPGLRAELIHSVKEKLSLIC 300
V WPDKSF L L+Q LAW+RSGL+++PTIQ SVCPT GGSNPG++AE+IHS+ + L+LIC
Sbjct: 361 VHWPDKSFGLDLSQPLAWRRSGLLMKPTIQVSVCPTFGGSNPGIKAEVIHSLSDDLNLIC 420
Query: 301 GCAFMTYPSAFASVSIGKSKWNGNVGNYGLVLRVDAPLCTVGRPSFSIQINSGIEF 356
G A +PSAFASV+ G+SKWNGN+G G+V+R D PL ++G+PSFSIQ+N+ EF
Sbjct: 421 GYALNAHPSAFASVAFGRSKWNGNIGRTGIVVRADTPLASIGQPSFSIQLNNAFEF 476
>AT1G53450.2 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G14830.2); Has 35333 Blast hits to 34131
proteins in 2444 species: Archae - 798; Bacteria -
22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
- 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
chr1:19951747-19953839 REVERSE LENGTH=453
Length = 453
Score = 445 bits (1144), Expect = e-125, Method: Compositional matrix adjust.
Identities = 216/321 (67%), Positives = 265/321 (82%), Gaps = 2/321 (0%)
Query: 36 GFSETVTVSGGGTVDAELGGFNLGSAGHLGRRQGIINFTSTYDSRTQEVEGSVAARGDLW 95
FS+T T S G + ++ F+L + G R +G + +S+Y++RT +E S+AARGDLW
Sbjct: 135 AFSKTDTASSGTVYEEKVTEFDLRTIGLHRRAKGTVELSSSYETRTSSMEHSLAARGDLW 194
Query: 96 RVEASHGGSASTSGNENSSLFLVQLGPLLFIRDSTLLLPVHLSKQHLLWYGYDRKNGMHS 155
RVEAS S S +++SSLFL+QLGPLLF+RDSTLLLPVHLSKQHLLWYGYDRK GMHS
Sbjct: 195 RVEAS--TSNSPVRDDSSSLFLLQLGPLLFLRDSTLLLPVHLSKQHLLWYGYDRKKGMHS 252
Query: 156 LCPAVWSKHRRWLLMSMLCLNPLACSFVDLQFPNGQLTYVSGEGLTTSAFLPVCGGLLQA 215
LCPA+WSKHRRWL+MSMLCLNPL CSFVDLQFPNGQLTYVSGEGLTTS F+P+CGGLLQA
Sbjct: 253 LCPALWSKHRRWLMMSMLCLNPLDCSFVDLQFPNGQLTYVSGEGLTTSVFVPLCGGLLQA 312
Query: 216 QGQYPGEMKFSFSCKNKWGTRITPMVQWPDKSFSLGLAQALAWKRSGLIVRPTIQFSVCP 275
QGQYPG+M+FSFSCK+K GTRITPM+ WPDKS +LG++QALAW+RSG++++P IQ SVC
Sbjct: 313 QGQYPGDMRFSFSCKSKQGTRITPMINWPDKSLALGVSQALAWRRSGVMLKPAIQLSVCS 372
Query: 276 TLGGSNPGLRAELIHSVKEKLSLICGCAFMTYPSAFASVSIGKSKWNGNVGNYGLVLRVD 335
T GGSNPG++ E+I S+ + +++ICGCAF +PS FASVS G+SKWNGN+G G+V+R D
Sbjct: 373 TFGGSNPGIKTEVIQSLNDNINMICGCAFTAHPSTFASVSFGRSKWNGNIGRTGIVVRAD 432
Query: 336 APLCTVGRPSFSIQINSGIEF 356
PL V RPSFSIQIN+ EF
Sbjct: 433 TPLPNVARPSFSIQINNAFEF 453
>AT1G53450.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G14830.2); Has 71 Blast hits to 71 proteins in
12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi
- 0; Plants - 71; Viruses - 0; Other Eukaryotes - 0
(source: NCBI BLink). | chr1:19951747-19953839 REVERSE
LENGTH=453
Length = 453
Score = 445 bits (1144), Expect = e-125, Method: Compositional matrix adjust.
Identities = 216/321 (67%), Positives = 265/321 (82%), Gaps = 2/321 (0%)
Query: 36 GFSETVTVSGGGTVDAELGGFNLGSAGHLGRRQGIINFTSTYDSRTQEVEGSVAARGDLW 95
FS+T T S G + ++ F+L + G R +G + +S+Y++RT +E S+AARGDLW
Sbjct: 135 AFSKTDTASSGTVYEEKVTEFDLRTIGLHRRAKGTVELSSSYETRTSSMEHSLAARGDLW 194
Query: 96 RVEASHGGSASTSGNENSSLFLVQLGPLLFIRDSTLLLPVHLSKQHLLWYGYDRKNGMHS 155
RVEAS S S +++SSLFL+QLGPLLF+RDSTLLLPVHLSKQHLLWYGYDRK GMHS
Sbjct: 195 RVEAS--TSNSPVRDDSSSLFLLQLGPLLFLRDSTLLLPVHLSKQHLLWYGYDRKKGMHS 252
Query: 156 LCPAVWSKHRRWLLMSMLCLNPLACSFVDLQFPNGQLTYVSGEGLTTSAFLPVCGGLLQA 215
LCPA+WSKHRRWL+MSMLCLNPL CSFVDLQFPNGQLTYVSGEGLTTS F+P+CGGLLQA
Sbjct: 253 LCPALWSKHRRWLMMSMLCLNPLDCSFVDLQFPNGQLTYVSGEGLTTSVFVPLCGGLLQA 312
Query: 216 QGQYPGEMKFSFSCKNKWGTRITPMVQWPDKSFSLGLAQALAWKRSGLIVRPTIQFSVCP 275
QGQYPG+M+FSFSCK+K GTRITPM+ WPDKS +LG++QALAW+RSG++++P IQ SVC
Sbjct: 313 QGQYPGDMRFSFSCKSKQGTRITPMINWPDKSLALGVSQALAWRRSGVMLKPAIQLSVCS 372
Query: 276 TLGGSNPGLRAELIHSVKEKLSLICGCAFMTYPSAFASVSIGKSKWNGNVGNYGLVLRVD 335
T GGSNPG++ E+I S+ + +++ICGCAF +PS FASVS G+SKWNGN+G G+V+R D
Sbjct: 373 TFGGSNPGIKTEVIQSLNDNINMICGCAFTAHPSTFASVSFGRSKWNGNIGRTGIVVRAD 432
Query: 336 APLCTVGRPSFSIQINSGIEF 356
PL V RPSFSIQIN+ EF
Sbjct: 433 TPLPNVARPSFSIQINNAFEF 453