Miyakogusa Predicted Gene
- Lj6g3v0433920.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj6g3v0433920.1 tr|D7MH52|D7MH52_ARALL Predicted protein
OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRAFT_658867
PE,43.44,6e-19, ,CUFF.57825.1
(150 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G70780.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 135 8e-33
AT1G23150.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 119 7e-28
AT5G37730.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 113 6e-26
AT2G01554.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 61 3e-10
AT2G27830.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 53 8e-08
AT4G22758.1 | Symbols: | unknown protein; LOCATED IN: chloropla... 49 2e-06
>AT1G70780.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: mitochondrion;
EXPRESSED IN: sperm cell, male gametophyte, pollen tube;
EXPRESSED DURING: L mature pollen stage, M germinated
pollen stage; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT1G23150.1); Has 143 Blast
hits to 143 proteins in 17 species: Archae - 0; Bacteria
- 0; Metazoa - 0; Fungi - 0; Plants - 143; Viruses - 0;
Other Eukaryotes - 0 (source: NCBI BLink). |
chr1:26695462-26695975 REVERSE LENGTH=140
Length = 140
Score = 135 bits (341), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 70/131 (53%), Positives = 88/131 (67%), Gaps = 3/131 (2%)
Query: 23 DQKAKTNRFLITINILGSAGPIRFVVNEEKLVSGVIDTALKSYAREGRLPVLGFDASNFL 82
+Q AK NR LI++ +LGSAGPIRFV E+ LV+ VIDTALK YAREGRLP+LG D ++FL
Sbjct: 10 NQNAKGNRILISVTVLGSAGPIRFVAYEDDLVASVIDTALKGYAREGRLPLLGSDFNDFL 69
Query: 83 LYRANAGFDALNPLEPIGSYGERNFVLCKKPV---YHPSKTEPQSELLSQKSSGGWKAWL 139
LY G +AL+ + IGS G RNF+LC+KP S S + + G KAW+
Sbjct: 70 LYCPMVGPEALSTWDAIGSLGARNFMLCRKPEEKKVEESNGRSDSTINGARKGGSLKAWI 129
Query: 140 NKSFGLKILSH 150
NKSF LK+ SH
Sbjct: 130 NKSFNLKVSSH 140
>AT1G23150.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G70780.1); Has 124 Blast hits to 124 proteins
in 17 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 124; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr1:8206948-8207461 FORWARD
LENGTH=141
Length = 141
Score = 119 bits (298), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 69/134 (51%), Positives = 87/134 (64%), Gaps = 6/134 (4%)
Query: 23 DQKAKTNRFLITINILGSAGPIRFVVNEEKLVSGVIDTALKSYAREGRLPVLGFDASNFL 82
+Q K NR LI++ LGSAGPIRFV NE LV+ VIDTALK YAREGRLP+LG D ++F+
Sbjct: 8 NQIVKGNRILISVTFLGSAGPIRFVANEGDLVASVIDTALKCYAREGRLPILGSDFNDFV 67
Query: 83 LYRANAGFDALNPLEPIGSYGERNFVLCKKPVYHPSKTEPQSEL-----LSQKSSGG-WK 136
Y G AL+P E IGS G RNF+LCKK E + ++K +GG +K
Sbjct: 68 FYCPMVGPGALSPWEAIGSVGVRNFMLCKKKPEEKKVEEDKGRSNFPINGARKGAGGSFK 127
Query: 137 AWLNKSFGLKILSH 150
AW+NKS LK+ +H
Sbjct: 128 AWINKSLRLKVTTH 141
>AT5G37730.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G23150.1); Has 1807 Blast hits to 1807 proteins
in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
- 339 (source: NCBI BLink). | chr5:14986141-14986884
REVERSE LENGTH=182
Length = 182
Score = 113 bits (282), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 57/123 (46%), Positives = 75/123 (60%), Gaps = 1/123 (0%)
Query: 15 NRGSVYSDDQKAKTNRFLITINILGSAGPIRFVVNEEKLVSGVIDTALKSYAREGRLPVL 74
NR K K + L+++N+LGS GPIRF+ NE+ VS I+T LK+YAR+GR+PVL
Sbjct: 2 NRNENVKGVMKRKNKKLLVSVNVLGSVGPIRFLANEDDEVSSAINTTLKAYARQGRIPVL 61
Query: 75 GFDASNFLLYRANAGFDALNPLEPIGSYGERNFVLCKKPVYHPSKTEPQSELLSQKSSGG 134
GFD NF+ Y NAGF+ L+P E IGS NF++CKK K E E + + G
Sbjct: 62 GFDVDNFIFYSINAGFNTLHPQEKIGSMDVTNFLMCKKEPRPLEKVEGIRESRA-RIGHG 120
Query: 135 WKA 137
WK
Sbjct: 121 WKT 123
>AT2G01554.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT1G70780.1);
Has 30201 Blast hits to 17322 proteins in 780 species:
Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi -
3422; Plants - 5037; Viruses - 0; Other Eukaryotes -
2996 (source: NCBI BLink). | chr2:249687-250181 FORWARD
LENGTH=105
Length = 105
Score = 60.8 bits (146), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 64/134 (47%), Gaps = 38/134 (28%)
Query: 20 YSDDQKAKT-NRFLITINILGSAGPIRFVVNEEKLVSGVIDTALKSYAREGRLPVLGFDA 78
+ +QK+KT NRFL++IN+LGSAG + ++P
Sbjct: 7 HQRNQKSKTTNRFLVSINVLGSAG----------------------FHFSDQIPAF---- 40
Query: 79 SNFLLYRANAGFDALN-PLE-PIGSYGERNFVLCKKPVYHPSKTEPQSELLSQKSSGGWK 136
+ F AL PL+ IGS G RNFVL K + + ++K+SG WK
Sbjct: 41 ---------SDFIALTVPLKGKIGSTGSRNFVLSNKLETQNLEDSMMTTTTTRKTSGRWK 91
Query: 137 AWLNKSFGLKILSH 150
AWLNKSFGL + SH
Sbjct: 92 AWLNKSFGLMVPSH 105
>AT2G27830.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G22758.1); Has 131 Blast hits to 131 proteins
in 17 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 131; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr2:11860734-11861306 FORWARD
LENGTH=190
Length = 190
Score = 52.8 bits (125), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 26/87 (29%), Positives = 48/87 (55%), Gaps = 1/87 (1%)
Query: 27 KTNRFLITINILGSAGPIRFVVNEEKLVSGVIDTALKSYAREGRLPVLG-FDASNFLLYR 85
+ + L+ + + GS G ++ +++ E VS +ID A++ Y +E R P L + S F L+
Sbjct: 68 RLTKLLLNVTVQGSLGAVQIIISPESTVSDLIDAAVRQYVKEARRPFLPESEPSRFDLHY 127
Query: 86 ANAGFDALNPLEPIGSYGERNFVLCKK 112
+ +++ E + S G RNF LC +
Sbjct: 128 SQFSLESIVRDEKLISLGSRNFFLCGR 154
>AT4G22758.1 | Symbols: | unknown protein; LOCATED IN: chloroplast;
BEST Arabidopsis thaliana protein match is: unknown
protein (TAIR:AT2G27830.1). | chr4:11958477-11959904
FORWARD LENGTH=255
Length = 255
Score = 48.5 bits (114), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 44/83 (53%), Gaps = 1/83 (1%)
Query: 30 RFLITINILGSAGPIRFVVNEEKLVSGVIDTALKSYAREGRLPVLGFDASNFLLYRANAG 89
+ +I++ + GS GP+R +V V I + Y +EGR P L D++ F L++++
Sbjct: 123 KVIISVAVEGSPGPVRAMVKLSCNVEETIKIVVDKYCKEGRTPKLDRDSA-FELHQSHFS 181
Query: 90 FDALNPLEPIGSYGERNFVLCKK 112
L E IG G R+F + KK
Sbjct: 182 IQCLEKREIIGELGSRSFYMRKK 204