Miyakogusa Predicted Gene
- Lj0g3v0325869.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0325869.1 Non Chatacterized Hit- tr|D7U2W3|D7U2W3_VITVI
Putative uncharacterized protein OS=Vitis vinifera
GN=,32.5,0.0000000000005,SUBFAMILY NOT NAMED,NULL;
NFRKB-RELATED,Nuclear factor related to kappa-B-binding protein;
seg,NULL,CUFF.22145.1
(389 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G45830.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 320 1e-87
AT1G02290.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 121 7e-28
AT5G13950.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 73 3e-13
AT5G13950.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 73 3e-13
AT5G13950.3 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 73 4e-13
>AT3G45830.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 24 plant
structures; EXPRESSED DURING: 15 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT1G02290.1); Has 499 Blast hits to 438 proteins
in 100 species: Archae - 0; Bacteria - 7; Metazoa - 236;
Fungi - 15; Plants - 108; Viruses - 2; Other Eukaryotes
- 131 (source: NCBI BLink). | chr3:16841277-16845173
FORWARD LENGTH=1298
Length = 1298
Score = 320 bits (819), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 172/332 (51%), Positives = 210/332 (63%), Gaps = 15/332 (4%)
Query: 1 MAIEKNSFKVTRLDSEGSPLSRETMSSDEDEARRGNSAVXXXXXXXXXXXXXXXXX---X 57
MAIEK++ KV+R D E S S ++MSS E+ RR NS V
Sbjct: 1 MAIEKSNVKVSRFDLEYSHGSGDSMSSYEE--RRKNSVVNNVDSEDEDDDFDEDDSGAGS 58
Query: 58 XXXXLLELGETGAEFCQIGNQTCSIPLELYDLAGLEDILSVDVWNEILSEEERLELAKYL 117
LLEL ETGAEFCQ+GN TCSIP ELYDL LEDILSVDVWNE L+E+ER L+ YL
Sbjct: 59 DDFDLLELAETGAEFCQVGNVTCSIPFELYDLPSLEDILSVDVWNECLTEKERFSLSSYL 118
Query: 118 PDVDQETFVQSLRELFTGCNLHFGSPVKKLFDMLKGGLCEPRVALYREGLNVFQKKQHYH 177
PDVDQ TF+++L+ELF GCN HFGSPVKKLFDMLKGG CEPR LY EG ++F + +HYH
Sbjct: 119 PDVDQLTFMRTLKELFEGCNFHFGSPVKKLFDMLKGGQCEPRNTLYLEGRSLFLRTKHYH 178
Query: 178 LLRKHQNNMVSSLCQIRDAWRNCRGYSIEERLRVLNITRSQKSLMYXXXXXXXXXXXXXX 237
LRK+ N+MV +LCQ RDAW +C+GYSI+E+LRVLNI +SQK+LM
Sbjct: 179 SLRKYHNDMVVNLCQTRDAWTSCKGYSIDEKLRVLNIVKSQKTLMREKKDDFEDDSSEKD 238
Query: 238 XXV--IWSRKNKDRKSAAQKTGRYPLHGVGSGLELQPRGRSAVLEQEKYGKQNPKGILKL 295
W RK KDRKS +K R+ +GV SGLE PR + A +EQ+ YGK K
Sbjct: 239 EPFDKPWGRKGKDRKSTQKKLARHAGYGVDSGLEF-PRRQLAAVEQDLYGKPKSK----- 292
Query: 296 AGPKIPSAKDPTGNFSSAYHALDMNPGLNGSA 327
PK P AK G +++ Y+ MN N S+
Sbjct: 293 --PKFPFAKTSVGPYATGYNGYGMNSAYNPSS 322
>AT1G02290.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G45830.1); Has 134 Blast hits to 134 proteins
in 37 species: Archae - 0; Bacteria - 0; Metazoa - 54;
Fungi - 0; Plants - 78; Viruses - 0; Other Eukaryotes -
2 (source: NCBI BLink). | chr1:450646-451977 REVERSE
LENGTH=443
Length = 443
Score = 121 bits (304), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 50/133 (37%), Positives = 85/133 (63%)
Query: 65 LGETGAEFCQIGNQTCSIPLELYDLAGLEDILSVDVWNEILSEEERLELAKYLPDVDQET 124
+ + E + Q C+IP ELYDL L ILSV+ WN +L+EEER L+ +LPD+D +T
Sbjct: 21 IAQVNCELALVEGQLCNIPYELYDLPDLTGILSVETWNSLLTEEERFFLSCFLPDMDPQT 80
Query: 125 FVQSLRELFTGCNLHFGSPVKKLFDMLKGGLCEPRVALYREGLNVFQKKQHYHLLRKHQN 184
F +++EL G NL+FG+P K + L GGL P+VA ++EG+ +++++Y+ L+ +
Sbjct: 81 FSLTMQELLDGANLYFGNPEDKFYKNLLGGLFTPKVACFKEGVMFVKRRKYYYSLKFYHE 140
Query: 185 NMVSSLCQIRDAW 197
++ + +++ W
Sbjct: 141 KLIRTFTEMQRVW 153
>AT5G13950.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 23 plant
structures; EXPRESSED DURING: 13 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT1G02290.1); Has 147 Blast hits to 145 proteins
in 44 species: Archae - 0; Bacteria - 2; Metazoa - 56;
Fungi - 6; Plants - 81; Viruses - 0; Other Eukaryotes -
2 (source: NCBI BLink). | chr5:4496196-4500206 REVERSE
LENGTH=939
Length = 939
Score = 72.8 bits (177), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 40/125 (32%), Positives = 65/125 (52%), Gaps = 4/125 (3%)
Query: 78 QTCSIPLELYDLAGLEDILSVDVWNEILSEEERLELAKYLPD-VDQETFVQSLRELFTGC 136
Q C +P E + L L ++LS +VW LS+ ER L ++LP+ VD E VQ+ L G
Sbjct: 85 QVCPVPHETFQLENLSEVLSNEVWRSCLSDGERNYLRQFLPEGVDVEQVVQA---LLDGE 141
Query: 137 NLHFGSPVKKLFDMLKGGLCEPRVALYREGLNVFQKKQHYHLLRKHQNNMVSSLCQIRDA 196
N HFG+P + G P + RE K+++Y L K+ +++ L +++
Sbjct: 142 NFHFGNPSLDWGTAVCSGKAHPDQIVSREECLRADKRRYYSNLEKYHQDIIDYLQTLKEK 201
Query: 197 WRNCR 201
W +C+
Sbjct: 202 WESCK 206
>AT5G13950.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G02290.1); Has 147 Blast hits to 145 proteins
in 44 species: Archae - 0; Bacteria - 2; Metazoa - 56;
Fungi - 6; Plants - 81; Viruses - 0; Other Eukaryotes -
2 (source: NCBI BLink). | chr5:4496196-4500206 REVERSE
LENGTH=939
Length = 939
Score = 72.8 bits (177), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 40/125 (32%), Positives = 65/125 (52%), Gaps = 4/125 (3%)
Query: 78 QTCSIPLELYDLAGLEDILSVDVWNEILSEEERLELAKYLPD-VDQETFVQSLRELFTGC 136
Q C +P E + L L ++LS +VW LS+ ER L ++LP+ VD E VQ+ L G
Sbjct: 85 QVCPVPHETFQLENLSEVLSNEVWRSCLSDGERNYLRQFLPEGVDVEQVVQA---LLDGE 141
Query: 137 NLHFGSPVKKLFDMLKGGLCEPRVALYREGLNVFQKKQHYHLLRKHQNNMVSSLCQIRDA 196
N HFG+P + G P + RE K+++Y L K+ +++ L +++
Sbjct: 142 NFHFGNPSLDWGTAVCSGKAHPDQIVSREECLRADKRRYYSNLEKYHQDIIDYLQTLKEK 201
Query: 197 WRNCR 201
W +C+
Sbjct: 202 WESCK 206
>AT5G13950.3 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 23 plant
structures; EXPRESSED DURING: 13 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT1G02290.1). | chr5:4496196-4500206 REVERSE
LENGTH=954
Length = 954
Score = 72.8 bits (177), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 40/125 (32%), Positives = 65/125 (52%), Gaps = 4/125 (3%)
Query: 78 QTCSIPLELYDLAGLEDILSVDVWNEILSEEERLELAKYLPD-VDQETFVQSLRELFTGC 136
Q C +P E + L L ++LS +VW LS+ ER L ++LP+ VD E VQ+ L G
Sbjct: 85 QVCPVPHETFQLENLSEVLSNEVWRSCLSDGERNYLRQFLPEGVDVEQVVQA---LLDGE 141
Query: 137 NLHFGSPVKKLFDMLKGGLCEPRVALYREGLNVFQKKQHYHLLRKHQNNMVSSLCQIRDA 196
N HFG+P + G P + RE K+++Y L K+ +++ L +++
Sbjct: 142 NFHFGNPSLDWGTAVCSGKAHPDQIVSREECLRADKRRYYSNLEKYHQDIIDYLQTLKEK 201
Query: 197 WRNCR 201
W +C+
Sbjct: 202 WESCK 206