Miyakogusa Predicted Gene
- Lj5g3v2112060.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj5g3v2112060.1 Non Chatacterized Hit- tr|F6H0B1|F6H0B1_VITVI
Putative uncharacterized protein OS=Vitis vinifera
GN=,26.83,1e-18,coiled-coil,NULL; seg,NULL; DUF760,Protein of unknown
function DUF760,CUFF.56696.1
(378 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT2G14910.1 | Symbols: | unknown protein; LOCATED IN: chloropla... 422 e-118
AT2G14910.2 | Symbols: | unknown protein; LOCATED IN: chloropla... 370 e-103
AT5G14970.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 199 2e-51
AT1G63610.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 103 2e-22
AT1G63610.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 100 2e-21
>AT2G14910.1 | Symbols: | unknown protein; LOCATED IN: chloroplast;
EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT5G14970.1); Has 605 Blast
hits to 425 proteins in 102 species: Archae - 0;
Bacteria - 300; Metazoa - 25; Fungi - 0; Plants - 89;
Viruses - 0; Other Eukaryotes - 191 (source: NCBI
BLink). | chr2:6406957-6409015 REVERSE LENGTH=386
Length = 386
Score = 422 bits (1086), Expect = e-118, Method: Compositional matrix adjust.
Identities = 227/333 (68%), Positives = 262/333 (78%), Gaps = 16/333 (4%)
Query: 53 SFGDFT-----SKPKKSVLRELIQEIEPLDVSHIQKDVPPTTADAMKRTISGMLGLLPSD 107
S DFT PKK VL +LIQEIEPLDVS IQKDVP TT DAMKRTISGMLGLLPSD
Sbjct: 63 SLDDFTLHSDSRSPKKCVLSDLIQEIEPLDVSLIQKDVPVTTLDAMKRTISGMLGLLPSD 122
Query: 108 QFHVVIEALWEPLSKLLISSMMTGYTLRNAEYRLCLEKNLDMCERDLEKPKAESTPMDLQ 167
+F V IE+LWEPLSKLL+SSMMTGYTLRNAEYRL LEKNLDM L+ +E+T D++
Sbjct: 123 RFQVHIESLWEPLSKLLVSSMMTGYTLRNAEYRLFLEKNLDMSGGGLDSHASENTEYDME 182
Query: 168 GLLHDSVNVIDFGRKSNLSSKVEKVHEDVDIQDLGEISSEAQQYILNLQSRLSSMKKELH 227
G D +V S S+ + + E +D + LG +SSEAQ+YIL LQS+LSS+KKEL
Sbjct: 183 GTFPDEDHV-----SSKRDSRTQNLSETIDEEGLGRVSSEAQEYILRLQSQLSSVKKELQ 237
Query: 228 EVKRKNAALQMQQFVGEEKNDLLDYLRSLQPEQVAQLSEFTSPELKEIIISVVHGLLATL 287
E++RKNAALQMQQFVGEEKNDLLDYLRSLQPE+VA+LSE +PE+KE I SVVHGLLATL
Sbjct: 238 EMRRKNAALQMQQFVGEEKNDLLDYLRSLQPEKVAELSEPAAPEVKETIHSVVHGLLATL 297
Query: 288 SPKMHSK-PSTMSENATIGTANAGS-EDCAEVVENSSIQFHPVISLTRDYLARLLFWCML 345
SPKMHSK P+ SE T A S EDCAE+VEN+S+QF P+ISLTRDYLARLLFWCML
Sbjct: 298 SPKMHSKFPA--SEVPPTETVKAKSDEDCAELVENTSLQFQPLISLTRDYLARLLFWCML 355
Query: 346 LGHYLRGLEYRVDLTELLSLTSDAENNGNEQIA 378
LGHYLRGLEYR++L E+LSLT DA NG+E +A
Sbjct: 356 LGHYLRGLEYRMELMEVLSLTCDA--NGSENVA 386
>AT2G14910.2 | Symbols: | unknown protein; LOCATED IN: chloroplast;
EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT5G14970.1); Has 35333 Blast
hits to 34131 proteins in 2444 species: Archae - 798;
Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants -
531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI
BLink). | chr2:6407195-6409015 REVERSE LENGTH=366
Length = 366
Score = 370 bits (951), Expect = e-103, Method: Compositional matrix adjust.
Identities = 201/297 (67%), Positives = 231/297 (77%), Gaps = 14/297 (4%)
Query: 53 SFGDFT-----SKPKKSVLRELIQEIEPLDVSHIQKDVPPTTADAMKRTISGMLGLLPSD 107
S DFT PKK VL +LIQEIEPLDVS IQKDVP TT DAMKRTISGMLGLLPSD
Sbjct: 63 SLDDFTLHSDSRSPKKCVLSDLIQEIEPLDVSLIQKDVPVTTLDAMKRTISGMLGLLPSD 122
Query: 108 QFHVVIEALWEPLSKLLISSMMTGYTLRNAEYRLCLEKNLDMCERDLEKPKAESTPMDLQ 167
+F V IE+LWEPLSKLL+SSMMTGYTLRNAEYRL LEKNLDM L+ +E+T D++
Sbjct: 123 RFQVHIESLWEPLSKLLVSSMMTGYTLRNAEYRLFLEKNLDMSGGGLDSHASENTEYDME 182
Query: 168 GLLHDSVNVIDFGRKSNLSSKVEKVHEDVDIQDLGEISSEAQQYILNLQSRLSSMKKELH 227
G D +V S S+ + + E +D + LG +SSEAQ+YIL LQS+LSS+KKEL
Sbjct: 183 GTFPDEDHV-----SSKRDSRTQNLSETIDEEGLGRVSSEAQEYILRLQSQLSSVKKELQ 237
Query: 228 EVKRKNAALQMQQFVGEEKNDLLDYLRSLQPEQVAQLSEFTSPELKEIIISVVHGLLATL 287
E++RKNAALQMQQFVGEEKNDLLDYLRSLQPE+VA+LSE +PE+KE I SVVHGLLATL
Sbjct: 238 EMRRKNAALQMQQFVGEEKNDLLDYLRSLQPEKVAELSEPAAPEVKETIHSVVHGLLATL 297
Query: 288 SPKMHSK-PSTMSENATIGTANAGS-EDCAEVVENSSIQFHPVISLTRDYLARLLFW 342
SPKMHSK P+ SE T A S EDCAE+VEN+S+QF P+ISLTRDYLARLLFW
Sbjct: 298 SPKMHSKFPA--SEVPPTETVKAKSDEDCAELVENTSLQFQPLISLTRDYLARLLFW 352
>AT5G14970.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT2G14910.1); Has 579 Blast hits to 397 proteins
in 95 species: Archae - 0; Bacteria - 294; Metazoa - 0;
Fungi - 0; Plants - 86; Viruses - 0; Other Eukaryotes -
199 (source: NCBI BLink). | chr5:4847368-4848760 FORWARD
LENGTH=355
Length = 355
Score = 199 bits (506), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 128/307 (41%), Positives = 179/307 (58%), Gaps = 35/307 (11%)
Query: 64 SVLRELIQEIEPLDVSHIQKDVPPTTADAMKRTISGMLGLLPSDQFHVVIEALWEPLSKL 123
+V+ L+ I+PLD S I K + + D+MK+TIS MLGLLPSDQF V + +PL +L
Sbjct: 78 AVIANLVNRIQPLDTSVISKGLSDSAKDSMKQTISSMLGLLPSDQFSVSVTISEQPLYRL 137
Query: 124 LISSMMTGYTLRNAEYRLCLEKNLDMCERDLEKPKAESTPMDLQGLLHD--SVNVIDFGR 181
LISS++TGYTL NAEYR+ L +N D+ P+D + D S + + FG
Sbjct: 138 LISSIITGYTLWNAEYRVSLRRNFDI-------------PIDPRKEEEDQSSKDNVRFGS 184
Query: 182 KSNLSSKVEKVHEDVDI---QDLGEISSEAQQYILNLQSRLSSMKKELHEVKRKNAALQM 238
+ +S + E+ + Q G++S EA YI LQS LSSMK+EL K+K AL++
Sbjct: 185 EKGMSEDLGNCVEEFERLSPQVFGDLSPEALSYIQLLQSELSSMKEELDSQKKK--ALRI 242
Query: 239 QQFVGEEKNDLLDYLRSLQPEQVAQLSEFTSPELKEIIISVVHGLLATLSPKMHSKPSTM 298
+ G +NDLLDYLRSL PE V +LS+ +SPE++EI+ +V +L L + S
Sbjct: 243 ECEKG-NRNDLLDYLRSLDPEMVTELSQLSSPEVEEIVNQLVQNVLERLFEDQTT--SNF 299
Query: 299 SENATIGTANAGSEDCAEVVENSSIQFHPVISLTRDYLARLLFWCMLLGHYLRGLEYRVD 358
+N I T G +V +RDYLA+LLFWCMLLGH+LRGLE R+
Sbjct: 300 MQNPGIRTTEGGDGTGRKV------------DTSRDYLAKLLFWCMLLGHHLRGLENRLH 347
Query: 359 LTELLSL 365
L+ ++ L
Sbjct: 348 LSCVVGL 354
>AT1G63610.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: chloroplast;
EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT2G14910.1); Has 537 Blast
hits to 411 proteins in 100 species: Archae - 0;
Bacteria - 231; Metazoa - 0; Fungi - 0; Plants - 94;
Viruses - 0; Other Eukaryotes - 212 (source: NCBI
BLink). | chr1:23583675-23585599 REVERSE LENGTH=340
Length = 340
Score = 103 bits (257), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 83/314 (26%), Positives = 153/314 (48%), Gaps = 61/314 (19%)
Query: 51 TPSFGDFTSKPKKSVLRELIQEIEPLDVSHIQKDVPPTTADAMKRTISGMLGLLPSDQFH 110
TP G K ++ +L E +Q ++P + K P +AM++T++ M+G LP F
Sbjct: 74 TPPNGTRQPKSRRDILLEYVQNVKPEFMEMFVKRAPKHVVEAMRQTVTNMIGTLPPQFFA 133
Query: 111 VVIEALWEPLSKLLISSMMTGYTLRNAEYRLCLEKNLDMCERDLEKPKAESTPMDLQGLL 170
V + ++ E L++L++S +MTGY RNA+YRL L+++L+ A P D +G
Sbjct: 134 VTVTSVAENLAQLMMSVLMTGYMFRNAQYRLELQQSLEQV--------ALPEPRDQKGGD 185
Query: 171 HDSVNVIDFGRKSNLSSKVEKVHEDVDIQDLGEISSEAQQYILNLQSRLSSMKKELHEVK 230
D G + N+S +V + + + + +A++YI L++ + + + +V
Sbjct: 186 EDYAP----GTQKNVSGEVIRWNNVSGPEKI-----DAKKYIELLEAEIEELNR---QVG 233
Query: 231 RKNAALQMQQFVGEEKNDLLDYLRSLQPEQVAQLSEFTSPELKEIIISVVHGLLATLSPK 290
RK+A ++N++L+YL+SL+P+ + +L+ ++ + + V LLA
Sbjct: 234 RKSA---------NQQNEILEYLKSLEPQNLKELTSTAGEDVAVAMNTFVKRLLAV---- 280
Query: 291 MHSKPSTMSENATIGTANAGSEDCAEVVENSSIQFHPVISLTRDYLARLLFWCMLLGHYL 350
S P+ M N T E S+ LA+LL+W M++G+ +
Sbjct: 281 --SDPNQMKTNVT---------------ETSAAD-----------LAKLLYWLMVVGYSI 312
Query: 351 RGLEYRVDLTELLS 364
R +E R D+ +L
Sbjct: 313 RNIEVRFDMERVLG 326
>AT1G63610.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: chloroplast;
EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT2G14910.1); Has 35333 Blast
hits to 34131 proteins in 2444 species: Archae - 798;
Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants -
531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI
BLink). | chr1:23583675-23585599 REVERSE LENGTH=341
Length = 341
Score = 100 bits (248), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 80/305 (26%), Positives = 150/305 (49%), Gaps = 61/305 (20%)
Query: 60 KPKKSVLRELIQEIEPLDVSHIQKDVPPTTADAMKRTISGMLGLLPSDQFHVVIEALWEP 119
K ++ +L E +Q ++P + K P +AM++T++ M+G LP F V + ++ E
Sbjct: 84 KSRRDILLEYVQNVKPEFMEMFVKRAPKHVVEAMRQTVTNMIGTLPPQFFAVTVTSVAEN 143
Query: 120 LSKLLISSMMTGYTLRNAEYRLCLEKNLDMCERDLEKPKAESTPMDLQGLLHDSVNVIDF 179
L++L++S +MTGY RNA+YRL L+++L+ A P D +G D
Sbjct: 144 LAQLMMSVLMTGYMFRNAQYRLELQQSLEQV--------ALPEPRDQKGGDEDYAP---- 191
Query: 180 GRKSNLSSKVEKVHEDVDIQDLGEISSEAQQYILNLQSRLSSMKKELHEVKRKNAALQMQ 239
G + N+S +V + + + + +A++YI L++ + + + +V RK+A
Sbjct: 192 GTQKNVSGEVIRWNNVSGPEKI-----DAKKYIELLEAEIEELNR---QVGRKSA----- 238
Query: 240 QFVGEEKNDLLDYLRSLQPEQVAQLSEFTSPELKEIIISVVHGLLATLSPKMHSKPSTMS 299
++N++L+YL+SL+P+ + +L+ ++ + + V LLA S P+ M
Sbjct: 239 ----NQQNEILEYLKSLEPQNLKELTSTAGEDVAVAMNTFVKRLLAV------SDPNQMK 288
Query: 300 ENATIGTANAGSEDCAEVVENSSIQFHPVISLTRDYLARLLFWCMLLGHYLRGLEYRVDL 359
N T E S+ LA+LL+W M++G+ +R +E R D+
Sbjct: 289 TNVT---------------ETSAAD-----------LAKLLYWLMVVGYSIRNIEVRFDM 322
Query: 360 TELLS 364
+L
Sbjct: 323 ERVLG 327