Miyakogusa Predicted Gene
- Lj3g3v0349300.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj3g3v0349300.1 Non Chatacterized Hit- tr|K4BLN0|K4BLN0_SOLLC
Uncharacterized protein OS=Solanum lycopersicum
GN=Sol,23.85,8e-19,seg,NULL; FAMILY NOT NAMED,NULL,CUFF.40576.1
(249 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G67550.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 145 2e-35
AT2G12400.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 94 1e-19
AT2G25270.1 | Symbols: | unknown protein; LOCATED IN: plasma me... 87 1e-17
AT1G80540.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 72 3e-13
AT1G71110.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 67 8e-12
>AT5G67550.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; EXPRESSED IN: flower; EXPRESSED DURING: 4
anthesis; BEST Arabidopsis thaliana protein match is:
unknown protein (TAIR:AT1G71110.1); Has 161 Blast hits
to 154 proteins in 16 species: Archae - 0; Bacteria - 0;
Metazoa - 0; Fungi - 0; Plants - 161; Viruses - 0; Other
Eukaryotes - 0 (source: NCBI BLink). |
chr5:26946908-26949112 REVERSE LENGTH=509
Length = 509
Score = 145 bits (367), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 77/261 (29%), Positives = 133/261 (50%), Gaps = 13/261 (4%)
Query: 1 MLVVLCWLTTVICWLFFGVYFFLEKFSNDACTALDNFQENPYNNSLSSILPCDELLSAKP 60
M++ LCW+ T +CW+ G FF+ F+ D C+A + F +NP N++L+++ PC + L +
Sbjct: 249 MVIFLCWIITTLCWVLTGFDFFIHTFAEDLCSAFNGFVQNPRNSTLTNLFPCMDPLHSDK 308
Query: 61 VLSDVSAGIYDLVNKVNANIS------ALQATS-----YPDLVQVCNPFSEPPNYFYQPE 109
L ++S I++ + ++N+ ++ AL S P+ +C+PF Y P+
Sbjct: 309 TLIEISLMIHNFITQLNSKVAESMRSNALTDRSNTVSWAPESGIICDPFVGQQINSYTPQ 368
Query: 110 NCPANTIRIGDIPKVLKAFTCLDAN-DGTCD-NGNLISSSEYVRVEAYTTSIQDLLNVYP 167
+C I IG+ P +L FTC D + TC G I + Y++V AY+ S Q +L++ P
Sbjct: 369 SCSNGAIPIGEFPNILSRFTCHDKDPPETCRITGKFIPEAAYLKVYAYSNSAQGMLDILP 428
Query: 168 SMEHLLECQIVKDAFSQVLSKQCKPMKKYARMVWVGMXXXXXXXXXXXXXWTIKARREHR 227
S ++L EC VKD S ++S QC P + +W + + KA +E
Sbjct: 429 SFQNLTECLAVKDTLSSIVSNQCDPFRASMYRLWASILALSLIMVVLVLLFLAKAFQEKG 488
Query: 228 YHLSDSSVEPLESRPSKEIEI 248
+ S+ P S +++ I
Sbjct: 489 KSFAWFSIHPTSSAEIRQVNI 509
>AT2G12400.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; EXPRESSED IN: 25 plant structures; EXPRESSED
DURING: 13 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT2G25270.1);
Has 177 Blast hits to 172 proteins in 23 species: Archae
- 0; Bacteria - 2; Metazoa - 3; Fungi - 0; Plants - 164;
Viruses - 0; Other Eukaryotes - 8 (source: NCBI BLink).
| chr2:5005144-5008140 REVERSE LENGTH=541
Length = 541
Score = 93.6 bits (231), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 64/258 (24%), Positives = 106/258 (41%), Gaps = 19/258 (7%)
Query: 2 LVVLCWLTTVICWLFFGVYFFLEKFSNDACTALDNFQENPY-NNSLSSILPCDELLSAKP 60
LV+L W+ + ++ G + L D C A+D + +NP + +L ILPC + +A+
Sbjct: 286 LVILGWILVTVTFVLCGGFLLLHNVVGDTCVAMDQWVQNPTAHTALDDILPCVDNATARE 345
Query: 61 VLSDVSAGIYDLVNKVNANISALQATSYPDLVQ-------------VCNPFSEPPNYFYQ 107
L+ Y LVN ++ IS + ++P + +CNPF N
Sbjct: 346 TLTRTKLVTYQLVNLLDNAISNMTNRNFPPQFRPLYYNQSGPLMPLLCNPF----NADLS 401
Query: 108 PENCPANTIRIGDIPKVLKAFTCLDANDGTCDNGNLISSSEYVRVEAYTTSIQDLLNVYP 167
C + + + +V K FTC GTC ++ Y ++ A L P
Sbjct: 402 DRQCQPGQVHLNNATEVWKNFTCQIVTPGTCSTPGRLTPKLYSQMAAAVNVSYGLYKYGP 461
Query: 168 SMEHLLECQIVKDAFSQVLSKQCKPMKKYARMVWVGMXXXXXXXXXXXXXWTIKAR-REH 226
+ L C V+ F+ + C +K+Y + ++VG+ W I AR R H
Sbjct: 462 FLADLQGCDFVRSTFTDIERDHCPGLKRYTQWIYVGLVVVSASVMSSLVFWVIYARERRH 521
Query: 227 RYHLSDSSVEPLESRPSK 244
R + D + E SK
Sbjct: 522 RVYTKDYNAMHSEDPRSK 539
>AT2G25270.1 | Symbols: | unknown protein; LOCATED IN: plasma
membrane; EXPRESSED IN: 18 plant structures; EXPRESSED
DURING: 13 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT2G12400.1);
Has 35333 Blast hits to 34131 proteins in 2444 species:
Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi -
991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610
(source: NCBI BLink). | chr2:10759779-10762358 FORWARD
LENGTH=545
Length = 545
Score = 86.7 bits (213), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 57/241 (23%), Positives = 105/241 (43%), Gaps = 19/241 (7%)
Query: 2 LVVLCWLTTVICWLFFGVYFFLEKFSNDACTALDNFQENPYNNS-LSSILPCDELLSAKP 60
LV+L W+ ++ G + L + D C A+ + E P +N+ L ILPC + +A+
Sbjct: 291 LVILGWILVTGTFILSGTFLVLHNATADTCVAMSEWVERPSSNTALDEILPCTDNATAQE 350
Query: 61 VL---SDVSAGIYDLVNKVNANISALQAT----------SYPDLVQVCNPFSEPPNYFYQ 107
L +V+ + +L+N V N+S + + S P L +CNPF N+
Sbjct: 351 TLMRSREVTGQLVELINTVITNVSNINFSPVFVPMYYNQSGPLLPLLCNPF----NHDLT 406
Query: 108 PENCPANTIRIGDIPKVLKAFTCLDANDGTCDNGNLISSSEYVRVEAYTTSIQDLLNVYP 167
+C + + + + +F C + +GTC ++ + Y ++ + L+ P
Sbjct: 407 DRSCSPGDLDLNNATEAWTSFVCQVSQNGTCTTTGRLTPALYSQMASGVNISTGLIRDAP 466
Query: 168 SMEHLLECQIVKDAFSQVLSKQCKPMKKYARMVWVGMXXXXXXXXXXXXXWTIKAR-REH 226
+ L +C K F + + C +++Y V+VG+ W I +R R H
Sbjct: 467 FLVQLQDCSYAKQTFRDITNDHCPGLQRYGYWVYVGLAILATAVMLSLMFWIIYSRERRH 526
Query: 227 R 227
R
Sbjct: 527 R 527
>AT1G80540.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT2G12400.1); Has 175 Blast hits to 171 proteins
in 20 species: Archae - 0; Bacteria - 0; Metazoa - 2;
Fungi - 0; Plants - 171; Viruses - 0; Other Eukaryotes -
2 (source: NCBI BLink). | chr1:30281638-30284258 REVERSE
LENGTH=538
Length = 538
Score = 72.4 bits (176), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 51/251 (20%), Positives = 97/251 (38%), Gaps = 19/251 (7%)
Query: 1 MLVVLCWLTTVICWLFFGVYFFLEKFSNDACTALDNFQENPYNNS-LSSILPC---DELL 56
+LV+L W+ L V+ D C A+D + +P +S LS +LPC +
Sbjct: 288 LLVILGWILVTATILLSAVFLVFHNVVADTCMAMDQWVHDPAADSALSQLLPCLDPKTIG 347
Query: 57 SAKPVLSDVSAGIYDLVNKVNANISA----------LQATSYPDLVQVCNPFSEPPNYFY 106
+ ++A D+ N N+S S P + +CNP + +
Sbjct: 348 ETLDITKTMTATAVDMTNAYTVNVSNHDQFPPNAPFYHNQSGPLVPLLCNPLDQN----H 403
Query: 107 QPENCPANTIRIGDIPKVLKAFTCLDANDGTCDNGNLISSSEYVRVEAYTTSIQDLLNVY 166
+P C + + + + +V K + C +G C ++ Y ++ L +
Sbjct: 404 KPRPCAPDEVLLANASQVYKGYICQVNAEGICTTQGRLTQGSYDQMMGAINVAFTLDHYG 463
Query: 167 PSMEHLLECQIVKDAFSQVLSKQCKPMKKYARMVWVGMXXXXXXXXXXXXXWTIKAR-RE 225
P + + +C V+D F + +K C + ++ ++ G+ W I R R
Sbjct: 464 PFLASIADCTFVRDTFRDITTKNCPGLSITSQWIYAGLASLSGAVMFSLIFWLIFVRERR 523
Query: 226 HRYHLSDSSVE 236
HR S ++
Sbjct: 524 HRSQTKKSMIQ 534
>AT1G71110.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; BEST Arabidopsis thaliana protein match is:
unknown protein (TAIR:AT2G12400.1); Has 173 Blast hits
to 169 proteins in 21 species: Archae - 0; Bacteria - 0;
Metazoa - 3; Fungi - 0; Plants - 165; Viruses - 0; Other
Eukaryotes - 5 (source: NCBI BLink). |
chr1:26818244-26820852 FORWARD LENGTH=557
Length = 557
Score = 67.4 bits (163), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 53/240 (22%), Positives = 92/240 (38%), Gaps = 17/240 (7%)
Query: 1 MLVVLCWLTTVICWLFFGVYFFLEKFSNDACTALDNFQENPY-NNSLSSILPCDELLSAK 59
+ VV W+ + ++ GV+ L +D C A+ + +NP+ +LSSILPC + +
Sbjct: 286 IFVVSGWILVAVTFVLCGVFLILNNAISDTCVAMKEWVDNPHAETALSSILPCVDQQTTN 345
Query: 60 PVLSDVSAGIYDLVNKVNANISALQAT------------SYPDLVQVCNPFSEPPNYFYQ 107
LS I +V VN + A+ T S P + +C PF +
Sbjct: 346 QTLSQSKVVINSIVTVVNTFVYAVANTNPAPGQDRYYNQSGPPMPPLCIPFDAN----ME 401
Query: 108 PENCPANTIRIGDIPKVLKAFTCLDANDGTCDNGNLISSSEYVRVEAYTTSIQDLLNVYP 167
C + I + V + + C G C ++ + ++ A L + P
Sbjct: 402 DRQCSPWELSIENASSVWENYKCEVTPSGICTTVGRVTPDTFGQLVAAVNESYALEHYTP 461
Query: 168 SMEHLLECQIVKDAFSQVLSKQCKPMKKYARMVWVGMXXXXXXXXXXXXXWTIKARREHR 227
+ +C V++ F + S C P+ + R+V G+ W A R R
Sbjct: 462 PLLSFRDCNFVRETFMSITSDYCPPLVRNLRIVNAGLGLISVGVLLCLVLWIFYANRPQR 521