Miyakogusa Predicted Gene
- Lj1g3v1855350.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v1855350.1 Non Chatacterized Hit- tr|I1K8S0|I1K8S0_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.11385 PE,81.36,0,FAMILY
NOT NAMED,NULL,CUFF.28088.1
(284 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT2G25270.1 | Symbols: | unknown protein; LOCATED IN: plasma me... 364 e-101
AT2G12400.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 343 1e-94
AT1G80540.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 281 3e-76
AT1G71110.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 235 2e-62
AT5G67550.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 67 2e-11
>AT2G25270.1 | Symbols: | unknown protein; LOCATED IN: plasma
membrane; EXPRESSED IN: 18 plant structures; EXPRESSED
DURING: 13 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT2G12400.1);
Has 35333 Blast hits to 34131 proteins in 2444 species:
Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi -
991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610
(source: NCBI BLink). | chr2:10759779-10762358 FORWARD
LENGTH=545
Length = 545
Score = 364 bits (934), Expect = e-101, Method: Compositional matrix adjust.
Identities = 161/273 (58%), Positives = 213/273 (78%), Gaps = 4/273 (1%)
Query: 1 MVLFVATVFSIFGMQILVYILVFAGWFLVTGTLILCGCFLILHNVTADTCVAMDEWIQYP 60
+V F+ V SIFGMQ++VY LV GW LVTGT IL G FL+LHN TADTCVAM EW++ P
Sbjct: 271 VVTFLGLVSSIFGMQVIVYTLVILGWILVTGTFILSGTFLVLHNATADTCVAMSEWVERP 330
Query: 61 AANTALDDILPCVDNATAQETFLRSKEVTSELVNLVNQVITNVSNINFAPNFTPLFYNQS 120
++NTALD+ILPC DNATAQET +RS+EVT +LV L+N VITNVSNINF+P F P++YNQS
Sbjct: 331 SSNTALDEILPCTDNATAQETLMRSREVTGQLVELINTVITNVSNINFSPVFVPMYYNQS 390
Query: 121 GPVMPLLCNPFHPDMTDRLCDPGEVTLSNATQVYGNFVCQVSPSDICTTQGRLTPTFYNQ 180
GP++PLLCNPF+ D+TDR C PG++ L+NAT+ + +FVCQVS + CTT GRLTP Y+Q
Sbjct: 391 GPLLPLLCNPFNHDLTDRSCSPGDLDLNNATEAWTSFVCQVSQNGTCTTTGRLTPALYSQ 450
Query: 181 ISTGINVGNALYSYAPSLVELQDCTFVRETFSDISREHCPGLRRYSKWIYAGLVMVSFAV 240
+++G+N+ L AP LV+LQDC++ ++TF DI+ +HCPGL+RY W+Y GL +++ AV
Sbjct: 451 MASGVNISTGLIRDAPFLVQLQDCSYAKQTFRDITNDHCPGLQRYGYWVYVGLAILATAV 510
Query: 241 MFSLIFWVVYGRERRYRLYT----KETKELTPV 269
M SL+FW++Y RERR+R E+KE+ V
Sbjct: 511 MLSLMFWIIYSRERRHRKEALPEFSESKEIVRV 543
>AT2G12400.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; EXPRESSED IN: 25 plant structures; EXPRESSED
DURING: 13 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT2G25270.1);
Has 177 Blast hits to 172 proteins in 23 species: Archae
- 0; Bacteria - 2; Metazoa - 3; Fungi - 0; Plants - 164;
Viruses - 0; Other Eukaryotes - 8 (source: NCBI BLink).
| chr2:5005144-5008140 REVERSE LENGTH=541
Length = 541
Score = 343 bits (879), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 159/259 (61%), Positives = 203/259 (78%)
Query: 4 FVATVFSIFGMQILVYILVFAGWFLVTGTLILCGCFLILHNVTADTCVAMDEWIQYPAAN 63
F+ + SIFG+Q LVY LV GW LVT T +LCG FL+LHNV DTCVAMD+W+Q P A+
Sbjct: 269 FIGFLLSIFGLQCLVYTLVILGWILVTVTFVLCGGFLLLHNVVGDTCVAMDQWVQNPTAH 328
Query: 64 TALDDILPCVDNATAQETFLRSKEVTSELVNLVNQVITNVSNINFAPNFTPLFYNQSGPV 123
TALDDILPCVDNATA+ET R+K VT +LVNL++ I+N++N NF P F PL+YNQSGP+
Sbjct: 329 TALDDILPCVDNATARETLTRTKLVTYQLVNLLDNAISNMTNRNFPPQFRPLYYNQSGPL 388
Query: 124 MPLLCNPFHPDMTDRLCDPGEVTLSNATQVYGNFVCQVSPSDICTTQGRLTPTFYNQIST 183
MPLLCNPF+ D++DR C PG+V L+NAT+V+ NF CQ+ C+T GRLTP Y+Q++
Sbjct: 389 MPLLCNPFNADLSDRQCQPGQVHLNNATEVWKNFTCQIVTPGTCSTPGRLTPKLYSQMAA 448
Query: 184 GINVGNALYSYAPSLVELQDCTFVRETFSDISREHCPGLRRYSKWIYAGLVMVSFAVMFS 243
+NV LY Y P L +LQ C FVR TF+DI R+HCPGL+RY++WIY GLV+VS +VM S
Sbjct: 449 AVNVSYGLYKYGPFLADLQGCDFVRSTFTDIERDHCPGLKRYTQWIYVGLVVVSASVMSS 508
Query: 244 LIFWVVYGRERRYRLYTKE 262
L+FWV+Y RERR+R+YTK+
Sbjct: 509 LVFWVIYARERRHRVYTKD 527
>AT1G80540.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT2G12400.1); Has 175 Blast hits to 171 proteins
in 20 species: Archae - 0; Bacteria - 0; Metazoa - 2;
Fungi - 0; Plants - 171; Viruses - 0; Other Eukaryotes -
2 (source: NCBI BLink). | chr1:30281638-30284258 REVERSE
LENGTH=538
Length = 538
Score = 281 bits (720), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 133/262 (50%), Positives = 180/262 (68%)
Query: 2 VLFVATVFSIFGMQILVYILVFAGWFLVTGTLILCGCFLILHNVTADTCVAMDEWIQYPA 61
V F+ +FS G+++LVY+LV GW LVT T++L FL+ HNV ADTC+AMD+W+ PA
Sbjct: 270 VAFLGLLFSFCGLRVLVYLLVILGWILVTATILLSAVFLVFHNVVADTCMAMDQWVHDPA 329
Query: 62 ANTALDDILPCVDNATAQETFLRSKEVTSELVNLVNQVITNVSNINFAPNFTPLFYNQSG 121
A++AL +LPC+D T ET +K +T+ V++ N NVSN + P P ++NQSG
Sbjct: 330 ADSALSQLLPCLDPKTIGETLDITKTMTATAVDMTNAYTVNVSNHDQFPPNAPFYHNQSG 389
Query: 122 PVMPLLCNPFHPDMTDRLCDPGEVTLSNATQVYGNFVCQVSPSDICTTQGRLTPTFYNQI 181
P++PLLCNP + R C P EV L+NA+QVY ++CQV+ ICTTQGRLT Y+Q+
Sbjct: 390 PLVPLLCNPLDQNHKPRPCAPDEVLLANASQVYKGYICQVNAEGICTTQGRLTQGSYDQM 449
Query: 182 STGINVGNALYSYAPSLVELQDCTFVRETFSDISREHCPGLRRYSKWIYAGLVMVSFAVM 241
INV L Y P L + DCTFVR+TF DI+ ++CPGL S+WIYAGL +S AVM
Sbjct: 450 MGAINVAFTLDHYGPFLASIADCTFVRDTFRDITTKNCPGLSITSQWIYAGLASLSGAVM 509
Query: 242 FSLIFWVVYGRERRYRLYTKET 263
FSLIFW+++ RERR+R TK++
Sbjct: 510 FSLIFWLIFVRERRHRSQTKKS 531
>AT1G71110.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; BEST Arabidopsis thaliana protein match is:
unknown protein (TAIR:AT2G12400.1); Has 173 Blast hits
to 169 proteins in 21 species: Archae - 0; Bacteria - 0;
Metazoa - 3; Fungi - 0; Plants - 165; Viruses - 0; Other
Eukaryotes - 5 (source: NCBI BLink). |
chr1:26818244-26820852 FORWARD LENGTH=557
Length = 557
Score = 235 bits (600), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 116/257 (45%), Positives = 161/257 (62%), Gaps = 2/257 (0%)
Query: 4 FVATVFSIFGMQILVYILVFAGWFLVTGTLILCGCFLILHNVTADTCVAMDEWIQYPAAN 63
FV + S+ Q +V+I V +GW LV T +LCG FLIL+N +DTCVAM EW+ P A
Sbjct: 270 FVGLLLSVLRHQHVVHIFVVSGWILVAVTFVLCGVFLILNNAISDTCVAMKEWVDNPHAE 329
Query: 64 TALDDILPCVDNATAQETFLRSKEVTSELVNLVNQVITNVSNINFAPNFTPLFYNQSGPV 123
TAL ILPCVD T +T +SK V + +V +VN + V+N N AP +YNQSGP
Sbjct: 330 TALSSILPCVDQQTTNQTLSQSKVVINSIVTVVNTFVYAVANTNPAPG-QDRYYNQSGPP 388
Query: 124 MPLLCNPFHPDMTDRLCDPGEVTLSNATQVYGNFVCQVSPSDICTTQGRLTPTFYNQIST 183
MP LC PF +M DR C P E+++ NA+ V+ N+ C+V+PS ICTT GR+TP + Q+
Sbjct: 389 MPPLCIPFDANMEDRQCSPWELSIENASSVWENYKCEVTPSGICTTVGRVTPDTFGQLVA 448
Query: 184 GINVGNALYSYAPSLVELQDCTFVRETFSDISREHCPGLRRYSKWIYAGLVMVSFAVMFS 243
+N AL Y P L+ +DC FVRETF I+ ++CP L R + + AGL ++S V+
Sbjct: 449 AVNESYALEHYTPPLLSFRDCNFVRETFMSITSDYCPPLVRNLRIVNAGLGLISVGVLLC 508
Query: 244 LIFWVVYG-RERRYRLY 259
L+ W+ Y R +R ++
Sbjct: 509 LVLWIFYANRPQREEVF 525
>AT5G67550.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; EXPRESSED IN: flower; EXPRESSED DURING: 4
anthesis; BEST Arabidopsis thaliana protein match is:
unknown protein (TAIR:AT1G71110.1); Has 161 Blast hits
to 154 proteins in 16 species: Archae - 0; Bacteria - 0;
Metazoa - 0; Fungi - 0; Plants - 161; Viruses - 0; Other
Eukaryotes - 0 (source: NCBI BLink). |
chr5:26946908-26949112 REVERSE LENGTH=509
Length = 509
Score = 66.6 bits (161), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 54/235 (22%), Positives = 97/235 (41%), Gaps = 16/235 (6%)
Query: 20 ILVFAGWFLVTGTLILCGCFLILHNVTADTCVAMDEWIQYPAANTALDDILPCVDNATAQ 79
+++F W + T +L G +H D C A + ++Q P N+ L ++ PC+D +
Sbjct: 249 MVIFLCWIITTLCWVLTGFDFFIHTFAEDLCSAFNGFVQNP-RNSTLTNLFPCMDPLHSD 307
Query: 80 ETFLRSKEVTSELVNLVNQVITNVSNINFAPNFTPLFYNQS-GPVMPLLCNPFHPDM--- 135
+T + E++ + N + Q+ + V+ + T S P ++C+PF
Sbjct: 308 KTLI---EISLMIHNFITQLNSKVAESMRSNALTDRSNTVSWAPESGIICDPFVGQQINS 364
Query: 136 -TDRLCDPGEVTLSNATQVYGNFVCQ-VSPSDICTTQGRLTP-TFYNQISTGINVGNALY 192
T + C G + + + F C P + C G+ P Y ++ N +
Sbjct: 365 YTPQSCSNGAIPIGEFPNILSRFTCHDKDPPETCRITGKFIPEAAYLKVYAYSNSAQGML 424
Query: 193 SYAPSLVELQDCTFVRETFSDISREHCPGLRR--YSKW---IYAGLVMVSFAVMF 242
PS L +C V++T S I C R Y W + L+MV ++F
Sbjct: 425 DILPSFQNLTECLAVKDTLSSIVSNQCDPFRASMYRLWASILALSLIMVVLVLLF 479