Miyakogusa Predicted Gene
- Lj5g3v2166200.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj5g3v2166200.1 Non Chatacterized Hit- tr|I1LEM9|I1LEM9_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.53973
PE,80.36,0,coiled-coil,NULL; FAMILY NOT NAMED,NULL;
seg,NULL,CUFF.56792.1
(558 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G71110.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 584 e-167
AT2G12400.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 363 e-100
AT2G25270.1 | Symbols: | unknown protein; LOCATED IN: plasma me... 338 4e-93
AT1G80540.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 257 1e-68
AT5G67550.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 95 2e-19
>AT1G71110.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; BEST Arabidopsis thaliana protein match is:
unknown protein (TAIR:AT2G12400.1); Has 173 Blast hits
to 169 proteins in 21 species: Archae - 0; Bacteria - 0;
Metazoa - 3; Fungi - 0; Plants - 165; Viruses - 0; Other
Eukaryotes - 5 (source: NCBI BLink). |
chr1:26818244-26820852 FORWARD LENGTH=557
Length = 557
Score = 584 bits (1505), Expect = e-167, Method: Compositional matrix adjust.
Identities = 287/498 (57%), Positives = 351/498 (70%), Gaps = 8/498 (1%)
Query: 35 EHPVKFIIGEENLGPWRNQLTQVAPAPGPNAEDT----LVLAANRTKRPDILQGFRHYRG 90
+ P++ I+G N G W+ ++ APGP ++D L+LAA+RTKRPDIL+ F+ Y G
Sbjct: 31 QDPLRLILGSPNFGTWKGGISL---APGPESDDVVSDYLLLAAHRTKRPDILRAFKPYHG 87
Query: 91 GWDITNQHYWASVGFTGGAGFILAVLWFVSFGLALAIHLCCGWGINIKDKESS-HSQRIC 149
GW+ITN HYWASVGFTG GFILAV+W +SFG L ++ C W I K K SS ++RIC
Sbjct: 88 GWNITNNHYWASVGFTGAPGFILAVIWLLSFGSLLVVYHCFKWRICDKAKGSSFDTRRIC 147
Query: 150 LMLLIVFTFAATTGCILLFVGQDKFHGEALDTLHYFVNQSDYSVQILRNVTQYLSLAKTI 209
+LLIVFT A GCILL VGQDKFH EA+ TL Y VNQSDY+V+IL+NVTQYLSLAKTI
Sbjct: 148 FILLIVFTCVAAVGCILLSVGQDKFHTEAMHTLKYVVNQSDYTVEILQNVTQYLSLAKTI 207
Query: 210 HVTQILLPSDVMDDIDKLTVDLNSAADTLSEKTNENAVKFRRVFKDVRXXXXXXXXXXXX 269
+VTQI++PSDVM +IDKL V+LN+AA TL E T +NA K +RVF VR
Sbjct: 208 NVTQIVIPSDVMGEIDKLNVNLNTAAVTLGETTTDNAAKIKRVFYAVRSALITVATVMLI 267
Query: 270 XXXXXXXXXXXGYQHAILIFVITGWLLVATTFILCGVFMILNNAISDTCMAMGEWVENPH 329
+QH + IFV++GW+LVA TF+LCGVF+ILNNAISDTC+AM EWV+NPH
Sbjct: 268 LSFVGLLLSVLRHQHVVHIFVVSGWILVAVTFVLCGVFLILNNAISDTCVAMKEWVDNPH 327
Query: 330 RESSLSDVLPCVDQRTTNQTLIQSKQVVTNIAGVVNRFIYETANINATQGTPGYYNQSGP 389
E++LS +LPCVDQ+TTNQTL QSK V+ +I VVN F+Y AN N G YYNQSGP
Sbjct: 328 AETALSSILPCVDQQTTNQTLSQSKVVINSIVTVVNTFVYAVANTNPAPGQDRYYNQSGP 387
Query: 390 AMMPLCYPFDSQLQEHQCSDQAVSSANASMVWKNNECEVSESGICTTVGRVTPEIYAQLV 449
M PLC PFD+ +++ QCS +S NAS VW+N +CEV+ SGICTTVGRVTP+ + QLV
Sbjct: 388 PMPPLCIPFDANMEDRQCSPWELSIENASSVWENYKCEVTPSGICTTVGRVTPDTFGQLV 447
Query: 450 AAVNASYALEHYTPLLLSLQNCNFVRDAFTGITSSHCPPLNHYLKITNXXXXXXXXXXXX 509
AAVN SYALEHYTP LLS ++CNFVR+ F ITS +CPPL L+I N
Sbjct: 448 AAVNESYALEHYTPPLLSFRDCNFVRETFMSITSDYCPPLVRNLRIVNAGLGLISVGVLL 507
Query: 510 XXXXWILYANRPQRGEVF 527
WI YANRPQR EVF
Sbjct: 508 CLVLWIFYANRPQREEVF 525
>AT2G12400.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; EXPRESSED IN: 25 plant structures; EXPRESSED
DURING: 13 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT2G25270.1);
Has 177 Blast hits to 172 proteins in 23 species: Archae
- 0; Bacteria - 2; Metazoa - 3; Fungi - 0; Plants - 164;
Viruses - 0; Other Eukaryotes - 8 (source: NCBI BLink).
| chr2:5005144-5008140 REVERSE LENGTH=541
Length = 541
Score = 363 bits (932), Expect = e-100, Method: Compositional matrix adjust.
Identities = 192/489 (39%), Positives = 267/489 (54%), Gaps = 17/489 (3%)
Query: 50 WRNQLTQVAPAPGPNAEDTLVLAANRTKRPDILQGFRHYRGGWDITNQHYWASVGFTGGA 109
WR + + A +L+LAA RT+R D F+ Y GGW+I+N HY SVG+T
Sbjct: 46 WRTSVIERVIAEESGENSSLILAAKRTRRKDPADNFKLYTGGWNISNSHYLTSVGYTAAP 105
Query: 110 GFILAVLWFVSFGLALAI----HLCCGWGINIKDKESSHSQRIC----LMLLIVFTFAAT 161
I+A++WFV FGL+L++ + CC ++S R+ L+LLI FT AA
Sbjct: 106 FIIIALVWFVFFGLSLSLICLCYCCCA-------RQSYGYSRVAYALSLILLISFTIAAI 158
Query: 162 TGCILLFVGQDKFHGEALDTLHYFVNQSDYSVQILRNVTQYLSLAKTIHVTQILLPSDVM 221
GC+ L+ GQ KFH DTL Y V+Q++ + + LRNV+ YL+ AK + V +LP DV+
Sbjct: 159 IGCVFLYTGQGKFHASTTDTLDYVVSQANLTSENLRNVSDYLNAAKKVDVQSSILPQDVL 218
Query: 222 DDIDKLTVDLNSAADTLSEKTNENAVKFRRVFKDVRXXXXXXXXXXXXXXXXXXXXXXXG 281
ID + +NS+A TLS KT EN K + V +R G
Sbjct: 219 SSIDNIQGKINSSATTLSVKTMENQDKIQNVLDIMRLALVIIAAVMLFLAFIGFLLSIFG 278
Query: 282 YQHAILIFVITGWLLVATTFILCGVFMILNNAISDTCMAMGEWVENPHRESSLSDVLPCV 341
Q + VI GW+LV TF+LCG F++L+N + DTC+AM +WV+NP ++L D+LPCV
Sbjct: 279 LQCLVYTLVILGWILVTVTFVLCGGFLLLHNVVGDTCVAMDQWVQNPTAHTALDDILPCV 338
Query: 342 DQRTTNQTLIQSKQVVTNIAGVVNRFIYETANIN-ATQGTPGYYNQSGPAMMPLCYPFDS 400
D T +TL ++K V + +++ I N N Q P YYNQSGP M LC PF++
Sbjct: 339 DNATARETLTRTKLVTYQLVNLLDNAISNMTNRNFPPQFRPLYYNQSGPLMPLLCNPFNA 398
Query: 401 QLQEHQCSDQAVSSANASMVWKNNECEVSESGICTTVGRVTPEIYAQLVAAVNASYALEH 460
L + QC V NA+ VWKN C++ G C+T GR+TP++Y+Q+ AAVN SY L
Sbjct: 399 DLSDRQCQPGQVHLNNATEVWKNFTCQIVTPGTCSTPGRLTPKLYSQMAAAVNVSYGLYK 458
Query: 461 YTPLLLSLQNCNFVRDAFTGITSSHCPPLNHYLKITNXXXXXXXXXXXXXXXXWILYANR 520
Y P L LQ C+FVR FT I HCP L Y + W++YA R
Sbjct: 459 YGPFLADLQGCDFVRSTFTDIERDHCPGLKRYTQWIYVGLVVVSASVMSSLVFWVIYA-R 517
Query: 521 PQRGEVFVK 529
+R V+ K
Sbjct: 518 ERRHRVYTK 526
>AT2G25270.1 | Symbols: | unknown protein; LOCATED IN: plasma
membrane; EXPRESSED IN: 18 plant structures; EXPRESSED
DURING: 13 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT2G12400.1);
Has 35333 Blast hits to 34131 proteins in 2444 species:
Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi -
991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610
(source: NCBI BLink). | chr2:10759779-10762358 FORWARD
LENGTH=545
Length = 545
Score = 338 bits (868), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 186/518 (35%), Positives = 283/518 (54%), Gaps = 23/518 (4%)
Query: 31 TGSIEHPVKFIIGEENL-GP--WRN-QLTQVAPAPGPNAEDTLVLAANRTKRPDILQGFR 86
TGS+ +KFI+ E L GP + N Q+ +VA ++ LAA RT R D L GF
Sbjct: 40 TGSV---MKFIVAEAPLLGPAGFNNPQVIEVA---------SVALAAQRTYRKDPLNGFE 87
Query: 87 HYRGGWDITNQHYWASVGFTGGAGFILAVLWFVSFGLALAIHLCCGWGINIKDKESSHSQ 146
Y GGW+I+NQHYWASV +T F+LA +WF+ FG+ L + C I + +S+
Sbjct: 88 KYTGGWNISNQHYWASVSYTAVPLFVLAAVWFLGFGICLLV--ICMCHICHRTNSVGYSK 145
Query: 147 R---ICLMLLIVFTFAATTGCILLFVGQDKFHGEALDTLHYFVNQSDYSVQILRNVTQYL 203
+ L+ L++FT A GC+LL+ GQ +++ +TL Y ++Q+D ++ LR ++ YL
Sbjct: 146 VAYVVSLIFLLIFTVIAIIGCVLLYSGQIRYNKSTTETLEYVMSQADSTISQLRAISDYL 205
Query: 204 SLAKTIHVTQILLPSDVMDDIDKLTVDLNSAADTLSEKTNENAVKFRRVFKDVRXXXXXX 263
+ AK V Q+LLP++V +ID++ V L+S+ T++EK+ ++ R VR
Sbjct: 206 ASAKQAAVLQVLLPANVQTEIDQIGVKLDSSVATITEKSTNSSNHIRHFLDSVRVALIVV 265
Query: 264 XXXXXXXXXXXXXXXXXGYQHAILIFVITGWLLVATTFILCGVFMILNNAISDTCMAMGE 323
G Q + VI GW+LV TFIL G F++L+NA +DTC+AM E
Sbjct: 266 SIVMLVVTFLGLVSSIFGMQVIVYTLVILGWILVTGTFILSGTFLVLHNATADTCVAMSE 325
Query: 324 WVENPHRESSLSDVLPCVDQRTTNQTLIQSKQVVTNIAGVVNRFIYETANINATQ-GTPG 382
WVE P ++L ++LPC D T +TL++S++V + ++N I +NIN + P
Sbjct: 326 WVERPSSNTALDEILPCTDNATAQETLMRSREVTGQLVELINTVITNVSNINFSPVFVPM 385
Query: 383 YYNQSGPAMMPLCYPFDSQLQEHQCSDQAVSSANASMVWKNNECEVSESGICTTVGRVTP 442
YYNQSGP + LC PF+ L + CS + NA+ W + C+VS++G CTT GR+TP
Sbjct: 386 YYNQSGPLLPLLCNPFNHDLTDRSCSPGDLDLNNATEAWTSFVCQVSQNGTCTTTGRLTP 445
Query: 443 EIYAQLVAAVNASYALEHYTPLLLSLQNCNFVRDAFTGITSSHCPPLNHYLKITNXXXXX 502
+Y+Q+ + VN S L P L+ LQ+C++ + F IT+ HCP L Y
Sbjct: 446 ALYSQMASGVNISTGLIRDAPFLVQLQDCSYAKQTFRDITNDHCPGLQRYGYWVYVGLAI 505
Query: 503 XXXXXXXXXXXWILYAN-RPQRGEVFVKLSLPEKIKNI 539
WI+Y+ R R E + S ++I +
Sbjct: 506 LATAVMLSLMFWIIYSRERRHRKEALPEFSESKEIVRV 543
>AT1G80540.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT2G12400.1); Has 175 Blast hits to 171 proteins
in 20 species: Archae - 0; Bacteria - 0; Metazoa - 2;
Fungi - 0; Plants - 171; Viruses - 0; Other Eukaryotes -
2 (source: NCBI BLink). | chr1:30281638-30284258 REVERSE
LENGTH=538
Length = 538
Score = 257 bits (657), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 146/429 (34%), Positives = 222/429 (51%), Gaps = 8/429 (1%)
Query: 69 LVLAANRTKRPDILQGFRHYRGGWDITNQHYWASVGFTGGAGFILAVLWFVSFGLALAIH 128
LVLAA RT+RPD L F Y GW++TN HY ASVGF+ ++A+ WFV GL L
Sbjct: 64 LVLAAERTQRPDPLNHFNIYVDGWNVTNSHYIASVGFSAVPFIVIAIAWFVLLGLFLICS 123
Query: 129 LCCGWGINIKDKESSHSQRIC----LMLLIVFTFAATTGCILLFVGQDKFHGEALDTLHY 184
C + +S R+C L+ L++FT AA G +L+ GQ++F+G T Y
Sbjct: 124 CLCCCCCGCGRRNYGYS-RVCYTLSLVFLLLFTIAAVIGSAMLYTGQNEFYGSVERTFMY 182
Query: 185 FVNQSDYSVQILRNVTQYLSLAKTIHVT-QILLPSDVMDDIDKLTVDLNSAADTLSEKTN 243
V Q+ + L ++ + AK I + L P + +ID + + T ++
Sbjct: 183 IVKQATGVLTKLTSLWDSIQSAKDIQLDGHNLFPPEFRGNIDHFNNMIKMSNITYPDRVA 242
Query: 244 ENAVKFRR-VFKDVRXXXXXXXXXXXXXXXXXXXXXXXGYQHAILIFVITGWLLVATTFI 302
+++ VR G + + + VI GW+LV T +
Sbjct: 243 NQTIRYLTGALNPVRYVLNVIAGVMLAVAFLGLLFSFCGLRVLVYLLVILGWILVTATIL 302
Query: 303 LCGVFMILNNAISDTCMAMGEWVENPHRESSLSDVLPCVDQRTTNQTLIQSKQVVTNIAG 362
L VF++ +N ++DTCMAM +WV +P +S+LS +LPC+D +T +TL +K +
Sbjct: 303 LSAVFLVFHNVVADTCMAMDQWVHDPAADSALSQLLPCLDPKTIGETLDITKTMTATAVD 362
Query: 363 VVNRFIYETANINA-TQGTPGYYNQSGPAMMPLCYPFDSQLQEHQCSDQAVSSANASMVW 421
+ N + +N + P Y+NQSGP + LC P D + C+ V ANAS V+
Sbjct: 363 MTNAYTVNVSNHDQFPPNAPFYHNQSGPLVPLLCNPLDQNHKPRPCAPDEVLLANASQVY 422
Query: 422 KNNECEVSESGICTTVGRVTPEIYAQLVAAVNASYALEHYTPLLLSLQNCNFVRDAFTGI 481
K C+V+ GICTT GR+T Y Q++ A+N ++ L+HY P L S+ +C FVRD F I
Sbjct: 423 KGYICQVNAEGICTTQGRLTQGSYDQMMGAINVAFTLDHYGPFLASIADCTFVRDTFRDI 482
Query: 482 TSSHCPPLN 490
T+ +CP L+
Sbjct: 483 TTKNCPGLS 491
>AT5G67550.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; EXPRESSED IN: flower; EXPRESSED DURING: 4
anthesis; BEST Arabidopsis thaliana protein match is:
unknown protein (TAIR:AT1G71110.1); Has 161 Blast hits
to 154 proteins in 16 species: Archae - 0; Bacteria - 0;
Metazoa - 0; Fungi - 0; Plants - 161; Viruses - 0; Other
Eukaryotes - 0 (source: NCBI BLink). |
chr5:26946908-26949112 REVERSE LENGTH=509
Length = 509
Score = 94.7 bits (234), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 96/440 (21%), Positives = 166/440 (37%), Gaps = 46/440 (10%)
Query: 75 RTKRPDILQGFRHYRGGWDITNQHYWASVGFTGGAGFILAVLWFVSFGLALAIHLCCGWG 134
R KR D L FR+Y GG+++ N+HYWA+ FTG G+ +A G+ + + +C G
Sbjct: 36 RFKRRDPLNSFRYYDGGFNVRNKHYWAATAFTGIHGYAVA-------GVLIIVGICLGLY 88
Query: 135 INIKDKESSHSQRICLMLLIVFTF------------AATTGCILLFVGQDKFH----GEA 178
+ DK S L + TTG ++ + K E
Sbjct: 89 VAFSDKRRRVSSTRRRYLDRYYLPLFLLLLLFMFLSVVTTGIVIAANQRSKNRTEEMKET 148
Query: 179 LDTLHYFVNQSDYSVQILRNVTQYLSLAKTIHVTQILLPSDVMDDIDKLTVDLNSAADTL 238
+D VNQ+ +V + QYL L + T +L + T L + +
Sbjct: 149 IDKAGEDVNQNIRTVIVSLTKIQYLLLPYDQNTTHLL---------NVTTHRLGKGSRLI 199
Query: 239 SEKTNENAVKFRRVFKDVRXXXXXXXXXXXXXXXXXXXXXXXGYQHAILIFVITGWLLVA 298
+ K + ++ + W++
Sbjct: 200 QSFLHHKGRSIDLAIKISYVSHLMITSTNLFLLLLAFLPLLLHWHPGFIMVIFLCWIITT 259
Query: 299 TTFILCGVFMILNNAISDTCMAMGEWVENPHRESSLSDVLPCVDQRTTNQTLIQSKQVVT 358
++L G ++ D C A +V+NP R S+L+++ PC+D +++TLI+ ++
Sbjct: 260 LCWVLTGFDFFIHTFAEDLCSAFNGFVQNP-RNSTLTNLFPCMDPLHSDKTLIEISLMIH 318
Query: 359 NIAGVVNRFIYETANINA---TQGTPGYYNQSGPAMMPLCYPFDSQ----LQEHQCSDQA 411
N +N + E+ NA T + +SG +C PF Q CS+ A
Sbjct: 319 NFITQLNSKVAESMRSNALTDRSNTVSWAPESG----IICDPFVGQQINSYTPQSCSNGA 374
Query: 412 VSSANASMVWKNNECEVSE-SGICTTVGRVTPE-IYAQLVAAVNASYALEHYTPLLLSLQ 469
+ + C + C G+ PE Y ++ A N++ + P +L
Sbjct: 375 IPIGEFPNILSRFTCHDKDPPETCRITGKFIPEAAYLKVYAYSNSAQGMLDILPSFQNLT 434
Query: 470 NCNFVRDAFTGITSSHCPPL 489
C V+D + I S+ C P
Sbjct: 435 ECLAVKDTLSSIVSNQCDPF 454