Miyakogusa Predicted Gene

Lj1g3v1855350.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj1g3v1855350.1 Non Chatacterized Hit- tr|I1K8S0|I1K8S0_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.11385 PE,81.36,0,FAMILY
NOT NAMED,NULL,CUFF.28088.1
         (284 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT2G25270.1 | Symbols:  | unknown protein; LOCATED IN: plasma me...   364   e-101
AT2G12400.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   343   1e-94
AT1G80540.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   281   3e-76
AT1G71110.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   235   2e-62
AT5G67550.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    67   2e-11

>AT2G25270.1 | Symbols:  | unknown protein; LOCATED IN: plasma
           membrane; EXPRESSED IN: 18 plant structures; EXPRESSED
           DURING: 13 growth stages; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT2G12400.1);
           Has 35333 Blast hits to 34131 proteins in 2444 species:
           Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi -
           991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610
           (source: NCBI BLink). | chr2:10759779-10762358 FORWARD
           LENGTH=545
          Length = 545

 Score =  364 bits (934), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 161/273 (58%), Positives = 213/273 (78%), Gaps = 4/273 (1%)

Query: 1   MVLFVATVFSIFGMQILVYILVFAGWFLVTGTLILCGCFLILHNVTADTCVAMDEWIQYP 60
           +V F+  V SIFGMQ++VY LV  GW LVTGT IL G FL+LHN TADTCVAM EW++ P
Sbjct: 271 VVTFLGLVSSIFGMQVIVYTLVILGWILVTGTFILSGTFLVLHNATADTCVAMSEWVERP 330

Query: 61  AANTALDDILPCVDNATAQETFLRSKEVTSELVNLVNQVITNVSNINFAPNFTPLFYNQS 120
           ++NTALD+ILPC DNATAQET +RS+EVT +LV L+N VITNVSNINF+P F P++YNQS
Sbjct: 331 SSNTALDEILPCTDNATAQETLMRSREVTGQLVELINTVITNVSNINFSPVFVPMYYNQS 390

Query: 121 GPVMPLLCNPFHPDMTDRLCDPGEVTLSNATQVYGNFVCQVSPSDICTTQGRLTPTFYNQ 180
           GP++PLLCNPF+ D+TDR C PG++ L+NAT+ + +FVCQVS +  CTT GRLTP  Y+Q
Sbjct: 391 GPLLPLLCNPFNHDLTDRSCSPGDLDLNNATEAWTSFVCQVSQNGTCTTTGRLTPALYSQ 450

Query: 181 ISTGINVGNALYSYAPSLVELQDCTFVRETFSDISREHCPGLRRYSKWIYAGLVMVSFAV 240
           +++G+N+   L   AP LV+LQDC++ ++TF DI+ +HCPGL+RY  W+Y GL +++ AV
Sbjct: 451 MASGVNISTGLIRDAPFLVQLQDCSYAKQTFRDITNDHCPGLQRYGYWVYVGLAILATAV 510

Query: 241 MFSLIFWVVYGRERRYRLYT----KETKELTPV 269
           M SL+FW++Y RERR+R        E+KE+  V
Sbjct: 511 MLSLMFWIIYSRERRHRKEALPEFSESKEIVRV 543


>AT2G12400.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: endomembrane
           system; EXPRESSED IN: 25 plant structures; EXPRESSED
           DURING: 13 growth stages; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT2G25270.1);
           Has 177 Blast hits to 172 proteins in 23 species: Archae
           - 0; Bacteria - 2; Metazoa - 3; Fungi - 0; Plants - 164;
           Viruses - 0; Other Eukaryotes - 8 (source: NCBI BLink).
           | chr2:5005144-5008140 REVERSE LENGTH=541
          Length = 541

 Score =  343 bits (879), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 159/259 (61%), Positives = 203/259 (78%)

Query: 4   FVATVFSIFGMQILVYILVFAGWFLVTGTLILCGCFLILHNVTADTCVAMDEWIQYPAAN 63
           F+  + SIFG+Q LVY LV  GW LVT T +LCG FL+LHNV  DTCVAMD+W+Q P A+
Sbjct: 269 FIGFLLSIFGLQCLVYTLVILGWILVTVTFVLCGGFLLLHNVVGDTCVAMDQWVQNPTAH 328

Query: 64  TALDDILPCVDNATAQETFLRSKEVTSELVNLVNQVITNVSNINFAPNFTPLFYNQSGPV 123
           TALDDILPCVDNATA+ET  R+K VT +LVNL++  I+N++N NF P F PL+YNQSGP+
Sbjct: 329 TALDDILPCVDNATARETLTRTKLVTYQLVNLLDNAISNMTNRNFPPQFRPLYYNQSGPL 388

Query: 124 MPLLCNPFHPDMTDRLCDPGEVTLSNATQVYGNFVCQVSPSDICTTQGRLTPTFYNQIST 183
           MPLLCNPF+ D++DR C PG+V L+NAT+V+ NF CQ+     C+T GRLTP  Y+Q++ 
Sbjct: 389 MPLLCNPFNADLSDRQCQPGQVHLNNATEVWKNFTCQIVTPGTCSTPGRLTPKLYSQMAA 448

Query: 184 GINVGNALYSYAPSLVELQDCTFVRETFSDISREHCPGLRRYSKWIYAGLVMVSFAVMFS 243
            +NV   LY Y P L +LQ C FVR TF+DI R+HCPGL+RY++WIY GLV+VS +VM S
Sbjct: 449 AVNVSYGLYKYGPFLADLQGCDFVRSTFTDIERDHCPGLKRYTQWIYVGLVVVSASVMSS 508

Query: 244 LIFWVVYGRERRYRLYTKE 262
           L+FWV+Y RERR+R+YTK+
Sbjct: 509 LVFWVIYARERRHRVYTKD 527


>AT1G80540.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G12400.1); Has 175 Blast hits to 171 proteins
           in 20 species: Archae - 0; Bacteria - 0; Metazoa - 2;
           Fungi - 0; Plants - 171; Viruses - 0; Other Eukaryotes -
           2 (source: NCBI BLink). | chr1:30281638-30284258 REVERSE
           LENGTH=538
          Length = 538

 Score =  281 bits (720), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 133/262 (50%), Positives = 180/262 (68%)

Query: 2   VLFVATVFSIFGMQILVYILVFAGWFLVTGTLILCGCFLILHNVTADTCVAMDEWIQYPA 61
           V F+  +FS  G+++LVY+LV  GW LVT T++L   FL+ HNV ADTC+AMD+W+  PA
Sbjct: 270 VAFLGLLFSFCGLRVLVYLLVILGWILVTATILLSAVFLVFHNVVADTCMAMDQWVHDPA 329

Query: 62  ANTALDDILPCVDNATAQETFLRSKEVTSELVNLVNQVITNVSNINFAPNFTPLFYNQSG 121
           A++AL  +LPC+D  T  ET   +K +T+  V++ N    NVSN +  P   P ++NQSG
Sbjct: 330 ADSALSQLLPCLDPKTIGETLDITKTMTATAVDMTNAYTVNVSNHDQFPPNAPFYHNQSG 389

Query: 122 PVMPLLCNPFHPDMTDRLCDPGEVTLSNATQVYGNFVCQVSPSDICTTQGRLTPTFYNQI 181
           P++PLLCNP   +   R C P EV L+NA+QVY  ++CQV+   ICTTQGRLT   Y+Q+
Sbjct: 390 PLVPLLCNPLDQNHKPRPCAPDEVLLANASQVYKGYICQVNAEGICTTQGRLTQGSYDQM 449

Query: 182 STGINVGNALYSYAPSLVELQDCTFVRETFSDISREHCPGLRRYSKWIYAGLVMVSFAVM 241
              INV   L  Y P L  + DCTFVR+TF DI+ ++CPGL   S+WIYAGL  +S AVM
Sbjct: 450 MGAINVAFTLDHYGPFLASIADCTFVRDTFRDITTKNCPGLSITSQWIYAGLASLSGAVM 509

Query: 242 FSLIFWVVYGRERRYRLYTKET 263
           FSLIFW+++ RERR+R  TK++
Sbjct: 510 FSLIFWLIFVRERRHRSQTKKS 531


>AT1G71110.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: endomembrane
           system; BEST Arabidopsis thaliana protein match is:
           unknown protein (TAIR:AT2G12400.1); Has 173 Blast hits
           to 169 proteins in 21 species: Archae - 0; Bacteria - 0;
           Metazoa - 3; Fungi - 0; Plants - 165; Viruses - 0; Other
           Eukaryotes - 5 (source: NCBI BLink). |
           chr1:26818244-26820852 FORWARD LENGTH=557
          Length = 557

 Score =  235 bits (600), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 116/257 (45%), Positives = 161/257 (62%), Gaps = 2/257 (0%)

Query: 4   FVATVFSIFGMQILVYILVFAGWFLVTGTLILCGCFLILHNVTADTCVAMDEWIQYPAAN 63
           FV  + S+   Q +V+I V +GW LV  T +LCG FLIL+N  +DTCVAM EW+  P A 
Sbjct: 270 FVGLLLSVLRHQHVVHIFVVSGWILVAVTFVLCGVFLILNNAISDTCVAMKEWVDNPHAE 329

Query: 64  TALDDILPCVDNATAQETFLRSKEVTSELVNLVNQVITNVSNINFAPNFTPLFYNQSGPV 123
           TAL  ILPCVD  T  +T  +SK V + +V +VN  +  V+N N AP     +YNQSGP 
Sbjct: 330 TALSSILPCVDQQTTNQTLSQSKVVINSIVTVVNTFVYAVANTNPAPG-QDRYYNQSGPP 388

Query: 124 MPLLCNPFHPDMTDRLCDPGEVTLSNATQVYGNFVCQVSPSDICTTQGRLTPTFYNQIST 183
           MP LC PF  +M DR C P E+++ NA+ V+ N+ C+V+PS ICTT GR+TP  + Q+  
Sbjct: 389 MPPLCIPFDANMEDRQCSPWELSIENASSVWENYKCEVTPSGICTTVGRVTPDTFGQLVA 448

Query: 184 GINVGNALYSYAPSLVELQDCTFVRETFSDISREHCPGLRRYSKWIYAGLVMVSFAVMFS 243
            +N   AL  Y P L+  +DC FVRETF  I+ ++CP L R  + + AGL ++S  V+  
Sbjct: 449 AVNESYALEHYTPPLLSFRDCNFVRETFMSITSDYCPPLVRNLRIVNAGLGLISVGVLLC 508

Query: 244 LIFWVVYG-RERRYRLY 259
           L+ W+ Y  R +R  ++
Sbjct: 509 LVLWIFYANRPQREEVF 525


>AT5G67550.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: endomembrane
           system; EXPRESSED IN: flower; EXPRESSED DURING: 4
           anthesis; BEST Arabidopsis thaliana protein match is:
           unknown protein (TAIR:AT1G71110.1); Has 161 Blast hits
           to 154 proteins in 16 species: Archae - 0; Bacteria - 0;
           Metazoa - 0; Fungi - 0; Plants - 161; Viruses - 0; Other
           Eukaryotes - 0 (source: NCBI BLink). |
           chr5:26946908-26949112 REVERSE LENGTH=509
          Length = 509

 Score = 66.6 bits (161), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 54/235 (22%), Positives = 97/235 (41%), Gaps = 16/235 (6%)

Query: 20  ILVFAGWFLVTGTLILCGCFLILHNVTADTCVAMDEWIQYPAANTALDDILPCVDNATAQ 79
           +++F  W + T   +L G    +H    D C A + ++Q P  N+ L ++ PC+D   + 
Sbjct: 249 MVIFLCWIITTLCWVLTGFDFFIHTFAEDLCSAFNGFVQNP-RNSTLTNLFPCMDPLHSD 307

Query: 80  ETFLRSKEVTSELVNLVNQVITNVSNINFAPNFTPLFYNQS-GPVMPLLCNPFHPDM--- 135
           +T +   E++  + N + Q+ + V+    +   T      S  P   ++C+PF       
Sbjct: 308 KTLI---EISLMIHNFITQLNSKVAESMRSNALTDRSNTVSWAPESGIICDPFVGQQINS 364

Query: 136 -TDRLCDPGEVTLSNATQVYGNFVCQ-VSPSDICTTQGRLTP-TFYNQISTGINVGNALY 192
            T + C  G + +     +   F C    P + C   G+  P   Y ++    N    + 
Sbjct: 365 YTPQSCSNGAIPIGEFPNILSRFTCHDKDPPETCRITGKFIPEAAYLKVYAYSNSAQGML 424

Query: 193 SYAPSLVELQDCTFVRETFSDISREHCPGLRR--YSKW---IYAGLVMVSFAVMF 242
              PS   L +C  V++T S I    C   R   Y  W   +   L+MV   ++F
Sbjct: 425 DILPSFQNLTECLAVKDTLSSIVSNQCDPFRASMYRLWASILALSLIMVVLVLLF 479