Miyakogusa Predicted Gene

Lj0g3v0308479.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj0g3v0308479.1 Non Chatacterized Hit- tr|I1LNY1|I1LNY1_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.18675 PE,84.91,0,
,CUFF.20847.1
         (472 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT2G30700.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   584   e-167
AT1G61900.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   275   3e-74
AT1G61900.3 | Symbols:  | unknown protein; BEST Arabidopsis thal...   273   2e-73
AT1G61900.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   270   1e-72

>AT2G30700.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G61900.1); Has 68 Blast hits to 67 proteins in
           13 species: Archae - 0; Bacteria - 2; Metazoa - 0; Fungi
           - 0; Plants - 66; Viruses - 0; Other Eukaryotes - 0
           (source: NCBI BLink). | chr2:13082033-13084384 REVERSE
           LENGTH=480
          Length = 480

 Score =  584 bits (1506), Expect = e-167,   Method: Compositional matrix adjust.
 Identities = 307/463 (66%), Positives = 360/463 (77%), Gaps = 11/463 (2%)

Query: 1   MGCFQDVNCHRGSLCCQLILFFTWLPTFLDVTAQA---EPRRISSLVELAKEPTSGESGL 57
           MG  + V   +G L  + +LF  WL +F DV A     E    S+  ELA  P  G SG 
Sbjct: 8   MGSLETVCWLKGCLVYRFLLFIIWLSSFQDVAAHDKLNEHSSRSTTSELANPPGIGVSG- 66

Query: 58  FDPIEISPAVIPKFPHPSESF-PPMYPIFPSKYEPVLTGKCPVNFSESEISNMLDKTASD 116
             PI++SP+VIPK+  P+  + PPMYP FP  YEP LTGKCP +F    IS+++D  ASD
Sbjct: 67  --PIQVSPSVIPKYASPALPWTPPMYPTFPDTYEPKLTGKCPTDFQA--ISSVIDTAASD 122

Query: 117 CSAPLASLVGNVICCPQFSSLIHIFQGFISMKSNNLVLPNEAADPCFSDIISILASRGAN 176
           CS P A+LVGNVICCPQF SL+HIFQG  ++KSN LVLP+  A  CFSDI+SIL SR AN
Sbjct: 123 CSQPFAALVGNVICCPQFVSLLHIFQGQHNVKSNKLVLPDAVATDCFSDIVSILVSRRAN 182

Query: 177 SSIPTLCSVKSSNLTGGSCPVKDDSTFEKIVNTSKLLEACSTVDQLKECCRPICQHAITD 236
            +IP LCSV SSNLTGGSCPV D +TFEK+VN+SKLL+AC TVD LKECCRPICQ AI +
Sbjct: 183 MTIPALCSVTSSNLTGGSCPVTDVTTFEKVVNSSKLLDACRTVDPLKECCRPICQPAIME 242

Query: 237 AALQISGRQMMINNNENVAQEVNYTDYLNDCKGVVYSYLSKKLSFEAANTAFRILSSCKV 296
           AAL ISG QM + +   +A   N  + +NDCK VV+SYLS+KL  + AN AFRILSSCKV
Sbjct: 243 AALIISGHQMTVGDKIPLAGS-NNVNAINDCKNVVFSYLSRKLPADKANAAFRILSSCKV 301

Query: 297 NKVCPLTFKEPSEVIAACRNVAAPSPSCCSSLNTYIAGIQKQMLITNKQAIICASLFGSM 356
           NK CPL FKEP+EVI ACRNVAAPSPSCCSSLN YI+GIQ QMLITNKQAI+CA++ GSM
Sbjct: 302 NKACPLEFKEPTEVIKACRNVAAPSPSCCSSLNAYISGIQNQMLITNKQAIVCATVIGSM 361

Query: 357 LRVGGVMTNIYELCDVDLKDFSIQAYG-QQGCLLRSLPADVVFDNSSGFSFTCDLSDNIA 415
           LR GGVMTNIYELCDVDLKDFS+QAYG QQGCLLRS PAD++FDN+SG+SFTCDL+DNIA
Sbjct: 362 LRKGGVMTNIYELCDVDLKDFSVQAYGMQQGCLLRSYPADLIFDNTSGYSFTCDLTDNIA 421

Query: 416 APWPSSTSITSMSLCAPEMSLPALPTSQTLKNIGCNSAGVGFL 458
           APWPSS+S++S+SLCAPEMSLPALPTSQT+KN G  + GVG L
Sbjct: 422 APWPSSSSMSSLSLCAPEMSLPALPTSQTIKNHGFCNGGVGAL 464


>AT1G61900.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: anchored to
           plasma membrane, plasma membrane, anchored to membrane;
           EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT2G30700.1); Has 65 Blast
           hits to 65 proteins in 12 species: Archae - 0; Bacteria
           - 0; Metazoa - 0; Fungi - 0; Plants - 65; Viruses - 0;
           Other Eukaryotes - 0 (source: NCBI BLink). |
           chr1:22882508-22884722 REVERSE LENGTH=433
          Length = 433

 Score =  275 bits (704), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 160/400 (40%), Positives = 231/400 (57%), Gaps = 14/400 (3%)

Query: 62  EISPAVIPKFPHPSESFPPMYPIFPSKYEPVLTGKCPVNFSESEISNMLDKTASDCSAPL 121
           EISP   P+   P  +  PM P   S   P L+G C +NFS SE  +++  T+ +C    
Sbjct: 35  EISPDTSPQPFLPFIAPSPMVPYINSTM-PKLSGLCSLNFSASE--SLIQTTSHNCWTVF 91

Query: 122 ASLVGNVICCPQFSSLIHIFQGFISMKSNNLVLPNEAADPCFSDIISILASRGANSSIPT 181
           A L+ NV+CCPQ  + + I  G  S ++  L L    +  C SD+  IL  +GA+  +  
Sbjct: 92  APLLANVMCCPQLDATLTIILGKASKETGLLALNRTQSKHCLSDLEQILVGKGASGQLNK 151

Query: 182 LCSVKSSNLTGGSCPVKDDSTFEKIVNTSKLLEACSTVDQLKECCRPICQHAITDAALQI 241
           +CS+ SSNLT  SCPV +   FE  V+T+KLL AC  +D +KECC   CQ+AI DAA  I
Sbjct: 152 ICSIHSSNLTSSSCPVINVDEFESTVDTAKLLLACEKIDPVKECCEEACQNAILDAATNI 211

Query: 242 SGRQMMINNNENVAQEVNYTDYLNDCKGVVYSYLSKKLSFEAANTAFRILSSCKVNKVCP 301
           S     +  +E +    + +D +NDCK VV  +L+ KL         R L++CK+N+VCP
Sbjct: 212 S-----LKASETL---TDNSDRINDCKNVVNRWLATKLDPSRVKETLRGLANCKINRVCP 263

Query: 302 LTFKEPSEVIAACRNVAAPSPSCCSSLNTYIAGIQKQMLITNKQAIICASLFGSMLRVGG 361
           L F     +   C N  +    CC ++ +Y++ +QKQ LITN QA+ CA+  G+ L+   
Sbjct: 264 LVFPHMKHIGGNCSNELSNQTGCCRAMESYVSHLQKQTLITNLQALDCATSLGTKLQKLN 323

Query: 362 VMTNIYELCDVDLKDFSIQAYGQQ-GCLLRSLPADVVFDNSSGFSFTCDLSDNIAAPWPS 420
           +  NI+ +C + LKDFS+Q   Q+ GCLL SLP+D +FD  +G SFTCDL+DNI APWPS
Sbjct: 324 ITKNIFSVCHISLKDFSLQVGNQESGCLLPSLPSDAIFDKDTGISFTCDLNDNIPAPWPS 383

Query: 421 STSITSMSLCAPEMSLPALPTSQTLKNIGCNSAGVGFLVI 460
           S+  ++ + C   + +PALP + +      +  GV  LVI
Sbjct: 384 SSLSSAST-CKKPVRIPALPAAAS-SQPRLHDEGVTRLVI 421


>AT1G61900.3 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G30700.1). | chr1:22882508-22884722 REVERSE
           LENGTH=429
          Length = 429

 Score =  273 bits (698), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 159/399 (39%), Positives = 230/399 (57%), Gaps = 16/399 (4%)

Query: 62  EISPAVIPKFPHPSESFPPMYPIFPSKYEPVLTGKCPVNFSESEISNMLDKTASDCSAPL 121
           EISP   P+   P  +  PM P   S   P L+G C +NFS SE  +++  T+ +C    
Sbjct: 35  EISPDTSPQPFLPFIAPSPMVPYINSTM-PKLSGLCSLNFSASE--SLIQTTSHNCWTVF 91

Query: 122 ASLVGNVICCPQFSSLIHIFQGFISMKSNNLVLPNEAADPCFSDIISILASRGANSSIPT 181
           A L+ NV+CCPQ  + + I  G  S ++  L L    +  C SD+  IL  +GA+  +  
Sbjct: 92  APLLANVMCCPQLDATLTIILGKASKETGLLALNRTQSKHCLSDLEQILVGKGASGQLNK 151

Query: 182 LCSVKSSNLTGGSCPVKDDSTFEKIVNTSKLLEACSTVDQLKECCRPICQHAITDAALQI 241
           +CS+ SSNLT  SCPV +   FE  V+T+KLL AC  +D +KECC   CQ+AI DAA  I
Sbjct: 152 ICSIHSSNLTSSSCPVINVDEFESTVDTAKLLLACEKIDPVKECCEEACQNAILDAATNI 211

Query: 242 SGRQMMINNNENVAQEVNYTDYLNDCKGVVYSYLSKKLSFEAANTAFRILSSCKVNKVCP 301
           S     +  +E +    + +D +NDCK VV  +L+ KL         R L++CK+N+VCP
Sbjct: 212 S-----LKASETL---TDNSDRINDCKNVVNRWLATKLDPSRVKETLRGLANCKINRVCP 263

Query: 302 LTFKEPSEVIAACRNVAAPSPSCCSSLNTYIAGIQKQMLITNKQAIICASLFGSMLRVGG 361
           L F     +   C N  +    CC ++ +Y++ +QKQ LITN QA+ CA+  G+ L+   
Sbjct: 264 LVFPHMKHIGGNCSNELSNQTGCCRAMESYVSHLQKQTLITNLQALDCATSLGTKLQKLN 323

Query: 362 VMTNIYELCDVDLKDFSIQAYGQQGCLLRSLPADVVFDNSSGFSFTCDLSDNIAAPWPSS 421
           +  NI+ +C + LKDFS+Q   + GCLL SLP+D +FD  +G SFTCDL+DNI APWPSS
Sbjct: 324 ITKNIFSVCHISLKDFSLQ---ESGCLLPSLPSDAIFDKDTGISFTCDLNDNIPAPWPSS 380

Query: 422 TSITSMSLCAPEMSLPALPTSQTLKNIGCNSAGVGFLVI 460
           +  ++ + C   + +PALP + +      +  GV  LVI
Sbjct: 381 SLSSAST-CKKPVRIPALPAAAS-SQPRLHDEGVTRLVI 417


>AT1G61900.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: anchored to
           plasma membrane, plasma membrane, anchored to membrane;
           EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT2G30700.1); Has 35333 Blast
           hits to 34131 proteins in 2444 species: Archae - 798;
           Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants -
           531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI
           BLink). | chr1:22882561-22884722 REVERSE LENGTH=413
          Length = 413

 Score =  270 bits (691), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 148/359 (41%), Positives = 210/359 (58%), Gaps = 12/359 (3%)

Query: 62  EISPAVIPKFPHPSESFPPMYPIFPSKYEPVLTGKCPVNFSESEISNMLDKTASDCSAPL 121
           EISP   P+   P  +  PM P   S   P L+G C +NFS SE  +++  T+ +C    
Sbjct: 35  EISPDTSPQPFLPFIAPSPMVPYINSTM-PKLSGLCSLNFSASE--SLIQTTSHNCWTVF 91

Query: 122 ASLVGNVICCPQFSSLIHIFQGFISMKSNNLVLPNEAADPCFSDIISILASRGANSSIPT 181
           A L+ NV+CCPQ  + + I  G  S ++  L L    +  C SD+  IL  +GA+  +  
Sbjct: 92  APLLANVMCCPQLDATLTIILGKASKETGLLALNRTQSKHCLSDLEQILVGKGASGQLNK 151

Query: 182 LCSVKSSNLTGGSCPVKDDSTFEKIVNTSKLLEACSTVDQLKECCRPICQHAITDAALQI 241
           +CS+ SSNLT  SCPV +   FE  V+T+KLL AC  +D +KECC   CQ+AI DAA  I
Sbjct: 152 ICSIHSSNLTSSSCPVINVDEFESTVDTAKLLLACEKIDPVKECCEEACQNAILDAATNI 211

Query: 242 SGRQMMINNNENVAQEVNYTDYLNDCKGVVYSYLSKKLSFEAANTAFRILSSCKVNKVCP 301
           S     +  +E +    + +D +NDCK VV  +L+ KL         R L++CK+N+VCP
Sbjct: 212 S-----LKASETL---TDNSDRINDCKNVVNRWLATKLDPSRVKETLRGLANCKINRVCP 263

Query: 302 LTFKEPSEVIAACRNVAAPSPSCCSSLNTYIAGIQKQMLITNKQAIICASLFGSMLRVGG 361
           L F     +   C N  +    CC ++ +Y++ +QKQ LITN QA+ CA+  G+ L+   
Sbjct: 264 LVFPHMKHIGGNCSNELSNQTGCCRAMESYVSHLQKQTLITNLQALDCATSLGTKLQKLN 323

Query: 362 VMTNIYELCDVDLKDFSIQAYGQQ-GCLLRSLPADVVFDNSSGFSFTCDLSDNIAAPWP 419
           +  NI+ +C + LKDFS+Q   Q+ GCLL SLP+D +FD  +G SFTCDL+DNI APWP
Sbjct: 324 ITKNIFSVCHISLKDFSLQVGNQESGCLLPSLPSDAIFDKDTGISFTCDLNDNIPAPWP 382