Miyakogusa Predicted Gene

Lj3g3v0966700.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj3g3v0966700.1 Non Chatacterized Hit- tr|I1N1C4|I1N1C4_SOYBN
Uncharacterized protein OS=Glycine max PE=4 SV=1,94.18,0,SUBFAMILY NOT
NAMED,NULL; FAMILY NOT NAMED,NULL; seg,NULL,CUFF.41964.1
         (361 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT1G70160.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   593   e-170
AT4G27020.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   516   e-147
AT5G54870.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   514   e-146

>AT1G70160.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: endomembrane
           system; EXPRESSED IN: 24 plant structures; EXPRESSED
           DURING: 15 growth stages; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT4G27020.1);
           Has 108 Blast hits to 108 proteins in 20 species: Archae
           - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 89;
           Viruses - 0; Other Eukaryotes - 19 (source: NCBI BLink).
           | chr1:26420159-26422345 FORWARD LENGTH=523
          Length = 523

 Score =  593 bits (1529), Expect = e-170,   Method: Compositional matrix adjust.
 Identities = 281/361 (77%), Positives = 307/361 (85%)

Query: 1   MPSGMLGTLLSLIDVLPLFANNAWGQKANLDFLKKHMGATFEKRSQPWRATIDPADVHSG 60
           MPSGMLGTLLSLIDVLPLF+N AWGQ ANL FL KHMGATFEKRSQPWR+ I+P DVHSG
Sbjct: 163 MPSGMLGTLLSLIDVLPLFSNTAWGQNANLAFLTKHMGATFEKRSQPWRSMINPEDVHSG 222

Query: 61  DFLAVSKIRGRWGGFETLEKWVTGSFAGHTAVCLKDDMGNLWXXXXXXXXXXXXXXXXXX 120
           DFLAVSKIRGRWGGFETLEKWVTG+FAGHTAVCLKDD+GNLW                  
Sbjct: 223 DFLAVSKIRGRWGGFETLEKWVTGAFAGHTAVCLKDDLGNLWVGESGHENEKGEEIIVVI 282

Query: 121 XXXXXXXLALKDSSNPQIALLPLHPDLRAKFNSTAAWEYARSMSGKPYGYHNMIFSWIDT 180
                  L LKD+SNPQ+ALLPLHPD+RAKFN+TAAWEYARSM GKPYGYHNMIFSWIDT
Sbjct: 283 PWDEWWELTLKDNSNPQVALLPLHPDIRAKFNNTAAWEYARSMLGKPYGYHNMIFSWIDT 342

Query: 181 VADNYPPPLDAHLVISVMSMWTRLQPAYSANMWNEALNKRLGTEGLDLHDIIVETEKRGI 240
           + DNYPPPLDAHLVISVMSMWTR+QPAY+ANMWNEALNKRLGTE LDL+ I+ ET +RG+
Sbjct: 343 LGDNYPPPLDAHLVISVMSMWTRVQPAYAANMWNEALNKRLGTEDLDLYGILEETARRGM 402

Query: 241 PFDQLLTIPEQDEWVYSDGKSTTCVAFILSMYKEAGIFGPFSSSIQVTEFTIRDAYMLRL 300
            FD+LLTIPEQDEWVYSDGKSTTCVAFIL+MYK AGIF P +  IQVTEFTIRDAY L+L
Sbjct: 403 SFDELLTIPEQDEWVYSDGKSTTCVAFILAMYKAAGIFDPLADHIQVTEFTIRDAYTLKL 462

Query: 301 FEDNQTRLPSWCNNENDRLPFCQILGEYRMELPGYNTLEPYAHMNEYCPSLPPTYDRPSR 360
           FE NQTRLPSWCN E  +L FCQILGEYRMELPGYNT+ PY +MN+ CPSLPP Y+RPS+
Sbjct: 463 FESNQTRLPSWCNTEEGKLDFCQILGEYRMELPGYNTIYPYPNMNQNCPSLPPNYERPSK 522

Query: 361 C 361
           C
Sbjct: 523 C 523


>AT4G27020.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; LOCATED IN: vacuole; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT5G54870.1); Has 1807 Blast hits to 1807 proteins
           in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
           Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
           - 339 (source: NCBI BLink). | chr4:13568604-13571381
           REVERSE LENGTH=523
          Length = 523

 Score =  516 bits (1329), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 240/363 (66%), Positives = 280/363 (77%), Gaps = 2/363 (0%)

Query: 1   MPSGMLGTLLSLIDVLPLFANNAWGQKANLDFLKKHMGATFEKRSQPWRATIDPADVHSG 60
           M +GMLGTL +L DV PLF N  WG+ +N+ FLK HMGA F  R +PW   I   ++HSG
Sbjct: 161 MEAGMLGTLRALWDVFPLFTNTGWGENSNIAFLKNHMGANFYPRPKPWVTNITTDEIHSG 220

Query: 61  DFLAVSKIRGRWGGFETLEKWVTGSFAGHTAVCLKDDMGNLWXXXXXXXXXXXXXXXXXX 120
           D LA+SKIRGRWGGFETLEKWV+G++AGHTAVCL+D  G LW                  
Sbjct: 221 DLLAISKIRGRWGGFETLEKWVSGAYAGHTAVCLRDSEGKLWVGESGNENEKGEDVIAIL 280

Query: 121 XXXXXXXL-ALKDSSNPQIALLPLHPDLRAKFNSTAAWEYARSMSGKPYGYHNMIFSWID 179
                      KD SNP IALLPLHPD RAKFN TAAWEYARSM GKPYGYHN+IFSWID
Sbjct: 281 PWEEWWEFEQTKDDSNPHIALLPLHPDYRAKFNVTAAWEYARSMDGKPYGYHNLIFSWID 340

Query: 180 TVADNYPPPLDAHLVISVMSMWTRLQPAYSANMWNEALNKRLGTEGLDLHDIIVETEKRG 239
           T++ NYPPPLDA LV SVM++W+++QP Y+ANMWNEALNKRLGTEGLDL D++VE EKRG
Sbjct: 341 TISGNYPPPLDAQLVASVMTVWSKIQPDYAANMWNEALNKRLGTEGLDLPDVLVEVEKRG 400

Query: 240 IPFDQLLTIPEQDEWVYSDGKSTTCVAFILSMYKEAGIFGPFSSSIQVTEFTIRDAYMLR 299
             FD+LL +PEQD+W+YSDGKST+C+AFIL MYKEAG+F P SSSIQVTEFTI+DAYML+
Sbjct: 401 SSFDELLAVPEQDDWIYSDGKSTSCIAFILEMYKEAGLFDPISSSIQVTEFTIKDAYMLK 460

Query: 300 LFEDNQTRLPSWCN-NENDRLPFCQILGEYRMELPGYNTLEPYAHMNEYCPSLPPTYDRP 358
            FE N +R P WCN N+  +LP+CQILG+YRMELPGYNT+EPY HMNE+CPSLPP Y RP
Sbjct: 461 FFESNASRFPKWCNDNDVVKLPYCQILGKYRMELPGYNTMEPYPHMNEHCPSLPPKYHRP 520

Query: 359 SRC 361
             C
Sbjct: 521 KNC 523


>AT5G54870.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: vacuole;
           EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT4G27020.1); Has 1807 Blast
           hits to 1807 proteins in 277 species: Archae - 0;
           Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385;
           Viruses - 0; Other Eukaryotes - 339 (source: NCBI
           BLink). | chr5:22289149-22291604 FORWARD LENGTH=531
          Length = 531

 Score =  514 bits (1323), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 236/363 (65%), Positives = 286/363 (78%), Gaps = 2/363 (0%)

Query: 1   MPSGMLGTLLSLIDVLPLFANNAWGQKANLDFLKKHMGATFEKRSQPWRATIDPADVHSG 60
           M +GMLGTL +L DV PLF+N  WG+ +NL FL+KHMGA FE R +PW   +    + SG
Sbjct: 169 MHAGMLGTLQALWDVFPLFSNTGWGESSNLAFLEKHMGANFEPRPEPWVTNVTTDQIQSG 228

Query: 61  DFLAVSKIRGRWGGFETLEKWVTGSFAGHTAVCLKDDMGNLWXXXXXXXXXXXXXXXXXX 120
           D LA+SKIRGRWGGFETLEKWV+G++AGH+AV L+D  G LW                  
Sbjct: 229 DLLAISKIRGRWGGFETLEKWVSGAYAGHSAVALRDSEGKLWVGESGNENDKGEDVIAIL 288

Query: 121 XXXXXXXL-ALKDSSNPQIALLPLHPDLRAKFNSTAAWEYARSMSGKPYGYHNMIFSWID 179
                      KD SNPQIALLPLHPD+RAKF+  AAW+YARSM GKPYGYHN+IFSWID
Sbjct: 289 PWEEWWAFEQTKDDSNPQIALLPLHPDVRAKFDVAAAWKYARSMEGKPYGYHNLIFSWID 348

Query: 180 TVADNYPPPLDAHLVISVMSMWTRLQPAYSANMWNEALNKRLGTEGLDLHDIIVETEKRG 239
           TV++NYPPPLDAHLV S M++W+++QP Y+ANMWNEALNKRLGTEGLDL D++VE EKRG
Sbjct: 349 TVSENYPPPLDAHLVASFMTVWSQMQPEYAANMWNEALNKRLGTEGLDLSDVLVEVEKRG 408

Query: 240 IPFDQLLTIPEQDEWVYSDGKSTTCVAFILSMYKEAGIFGPFSSSIQVTEFTIRDAYMLR 299
             FD+LL +PE D+W+YSDGKST+C+AFIL MYKEAG+FGP +SSIQVTEFTI+DAYML 
Sbjct: 409 SSFDKLLAVPELDDWIYSDGKSTSCIAFILEMYKEAGLFGPLASSIQVTEFTIKDAYMLN 468

Query: 300 LFEDNQTRLPSWCN-NENDRLPFCQILGEYRMELPGYNTLEPYAHMNEYCPSLPPTYDRP 358
            FE+N +RLP+WCN N++ +LP+CQILG+YRMELPGYNT+EPY+HMNE CP+LPP Y+RP
Sbjct: 469 FFENNASRLPTWCNDNDSVKLPYCQILGKYRMELPGYNTMEPYSHMNEQCPTLPPKYNRP 528

Query: 359 SRC 361
             C
Sbjct: 529 DNC 531