Miyakogusa Predicted Gene

Lj1g3v3438800.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj1g3v3438800.1 Non Chatacterized Hit- tr|A3AD48|A3AD48_ORYSJ
Putative uncharacterized protein OS=Oryza sativa
subsp,34.85,2e-17,seg,NULL,CUFF.30664.1
         (495 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT2G37960.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   267   1e-71
AT2G37960.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   267   1e-71
AT3G54060.2 | Symbols:  | unknown protein; BEST Arabidopsis thal...   220   2e-57
AT3G54060.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   220   2e-57

>AT2G37960.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT3G54060.2);
           Has 30201 Blast hits to 17322 proteins in 780 species:
           Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi -
           3422; Plants - 5037; Viruses - 0; Other Eukaryotes -
           2996 (source: NCBI BLink). | chr2:15886962-15889180
           REVERSE LENGTH=480
          Length = 480

 Score =  267 bits (682), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 193/503 (38%), Positives = 269/503 (53%), Gaps = 58/503 (11%)

Query: 5   QSNSKPKKPETFGKGKVTPPQVAFIVDRYLADNNFSSTRSAFRIEASSLIAHSPVHEAPR 64
           +SN         G G+VTP QVAF+VDRYL DN FS TRS FR EASSLI++SPV E P 
Sbjct: 3   RSNRSKTSERIIGNGEVTPIQVAFLVDRYLCDNRFSKTRSLFRSEASSLISNSPVREVPN 62

Query: 65  SLLTLGQMLDEYICLKEQKVMVDQERAIVEQEKNRVQMLLQGMQNVMTAYNASGNLPLPA 124
           SLL L ++L+EYI LK++K+++DQE++ ++QEK RVQ LL GMQ+VM AYN+S     P 
Sbjct: 63  SLLPLNEILNEYIRLKKEKIVMDQEKSKLDQEKTRVQNLLNGMQDVMNAYNSSTAAAPPP 122

Query: 125 AKSAGAVVP---QKTFSNKPNSGVGVPTS------TQNRMNIQSLPPXXXXXXXXXXXXT 175
                +  P   Q   S    +  GV +S      TQN M + SLP             T
Sbjct: 123 PPVITSAAPMDKQVVASTSKQNNFGVSSSGCTVYNTQNAMTV-SLPGNKRVGNFTGPCIT 181

Query: 176 LLTNVSDRKRKDTKAVDAPLAAKKSRGRSSTRKIPSQGQNTLQQSNNVASIQMEAQPSAI 235
              +++ +++    +V AP     S  R   +KIP        Q+ N  + Q    PS +
Sbjct: 182 --QSITKKRKSPEVSVGAP-----SVSRKGMKKIP--------QAANYLTFQT---PSEM 223

Query: 236 RTSAEKCTPS-ESQVQGSNVAKCLFNQSSHAVSSNSPVPKTPPRGKSLQSDTNISPAEIS 294
           +T       + ES    S+VAKCLF++S  +  SNS  P+TP +  S QSD         
Sbjct: 224 QTPLNNGVATNESSDLTSSVAKCLFDKSGTSPPSNSTCPRTPQQKVSPQSD--------- 274

Query: 295 SVAPSNREAAPTRCTVISTRRVMVSPAKQMA--YIEMSHCI---SPVKTDSNRVNKRDHV 349
                 +E  PT CT+++  R+ VSP KQ+A   +E SH +   SPVK++    +KRDHV
Sbjct: 275 ------KEVTPTNCTIVTKERITVSPLKQIASYTVERSHTVSSFSPVKSNLKMSSKRDHV 328

Query: 350 RSRLDFDAVDIPESLNNPLPNEI---STSESEKDLDIFGIDFPNLDALGIDFSFSELLNY 406
           + RL+FD  +    L+ P   ++   S+S SE + D+F IDF N+D L  DFSFSELL  
Sbjct: 329 KGRLNFDDTEATMHLDAPATVDMVSTSSSGSEAEADLFDIDFSNIDLLSEDFSFSELLFD 388

Query: 407 LDFPCDCIDFSYNPTTSSPSEDNASGSYNECNSGHVTPD------ISNVAEVRCEKDMRI 460
            D  C+ +     P  S+   + ASGS  E  + ++ PD       S V E+   KDM  
Sbjct: 389 FDIGCEEMSNHSLPQPSNFHIETASGSSPESRNTNLEPDQVVSEYTSTVTEMIQGKDMNT 448

Query: 461 LGPDCLTAMKSVTKSITVFSPEK 483
            G D +T +KS+TK + + SP K
Sbjct: 449 QGSDSMTTVKSITKCLRILSPAK 471


>AT2G37960.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G54060.2); Has 418 Blast hits to 247 proteins
           in 92 species: Archae - 0; Bacteria - 163; Metazoa - 49;
           Fungi - 80; Plants - 28; Viruses - 0; Other Eukaryotes -
           98 (source: NCBI BLink). | chr2:15886962-15889180
           REVERSE LENGTH=480
          Length = 480

 Score =  267 bits (682), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 193/503 (38%), Positives = 269/503 (53%), Gaps = 58/503 (11%)

Query: 5   QSNSKPKKPETFGKGKVTPPQVAFIVDRYLADNNFSSTRSAFRIEASSLIAHSPVHEAPR 64
           +SN         G G+VTP QVAF+VDRYL DN FS TRS FR EASSLI++SPV E P 
Sbjct: 3   RSNRSKTSERIIGNGEVTPIQVAFLVDRYLCDNRFSKTRSLFRSEASSLISNSPVREVPN 62

Query: 65  SLLTLGQMLDEYICLKEQKVMVDQERAIVEQEKNRVQMLLQGMQNVMTAYNASGNLPLPA 124
           SLL L ++L+EYI LK++K+++DQE++ ++QEK RVQ LL GMQ+VM AYN+S     P 
Sbjct: 63  SLLPLNEILNEYIRLKKEKIVMDQEKSKLDQEKTRVQNLLNGMQDVMNAYNSSTAAAPPP 122

Query: 125 AKSAGAVVP---QKTFSNKPNSGVGVPTS------TQNRMNIQSLPPXXXXXXXXXXXXT 175
                +  P   Q   S    +  GV +S      TQN M + SLP             T
Sbjct: 123 PPVITSAAPMDKQVVASTSKQNNFGVSSSGCTVYNTQNAMTV-SLPGNKRVGNFTGPCIT 181

Query: 176 LLTNVSDRKRKDTKAVDAPLAAKKSRGRSSTRKIPSQGQNTLQQSNNVASIQMEAQPSAI 235
              +++ +++    +V AP     S  R   +KIP        Q+ N  + Q    PS +
Sbjct: 182 --QSITKKRKSPEVSVGAP-----SVSRKGMKKIP--------QAANYLTFQT---PSEM 223

Query: 236 RTSAEKCTPS-ESQVQGSNVAKCLFNQSSHAVSSNSPVPKTPPRGKSLQSDTNISPAEIS 294
           +T       + ES    S+VAKCLF++S  +  SNS  P+TP +  S QSD         
Sbjct: 224 QTPLNNGVATNESSDLTSSVAKCLFDKSGTSPPSNSTCPRTPQQKVSPQSD--------- 274

Query: 295 SVAPSNREAAPTRCTVISTRRVMVSPAKQMA--YIEMSHCI---SPVKTDSNRVNKRDHV 349
                 +E  PT CT+++  R+ VSP KQ+A   +E SH +   SPVK++    +KRDHV
Sbjct: 275 ------KEVTPTNCTIVTKERITVSPLKQIASYTVERSHTVSSFSPVKSNLKMSSKRDHV 328

Query: 350 RSRLDFDAVDIPESLNNPLPNEI---STSESEKDLDIFGIDFPNLDALGIDFSFSELLNY 406
           + RL+FD  +    L+ P   ++   S+S SE + D+F IDF N+D L  DFSFSELL  
Sbjct: 329 KGRLNFDDTEATMHLDAPATVDMVSTSSSGSEAEADLFDIDFSNIDLLSEDFSFSELLFD 388

Query: 407 LDFPCDCIDFSYNPTTSSPSEDNASGSYNECNSGHVTPD------ISNVAEVRCEKDMRI 460
            D  C+ +     P  S+   + ASGS  E  + ++ PD       S V E+   KDM  
Sbjct: 389 FDIGCEEMSNHSLPQPSNFHIETASGSSPESRNTNLEPDQVVSEYTSTVTEMIQGKDMNT 448

Query: 461 LGPDCLTAMKSVTKSITVFSPEK 483
            G D +T +KS+TK + + SP K
Sbjct: 449 QGSDSMTTVKSITKCLRILSPAK 471


>AT3G54060.2 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G37960.2); Has 35333 Blast hits to 34131
           proteins in 2444 species: Archae - 798; Bacteria -
           22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
           - 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
           chr3:20018902-20020826 REVERSE LENGTH=442
          Length = 442

 Score =  220 bits (560), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 161/413 (38%), Positives = 232/413 (56%), Gaps = 38/413 (9%)

Query: 8   SKPKKPETFGKGKVTPPQVAFIVDRYLADNNFSSTRSAFRIEASSLIAHSPVHEAPRSLL 67
           S+ K     GKG+VTP QVAFIVDRYL DN FS TR+ FR EASSLI+ SP+   P SL+
Sbjct: 5   SRSKGSNLIGKGEVTPTQVAFIVDRYLHDNRFSETRALFRSEASSLISDSPIRNVPNSLM 64

Query: 68  TLGQMLDEYICLKEQKVMVDQERAIVEQEKNRVQMLLQGMQNVMTAYNASGNLPLPAAKS 127
           TL  ML+ Y+ LK+QKV +DQE+  ++QEK RVQ LLQGM+NVM  YNAS   P PA+  
Sbjct: 65  TLDAMLNHYVSLKKQKVSLDQEKLKLDQEKIRVQNLLQGMENVMNTYNASLTAPPPAS-- 122

Query: 128 AGAVVPQKTFSNKPNSGVGVPTSTQNRMNIQSLPPXXXXXXXXXXXXTLLTNVSDRKRKD 187
             A   Q+   +  +SG+    +T N M++ SL              +   +++ +++  
Sbjct: 123 --APTSQQKNHSISSSGLS-QYNTLNGMSV-SLLGNKRVDFGNFSTPSTSQSITGKRKGP 178

Query: 188 TKAVDAPLAAKKSRGRSSTRKIPSQGQNTLQQSNNVASIQMEAQPSAIRTSAEKCTPSES 247
             +V AP  ++KSR   +T      G N L Q++  A+    ++  A+   A+    +E 
Sbjct: 179 EVSVTAPPVSRKSRITRAT------GTNKLPQADKAAN-NFTSETLAV---AKNSASNEL 228

Query: 248 QVQGSNVAKCLFNQSSHAVSSNSPVPKTPPRGKSLQSDTNISPAEISSVAPSNREAAPTR 307
              GS+V KCLFN++  +V ++S   +TP +  S  SD + S  +   V P+N     T 
Sbjct: 229 IGNGSSVVKCLFNKADSSVPTSSTCFRTPQKHASSGSDKSNSSQK--EVTPTN-----TN 281

Query: 308 CTVISTRRVMVSPAKQM-AY-IEMSHCI---SPVKTDSNRVNKRDHVRSRLDFDAVDIPE 362
           CT+++  R  +SP KQ+ +Y +E SH I   SPVK++    NKRDHV+ +L+FD  D   
Sbjct: 282 CTIVTKERFTISPLKQITSYSVERSHLISFSSPVKSNLKMSNKRDHVKGKLNFDDTDTET 341

Query: 363 SLNNPLPNEI---STSESEKDLDIFGIDFPNLDALGIDFSFSELLNYLDFPCD 412
            L  P   ++   S S SE ++D+F +DF NLD       FSELL   D  C+
Sbjct: 342 CLEAPATADLVSTSPSGSEPEVDLFDMDFSNLD-------FSELLVDFDLGCE 387


>AT3G54060.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G37960.2); Has 455 Blast hits to 322 proteins
           in 98 species: Archae - 0; Bacteria - 178; Metazoa - 88;
           Fungi - 75; Plants - 28; Viruses - 2; Other Eukaryotes -
           84 (source: NCBI BLink). | chr3:20018915-20020826
           REVERSE LENGTH=456
          Length = 456

 Score =  220 bits (560), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 161/413 (38%), Positives = 232/413 (56%), Gaps = 38/413 (9%)

Query: 8   SKPKKPETFGKGKVTPPQVAFIVDRYLADNNFSSTRSAFRIEASSLIAHSPVHEAPRSLL 67
           S+ K     GKG+VTP QVAFIVDRYL DN FS TR+ FR EASSLI+ SP+   P SL+
Sbjct: 5   SRSKGSNLIGKGEVTPTQVAFIVDRYLHDNRFSETRALFRSEASSLISDSPIRNVPNSLM 64

Query: 68  TLGQMLDEYICLKEQKVMVDQERAIVEQEKNRVQMLLQGMQNVMTAYNASGNLPLPAAKS 127
           TL  ML+ Y+ LK+QKV +DQE+  ++QEK RVQ LLQGM+NVM  YNAS   P PA+  
Sbjct: 65  TLDAMLNHYVSLKKQKVSLDQEKLKLDQEKIRVQNLLQGMENVMNTYNASLTAPPPAS-- 122

Query: 128 AGAVVPQKTFSNKPNSGVGVPTSTQNRMNIQSLPPXXXXXXXXXXXXTLLTNVSDRKRKD 187
             A   Q+   +  +SG+    +T N M++ SL              +   +++ +++  
Sbjct: 123 --APTSQQKNHSISSSGLS-QYNTLNGMSV-SLLGNKRVDFGNFSTPSTSQSITGKRKGP 178

Query: 188 TKAVDAPLAAKKSRGRSSTRKIPSQGQNTLQQSNNVASIQMEAQPSAIRTSAEKCTPSES 247
             +V AP  ++KSR   +T      G N L Q++  A+    ++  A+   A+    +E 
Sbjct: 179 EVSVTAPPVSRKSRITRAT------GTNKLPQADKAAN-NFTSETLAV---AKNSASNEL 228

Query: 248 QVQGSNVAKCLFNQSSHAVSSNSPVPKTPPRGKSLQSDTNISPAEISSVAPSNREAAPTR 307
              GS+V KCLFN++  +V ++S   +TP +  S  SD + S  +   V P+N     T 
Sbjct: 229 IGNGSSVVKCLFNKADSSVPTSSTCFRTPQKHASSGSDKSNSSQK--EVTPTN-----TN 281

Query: 308 CTVISTRRVMVSPAKQM-AY-IEMSHCI---SPVKTDSNRVNKRDHVRSRLDFDAVDIPE 362
           CT+++  R  +SP KQ+ +Y +E SH I   SPVK++    NKRDHV+ +L+FD  D   
Sbjct: 282 CTIVTKERFTISPLKQITSYSVERSHLISFSSPVKSNLKMSNKRDHVKGKLNFDDTDTET 341

Query: 363 SLNNPLPNEI---STSESEKDLDIFGIDFPNLDALGIDFSFSELLNYLDFPCD 412
            L  P   ++   S S SE ++D+F +DF NLD       FSELL   D  C+
Sbjct: 342 CLEAPATADLVSTSPSGSEPEVDLFDMDFSNLD-------FSELLVDFDLGCE 387