Miyakogusa Predicted Gene

Lj5g3v0616360.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj5g3v0616360.1 Non Chatacterized Hit- tr|F4IPZ8|F4IPZ8_ARATH
Uncharacterized protein OS=Arabidopsis thaliana
GN=At2,36.33,4e-18,seg,NULL,CUFF.53532.1
         (641 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT3G53320.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   114   2e-25
AT2G37070.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    97   3e-20
AT5G60150.1 | Symbols:  | unknown protein; Has 1807 Blast hits t...    50   4e-06

>AT3G53320.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G37070.1); Has 11044 Blast hits to 5993
           proteins in 551 species: Archae - 8; Bacteria - 1486;
           Metazoa - 4078; Fungi - 1814; Plants - 348; Viruses -
           112; Other Eukaryotes - 3198 (source: NCBI BLink). |
           chr3:19769397-19772278 REVERSE LENGTH=553
          Length = 553

 Score =  114 bits (284), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 109/300 (36%), Positives = 148/300 (49%), Gaps = 46/300 (15%)

Query: 3   DDTFLLSNTDLRRLSLIDFSSADDSLIATSPQHSGPSFEDAAIKLREWEQEPQPNETQSP 62
           DD  L  + DL  +   D    DD ++A+S              + E E+  QP+E+  P
Sbjct: 39  DDKCLKEDKDLNFMR--DTQYCDDEILASS--------------VEEKEEVLQPHESPEP 82

Query: 63  EKT---SKCNLRASLAWDKAFFTSAGVLDPEELSSMIEKKGEKLALALPGIQEDVQGSCE 119
           EK     K NLR SLAWD  FFTSAGVL+PEELSSM+E   +    ALP I ED+  S E
Sbjct: 83  EKVMKKGKYNLRKSLAWDNEFFTSAGVLEPEELSSMMESNHKSGKKALPTILEDINRSTE 142

Query: 120 SFSTFESDSLTLESLEADLFGDVRASIQKSSSRVSNEASVNSKARV-------PSPRFRT 172
           S STF+SD     S E  LF DVRASIQ+S+   +++ +   K+ V        SP   T
Sbjct: 143 SISTFQSDCTVENSQEFVLFEDVRASIQRSAK--TSDVATPGKSNVLRATDVAISPTSST 200

Query: 173 DRFSKRDGMASCNKIPSSKSPSAVMQGAGKLTKKSPIFSQLPGTPVASRE-EPSILKQSK 231
              +   G       P  ++PS V QG GK TK+          PVA+R    SI K   
Sbjct: 201 VDVTATQGKTKSKGSP--RNPSRV-QGPGKATKQ----------PVATRGLSTSISKPPN 247

Query: 232 LLGKSTPSSTISSKRASPSNLHV-KSENDKAKGLIGGKVNSMTSTPVFKGSQVIVPKPTI 290
            L K  P ST S+ R+S   L + K++ +K   L  GK        + + ++ ++PKP +
Sbjct: 248 GLSKVRPLSTTSTNRSS---LDISKTQQEKNSKLPAGKEPLGPRISMSRRAKPVLPKPGV 304


>AT2G37070.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G53320.1); Has 1323 Blast hits to 775 proteins
           in 176 species: Archae - 0; Bacteria - 113; Metazoa -
           351; Fungi - 175; Plants - 115; Viruses - 13; Other
           Eukaryotes - 556 (source: NCBI BLink). |
           chr2:15578415-15581125 FORWARD LENGTH=530
          Length = 530

 Score = 97.4 bits (241), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 109/300 (36%), Positives = 140/300 (46%), Gaps = 49/300 (16%)

Query: 12  DLRRLSLIDFSSADDSLIA------TSPQHSGPSFEDAAIKLREWEQEPQPNET------ 59
           D+  LSLIDFS+ DD+L+       T+   S    ED  + L          ET      
Sbjct: 14  DINGLSLIDFSAEDDNLLLSSFLDPTTFDFSDTDKEDRGLCLFGDTHNCCDEETLGSTTL 73

Query: 60  --QSPEKTSKCNLRASLAWDKAFFTSAGVLDPEELSSMIEKKGEKLALALPGIQEDVQGS 117
             + P K  K NLR SLAWDKAFFT+AGVL+P+ELSSM+ +K      +LP +QED+  S
Sbjct: 74  EEKEPHKIMKTNLRKSLAWDKAFFTNAGVLEPDELSSMMGRK------SLPAVQEDLHRS 127

Query: 118 CESFSTFESDSLTLESLEADLFGDVRASIQKSSSRVSNEASVNSKARVPSPRFRT-DRFS 176
            ES ST +SD  T+E+ +     D  A+  K     S EA       VPSP   T D  S
Sbjct: 128 TESMSTLKSD-CTVETGQEFFMCDA-ATPDKRKDLGSTEA-------VPSPTTSTLDDPS 178

Query: 177 KRDGMASCNKIPSSKSPSAVMQGAGKLTKKSPIFSQLPGTPVASREE-PSILKQSKLLGK 235
             + M      P  K P    QG  K TK           PVAS E   SI + S  L +
Sbjct: 179 SEEKM---KPNPIRKRPGIRSQGLAKATKH----------PVASEEHNTSISRPSTGLNR 225

Query: 236 STPSSTIS-SKRASPSNLHVKSENDKAKGLIGGKVNSMTSTPVFKGSQVIVPKPTISSKS 294
             PSS +S +KRAS      K E +      GGK    +  P+ +  + IV  P +  KS
Sbjct: 226 --PSSGLSKTKRASVDTNKAKQETNPKSS--GGKEPLASRVPISRRPRPIVSTPVVPFKS 281


>AT5G60150.1 | Symbols:  | unknown protein; Has 1807 Blast hits to
           1807 proteins in 277 species: Archae - 0; Bacteria - 0;
           Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0;
           Other Eukaryotes - 339 (source: NCBI BLink). |
           chr5:24218211-24223245 FORWARD LENGTH=1195
          Length = 1195

 Score = 50.4 bits (119), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 54/170 (31%), Positives = 80/170 (47%), Gaps = 26/170 (15%)

Query: 41  EDAAIKLREWEQEPQPNETQSPEKTSKCNLRASLAWDKAFFTSAGVLDPEELSSMIEKKG 100
           E+A ++L +   E Q  + +  +K +  NLR SLAWD+AF T  GVLD  ELS +     
Sbjct: 83  ENAKVELPKLSVERQ--QMKKKKKNAGFNLRKSLAWDRAFSTEEGVLDSSELSKITGTAC 140

Query: 101 EKLALALPGIQEDVQGSCESFSTFESD-SLTLESLEADLFGDVRASIQKSSSRVSNEASV 159
                 L  IQE+ +   ES S  + + S  L++LE +LF D+                V
Sbjct: 141 HLGGDRLAAIQEEYR---ESMSASKCNVSPGLQALEENLFNDL---------------PV 182

Query: 160 NSKARVPSPRFRTDRFSKRDGMASCNKIPSSKSPSAVMQGAGKLTKKSPI 209
           NSK R    +  +    K     S +K+P++KS    +    K T +SPI
Sbjct: 183 NSKNR--EKKLVSGIMPKE---LSISKVPTTKSDPVTVGNNMKRTTQSPI 227