Miyakogusa Predicted Gene

Lj5g3v2110820.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj5g3v2110820.1 CUFF.56671.1
         (360 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT4G33890.2 | Symbols:  | unknown protein; BEST Arabidopsis thal...   278   3e-75
AT4G33890.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   278   3e-75
AT2G14850.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   257   1e-68
AT2G24530.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   161   6e-40
AT4G31440.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   160   2e-39
AT5G67410.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   151   5e-37

>AT4G33890.2 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G14850.1); Has 35333 Blast hits to 34131
           proteins in 2444 species: Archae - 798; Bacteria -
           22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
           - 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
           chr4:16250057-16251085 FORWARD LENGTH=342
          Length = 342

 Score =  278 bits (712), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 168/357 (47%), Positives = 238/357 (66%), Gaps = 35/357 (9%)

Query: 10  RVDTLKLKALIVRKVGQQRAGKYFGQLGRLLSSKISKSEFDRVCIRTIGRENIPLHNQFI 69
           R+DTL++KALI R++G QRA  YF QLGR  + KI+KSEFD++CI+TIGR+NI LHN+ I
Sbjct: 9   RLDTLEIKALIYREIGNQRAESYFNQLGRFFALKITKSEFDKLCIKTIGRQNIHLHNRLI 68

Query: 70  KAILKNACSSKVPPPRGPARIGSALSGRDSNGLQQSSVQIPYGDAVPSLLRKDGSVAPRE 129
           ++I+KNAC +K PP     + GS +   + +  + S +Q  +GD+  S      +   R 
Sbjct: 69  RSIIKNACIAKSPP--FIKKGGSFVRFGNGDSKKNSQIQPLHGDSAFS----PSTRKCRS 122

Query: 130 QKFKGRRNGFGRLGKPQSLTP--EKLIHKAQQQQSATELNSLGSRPPISVEEGEEVEQMA 187
           +K + R +  G LGKP SLT   E+ + KA   QSATEL SLGSRPP+ V   EE E++ 
Sbjct: 123 RKLRDRPSPLGPLGKPHSLTTTNEESMSKA---QSATELLSLGSRPPVEVVSVEEGEEVE 179

Query: 188 R----SPSIQSRSPVTAPLGISMNFGHG--RKLLSNVLGSKC----HPDTCQSSGDLPDT 237
           +    SPS+QSR P+TAPLG+SM+  +G  RK +SNV  S C    + +TCQ++G+LPDT
Sbjct: 180 QIAGGSPSVQSRCPLTAPLGVSMSLRNGATRKSVSNV--SMCSRSFNRETCQNNGELPDT 237

Query: 238 RSLRSRLEQKLEKEGLTVTVDCVNLLNNALDSYLKRLIESSIGLAGSRSGSEYLRMRNRQ 297
           R+LRSRLE++LE EGL +T+D V+LLN+ LD +++RLIE  + LA +R G++ +R  N Q
Sbjct: 238 RTLRSRLERRLEMEGLKITMDSVSLLNSGLDVFMRRLIEPCLSLANTRCGTDRVREMNYQ 297

Query: 298 SVTGSNVLLPARYMQTATQSAGVSVSDFRVAMELNPQVLGPDWPIQLEKICICASEE 354
                       Y Q + + + VS+SDFR  MELN ++LG DWP+ +EKIC  AS++
Sbjct: 298 ------------YTQQSRRLSYVSMSDFRAGMELNTEILGEDWPMHMEKICSRASDK 342


>AT4G33890.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G14850.1); Has 133 Blast hits to 131 proteins
           in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 2; Plants - 129; Viruses - 0; Other Eukaryotes -
           2 (source: NCBI BLink). | chr4:16250057-16251085 FORWARD
           LENGTH=342
          Length = 342

 Score =  278 bits (712), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 168/357 (47%), Positives = 238/357 (66%), Gaps = 35/357 (9%)

Query: 10  RVDTLKLKALIVRKVGQQRAGKYFGQLGRLLSSKISKSEFDRVCIRTIGRENIPLHNQFI 69
           R+DTL++KALI R++G QRA  YF QLGR  + KI+KSEFD++CI+TIGR+NI LHN+ I
Sbjct: 9   RLDTLEIKALIYREIGNQRAESYFNQLGRFFALKITKSEFDKLCIKTIGRQNIHLHNRLI 68

Query: 70  KAILKNACSSKVPPPRGPARIGSALSGRDSNGLQQSSVQIPYGDAVPSLLRKDGSVAPRE 129
           ++I+KNAC +K PP     + GS +   + +  + S +Q  +GD+  S      +   R 
Sbjct: 69  RSIIKNACIAKSPP--FIKKGGSFVRFGNGDSKKNSQIQPLHGDSAFS----PSTRKCRS 122

Query: 130 QKFKGRRNGFGRLGKPQSLTP--EKLIHKAQQQQSATELNSLGSRPPISVEEGEEVEQMA 187
           +K + R +  G LGKP SLT   E+ + KA   QSATEL SLGSRPP+ V   EE E++ 
Sbjct: 123 RKLRDRPSPLGPLGKPHSLTTTNEESMSKA---QSATELLSLGSRPPVEVVSVEEGEEVE 179

Query: 188 R----SPSIQSRSPVTAPLGISMNFGHG--RKLLSNVLGSKC----HPDTCQSSGDLPDT 237
           +    SPS+QSR P+TAPLG+SM+  +G  RK +SNV  S C    + +TCQ++G+LPDT
Sbjct: 180 QIAGGSPSVQSRCPLTAPLGVSMSLRNGATRKSVSNV--SMCSRSFNRETCQNNGELPDT 237

Query: 238 RSLRSRLEQKLEKEGLTVTVDCVNLLNNALDSYLKRLIESSIGLAGSRSGSEYLRMRNRQ 297
           R+LRSRLE++LE EGL +T+D V+LLN+ LD +++RLIE  + LA +R G++ +R  N Q
Sbjct: 238 RTLRSRLERRLEMEGLKITMDSVSLLNSGLDVFMRRLIEPCLSLANTRCGTDRVREMNYQ 297

Query: 298 SVTGSNVLLPARYMQTATQSAGVSVSDFRVAMELNPQVLGPDWPIQLEKICICASEE 354
                       Y Q + + + VS+SDFR  MELN ++LG DWP+ +EKIC  AS++
Sbjct: 298 ------------YTQQSRRLSYVSMSDFRAGMELNTEILGEDWPMHMEKICSRASDK 342


>AT2G14850.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G33890.2); Has 140 Blast hits to 132 proteins
           in 17 species: Archae - 0; Bacteria - 0; Metazoa - 1;
           Fungi - 2; Plants - 133; Viruses - 0; Other Eukaryotes -
           4 (source: NCBI BLink). | chr2:6386400-6387275 FORWARD
           LENGTH=291
          Length = 291

 Score =  257 bits (656), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 152/348 (43%), Positives = 207/348 (59%), Gaps = 64/348 (18%)

Query: 8   YIRVDTLKLKALIVRKVGQQRAGKYFGQLGRLLSSKISKSEFDRVCIRTIGRENIPLHNQ 67
           + R+++L++KALI +K+G QRA  YF QLG+ L+S+ISKSEFD++C +T+GRENI LHN+
Sbjct: 7   FSRLNSLEIKALIYQKIGHQRADTYFDQLGKFLTSRISKSEFDKLCSKTVGRENISLHNR 66

Query: 68  FIKAILKNACSSKVPPPRGPARIGSALSGRDSNGLQQSSVQIPYGDAV-PSLLRKDGSVA 126
            +++ILKNA  +K PPPR P +                     YGD V P   RK     
Sbjct: 67  LVRSILKNASVAKSPPPRYPKKSL-------------------YGDPVFPPSPRKC---- 103

Query: 127 PREQKFKGRRNGFGRLGKPQSLTPEKLIHKAQQQQSATELNSLGSRPPISVEEGEEVEQM 186
            R +KF+ R +  G LGKPQSLT       ++ Q+   E+        +SVE+GEEVEQM
Sbjct: 104 -RSRKFRDRPSPLGPLGKPQSLTTTNDESMSKAQRLPMEV--------VSVEDGEEVEQM 154

Query: 187 ARSPSIQSRSPVTAPLGISMNFGHGRKLLSNVLGSKCHPDTCQSSGDLPDTRSLRSRLEQ 246
             SPS+QSRSP+TAPLG+S +    +   S   G   + +TCQSSG+LPD  +LR+RLE+
Sbjct: 155 TGSPSVQSRSPLTAPLGVSFHL-KSKARFSTYNG--INRETCQSSGELPDMITLRARLEK 211

Query: 247 KLEKEGLTVTVDCVNLLNNALDSYLKRLIESSIGLAGSRSGSEYLRMRNRQSVTGSNVLL 306
           KLE EG+ +++D  NLLN  L++Y++RLIE  + LA                        
Sbjct: 212 KLEMEGIKLSMDSANLLNRGLNAYMRRLIEPCLSLAS----------------------- 248

Query: 307 PARYMQTATQSAGVSVSDFRVAMELNPQVLGPDWPIQLEKICICASEE 354
                Q     + VS+ DF  AME+NP+VLG +WPIQLEKIC  ASEE
Sbjct: 249 -----QQKRAVSNVSMLDFHAAMEVNPRVLGEEWPIQLEKICCRASEE 291


>AT2G24530.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G31440.1); Has 30201 Blast hits to 17322
           proteins in 780 species: Archae - 12; Bacteria - 1396;
           Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
           0; Other Eukaryotes - 2996 (source: NCBI BLink). |
           chr2:10422597-10423820 FORWARD LENGTH=407
          Length = 407

 Score =  161 bits (408), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 131/402 (32%), Positives = 198/402 (49%), Gaps = 62/402 (15%)

Query: 10  RVDTLKLKALIVRKVGQQRAGKYFGQLGRLLSSKISKSEFDRVCIRTIGRENIPLHNQFI 69
           R+   +LK  IV+K G +R+ +YF  LGR LS K++KSEFD+ C+R +GREN+ LHNQ I
Sbjct: 8   RISLCELKEHIVKKTGVERSRRYFYYLGRFLSQKLTKSEFDKTCLRLLGRENLSLHNQLI 67

Query: 70  KAILKNACSSKVPPPR---GPARIGSALSGRDSNGLQQSSVQIP-YGDAVPSLLRKDGSV 125
           ++IL+NA  +K PPP    G +   +A   R  +GL+QS   IP +    P        +
Sbjct: 68  RSILRNATVAKSPPPDHEAGHSTKANAFQSR-GDGLEQSGTLIPNHSQHEPVWSNGVLPI 126

Query: 126 APRE-------QKFKGRRNGFGRLGKPQSLTPEKL----------IHKAQQQQS----AT 164
           +PR+       +K + R +  G  GK + +  + +          +     Q+S    A 
Sbjct: 127 SPRKVRSGMQNRKSRDRPSPLGSNGKVEHMLHQPVCREDNRGSVGMENGDYQRSGRYVAD 186

Query: 165 ELNSLGSRP-------------PISVEEGEEVEQMARSPSIQSRSPVTAPLGI---SMNF 208
           E +    RP              +S+ + +  E+ AR     S SP+ APLGI   S + 
Sbjct: 187 EKDGEFLRPVEKPRIPNKEKIAAVSMRDDQNQEEQARVN--LSMSPLIAPLGIPFCSASV 244

Query: 209 GHGRKLLSNVLGSKCHPDTCQSSGDLPDTRSLRSRLEQKLEKEGLT-VTVDCVNLLNNAL 267
           G   + +   + +     +C  SG LPD   LR R+E     +GL  V+++C   LNN L
Sbjct: 245 GGSPRTIP--VSTNAELISCYDSGGLPDIEMLRKRMENIAVAQGLEGVSMECAKTLNNML 302

Query: 268 DSYLKRLIESSIGLAGSRS-----GSEYLRMRNRQSVTGSNVLLPARYMQTATQSA---- 318
           D YLK+LI S   L G+RS     G + +  +  Q+    N + P   ++  T +     
Sbjct: 303 DVYLKKLINSCFDLVGARSTNGDPGKQRIGKQQSQNKI-VNGVWPTNSLKIQTPNGSSDI 361

Query: 319 -----GVSVSDFRVAMELNPQVLGPDWPIQLEKICICASEER 355
                 VS+ DFR AMELNP+ LG DWP   E+I + + EE+
Sbjct: 362 RQDHHSVSMLDFRTAMELNPRQLGEDWPTLRERISLRSFEEQ 403


>AT4G31440.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G24530.1); Has 210 Blast hits to 209 proteins
           in 55 species: Archae - 0; Bacteria - 72; Metazoa - 2;
           Fungi - 6; Plants - 128; Viruses - 0; Other Eukaryotes -
           2 (source: NCBI BLink). | chr4:15253731-15254870 FORWARD
           LENGTH=379
          Length = 379

 Score =  160 bits (404), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 129/382 (33%), Positives = 194/382 (50%), Gaps = 49/382 (12%)

Query: 10  RVDTLKLKALIVRKVGQQRAGKYFGQLGRLLSSKISKSEFDRVCIRTIGRENIPLHNQFI 69
           R+D  +LK  IV+KVG +R+ +YF  LGR LS K++KSEFD+ C R +GREN+ LHN+ I
Sbjct: 8   RIDLAELKVHIVKKVGVERSTRYFYYLGRFLSQKLTKSEFDKSCFRLLGRENLSLHNKLI 67

Query: 70  KAILKNACSSKVPP-------PRGPARIGSALSGRDSNGLQQSSVQ--IPYGDAVPSLLR 120
           ++IL+NA  +K PP       P     +G      +S  L    ++  +   + V + +R
Sbjct: 68  RSILRNASLAKSPPSVHQSGHPGKSLVLGKEDGPEESRSLNPDHIRNDLALSNGVLAKVR 127

Query: 121 KDGSVAPREQKFK-------GRRNG---FGRLGKPQSLTPEKLIHKAQQQQSATELNSLG 170
             G+   R  + K       G+  G   + R G+         +  A+Q+  + +     
Sbjct: 128 P-GTCDDRTIRDKPCPLGSNGKVLGPFAYSRPGRYPDERDSAFLCPAEQKAVSGKDQVAA 186

Query: 171 SRPPISVEEGEEVEQMARSPSIQSRSPVTAPLGI---SMNFGHGRKLLSNVLGSKCHPDT 227
              PIS ++  +V        I S  PV APLGI   S + G  R+ +   + +     +
Sbjct: 187 ---PISRDDEAQVR-------ILSTPPVMAPLGIPFCSASVGGDRRTVP--VSTSAAAIS 234

Query: 228 CQSSGDLPDTRSLRSRLEQKLEKEGLT-VTVDCVNLLNNALDSYLKRLIESSIGLAGSRS 286
           C  SG L DT  LR R+E     +GL  V+ +C  +LNN LD YLK+L++S + LAG+RS
Sbjct: 235 CYDSGGLSDTEMLRKRMENIAVTQGLGGVSAECSIVLNNMLDLYLKKLMKSCVDLAGARS 294

Query: 287 -----GSEYL-RMRNRQSVTGSNVLLPARYMQTATQSA-------GVSVSDFRVAMELNP 333
                G   L + ++R  +        + ++QT+ Q +        VS+ DFRVAMELNP
Sbjct: 295 MNGTPGKHSLEKQQSRDELVNGVRTNNSFHIQTSNQPSDITREQHSVSLLDFRVAMELNP 354

Query: 334 QVLGPDWPIQLEKICICASEER 355
             LG DWP+  E+I I   EER
Sbjct: 355 HQLGEDWPLLRERISISLFEER 376


>AT5G67410.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G14850.1); Has 1807 Blast hits to 1807 proteins
           in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
           Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
           - 339 (source: NCBI BLink). | chr5:26896600-26897463
           REVERSE LENGTH=287
          Length = 287

 Score =  151 bits (382), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 118/348 (33%), Positives = 175/348 (50%), Gaps = 67/348 (19%)

Query: 1   MTVPKRSYIRVDTLKLKALIVRKVGQQRAGKYFGQLGRLLSSKISKSEFDRVCIRTIGRE 60
           M   +   +R D  +LK+ I +++G+ +   Y   L + LS KISKS+FD++ I T+ RE
Sbjct: 1   MPTSQHHVVRTDISELKSQIEKRIGRAKTESYLNLLSKFLSLKISKSDFDKLIIVTVKRE 60

Query: 61  NIPLHNQFIKAILKNACSSKVPPPRGPARIGSALSGRDSNGLQQSSVQIPYGDAVPSLLR 120
           NI LHN  ++ ILKN C SK  PP          +G +S+  ++  +   +      L R
Sbjct: 61  NISLHNALLRGILKNICLSKTLPP-------FVKNGVESDNKKKKQLNGAFQSLCKELPR 113

Query: 121 KDGSVAPREQKFKGRRNGFGRLGKPQSLTPEKLIHKAQQQQSATELNSLGSRPPISVEEG 180
                +PR+ + + R N  G + K +SL               TE+ S   R   S+E  
Sbjct: 114 -----SPRKGRTQRRLNKDGNISKGKSL--------------VTEVVSSSGRQQWSMENV 154

Query: 181 EEVEQMARSPSIQSRSPVTAPLGISMNFGHGRKLLSNVLGSKCHPDTC-QSSGDLPDTRS 239
           EEV+Q+   P  +S+ P+ AP G++         L +V+  +   DTC  SSG+LPD+ S
Sbjct: 155 EEVDQLI--PCWRSQ-PIEAPFGVN---------LRDVIKKQHRIDTCCYSSGELPDSVS 202

Query: 240 LRSRLEQKLEKEGLTVTVDCVNLLNNALDSYLKRLIESSIGLAGSRSGSEYLRMRNRQSV 299
           L+ +LE  LE EGL V+V   N LN  LD +LKRLI+  + LA S               
Sbjct: 203 LKKKLEDDLE-EGLEVSVGFANSLNAGLDVFLKRLIKPCLELAAS--------------- 246

Query: 300 TGSNVLLPARYMQTATQSAGVSVSDFRVAMELNPQVLGPDWPIQLEKI 347
                       +++  S+  S+ DF+VAM LNP +LG DWP +LEKI
Sbjct: 247 ------------RSSNASSASSLVDFQVAMALNPSILGEDWPTKLEKI 282