Miyakogusa Predicted Gene

chr5.LjT42D02.30.nd
Show Alignment: 

BLASTP 2.2.18 [Mar-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= chr5.LjT42D02.30.nd - phase: 0 
         (1728 letters)

Database: TAIR8_pep 
           32,825 sequences; 13,166,001 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G42950.1 | Symbols:  | GYF domain-containing protein | chr5:1...   625   e-179
AT1G27430.1 | Symbols:  | GYF domain-containing protein | chr1:9...   103   1e-21
AT1G24300.1 | Symbols:  | GYF domain-containing protein | chr1:8...    76   2e-13

>AT5G42950.1 | Symbols:  | GYF domain-containing protein |
           chr5:17241664-17248272 FORWARD
          Length = 1714

 Score =  625 bits (1613), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 373/867 (43%), Positives = 504/867 (58%), Gaps = 82/867 (9%)

Query: 15  VAAAPPLQISKDVTGSDNPIPLSPQWLLPKPVESKPGSGNVGNNVISIPPYGTHSETVKT 74
           ++  PP QI KD+ GSDN IPLSPQWLL KP E+K G G    N      YG HS+ V+T
Sbjct: 17  LSVNPPHQIFKDIQGSDNAIPLSPQWLLSKPGENKTGMGTGDPNQ-----YGNHSDVVRT 71

Query: 75  SGNGEDGHDVQKRKDVFRPSMLXXXXXXXXXXXXXXXXTKSATRKDRWRDGDKDLGDSRK 134
           +GNGE+  D  K+KDVFRPS+L                T S+ R DRWR+GDKD GD++K
Sbjct: 72  TGNGEETLDNLKKKDVFRPSLLDAESGRRDRWRDEERDTLSSVRNDRWRNGDKDSGDNKK 131

Query: 135 VDRWADSLPSKNLGEARRGASDSHRRNDSGNRETNFDQRRESKWNTRWGPDNKEPEGPRE 194
           VDRW +  P    GE RRG +D  R  DSGN++   +QRRESKWN+RWGPD+KE E PR 
Sbjct: 132 VDRWDNVAP--KFGEQRRGPND--RWTDSGNKDAAPEQRRESKWNSRWGPDDKEAEIPRN 187

Query: 195 KWSDSAKDGDIHLDKGLSHISNLGKDEKEGDHYRPWRPNYSQSRGRVEPPHSQNTTPNKQ 254
           KW +  KDG+I  +KG S  ++      +GDHYRPWRP  SQ RGR E  H+Q +TPNKQ
Sbjct: 188 KWDEPGKDGEIIREKGPSLPTS------DGDHYRPWRP--SQGRGRGEALHNQ-STPNKQ 238

Query: 255 ASTFSYGRGRGEYTPPIXXXXXXXXXXXXXXXXXTYA----GAPLDKVESGHDDPGHFRY 310
            ++FS+ RGRGE T                           G+  DK ESG  +P H RY
Sbjct: 239 VTSFSHSRGRGENTAIFSAGRGRMSPGGSIFTSAPNQSHPPGSASDKGESGPGEPPHLRY 298

Query: 311 NRTKLLDLYRVTDVGTNRKQVDEFVQVPNLTQDEPVEPLALFAPTSEELTVLNGIDKGEI 370
           +R KLLD+YR+ D     K  D F++VP+LT +EP +PLAL AP+S+E+ VL+ I+KG+I
Sbjct: 299 SRMKLLDVYRMADTECYEKFPDGFIEVPSLTSEEPTDPLALCAPSSDEVNVLDAIEKGKI 358

Query: 371 ISSSAPQVPKD---GRGSTDFTHTRR---------MKLGSSPLQDRV------------- 405
           +SS APQ  KD   GR   +F+  RR         M  G+   +D               
Sbjct: 359 VSSGAPQTSKDGPTGRNPVEFSQPRRIRPAGSREDMTFGAEESKDESGETRNYPDDKFRP 418

Query: 406 ----EGGGSDRVADE--LSSNRDSSFEGNSNVHSGAGWRAMPGGEQATALLHDSRDVSND 459
               EG    R  +E  +   ++ S +GN++V S + WR   GGE++    HD  D S D
Sbjct: 419 EASHEGYAPFRRGNEAPVRELKEPSMQGNAHVQSASPWRQSSGGERSNRNSHDWNDPSAD 478

Query: 460 ARIRKSDMSSHQPKDPFNQRGNNLSYLSDSKDVGKWHASGDPNVKRQLSGIFDSELESRR 519
           +R++ SD     PKD  N  G N   L  SK   +W  S DP+++RQ S +FD E E R+
Sbjct: 479 SRLKSSDSVWSHPKDSINHLGGNNMMLPQSKGESRWQISEDPSLRRQPSLVFDREQEVRK 538

Query: 520 VPPTAPEELSLFYKDPNGQIQGPFKGIDIIGWFEAGYFGIDLPVRLENSAADSPWLSLGD 579
           + P++PEELSL+YKDP G IQGPF G DIIGWFEAGYFGIDL VRL ++  DSP+  LGD
Sbjct: 539 LLPSSPEELSLYYKDPQGLIQGPFSGSDIIGWFEAGYFGIDLLVRLASAPNDSPFSLLGD 598

Query: 580 AMPHLRAKARPPPGFSAPKPNEFTDISVRQNSSTFGNTLTGLSESELLKSDSRHRQGADS 639
            MPHLRAK+ PPPGF+  K NEF D +        G   +G+ E+++L++D R++  A +
Sbjct: 599 VMPHLRAKSGPPPGFTGAKQNEFVDAAGTSAFPGVGKVHSGMGETDMLQNDMRYKHVAGT 658

Query: 640 EAENRFMEPFMSGSKSSPALDSLKLSEGFHXXXXXXXXXXXXXXXXXXXXLYSVANRLAL 699
            AENRF+E  MSG  ++ A       +G                      +Y +A +L L
Sbjct: 659 VAENRFIESLMSGGLTNSA-------QGVQGYGVNSSGGLSLPVTDGGADMYLLAKKLEL 711

Query: 700 ERQMSLPKPYPYWPGRDAPSVAPTSNIVPDASLQSKLLSSVNDNSHQP-QSVNSELMSII 758
           ERQ S+P PY YWPGR++ ++ P S              +V++N+ QP +S +S+L+SI+
Sbjct: 712 ERQRSIPSPYSYWPGRESANLMPGSE-------------NVSENAQQPTRSPSSDLLSIL 758

Query: 759 QGLSDRTSAGVNNGAAGLPNY--PLQNNVDLLLNQSL-PQMPFGIQQQRLPTQNQLSMSN 815
           QG++DR+S  V   +  LP +  P+Q   DL   ++   Q+PFG+QQQRLP QN L +S 
Sbjct: 759 QGVTDRSSPAV---SGPLPAWSQPIQKESDLHHAKTFQTQIPFGVQQQRLPEQN-LPLSG 814

Query: 816 LLAQ-AADNPSSALTAEKLLSSGLSQD 841
           LL Q   +NP   L+ + +L++GLSQ+
Sbjct: 815 LLGQPMENNPGGMLSPDMMLAAGLSQE 841



 Score =  407 bits (1046), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 272/591 (46%), Positives = 348/591 (58%), Gaps = 30/591 (5%)

Query: 1149 QIKGKENQIGNAVPEAVDHQEVSCLPASVTKSITETVLVGESNAAAGSVSS-QNTGVPAG 1207
            +IKGK  +  + + +   H     + +S   +   + +  E ++  G  SS QNT    G
Sbjct: 1143 EIKGKTKKSADTLIDNDTH----LIKSSTATASNTSQMSSEVDSVRGEESSLQNTRTQPG 1198

Query: 1208 RAWKPAPGIRAKSLLEIQQEEQRKAEAEMLASKVATAVNSMSLASPWVGVGGGANSDSVK 1267
            RAWKPAPG + KSLLEIQ EEQR A+AE LA K+++ VNS+  A+PW G+    NSDS  
Sbjct: 1199 RAWKPAPGFKPKSLLEIQMEEQRVAQAEALAPKISSTVNSVGSAAPWAGIV--TNSDS-N 1255

Query: 1268 VSGESHRGGNIEYP--VLSETSQNIKSKKSQLHDLLAEEVLKKSNEKDAEVPDSALSSNN 1325
            +  E+H    I     V  E+   +K+KKS LHDLLA++V  KS++K+ EV +  + SNN
Sbjct: 1256 ILRETHGESAITQTGVVKPESVPTLKAKKSHLHDLLADDVFAKSSDKEREVME--IISNN 1313

Query: 1326 ------LTVHSEPLDDSSFIEAXXXXXXXXXXXXXXXXXXXXLVSVASAEAPMASSPIEK 1379
                   T ++E  DD +FI+A                       V + +  + ++ +EK
Sbjct: 1314 DAFMQVTTTNAESFDDDNFIDARETKKSRKKSARAKTSGAKIAAHVPAVDTSLQTNSVEK 1373

Query: 1380 SKSFRS-AQQEKEVLPPIPAGPSLGDFVLWKGEQEXXXXXXXXAWSTDSGRIPKPTSLRD 1438
             KS R   QQEKEVLP IP+GPSLGDFVLWKGE          AWS+   +  KP+SLRD
Sbjct: 1374 GKSSRILQQQEKEVLPAIPSGPSLGDFVLWKGES-VNNPPPAAAWSSGPKKSTKPSSLRD 1432

Query: 1439 ILKEQEK-KTSSAVPANPVPTPQKSQPTQATWNXXXXXXXXXXXXXXXXXXXXINSHAPS 1497
            I+KEQEK  TSS  P +PVPT QK+ P QA                         S + S
Sbjct: 1433 IVKEQEKMTTSSHPPPSPVPTTQKAIPPQA-------HQGGASWSRSASSPSQAVSQSSS 1485

Query: 1498 QSKYKGDDDLFWGPIEQPKQETKQSDFPQLASQGSRGSKNIPLKKGTSTGLLTRQKSASG 1557
            QSK KGDDDLFWGP+EQ  Q+TKQ DFP L SQ S G+KN P K    T L  RQKS S 
Sbjct: 1486 QSKSKGDDDLFWGPVEQSTQDTKQGDFPHLTSQNSWGTKNTPGKVNAGTSL-NRQKSVSM 1544

Query: 1558 KPTERPLSSSPASSQPVLKLKKDAMTKLSEAIGFRDWCENECVRLIGTKDTSFLEFCLKQ 1617
               +R LSS P  +Q   K KK+A+TKL+EA GFRDWC++EC+RL+G++DTS LEFCLK 
Sbjct: 1545 GSADRVLSS-PVVTQASHKGKKEAVTKLTEANGFRDWCKSECLRLLGSEDTSVLEFCLKL 1603

Query: 1618 SRSEAEMFLIENLGSFDPDHKFIDKFLNYKELLPSDVLDIAFKSRNDKKVTGLGAAGMVS 1677
            SRSEAE  LIENLGS DPDHKFIDKFLNYK+LLPS+V++IAF+S+     T         
Sbjct: 1604 SRSEAETLLIENLGSRDPDHKFIDKFLNYKDLLPSEVVEIAFQSKGSGVGTRNNTGEDYY 1663

Query: 1678 ANADLQDGDHTEGTSXXXXXXXXXXXVSPAVLGFNVVSNRIMMGEIQTVED 1728
             N    +   ++              +S +VLGFNVVSNRIMMGEIQT+ED
Sbjct: 1664 YNTTAANDGFSKVGGKKKAKKGKKVSLSASVLGFNVVSNRIMMGEIQTIED 1714


>AT1G27430.1 | Symbols:  | GYF domain-containing protein |
           chr1:9521032-9526915 REVERSE
          Length = 1531

 Score =  103 bits (256), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 94/266 (35%), Positives = 126/266 (47%), Gaps = 59/266 (22%)

Query: 131 DSRKVDRWADSLPSKNLGEARRGASDSHRRNDSGNRETNFDQRRESKWNTRWGPDNKEPE 190
           D RK +R  DS+ S+  G+ +  A+ S R ND  +R    + RR++KW++RWGPD+KE E
Sbjct: 108 DRRKTERRIDSVSSRETGDIKNAAA-SDRWNDVNSRAAVHEPRRDNKWSSRWGPDDKEKE 166

Query: 191 GPREKWSDSAKDGDIHLDKGLSHISNL-GKDEKEGDHYRPWRPNY---SQS--------- 237
              EK  D  KD +    +  S +SN+    E++ D    WRP +   SQS         
Sbjct: 167 ARCEK-VDINKDKEEPQSESQSVVSNVRATSERDSDTRDKWRPRHRMESQSGGPSSYRAA 225

Query: 238 ------RGRVEPPHSQNTTPNKQASTFSYGRGRGEYTPPIXXXXXXXXXXXXXXXXXTYA 291
                 RGR E P+   T    +AST     GRG  T  I                   A
Sbjct: 226 PGFGLDRGRAEGPNLGFTVGRGRASTI----GRGSSTSLIG------------------A 263

Query: 292 GAPLDKVESGHDDPGHFRYNRTKLLDLYRV----TDVGTNRKQVDEFVQVPNLTQDEPVE 347
           G+ L  V         FRY R KLLD+YR     + +G    ++DE   V ++TQ   +E
Sbjct: 264 GSALSPV---------FRYPRGKLLDMYRKQKPDSSLGRILTEMDE---VASITQVALIE 311

Query: 348 PLALFAPTSEELTVLNGIDKGEIISS 373
           PLA  AP +EE   LNGI KG IISS
Sbjct: 312 PLAFIAPDAEEEANLNGIWKGRIISS 337



 Score = 76.6 bits (187), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 42/111 (37%), Positives = 61/111 (54%), Gaps = 13/111 (11%)

Query: 525 PEELSLFYKDPNGQIQGPFKGIDIIGWFEAGYFGIDLPVRLENSAADSPWLSLGDAMPHL 584
           PEE    Y DP G IQGPF G DII WFE G+FG DL VRL ++   +P+  LG  M ++
Sbjct: 474 PEEFLFLYIDPQGVIQGPFIGSDIISWFEQGFFGTDLQVRLASAPEGTPFQDLGRVMSYI 533

Query: 585 RAKARPPPGFSAPKPNEFTDISVRQNSSTFG-----------NTLTGLSES 624
           +A++       + + +E  + S++ NS   G           ++LTG+S S
Sbjct: 534 KAES--VHAHISDQKSELEETSLKANSEAGGSVAHVAESNDSSSLTGISRS 582


>AT1G24300.1 | Symbols:  | GYF domain-containing protein |
           chr1:8614504-8620409 REVERSE
          Length = 1417

 Score = 75.9 bits (185), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 32/64 (50%), Positives = 41/64 (64%)

Query: 525 PEELSLFYKDPNGQIQGPFKGIDIIGWFEAGYFGIDLPVRLENSAADSPWLSLGDAMPHL 584
           PE+    Y DP G IQGPF G DII WFE G+FG DL VRL N+   +P+  LG  M +L
Sbjct: 396 PEDFLFLYIDPQGVIQGPFIGSDIISWFEQGFFGTDLQVRLANAPEGTPFQDLGRVMSYL 455

Query: 585 RAKA 588
           + ++
Sbjct: 456 KTES 459



 Score = 61.6 bits (148), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 50/139 (35%), Positives = 73/139 (52%), Gaps = 13/139 (9%)

Query: 131 DSRKVDRWADSLPSKNLGEARRGASDSHRRNDSGNRETNFDQRRESKWNTRWGPDNKEPE 190
           D RK +R  D++ S+  GE +  A+ S R ND  +R    + RR++KW++RWGPD+KE E
Sbjct: 108 DRRKTERRIDNVSSRETGEVKTTAA-SDRWNDVNSRAAVHEPRRDNKWSSRWGPDDKEKE 166

Query: 191 GPREKWSDSAKDGDIHLDKGLSHISNL-GKDEKEGDHYRPWRPNYSQSRGRVEPPHSQNT 249
              EK  +  KD +    +  S +SN+    E++ D    WRP     R R+E   SQ+ 
Sbjct: 167 ARCEK-VEINKDKEEPQSESQSVVSNVRATSERDSDPRDKWRP-----RHRME---SQSG 217

Query: 250 TPN--KQASTFSYGRGRGE 266
            P   + A  F   RGR E
Sbjct: 218 VPTSYRTAPGFGLDRGRAE 236