Miyakogusa Predicted Gene

chr5.CM0911.360.nd
Show Alignment: 

BLASTP 2.2.18 [Mar-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= chr5.CM0911.360.nd - phase: 0 
         (1598 letters)

Database: TAIR8_pep 
           32,825 sequences; 13,166,001 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT1G27430.1 | Symbols:  | GYF domain-containing protein | chr1:9...   375   e-103
AT1G24300.1 | Symbols:  | GYF domain-containing protein | chr1:8...   204   3e-52
AT5G42950.1 | Symbols:  | GYF domain-containing protein | chr5:1...   103   1e-21

>AT1G27430.1 | Symbols:  | GYF domain-containing protein |
           chr1:9521032-9526915 REVERSE
          Length = 1531

 Score =  375 bits (962), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 282/793 (35%), Positives = 400/793 (50%), Gaps = 143/793 (18%)

Query: 1   MGDGKMNLPDDLFSSKPSDFHSSLLKDEAFGGHGGEKGIAALLXXXXXXXXXXXXIPLSP 60
           M +GK +LPDDL  SK SD    L  D +                          IPLSP
Sbjct: 1   MAEGKFDLPDDLIFSKSSDQLKELASDNS--------------------------IPLSP 34

Query: 61  QWLYSK----PVDVKT-TANPVGVNSTDPILKDSWRLEGSQDKKDWRRAAPDVDISXXXX 115
           QWLY+K     +DV++ T  P+G N +DP  KD+WRL+  +DKKDW++   + + S    
Sbjct: 35  QWLYTKSSEYKMDVRSPTPVPMG-NPSDPNPKDAWRLDAPEDKKDWKKIVHENETSRRWR 93

Query: 116 XXXXXTSLLGXXXXXXXXXXXXXTSTSENRS------LPADRWHD--SRGSVHDSRRENK 167
                T LLG              S S   +        +DRW+D  SR +VH+ RR+NK
Sbjct: 94  EEERETGLLGARKVDRRKTERRIDSVSSRETGDIKNAAASDRWNDVNSRAAVHEPRRDNK 153

Query: 168 WSSRWGPDDKEKDSRSEKRN-DVGKEDGHTEKQSSVASNRTGADRDTDSRDKWRPRHRVE 226
           WSSRWGPDDKEK++R EK + +  KE+  +E QS V++ R  ++RD+D+RDKWRPRHR+E
Sbjct: 154 WSSRWGPDDKEKEARCEKVDINKDKEEPQSESQSVVSNVRATSERDSDTRDKWRPRHRME 213

Query: 227 AQTAGVATYRAAPGFGLEKGRIEGSNVRFSPGRGRANFNENLQIGRPPLGSSAGSSLVDK 286
           +Q+ G ++YRAAPGFGL++GR EG N+ F+ GRGRA+      IGR   GSS        
Sbjct: 214 SQSGGPSSYRAAPGFGLDRGRAEGPNLGFTVGRGRAS-----TIGR---GSST------- 258

Query: 287 NKTILGKSSLGADSYYYPRGKILDIYRKQKVDPTFESMPSEMEHTSPITQLSSVEPLAFV 346
             +++G  S  +  + YPRGK+LD+YRKQK D +   + +EM+  + ITQ++ +EPLAF+
Sbjct: 259 --SLIGAGSALSPVFRYPRGKLLDMYRKQKPDSSLGRILTEMDEVASITQVALIEPLAFI 316

Query: 347 APAVEEEGVLKDIWKGKITSSEV---SGYSVRGKDGGLNEDISGLGVTLSEGKQLTIGSG 403
           AP  EEE  L  IWKG+I SSEV   SG    G +  L   I   G T  +G  L   +G
Sbjct: 317 APDAEEEANLNGIWKGRIISSEVYTSSGEESLGGNSLLKCRIPESGETKVDGALLGFMNG 376

Query: 404 EKVISRMNIQNESEQIFIGSASTADGSSKNVVKEVATSQEIKQKHMPSLGVYEKDEISGN 463
           +                       +GS KN    +  S      H   LG          
Sbjct: 377 D-----------------------NGSMKNNDSGLLGS------HNGGLGA--------- 398

Query: 464 NTREGSIPRIKVAESETFDYHQGQLSAFKEHATQDGVESIGASAISSNLPDDARSLFDFS 523
                S+PR+    SE+  Y  G       H + + V S+      S++ D + S+    
Sbjct: 399 ---ASSVPRLNSVASES--YGSGGAGYQLSHGSPEAVRSV---FTKSSVLDGSESVVGSF 450

Query: 524 SLQQNASVNPQDLKLNEKMYAL--EELSLCYLDPQGMVQGPFLGIDIIMWFEQGFFGLDL 581
                  +   D +++    A+  EE    Y+DPQG++QGPF+G DII WFEQGFFG DL
Sbjct: 451 EQAYTGKLQQPDTEVDHSEGAMPPEEFLFLYIDPQGVIQGPFIGSDIISWFEQGFFGTDL 510

Query: 582 PVRLFEAPEGSPFHELGDVMPHLKVKTGLDSGSNLVNQSEPSDANERNLKVDVHTFDYGS 641
            VRL  APEG+PF +LG VM ++K ++     +++ +Q   S+  E +LK +      GS
Sbjct: 511 QVRLASAPEGTPFQDLGRVMSYIKAES---VHAHISDQK--SELEETSLKANSEA--GGS 563

Query: 642 DDQPWSSSRPDTTSNVGISSQMS-----------NQSYHSEIK----FSDEQRFNNIVAQ 686
                 S+  D++S  GIS   S            +   SE+      +++Q F +  AQ
Sbjct: 564 VAHVAESN--DSSSLTGISRSFSVYNNPSGQDNFQRKSESEVYGRPPHAEDQSFLDFSAQ 621

Query: 687 DEDATF------SNLAGSSNDNPLMRPVGANASYSHPTGRPIANEITGSDTQNSEADKLH 740
           DE+  F      S  A S   +  M    A   +S  +  P+  E+T + T+N   +KLH
Sbjct: 622 DEEIVFPGRARVSGYASSVKSSTSMH--DALMEFSGHSDIPV--EVTTAATRNQNENKLH 677

Query: 741 PFGLLMSELRDGS 753
           PFG+L SEL  GS
Sbjct: 678 PFGVLWSELEGGS 690



 Score = 74.7 bits (182), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 96/353 (27%), Positives = 156/353 (44%), Gaps = 57/353 (16%)

Query: 956  DANFGQSKHDLSRENLLDQVQL-RRYLHDMQQNS-HSLRNLDPSMEQIIQANMGLNAVQG 1013
            D  FGQS HD  R N +DQ+ L ++ L+++Q++S H  +N  P +EQ    N G    +G
Sbjct: 851  DTRFGQS-HDFPRSNSVDQMLLEQQMLNELQKSSGHPSQNFAPYIEQHAAGNFGRFTHEG 909

Query: 1014 RQADLSDLL------------------LQARHGNIL--PSEXXXXXXXXXXXXXXXXXXX 1053
             Q +L + L                  +Q++HG +   P                     
Sbjct: 910  HQRELLEQLFSTQMQSQYGQKQSQYGQMQSQHGQLQSEPIRSLEYQLLQQEQLMQLANGV 969

Query: 1054 XXXXGMDGERHFGRSWPINETGQLVR-NPSSHQLGHSAGFNVSDIHKQQQRLVAQEEQLN 1112
                 ++ +RH    WP + + QL+R +P  H+   SAGF   D H+QQQR    E+Q +
Sbjct: 970  RHNTLLEEQRHIDPLWPSDHSDQLLRTHPGIHRSHSSAGFRPLDFHQQQQR-PHFEDQFS 1028

Query: 1113 YLGRNHLEQNQ-RGFYDPSSMMFERSSPGS----------VQGRELLERRRYMHPAEQLG 1161
             L RN   Q Q R       + FERS+ G            QG EL +   +M  + +LG
Sbjct: 1029 QLERNRSYQQQLRLELLEHGLPFERSASGLNLDAVNGLGLSQGLELRDATAHMQSSGRLG 1088

Query: 1162 ---PVSSHH---LQSSDDLFGH-----HSLSGNNGHVENNWIDPRVQ------QHLEAVR 1204
               P  SH    +   +  F H        SG +  +  +W + + +      +H +   
Sbjct: 1089 NSTPGFSHQNPRIPLGESHFSHLEPTEGRWSGADTQLAGDWAESQFRRSNMDTEHDKMRS 1148

Query: 1205 QRRDLGDTIASADLNIPSAGAHEESSARGFMDLLHQKLGLQSSQSSNVDKWHP 1257
            + R LG+   S  +     G+ ++ S + FM+LLHQ+ G QS++S N+++ +P
Sbjct: 1149 EIRRLGEDPNSWMV----GGSTDDKSKQLFMELLHQRPGHQSAESPNMNRGYP 1197


>AT1G24300.1 | Symbols:  | GYF domain-containing protein |
           chr1:8614504-8620409 REVERSE
          Length = 1417

 Score =  204 bits (520), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 130/322 (40%), Positives = 176/322 (54%), Gaps = 58/322 (18%)

Query: 1   MGDGKMNLPDDLFSSKPSDFHSSLLKDEAFGGHGGEKGIAALLXXXXXXXXXXXXIPLSP 60
           M +GK +LPDDL  SK SD    L  D +                          IPLSP
Sbjct: 1   MAEGKFDLPDDLILSKSSDQLKELASDNS--------------------------IPLSP 34

Query: 61  QWLYSK----PVDVKT-TANPVGVNSTDPILKDSWRLEGSQDKKDWRRAAPDVDISXXXX 115
           QWLY+K     +DV++ T  P+G N +DP LKD+WRL+  +DKKDW++   + + +    
Sbjct: 35  QWLYTKSSESKMDVRSPTPMPMG-NPSDPNLKDAWRLDAPEDKKDWKKIVSENETNRRWR 93

Query: 116 XXXXXTSLLGXXXXXXXXXXX-----XXTSTSENRSLPA-DRWHD--SRGSVHDSRRENK 167
                T LLG                    T E ++  A DRW+D  SR +VH+ RR+NK
Sbjct: 94  EEERETGLLGARKVDRRKTERRIDNVSSRETGEVKTTAASDRWNDVNSRAAVHEPRRDNK 153

Query: 168 WSSRWGPDDKEKDSRSEKRN-DVGKEDGHTEKQSSVASNRTGADRDTDSRDKWRPRHRVE 226
           WSSRWGPDDKEK++R EK   +  KE+  +E QS V++ R  ++RD+D RDKWRPRHR+E
Sbjct: 154 WSSRWGPDDKEKEARCEKVEINKDKEEPQSESQSVVSNVRATSERDSDPRDKWRPRHRME 213

Query: 227 AQTAGVATYRAAPGFGLEKGRIEGSNVRFSPGRGRANFNENLQIGRPPLGSSAGSSLVDK 286
           +Q+    +YR APGFGL++GR EG N+ F+ GRGRA+      IGR   GSS        
Sbjct: 214 SQSGVPTSYRTAPGFGLDRGRAEGPNLGFTVGRGRAS-----TIGR---GSST------- 258

Query: 287 NKTILGKSSLGADSYYYPRGKI 308
             +++G  S  A  + YPR K 
Sbjct: 259 --SLIGAGSASAPVFRYPRVKC 278



 Score =  125 bits (313), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 124/408 (30%), Positives = 183/408 (44%), Gaps = 65/408 (15%)

Query: 375 RGKDGGLNEDISGLGVTLSEGKQLTIGSGEKVISRMNIQNESEQIFIGSASTADGSSKNV 434
           RG+  G N     LG T+  G+  TIG G          + +  I  GSAS        V
Sbjct: 232 RGRAEGPN-----LGFTVGRGRASTIGRG----------SSTSLIGAGSASAPVFRYPRV 276

Query: 435 VKEVATSQEIKQKHMPSLGVYEKDEISGNNTREG-------------SIPRIKVAESETF 481
              +  S E K      LG    D  S  N   G             S+PR+    SE++
Sbjct: 277 KCRIPESGETKVDGA-LLGFMNGDNGSMKNNDSGLLGSHNGGLGAASSVPRLNSVASESY 335

Query: 482 DYHQGQLSAFKE--HATQDGVESIGASAISSNLPDDARSLFDFSSLQQNASVNPQDLKLN 539
               G   A  +  H + + V S+      S + D + S+           +   D++++
Sbjct: 336 ----GSFGAGYQVSHGSPEAVRSV---FTKSPVLDGSESVVGSFEQDYMGKLQQPDVEVD 388

Query: 540 EKMYAL--EELSLCYLDPQGMVQGPFLGIDIIMWFEQGFFGLDLPVRLFEAPEGSPFHEL 597
           +   A+  E+    Y+DPQG++QGPF+G DII WFEQGFFG DL VRL  APEG+PF +L
Sbjct: 389 QSEAAMPPEDFLFLYIDPQGVIQGPFIGSDIISWFEQGFFGTDLQVRLANAPEGTPFQDL 448

Query: 598 GDVMPHLKVKTGLDSGSNLVNQSEPSDANERNLKVDVHTFDYGSDDQPWSSSRPDTTSNV 657
           G VM +LK ++   + +++ NQ   S+  E  LK +    D G    P + S  D++S  
Sbjct: 449 GRVMSYLKTES---AHAHISNQE--SELEETRLKANS---DTGLSIAPVAESN-DSSSMN 499

Query: 658 GISSQMS---------NQSYHSEIKF------SDEQRFNNIVAQDEDATFSNLAGSSNDN 702
           G S   S         N    SE +F      ++++ F +   QDE+  F   AG S   
Sbjct: 500 GTSRSFSVYNNPSAQDNFQRKSESEFYATPPHTEDRSFLDFSTQDEEIVFPGRAGVSGYA 559

Query: 703 PLMRPVGANASYSHPTGR-PIANEITGSDTQNSEADKLHPFGLLMSEL 749
            +      + ++   +G+  I  E T + TQ    +KLHPFG+L SEL
Sbjct: 560 SVKSSTSMHDAFMEVSGQSAIPVESTKAATQKQHENKLHPFGVLWSEL 607



 Score = 70.5 bits (171), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 94/347 (27%), Positives = 151/347 (43%), Gaps = 56/347 (16%)

Query: 956  DANFGQSKHDLSRENLLDQVQLR-RYLHDMQQNS-HSLRNLDPSMEQIIQANMGLNAVQG 1013
            D+ FGQS HD  R N +DQ+ L  + ++++Q++S H  +N  P +EQ+   N G    +G
Sbjct: 782  DSRFGQS-HDFPRSNNVDQMLLEHQLMNELQKSSGHPSQNFAPYIEQLAAGNFGQLPHEG 840

Query: 1014 RQADLSDLLL----QARHGNIL---------PSEXXXXXXXXXXXXXXXXXXXXXXXGMD 1060
             Q +L + LL    Q+++G +          P+                         ++
Sbjct: 841  HQRELLEQLLSTKMQSQYGPMQSPYGQLQSEPTRSLEYQLLQQEQLMQLANGVRHNTLLE 900

Query: 1061 GERHFGRSWPINETGQLVRN-PSSHQLGHSAGFNVSDIHKQQQRLVAQEEQLNYLGRNHL 1119
             +RH    WP +   QL+R+ P   +   S GF   D H+QQQR    E+Q   L RN L
Sbjct: 901  EQRHIDPLWPSDHNDQLLRSHPGIQRSRSSTGFRQLDFHQQQQR-PPFEDQFGQLERNLL 959

Query: 1120 EQNQ-RGFYDPSSMMFERSS--PGSV--------------QGRELLERRRYMHPAEQ-LG 1161
             Q Q R       + FERS+  P SV              QG EL +   +M      LG
Sbjct: 960  YQQQLRQELFEQGLPFERSASLPVSVSGMNLDPVNGLGLSQGLELRDATTHMQIGNSTLG 1019

Query: 1162 --------PVSSHHLQSSDDLFGHHSLSGNNGHVENNWIDPRVQ------QHLEAVRQRR 1207
                    P+   H    + + G    SG +  V  +W + ++       +H +   + R
Sbjct: 1020 FNHQNPRIPIGEPHFSQLESMEGR--WSGADTQVVGDWAESQLHRSNIDAEHHKMRSESR 1077

Query: 1208 DLGDTIASADLNIPSAGAHEESSARGFMDLLHQKLGLQSSQSSNVDK 1254
             +G+   S  L     G  E+ S + FM+LLHQ+ G QS++S ++++
Sbjct: 1078 RMGEDSNSWML----GGTTEDRSKQLFMELLHQRPGHQSAESPSMNR 1120


>AT5G42950.1 | Symbols:  | GYF domain-containing protein |
           chr5:17241664-17248272 FORWARD
          Length = 1714

 Score =  103 bits (256), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 49/80 (61%), Positives = 56/80 (70%)

Query: 546 EELSLCYLDPQGMVQGPFLGIDIIMWFEQGFFGLDLPVRLFEAPEGSPFHELGDVMPHLK 605
           EELSL Y DPQG++QGPF G DII WFE G+FG+DL VRL  AP  SPF  LGDVMPHL+
Sbjct: 545 EELSLYYKDPQGLIQGPFSGSDIIGWFEAGYFGIDLLVRLASAPNDSPFSLLGDVMPHLR 604

Query: 606 VKTGLDSGSNLVNQSEPSDA 625
            K+G   G     Q+E  DA
Sbjct: 605 AKSGPPPGFTGAKQNEFVDA 624



 Score = 95.5 bits (236), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 108/353 (30%), Positives = 150/353 (42%), Gaps = 44/353 (12%)

Query: 56  IPLSPQWLYSKPVDVKT---TANPVGVNSTDPILKDSWRLEGSQD---KKD-WRRAAPDV 108
           IPLSPQWL SKP + KT   T +P    +   +++ +   E + D   KKD +R +  D 
Sbjct: 36  IPLSPQWLLSKPGENKTGMGTGDPNQYGNHSDVVRTTGNGEETLDNLKKKDVFRPSLLDA 95

Query: 109 DISXXXXXXXXXTSLLGXXXXXXXXXXXXXTSTS--------------ENRSLPADRWHD 154
           +              L              +  +              E R  P DRW D
Sbjct: 96  ESGRRDRWRDEERDTLSSVRNDRWRNGDKDSGDNKKVDRWDNVAPKFGEQRRGPNDRWTD 155

Query: 155 S--RGSVHDSRRENKWSSRWGPDDKEKDSRSEKRNDVGKEDGHT--EKQSSVASNRTGAD 210
           S  + +  + RRE+KW+SRWGPDDKE +    K ++ GK DG    EK  S+ ++     
Sbjct: 156 SGNKDAAPEQRRESKWNSRWGPDDKEAEIPRNKWDEPGK-DGEIIREKGPSLPTS----- 209

Query: 211 RDTDSRDKWRP---RHRVEAQTAGVATYRAAPGFGLEKGRIEGSNVRFSPGRGRANFNEN 267
            D D    WRP   R R EA        +    F   +GR E + + FS GRGR +   +
Sbjct: 210 -DGDHYRPWRPSQGRGRGEALHNQSTPNKQVTSFSHSRGRGENTAI-FSAGRGRMSPGGS 267

Query: 268 LQIGRPPLGSSAGSSLVDKNKTILGKSSLGADSYY-YPRGKILDIYRKQKVDPTFESMPS 326
           +    P      GS+  DK     G+S  G   +  Y R K+LD+YR    +  +E  P 
Sbjct: 268 IFTSAPNQSHPPGSA-SDK-----GESGPGEPPHLRYSRMKLLDVYRMADTE-CYEKFPD 320

Query: 327 EMEHTSPITQLSSVEPLAFVAPAVEEEGVLKDIWKGKITSSEVSGYSVRGKDG 379
                  +T     +PLA  AP+ +E  VL  I KGKI SS     S  G  G
Sbjct: 321 GFIEVPSLTSEEPTDPLALCAPSSDEVNVLDAIEKGKIVSSGAPQTSKDGPTG 373