Miyakogusa Predicted Gene

chr2.LjT47E17.80.nc
Show Alignment: 

BLASTP 2.2.18 [Mar-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= chr2.LjT47E17.80.nc - phase: 0 
         (1339 letters)

Database: TAIR8_pep 
           32,825 sequences; 13,166,001 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT1G68720.1 | Symbols:  | cytidine/deoxycytidylate deaminase fam...   432   e-121
AT5G28050.2 | Symbols:  | cytidine/deoxycytidylate deaminase fam...    83   1e-15
AT5G28050.1 | Symbols:  | cytidine/deoxycytidylate deaminase fam...    82   2e-15
AT1G48175.1 | Symbols: EMB2191 | EMB2191 (EMBRYO DEFECTIVE 2191)...    82   2e-15
AT3G05300.1 | Symbols:  | cytidine/deoxycytidylate deaminase fam...    65   3e-10

>AT1G68720.1 | Symbols:  | cytidine/deoxycytidylate deaminase family
            protein | chr1:25808210-25812483 FORWARD
          Length = 1307

 Score =  432 bits (1111), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 346/983 (35%), Positives = 473/983 (48%), Gaps = 162/983 (16%)

Query: 464  EENLQISETHIQ--ETSGEHEKFIGSTSTTGKNVINRSSQKYIGNSNIEDSE-------- 513
            EE+++I E H+   ETS +++K          + I   S    GN NIE S+        
Sbjct: 380  EEDMEIHEVHVNDAETSSQNQKLFNEREDYRVHSIRNDS----GNENIESSQHQLKERLE 435

Query: 514  -------RTSNIRMKNVGEKKISLSSVQGVEEQHQKGEMIITQ---AEER-------RRK 556
                   R S +R +     K S S  +G+       E+   Q    EER       RR 
Sbjct: 436  TRYSSEDRVSEMRRRT----KYSSSQEEGINVLQNFPEVTNNQQPLVEERISKQAGTRRT 491

Query: 557  SQQFSEASQAHGNNVEDTSIAKSRTSLKNWQGNLYFSSNARATELQTDKRTTESSVHSKG 616
            ++  SE+S+ H  ++ +T +++    ++N + +    S  ++   Q D     + + +  
Sbjct: 492  TEHISESSEIHDIDIRNTYVSQREDQIRNQEVHAGLVSGLQSERKQQDYHIEHNPLQTTQ 551

Query: 617  YEHVSTSSEGYASDEKQVSSSQRSSEKVRFIPKSKLTTV---VKTRESSSQT-DERIFEL 672
             +  S S   + SD  + +  QR SEK R I +   T V    K  ++ +Q  D R+   
Sbjct: 552  SDRTSVSV-SHTSDAVRYTEIQRKSEK-RLIGQGSTTAVQSDSKVEKNGAQKEDSRLDHA 609

Query: 673  TSEE-----------QRRWNTSREESSFKGSWNRVSVAGESVISAEGDERSSPITLIPSS 721
             S++           Q + +     S      +R  +    ++S E   + S  TLIP S
Sbjct: 610  NSKKDGQTTLGLQSYQSKLSEEASSSQSSLMASRTKLQLVDLVSEE--MQGSETTLIPPS 667

Query: 722  PQM-GGGSTHVESTSGTASPQIFLETLESGSFALYDK-------SGKSPALLSGPYSRHE 773
             Q+    S     T G +  +I   T ESG    ++        + +S   L G ++ HE
Sbjct: 668  SQLVSRRSGQSYRTGGVSIQEISHGTSESGYTTAFEHPRAGASVNSQSAGELMG-FTSHE 726

Query: 774  SDKVYSKPSNTMAPEDALGSAGRLAESSSQFVDEFVERVRHDVTTSETQEMEVTGTNLAF 833
                           DA+GSA RL ++S ++V EFV++ +H V   ET+E       L  
Sbjct: 727  ---------------DAMGSAHRLEQASEKYVGEFVKKAKHGVINPETEEQRAESNQL-- 769

Query: 834  DVEGNRVYSSTQQGTPIYPQSKKHDSSRSSGLPGIKGPADEMWDXXXXXXXXXXXXXXXX 893
                                 K+ DS RSSG  G KGP+DEMW                 
Sbjct: 770  ---------------------KRRDSRRSSGGSGAKGPSDEMW--VTDSAQGTPHPGATE 806

Query: 894  INKETAKPIVNRTGRSLWSIISDVVRLRWGSRPGSSTSAGRSGERNSPNKS-DSETWFSG 952
             N      I  R GRSLW++I+D+ RLRWGSR GS  S+ +   R+SPN+S  S TWFSG
Sbjct: 807  GNAAVGNAIFKRNGRSLWNVIADIARLRWGSRAGSPDSSAKPAGRSSPNESVSSATWFSG 866

Query: 953  QEHEETSKSNVLKETSVSPETMSSDKLIPGIRHPQTEGEVSDTQKLKEKVKHLEVGXXXX 1012
            +EH+ +S  N   +  +  E  S  ++  G   P+++ E   T KLK++ +  E      
Sbjct: 867  REHDGSSDDNTKGDKVLPQEAPSLHQVEVGQTSPRSQSEYPGTTKLKQRSERHEGVVSSP 926

Query: 1013 XXXXXXXXXLAASYASGEENAIL---TEDGKVLKESTSGTKNIELPISLPAR-------- 1061
                     ++   +S   N I+    E+G   +     T   E+P+ LP+R        
Sbjct: 927  SSTILEGGSVSNRMSSTSGNQIVGVDEEEGGNFEFRLPETALTEVPMKLPSRNLIRSPPI 986

Query: 1062 ------------------------------GQ-PIVGEIVNISGPDM------------- 1077
                                          GQ P++    N+  P +             
Sbjct: 987  KESSESSLTEASSDQNFTVGEGRRYPRMDAGQNPLLFPGRNLRSPAVMEPPVPRPRMVSG 1046

Query: 1078 -SGIEPVVEIKDPVAPVQSELSGSERKDGELKQRKFQRNKQVGRDRFDDWEEAYKVELEQ 1136
             S +   VE + P++    E +GS   D  L QRK QRNKQV RD F++WEEAYKVE E+
Sbjct: 1047 SSSLREQVEQQQPLSAKSQEETGSVSADSALIQRKLQRNKQVVRDSFEEWEEAYKVEAER 1106

Query: 1137 RKMDEMFMNEALLEARKAADAWEVPVGAVLVQDGKIIARGCNLVEESRDSTAHAEMICIR 1196
            R +DE+FM EAL+EA+KAAD WEVPVGAVLV DGKIIARG NLVEE RDSTAHAEMICIR
Sbjct: 1107 RTVDEIFMREALVEAKKAADTWEVPVGAVLVHDGKIIARGYNLVEELRDSTAHAEMICIR 1166

Query: 1197 EASKLLHTWRLSETTLYVTLEPCPMCAGAILQARVDTVVWGAPNKLLGADGSWIRLFPDG 1256
            E SK L +WRL++TTLYVTLEPCPMCAGAILQARV+T+VWGAPNKLLGADGSWIRLFP G
Sbjct: 1167 EGSKALRSWRLADTTLYVTLEPCPMCAGAILQARVNTLVWGAPNKLLGADGSWIRLFPGG 1226

Query: 1257 GQNVSEPRDIQPAPVHPFHPNIKIRRGVLATECANEMQQFFQLXXXXXXXXXXXXXXXLA 1316
              N SE  +  P PVHPFHP + IRRGVL +ECA  MQQFFQL                 
Sbjct: 1227 EGNGSEASEKPPPPVHPFHPKMTIRRGVLESECAQTMQQFFQLRRKKKDKNSDPPTPTDH 1286

Query: 1317 VTHHHPSKLLNKIQDMFHVMFCL 1339
              HH P KLLNK+  +    FCL
Sbjct: 1287 HHHHLP-KLLNKMHQVL-PFFCL 1307


>AT5G28050.2 | Symbols:  | cytidine/deoxycytidylate deaminase family
            protein | chr5:10044213-10045488 REVERSE
          Length = 204

 Score = 83.2 bits (204), Expect = 1e-15,   Method: Composition-based stats.
 Identities = 44/100 (44%), Positives = 63/100 (63%), Gaps = 1/100 (1%)

Query: 1140 DEMFMNEALLEARKAADAWEV-PVGAVLVQDGKIIARGCNLVEESRDSTAHAEMICIREA 1198
            D  F+ +A+ EA K  D  +  P GAV+V + +++A   N+V +  D TAHAE+  IREA
Sbjct: 49   DHKFLTQAVEEAYKGVDCGDGGPFGAVIVHNNEVVASCHNMVLKYTDPTAHAEVTAIREA 108

Query: 1199 SKLLHTWRLSETTLYVTLEPCPMCAGAILQARVDTVVWGA 1238
             K L+   LSE  +Y + EPCPMC GAI  +R+  +V+GA
Sbjct: 109  CKKLNKIELSECEIYASCEPCPMCFGAIHLSRLKRLVYGA 148


>AT5G28050.1 | Symbols:  | cytidine/deoxycytidylate deaminase family
            protein | chr5:10044213-10045759 REVERSE
          Length = 185

 Score = 82.4 bits (202), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 44/100 (44%), Positives = 63/100 (63%), Gaps = 1/100 (1%)

Query: 1140 DEMFMNEALLEARKAADAWEV-PVGAVLVQDGKIIARGCNLVEESRDSTAHAEMICIREA 1198
            D  F+ +A+ EA K  D  +  P GAV+V + +++A   N+V +  D TAHAE+  IREA
Sbjct: 30   DHKFLTQAVEEAYKGVDCGDGGPFGAVIVHNNEVVASCHNMVLKYTDPTAHAEVTAIREA 89

Query: 1199 SKLLHTWRLSETTLYVTLEPCPMCAGAILQARVDTVVWGA 1238
             K L+   LSE  +Y + EPCPMC GAI  +R+  +V+GA
Sbjct: 90   CKKLNKIELSECEIYASCEPCPMCFGAIHLSRLKRLVYGA 129


>AT1G48175.1 | Symbols: EMB2191 | EMB2191 (EMBRYO DEFECTIVE 2191);
            catalytic/ hydrolase/ zinc ion binding |
            chr1:17794626-17795735 FORWARD
          Length = 182

 Score = 82.4 bits (202), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 49/122 (40%), Positives = 67/122 (54%), Gaps = 15/122 (12%)

Query: 1143 FMNEALLEARKAADAWEVPVGAVLVQDGKIIARGCNLVEESRDSTAHAEMICIREASKLL 1202
            +M  AL +A+ A +A EVPVG V ++DGK+IA G N   E+R++T HAEM  I    +L+
Sbjct: 12   YMGFALHQAKLALEALEVPVGCVFLEDGKVIASGRNRTNETRNATRHAEMEAI---DQLV 68

Query: 1203 HTW------------RLSETTLYVTLEPCPMCAGAILQARVDTVVWGAPNKLLGADGSWI 1250
              W            + S+  LYVT EPC MCA A+    +  V +G PN   G  GS +
Sbjct: 69   GQWQKDGLSPSQVAEKFSKCVLYVTCEPCIMCASALSFLGIKEVYYGCPNDKFGGCGSIL 128

Query: 1251 RL 1252
             L
Sbjct: 129  SL 130


>AT3G05300.1 | Symbols:  | cytidine/deoxycytidylate deaminase family
            protein | chr3:1508030-1508493 REVERSE
          Length = 113

 Score = 64.7 bits (156), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 36/82 (43%), Positives = 52/82 (63%), Gaps = 1/82 (1%)

Query: 1179 LVEESRDSTAHAEMICIREASKLLHTWRLSETTLYVTLEPCPMCAGAILQARVDTVVWGA 1238
            +V + +D TAHAE+I IREA K L+  +LSE  +Y + EPCPMC GAI  +R+  +V+ A
Sbjct: 1    MVFKYKDPTAHAEVIAIREACKKLNEIKLSECEIYASCEPCPMCFGAIHLSRLKRLVYEA 60

Query: 1239 PNKLLGADGSWIRLFPDGGQNV 1260
              +   A G + R+  DG + V
Sbjct: 61   KVEAALAIG-FNRILADGVRGV 81