Miyakogusa Predicted Gene
- Lj2g3v1192960.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj2g3v1192960.1 Non Chatacterized Hit- tr|I1JEQ9|I1JEQ9_SOYBN
Uncharacterized protein OS=Glycine max PE=4 SV=1,60.34,0,seg,NULL;
Cytidine deaminase-like,Cytidine deaminase-like;
dCMP_cyt_deam_1,CMP/dCMP deaminase, zinc-,CUFF.36441.1
(1339 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G68720.1 | Symbols: TADA, ATTADA | tRNA arginine adenosine de... 432 e-121
AT5G28050.2 | Symbols: | Cytidine/deoxycytidylate deaminase fam... 83 1e-15
AT5G28050.1 | Symbols: | Cytidine/deoxycytidylate deaminase fam... 82 2e-15
AT1G48175.1 | Symbols: emb2191 | Cytidine/deoxycytidylate deamin... 82 2e-15
AT3G05300.1 | Symbols: | Cytidine/deoxycytidylate deaminase fam... 65 4e-10
>AT1G68720.1 | Symbols: TADA, ATTADA | tRNA arginine adenosine
deaminase | chr1:25804547-25808820 FORWARD LENGTH=1307
Length = 1307
Score = 432 bits (1111), Expect = e-121, Method: Compositional matrix adjust.
Identities = 346/983 (35%), Positives = 473/983 (48%), Gaps = 162/983 (16%)
Query: 464 EENLQISETHIQ--ETSGEHEKFIGSTSTTGKNVINRSSQKYIGNSNIEDSE-------- 513
EE+++I E H+ ETS +++K + I S GN NIE S+
Sbjct: 380 EEDMEIHEVHVNDAETSSQNQKLFNEREDYRVHSIRNDS----GNENIESSQHQLKERLE 435
Query: 514 -------RTSNIRMKNVGEKKISLSSVQGVEEQHQKGEMIITQ---AEER-------RRK 556
R S +R + K S S +G+ E+ Q EER RR
Sbjct: 436 TRYSSEDRVSEMRRRT----KYSSSQEEGINVLQNFPEVTNNQQPLVEERISKQAGTRRT 491
Query: 557 SQQFSEASQAHGNNVEDTSIAKSRTSLKNWQGNLYFSSNARATELQTDKRTTESSVHSKG 616
++ SE+S+ H ++ +T +++ ++N + + S ++ Q D + + +
Sbjct: 492 TEHISESSEIHDIDIRNTYVSQREDQIRNQEVHAGLVSGLQSERKQQDYHIEHNPLQTTQ 551
Query: 617 YEHVSTSSEGYASDEKQVSSSQRSSEKVRFIPKSKLTTV---VKTRESSSQT-DERIFEL 672
+ S S + SD + + QR SEK R I + T V K ++ +Q D R+
Sbjct: 552 SDRTSVSV-SHTSDAVRYTEIQRKSEK-RLIGQGSTTAVQSDSKVEKNGAQKEDSRLDHA 609
Query: 673 TSEE-----------QRRWNTSREESSFKGSWNRVSVAGESVISAEGDERSSPITLIPSS 721
S++ Q + + S +R + ++S E + S TLIP S
Sbjct: 610 NSKKDGQTTLGLQSYQSKLSEEASSSQSSLMASRTKLQLVDLVSEE--MQGSETTLIPPS 667
Query: 722 PQM-GGGSTHVESTSGTASPQIFLETLESGSFALYDK-------SGKSPALLSGPYSRHE 773
Q+ S T G + +I T ESG ++ + +S L G ++ HE
Sbjct: 668 SQLVSRRSGQSYRTGGVSIQEISHGTSESGYTTAFEHPRAGASVNSQSAGELMG-FTSHE 726
Query: 774 SDKVYSKPSNTMAPEDALGSAGRLAESSSQFVDEFVERVRHDVTTSETQEMEVTGTNLAF 833
DA+GSA RL ++S ++V EFV++ +H V ET+E L
Sbjct: 727 ---------------DAMGSAHRLEQASEKYVGEFVKKAKHGVINPETEEQRAESNQL-- 769
Query: 834 DVEGNRVYSSTQQGTPIYPQSKKHDSSRSSGLPGIKGPADEMWDXXXXXXXXXXXXXXXX 893
K+ DS RSSG G KGP+DEMW
Sbjct: 770 ---------------------KRRDSRRSSGGSGAKGPSDEMW--VTDSAQGTPHPGATE 806
Query: 894 INKETAKPIVNRTGRSLWSIISDVVRLRWGSRPGSSTSAGRSGERNSPNKS-DSETWFSG 952
N I R GRSLW++I+D+ RLRWGSR GS S+ + R+SPN+S S TWFSG
Sbjct: 807 GNAAVGNAIFKRNGRSLWNVIADIARLRWGSRAGSPDSSAKPAGRSSPNESVSSATWFSG 866
Query: 953 QEHEETSKSNVLKETSVSPETMSSDKLIPGIRHPQTEGEVSDTQKLKEKVKHLEVGXXXX 1012
+EH+ +S N + + E S ++ G P+++ E T KLK++ + E
Sbjct: 867 REHDGSSDDNTKGDKVLPQEAPSLHQVEVGQTSPRSQSEYPGTTKLKQRSERHEGVVSSP 926
Query: 1013 XXXXXXXXXLAASYASGEENAIL---TEDGKVLKESTSGTKNIELPISLPAR-------- 1061
++ +S N I+ E+G + T E+P+ LP+R
Sbjct: 927 SSTILEGGSVSNRMSSTSGNQIVGVDEEEGGNFEFRLPETALTEVPMKLPSRNLIRSPPI 986
Query: 1062 ------------------------------GQ-PIVGEIVNISGPDM------------- 1077
GQ P++ N+ P +
Sbjct: 987 KESSESSLTEASSDQNFTVGEGRRYPRMDAGQNPLLFPGRNLRSPAVMEPPVPRPRMVSG 1046
Query: 1078 -SGIEPVVEIKDPVAPVQSELSGSERKDGELKQRKFQRNKQVGRDRFDDWEEAYKVELEQ 1136
S + VE + P++ E +GS D L QRK QRNKQV RD F++WEEAYKVE E+
Sbjct: 1047 SSSLREQVEQQQPLSAKSQEETGSVSADSALIQRKLQRNKQVVRDSFEEWEEAYKVEAER 1106
Query: 1137 RKMDEMFMNEALLEARKAADAWEVPVGAVLVQDGKIIARGCNLVEESRDSTAHAEMICIR 1196
R +DE+FM EAL+EA+KAAD WEVPVGAVLV DGKIIARG NLVEE RDSTAHAEMICIR
Sbjct: 1107 RTVDEIFMREALVEAKKAADTWEVPVGAVLVHDGKIIARGYNLVEELRDSTAHAEMICIR 1166
Query: 1197 EASKLLHTWRLSETTLYVTLEPCPMCAGAILQARVDTVVWGAPNKLLGADGSWIRLFPDG 1256
E SK L +WRL++TTLYVTLEPCPMCAGAILQARV+T+VWGAPNKLLGADGSWIRLFP G
Sbjct: 1167 EGSKALRSWRLADTTLYVTLEPCPMCAGAILQARVNTLVWGAPNKLLGADGSWIRLFPGG 1226
Query: 1257 GQNVSEPRDIQPAPVHPFHPNIKIRRGVLATECANEMQQFFQLXXXXXXXXXXXXXXXLA 1316
N SE + P PVHPFHP + IRRGVL +ECA MQQFFQL
Sbjct: 1227 EGNGSEASEKPPPPVHPFHPKMTIRRGVLESECAQTMQQFFQLRRKKKDKNSDPPTPTDH 1286
Query: 1317 VTHHHPSKLLNKIQDMFHVMFCL 1339
HH P KLLNK+ + FCL
Sbjct: 1287 HHHHLP-KLLNKMHQVL-PFFCL 1307
>AT5G28050.2 | Symbols: | Cytidine/deoxycytidylate deaminase family
protein | chr5:10044209-10045484 REVERSE LENGTH=204
Length = 204
Score = 83.2 bits (204), Expect = 1e-15, Method: Composition-based stats.
Identities = 44/100 (44%), Positives = 63/100 (63%), Gaps = 1/100 (1%)
Query: 1140 DEMFMNEALLEARKAADAWEV-PVGAVLVQDGKIIARGCNLVEESRDSTAHAEMICIREA 1198
D F+ +A+ EA K D + P GAV+V + +++A N+V + D TAHAE+ IREA
Sbjct: 49 DHKFLTQAVEEAYKGVDCGDGGPFGAVIVHNNEVVASCHNMVLKYTDPTAHAEVTAIREA 108
Query: 1199 SKLLHTWRLSETTLYVTLEPCPMCAGAILQARVDTVVWGA 1238
K L+ LSE +Y + EPCPMC GAI +R+ +V+GA
Sbjct: 109 CKKLNKIELSECEIYASCEPCPMCFGAIHLSRLKRLVYGA 148
>AT5G28050.1 | Symbols: | Cytidine/deoxycytidylate deaminase family
protein | chr5:10044209-10045755 REVERSE LENGTH=185
Length = 185
Score = 82.4 bits (202), Expect = 2e-15, Method: Composition-based stats.
Identities = 44/100 (44%), Positives = 63/100 (63%), Gaps = 1/100 (1%)
Query: 1140 DEMFMNEALLEARKAADAWEV-PVGAVLVQDGKIIARGCNLVEESRDSTAHAEMICIREA 1198
D F+ +A+ EA K D + P GAV+V + +++A N+V + D TAHAE+ IREA
Sbjct: 30 DHKFLTQAVEEAYKGVDCGDGGPFGAVIVHNNEVVASCHNMVLKYTDPTAHAEVTAIREA 89
Query: 1199 SKLLHTWRLSETTLYVTLEPCPMCAGAILQARVDTVVWGA 1238
K L+ LSE +Y + EPCPMC GAI +R+ +V+GA
Sbjct: 90 CKKLNKIELSECEIYASCEPCPMCFGAIHLSRLKRLVYGA 129
>AT1G48175.1 | Symbols: emb2191 | Cytidine/deoxycytidylate deaminase
family protein | chr1:17790957-17792066 FORWARD
LENGTH=182
Length = 182
Score = 82.4 bits (202), Expect = 2e-15, Method: Composition-based stats.
Identities = 49/122 (40%), Positives = 67/122 (54%), Gaps = 15/122 (12%)
Query: 1143 FMNEALLEARKAADAWEVPVGAVLVQDGKIIARGCNLVEESRDSTAHAEMICIREASKLL 1202
+M AL +A+ A +A EVPVG V ++DGK+IA G N E+R++T HAEM I +L+
Sbjct: 12 YMGFALHQAKLALEALEVPVGCVFLEDGKVIASGRNRTNETRNATRHAEMEAI---DQLV 68
Query: 1203 HTW------------RLSETTLYVTLEPCPMCAGAILQARVDTVVWGAPNKLLGADGSWI 1250
W + S+ LYVT EPC MCA A+ + V +G PN G GS +
Sbjct: 69 GQWQKDGLSPSQVAEKFSKCVLYVTCEPCIMCASALSFLGIKEVYYGCPNDKFGGCGSIL 128
Query: 1251 RL 1252
L
Sbjct: 129 SL 130
>AT3G05300.1 | Symbols: | Cytidine/deoxycytidylate deaminase family
protein | chr3:1508024-1508487 REVERSE LENGTH=113
Length = 113
Score = 64.7 bits (156), Expect = 4e-10, Method: Composition-based stats.
Identities = 36/82 (43%), Positives = 52/82 (63%), Gaps = 1/82 (1%)
Query: 1179 LVEESRDSTAHAEMICIREASKLLHTWRLSETTLYVTLEPCPMCAGAILQARVDTVVWGA 1238
+V + +D TAHAE+I IREA K L+ +LSE +Y + EPCPMC GAI +R+ +V+ A
Sbjct: 1 MVFKYKDPTAHAEVIAIREACKKLNEIKLSECEIYASCEPCPMCFGAIHLSRLKRLVYEA 60
Query: 1239 PNKLLGADGSWIRLFPDGGQNV 1260
+ A G + R+ DG + V
Sbjct: 61 KVEAALAIG-FNRILADGVRGV 81