Miyakogusa Predicted Gene
- chr3.CM1570.200.nc
BLASTP 2.2.18 [Mar-02-2008]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= chr3.CM1570.200.nc - phase: 0
(744 letters)
Database: TAIR8_pep
32,825 sequences; 13,166,001 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT2G45620.1 | Symbols: | nucleotidyltransferase family protein ... 619 e-177
AT3G45750.1 | Symbols: | similar to unknown protein [Arabidopsi... 112 9e-25
AT2G39740.1 | Symbols: | similar to unknown protein [Arabidopsi... 84 2e-16
AT5G53770.1 | Symbols: | nucleotidyltransferase family protein ... 77 6e-14
AT3G45760.1 | Symbols: | similar to unknown protein [Arabidopsi... 49 2e-05
>AT2G45620.1 | Symbols: | nucleotidyltransferase family protein |
chr2:18800017-18802824 FORWARD
Length = 764
Score = 619 bits (1595), Expect = e-177, Method: Compositional matrix adjust.
Identities = 378/797 (47%), Positives = 461/797 (57%), Gaps = 98/797 (12%)
Query: 1 MHGGGGDFPSPQPP-NSGEYLLSLIXXXXXXXXXXXXXXXXXXXXXXAIDPAVAFMGPSI 59
M GG + P+P N+GE+LLS++ A+DPA+A +GP++
Sbjct: 1 MADGGAEPPAPPSSINAGEFLLSILHGSPSPSSQGPQHHQSF-----ALDPAIAAIGPTV 55
Query: 60 --PVAASPWQSNGLDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXFFGLPHN------PF 111
P S WQSNG F PHN F
Sbjct: 56 NNPFPPSNWQSNG-------------------HRPSNHNPPSWPLAFSPPHNLSPNFLGF 96
Query: 112 PQPRPTGNHYPAAAAQLHYNSGAALSDDLRRLGFPIEGN------------DKSTFVQQQ 159
PQ P+ P Q N + +D RLGFP N + Q +
Sbjct: 97 PQFPPS----PFTTNQFDGNQRVS-PEDAYRLGFPGTTNPAIQSMVQQQQQQQLPPPQSE 151
Query: 160 ELKLKFGSLP-------------SVSYASSPE---------VPSNGDSLPNLKFDNGFDR 197
KL FGS ++ Y S+ SN + PNL D
Sbjct: 152 TRKLVFGSFSGDATQSLNGLHNGNLKYDSNQHEQLMRHPQSTLSNSNMDPNLSHHRNHDL 211
Query: 198 NLHVDPKSGPNNHGVVGGYRVLGSAPETTRXXXXXGFGNKSRGTGYWGSGTTRKGSEVGE 257
+ SG N G +G G ++T GF + RG W K + G
Sbjct: 212 HEQRGGHSGRGNWGHIGNN---GRGLKSTPPPPPPGFSSNQRG---WDMSLGSKDDDRGM 265
Query: 258 DRGLAVGSGEFG-ARNENLHSKKESGRM-GSGGRSNTRGNVAREVGLPDQIDXXXXXXXX 315
R GE N+++ E+ R+ G ++ ++ N+++ QID
Sbjct: 266 GRNHDQAMGEHSKVWNQSVDFSAEANRLRGLSIQNESKFNLSQ------QIDHPGPPKGA 319
Query: 316 XXXXXXXXXIEESRSSLNRVGV------VEDGVSDKHMGVGSRGGADVDLLGEQIVESLL 369
+S S LN+ E G K G+ +++ GE IV+SLL
Sbjct: 320 SLHSVSAADAADSFSMLNKEARRGGERREELGQLSKAKREGNANSDEIEDFGEDIVKSLL 379
Query: 370 LEDESDDK--NNNSKQRRTPREKDARLLDSRGEQMLSQRGRMYKRQMMCRRDIDSFNGSF 427
LEDE+ +K N+ K +T REK++R+ D+RG+++L Q+ RM K M CR DI ++ +F
Sbjct: 380 LEDETGEKDANDGKKDSKTSREKESRV-DNRGQRLLGQKARMVKMYMACRNDIHRYDATF 438
Query: 428 LAIYESLIPPEEEKLKQKQLLGVLENLVSKEWPTSKLYLYGSCANSFGVSKSDIDVCLAI 487
+AIY+SLIP EEE KQ+QL+ LENLV+KEWP +KLYLYGSCANSFG KSDIDVCLAI
Sbjct: 439 IAIYKSLIPAEEELEKQRQLMAHLENLVAKEWPHAKLYLYGSCANSFGFPKSDIDVCLAI 498
Query: 488 KEAED--KSKIIMKLADILQSDNLQNVQALTRARVPIVKLMDPATGISCDICVNNILAVV 545
E +D KS++++KLA+IL+SDNLQNVQALTRARVPIVKLMDP TGISCDIC+NN+LAVV
Sbjct: 499 -EGDDINKSEMLLKLAEILESDNLQNVQALTRARVPIVKLMDPVTGISCDICINNVLAVV 557
Query: 546 NTKLLRDYGLIDARLRQLAFIIKHWAKSRGVNETYHGTLSSYAYVLMCIHFLQLRRPAIL 605
NTKLLRDY ID RLRQLAFI+KHWAKSR VNETY GTLSSYAYVLMCIHFLQ RRP IL
Sbjct: 558 NTKLLRDYAQIDVRLRQLAFIVKHWAKSRRVNETYQGTLSSYAYVLMCIHFLQQRRPPIL 617
Query: 606 PCLQEMESTYSVTVDDTYCSYFDQVDRLCNFGRNNKETIARLVWGFFYYWAYCHDYANTV 665
PCLQEME TYSV VD+ C+YFD VDRL NFG NN+ETIA LVWGFF YWAY HDYA V
Sbjct: 618 PCLQEMEPTYSVRVDNIRCTYFDNVDRLRNFGSNNRETIAELVWGFFNYWAYAHDYAYNV 677
Query: 666 ISVRTGSILSKREKDWTRRIGNDRHLICIEDPFETSHDLGRVVDKRSIKVLREEFERAAD 725
+SVRTGSIL KREKDWTRR+GNDRHLICIEDPFETSHDLGRVVDK SI+VLREEFERAA
Sbjct: 678 VSVRTGSILGKREKDWTRRVGNDRHLICIEDPFETSHDLGRVVDKFSIRVLREEFERAAR 737
Query: 726 IMQNDPNPCIKLFEPYV 742
IM DPNPC KL EPY+
Sbjct: 738 IMHQDPNPCAKLLEPYI 754
>AT3G45750.1 | Symbols: | similar to unknown protein [Arabidopsis
thaliana] (TAIR:AT3G45760.1); similar to hypothetical
protein OsJ_027139 [Oryza sativa (japonica
cultivar-group)] (GB:EAZ43656.1); similar to
Os08g0559900 [Oryza sativa (japonica cultivar-group)]
(GB:NP_001062505.1); contains domain SSF81631
(SSF81631); contains domain PTHR12271:SF13
(PTHR12271:SF13); contains domain PTHR12271 (PTHR12271);
contains domain SSF81301 (SSF81301) |
chr3:16804840-16808365 REVERSE
Length = 682
Score = 112 bits (280), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 87/320 (27%), Positives = 148/320 (46%), Gaps = 42/320 (13%)
Query: 430 IYESLIPPEEEKLKQKQLLGVLENLV-----SKEWPTSKLYLYGSCANSFGVSKSDIDVC 484
+Y S P + +K+L+ L + E + L YGS S+SD+DV
Sbjct: 50 VYCSFRPVSADYNTRKELVKNLNTMALDIYGKSEESSPVLEAYGSFVMDMYSSQSDLDVS 109
Query: 485 LAIKEA------EDKSKIIMKLADILQS----DNLQNVQALTRARVPIVKLMDPATGISC 534
+ E K +I+ + A L+S ++NV+++ A+VPIVK D TG+ C
Sbjct: 110 INFGNGTSEIPREKKLEILKRFAKKLRSLQGEGQVKNVESIFSAKVPIVKFSDQGTGVEC 169
Query: 535 DICVNNILAVVNTKLLRDYGLIDARLRQLAFIIKHWAKSRGVNETYHGTLSSYAYVLMCI 594
D+ V N ++N++++R ID R ++L ++KHWAK+ VN H TL+S + L+
Sbjct: 170 DLSVENKDGILNSQIVRIISQIDGRFQKLCLLVKHWAKAHEVNSALHRTLNSVSITLLVA 229
Query: 595 HFLQLRRPAILPCLQEMESTYSVTVDDTY--CSYFDQVDRLCNFGRNNKETIARLVWGFF 652
LQ + P ILP +S+ + D + + + N+G+ N+E++ RL FF
Sbjct: 230 LHLQTQNPPILP-------PFSMLLKDGMDPPNVEKRAQKFLNWGQRNQESLGRLFATFF 282
Query: 653 -------YYWAYCHDYANTVISVRTGSILSKREKDWTRRIGNDRHLICIEDPFETSHDLG 705
+ W +SV G +SK+ W +++G I +ED S ++
Sbjct: 283 IKLQSVEFLWR-----QGLCVSVLNGLWISKK---W-KKVGVGS--ISVEDFTNISQNVA 331
Query: 706 RVVDKRSIKVLREEFERAAD 725
R V+ K + R +
Sbjct: 332 RRVNGAGAKKIYSSINRTVE 351
>AT2G39740.1 | Symbols: | similar to unknown protein [Arabidopsis
thaliana] (TAIR:AT3G45750.1); similar to unnamed protein
product [Vitis vinifera] (GB:CAO69145.1); contains
domain SSF81631 (SSF81631); contains domain
PTHR12271:SF13 (PTHR12271:SF13); contains domain
PTHR12271 (PTHR12271); contains domain SSF81301
(SSF81301) | chr2:16583328-16585782 FORWARD
Length = 511
Score = 84.3 bits (207), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 69/280 (24%), Positives = 127/280 (45%), Gaps = 32/280 (11%)
Query: 467 YGSCANSFGVSKSDIDVCLAIKEA--------EDKSKIIMKLADILQSDNLQ-NVQALTR 517
+GS ++ D+D+ + + + K ++ L L++ L +Q +
Sbjct: 53 FGSFVSNLFTRWGDLDISVDLFSGSSILFTGKKQKQTLLGHLLRALRASGLWYKLQFVIH 112
Query: 518 ARVPIVKLMDPATGISCDICVNNILAVVNTKLLRDYGLIDARLRQLAFIIKHWAKSRGVN 577
ARVPI+K++ ISCDI ++N+ ++ ++ L ID R R L ++K WAK+ +N
Sbjct: 113 ARVPILKVVSGHQRISCDISIDNLDGLLKSRFLFWISEIDGRFRDLVLLVKEWAKAHNIN 172
Query: 578 ETYHGTLSSYAYVLMCIHFLQLRRPAILPCLQEMESTYSVT----VDDTYCSYFDQVDRL 633
++ GT +SY+ L+ I Q PAILP L+ + +V V T QV
Sbjct: 173 DSKTGTFNSYSLSLLVIFHFQTCVPAILPPLRVIYPKSAVDDLTGVRKTAEESIAQVT-A 231
Query: 634 CNFGR--------NNKETIARLVWGFFYYWA----YCHDYANTVISVRTGSILSKREKDW 681
N R N+ +++ L+ FF ++ ++ + R +I S W
Sbjct: 232 ANIARFKSERAKSVNRSSLSELLVSFFAKFSDINVKAQEFGVCPFTGRWETISS--NTTW 289
Query: 682 TRRIGNDRHLICIEDPFETSHDLGRVVDKRSIKVLREEFE 721
+ + + +EDPFE + R V +R++ + + F+
Sbjct: 290 LPKT----YSLFVEDPFEQPVNAARSVSRRNLDRIAQVFQ 325
>AT5G53770.1 | Symbols: | nucleotidyltransferase family protein |
chr5:21843959-21847084 FORWARD
Length = 530
Score = 76.6 bits (187), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 66/276 (23%), Positives = 119/276 (43%), Gaps = 39/276 (14%)
Query: 434 LIPPEEEKLKQKQLLGVLENLVSKEWPTSKLYLYGSCANSFGVSKSDIDVCLAIKEAEDK 493
L+P + EK ++ + + +++ WP+ K+ ++GS + SDIDV + +
Sbjct: 132 LLPTQAEKAERDAAVESVSSVIKYIWPSCKVEVFGSYKTGLYLPTSDIDVVILESGLTNP 191
Query: 494 SKIIMKLADILQSDNL-QNVQALTRARVPIVKLMDPATGISCDICVNNILAVVNTKLLRD 552
+ L+ L + +N+ + +ARVPI+K ++ + I+ D+ + + ++D
Sbjct: 192 QLGLRALSRALSQRGIAKNLLVIAKARVPIIKFVEKKSNIAFDLSFDMENGPKAAEFIQD 251
Query: 553 YGLIDARLRQLAFIIKHWAKSRGVNETYHGTLSSYAYVLMCIHFLQLRRPAILPCLQEME 612
LR L I+K + + R +NE Y G + SYA + M I FL+ +
Sbjct: 252 AVSKLPPLRPLCLILKVFLQQRELNEVYSGGIGSYALLAMLIAFLKYLKD---------- 301
Query: 613 STYSVTVDDTYCSYFDQVDRLCNFGRNNKE-TIARLVWGFFYYWAYCHDYANTVISVRT- 670
GR+ E + L+ FF ++ + A+ IS +
Sbjct: 302 ------------------------GRSAPEHNLGVLLVKFFDFYGRKLNTADVGISCKMG 337
Query: 671 GSILSKREKDWTRRIGNDRHLICIEDPFETSHDLGR 706
GS SK K + R LI IEDP +D+G+
Sbjct: 338 GSFFSKYNKGFLNRA--RPSLISIEDPQTPENDIGK 371
>AT3G45760.1 | Symbols: | similar to unknown protein [Arabidopsis
thaliana] (TAIR:AT3G45750.1); similar to hypothetical
protein OsJ_027139 [Oryza sativa (japonica
cultivar-group)] (GB:EAZ43656.1); similar to
Os08g0559900 [Oryza sativa (japonica cultivar-group)]
(GB:NP_001062505.1); contains domain PTHR12271:SF1
(PTHR12271:SF1); contains domain PTHR12271 (PTHR12271);
contains domain SSF81301 (SSF81301) |
chr3:16812981-16815744 REVERSE
Length = 442
Score = 48.5 bits (114), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 65/279 (23%), Positives = 112/279 (40%), Gaps = 62/279 (22%)
Query: 399 GEQMLSQRG--RMYKRQMMC---RRDIDSF-----NGSFLAIYESLIPPEEEKLKQKQLL 448
GE+ +S +G + K M R +IDS+ + Y S P + +K+L+
Sbjct: 9 GEKNVSSKGIQKKVKNTMSIASKRYNIDSYILLDLDKVLDDAYSSFRPVSADYNTRKELV 68
Query: 449 GVLENLV-----SKEWPTSKLYLYGSCANSFGVSKSDIDVCLAIKEA------EDKSKII 497
L + E + L YGS A + S+ D+DV + E K +I+
Sbjct: 69 KNLNAMAIDIFGKSEESSPVLEAYGSFAMNTFSSQKDLDVSINFSSGTSEFYREKKLEIL 128
Query: 498 MKLADILQSDN----LQNVQALTRARVPIVKLMDPATGISCDICVNNILAVVNTKLLRDY 553
+ A L+S ++NV + ARVPIV+ D TGI CD+ V + ++ ++++R
Sbjct: 129 TRFATKLRSLEGQGFVRNVVPILSARVPIVRFCDQGTGIECDLTVESKDGILTSQIIRII 188
Query: 554 GLIDARLRQLAFIIKHWAKSRGVNETYHGTLSSYAYVLMCIHFLQLRRPAILPCLQEMES 613
ID R ++L + + P ILP +
Sbjct: 189 SQIDDRFQKLCLLTQS--------------------------------PPILPPFSTL-- 214
Query: 614 TYSVTVDDTYCSYFDQVDRLCNFGRNNKETIARLVWGFF 652
+ +D + + N+G+ N+E++ RL FF
Sbjct: 215 -FKDGIDPPIVE--KRTQKFLNWGQRNQESLGRLFATFF 250