Miyakogusa Predicted Gene
- chr1.CM0029.50.nc
BLASTP 2.2.18 [Mar-02-2008]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= chr1.CM0029.50.nc + phase: 0
(459 letters)
Database: TAIR8_pep
32,825 sequences; 13,166,001 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G58560.1 | Symbols: | endonuclease/exonuclease/phosphatase f... 625 e-179
AT3G58580.1 | Symbols: | hydrolase | chr3:21671843-21674674 REV... 612 e-175
AT5G11350.1 | Symbols: | endonuclease/exonuclease/phosphatase f... 85 1e-16
AT1G31500.3 | Symbols: | endonuclease/exonuclease/phosphatase f... 84 3e-16
AT1G31500.1 | Symbols: | endonuclease/exonuclease/phosphatase f... 83 4e-16
AT1G31500.2 | Symbols: | endonuclease/exonuclease/phosphatase f... 82 6e-16
AT3G18500.2 | Symbols: | similar to endonuclease/exonuclease/ph... 82 6e-16
AT1G31500.4 | Symbols: | endonuclease/exonuclease/phosphatase f... 75 1e-13
AT1G73875.1 | Symbols: | endonuclease/exonuclease/phosphatase f... 63 4e-10
AT3G18500.1 | Symbols: | similar to endonuclease/exonuclease/ph... 55 1e-07
AT1G31530.1 | Symbols: | endonuclease/exonuclease/phosphatase f... 48 1e-05
>AT3G58560.1 | Symbols: | endonuclease/exonuclease/phosphatase
family protein | chr3:21661857-21664873 REVERSE
Length = 602
Score = 625 bits (1612), Expect = e-179, Method: Compositional matrix adjust.
Identities = 304/426 (71%), Positives = 348/426 (81%), Gaps = 12/426 (2%)
Query: 1 MLSVLRVHLPSDIPIVGCELTPYVLLRRPDKTVTTDDALETAPLDGHFLRYKWYRVQSDK 60
MLSV+RVHLPS+IPIVGCELTPYVLLRRPDKT +TDD E+APL+GHFL+Y+W+RVQSDK
Sbjct: 1 MLSVIRVHLPSEIPIVGCELTPYVLLRRPDKTPSTDDVPESAPLEGHFLKYRWFRVQSDK 60
Query: 61 KVAVCSVHPSEQATLQCLGCVKAKIPVSKSYHCTPKCFSDAWQHHRVLHDRAASAXXXXX 120
KVA+CSVHPSE ATLQCLGC+K+K+PV+KSYHC+ KCFSDAWQHHRVLH+RAASA
Sbjct: 61 KVAICSVHPSETATLQCLGCLKSKVPVAKSYHCSTKCFSDAWQHHRVLHERAASAATEGN 120
Query: 121 XXXXXXXXXXXXAXXXXXXXXXXXXXXXXXXXXXXAPLYPAAVTQRSG--GETWFEVGRS 178
+ +YP+A+TQ++G GET EVGRS
Sbjct: 121 DEEEL-------PRLNSSGSGSGVLSTSVSLTNGSSSVYPSAITQKTGAGGETLVEVGRS 173
Query: 179 KTYTPTADDIGHVLKFECVVVDAETKLPVGHANTILTSRVIPAPSPSPRRLIPVDGM--- 235
KTYTP ADDI HVLKFECVVV+AETK VG + TILTSRVIPAPSPSPRRLI + G
Sbjct: 174 KTYTPMADDICHVLKFECVVVNAETKQNVGLSCTILTSRVIPAPSPSPRRLISISGTDVT 233
Query: 236 GNVDADGRVSSSGTFTVLSYNILSDSYASSDLYNYCPSWALSWPYRRQNLLREIVGYRAD 295
G++D++GR S GTFTVLSYNILSD+YASSD+Y+YCP+WAL+W YRRQNLLREIV YRAD
Sbjct: 234 GHLDSNGRPLSMGTFTVLSYNILSDTYASSDIYSYCPTWALAWTYRRQNLLREIVKYRAD 293
Query: 296 IICLQEVQNDHYEEFFAPELDKHGYNGLYKRKTNEVFNGNINTIDGCATFFRRDRFSHVK 355
I+CLQEVQNDH+EEFF PELDKHGY GL+KRKTNEVF GN NTIDGCATFFRRDRFSHVK
Sbjct: 294 IVCLQEVQNDHFEEFFLPELDKHGYQGLFKRKTNEVFIGNTNTIDGCATFFRRDRFSHVK 353
Query: 356 KYEVEFNKAAQSLTDAMIPTTQKKSALNRLVKDNVALIVVLEAKVNNQPVDNPGKRQLLC 415
KYEVEFNKAAQSLT+A+IP +QKK+ALNRLVKDNVALIVVLEAK +Q DNPGKRQLLC
Sbjct: 354 KYEVEFNKAAQSLTEAIIPVSQKKNALNRLVKDNVALIVVLEAKFGSQAADNPGKRQLLC 413
Query: 416 VVSSFI 421
V ++ +
Sbjct: 414 VANTHV 419
>AT3G58580.1 | Symbols: | hydrolase | chr3:21671843-21674674
REVERSE
Length = 603
Score = 612 bits (1577), Expect = e-175, Method: Compositional matrix adjust.
Identities = 291/424 (68%), Positives = 336/424 (79%), Gaps = 5/424 (1%)
Query: 1 MLSVLRVHLPSDIPIVGCELTPYVLLRRPDKTVTTDDALETAPLDGHFLRYKWYRVQSDK 60
MLSV+RVHLPS+IPIVGCELTPYVL+RRPDK TTDD E+APL+G+FLRY+WYRVQSDK
Sbjct: 1 MLSVIRVHLPSEIPIVGCELTPYVLVRRPDKNSTTDDVPESAPLEGYFLRYRWYRVQSDK 60
Query: 61 KVAVCSVHPSEQATLQCLGCVKAKIPVSKSYHCTPKCFSDAWQHHRVLHDRAASAXXXXX 120
KV +CSVHP+EQATLQC+ C K + V KSYHC+PKCF+DAWQHHR LH+RAA+
Sbjct: 61 KVTICSVHPTEQATLQCVFCSKRRSLVPKSYHCSPKCFTDAWQHHRTLHERAAAENNANE 120
Query: 121 XXXXXXXXXXXXAXXXXXXXXXXXXXXXXXXXXXXAPLYPAAVTQRSGGETWFEVGRSKT 180
AP YP+ +TQ++GGET EVG KT
Sbjct: 121 DDDLNRNNSAGSGSLAGSLSGSMSNLSIANNGP--APFYPSNITQKNGGETLVEVGGCKT 178
Query: 181 YTPTADDIGHVLKFECVVVDAETKLPVGHANTILTSRVIPAPSPSPRRLIPV---DGMGN 237
YTPTADDI HVLKFECVV +AETK VGH +TILTSRVIPAPSPSPR+LIPV DGMG+
Sbjct: 179 YTPTADDISHVLKFECVVANAETKQIVGHPSTILTSRVIPAPSPSPRKLIPVNGADGMGH 238
Query: 238 VDADGRVSSSGTFTVLSYNILSDSYASSDLYNYCPSWALSWPYRRQNLLREIVGYRADII 297
+D D R+ S+G+FTVLSYNILSD+ ASSDLY+YCP WALSWPYRRQNLLREIVGYRAD++
Sbjct: 239 LDQDARIQSAGSFTVLSYNILSDTSASSDLYSYCPPWALSWPYRRQNLLREIVGYRADVV 298
Query: 298 CLQEVQNDHYEEFFAPELDKHGYNGLYKRKTNEVFNGNINTIDGCATFFRRDRFSHVKKY 357
CLQEVQ+DH+ E FAPELDKHGY LYKRKTNEV +G+ + IDGCATFFRRDRFSHVKKY
Sbjct: 299 CLQEVQSDHFHEIFAPELDKHGYQALYKRKTNEVLSGSTSAIDGCATFFRRDRFSHVKKY 358
Query: 358 EVEFNKAAQSLTDAMIPTTQKKSALNRLVKDNVALIVVLEAKVNNQPVDNPGKRQLLCVV 417
+VEFNKAAQSLTDA+IP QK++ALNRLVKDN+ALIVVLEAK NQP D GKRQL+CV
Sbjct: 359 DVEFNKAAQSLTDALIPQAQKRTALNRLVKDNIALIVVLEAKFGNQPTDPSGKRQLICVA 418
Query: 418 SSFI 421
++ +
Sbjct: 419 NTHV 422
>AT5G11350.1 | Symbols: | endonuclease/exonuclease/phosphatase
family protein | chr5:3621584-3625413 FORWARD
Length = 754
Score = 84.7 bits (208), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 55/164 (33%), Positives = 84/164 (51%), Gaps = 33/164 (20%)
Query: 247 SGTFTVLSYNILSDSYASS---DLYNYCPSWALSWPYRRQNLLREIVGYRADIICLQEVQ 303
S F VLSYNIL+D A+ LY + P LSW +R+ L+ E+ + ADI+CLQEV
Sbjct: 180 SEKFVVLSYNILADYLANDHWRSLYFHIPRNMLSWGWRKSKLVFELSLWSADIMCLQEV- 238
Query: 304 NDHYEEFFAPELDKHGYNGLYKRKTNEVFNGNINTIDGCATFFRRDRFSHVKKYEVEFNK 363
D +++ E+ GY+ ++K +T N +DGCA F+R +RF V + ++FN+
Sbjct: 239 -DKFQD-LEEEMKHRGYSAIWKMRTG-------NAVDGCAIFWRSNRFKLVHEESIQFNQ 289
Query: 364 AAQSLTDAMIPTTQKKSALNRLVKDNVALIVVLEAKVNNQPVDN 407
++DNVA I VLE + + +N
Sbjct: 290 LG--------------------LRDNVAQICVLETLLTSHTKEN 313
>AT1G31500.3 | Symbols: | endonuclease/exonuclease/phosphatase
family protein | chr1:11274211-11276515 REVERSE
Length = 283
Score = 83.6 bits (205), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 58/187 (31%), Positives = 88/187 (47%), Gaps = 19/187 (10%)
Query: 211 NTILTSRVI-------PAPSPSPRRLIPVDGMGNVDADGRVSSSGT-FTVLSYNILSDSY 262
N +L +VI PA P R+ V+G VD R S G F ++SYNIL+ Y
Sbjct: 14 NLLLPRKVISRRMSTNPAIEPKVRKFESVEG---VDIGSRNKSDGIRFRLVSYNILAQVY 70
Query: 263 ASSDLYNYCPSWALSWPYRRQNLLREIVGYRADIICLQEVQNDHYEEFFAPELDKHGYNG 322
S L + P L W R +L + +AD CLQEV D Y+ F+ +D GY+G
Sbjct: 71 VKSALLPHSPPACLKWKARSHAILSVLKNLQADFFCLQEV--DEYDSFYRNNMDSLGYSG 128
Query: 323 LYKRKTNEVFNGNINTIDGCATFFRRDRFSHVKKYEVEFNKAAQSLTDAMIPTTQKKSAL 382
+Y ++T + DGCA F++ V K +E+N S+ + +++K
Sbjct: 129 IYIQRTGQ------RKRDGCAIFYKPSCAELVTKERIEYNDLVDSIKADSVSCSEQKIET 182
Query: 383 NRLVKDN 389
+ KD+
Sbjct: 183 SNEGKDS 189
>AT1G31500.1 | Symbols: | endonuclease/exonuclease/phosphatase
family protein | chr1:11273802-11276515 REVERSE
Length = 388
Score = 82.8 bits (203), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 54/171 (31%), Positives = 82/171 (47%), Gaps = 12/171 (7%)
Query: 220 PAPSPSPRRLIPVDGMGNVDADGRVSSSGT-FTVLSYNILSDSYASSDLYNYCPSWALSW 278
PA P R+ V+G VD R S G F ++SYNIL+ Y S L + P L W
Sbjct: 35 PAIEPKVRKFESVEG---VDIGSRNKSDGIRFRLVSYNILAQVYVKSALLPHSPPACLKW 91
Query: 279 PYRRQNLLREIVGYRADIICLQEVQNDHYEEFFAPELDKHGYNGLYKRKTNEVFNGNINT 338
R +L + +AD CLQEV D Y+ F+ +D GY+G+Y ++T +
Sbjct: 92 KARSHAILSVLKNLQADFFCLQEV--DEYDSFYRNNMDSLGYSGIYIQRTGQ------RK 143
Query: 339 IDGCATFFRRDRFSHVKKYEVEFNKAAQSLTDAMIPTTQKKSALNRLVKDN 389
DGCA F++ V K +E+N S+ + +++K + KD+
Sbjct: 144 RDGCAIFYKPSCAELVTKERIEYNDLVDSIKADSVSCSEQKIETSNEGKDS 194
>AT1G31500.2 | Symbols: | endonuclease/exonuclease/phosphatase
family protein | chr1:11273802-11276128 REVERSE
Length = 358
Score = 82.4 bits (202), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 54/171 (31%), Positives = 82/171 (47%), Gaps = 12/171 (7%)
Query: 220 PAPSPSPRRLIPVDGMGNVDADGRVSSSGT-FTVLSYNILSDSYASSDLYNYCPSWALSW 278
PA P R+ V+G VD R S G F ++SYNIL+ Y S L + P L W
Sbjct: 5 PAIEPKVRKFESVEG---VDIGSRNKSDGIRFRLVSYNILAQVYVKSALLPHSPPACLKW 61
Query: 279 PYRRQNLLREIVGYRADIICLQEVQNDHYEEFFAPELDKHGYNGLYKRKTNEVFNGNINT 338
R +L + +AD CLQEV D Y+ F+ +D GY+G+Y ++T +
Sbjct: 62 KARSHAILSVLKNLQADFFCLQEV--DEYDSFYRNNMDSLGYSGIYIQRTGQ------RK 113
Query: 339 IDGCATFFRRDRFSHVKKYEVEFNKAAQSLTDAMIPTTQKKSALNRLVKDN 389
DGCA F++ V K +E+N S+ + +++K + KD+
Sbjct: 114 RDGCAIFYKPSCAELVTKERIEYNDLVDSIKADSVSCSEQKIETSNEGKDS 164
>AT3G18500.2 | Symbols: | similar to
endonuclease/exonuclease/phosphatase family protein
[Arabidopsis thaliana] (TAIR:AT1G73875.1); similar to
unnamed protein product [Vitis vinifera]
(GB:CAO68025.1); contains InterPro domain
Endonuclease/exonuclease/phosphatase
(InterPro:IPR005135) | chr3:6352550-6355303 FORWARD
Length = 448
Score = 82.0 bits (201), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 52/156 (33%), Positives = 83/156 (53%), Gaps = 32/156 (20%)
Query: 250 FTVLSYNILSDSYAS--SDLYNYCPSWALSWPYRRQNLLREIVGYRADIICLQEVQNDHY 307
FTV+SYNIL D +S +LY+ L W YR++ + E++ DII +QEV D Y
Sbjct: 109 FTVVSYNILGDGNSSYHRELYSNVSVPYLKWGYRKRLICEELIRLNPDIISMQEV--DKY 166
Query: 308 EEFFAPELDKHGYNGLYKRKTNEVFNGNINTIDGCATFFRRDRFSHVKKYEVEFNKAAQS 367
+ F+ ++K GY G YKR+T + +DGCA F++ DRF +++ +EF++
Sbjct: 167 FDLFS-MMEKAGYAGSYKRRTGD-------NVDGCAMFWKADRFGVLERENIEFSQFG-- 216
Query: 368 LTDAMIPTTQKKSALNRLVKDNVALIVVLEAKVNNQ 403
++DNVA + VLE + +N+
Sbjct: 217 ------------------MRDNVAQLAVLELRKSNK 234
>AT1G31500.4 | Symbols: | endonuclease/exonuclease/phosphatase
family protein | chr1:11273802-11276515 REVERSE
Length = 417
Score = 74.7 bits (182), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 45/146 (30%), Positives = 71/146 (48%), Gaps = 8/146 (5%)
Query: 244 VSSSGTFTVLSYNILSDSYASSDLYNYCPSWALSWPYRRQNLLREIVGYRADIICLQEVQ 303
V S F ++SYNIL+ Y S L + P L W R +L + +AD CLQEV
Sbjct: 86 VFSGIRFRLVSYNILAQVYVKSALLPHSPPACLKWKARSHAILSVLKNLQADFFCLQEV- 144
Query: 304 NDHYEEFFAPELDKHGYNGLYKRKTNEVFNGNINTIDGCATFFRRDRFSHVKKYEVEFNK 363
D Y+ F+ +D GY+G+Y ++T + DGCA F++ V K +E+N
Sbjct: 145 -DEYDSFYRNNMDSLGYSGIYIQRTGQ------RKRDGCAIFYKPSCAELVTKERIEYND 197
Query: 364 AAQSLTDAMIPTTQKKSALNRLVKDN 389
S+ + +++K + KD+
Sbjct: 198 LVDSIKADSVSCSEQKIETSNEGKDS 223
>AT1G73875.1 | Symbols: | endonuclease/exonuclease/phosphatase
family protein | chr1:27784773-27788232 FORWARD
Length = 454
Score = 62.8 bits (151), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 46/157 (29%), Positives = 71/157 (45%), Gaps = 32/157 (20%)
Query: 250 FTVLSYNILSDSYASS--DLYNYCPSWALSWPYRRQNLLREIVGYRADIICLQEVQNDHY 307
++SYN+L AS+ DLY P L W R+ + +EI Y A I+CLQEV D +
Sbjct: 100 LVLVSYNLLGVDNASNHMDLYYNVPRKHLEWSRRKHLICKEISRYNASILCLQEV--DRF 157
Query: 308 EEFFAPELDKHGYNGLYKRKTNEVFNGNINTIDGCATFFRRDRFSHVKKYEVEFNKAAQS 367
++ L G+ G++K +T E DGCA F++ + F + +EF+K
Sbjct: 158 DDLDVL-LKNRGFRGVHKSRTGEA-------SDGCAIFWKENLFELLDHQHIEFDKFG-- 207
Query: 368 LTDAMIPTTQKKSALNRLVKDNVALIVVLEAKVNNQP 404
+++NVA + VLE P
Sbjct: 208 ------------------MRNNVAQLCVLEMNCEEDP 226
>AT3G18500.1 | Symbols: | similar to
endonuclease/exonuclease/phosphatase family protein
[Arabidopsis thaliana] (TAIR:AT1G73875.1); similar to
unnamed protein product [Vitis vinifera]
(GB:CAO68025.1); contains InterPro domain
Endonuclease/exonuclease/phosphatase
(InterPro:IPR005135) | chr3:6352550-6355303 FORWARD
Length = 426
Score = 55.1 bits (131), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 40/156 (25%), Positives = 66/156 (42%), Gaps = 54/156 (34%)
Query: 250 FTVLSYNILSDSYAS--SDLYNYCPSWALSWPYRRQNLLREIVGYRADIICLQEVQNDHY 307
FTV+SYNIL D +S +LY+ L W YR++ + E++ DII +Q D+
Sbjct: 109 FTVVSYNILGDGNSSYHRELYSNVSVPYLKWGYRKRLICEELIRLNPDIISMQRRTGDN- 167
Query: 308 EEFFAPELDKHGYNGLYKRKTNEVFNGNINTIDGCATFFRRDRFSHVKKYEVEFNKAAQS 367
+DGCA F++ DRF +++ +EF++
Sbjct: 168 -------------------------------VDGCAMFWKADRFGVLERENIEFSQFG-- 194
Query: 368 LTDAMIPTTQKKSALNRLVKDNVALIVVLEAKVNNQ 403
++DNVA + VLE + +N+
Sbjct: 195 ------------------MRDNVAQLAVLELRKSNK 212
>AT1G31530.1 | Symbols: | endonuclease/exonuclease/phosphatase
family protein | chr1:11281169-11282449 REVERSE
Length = 283
Score = 48.1 bits (113), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 41/87 (47%), Gaps = 15/87 (17%)
Query: 263 ASSDLYNYCPSWALSWPYRRQNLLREIVGYRADIICLQEVQNDHYEEFFAPELDKHGYNG 322
ASS L P ++ W R + +L I + AD ICLQEV D Y FF ++ GY+G
Sbjct: 2 ASSSLMYVSPPESILWEKRSKAILDNIKNFEADFICLQEV--DEYHSFFDRNMEAQGYSG 59
Query: 323 LYKRKTNEVFNGNINTIDG--CATFFR 347
+ I +G CA FF+
Sbjct: 60 I-----------PIENKEGYECAIFFK 75