Miyakogusa Predicted Gene
- Lj1g3v0049850.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v0049850.1 CUFF.25160.1
(850 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G44750.1 | Symbols: REV1, ATREV1 | DNA-directed DNA polymeras... 847 0.0
AT5G44750.2 | Symbols: REV1, ATREV1 | DNA-directed DNA polymeras... 843 0.0
AT1G49980.1 | Symbols: | DNA/RNA polymerases superfamily protei... 129 6e-30
AT5G44740.2 | Symbols: POLH | Y-family DNA polymerase H | chr5:1... 117 4e-26
AT5G44740.1 | Symbols: POLH | Y-family DNA polymerase H | chr5:1... 75 3e-13
>AT5G44750.1 | Symbols: REV1, ATREV1 | DNA-directed DNA polymerases
| chr5:18052669-18059581 FORWARD LENGTH=1101
Length = 1101
Score = 847 bits (2188), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 445/815 (54%), Positives = 557/815 (68%), Gaps = 46/815 (5%)
Query: 45 LGVAWGS-------NALPPSDFSGYMALKNRKLHDQFHAQXXXXXXXXXXG-KPIFTGVS 96
LG AWG+ + P SDF YM +KNRKL +QF + K IF GVS
Sbjct: 33 LGAAWGAASSRSSFRSSPFSDFGSYMEVKNRKLQNQFETEASAASRGVSGSEKLIFQGVS 92
Query: 97 IFVDGFPVPSSQELRGYMFKYGGRFENYFSRHRVTHIICSNLPHTKLKNLRAFSAGLPLV 156
IFVDGF +PS QEL+GYM KYGGRFENYFSR VTHIICSNLP +K+KNLR FS GLP+V
Sbjct: 93 IFVDGFTIPSHQELKGYMMKYGGRFENYFSRRSVTHIICSNLPDSKVKNLRTFSRGLPVV 152
Query: 157 KPTWILDSVAANRLLPWLPYQLHQLANNQPTLSAFFSFKTSKLSDMEDSTT----NVCQS 212
KPTWI+DS++ANRLL W+PYQL QL + QP LSAFF+ ++ M T + S
Sbjct: 153 KPTWIVDSISANRLLGWVPYQLDQLNDTQPKLSAFFAPRSHLTPQMASPVTSFQPDTGYS 212
Query: 213 QDRHSSKVNGLISTVSNDTIPENENTNAFMLEDSTNLVRVKYDEGSTAATQ--------- 263
+ S + S + D I ++ + +E++T + + G +++
Sbjct: 213 EAEEGSSIRADDSEEARDHI--DDEIDGVYIENTTPELTEQTGTGDLKSSEMNAEGLGNY 270
Query: 264 --EKTNVQDELEPNHQEPSTSVSSPCSDDQNVKEFQSSAATRL--SKQGHSTLTDPNFVE 319
E+ V EL+ STS + S N K S AT S + HSTL DPNFVE
Sbjct: 271 DIEEKEVSSELQSTTNLHSTSDNK--SVHANGKNGGKSIATAAGSSTRRHSTLEDPNFVE 328
Query: 320 NYFKSSRLHFIGTWRNRYRKRFPMPSAGLN-KEXXXXXXXXXXXXXXXHVDMDCFFVSVV 378
NYFK+SRLHFIGTWRNRYRKRF S GL + H+D+DCFFVSVV
Sbjct: 329 NYFKNSRLHFIGTWRNRYRKRFHGSSNGLKWADSGQNTAEMAKKSTIIHIDLDCFFVSVV 388
Query: 379 IRSQPELFDKPVAVCHSNNSKGTSEISSANYPARSYGIKAGMFVRDAKALCPDLVIFPYN 438
I+++ EL DKPVAVCHS+N KGT+EISSANYPAR+YG+KAGMFVR AK LCP LVI PYN
Sbjct: 389 IKNRLELHDKPVAVCHSDNPKGTAEISSANYPARAYGVKAGMFVRHAKDLCPQLVIVPYN 448
Query: 439 FEAYEEVADQFYTILHRHCNKVQAVSCDEAFLDVTHSEVKDPELLASSIREEICKTTRCT 498
FEAYEEVADQFY ILHRHC KVQA+SCDEAFLDV+ + E+LAS+IR EI +TT C+
Sbjct: 449 FEAYEEVADQFYDILHRHCRKVQALSCDEAFLDVSDLSDVETEVLASTIRNEILETTGCS 508
Query: 499 ASAGIAGNMLMARIATRTAKPNGQYHITPERVDDYLCQLPIDSLPGVGHVLQEKLKKQNV 558
ASAGI G MLMAR+ATR AKP GQ +I+ E+V+++L QLP+ +LPGVG VL+EKL KQN+
Sbjct: 509 ASAGIGGTMLMARLATRVAKPAGQLYISAEKVEEFLDQLPVGTLPGVGSVLKEKLVKQNI 568
Query: 559 HTCGQLRTISKALLQKDYGMKTGEMLWNYSRGIDSRAVGVFQESKSIGADVNWGVRFNDV 618
TCGQLR ISK LQKD+G+KTGEMLW+YSRG+D R+V QESKSIGA+VNWGVRF D
Sbjct: 569 QTCGQLRLISKDSLQKDFGVKTGEMLWSYSRGLDLRSVTAVQESKSIGAEVNWGVRFRDQ 628
Query: 619 KDCDRFLIDLCKEVSLRLQCCGVQGRTFSLKIKKRRKDAVEPAKFMGCGDCENLSHSITI 678
+D FL LCKEVSLRLQ C + GRTF+LKIKKR+KDA EP K+MGCGDC+NLS SIT+
Sbjct: 629 QDVQHFLQCLCKEVSLRLQGCEMIGRTFTLKIKKRKKDAEEPTKYMGCGDCDNLSRSITV 688
Query: 679 PLATDNVEVLQRIVKQLFGCFYIDVKEIRGIGLQVSRFESAETSKQEKYSLKSWVASGNA 738
P ATD++EVLQRI K+LFG F +DVKE+RG+GLQVS+ +SA+ S + +LKSW++S A
Sbjct: 689 PAATDDIEVLQRISKKLFGSFCLDVKEVRGVGLQVSKLDSADPSNKGSRTLKSWLSSAPA 748
Query: 739 SMQKQKYPMDQD--------KQNMD---GTSSGECRGGSSVKIQNNQANANPISTPPPLC 787
+Q ++QD ++N D + G R S +++ + + S+ PP+C
Sbjct: 749 VVQ-----IEQDDNVFAAKVRENSDCNRPVTGGVSRLRESNSEESSIQSGDTNSSLPPMC 803
Query: 788 HLDVEVIRNLPPEVFSELNEIYGGKLIDYIAKGKG 822
+LD+EV+ NLPPE+ SEL+ YGGKL + I K +G
Sbjct: 804 YLDMEVLENLPPELLSELDGTYGGKLFELIEKKRG 838
>AT5G44750.2 | Symbols: REV1, ATREV1 | DNA-directed DNA polymerases
| chr5:18052669-18059581 FORWARD LENGTH=1105
Length = 1105
Score = 843 bits (2177), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 445/819 (54%), Positives = 557/819 (68%), Gaps = 50/819 (6%)
Query: 45 LGVAWGS-------NALPPSDFSGYMALKNRKLHDQFHAQXXXXXXXXXXG-KPIFTGVS 96
LG AWG+ + P SDF YM +KNRKL +QF + K IF GVS
Sbjct: 33 LGAAWGAASSRSSFRSSPFSDFGSYMEVKNRKLQNQFETEASAASRGVSGSEKLIFQGVS 92
Query: 97 IFVDGFPVPSSQELRGYMFKYGGRFENYFSRHRVTHIICSNLPHTKLKNLRAFSAGLPLV 156
IFVDGF +PS QEL+GYM KYGGRFENYFSR VTHIICSNLP +K+KNLR FS GLP+V
Sbjct: 93 IFVDGFTIPSHQELKGYMMKYGGRFENYFSRRSVTHIICSNLPDSKVKNLRTFSRGLPVV 152
Query: 157 KPTWILDSVAANRLLPWLPYQLHQLANNQPTLSAFFSFKTSKLSDMEDSTT----NVCQS 212
KPTWI+DS++ANRLL W+PYQL QL + QP LSAFF+ ++ M T + S
Sbjct: 153 KPTWIVDSISANRLLGWVPYQLDQLNDTQPKLSAFFAPRSHLTPQMASPVTSFQPDTGYS 212
Query: 213 QDRHSSKVNGLISTVSNDTIPENENTNAFMLEDSTNLVRVKYDEGSTAATQ--------- 263
+ S + S + D I ++ + +E++T + + G +++
Sbjct: 213 EAEEGSSIRADDSEEARDHI--DDEIDGVYIENTTPELTEQTGTGDLKSSEMNAEGLGNY 270
Query: 264 --EKTNVQDELEPNHQEPSTSVSSPCSDDQNVKEFQSSAATRL--SKQGHSTLTDPNFVE 319
E+ V EL+ STS + S N K S AT S + HSTL DPNFVE
Sbjct: 271 DIEEKEVSSELQSTTNLHSTSDNK--SVHANGKNGGKSIATAAGSSTRRHSTLEDPNFVE 328
Query: 320 NYFKSSRLHFIGTWRNRYRKRFPMPSAGLN-KEXXXXXXXXXXXXXXXHVDMDCFFVSVV 378
NYFK+SRLHFIGTWRNRYRKRF S GL + H+D+DCFFVSVV
Sbjct: 329 NYFKNSRLHFIGTWRNRYRKRFHGSSNGLKWADSGQNTAEMAKKSTIIHIDLDCFFVSVV 388
Query: 379 IRSQPELFDKPVAVCHSNNSKGTSEISSANYPARSYGIKAGMFVRDAKALCPDLVIFPYN 438
I+++ EL DKPVAVCHS+N KGT+EISSANYPAR+YG+KAGMFVR AK LCP LVI PYN
Sbjct: 389 IKNRLELHDKPVAVCHSDNPKGTAEISSANYPARAYGVKAGMFVRHAKDLCPQLVIVPYN 448
Query: 439 FEAYEEVADQFYTILHRHCNKVQAVSCDEAFLDVTHSEVKDPELLASSIREEICKTTRCT 498
FEAYEEVADQFY ILHRHC KVQA+SCDEAFLDV+ + E+LAS+IR EI +TT C+
Sbjct: 449 FEAYEEVADQFYDILHRHCRKVQALSCDEAFLDVSDLSDVETEVLASTIRNEILETTGCS 508
Query: 499 ASAGIAGNMLMARIATRTAKPNGQYHITPERVDDYLCQLPIDSLPGVGHVLQEKLKKQNV 558
ASAGI G MLMAR+ATR AKP GQ +I+ E+V+++L QLP+ +LPGVG VL+EKL KQN+
Sbjct: 509 ASAGIGGTMLMARLATRVAKPAGQLYISAEKVEEFLDQLPVGTLPGVGSVLKEKLVKQNI 568
Query: 559 HTCGQLRTISKALLQKDYGMKTGEMLWNYSRGIDSRAVGVFQESKSIGADVNWGVRFNDV 618
TCGQLR ISK LQKD+G+KTGEMLW+YSRG+D R+V QESKSIGA+VNWGVRF D
Sbjct: 569 QTCGQLRLISKDSLQKDFGVKTGEMLWSYSRGLDLRSVTAVQESKSIGAEVNWGVRFRDQ 628
Query: 619 KD----CDRFLIDLCKEVSLRLQCCGVQGRTFSLKIKKRRKDAVEPAKFMGCGDCENLSH 674
+D FL LCKEVSLRLQ C + GRTF+LKIKKR+KDA EP K+MGCGDC+NLS
Sbjct: 629 QDVFILVQHFLQCLCKEVSLRLQGCEMIGRTFTLKIKKRKKDAEEPTKYMGCGDCDNLSR 688
Query: 675 SITIPLATDNVEVLQRIVKQLFGCFYIDVKEIRGIGLQVSRFESAETSKQEKYSLKSWVA 734
SIT+P ATD++EVLQRI K+LFG F +DVKE+RG+GLQVS+ +SA+ S + +LKSW++
Sbjct: 689 SITVPAATDDIEVLQRISKKLFGSFCLDVKEVRGVGLQVSKLDSADPSNKGSRTLKSWLS 748
Query: 735 SGNASMQKQKYPMDQD--------KQNMD---GTSSGECRGGSSVKIQNNQANANPISTP 783
S A +Q ++QD ++N D + G R S +++ + + S+
Sbjct: 749 SAPAVVQ-----IEQDDNVFAAKVRENSDCNRPVTGGVSRLRESNSEESSIQSGDTNSSL 803
Query: 784 PPLCHLDVEVIRNLPPEVFSELNEIYGGKLIDYIAKGKG 822
PP+C+LD+EV+ NLPPE+ SEL+ YGGKL + I K +G
Sbjct: 804 PPMCYLDMEVLENLPPELLSELDGTYGGKLFELIEKKRG 842
>AT1G49980.1 | Symbols: | DNA/RNA polymerases superfamily protein |
chr1:18508026-18512111 FORWARD LENGTH=671
Length = 671
Score = 129 bits (325), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 113/418 (27%), Positives = 190/418 (45%), Gaps = 47/418 (11%)
Query: 367 HVDMDCFFVSVVIRSQPELFDKPVAVCHSNNSKGTSEISSANYPARSYGIKAGMFVRDAK 426
HVDMD F+ +V S P + KP+AV G S IS+ANY AR +G++A M A+
Sbjct: 107 HVDMDAFYAAVETLSDPSIKGKPMAV------GGLSMISTANYEARKFGVRAAMPGFIAR 160
Query: 427 ALCPDLVIFPYNFEAYEEVADQFYTILHRHCNKVQAVSCDEAFLDVTHSEVKDPEL---- 482
LCPDL+ P +F Y +D + + A S DEA+LD+T ++ L
Sbjct: 161 KLCPDLIFVPVDFTKYTHYSDLTRKVFRNYDPHFIAGSLDEAYLDITEV-CRERGLSGGE 219
Query: 483 LASSIREEICKTTRCTASAGIAGNMLMARIATRTAKPNGQYHITPER--VDDYLCQLPID 540
+A +R + T T SAG+A N L+A++ + KPNGQ+ + +R V ++ LP+
Sbjct: 220 IAEELRSSVYSETGLTCSAGVAANRLLAKVCSDINKPNGQFVLQNDRSTVMTFVSFLPVR 279
Query: 541 SLPGVGHVLQEKLKKQ-NVHTCGQLRTISKALLQKDYGMKTGEMLWNYSRGIDSRAVGVF 599
+ G+G V + LK + TC ++ +LL + + + + G+
Sbjct: 280 KIGGIGKVTEHILKDALGIKTCEEM-VQKGSLLYALFSQSSADFFLSVGLGLGGTNTPQV 338
Query: 600 QESKSIGADVNWGVRFNDVKDCDRFLIDLCKEVSLRLQCCGVQGRTFSLKIK----KRRK 655
+ KSI ++ + D + L +L + +S ++ G+ RT +LK+K + R
Sbjct: 339 RSRKSISSERTFAAT-GDERLLYSKLDELAEMLSHDMKKEGLTARTLTLKLKTASFEIRS 397
Query: 656 DAVEPAKFMGCGDCENLSHSITIPLATDNVEVLQRIVKQLFGCFYIDVKEIRGIGLQVSR 715
AV ++ C + L H+ + A V V R IGL++S+
Sbjct: 398 RAVSLQRYT-CSSDDILKHATKLLKAELPVSV-------------------RLIGLRMSQ 437
Query: 716 F-ESAETSKQEKYSLKSWVASGNASMQKQ------KYPMDQDKQNMDGTSSGECRGGS 766
F E S + ++ ++ ++S Q Q + +D +K + SG GS
Sbjct: 438 FVEEIRNSDPSQGTITKFIVQKDSSRQAQDLDDNDSFDLDANKNCLSNDESGNVSFGS 495
>AT5G44740.2 | Symbols: POLH | Y-family DNA polymerase H |
chr5:18047903-18051779 REVERSE LENGTH=672
Length = 672
Score = 117 bits (292), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 102/334 (30%), Positives = 141/334 (42%), Gaps = 66/334 (19%)
Query: 367 HVDMDCFFVSVVIRSQPELFDKPVAVCHSNNSKGTSEISSANYPARSYGIKAGMFVRDAK 426
HVDMDCF+V V R QPEL P AV N +G I+ +Y AR G+K M +AK
Sbjct: 16 HVDMDCFYVQVEQRKQPELRGLPSAVVQYNEWQGGGLIA-VSYEARKCGVKRSMRGDEAK 74
Query: 427 ALCPD--LVIFPY-----NFEAYEEVADQFYTILHRHCNKVQAVSCDEAFLDVTHS---- 475
A CP LV P + Y + +IL + K + S DE +LD+T +
Sbjct: 75 AACPQIQLVQVPVARGKADLNLYRSAGSEVVSILAK-SGKCERASIDEVYLDLTDAAESM 133
Query: 476 ---------EVKDPELLASSI--------------------------------------- 487
E+ D E+L S I
Sbjct: 134 LADAPPESLELIDEEVLKSHILGMNREDGDDFKESVRNWICREDADRRDKLLSCGIIIVA 193
Query: 488 --REEICKTTRCTASAGIAGNMLMARIATRTAKPNGQYHITPERVDDYLCQLPIDSLPGV 545
R+++ K T T SAGIA N ++A++A+ KP Q + V + L LPI + +
Sbjct: 194 ELRKQVLKETEFTCSAGIAHNKMLAKLASGMNKPAQQTVVPYAAVQELLSSLPIKKMKQL 253
Query: 546 GHVLQEKLKKQ-NVHTCGQLRTISKALLQKDYGMKTGEMLWNYSRGIDSRAVGVFQESKS 604
G L L+ V T G L S+ LQ+ YG+ TG LWN +RGI V KS
Sbjct: 254 GGKLGTSLQTDLGVDTVGDLLQFSETKLQEHYGVNTGTWLWNIARGISGEEVQGRLLPKS 313
Query: 605 IGADVNW-GVR-FNDVKDCDRFLIDLCKEVSLRL 636
G+ + G R + +L L +E+S RL
Sbjct: 314 HGSGKTFPGPRALKSLSTVQHWLNQLSEELSERL 347
>AT5G44740.1 | Symbols: POLH | Y-family DNA polymerase H |
chr5:18047903-18051197 REVERSE LENGTH=588
Length = 588
Score = 74.7 bits (182), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 69/245 (28%), Positives = 109/245 (44%), Gaps = 14/245 (5%)
Query: 482 LLASSIREEICKTTRCTASAGIAGNMLMARIATRTAKPNGQYHITPERVDDYLCQLPIDS 541
++ + +R+++ K T T SAGIA N ++A++A+ KP Q + V + L LPI
Sbjct: 106 IIVAELRKQVLKETEFTCSAGIAHNKMLAKLASGMNKPAQQTVVPYAAVQELLSSLPIKK 165
Query: 542 LPGVGHVLQEKLKKQ-NVHTCGQLRTISKALLQKDYGMKTGEMLWNYSRGIDSRAVGVFQ 600
+ +G L L+ V T G L S+ LQ+ YG+ TG LWN +RGI V
Sbjct: 166 MKQLGGKLGTSLQTDLGVDTVGDLLQFSETKLQEHYGVNTGTWLWNIARGISGEEVQGRL 225
Query: 601 ESKSIGADVNW-GVR-FNDVKDCDRFLIDLCKEVSLRLQCCGVQGR----TFSLKIKKRR 654
KS G+ + G R + +L L +E+S RL Q + T +L R
Sbjct: 226 LPKSHGSGKTFPGPRALKSLSTVQHWLNQLSEELSERLGSDLEQNKRIASTLTLHASAFR 285
Query: 655 KDAVEPAKFMGCGDCENLSHSITIPLATDNVEVLQRIVKQLFGCFYI-----DVKEIRGI 709
+ K C + + +T + D + Q +++ G F I ++ R
Sbjct: 286 SKDSDSHKKFPSKSCP-MRYGVT-KIQEDAFNLFQAALREYMGSFGIKPQGNKLETWRIT 343
Query: 710 GLQVS 714
GL VS
Sbjct: 344 GLSVS 348