Miyakogusa Predicted Gene
- Lj3g3v0965940.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj3g3v0965940.1 Non Chatacterized Hit- tr|I1MJ65|I1MJ65_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.50128
PE,79.78,0,PAT1,Topoisomerase II-associated protein PAT1;
TOPOISOMERASE II-ASSOCIATED PROTEIN PAT1,NULL; seg,NU,CUFF.41924.1
(820 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G79090.2 | Symbols: | FUNCTIONS IN: molecular_function unkno... 608 e-174
AT1G79090.1 | Symbols: | FUNCTIONS IN: molecular_function unkno... 608 e-174
AT3G22270.1 | Symbols: | Topoisomerase II-associated protein PA... 504 e-142
AT4G14990.1 | Symbols: | Topoisomerase II-associated protein PA... 474 e-133
>AT1G79090.2 | Symbols: | FUNCTIONS IN: molecular_function unknown;
INVOLVED IN: biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 25 plant
structures; EXPRESSED DURING: 15 growth stages; CONTAINS
InterPro DOMAIN/s: Topoisomerase II-associated protein
PAT1 (InterPro:IPR019167); BEST Arabidopsis thaliana
protein match is: Topoisomerase II-associated protein
PAT1 (TAIR:AT3G22270.1); Has 1260 Blast hits to 1163
proteins in 186 species: Archae - 0; Bacteria - 32;
Metazoa - 596; Fungi - 277; Plants - 212; Viruses - 0;
Other Eukaryotes - 143 (source: NCBI BLink). |
chr1:29749551-29752945 REVERSE LENGTH=793
Length = 793
Score = 608 bits (1567), Expect = e-174, Method: Compositional matrix adjust.
Identities = 344/678 (50%), Positives = 428/678 (63%), Gaps = 77/678 (11%)
Query: 177 GESVPNWFDQHAYDSETTQDGKRWSSQPHSSIAHLEESKSLYRTSLYPEKQ----QEH-- 230
GE +PNW+ + DS+ +D K WS+QP SS+ +E+ + RT LYPE Q Q+H
Sbjct: 125 GEELPNWYGRQILDSDAIKDDKVWSAQPFSSLDRVEQ-RIPDRTKLYPEPQRQLHQDHNQ 183
Query: 231 PHFSSEPVLVPNSSFTXXXXXXXXXXXXXXXNNTGYLNIPHNAIGAQMALSSQNRSRFSN 290
FSSEP+LVP SSF G+ NIP+ + G QM S N S F N
Sbjct: 184 QQFSSEPILVPKSSFVSYPPPGSISPD----QRLGHPNIPYQSGGPQMG--SPNFSPFPN 237
Query: 291 PALQLGGLNHGLP-FSGNMNQFPTGSPFNQRIQNQLVNQAGFYSGD-----------HPN 338
QL ++HG P +GN QF P N Q +N+ + GD P
Sbjct: 238 LQPQLPSMHHGSPQHTGNRPQFRPALPLNNLPPAQWMNRQNMHPGDSSGIMNNAMLQQPP 297
Query: 339 FSSGL--------------PM------------------------LNKYDQMLGIMELRD 360
+GL PM YD MLG +LR+
Sbjct: 298 HQNGLMPPQMQGSQNRLPHPMQPPLGHMPGMQPQLFNSHLSRSSSSGNYDGMLGFGDLRE 357
Query: 361 QLPKSALLGRQNLRFPPQGFDLSFNRSNNGWPRFRSKYMTTEEIENILRMQLAATHSNDP 420
P S RQN+RFP QGFD R +P FRSKYM+ EIENILRMQL ATHSNDP
Sbjct: 358 VRPGSGHGNRQNVRFPQQGFDAGVQRR---YP-FRSKYMSAGEIENILRMQLVATHSNDP 413
Query: 421 YVDDYYHQGCLAKKSAGAKLRHHFCPNQIKEHPLRGSANTEQHAFLQVDALGRVPFSSIR 480
YVDDYYHQ CLAKKSAGAKL+HHFCPN +++ R +N E HAFLQV+ALGRVPFSSIR
Sbjct: 414 YVDDYYHQACLAKKSAGAKLKHHFCPNHLRDLQQRARSNNEPHAFLQVEALGRVPFSSIR 473
Query: 481 RPRPLLEVDPPNSSRASSPEQNVSEKPLEQEPLLAARVTIEDGLCLLLDVDDIDRFLQFN 540
RPRPLLEVDPPNS++ + E ++KPL+QEP+LAARV IEDGLCLLL+VDDIDRFL+FN
Sbjct: 474 RPRPLLEVDPPNSAKFGNAEHKPTDKPLDQEPMLAARVYIEDGLCLLLEVDDIDRFLEFN 533
Query: 541 QLQDGGIQLKHKRQGLLEGLAASLHLVDPLGKSGHTVMHVANDDVVFLRIVSLPKGRKLL 600
QLQDGG QLK +RQ LL+ LA SL L DPL K+G + + DD +FLR++SLPKGRKLL
Sbjct: 534 QLQDGGHQLKQRRQALLQSLAVSLQLGDPLAKNGQS---QSLDDFLFLRVISLPKGRKLL 590
Query: 601 ARYLQILFPGGELMRIVCMAIFRHFRFLFGGLPSDPVAAETVSNLARVVSKCLREMDXXX 660
RYLQ++FPG +LMRIVCMAIFRH R LFG L SDP +T + LA V++ C++ M+
Sbjct: 591 IRYLQLIFPGSDLMRIVCMAIFRHLRSLFGVLSSDPDIIKTTNKLATVINLCIQNMELGP 650
Query: 661 XXXXXXXXXXXXEQPPLRPLGSPAGDGASLILVSVLERATELLTDPHAASNYNIANRSLW 720
EQ PLRPLGSP GDGAS +L S+L+RA+EL+ A+N+N A +LW
Sbjct: 651 VSTCLAAVSCSSEQAPLRPLGSPVGDGASTVLKSILDRASELIR----ANNFNNAGIALW 706
Query: 721 QASFDEFFGLLAKYCVNKYDSIMQSFLNQGTPNMAV-IGSEAARAISREMPVELLRASLP 779
+ASF+EFF +L +YC++KYDSIMQS Q P+ A I EAA+AI REMP+ELLR+S P
Sbjct: 707 RASFNEFFNMLMRYCISKYDSIMQSL--QLPPHFATEISEEAAKAIVREMPIELLRSSFP 764
Query: 780 HTDDRQKKILLDFAHRSL 797
H D++QK+IL++F RS+
Sbjct: 765 HIDEQQKRILMEFLKRSM 782
>AT1G79090.1 | Symbols: | FUNCTIONS IN: molecular_function unknown;
INVOLVED IN: biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 25 plant
structures; EXPRESSED DURING: 15 growth stages; CONTAINS
InterPro DOMAIN/s: Topoisomerase II-associated protein
PAT1 (InterPro:IPR019167); BEST Arabidopsis thaliana
protein match is: Topoisomerase II-associated protein
PAT1 (TAIR:AT3G22270.1); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
0; Other Eukaryotes - 2996 (source: NCBI BLink). |
chr1:29749551-29752945 REVERSE LENGTH=793
Length = 793
Score = 608 bits (1567), Expect = e-174, Method: Compositional matrix adjust.
Identities = 344/678 (50%), Positives = 428/678 (63%), Gaps = 77/678 (11%)
Query: 177 GESVPNWFDQHAYDSETTQDGKRWSSQPHSSIAHLEESKSLYRTSLYPEKQ----QEH-- 230
GE +PNW+ + DS+ +D K WS+QP SS+ +E+ + RT LYPE Q Q+H
Sbjct: 125 GEELPNWYGRQILDSDAIKDDKVWSAQPFSSLDRVEQ-RIPDRTKLYPEPQRQLHQDHNQ 183
Query: 231 PHFSSEPVLVPNSSFTXXXXXXXXXXXXXXXNNTGYLNIPHNAIGAQMALSSQNRSRFSN 290
FSSEP+LVP SSF G+ NIP+ + G QM S N S F N
Sbjct: 184 QQFSSEPILVPKSSFVSYPPPGSISPD----QRLGHPNIPYQSGGPQMG--SPNFSPFPN 237
Query: 291 PALQLGGLNHGLP-FSGNMNQFPTGSPFNQRIQNQLVNQAGFYSGD-----------HPN 338
QL ++HG P +GN QF P N Q +N+ + GD P
Sbjct: 238 LQPQLPSMHHGSPQHTGNRPQFRPALPLNNLPPAQWMNRQNMHPGDSSGIMNNAMLQQPP 297
Query: 339 FSSGL--------------PM------------------------LNKYDQMLGIMELRD 360
+GL PM YD MLG +LR+
Sbjct: 298 HQNGLMPPQMQGSQNRLPHPMQPPLGHMPGMQPQLFNSHLSRSSSSGNYDGMLGFGDLRE 357
Query: 361 QLPKSALLGRQNLRFPPQGFDLSFNRSNNGWPRFRSKYMTTEEIENILRMQLAATHSNDP 420
P S RQN+RFP QGFD R +P FRSKYM+ EIENILRMQL ATHSNDP
Sbjct: 358 VRPGSGHGNRQNVRFPQQGFDAGVQRR---YP-FRSKYMSAGEIENILRMQLVATHSNDP 413
Query: 421 YVDDYYHQGCLAKKSAGAKLRHHFCPNQIKEHPLRGSANTEQHAFLQVDALGRVPFSSIR 480
YVDDYYHQ CLAKKSAGAKL+HHFCPN +++ R +N E HAFLQV+ALGRVPFSSIR
Sbjct: 414 YVDDYYHQACLAKKSAGAKLKHHFCPNHLRDLQQRARSNNEPHAFLQVEALGRVPFSSIR 473
Query: 481 RPRPLLEVDPPNSSRASSPEQNVSEKPLEQEPLLAARVTIEDGLCLLLDVDDIDRFLQFN 540
RPRPLLEVDPPNS++ + E ++KPL+QEP+LAARV IEDGLCLLL+VDDIDRFL+FN
Sbjct: 474 RPRPLLEVDPPNSAKFGNAEHKPTDKPLDQEPMLAARVYIEDGLCLLLEVDDIDRFLEFN 533
Query: 541 QLQDGGIQLKHKRQGLLEGLAASLHLVDPLGKSGHTVMHVANDDVVFLRIVSLPKGRKLL 600
QLQDGG QLK +RQ LL+ LA SL L DPL K+G + + DD +FLR++SLPKGRKLL
Sbjct: 534 QLQDGGHQLKQRRQALLQSLAVSLQLGDPLAKNGQS---QSLDDFLFLRVISLPKGRKLL 590
Query: 601 ARYLQILFPGGELMRIVCMAIFRHFRFLFGGLPSDPVAAETVSNLARVVSKCLREMDXXX 660
RYLQ++FPG +LMRIVCMAIFRH R LFG L SDP +T + LA V++ C++ M+
Sbjct: 591 IRYLQLIFPGSDLMRIVCMAIFRHLRSLFGVLSSDPDIIKTTNKLATVINLCIQNMELGP 650
Query: 661 XXXXXXXXXXXXEQPPLRPLGSPAGDGASLILVSVLERATELLTDPHAASNYNIANRSLW 720
EQ PLRPLGSP GDGAS +L S+L+RA+EL+ A+N+N A +LW
Sbjct: 651 VSTCLAAVSCSSEQAPLRPLGSPVGDGASTVLKSILDRASELIR----ANNFNNAGIALW 706
Query: 721 QASFDEFFGLLAKYCVNKYDSIMQSFLNQGTPNMAV-IGSEAARAISREMPVELLRASLP 779
+ASF+EFF +L +YC++KYDSIMQS Q P+ A I EAA+AI REMP+ELLR+S P
Sbjct: 707 RASFNEFFNMLMRYCISKYDSIMQSL--QLPPHFATEISEEAAKAIVREMPIELLRSSFP 764
Query: 780 HTDDRQKKILLDFAHRSL 797
H D++QK+IL++F RS+
Sbjct: 765 HIDEQQKRILMEFLKRSM 782
>AT3G22270.1 | Symbols: | Topoisomerase II-associated protein PAT1
| chr3:7874480-7877857 FORWARD LENGTH=782
Length = 782
Score = 504 bits (1298), Expect = e-142, Method: Compositional matrix adjust.
Identities = 302/681 (44%), Positives = 391/681 (57%), Gaps = 69/681 (10%)
Query: 180 VPNWFDQHAYDSETTQDGKRWSSQPHSSIAHLEESKSLYRTSLYPEKQQEHPHFSSEPVL 239
+ +W D E Q+ KRWSSQP S AH SK LYRTS YP++Q + H++SEP++
Sbjct: 127 LTSWLD------EQDQEAKRWSSQPQS-FAH---SKPLYRTSSYPQQQPQLQHYNSEPII 176
Query: 240 VPNSSFTXXXXXXXXXXXXXXXNNTGYLNIPHNAIGAQMALSSQNRSRFSNPALQLGGLN 299
+P S+FT N ++P G+Q+ S+ S SN L GL+
Sbjct: 177 LPESNFTSFPPPGNRSPQASPGNLHRAPSLPG---GSQLTYSAP--SPLSNSGFHLSGLS 231
Query: 300 HGLPFSGNMNQFPTGSP-FNQRIQNQLVNQAGFYSGDHPNF--------SSGLPMLN--- 347
G + GN+ ++ + P +Q V G GDH LP N
Sbjct: 232 QGPHYGGNLTRYASCGPTLGNMVQPHWVTDPGHLHGDHSGLLHNLVQQQHQQLPPRNAIM 291
Query: 348 -----------KYDQM-------------------LGIMELRDQLPKSALLGRQNLRFPP 377
Y Q+ G+ E+R+ KS+ R+N
Sbjct: 292 SQHLLALQQRQSYAQLAALQSQLYSSYPSPSRKVPFGVGEVREHKHKSSHRSRKNRGLSQ 351
Query: 378 QGFDLSFNRSNNGWPRFRSKYMTTEEIENILRMQLAATHSNDPYVDDYYHQGCLAKKSAG 437
Q D + +S G +FRSK+MT+EEIE+IL+MQ + +HSNDPYV+DYYHQ LAKKSAG
Sbjct: 352 QTSDAASQKSETGL-QFRSKHMTSEEIESILKMQHSNSHSNDPYVNDYYHQAKLAKKSAG 410
Query: 438 AKLRHHFCPNQIKEHPLRGSANTEQHAFLQVDALGRVPFSSIRRPRPLLEVDPPNSSRAS 497
+K HF P Q+K+H R ++EQH + VDALG++ S+RRP LLEVD
Sbjct: 411 SKAISHFYPAQLKDHQPRSRNSSEQHPQVHVDALGKITLPSVRRPHALLEVDSSPGFNDG 470
Query: 498 SPEQNVSEKPLEQEPLLAARVTIEDGLCLLLDVDDIDRFLQFNQLQDGGIQLKHKRQGLL 557
S + S K LEQEPL+AARVTIED L +L+D+ DIDR LQ + QDGG QLK KRQ LL
Sbjct: 471 SGDHKGSGKHLEQEPLVAARVTIEDALGVLIDIVDIDRTLQNTRPQDGGAQLKRKRQILL 530
Query: 558 EGLAASLHLVDPLGKSGHTVMHVANDDVVFLRIVSLPKGRKLLARYLQILFPGGELMRIV 617
EGLA +L L DP K+G A DD+VFLRI +LPKGRKLL +YLQ+L PG E R+V
Sbjct: 531 EGLATALQLADPFSKTGQKSGMTAKDDIVFLRIATLPKGRKLLTKYLQLLVPGTENARVV 590
Query: 618 CMAIFRHFRFLFGGLPSDPVAAETVSNLARVVSKCLREMDXXXXXXXXXXXXXXXEQPPL 677
CMAIFRH RFLFGGLPSD +AAET+SNLA+ V+ C++ MD EQPPL
Sbjct: 591 CMAIFRHLRFLFGGLPSDTLAAETISNLAKAVTVCVQAMDLRALSACLAAVVCSSEQPPL 650
Query: 678 RPLGSPAGDGASLILVSVLERATELLTDPHAASNYNIANRSLWQASFDEFFGLLAKYCVN 737
RP+GS AGDGAS++L+S+LERA E++ P + +N LW+ASFDEFF LL KYC +
Sbjct: 651 RPIGSSAGDGASVVLISLLERAAEVVVVPRVM--HGNSNDGLWRASFDEFFNLLTKYCRS 708
Query: 738 KYDSIMQSFLNQGTPNMAVIGSEAARAISREMPVELLRASLPHTDDRQKKILLDFAHRSL 797
KYD+I NQG+ + AI REMP ELLRASL HT+D Q+ LL+F +
Sbjct: 709 KYDTIRGQ--NQGSAADVL-----ELAIKREMPAELLRASLRHTNDDQRNYLLNFGRK-- 759
Query: 798 PVVGFNSSAGGNGNHVNSESV 818
P S++ G +NSESV
Sbjct: 760 PSAISESASHARGGQINSESV 780
Score = 55.1 bits (131), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 33/88 (37%), Positives = 46/88 (52%), Gaps = 15/88 (17%)
Query: 81 TNFWKLNKVESGPKSAAVIGDQG-----SRENSTAEWANRNDVQNWFEQSAYDSEGSLDG 135
T F KLN+V +GPK VIGD+G +S +W ++ +W + E +
Sbjct: 85 TTFAKLNRVVTGPKHPGVIGDRGSGSFSRESSSATDWTQDAELTSWLD------EQDQEA 138
Query: 136 RRLSSQPYSSLSHLPEPKPLYRTASYPE 163
+R SSQP S KPLYRT+SYP+
Sbjct: 139 KRWSSQPQS----FAHSKPLYRTSSYPQ 162
>AT4G14990.1 | Symbols: | Topoisomerase II-associated protein PAT1
| chr4:8566259-8569511 REVERSE LENGTH=787
Length = 787
Score = 474 bits (1219), Expect = e-133, Method: Compositional matrix adjust.
Identities = 287/687 (41%), Positives = 385/687 (56%), Gaps = 76/687 (11%)
Query: 182 NWFDQHAYDSETTQDGKRWSSQPHSSIAHLEESKSLYRTSLYPEKQQEHPHFSSEPVLVP 241
+W DQH + + + + S S SLYRTS YP++Q + H+SSEP++VP
Sbjct: 125 SWLDQHTVEEQVQE------ASWSSQPQSSPNSNSLYRTSSYPQQQTQLQHYSSEPIIVP 178
Query: 242 NSSFTXXXXXXXXXXXXXXXNNTGYLNIPHNAIGAQMALSSQNRSRFSNPALQLGGLNHG 301
S+FT + ++P G+Q S+ N S SN L GL+HG
Sbjct: 179 ESTFTSFPSPGKRSQQSSPSHIHRAPSLPG---GSQSNFSAPNASPLSNSTFHLSGLSHG 235
Query: 302 LPFSGN-MNQFPTGSPFNQRIQNQ---LVNQAGFYSGDHPNF------------------ 339
GN + ++ + P + Q V G GDH
Sbjct: 236 PSHYGNNLARYASCGPTLGNMVQQPPHWVTDPGLLHGDHSALLHSLMQQQHLQQLPPRNG 295
Query: 340 --SSGLPMLNK----------------------YDQMLGIMELRDQLPKSALLGRQNLR- 374
S L L + + + G+ E+R+ KS+ R+N
Sbjct: 296 FTSQQLISLQQRQSLAHLAALQSQLYSSYPSPSHKALFGVGEVREHKHKSSHRSRKNRGG 355
Query: 375 FPPQGFDLSFNRSNNGWPRFRSKYMTTEEIENILRMQLAATHSNDPYVDDYYHQGCLAKK 434
Q DL+ +S +G +FRSKYMT+EEIE+IL+MQ + +HS+DPYV+DYYHQ LAKK
Sbjct: 356 ISQQTSDLASQKSESGL-QFRSKYMTSEEIESILKMQHSNSHSSDPYVNDYYHQARLAKK 414
Query: 435 SAGAKLRHHFCPNQIKEHPLRGSANTEQHAFLQVDALGRVPFSSIRRPRPLLEVDPPNSS 494
S+G++ + P+ +K+H R +++Q + VDALG++ SI RPR LLEVD P SS
Sbjct: 415 SSGSRTKPQLYPSHLKDHQSRSRNSSDQQPQVHVDALGKITLPSICRPRALLEVDSPPSS 474
Query: 495 RASSPEQNVSEKPLEQEPLLAARVTIEDGLCLLLDVDDIDRFLQFNQLQDGGIQLKHKRQ 554
K LE EPL+AARVTIED +L+D+ DIDR LQFN+ QDGG QL+ KRQ
Sbjct: 475 ---------GHKHLEDEPLVAARVTIEDAFGVLIDIVDIDRTLQFNRPQDGGAQLRRKRQ 525
Query: 555 GLLEGLAASLHLVDPLGKSGHTVMHVANDDVVFLRIVSLPKGRKLLARYLQILFPGGELM 614
LLEGLA SL LVDP K+G DD+VFLRI +LPKGRKLL +YLQ+L PG E+
Sbjct: 526 ILLEGLATSLQLVDPFSKTGQKTGLTTKDDIVFLRITTLPKGRKLLTKYLQLLVPGTEIA 585
Query: 615 RIVCMAIFRHFRFLFGGLPSDPVAAETVSNLARVVSKCLREMDXXXXXXXXXXXXXXXEQ 674
R+VCMA+FRH RFLFGGLPSD +AAET++NLA+ V+ C++ MD EQ
Sbjct: 586 RVVCMAVFRHLRFLFGGLPSDSLAAETIANLAKAVTVCVQAMDLRALSACLAAVVCSSEQ 645
Query: 675 PPLRPLGSPAGDGASLILVSVLERATELLTD--PHAASNYNIANRSLWQASFDEFFGLLA 732
PPLRP+GS +GDGAS++LVS+LERA E++ P SN+ N LW+ASFDEFF LL
Sbjct: 646 PPLRPIGSSSGDGASVVLVSLLERAAEVIVAVVPPRVSNHGNPNDGLWRASFDEFFSLLT 705
Query: 733 KYCVNKYDSIMQSFLNQGTPNMAVIGSEAARAISREMPVELLRASLPHTDDRQKKILLDF 792
KYC +KY++I Q N A + AI REMP ELLRASL HT++ Q+ LL+
Sbjct: 706 KYCRSKYETIH----GQNHDNAADV---LELAIKREMPAELLRASLRHTNEDQRNFLLNV 758
Query: 793 AHRSLPVVGFNSS-AGGNGNHVNSESV 818
+ PV ++ A +G +NSE V
Sbjct: 759 GRSASPVSESTTTRASASGGQINSEFV 785