Miyakogusa Predicted Gene
- Lj3g3v0965940.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj3g3v0965940.1 Non Characterized Hit- tr|I1MJ65|I1MJ65_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.50128
PE,79.78,0,PAT1,Topoisomerase II-associated protein PAT1;
TOPOISOMERASE II-ASSOCIATED PROTEIN PAT1,NULL; seg,NU,CUFF.41924.1
(820 letters)
Database: Medicago_aa4.0v1
62,319 sequences; 21,947,249 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Medtr2g069550.1 | topoisomerase II-associated protein PAT1, puta... 1120 0.0
Medtr2g069530.1 | topoisomerase II-associated protein PAT1 | HC ... 885 0.0
>Medtr2g069550.1 | topoisomerase II-associated protein PAT1,
putative | HC | chr2:28950076-28956943 | 20130731
Length = 825
Score = 1120 bits (2896), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 571/811 (70%), Positives = 631/811 (77%), Gaps = 11/811 (1%)
Query: 13 VQSGDVPTEGEVFDASQYAFFGK-XXXXXXXXXXXXXXXXXPXXXXXXXXXXXXXXXXXX 71
+ +GD+P EG VFDA+QYAFFG+ P
Sbjct: 21 LAAGDIPKEGAVFDATQYAFFGQDACVEEVELGGLDDDDGLPESNEEEFLLNREEVEDVK 80
Query: 72 XXXXXXXXXTNFWKLNKVESGPKSAAVIGDQGSRENSTAEWANRNDVQNWFEQSAYDSEG 131
T+FWKLNKVE GP++ A+ G++GSRENSTAEW RND NWF+Q+ YDSEG
Sbjct: 81 SLSDIDDLSTDFWKLNKVEIGPQNTAIFGERGSRENSTAEWTQRNDNLNWFDQNPYDSEG 140
Query: 132 SLDGRRLSSQPYSSLSHLPEPKPLYRTASYPEXXXXXXXXXXX-GPGESVPNWFDQHAYD 190
SLDGRRLSSQPYSS + L E LYRT+SYPE G ESVPN FDQ D
Sbjct: 141 SLDGRRLSSQPYSSFAPLQESNSLYRTSSYPEQQRQQQQHYLQHGSSESVPNRFDQPFRD 200
Query: 191 SETTQDGKRWSSQPHSSIAHLEESKSLYRTSLYPEKQQEHPHFSSEPVLVPNSSFTXXXX 250
SET QDGKRWSSQPHS+IAH+EESKSLY TSLYP+KQQE HFS++ L P FT
Sbjct: 201 SET-QDGKRWSSQPHSNIAHIEESKSLYSTSLYPDKQQELLHFSNDSTLAP---FTSYPP 256
Query: 251 XXXXXXXXXXXNNTGYLNIPHNAIGAQMALSSQNRSRFSNPALQLGGLNHGLPFSGNMNQ 310
+NTG +NIP++A GAQMALSSQNRS SN ALQLGGLNHG P SGNM Q
Sbjct: 257 PGGRALQSSPSHNTGPINIPYHAAGAQMALSSQNRSHLSNSALQLGGLNHGSPLSGNMRQ 316
Query: 311 FPTGSPFNQRIQNQLVNQAGFYSGDHPNFSSGLPMLNKYDQMLGIMELRDQLPKSALLGR 370
P GSP NQRIQN LV+QAG Y+G HP SSGLPM+NKYDQMLG+MELRDQ+PKSA +GR
Sbjct: 317 IPMGSPLNQRIQNPLVSQAGLYNGAHPKISSGLPMINKYDQMLGMMELRDQMPKSAQIGR 376
Query: 371 QNLRFPPQGFDLSFN-RSNNGWPRFRSKYMTTEEIENILRMQLAATHSNDPYVDDYYHQG 429
QNLRF PQGFD+S N + NNGWPRFRSKYMTTEE+ENI R+QLAATH+NDPYVDDYY+QG
Sbjct: 377 QNLRFSPQGFDMSNNHKFNNGWPRFRSKYMTTEELENIFRLQLAATHTNDPYVDDYYNQG 436
Query: 430 CLAKKSAGAKLRHHFCPNQIKEHPLRGSANTEQHAFLQVDALGRVPFSSIRRPRPLLEVD 489
CLAKKS+GAKLR HF P QI+E PLR S+N E HAFLQVDALGRVPFSSIRRPRPLLEV+
Sbjct: 437 CLAKKSSGAKLRRHFSPAQIREIPLRASSN-EPHAFLQVDALGRVPFSSIRRPRPLLEVE 495
Query: 490 PPNSSRASSPEQNVSEKPLEQEPLLAARVTIEDGLCLLLDVDDIDRFLQFN-QLQDGGIQ 548
PPNSSRA EQN+SEKPLEQEP+LAARVTIEDGLC+LLDVDDIDRFLQFN QLQDGG
Sbjct: 496 PPNSSRAGGSEQNISEKPLEQEPMLAARVTIEDGLCVLLDVDDIDRFLQFNKQLQDGGDH 555
Query: 549 LKHKRQGLLEGLAASLHLVDPLGKSGHTVMHVANDDVVFLRIVSLPKGRKLLARYLQILF 608
LK KRQGLLEGL+ASL LVDPLG+ G TV A DD VFLRI+SLPKGRKLLARYL++LF
Sbjct: 556 LKRKRQGLLEGLSASLQLVDPLGRIGQTVELAAKDDYVFLRILSLPKGRKLLARYLELLF 615
Query: 609 PGGELMRIVCMAIFRHFRFLFGGLPSDPVAAETVSNLARVVSKCLREMDXXXXXXXXXXX 668
PGG+LMRIVCMAIFRH RFLFG LPSDPVA+ETV NLA+VVSKC+REMD
Sbjct: 616 PGGDLMRIVCMAIFRHLRFLFGALPSDPVASETVINLAKVVSKCVREMDLVALSVCLASV 675
Query: 669 XXXXEQPPLRPLGSPAGDGASLILVSVLERATELLTDPHAASNYNIANRSLWQASFDEFF 728
E PPLRPLGSP GDG SLILVSVLERATE+LTDPHAASNYNI NRSLWQASFDEFF
Sbjct: 676 VCSAEPPPLRPLGSPVGDGTSLILVSVLERATEVLTDPHAASNYNIRNRSLWQASFDEFF 735
Query: 729 GLLAKYCVNKYDSIMQSFLNQGTPNMAVIGSEAARAISREMPVELLRASLPHTDDRQKKI 788
GLL KYCVNKYDSIMQSFL QGT NMAVIG + ARAISREMPVELLRASLPHTDDRQKKI
Sbjct: 736 GLLTKYCVNKYDSIMQSFLAQGTSNMAVIGPDVARAISREMPVELLRASLPHTDDRQKKI 795
Query: 789 LLDFAHRSLPVVGFNSSAGGNGNHVNSESVL 819
LLDFA RS VVG++S+AGGNG+H NS+SV+
Sbjct: 796 LLDFAQRS--VVGYSSNAGGNGHHANSQSVM 824
>Medtr2g069530.1 | topoisomerase II-associated protein PAT1 | HC |
chr2:28933010-28942411 | 20130731
Length = 880
Score = 885 bits (2286), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 491/857 (57%), Positives = 565/857 (65%), Gaps = 50/857 (5%)
Query: 14 QSGDVPTEG-EVFDASQYAFFGKXXXXXXXXXXXXXXXXXPXXXXXXXXXXXXXXXXXXX 72
Q G VP G VFDASQYAFFGK P
Sbjct: 24 QLGHVPVSGGAVFDASQYAFFGKDAVQEVELGGLEEDARLPTFESNEEEFFFNREQDEDV 83
Query: 73 XXXXXXX--XTNFWKLNKVESGPKSAAVIGDQGSRENSTAEWANRNDVQNWFEQSAYDSE 130
T FWKLNK SGP++ VIG++GSRENST++ A R+DV +WF+Q+AYDSE
Sbjct: 84 RSLSDIDDLTTTFWKLNKAVSGPRNPGVIGERGSRENSTSDLAQRDDVHSWFDQNAYDSE 143
Query: 131 GSLDGRRLSSQPYSSLSHLPEPKPLYRTASYPEXXXXXXXXX-XXGPGESVPNWFDQHAY 189
GS+DG+R SSQP SSL+HL K LYRT+SYPE E V NW DQH Y
Sbjct: 144 GSIDGKRWSSQPQSSLAHLQNSKSLYRTSSYPEQLRQEQNYHLQHCSSEPVHNWLDQHIY 203
Query: 190 DSETTQDGKRWSSQPHSSIAHLEESKSLYRTSLYPEKQQEHPHFSSEPVLVPNSSFTXXX 249
D ET D KRWSSQPH+SI HL ES LYRTS YP+KQQE FSSEP+LV SSFT
Sbjct: 204 DIETAHDEKRWSSQPHTSIPHLHESTPLYRTSSYPDKQQELTRFSSEPILVQKSSFTSYP 263
Query: 250 XXXXXXXXXXXXNNTGYLNIP-HNAIGAQMALSSQNRSRFSNPALQLGGLNHGLPFSGNM 308
++TG+LNIP H AQMALSSQNR FSN ALQL GLN G F GN
Sbjct: 264 PPGGRSQQGSPSHSTGHLNIPYHTGAAAQMALSSQNRPHFSNSALQLSGLNLGSHFGGNT 323
Query: 309 NQFPTGS-PFNQRIQNQLVNQAGFYSGDHPNFSSGL------------------------ 343
Q+ TGS P +QRIQNQLV+QAG Y GD N S +
Sbjct: 324 RQYSTGSSPLSQRIQNQLVSQAGLYPGDRSNLLSNMLQQQLQLHNGSVSPHLMTQLQQQQ 383
Query: 344 -----PMLNKYDQMLG---------------IMELRDQLPKSALLGRQNLRFPPQGFDLS 383
P+ + G + D KS+ G+ + R QG D S
Sbjct: 384 HRLHHPVQQSAGYLSGFQSHLFNPHLSPSSSVNSKYDHRHKSSQKGKHSHRLSHQGSDAS 443
Query: 384 FNRSNNGWPRFRSKYMTTEEIENILRMQLAATHSNDPYVDDYYHQGCLAKKSAGAKLRHH 443
+S + +FRSK+MT++EIE+ILRMQLA THSNDPY+DDYYHQG LAKK + AKL+H
Sbjct: 444 SQKSESSSLQFRSKHMTSDEIESILRMQLAVTHSNDPYIDDYYHQGRLAKKPSVAKLKHP 503
Query: 444 FCPNQIKEHPLRGSANTEQHAFLQVDALGRVPFSSIRRPRPLLEVDPPNSSRASSPEQNV 503
FCP QIKE P R ++ + HAFLQVD+LGRV FSSIRRPRPLLE D NSS S + +
Sbjct: 504 FCPTQIKELPSRTRSSNDPHAFLQVDSLGRVSFSSIRRPRPLLEADSSNSSIPGSSDLKI 563
Query: 504 SEKPLEQEPLLAARVTIEDGLCLLLDVDDIDRFLQFNQLQDGGIQLKHKRQGLLEGLAAS 563
SEKPLEQEP AARVTIEDGLCLLLDVDDIDR+LQ NQ QDGG QL+ +RQ LLEGLA S
Sbjct: 564 SEKPLEQEPTFAARVTIEDGLCLLLDVDDIDRYLQSNQPQDGGTQLRRRRQVLLEGLATS 623
Query: 564 LHLVDPLGKSGHTVMHVANDDVVFLRIVSLPKGRKLLARYLQILFPGGELMRIVCMAIFR 623
L LVDPLGK+GH V A DD+VF+RIVS+PKGRKLLA+YLQ+L PG +LMRIVCM + R
Sbjct: 624 LQLVDPLGKNGHKVGLAAKDDLVFIRIVSVPKGRKLLAKYLQLLLPGSDLMRIVCMTVLR 683
Query: 624 HFRFLFGGLPSDPVAAETVSNLARVVSKCLREMDXXXXXXXXXXXXXXXEQPPLRPLGSP 683
H RFLFGGLPSDP AA T NLA VV +C+R MD E PPLRP+GS
Sbjct: 684 HLRFLFGGLPSDPAAAVTTCNLAEVVCQCIRGMDLGALSACLAAVVCSAEHPPLRPVGST 743
Query: 684 AGDGASLILVSVLERATELLTDPHAASNYNIANRSLWQASFDEFFGLLAKYCVNKYDSIM 743
AGDGASLILVSVLERATELLTDP AA NYN+ NRS WQASFDEFFGL+ KYC+NKY SIM
Sbjct: 744 AGDGASLILVSVLERATELLTDPQAACNYNMGNRSFWQASFDEFFGLITKYCMNKYHSIM 803
Query: 744 QSFLNQGTPNMAVIGSEAARAISREMPVELLRASLPHTDDRQKKILLDFAHRSLPVVGFN 803
QS L Q PN+ IGS+AA+AIS+EMPVELLRASLPHTDDRQ+K+LLDFA RS+PV+GFN
Sbjct: 804 QSLLLQNAPNVDDIGSDAAKAISKEMPVELLRASLPHTDDRQRKLLLDFAQRSVPVIGFN 863
Query: 804 SSAGGNGNHVNSESVLS 820
S+AGG+G VNSE+VLS
Sbjct: 864 SNAGGSGGLVNSETVLS 880