Miyakogusa Predicted Gene
- Lj2g3v0486970.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj2g3v0486970.1 Non Chatacterized Hit- tr|I1MQ84|I1MQ84_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.26345
PE,88.52,0,MMS1_N,NULL; seg,NULL; CLEAVAGE AND POLYADENYLATION
SPECIFICITY FACTOR SUBUNIT 1 (CLEAVAGE AND POLYA,CUFF.34661.1
(999 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G51660.1 | Symbols: CPSF160, ATCPSF160 | cleavage and polyade... 1413 0.0
AT4G21100.1 | Symbols: DDB1B | damaged DNA binding protein 1B | ... 50 9e-06
>AT5G51660.1 | Symbols: CPSF160, ATCPSF160 | cleavage and
polyadenylation specificity factor 160 |
chr5:20980250-20989268 FORWARD LENGTH=1442
Length = 1442
Score = 1413 bits (3657), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 682/1005 (67%), Positives = 801/1005 (79%), Gaps = 17/1005 (1%)
Query: 1 MSFAAYKMMQWPTGVEHCASGFLTHSRADLASHTIIXXXXXXXXXXXXXXXRHPSPLPNL 60
MSFAAYKMM WPTGVE+CASG++THS +D I R PLPN+
Sbjct: 1 MSFAAYKMMHWPTGVENCASGYITHSLSDSTLQIPIVSVHDDIEAEWPNPKRGIGPLPNV 60
Query: 61 VVTAANVLEVYIVRLQD-------RHPSKPSSARGTLLDGISGASLELVCHYRLHGNVKS 113
V+TAAN+LEVYIVR Q+ R+P + RG ++DG+ G SLELVCHYRLHGNV+S
Sbjct: 61 VITAANILEVYIVRAQEEGNTQELRNPKL--AKRGGVMDGVYGVSLELVCHYRLHGNVES 118
Query: 114 MAVLSIGGGDTSKRRDSIILTFKDAKISVLEYDDSIHGLRTSSLHCFEGPEWLHLKRGRE 173
+AVL +GGG++SK RDSIILTF+DAKISVLE+DDSIH LR +S+HCFEGP+WLHLKRGRE
Sbjct: 119 IAVLPMGGGNSSKGRDSIILTFRDAKISVLEFDDSIHSLRMTSMHCFEGPDWLHLKRGRE 178
Query: 174 QFARGPVVKVDPQGRCGGVLVYDLQMIILKTTQAGSGLVGEDDPLGSGGAVAARIDSSYM 233
F RGP+VKVDPQGRCGGVLVY LQMIILKT+Q GSGLVG+DD SGG V+AR++SSY+
Sbjct: 179 SFPRGPLVKVDPQGRCGGVLVYGLQMIILKTSQVGSGLVGDDDAFSSGGTVSARVESSYI 238
Query: 234 INLRDLDMRHVKDFTFVHGYIEPVMVILHERELSWAGRVAWKHHTCMISALSISTTLKQH 293
INLRDL+M+HVKDF F+HGYIEPV+VIL E E +WAGRV+WKHHTC++SALSI++TLKQH
Sbjct: 239 INLRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINSTLKQH 298
Query: 294 PLIWSAVNLPHDAYKLIAVPSPIGGVLVIAANTIHYHSQSASCALALNSYAVSLDSSQEM 353
P+IWSA+NLPHDAYKL+AVPSPIGGVLV+ ANTIHYHSQSASCALALN+YA S DSSQE+
Sbjct: 299 PVIWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQEL 358
Query: 354 PRSFFNVELDAANATWLSNDVAXXXXXXXXXXXXXXIFDGRVVQRLDLSKSKASVLSSGI 413
P S F+VELDAA+ TW+SNDVA I+DGR VQRLDLSKSKASVL+S I
Sbjct: 359 PASNFSVELDAAHGTWISNDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLASDI 418
Query: 414 TTIGNSLFFLASRLGDSMLVQFSCGSSVSMLSSNLKEEVGDIEGDASSTKRLRRSPSDSL 473
T++GNSLFFL SRLGDS+LVQFSC S + L++E DIEG+ KRLR + SD+
Sbjct: 419 TSVGNSLFFLGSRLGDSLLVQFSCRSGPAASLPGLRDEDEDIEGEGHQAKRLRMT-SDTF 477
Query: 474 HDMVSGEELSLYGSAPNRTESAQKSFSFAVRDSLINIGPLKDFSYGLRINADANATGIAK 533
D + EELSL+GS PN ++SAQKSFSFAVRDSL+N+GP+KDF+YGLRINADANATG++K
Sbjct: 478 QDTIGNEELSLFGSTPNNSDSAQKSFSFAVRDSLVNVGPVKDFAYGLRINADANATGVSK 537
Query: 534 QSNYELVCCSGHGKNGSLCVLRQSIRPEVITEVELPGCKGIWTVYHKSARSHISDSSKLA 593
QSNYELVCCSGHGKNG+LCVLRQSIRPE+ITEVELPGCKGIWTVYHKS+R H +DSSK+A
Sbjct: 538 QSNYELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNADSSKMA 597
Query: 594 DDDDEYHAYLIISLEARTMVLETADLLSEVTESVDYYVQGKTLAAGNLFGRRRVIQVYER 653
D+DEYHAYLIISLEARTMVLETADLL+EVTESVDYYVQG+T+AAGNLFGRRRVIQV+E
Sbjct: 598 ADEDEYHAYLIISLEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVIQVFEH 657
Query: 654 GARILDGSFMTQDIXXXXXXXXXXXXXXXALALSVSIADPYVLLRMSDGSIRLLVGDPST 713
GARILDGSFM Q++ + SVSIADPYVLLRM+D SIRLLVGDPST
Sbjct: 658 GARILDGSFMNQELSFGASNSESNSGSESSTVSSVSIADPYVLLRMTDDSIRLLVGDPST 717
Query: 714 CTISVTXXXXXXXXXXXXXXCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGTDGAPQDH 773
CT+S++ CTLYHDKGPEPWLRK STDAWLS+GVGEA+D DG PQD
Sbjct: 718 CTVSISSPSVLEGSKRKISACTLYHDKGPEPWLRKASTDAWLSSGVGEAVDSVDGGPQDQ 777
Query: 774 GDIYCVVCYENGNLEIFDVPNFSCVFSVENFMSGKSHLVDALTKEVAKDSQKGDKGSDAV 833
GDIYCVVCYE+G LEIFDVP+F+CVFSV+ F SG+ HL D E+ + K +
Sbjct: 778 GDIYCVVCYESGALEIFDVPSFNCVFSVDKFASGRRHLSDMPIHELEYELNKNSED---- 833
Query: 834 ANQGRKENVLNMKVVELAMQRWSGQHSRPFLFGILSDGTILCYHAYLYESPDGTSKVEDS 893
N KE + N +VVELAMQRWSG H+RPFLF +L+DGTILCYHAYL++ D T K E+S
Sbjct: 834 -NTSSKE-IKNTRVVELAMQRWSGHHTRPFLFAVLADGTILCYHAYLFDGVDST-KAENS 890
Query: 894 VSASGPVDLSSTSVSRLRNLRFVRLPLDAYPREETSNGSPGQHITIFKNIGSYEGFFLSG 953
+S+ P L+S+ S+LRNL+F+R+PLD RE TS+G Q IT+FKNI ++GFFLSG
Sbjct: 891 LSSENPAALNSSGSSKLRNLKFLRIPLDTSTREGTSDGVASQRITMFKNISGHQGFFLSG 950
Query: 954 SRPAWVMVLRERLRVHPQLCDGSILAFTVLHNVNCNHGLIYVTSQ 998
SRP W M+ RERLR H QLCDGSI AFTVLHNVNCNHG IYVT+Q
Sbjct: 951 SRPGWCMLFRERLRFHSQLCDGSIAAFTVLHNVNCNHGFIYVTAQ 995
>AT4G21100.1 | Symbols: DDB1B | damaged DNA binding protein 1B |
chr4:11258916-11265309 REVERSE LENGTH=1088
Length = 1088
Score = 50.1 bits (118), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 80/359 (22%), Positives = 147/359 (40%), Gaps = 69/359 (19%)
Query: 94 ISGASLELVCHYRLHGNVKSMAVLSIGGGDTSKRRDSIILTFKDAKISVLEYDDSIHGLR 153
+S L+ + L+G + +M + G + +D + + + K VL++D L
Sbjct: 45 LSPQGLQTILDVPLYGRIATMELFRPHG----EAQDFLFVATERYKFCVLQWDYESSELI 100
Query: 154 TSSLHCFEGPEWLHLKRGREQFARGPVVKVDPQGRCGGVLVYDLQMIILKTTQAGSGLVG 213
T ++ G + R + G + +DP R G+ +YD ++
Sbjct: 101 TRAM----GDVSDRIGRPTDN---GQIGIIDPDCRVIGLHLYDGLFKVI----------- 142
Query: 214 EDDPLGSGGAVAARIDSSYMINLRDLDMRHVKDFTFVHGYIEPVMVILHERELSWAGRVA 273
P + G ++ ++ I L +L V D F++G +P + +L++ + A
Sbjct: 143 ---PFDNKG----QLKEAFNIRLEEL---QVLDIKFLYGCTKPTIAVLYQD-----NKDA 187
Query: 274 WKHHTCMISALSISTTLKQHPLI---WSAVNLPHDAYKLIAVPSPIGGVLVIAANTIHYH 330
T +S LK + WS NL + A LI VPSP+ GVL+I TI Y
Sbjct: 188 RHVKTYEVS-------LKDKNFVEGPWSQNNLDNGADLLIPVPSPLCGVLIIGEETIVYC 240
Query: 331 SQSASCALALNSYAVSLDSSQEMPRSFFNVELDAANATWLSNDVAXXXXXXXXXXXXXXI 390
S +A A+ + + +++ V+LD + +L D A
Sbjct: 241 SANAFKAIPIRP---------SITKAYGRVDLDGSR--YLLGDHAGLIHLLVITH----- 284
Query: 391 FDGRVVQRLDLSKSKASVLSSGITTIGNSLFFLASRLGDSMLVQFSC-----GSSVSML 444
+ V L + + ++S I+ + N++ F+ S GDS L++ + GS V +L
Sbjct: 285 -EKEKVTGLKIELLGETSIASSISYLDNAVVFVGSSYGDSQLIKLNLQPDAKGSYVEIL 342