Miyakogusa Predicted Gene
- Lj2g3v3105900.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj2g3v3105900.1 Non Chatacterized Hit- tr|I1JDH7|I1JDH7_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.48939
PE,87.76,0,Beta-Casp,Beta-Casp domain;
Lactamase_B,Beta-lactamase-like;
Metallo-hydrolase/oxidoreductase,NULL; ,CUFF.39714.1
(392 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT2G01730.1 | Symbols: ATCPSF73-II, EDA26, CPSF73-II | cleavage ... 684 0.0
AT1G61010.3 | Symbols: CPSF73-I | cleavage and polyadenylation s... 271 5e-73
AT1G61010.2 | Symbols: CPSF73-I | cleavage and polyadenylation s... 271 5e-73
AT1G61010.1 | Symbols: CPSF73-I | cleavage and polyadenylation s... 271 5e-73
AT5G23880.1 | Symbols: EMB1265, CPSF100, ESP5, ATCPSF100 | cleav... 144 1e-34
AT3G07530.1 | Symbols: | CONTAINS InterPro DOMAIN/s: Beta-Casp ... 50 3e-06
>AT2G01730.1 | Symbols: ATCPSF73-II, EDA26, CPSF73-II | cleavage and
polyadenylation specificity factor 73 kDa subunit-II |
chr2:320597-323845 FORWARD LENGTH=613
Length = 613
Score = 684 bits (1766), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 315/393 (80%), Positives = 359/393 (91%), Gaps = 1/393 (0%)
Query: 1 MAIETLVLGAGQEVGKSCVVVTINGKRIMFDCGMHMGHLDHRRYPDFSLISQ-GHYDAAL 59
MAI+ LVLGAGQE+GKSCVVVTINGK+IMFDCGMHMG DH RYP+FSLIS+ G +D A+
Sbjct: 1 MAIDCLVLGAGQEIGKSCVVVTINGKKIMFDCGMHMGCDDHNRYPNFSLISKSGDFDNAI 60
Query: 60 SCIIITHFHLDHVGALAYFTEVCGYRGPIYMTYPTKALAPLMLEDYRKVMVDRRGEEELF 119
SCIIITHFH+DHVGAL YFTEVCGY GPIYM+YPTKAL+PLMLEDYR+VMVDRRGEEELF
Sbjct: 61 SCIIITHFHMDHVGALPYFTEVCGYNGPIYMSYPTKALSPLMLEDYRRVMVDRRGEEELF 120
Query: 120 TSENIAECMKKVIAIDLRQTVQVDEDLQIRAYYAGHVIGAAMFYAKVGDAEMVYTGDYNM 179
T+ +IA CMKKVIAIDL+QT+QVDEDLQIRAYYAGHV+GA M YAK+GDA +VYTGDYNM
Sbjct: 121 TTTHIANCMKKVIAIDLKQTIQVDEDLQIRAYYAGHVLGAVMVYAKMGDAAIVYTGDYNM 180
Query: 180 TADRHLGAAQIDRLRLDLLITESTYATTIRDSKYAREREFLKAVHKCVSGGGKVLIPTFA 239
T DRHLGAA+IDRL+LDLLI+ESTYATTIR SKY REREFL+AVHKCV+GGGK LIP+FA
Sbjct: 181 TTDRHLGAAKIDRLQLDLLISESTYATTIRGSKYPREREFLQAVHKCVAGGGKALIPSFA 240
Query: 240 LGRAQELCILLDDYWERMNLKVPIYFSAGLTIQANMYYKMLISWTSQKIKDTYSTHNAFD 299
LGRAQELC+LLDDYWERMN+KVPIYFS+GLTIQANMYYKMLISWTSQ +K+ ++THN FD
Sbjct: 241 LGRAQELCMLLDDYWERMNIKVPIYFSSGLTIQANMYYKMLISWTSQNVKEKHNTHNPFD 300
Query: 300 FKNVHHFERSMINAPGPCVLFATPGMISGGFSLEVFKHWAPSENNLITLPGYCVAGTVGH 359
FKNV F+RS+I+APGPCVLFATPGM+ GFSLEVFKHWAPS NL+ LPGY VAGTVGH
Sbjct: 301 FKNVKDFDRSLIHAPGPCVLFATPGMLCAGFSLEVFKHWAPSPLNLVALPGYSVAGTVGH 360
Query: 360 RLMSGKATKVDVDPDTQIDVRCQIHQLAFSPHT 392
+LM+GK T VD+ T++DVRC++HQ+AFSPHT
Sbjct: 361 KLMAGKPTTVDLYNGTKVDVRCKVHQVAFSPHT 393
>AT1G61010.3 | Symbols: CPSF73-I | cleavage and polyadenylation
specificity factor 73-I | chr1:22474954-22477660 REVERSE
LENGTH=693
Length = 693
Score = 271 bits (693), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 145/387 (37%), Positives = 221/387 (57%), Gaps = 10/387 (2%)
Query: 8 LGAGQEVGKSCVVVTINGKRIMFDCGMHMGHLDHRRYPDFSLISQGHYDAALSCIIITHF 67
LGAG EVG+SCV ++ GK I+FDCG+H + P F I D ++ITHF
Sbjct: 27 LGAGSEVGRSCVYMSFRGKNILFDCGIHPAYSGMAALPYFDEIDPSSID----VLLITHF 82
Query: 68 HLDHVGALAYFTEVCGYRGPIYMTYPTKALAPLMLEDYRKVMVDRRGEEELFTSENIAEC 127
H+DH +L YF E + G ++MT+ TKA+ L+L DY KV E+ LF ++I +
Sbjct: 83 HIDHAASLPYFLEKTTFNGRVFMTHATKAIYKLLLTDYVKVS-KVSVEDMLFDEQDINKS 141
Query: 128 MKKVIAIDLRQTVQVDEDLQIRAYYAGHVIGAAMFYAKVGDAEMVYTGDYNMTADRHLGA 187
M K+ ID QTV+V+ ++ Y AGHV+GAAMF + ++YTGDY+ DRHL A
Sbjct: 142 MDKIEVIDFHQTVEVN-GIKFWCYTAGHVLGAAMFMVDIAGVRILYTGDYSREEDRHLRA 200
Query: 188 AQIDRLRLDLLITESTYATTIRDSKYAREREFLKAVHKCVSGGGKVLIPTFALGRAQELC 247
A++ + D+ I EST + S++ RE+ F +H V+ GG+VLIP FALGRAQEL
Sbjct: 201 AELPQFSPDICIIESTSGVQLHQSRHIREKRFTDVIHSTVAQGGRVLIPAFALGRAQELL 260
Query: 248 ILLDDYW-ERMNL-KVPIYFSAGLTIQANMYYKMLISWTSQKIKDTYSTHNAFDFKNVHH 305
++LD+YW +L +PIY+++ L + Y+ I + +I++ ++ N F FK++
Sbjct: 261 LILDEYWANHPDLHNIPIYYASPLAKKCMAVYQTYILSMNDRIRNQFANSNPFVFKHISP 320
Query: 306 FER-SMINAPGPCVLFATPGMISGGFSLEVFKHWAPSENNLITLPGYCVAGTVGHRLMSG 364
N GP V+ ATPG + G S ++F W + N +PGY V GT+ +++
Sbjct: 321 LNSIDDFNDVGPSVVMATPGGLQSGLSRQLFDSWCSDKKNACIIPGYMVEGTLAKTIIN- 379
Query: 365 KATKVDVDPDTQIDVRCQIHQLAFSPH 391
+ +V + + Q+H ++FS H
Sbjct: 380 EPKEVTLMNGLTAPLNMQVHYISFSAH 406
>AT1G61010.2 | Symbols: CPSF73-I | cleavage and polyadenylation
specificity factor 73-I | chr1:22474954-22477660 REVERSE
LENGTH=693
Length = 693
Score = 271 bits (693), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 145/387 (37%), Positives = 221/387 (57%), Gaps = 10/387 (2%)
Query: 8 LGAGQEVGKSCVVVTINGKRIMFDCGMHMGHLDHRRYPDFSLISQGHYDAALSCIIITHF 67
LGAG EVG+SCV ++ GK I+FDCG+H + P F I D ++ITHF
Sbjct: 27 LGAGSEVGRSCVYMSFRGKNILFDCGIHPAYSGMAALPYFDEIDPSSID----VLLITHF 82
Query: 68 HLDHVGALAYFTEVCGYRGPIYMTYPTKALAPLMLEDYRKVMVDRRGEEELFTSENIAEC 127
H+DH +L YF E + G ++MT+ TKA+ L+L DY KV E+ LF ++I +
Sbjct: 83 HIDHAASLPYFLEKTTFNGRVFMTHATKAIYKLLLTDYVKVS-KVSVEDMLFDEQDINKS 141
Query: 128 MKKVIAIDLRQTVQVDEDLQIRAYYAGHVIGAAMFYAKVGDAEMVYTGDYNMTADRHLGA 187
M K+ ID QTV+V+ ++ Y AGHV+GAAMF + ++YTGDY+ DRHL A
Sbjct: 142 MDKIEVIDFHQTVEVN-GIKFWCYTAGHVLGAAMFMVDIAGVRILYTGDYSREEDRHLRA 200
Query: 188 AQIDRLRLDLLITESTYATTIRDSKYAREREFLKAVHKCVSGGGKVLIPTFALGRAQELC 247
A++ + D+ I EST + S++ RE+ F +H V+ GG+VLIP FALGRAQEL
Sbjct: 201 AELPQFSPDICIIESTSGVQLHQSRHIREKRFTDVIHSTVAQGGRVLIPAFALGRAQELL 260
Query: 248 ILLDDYW-ERMNL-KVPIYFSAGLTIQANMYYKMLISWTSQKIKDTYSTHNAFDFKNVHH 305
++LD+YW +L +PIY+++ L + Y+ I + +I++ ++ N F FK++
Sbjct: 261 LILDEYWANHPDLHNIPIYYASPLAKKCMAVYQTYILSMNDRIRNQFANSNPFVFKHISP 320
Query: 306 FER-SMINAPGPCVLFATPGMISGGFSLEVFKHWAPSENNLITLPGYCVAGTVGHRLMSG 364
N GP V+ ATPG + G S ++F W + N +PGY V GT+ +++
Sbjct: 321 LNSIDDFNDVGPSVVMATPGGLQSGLSRQLFDSWCSDKKNACIIPGYMVEGTLAKTIIN- 379
Query: 365 KATKVDVDPDTQIDVRCQIHQLAFSPH 391
+ +V + + Q+H ++FS H
Sbjct: 380 EPKEVTLMNGLTAPLNMQVHYISFSAH 406
>AT1G61010.1 | Symbols: CPSF73-I | cleavage and polyadenylation
specificity factor 73-I | chr1:22474954-22477660 REVERSE
LENGTH=693
Length = 693
Score = 271 bits (693), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 145/387 (37%), Positives = 221/387 (57%), Gaps = 10/387 (2%)
Query: 8 LGAGQEVGKSCVVVTINGKRIMFDCGMHMGHLDHRRYPDFSLISQGHYDAALSCIIITHF 67
LGAG EVG+SCV ++ GK I+FDCG+H + P F I D ++ITHF
Sbjct: 27 LGAGSEVGRSCVYMSFRGKNILFDCGIHPAYSGMAALPYFDEIDPSSID----VLLITHF 82
Query: 68 HLDHVGALAYFTEVCGYRGPIYMTYPTKALAPLMLEDYRKVMVDRRGEEELFTSENIAEC 127
H+DH +L YF E + G ++MT+ TKA+ L+L DY KV E+ LF ++I +
Sbjct: 83 HIDHAASLPYFLEKTTFNGRVFMTHATKAIYKLLLTDYVKVS-KVSVEDMLFDEQDINKS 141
Query: 128 MKKVIAIDLRQTVQVDEDLQIRAYYAGHVIGAAMFYAKVGDAEMVYTGDYNMTADRHLGA 187
M K+ ID QTV+V+ ++ Y AGHV+GAAMF + ++YTGDY+ DRHL A
Sbjct: 142 MDKIEVIDFHQTVEVN-GIKFWCYTAGHVLGAAMFMVDIAGVRILYTGDYSREEDRHLRA 200
Query: 188 AQIDRLRLDLLITESTYATTIRDSKYAREREFLKAVHKCVSGGGKVLIPTFALGRAQELC 247
A++ + D+ I EST + S++ RE+ F +H V+ GG+VLIP FALGRAQEL
Sbjct: 201 AELPQFSPDICIIESTSGVQLHQSRHIREKRFTDVIHSTVAQGGRVLIPAFALGRAQELL 260
Query: 248 ILLDDYW-ERMNL-KVPIYFSAGLTIQANMYYKMLISWTSQKIKDTYSTHNAFDFKNVHH 305
++LD+YW +L +PIY+++ L + Y+ I + +I++ ++ N F FK++
Sbjct: 261 LILDEYWANHPDLHNIPIYYASPLAKKCMAVYQTYILSMNDRIRNQFANSNPFVFKHISP 320
Query: 306 FER-SMINAPGPCVLFATPGMISGGFSLEVFKHWAPSENNLITLPGYCVAGTVGHRLMSG 364
N GP V+ ATPG + G S ++F W + N +PGY V GT+ +++
Sbjct: 321 LNSIDDFNDVGPSVVMATPGGLQSGLSRQLFDSWCSDKKNACIIPGYMVEGTLAKTIIN- 379
Query: 365 KATKVDVDPDTQIDVRCQIHQLAFSPH 391
+ +V + + Q+H ++FS H
Sbjct: 380 EPKEVTLMNGLTAPLNMQVHYISFSAH 406
>AT5G23880.1 | Symbols: EMB1265, CPSF100, ESP5, ATCPSF100 | cleavage
and polyadenylation specificity factor 100 |
chr5:8052550-8058147 FORWARD LENGTH=739
Length = 739
Score = 144 bits (362), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 102/363 (28%), Positives = 171/363 (47%), Gaps = 25/363 (6%)
Query: 20 VVTINGKRIMFDCGMHMGHLDHRRYPDFSLISQ-GHYDAALSCIIITHFHLDHVGALAYF 78
+V+I+G + DCG + D SL+ + + ++++H H+GAL Y
Sbjct: 22 LVSIDGFNFLIDCGWN-------DLFDTSLLEPLSRVASTIDAVLLSHPDTLHIGALPYA 74
Query: 79 TEVCGYRGPIYMTYPTKALAPLMLEDY---RKVMVDRRGEEELFTSENIAECMKKVIAID 135
+ G P+Y T P L L + D RK + D +LFT ++I + VI +
Sbjct: 75 MKQLGLSAPVYATEPVHRLGLLTMYDQFLSRKQVSDF----DLFTLDDIDSAFQNVIRLT 130
Query: 136 LRQTVQVD---EDLQIRAYYAGHVIGAAMFYAKVGDAEMVYTGDYNMTADRHLGAAQIDR 192
Q + E + I + AGH++G +++ +++Y DYN +RHL +
Sbjct: 131 YSQNYHLSGKGEGIVIAPHVAGHMLGGSIWRITKDGEDVIYAVDYNHRKERHLNGTVLQS 190
Query: 193 -LRLDLLITESTYAT-TIRDSKYAREREFLKAVHKCVSGGGKVLIPTFALGRAQELCILL 250
+R +LIT++ +A T + ++ R++EFL + K + GG VL+P GR EL ++L
Sbjct: 191 FVRPAVLITDAYHALYTNQTARQQRDKEFLDTISKHLEVGGNVLLPVDTAGRVLELLLIL 250
Query: 251 DDYWERMNLKVPIYFSAGLTIQANMYYKMLISWTSQKIKDTYSTH--NAFDFKNVHHF-- 306
+ +W + PIYF ++ Y K + W S I ++ T NAF ++V
Sbjct: 251 EQHWSQRGFSFPIYFLTYVSSSTIDYVKSFLEWMSDSISKSFETSRDNAFLLRHVTLLIN 310
Query: 307 ERSMINA-PGPCVLFATPGMISGGFSLEVFKHWAPSENNLITLPGYCVAGTVGHRLMSGK 365
+ + NA PGP V+ A+ + GF+ E+F WA NL+ GT+ L S
Sbjct: 311 KTDLDNAPPGPKVVLASMASLEAGFAREIFVEWANDPRNLVLFTETGQFGTLARMLQSAP 370
Query: 366 ATK 368
K
Sbjct: 371 PPK 373
>AT3G07530.1 | Symbols: | CONTAINS InterPro DOMAIN/s: Beta-Casp
domain (InterPro:IPR022712); BEST Arabidopsis thaliana
protein match is: cleavage and polyadenylation
specificity factor 73 kDa subunit-II (TAIR:AT2G01730.1);
Has 624 Blast hits to 615 proteins in 160 species:
Archae - 54; Bacteria - 6; Metazoa - 333; Fungi - 44;
Plants - 93; Viruses - 0; Other Eukaryotes - 94 (source:
NCBI BLink). | chr3:2400793-2404280 FORWARD LENGTH=699
Length = 699
Score = 49.7 bits (117), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 83/385 (21%), Positives = 148/385 (38%), Gaps = 95/385 (24%)
Query: 55 YDAALSCIIITHFHLDHVGALAYFTEVCGYRGPIYMTYPTKALAPLMLED---------- 104
++A+ I++ + +G L + T+ G+ IYMT T + LM+ED
Sbjct: 98 WEASFIDIVLISNPMGLLG-LPFLTQNPGFFAKIYMTEVTAKIGQLMMEDIVSMHKEFRC 156
Query: 105 ------------------------YRKVMVDRRGEE-----ELFTSENIAECMKKVIAID 135
+KV+ G++ L++ ++I CMKKV +
Sbjct: 157 FHGPDNSSFPGWIKNLDSEQVPALLKKVVFGESGDDLGSWMRLYSLDDIESCMKKVQGVK 216
Query: 136 LRQTVQVDEDLQIRAYYAGHVIGAAMFYAKVGDAEMVYTGDYNMTADRHLGAAQIDRLR- 194
+ V + L I+A +G IGA + + + Y D ++ H + L+
Sbjct: 217 FAEEVCYNGTLIIKALSSGLDIGACNWLINGPNGSLSYVSD-SIFVSHHARSFDFHGLKE 275
Query: 195 LDLLI---------------------TESTYATTIRDSKYA--------REREFLKAVHK 225
D+LI +++ Y +TI D+K + E E L V
Sbjct: 276 TDVLIYSDFSSLQSAEVTEDGCISPDSDNNYISTISDNKDSLLNTEDSLEEMEKLAFVCS 335
Query: 226 CVS----GGGKVLIPTFALGRAQELCILLDDYWERMNLKVPIYFSAGLTIQANMYYKMLI 281
C + GG LI +G +L LL + E +LKVPI+ + + + Y +
Sbjct: 336 CAAESADAGGSTLITITRIGIVLQLLELLSNSLESSSLKVPIFVISSVAEELLAYTNTIP 395
Query: 282 SW-TSQKIKDTYSTHNAF------DFKNVHHFERSMINAPG-----------PCVLFATP 323
W Q+ + S +F K +H F I++P PC++FA+
Sbjct: 396 EWLCEQRQEKLISGEPSFGHLKFIKNKKIHLF--PAIHSPNLIYANRTSWQEPCIVFASH 453
Query: 324 GMISGGFSLEVFKHWAPSENNLITL 348
+ G S+++ + W +L+ L
Sbjct: 454 WSLRLGPSVQLLQRWRGDPKSLLVL 478