Miyakogusa Predicted Gene

Lj2g3v3105900.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj2g3v3105900.1 Non Chatacterized Hit- tr|I1JDH7|I1JDH7_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.48939
PE,87.76,0,Beta-Casp,Beta-Casp domain;
Lactamase_B,Beta-lactamase-like;
Metallo-hydrolase/oxidoreductase,NULL; ,CUFF.39714.1
         (392 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT2G01730.1 | Symbols: ATCPSF73-II, EDA26, CPSF73-II | cleavage ...   684   0.0  
AT1G61010.3 | Symbols: CPSF73-I | cleavage and polyadenylation s...   271   5e-73
AT1G61010.2 | Symbols: CPSF73-I | cleavage and polyadenylation s...   271   5e-73
AT1G61010.1 | Symbols: CPSF73-I | cleavage and polyadenylation s...   271   5e-73
AT5G23880.1 | Symbols: EMB1265, CPSF100, ESP5, ATCPSF100 | cleav...   144   1e-34
AT3G07530.1 | Symbols:  | CONTAINS InterPro DOMAIN/s: Beta-Casp ...    50   3e-06

>AT2G01730.1 | Symbols: ATCPSF73-II, EDA26, CPSF73-II | cleavage and
           polyadenylation specificity factor 73 kDa subunit-II |
           chr2:320597-323845 FORWARD LENGTH=613
          Length = 613

 Score =  684 bits (1766), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 315/393 (80%), Positives = 359/393 (91%), Gaps = 1/393 (0%)

Query: 1   MAIETLVLGAGQEVGKSCVVVTINGKRIMFDCGMHMGHLDHRRYPDFSLISQ-GHYDAAL 59
           MAI+ LVLGAGQE+GKSCVVVTINGK+IMFDCGMHMG  DH RYP+FSLIS+ G +D A+
Sbjct: 1   MAIDCLVLGAGQEIGKSCVVVTINGKKIMFDCGMHMGCDDHNRYPNFSLISKSGDFDNAI 60

Query: 60  SCIIITHFHLDHVGALAYFTEVCGYRGPIYMTYPTKALAPLMLEDYRKVMVDRRGEEELF 119
           SCIIITHFH+DHVGAL YFTEVCGY GPIYM+YPTKAL+PLMLEDYR+VMVDRRGEEELF
Sbjct: 61  SCIIITHFHMDHVGALPYFTEVCGYNGPIYMSYPTKALSPLMLEDYRRVMVDRRGEEELF 120

Query: 120 TSENIAECMKKVIAIDLRQTVQVDEDLQIRAYYAGHVIGAAMFYAKVGDAEMVYTGDYNM 179
           T+ +IA CMKKVIAIDL+QT+QVDEDLQIRAYYAGHV+GA M YAK+GDA +VYTGDYNM
Sbjct: 121 TTTHIANCMKKVIAIDLKQTIQVDEDLQIRAYYAGHVLGAVMVYAKMGDAAIVYTGDYNM 180

Query: 180 TADRHLGAAQIDRLRLDLLITESTYATTIRDSKYAREREFLKAVHKCVSGGGKVLIPTFA 239
           T DRHLGAA+IDRL+LDLLI+ESTYATTIR SKY REREFL+AVHKCV+GGGK LIP+FA
Sbjct: 181 TTDRHLGAAKIDRLQLDLLISESTYATTIRGSKYPREREFLQAVHKCVAGGGKALIPSFA 240

Query: 240 LGRAQELCILLDDYWERMNLKVPIYFSAGLTIQANMYYKMLISWTSQKIKDTYSTHNAFD 299
           LGRAQELC+LLDDYWERMN+KVPIYFS+GLTIQANMYYKMLISWTSQ +K+ ++THN FD
Sbjct: 241 LGRAQELCMLLDDYWERMNIKVPIYFSSGLTIQANMYYKMLISWTSQNVKEKHNTHNPFD 300

Query: 300 FKNVHHFERSMINAPGPCVLFATPGMISGGFSLEVFKHWAPSENNLITLPGYCVAGTVGH 359
           FKNV  F+RS+I+APGPCVLFATPGM+  GFSLEVFKHWAPS  NL+ LPGY VAGTVGH
Sbjct: 301 FKNVKDFDRSLIHAPGPCVLFATPGMLCAGFSLEVFKHWAPSPLNLVALPGYSVAGTVGH 360

Query: 360 RLMSGKATKVDVDPDTQIDVRCQIHQLAFSPHT 392
           +LM+GK T VD+   T++DVRC++HQ+AFSPHT
Sbjct: 361 KLMAGKPTTVDLYNGTKVDVRCKVHQVAFSPHT 393


>AT1G61010.3 | Symbols: CPSF73-I | cleavage and polyadenylation
           specificity factor 73-I | chr1:22474954-22477660 REVERSE
           LENGTH=693
          Length = 693

 Score =  271 bits (693), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 145/387 (37%), Positives = 221/387 (57%), Gaps = 10/387 (2%)

Query: 8   LGAGQEVGKSCVVVTINGKRIMFDCGMHMGHLDHRRYPDFSLISQGHYDAALSCIIITHF 67
           LGAG EVG+SCV ++  GK I+FDCG+H  +      P F  I     D     ++ITHF
Sbjct: 27  LGAGSEVGRSCVYMSFRGKNILFDCGIHPAYSGMAALPYFDEIDPSSID----VLLITHF 82

Query: 68  HLDHVGALAYFTEVCGYRGPIYMTYPTKALAPLMLEDYRKVMVDRRGEEELFTSENIAEC 127
           H+DH  +L YF E   + G ++MT+ TKA+  L+L DY KV      E+ LF  ++I + 
Sbjct: 83  HIDHAASLPYFLEKTTFNGRVFMTHATKAIYKLLLTDYVKVS-KVSVEDMLFDEQDINKS 141

Query: 128 MKKVIAIDLRQTVQVDEDLQIRAYYAGHVIGAAMFYAKVGDAEMVYTGDYNMTADRHLGA 187
           M K+  ID  QTV+V+  ++   Y AGHV+GAAMF   +    ++YTGDY+   DRHL A
Sbjct: 142 MDKIEVIDFHQTVEVN-GIKFWCYTAGHVLGAAMFMVDIAGVRILYTGDYSREEDRHLRA 200

Query: 188 AQIDRLRLDLLITESTYATTIRDSKYAREREFLKAVHKCVSGGGKVLIPTFALGRAQELC 247
           A++ +   D+ I EST    +  S++ RE+ F   +H  V+ GG+VLIP FALGRAQEL 
Sbjct: 201 AELPQFSPDICIIESTSGVQLHQSRHIREKRFTDVIHSTVAQGGRVLIPAFALGRAQELL 260

Query: 248 ILLDDYW-ERMNL-KVPIYFSAGLTIQANMYYKMLISWTSQKIKDTYSTHNAFDFKNVHH 305
           ++LD+YW    +L  +PIY+++ L  +    Y+  I   + +I++ ++  N F FK++  
Sbjct: 261 LILDEYWANHPDLHNIPIYYASPLAKKCMAVYQTYILSMNDRIRNQFANSNPFVFKHISP 320

Query: 306 FER-SMINAPGPCVLFATPGMISGGFSLEVFKHWAPSENNLITLPGYCVAGTVGHRLMSG 364
                  N  GP V+ ATPG +  G S ++F  W   + N   +PGY V GT+   +++ 
Sbjct: 321 LNSIDDFNDVGPSVVMATPGGLQSGLSRQLFDSWCSDKKNACIIPGYMVEGTLAKTIIN- 379

Query: 365 KATKVDVDPDTQIDVRCQIHQLAFSPH 391
           +  +V +       +  Q+H ++FS H
Sbjct: 380 EPKEVTLMNGLTAPLNMQVHYISFSAH 406


>AT1G61010.2 | Symbols: CPSF73-I | cleavage and polyadenylation
           specificity factor 73-I | chr1:22474954-22477660 REVERSE
           LENGTH=693
          Length = 693

 Score =  271 bits (693), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 145/387 (37%), Positives = 221/387 (57%), Gaps = 10/387 (2%)

Query: 8   LGAGQEVGKSCVVVTINGKRIMFDCGMHMGHLDHRRYPDFSLISQGHYDAALSCIIITHF 67
           LGAG EVG+SCV ++  GK I+FDCG+H  +      P F  I     D     ++ITHF
Sbjct: 27  LGAGSEVGRSCVYMSFRGKNILFDCGIHPAYSGMAALPYFDEIDPSSID----VLLITHF 82

Query: 68  HLDHVGALAYFTEVCGYRGPIYMTYPTKALAPLMLEDYRKVMVDRRGEEELFTSENIAEC 127
           H+DH  +L YF E   + G ++MT+ TKA+  L+L DY KV      E+ LF  ++I + 
Sbjct: 83  HIDHAASLPYFLEKTTFNGRVFMTHATKAIYKLLLTDYVKVS-KVSVEDMLFDEQDINKS 141

Query: 128 MKKVIAIDLRQTVQVDEDLQIRAYYAGHVIGAAMFYAKVGDAEMVYTGDYNMTADRHLGA 187
           M K+  ID  QTV+V+  ++   Y AGHV+GAAMF   +    ++YTGDY+   DRHL A
Sbjct: 142 MDKIEVIDFHQTVEVN-GIKFWCYTAGHVLGAAMFMVDIAGVRILYTGDYSREEDRHLRA 200

Query: 188 AQIDRLRLDLLITESTYATTIRDSKYAREREFLKAVHKCVSGGGKVLIPTFALGRAQELC 247
           A++ +   D+ I EST    +  S++ RE+ F   +H  V+ GG+VLIP FALGRAQEL 
Sbjct: 201 AELPQFSPDICIIESTSGVQLHQSRHIREKRFTDVIHSTVAQGGRVLIPAFALGRAQELL 260

Query: 248 ILLDDYW-ERMNL-KVPIYFSAGLTIQANMYYKMLISWTSQKIKDTYSTHNAFDFKNVHH 305
           ++LD+YW    +L  +PIY+++ L  +    Y+  I   + +I++ ++  N F FK++  
Sbjct: 261 LILDEYWANHPDLHNIPIYYASPLAKKCMAVYQTYILSMNDRIRNQFANSNPFVFKHISP 320

Query: 306 FER-SMINAPGPCVLFATPGMISGGFSLEVFKHWAPSENNLITLPGYCVAGTVGHRLMSG 364
                  N  GP V+ ATPG +  G S ++F  W   + N   +PGY V GT+   +++ 
Sbjct: 321 LNSIDDFNDVGPSVVMATPGGLQSGLSRQLFDSWCSDKKNACIIPGYMVEGTLAKTIIN- 379

Query: 365 KATKVDVDPDTQIDVRCQIHQLAFSPH 391
           +  +V +       +  Q+H ++FS H
Sbjct: 380 EPKEVTLMNGLTAPLNMQVHYISFSAH 406


>AT1G61010.1 | Symbols: CPSF73-I | cleavage and polyadenylation
           specificity factor 73-I | chr1:22474954-22477660 REVERSE
           LENGTH=693
          Length = 693

 Score =  271 bits (693), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 145/387 (37%), Positives = 221/387 (57%), Gaps = 10/387 (2%)

Query: 8   LGAGQEVGKSCVVVTINGKRIMFDCGMHMGHLDHRRYPDFSLISQGHYDAALSCIIITHF 67
           LGAG EVG+SCV ++  GK I+FDCG+H  +      P F  I     D     ++ITHF
Sbjct: 27  LGAGSEVGRSCVYMSFRGKNILFDCGIHPAYSGMAALPYFDEIDPSSID----VLLITHF 82

Query: 68  HLDHVGALAYFTEVCGYRGPIYMTYPTKALAPLMLEDYRKVMVDRRGEEELFTSENIAEC 127
           H+DH  +L YF E   + G ++MT+ TKA+  L+L DY KV      E+ LF  ++I + 
Sbjct: 83  HIDHAASLPYFLEKTTFNGRVFMTHATKAIYKLLLTDYVKVS-KVSVEDMLFDEQDINKS 141

Query: 128 MKKVIAIDLRQTVQVDEDLQIRAYYAGHVIGAAMFYAKVGDAEMVYTGDYNMTADRHLGA 187
           M K+  ID  QTV+V+  ++   Y AGHV+GAAMF   +    ++YTGDY+   DRHL A
Sbjct: 142 MDKIEVIDFHQTVEVN-GIKFWCYTAGHVLGAAMFMVDIAGVRILYTGDYSREEDRHLRA 200

Query: 188 AQIDRLRLDLLITESTYATTIRDSKYAREREFLKAVHKCVSGGGKVLIPTFALGRAQELC 247
           A++ +   D+ I EST    +  S++ RE+ F   +H  V+ GG+VLIP FALGRAQEL 
Sbjct: 201 AELPQFSPDICIIESTSGVQLHQSRHIREKRFTDVIHSTVAQGGRVLIPAFALGRAQELL 260

Query: 248 ILLDDYW-ERMNL-KVPIYFSAGLTIQANMYYKMLISWTSQKIKDTYSTHNAFDFKNVHH 305
           ++LD+YW    +L  +PIY+++ L  +    Y+  I   + +I++ ++  N F FK++  
Sbjct: 261 LILDEYWANHPDLHNIPIYYASPLAKKCMAVYQTYILSMNDRIRNQFANSNPFVFKHISP 320

Query: 306 FER-SMINAPGPCVLFATPGMISGGFSLEVFKHWAPSENNLITLPGYCVAGTVGHRLMSG 364
                  N  GP V+ ATPG +  G S ++F  W   + N   +PGY V GT+   +++ 
Sbjct: 321 LNSIDDFNDVGPSVVMATPGGLQSGLSRQLFDSWCSDKKNACIIPGYMVEGTLAKTIIN- 379

Query: 365 KATKVDVDPDTQIDVRCQIHQLAFSPH 391
           +  +V +       +  Q+H ++FS H
Sbjct: 380 EPKEVTLMNGLTAPLNMQVHYISFSAH 406


>AT5G23880.1 | Symbols: EMB1265, CPSF100, ESP5, ATCPSF100 | cleavage
           and polyadenylation specificity factor 100 |
           chr5:8052550-8058147 FORWARD LENGTH=739
          Length = 739

 Score =  144 bits (362), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 102/363 (28%), Positives = 171/363 (47%), Gaps = 25/363 (6%)

Query: 20  VVTINGKRIMFDCGMHMGHLDHRRYPDFSLISQ-GHYDAALSCIIITHFHLDHVGALAYF 78
           +V+I+G   + DCG +          D SL+       + +  ++++H    H+GAL Y 
Sbjct: 22  LVSIDGFNFLIDCGWN-------DLFDTSLLEPLSRVASTIDAVLLSHPDTLHIGALPYA 74

Query: 79  TEVCGYRGPIYMTYPTKALAPLMLEDY---RKVMVDRRGEEELFTSENIAECMKKVIAID 135
            +  G   P+Y T P   L  L + D    RK + D     +LFT ++I    + VI + 
Sbjct: 75  MKQLGLSAPVYATEPVHRLGLLTMYDQFLSRKQVSDF----DLFTLDDIDSAFQNVIRLT 130

Query: 136 LRQTVQVD---EDLQIRAYYAGHVIGAAMFYAKVGDAEMVYTGDYNMTADRHLGAAQIDR 192
             Q   +    E + I  + AGH++G +++       +++Y  DYN   +RHL    +  
Sbjct: 131 YSQNYHLSGKGEGIVIAPHVAGHMLGGSIWRITKDGEDVIYAVDYNHRKERHLNGTVLQS 190

Query: 193 -LRLDLLITESTYAT-TIRDSKYAREREFLKAVHKCVSGGGKVLIPTFALGRAQELCILL 250
            +R  +LIT++ +A  T + ++  R++EFL  + K +  GG VL+P    GR  EL ++L
Sbjct: 191 FVRPAVLITDAYHALYTNQTARQQRDKEFLDTISKHLEVGGNVLLPVDTAGRVLELLLIL 250

Query: 251 DDYWERMNLKVPIYFSAGLTIQANMYYKMLISWTSQKIKDTYSTH--NAFDFKNVHHF-- 306
           + +W +     PIYF   ++     Y K  + W S  I  ++ T   NAF  ++V     
Sbjct: 251 EQHWSQRGFSFPIYFLTYVSSSTIDYVKSFLEWMSDSISKSFETSRDNAFLLRHVTLLIN 310

Query: 307 ERSMINA-PGPCVLFATPGMISGGFSLEVFKHWAPSENNLITLPGYCVAGTVGHRLMSGK 365
           +  + NA PGP V+ A+   +  GF+ E+F  WA    NL+        GT+   L S  
Sbjct: 311 KTDLDNAPPGPKVVLASMASLEAGFAREIFVEWANDPRNLVLFTETGQFGTLARMLQSAP 370

Query: 366 ATK 368
             K
Sbjct: 371 PPK 373


>AT3G07530.1 | Symbols:  | CONTAINS InterPro DOMAIN/s: Beta-Casp
           domain (InterPro:IPR022712); BEST Arabidopsis thaliana
           protein match is: cleavage and polyadenylation
           specificity factor 73 kDa subunit-II (TAIR:AT2G01730.1);
           Has 624 Blast hits to 615 proteins in 160 species:
           Archae - 54; Bacteria - 6; Metazoa - 333; Fungi - 44;
           Plants - 93; Viruses - 0; Other Eukaryotes - 94 (source:
           NCBI BLink). | chr3:2400793-2404280 FORWARD LENGTH=699
          Length = 699

 Score = 49.7 bits (117), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 83/385 (21%), Positives = 148/385 (38%), Gaps = 95/385 (24%)

Query: 55  YDAALSCIIITHFHLDHVGALAYFTEVCGYRGPIYMTYPTKALAPLMLED---------- 104
           ++A+   I++    +  +G L + T+  G+   IYMT  T  +  LM+ED          
Sbjct: 98  WEASFIDIVLISNPMGLLG-LPFLTQNPGFFAKIYMTEVTAKIGQLMMEDIVSMHKEFRC 156

Query: 105 ------------------------YRKVMVDRRGEE-----ELFTSENIAECMKKVIAID 135
                                    +KV+    G++      L++ ++I  CMKKV  + 
Sbjct: 157 FHGPDNSSFPGWIKNLDSEQVPALLKKVVFGESGDDLGSWMRLYSLDDIESCMKKVQGVK 216

Query: 136 LRQTVQVDEDLQIRAYYAGHVIGAAMFYAKVGDAEMVYTGDYNMTADRHLGAAQIDRLR- 194
             + V  +  L I+A  +G  IGA  +     +  + Y  D ++    H  +     L+ 
Sbjct: 217 FAEEVCYNGTLIIKALSSGLDIGACNWLINGPNGSLSYVSD-SIFVSHHARSFDFHGLKE 275

Query: 195 LDLLI---------------------TESTYATTIRDSKYA--------REREFLKAVHK 225
            D+LI                     +++ Y +TI D+K +         E E L  V  
Sbjct: 276 TDVLIYSDFSSLQSAEVTEDGCISPDSDNNYISTISDNKDSLLNTEDSLEEMEKLAFVCS 335

Query: 226 CVS----GGGKVLIPTFALGRAQELCILLDDYWERMNLKVPIYFSAGLTIQANMYYKMLI 281
           C +     GG  LI    +G   +L  LL +  E  +LKVPI+  + +  +   Y   + 
Sbjct: 336 CAAESADAGGSTLITITRIGIVLQLLELLSNSLESSSLKVPIFVISSVAEELLAYTNTIP 395

Query: 282 SW-TSQKIKDTYSTHNAF------DFKNVHHFERSMINAPG-----------PCVLFATP 323
            W   Q+ +   S   +F        K +H F    I++P            PC++FA+ 
Sbjct: 396 EWLCEQRQEKLISGEPSFGHLKFIKNKKIHLF--PAIHSPNLIYANRTSWQEPCIVFASH 453

Query: 324 GMISGGFSLEVFKHWAPSENNLITL 348
             +  G S+++ + W     +L+ L
Sbjct: 454 WSLRLGPSVQLLQRWRGDPKSLLVL 478