Miyakogusa Predicted Gene
- chr2.CM0904.170.nd
BLASTP 2.2.18 [Mar-02-2008]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= chr2.CM0904.170.nd - phase: 0 /pseudo/partial
(514 letters)
Database: trembl
6,964,485 sequences; 2,268,126,488 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
A7QG18_VITVI (tr|A7QG18) Chromosome undetermined scaffold_91, wh... 519 e-145
A3C1P2_ORYSJ (tr|A3C1P2) Putative uncharacterized protein OS=Ory... 457 e-127
Q0IZH3_ORYSJ (tr|Q0IZH3) Os09g0569400 protein OS=Oryza sativa su... 457 e-126
B6U1S6_MAIZE (tr|B6U1S6) Putative uncharacterized protein OS=Zea... 456 e-126
A2Z484_ORYSI (tr|A2Z484) Putative uncharacterized protein OS=Ory... 455 e-126
A9RUR7_PHYPA (tr|A9RUR7) Predicted protein OS=Physcomitrella pat... 308 9e-82
A7RPX1_NEMVE (tr|A7RPX1) Predicted protein OS=Nematostella vecte... 184 1e-44
Q6DHE5_DANRE (tr|Q6DHE5) Cleavage and polyadenylation specific f... 173 3e-41
B6LND4_BRAFL (tr|B6LND4) Putative uncharacterized protein (Fragm... 169 4e-40
Q4S732_TETNG (tr|Q4S732) Chromosome 14 SCAF14723, whole genome s... 168 1e-39
B4JTB6_DROGR (tr|B4JTB6) GH10247 OS=Drosophila grimshawi GN=GH10... 165 1e-38
B5X4U8_SALSA (tr|B5X4U8) Cleavage and polyadenylation specificit... 162 6e-38
B4QZF3_DROSI (tr|B4QZF3) GD21403 OS=Drosophila simulans GN=GD214... 159 4e-37
B3S6C6_TRIAD (tr|B3S6C6) Putative uncharacterized protein OS=Tri... 159 4e-37
B4KDT1_DROMO (tr|B4KDT1) GI23051 OS=Drosophila mojavensis GN=GI2... 158 1e-36
B3P5M6_DROER (tr|B3P5M6) GG11633 OS=Drosophila erecta GN=GG11633... 157 2e-36
Q8IML7_DROME (tr|Q8IML7) CG1957-PB, isoform B OS=Drosophila mela... 157 2e-36
Q17Q35_AEDAE (tr|Q17Q35) Cleavage and polyadenylation specificit... 157 2e-36
B3LXN9_DROAN (tr|B3LXN9) GF16269 OS=Drosophila ananassae GN=GF16... 157 3e-36
B4G4W9_DROPE (tr|B4G4W9) GL24248 (Fragment) OS=Drosophila persim... 156 4e-36
B4PQ86_DROYA (tr|B4PQ86) GE26465, isoform A OS=Drosophila yakuba... 156 5e-36
B5DYG6_DROPS (tr|B5DYG6) GA26549 OS=Drosophila pseudoobscura pse... 154 2e-35
B4NF98_DROWI (tr|B4NF98) GK22564 OS=Drosophila willistoni GN=GK2... 152 5e-35
Q5F3I9_CHICK (tr|Q5F3I9) Putative uncharacterized protein OS=Gal... 148 1e-33
B4HZ45_DROSE (tr|B4HZ45) GM12754 OS=Drosophila sechellia GN=GM12... 148 1e-33
Q4R8N0_MACFA (tr|Q4R8N0) Testis cDNA clone: QtsA-12018, similar ... 147 2e-33
B3KMI0_HUMAN (tr|B3KMI0) cDNA FLJ11050 fis, clone PLACE1004564, ... 147 2e-33
B3KME1_HUMAN (tr|B3KME1) cDNA FLJ10765 fis, A-NT2RP4000111, high... 147 2e-33
B3KN45_HUMAN (tr|B3KN45) cDNA FLJ13521 fis, clone PLACE1005876, ... 147 3e-33
A4RR19_OSTLU (tr|A4RR19) Predicted protein OS=Ostreococcus lucim... 144 1e-32
B1H337_XENTR (tr|B1H337) LOC100145546 protein OS=Xenopus tropica... 140 2e-31
Q7QC68_ANOGA (tr|Q7QC68) AGAP002474-PA OS=Anopheles gambiae GN=A... 140 3e-31
B0WQG5_CULQU (tr|B0WQG5) Cleavage and polyadenylation specificit... 139 4e-31
Q01GI5_OSTTA (tr|Q01GI5) Polyadenylation cleavage/specificity fa... 139 6e-31
A8PAE9_BRUMA (tr|A8PAE9) Cleavage and polyadenylation specificit... 136 5e-30
B4PQ87_DROYA (tr|B4PQ87) GE26465, isoform B OS=Drosophila yakuba... 132 9e-29
Q8WPK6_OIKDI (tr|Q8WPK6) Similar to cleavage and polyadenylation... 130 3e-28
B6ECP4_9EUCA (tr|B6ECP4) Cyclin B OS=Fenneropenaeus penicillatus... 127 2e-27
B6K3N6_SCHJP (tr|B6K3N6) Cleavage factor two Cft2/polyadenylatio... 123 3e-26
Q3US33_MOUSE (tr|Q3US33) Putative uncharacterized protein (Fragm... 121 1e-25
A8XUS3_CAEBR (tr|A8XUS3) CBR-CPSF-2 protein OS=Caenorhabditis br... 118 9e-25
B6NF52_BRAFL (tr|B6NF52) Putative uncharacterized protein (Fragm... 104 2e-20
Q3URN8_MOUSE (tr|Q3URN8) Putative uncharacterized protein (Fragm... 102 8e-20
Q3UGU6_MOUSE (tr|Q3UGU6) Putative uncharacterized protein (Fragm... 101 2e-19
B4M067_DROVI (tr|B4M067) GJ24636 OS=Drosophila virilis GN=GJ2463... 100 2e-19
B0CXU7_LACBS (tr|B0CXU7) Predicted protein OS=Laccaria bicolor (... 92 1e-16
A9VA41_MONBE (tr|A9VA41) Predicted protein OS=Monosiga brevicoll... 90 3e-16
B6NF51_BRAFL (tr|B6NF51) Putative uncharacterized protein OS=Bra... 83 5e-14
A8NYN1_COPC7 (tr|A8NYN1) Putative uncharacterized protein OS=Cop... 80 4e-13
A5DGP1_PICGU (tr|A5DGP1) Putative uncharacterized protein OS=Pic... 80 5e-13
A2R7F5_ASPNC (tr|A2R7F5) Contig An16c0120, complete genome. OS=A... 74 4e-11
B2VU84_PYRTR (tr|B2VU84) Cleavage and polyadenylation specificit... 72 8e-11
Q7S0J8_NEUCR (tr|Q7S0J8) Putative uncharacterized protein OS=Neu... 72 1e-10
B6GY49_PENCH (tr|B6GY49) Pc12g15810 protein OS=Penicillium chrys... 71 2e-10
Q4R8A1_MACFA (tr|Q4R8A1) Testis cDNA clone: QtsA-13018, similar ... 69 1e-09
Q6CAZ0_YARLI (tr|Q6CAZ0) YALI0C23232p OS=Yarrowia lipolytica GN=... 68 2e-09
A8PTN4_MALGO (tr|A8PTN4) Putative uncharacterized protein OS=Mal... 67 3e-09
Q0UZX3_PHANO (tr|Q0UZX3) Putative uncharacterized protein OS=Pha... 67 5e-09
A8IL96_CHLRE (tr|A8IL96) Predicted protein (Fragment) OS=Chlamyd... 66 5e-09
Q5KIP3_CRYNE (tr|Q5KIP3) Cleavage and polyadenylation specificit... 65 1e-08
Q4S6C2_TETNG (tr|Q4S6C2) Chromosome 9 SCAF14729, whole genome sh... 65 1e-08
A7TPD0_VANPO (tr|A7TPD0) Putative uncharacterized protein OS=Van... 65 2e-08
B6ABK4_9CRYT (tr|B6ABK4) Putative uncharacterized protein OS=Cry... 65 2e-08
B6PQF2_BRAFL (tr|B6PQF2) Putative uncharacterized protein OS=Bra... 64 4e-08
A8Y5J3_MOUSE (tr|A8Y5J3) Cleavage and polyadenylation specific f... 62 7e-08
Q9N0D6_MACFA (tr|Q9N0D6) Unnamed protein product OS=Macaca fasci... 62 7e-08
B3RHF9_YEAS1 (tr|B3RHF9) Putative uncharacterized protein OS=Sac... 62 8e-08
A7A1G1_YEAS7 (tr|A7A1G1) Cleavage factor II (CF II) component OS... 62 8e-08
A2ADA4_MOUSE (tr|A2ADA4) Cleavage and polyadenylation specific f... 62 9e-08
Q17BY3_AEDAE (tr|Q17BY3) Cleavage and polyadenylation specificit... 62 1e-07
B0WXD2_CULQU (tr|B0WXD2) Cleavage and polyadenylation specificit... 62 1e-07
Q5AEE3_CANAL (tr|Q5AEE3) Putative uncharacterized protein CFT2 O... 62 1e-07
Q7Q184_ANOGA (tr|Q7Q184) AGAP009923-PA OS=Anopheles gambiae GN=A... 62 1e-07
B4JYM6_DROGR (tr|B4JYM6) GH14325 OS=Drosophila grimshawi GN=GH14... 60 3e-07
Q9VAH9_DROME (tr|Q9VAH9) CG1972-PA (LD08814p) OS=Drosophila mela... 60 3e-07
B3P5D8_DROER (tr|B3P5D8) GG11690 OS=Drosophila erecta GN=GG11690... 60 3e-07
Q29AN7_DROPS (tr|Q29AN7) GA15164 OS=Drosophila pseudoobscura pse... 60 3e-07
B4R083_DROSI (tr|B4R083) GD21460 OS=Drosophila simulans GN=GD214... 60 3e-07
B4PPK8_DROYA (tr|B4PPK8) GE23879 OS=Drosophila yakuba GN=GE23879... 60 3e-07
B4HZE1_DROSE (tr|B4HZE1) GM12816 OS=Drosophila sechellia GN=GM12... 60 3e-07
B4M5T1_DROVI (tr|B4M5T1) GJ10637 OS=Drosophila virilis GN=GJ1063... 60 3e-07
Q5TA44_HUMAN (tr|Q5TA44) Cleavage and polyadenylation specific f... 60 3e-07
B4NBL1_DROWI (tr|B4NBL1) GK11920 OS=Drosophila willistoni GN=GK1... 60 3e-07
B4G664_DROPE (tr|B4G664) GL23717 OS=Drosophila persimilis GN=GL2... 60 3e-07
A8K5S2_HUMAN (tr|A8K5S2) cDNA FLJ77752 OS=Homo sapiens PE=2 SV=1 60 4e-07
B4K612_DROMO (tr|B4K612) GI10422 OS=Drosophila mojavensis GN=GI1... 60 4e-07
B3KPR3_HUMAN (tr|B3KPR3) cDNA FLJ32090 fis, clone OCBBF2000719, ... 60 4e-07
B6K781_SCHJP (tr|B6K781) Endoribonuclease ysh1 OS=Schizosaccharo... 60 4e-07
B3S010_TRIAD (tr|B3S010) Putative uncharacterized protein OS=Tri... 60 4e-07
A7RL47_NEMVE (tr|A7RL47) Predicted protein OS=Nematostella vecte... 60 4e-07
Q96HV7_HUMAN (tr|Q96HV7) CPSF3L protein OS=Homo sapiens GN=CPSF3... 60 4e-07
B4DM87_HUMAN (tr|B4DM87) cDNA FLJ55759, highly similar to Homo s... 60 4e-07
B3MTI3_DROAN (tr|B3MTI3) GF23370 OS=Drosophila ananassae GN=GF23... 60 5e-07
Q750X1_ASHGO (tr|Q750X1) AGL182Cp OS=Ashbya gossypii GN=AGL182C ... 60 5e-07
Q4TC63_TETNG (tr|Q4TC63) Chromosome undetermined SCAF7053, whole... 59 1e-06
A7S9J6_NEMVE (tr|A7S9J6) Predicted protein OS=Nematostella vecte... 58 2e-06
Q5CXL4_CRYPV (tr|Q5CXL4) Inactive CPSFs Cft2p metallobeta-lactam... 57 4e-06
Q581U7_9TRYP (tr|Q581U7) Cleavage and polyadenylation specificit... 56 5e-06
A2EDT7_TRIVA (tr|A2EDT7) Putative uncharacterized protein OS=Tri... 56 6e-06
B6QRB9_PENMA (tr|B6QRB9) Cleavage and polyadenylylation specific... 56 7e-06
A7E3Q0_BOVIN (tr|A7E3Q0) Related to CPSF subunits 68 kDa OS=Bos ... 56 8e-06
A7T4V8_NEMVE (tr|A7T4V8) Predicted protein OS=Nematostella vecte... 55 9e-06
Q4RM75_TETNG (tr|Q4RM75) Chromosome 10 SCAF15019, whole genome s... 55 1e-05
B5X1A0_SALSA (tr|B5X1A0) Cleavage and polyadenylation specificit... 55 2e-05
A7F0N0_SCLS1 (tr|A7F0N0) Putative uncharacterized protein OS=Scl... 55 2e-05
Q6DIH6_XENTR (tr|Q6DIH6) Cleavage and polyadenylation specific f... 54 2e-05
Q8SUC3_ENCCU (tr|Q8SUC3) Putative uncharacterized protein ECU10_... 54 2e-05
Q28FT5_XENTR (tr|Q28FT5) Cleavage and polyadenylation specific f... 54 2e-05
Q6DRG6_DANRE (tr|Q6DRG6) Cleavage and polyadenylation specificit... 54 2e-05
Q5U3T4_DANRE (tr|Q5U3T4) Cleavage and polyadenylation specific f... 54 2e-05
A4S2M5_OSTLU (tr|A4S2M5) Predicted protein OS=Ostreococcus lucim... 54 2e-05
Q53RS2_HUMAN (tr|Q53RS2) Putative uncharacterized protein CPSF3 ... 54 2e-05
Q53F02_HUMAN (tr|Q53F02) Cleavage and polyadenylation specific f... 54 2e-05
Q2GQR7_CHAGB (tr|Q2GQR7) Putative uncharacterized protein OS=Cha... 54 2e-05
A8QG81_BRUMA (tr|A8QG81) RNA-metabolising metallo-beta-lactamase... 54 2e-05
Q4R7A0_MACFA (tr|Q4R7A0) Testis cDNA, clone: QtsA-15820, similar... 54 3e-05
Q5R428_PONAB (tr|Q5R428) Putative uncharacterized protein DKFZp4... 54 3e-05
Q5XGZ1_XENLA (tr|Q5XGZ1) LOC495111 protein OS=Xenopus laevis GN=... 54 3e-05
Q4R6V4_MACFA (tr|Q4R6V4) Testis cDNA, clone: QtsA-17044, similar... 54 3e-05
Q497G2_MOUSE (tr|Q497G2) Cpsf3 protein OS=Mus musculus GN=Cpsf3 ... 54 4e-05
A6SPR2_BOTFB (tr|A6SPR2) Putative uncharacterized protein OS=Bot... 54 4e-05
A8Q4U4_MALGO (tr|A8Q4U4) Putative uncharacterized protein OS=Mal... 54 4e-05
B2AL57_PODAN (tr|B2AL57) Predicted CDS Pa_5_8800 OS=Podospora an... 53 5e-05
Q011P7_OSTTA (tr|Q011P7) MRNA cleavage and polyadenylation facto... 53 5e-05
A8JGK6_CHLRE (tr|A8JGK6) Subunit of mRNA cleavage and polyadenyl... 53 5e-05
Q499P4_RAT (tr|Q499P4) Cleavage and polyadenylation specificity ... 53 5e-05
Q8CIM0_MOUSE (tr|Q8CIM0) Cleavage and polyadenylation specificit... 53 6e-05
A9RRH1_PHYPA (tr|A9RRH1) Predicted protein OS=Physcomitrella pat... 53 6e-05
Q3U057_MOUSE (tr|Q3U057) Putative uncharacterized protein OS=Mus... 53 6e-05
B5RTE7_DEBHA (tr|B5RTE7) DEHA2D07304p OS=Debaryomyces hansenii G... 53 6e-05
Q3UDS1_MOUSE (tr|Q3UDS1) Putative uncharacterized protein OS=Mus... 53 6e-05
Q3TC91_MOUSE (tr|Q3TC91) Putative uncharacterized protein OS=Mus... 53 6e-05
A4HB61_LEIBR (tr|A4HB61) Cleavage and polyadenylation specificit... 53 7e-05
A8WQT1_CAEBR (tr|A8WQT1) CBR-CPSF-3 protein OS=Caenorhabditis br... 52 8e-05
A4IAA9_LEIIN (tr|A4IAA9) Cleavage and polyadenylation specificit... 52 9e-05
Q95PY8_CAEEL (tr|Q95PY8) Protein Y67H2A.1, partially confirmed b... 52 1e-04
Q4Q2K1_LEIMA (tr|Q4Q2K1) Cleavage and polyadenylation specificit... 52 1e-04
Q05BZ5_HUMAN (tr|Q05BZ5) CPSF3 protein (Fragment) OS=Homo sapien... 52 1e-04
A3BYK8_ORYSJ (tr|A3BYK8) Putative uncharacterized protein OS=Ory... 52 1e-04
A2Z109_ORYSI (tr|A2Z109) Putative uncharacterized protein OS=Ory... 52 1e-04
B6KMV5_TOXGO (tr|B6KMV5) Cleavage and polyadenylation specificit... 52 1e-04
Q6ERD4_ORYSJ (tr|Q6ERD4) Putative FEG protein (Os09g0397900 prot... 52 2e-04
A9SGX1_PHYPA (tr|A9SGX1) Predicted protein OS=Physcomitrella pat... 52 2e-04
Q6BCB3_TRYCR (tr|Q6BCB3) Cleavage polyadenylation specificity fa... 51 2e-04
B4JEZ6_DROGR (tr|B4JEZ6) GH19212 OS=Drosophila grimshawi GN=GH19... 51 2e-04
B3RKJ0_TRIAD (tr|B3RKJ0) Putative uncharacterized protein OS=Tri... 51 2e-04
Q9U3K2_CAEEL (tr|Q9U3K2) Protein F10B5.8, partially confirmed by... 51 2e-04
Q4DTW2_TRYCR (tr|Q4DTW2) Cleavage and polyadenylation specificit... 51 3e-04
Q170J6_AEDAE (tr|Q170J6) Cleavage and polyadenylation specificit... 50 3e-04
Q9C952_ARATH (tr|Q9C952) Putative cleavage and polyadenylation s... 50 4e-04
Q8VY18_ARATH (tr|Q8VY18) Putative cleavage and polyadenylation s... 50 4e-04
A8WN58_CAEBR (tr|A8WN58) Putative uncharacterized protein OS=Cae... 50 4e-04
B4K8L8_DROMO (tr|B4K8L8) GI24246 OS=Drosophila mojavensis GN=GI2... 50 4e-04
Q4DNS8_TRYCR (tr|Q4DNS8) Cleavage and polyadenylation specificit... 50 5e-04
B4M3X4_DROVI (tr|B4M3X4) GJ10819 OS=Drosophila virilis GN=GJ1081... 50 5e-04
Q8SUE4_ENCCU (tr|Q8SUE4) Putative uncharacterized protein ECU10_... 50 6e-04
B0X8N8_CULQU (tr|B0X8N8) Cleavage and polyadenylation specificit... 49 7e-04
B4QU05_DROSI (tr|B4QU05) GD20157 OS=Drosophila simulans GN=GD201... 49 8e-04
B3LW78_DROAN (tr|B3LW78) GF18101 OS=Drosophila ananassae GN=GF18... 49 8e-04
A8N7F0_COPC7 (tr|A8N7F0) Putative uncharacterized protein OS=Cop... 49 8e-04
Q9VE51_DROME (tr|Q9VE51) CG7698-PA (RE31408p) OS=Drosophila mela... 49 9e-04
B4I269_DROSE (tr|B4I269) GM18692 OS=Drosophila sechellia GN=GM18... 49 9e-04
Q7PX56_ANOGA (tr|Q7PX56) AGAP001224-PA (Fragment) OS=Anopheles g... 49 9e-04
>A7QG18_VITVI (tr|A7QG18) Chromosome undetermined scaffold_91, whole genome
shotgun sequence OS=Vitis vinifera GN=GSVIVT00037665001
PE=4 SV=1
Length = 740
Score = 519 bits (1336), Expect = e-145, Method: Compositional matrix adjust.
Identities = 247/315 (78%), Positives = 280/315 (88%)
Query: 200 YWAEENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKT 259
YW + +L YPI+FL+YV+SSTIDYVKSFLEWMSD+IAKSFE TR+N FLLK++TLL++K+
Sbjct: 252 YWTQHHLNYPIFFLTYVASSTIDYVKSFLEWMSDSIAKSFEHTRDNAFLLKHVTLLISKS 311
Query: 260 ELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADPPP 319
EL+ VPDGPK+VLASMASLEAGFSHDIFVEW +D KNLVLF+E+GQFATLARMLQADPPP
Sbjct: 312 ELEKVPDGPKIVLASMASLEAGFSHDIFVEWATDAKNLVLFSERGQFATLARMLQADPPP 371
Query: 320 KAVKVTVSKRVPLVGDELIAYEEEQNRIKKEEALKASLIKAEELKASHGADSNTSDPMVI 379
KAVKVT+SKRVPLVG+EL AYEEEQ RIKKEEALKASL K +E+KAS G+D+ DPMVI
Sbjct: 372 KAVKVTMSKRVPLVGEELAAYEEEQERIKKEEALKASLSKEDEMKASRGSDNKLGDPMVI 431
Query: 380 DTGNNLIAPEVAGPRGGGYRDIFVDGFVPPSTSVAPMFPCYENTSEWDDFGEVINPDDYV 439
DT + +VA P GG+RDI +DGFVPPSTSVAPMFP YEN+SEWDDFGEVINP+DYV
Sbjct: 432 DTTTPPASSDVAVPHVGGHRDILIDGFVPPSTSVAPMFPFYENSSEWDDFGEVINPEDYV 491
Query: 440 IKDEDMDQTAMHVGGDINGKLDEGAASLILDTKPSKVISDERTVQVRCSLVYMDFEGRSD 499
IKDEDMDQ M VG D+NGKLDEGAASLI DT PSKVIS+E TVQV+C LVYMDFEGRSD
Sbjct: 492 IKDEDMDQATMQVGDDLNGKLDEGAASLIFDTTPSKVISNELTVQVKCMLVYMDFEGRSD 551
Query: 500 GRSIKNILSHVAPLK 514
GRSIK+ILSHVAPLK
Sbjct: 552 GRSIKSILSHVAPLK 566
>A3C1P2_ORYSJ (tr|A3C1P2) Putative uncharacterized protein OS=Oryza sativa subsp.
japonica GN=OsJ_029214 PE=4 SV=1
Length = 1017
Score = 457 bits (1176), Expect = e-127, Method: Compositional matrix adjust.
Identities = 221/316 (69%), Positives = 265/316 (83%), Gaps = 2/316 (0%)
Query: 200 YWAEENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKT 259
YWA+ +L+YPIYFL+ VS+ST+DYVKSFLEWM+D+I+KSFE TR+N FLLK +T ++NK
Sbjct: 252 YWAQRHLIYPIYFLTNVSTSTVDYVKSFLEWMNDSISKSFEHTRDNAFLLKCVTQIINKD 311
Query: 260 ELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADPPP 319
EL+ + D PKVVLASMASLE GFSHDIFV+ ++ KNLVLFTEKGQF TLARMLQ DPPP
Sbjct: 312 ELEKLGDAPKVVLASMASLEVGFSHDIFVDMANEAKNLVLFTEKGQFGTLARMLQVDPPP 371
Query: 320 KAVKVTVSKRVPLVGDELIAYEEEQNRIKKEEALKASLIKAEELKASHGADSNTSDPMVI 379
KAVKVT+SKR+PLVGDEL AYEEEQ RIKKEEALKASL K EE KAS G+++ SDPMVI
Sbjct: 372 KAVKVTMSKRIPLVGDELKAYEEEQERIKKEEALKASLNKEEEKKASLGSNAKASDPMVI 431
Query: 380 DTGNNLIAPEVAGPRGGGYRDIFVDGFVPPSTSVAPMFPCYENTSEWDDFGEVINPDDYV 439
D + P AG + GG DI +DGFVPPS+SVAPMFP +ENTSEWDDFGEVINP+DY+
Sbjct: 432 DASTSR-KPSNAGSKFGGNVDILIDGFVPPSSSVAPMFPFFENTSEWDDFGEVINPEDYL 490
Query: 440 IKDEDMDQTAMHVGGD-INGKLDEGAASLILDTKPSKVISDERTVQVRCSLVYMDFEGRS 498
+K E+MD T M GD ++ LDEG+A L+LD+ PSKVIS+E TVQV+CSL YMDFEGRS
Sbjct: 491 MKQEEMDNTLMPGAGDGMDSMLDEGSARLLLDSTPSKVISNEMTVQVKCSLAYMDFEGRS 550
Query: 499 DGRSIKNILSHVAPLK 514
DGRS+K++++HVAPLK
Sbjct: 551 DGRSVKSVIAHVAPLK 566
>Q0IZH3_ORYSJ (tr|Q0IZH3) Os09g0569400 protein OS=Oryza sativa subsp. japonica
GN=Os09g0569400 PE=4 SV=1
Length = 738
Score = 457 bits (1176), Expect = e-126, Method: Compositional matrix adjust.
Identities = 221/316 (69%), Positives = 265/316 (83%), Gaps = 2/316 (0%)
Query: 200 YWAEENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKT 259
YWA+ +L+YPIYFL+ VS+ST+DYVKSFLEWM+D+I+KSFE TR+N FLLK +T ++NK
Sbjct: 252 YWAQRHLIYPIYFLTNVSTSTVDYVKSFLEWMNDSISKSFEHTRDNAFLLKCVTQIINKD 311
Query: 260 ELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADPPP 319
EL+ + D PKVVLASMASLE GFSHDIFV+ ++ KNLVLFTEKGQF TLARMLQ DPPP
Sbjct: 312 ELEKLGDAPKVVLASMASLEVGFSHDIFVDMANEAKNLVLFTEKGQFGTLARMLQVDPPP 371
Query: 320 KAVKVTVSKRVPLVGDELIAYEEEQNRIKKEEALKASLIKAEELKASHGADSNTSDPMVI 379
KAVKVT+SKR+PLVGDEL AYEEEQ RIKKEEALKASL K EE KAS G+++ SDPMVI
Sbjct: 372 KAVKVTMSKRIPLVGDELKAYEEEQERIKKEEALKASLNKEEEKKASLGSNAKASDPMVI 431
Query: 380 DTGNNLIAPEVAGPRGGGYRDIFVDGFVPPSTSVAPMFPCYENTSEWDDFGEVINPDDYV 439
D + P AG + GG DI +DGFVPPS+SVAPMFP +ENTSEWDDFGEVINP+DY+
Sbjct: 432 DASTSR-KPSNAGSKFGGNVDILIDGFVPPSSSVAPMFPFFENTSEWDDFGEVINPEDYL 490
Query: 440 IKDEDMDQTAMHVGGD-INGKLDEGAASLILDTKPSKVISDERTVQVRCSLVYMDFEGRS 498
+K E+MD T M GD ++ LDEG+A L+LD+ PSKVIS+E TVQV+CSL YMDFEGRS
Sbjct: 491 MKQEEMDNTLMPGAGDGMDSMLDEGSARLLLDSTPSKVISNEMTVQVKCSLAYMDFEGRS 550
Query: 499 DGRSIKNILSHVAPLK 514
DGRS+K++++HVAPLK
Sbjct: 551 DGRSVKSVIAHVAPLK 566
>B6U1S6_MAIZE (tr|B6U1S6) Putative uncharacterized protein OS=Zea mays PE=2 SV=1
Length = 673
Score = 456 bits (1173), Expect = e-126, Method: Compositional matrix adjust.
Identities = 218/316 (68%), Positives = 263/316 (83%), Gaps = 3/316 (0%)
Query: 200 YWAEENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKT 259
YW E L YPIYFL+ VS+ST+DYVKSFLEWM D IAKSFE +R N FLLK +TL++NK
Sbjct: 188 YWDERRLQYPIYFLTNVSTSTVDYVKSFLEWMGDQIAKSFESSRANAFLLKKVTLIINKE 247
Query: 260 ELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADPPP 319
EL+ + D PKVVLASMASLE GFSHDIFVE ++ +NLVLFTEKGQF TLARMLQ DPPP
Sbjct: 248 ELEKLGDAPKVVLASMASLEVGFSHDIFVEMANEARNLVLFTEKGQFGTLARMLQVDPPP 307
Query: 320 KAVKVTVSKRVPLVGDELIAYEEEQNRIKKEEALKASLIKAEELKASHGADSNTSDPMVI 379
KA+KVT+SKR+PLVG+EL AYEEEQ RIKKE++LKASL+K EELKASHG+++ S+PMVI
Sbjct: 308 KALKVTMSKRIPLVGNELKAYEEEQERIKKEKSLKASLVKEEELKASHGSNTKASEPMVI 367
Query: 380 DTGNNLIAPEVAGPRGGGYRDIFVDGFVPPSTSVAPMFPCYENTSEWDDFGEVINPDDYV 439
D ++ V GG DI +DGFVPP TSVAPMFP +ENT+EWDDFGEVINPDDY+
Sbjct: 368 DASSS--RKSVNASHFGGNNDILIDGFVPPLTSVAPMFPFFENTAEWDDFGEVINPDDYM 425
Query: 440 IKDEDMDQTAMHVGGD-INGKLDEGAASLILDTKPSKVISDERTVQVRCSLVYMDFEGRS 498
+K E+MD T M GD ++G++D+G+A L+LD+ PSKVIS+E TVQV+CSLVYMDFEGRS
Sbjct: 426 MKQEEMDNTLMLGPGDGLDGRIDDGSARLLLDSTPSKVISNEMTVQVKCSLVYMDFEGRS 485
Query: 499 DGRSIKNILSHVAPLK 514
DGRS+K+I++HVAPLK
Sbjct: 486 DGRSVKSIIAHVAPLK 501
>A2Z484_ORYSI (tr|A2Z484) Putative uncharacterized protein OS=Oryza sativa subsp.
indica GN=OsI_031377 PE=4 SV=1
Length = 1139
Score = 455 bits (1170), Expect = e-126, Method: Compositional matrix adjust.
Identities = 221/316 (69%), Positives = 265/316 (83%), Gaps = 2/316 (0%)
Query: 200 YWAEENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKT 259
YWA+ +L+YPIYFL+ VS+ST+DYVKSFLEWM+D+I+KSFE TR+N FLLK +T ++NK
Sbjct: 252 YWAQRHLIYPIYFLTNVSTSTVDYVKSFLEWMNDSISKSFEHTRDNAFLLKCVTQIINKD 311
Query: 260 ELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADPPP 319
EL+ + D PKVVLASMASLE GFSHDIFV+ ++ KNLVLFTEKGQF TLARMLQ DPPP
Sbjct: 312 ELEKLGDAPKVVLASMASLEVGFSHDIFVDMANEAKNLVLFTEKGQFGTLARMLQVDPPP 371
Query: 320 KAVKVTVSKRVPLVGDELIAYEEEQNRIKKEEALKASLIKAEELKASHGADSNTSDPMVI 379
KAVKVT+SKR+PLVGDEL AYEEEQ RIKKEEALKASL K EE KAS G+++ SDPMVI
Sbjct: 372 KAVKVTMSKRIPLVGDELKAYEEEQERIKKEEALKASLNKEEEKKASLGSNAKASDPMVI 431
Query: 380 DTGNNLIAPEVAGPRGGGYRDIFVDGFVPPSTSVAPMFPCYENTSEWDDFGEVINPDDYV 439
D + P AG + GG DI +DGFVPPS+SVAPMFP +ENTSEWDDFGEVINP+DY+
Sbjct: 432 DASTSR-KPSNAGSKFGGNVDILIDGFVPPSSSVAPMFPFFENTSEWDDFGEVINPEDYL 490
Query: 440 IKDEDMDQTAMHVGGD-INGKLDEGAASLILDTKPSKVISDERTVQVRCSLVYMDFEGRS 498
+K E+MD T M GD ++ LDEG+A L+LD+ PSKVIS+E TVQV+CSL YMDFEGRS
Sbjct: 491 MKQEEMDNTLMPGAGDGMDSMLDEGSARLLLDSTPSKVISNEMTVQVKCSLAYMDFEGRS 550
Query: 499 DGRSIKNILSHVAPLK 514
DGRS+K++++HVAPLK
Sbjct: 551 DGRSVKSVIAHVAPLK 566
>A9RUR7_PHYPA (tr|A9RUR7) Predicted protein OS=Physcomitrella patens subsp.
patens GN=PHYPADRAFT_205745 PE=4 SV=1
Length = 724
Score = 308 bits (788), Expect = 9e-82, Method: Compositional matrix adjust.
Identities = 159/317 (50%), Positives = 212/317 (66%), Gaps = 13/317 (4%)
Query: 201 WAEENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTE 260
WA + L YP+ L+ VS ST+++ KS LEWMSD+IA+SF +REN FLLKY+ L ++ E
Sbjct: 253 WAHQRLSYPVALLTNVSYSTVEFAKSLLEWMSDSIARSFGSSRENSFLLKYLKLCHDRKE 312
Query: 261 LDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADPPPK 320
D +P GPKVV ASMASLE GF+ D+FVEW +D +NLVLFTE+GQ TLA+ LQA+PPPK
Sbjct: 313 FDELPSGPKVVFASMASLEGGFARDLFVEWATDSRNLVLFTERGQMGTLAKKLQAEPPPK 372
Query: 321 AVKVTVSKRVPLVGDELIAYEEEQNRIKKEEALKASLIKAEELKASHGADSNTSDPMVID 380
VKVT+S+++PL G+EL AYE EQ R+K + L+ EE+ + + P+
Sbjct: 373 IVKVTMSQKIPLTGEELQAYELEQ-RLKMATETEVDLV--EEVGPNSPEAKAVTGPL--- 426
Query: 381 TGNNLIAPEVAGPRGGGYRDIFVDGFVPPSTSVAPMFPCYENTSEWDDFGEVINPDDYVI 440
L E A R I +DGF + PMFP YEN S+WD++GEVINP+DY +
Sbjct: 427 ---PLTVAEPATNEIPSQRQILIDGFTASDKTAGPMFPLYENPSDWDEYGEVINPEDYRV 483
Query: 441 KD-EDMDQTAMHVG--GDINGKLDEGAASLILDTKPSKVISDERTVQVRCSLVYMDFEGR 497
+D E MD + D+ D+ A +++ D +PSKV+ + TV V+C+L YMDFEGR
Sbjct: 484 EDTEMMDYQSSQQAPVADVEDNTDQEAEAILAD-RPSKVVVKDYTVYVKCALYYMDFEGR 542
Query: 498 SDGRSIKNILSHVAPLK 514
SDGRSIKNIL+HVAP+K
Sbjct: 543 SDGRSIKNILAHVAPIK 559
>A7RPX1_NEMVE (tr|A7RPX1) Predicted protein OS=Nematostella vectensis GN=v1g89357
PE=4 SV=1
Length = 737
Score = 184 bits (468), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 118/308 (38%), Positives = 167/308 (54%), Gaps = 28/308 (9%)
Query: 208 YPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVPDG 267
Y + L+ VS + I++ KS +EWMSD I K+FE R N + +Y L + +L VP+
Sbjct: 264 YSLAMLNNVSYNVIEFAKSQVEWMSDKIMKAFEIGRNNPYQFRYCHLCHSLADLARVPE- 322
Query: 268 PKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADPPPKAVKVTVS 327
PKVVLASM L AGFS D+FVEW + KN V+FT + TLAR L + K V++ V
Sbjct: 323 PKVVLASMMDLTAGFSRDLFVEWADNPKNTVIFTARSSPGTLARTLIDNLELKQVELEVK 382
Query: 328 KRVPLVGDELIAYEEEQNRIKKEEALKASLIKAEELKASHGADSNTSDPMVIDTGNNLIA 387
+RV L G+EL Y EE + +K+ + A L A +DS D + ++L+
Sbjct: 383 QRVRLGGEELERYLEENKKKEKDYPVLA----ISTLVAEDDSDSEVEDEVASGARHDLMM 438
Query: 388 PEVAGPRGGGYRDIFVDGFVPPSTSVAPMFPCYENTSEWDDFGEVINPDDYVIKD-EDMD 446
E R F + S PMFPC+E ++WDD+GE I P+DY+ ++ +
Sbjct: 439 AEQKSGRKS--------SFFKQARSF-PMFPCHEEKAKWDDYGEFIRPEDYMQRELSATE 489
Query: 447 QTAMHVGGDINGKLDEGAASLILDTKPSKVISDERTVQVRCSLVYMDFEGRSDGRSIKNI 506
+ V D L P+K IS ++TV +RC+L ++DFEGRSDG SIK I
Sbjct: 490 EEKQKVVRD-------------LSKVPTKCISQKKTVSIRCTLAFIDFEGRSDGESIKRI 536
Query: 507 LSHVAPLK 514
L+ V P K
Sbjct: 537 LNLVNPRK 544
>Q6DHE5_DANRE (tr|Q6DHE5) Cleavage and polyadenylation specific factor 2 OS=Danio
rerio GN=cpsf2 PE=2 SV=1
Length = 790
Score = 173 bits (438), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 105/310 (33%), Positives = 169/310 (54%), Gaps = 17/310 (5%)
Query: 207 VYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVPD 266
VY + L+ VS + +++ KS +EWMSD + + FE R N F ++++L + ++L VP
Sbjct: 263 VYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHSLSDLARVP- 321
Query: 267 GPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADPPPKAVKVTV 326
PKVVL S LE+GFS ++F++W D KN V+ T + TLAR L +P K +++ +
Sbjct: 322 SPKVVLCSQPDLESGFSRELFIQWCQDAKNSVILTYRTTPGTLARYLIDNPGEKRIELEI 381
Query: 327 SKRVPLVGDELIAYEEEQNRIKKEEALKASLIKAEELKASHGAD--SNTSDPMVIDTGNN 384
KR L G EL Y E++ R+KKE A K K +L +S +D + P V+ T ++
Sbjct: 382 RKRCRLEGRELEEYMEKE-RMKKEAAKKLEQAKEVDLDSSDESDMEDDLEQPAVVKTKHH 440
Query: 385 LIAPEVAGPRGGGYRDIFVDGFVPPSTSVAPMFPCYENTSEWDDFGEVINPDDYVIKDED 444
+ + G R G GF + MFP +E +WD++GE+I P+D+++ +
Sbjct: 441 DLMMKGEGGRKG--------GFFKQAKKSYSMFPTHEERIKWDEYGEIIRPEDFLVPEL- 491
Query: 445 MDQTAMHVGGDINGKLDEGAASLILDTK--PSKVISDERTVQVRCSLVYMDFEGRSDGRS 502
Q + L G + D P+K S +T+ +R ++Y+D+EGRSDG S
Sbjct: 492 --QATEEEKSKLESGLTNGEEPMEQDLSDVPTKCTSTTQTLDIRARVMYIDYEGRSDGDS 549
Query: 503 IKNILSHVAP 512
IK I++ + P
Sbjct: 550 IKKIINQMKP 559
>B6LND4_BRAFL (tr|B6LND4) Putative uncharacterized protein (Fragment)
OS=Branchiostoma floridae GN=BRAFLDRAFT_209615 PE=4 SV=1
Length = 607
Score = 169 bits (428), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 114/316 (36%), Positives = 166/316 (52%), Gaps = 9/316 (2%)
Query: 200 YW--AEENL-VYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLV 256
YW AE L Y + L+ V+ + +++ KS +EWMSD I + FE R N F K++ L
Sbjct: 70 YWQNAETGLQAYNLCLLNNVAYNVVEFAKSQVEWMSDKIMRVFEDNRNNPFQFKHLKLCH 129
Query: 257 NKTELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQAD 316
+ +EL VPD PKVVLAS+ LE+GFS ++FV+W + KN V+ T + TL RML +
Sbjct: 130 SLSELHKVPD-PKVVLASVPDLESGFSRELFVQWCQNQKNTVVLTSRPGPGTLGRMLIDN 188
Query: 317 PPPKAVKVTVSKRVPLVGDELIAYEEEQNRIKKEEALKASLIKAEELKASHGADSNTSDP 376
P K + KRV L G EL Y +E+ + K+E+ + S K +E S D +
Sbjct: 189 PKMKTFTLQARKRVRLEGPELEEYLQEEKKEKEEKKRRESKAKGDESDTSESEDEMEVEG 248
Query: 377 MVIDTGNNLIAPEVAGPRGGGYRDIFVDGFVPPSTSVAPMFPCYENTSEWDDFGEVINPD 436
G +A + G R GF + PMFP E +WDD+GE+I P+
Sbjct: 249 SSFPGGVKGVAKHDLMMQAEGGRK---GGFFKQAKKAYPMFPAPEERVKWDDYGEIIKPE 305
Query: 437 DYVIKDEDMDQTAMHVGGDINGKLDEGAASLILDTKPSKVISDERTVQVRCSLVYMDFEG 496
DY++ +M Q + + L P+K I E T+ ++C +VY+DFEG
Sbjct: 306 DYMVV--EMTQAEEEKAKAEGEAAAQEEFAEELTDVPTKSIVQELTLDIKCRVVYIDFEG 363
Query: 497 RSDGRSIKNILSHVAP 512
RSDG S+K IL+ + P
Sbjct: 364 RSDGESMKKILTQLKP 379
>Q4S732_TETNG (tr|Q4S732) Chromosome 14 SCAF14723, whole genome shotgun sequence.
(Fragment) OS=Tetraodon nigroviridis
GN=GSTENG00023013001 PE=4 SV=1
Length = 765
Score = 168 bits (425), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 107/318 (33%), Positives = 172/318 (54%), Gaps = 31/318 (9%)
Query: 207 VYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVPD 266
VYP+ L+ VS + +++ KS +EWMSD + + FE R N F +++TL + +L VP
Sbjct: 239 VYPLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLTLCHSLADLARVP- 297
Query: 267 GPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADPPPKAVKVTV 326
PKVVL S LE+GFS ++F++W D KN ++ T + TLAR L +P K + + V
Sbjct: 298 SPKVVLCSQPDLESGFSRELFIQWSKDSKNSIILTYRTTPGTLARYLIDNPGEKHLDLEV 357
Query: 327 SKRVPLVGDELIAYEEEQNRIKKEEALKASLIKAEEL------KASHGADSNTSDPMVID 380
KRV L G EL Y E++RIKKE A K L +A+E+ ++ D + P +
Sbjct: 358 RKRVRLEGRELEEY-LEKDRIKKEAAKK--LEQAKEVDVDSSDESDMDDDDDLDQPTTVK 414
Query: 381 TGNNLIAPEVAGPRGGGYRDIFVDGFVPPSTSVAPMFPCYENTSEWDDFGEVINPDDYVI 440
+ ++ + + G R G F + PMFP +E +WD++GE+I +D+++
Sbjct: 415 SKHHDLMMKSEGSRKG--------SFFKQAKKSYPMFPTHEERIKWDEYGEIIRLEDFLV 466
Query: 441 KDEDMDQTAMHVGGDINGKLDEGAAS------LILDTKPSKVISDERTVQVRCSLVYMDF 494
+ + + KLD G + L P+K IS+ ++++R + Y+D+
Sbjct: 467 PE-------LQATEEEKSKLDSGLTNGDEPMDQDLSVLPTKCISNVESLEIRARVTYIDY 519
Query: 495 EGRSDGRSIKNILSHVAP 512
EGRSDG SIK I++ + P
Sbjct: 520 EGRSDGDSIKKIINQMKP 537
>B4JTB6_DROGR (tr|B4JTB6) GH10247 OS=Drosophila grimshawi GN=GH10247 PE=4 SV=1
Length = 754
Score = 165 bits (417), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 110/323 (34%), Positives = 165/323 (51%), Gaps = 31/323 (9%)
Query: 206 LVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVP 265
+ Y + L+ VS + I++ KS +EWMSD + K+FE R N F K+I L ++ +P
Sbjct: 262 MAYSLALLNNVSYNVIEFAKSQIEWMSDKLTKAFEGARNNPFQFKHINLCHTLADVYKLP 321
Query: 266 DGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLA-RMLQADPPPKAVKV 324
GPKVVLAS +E+GF+ D+FV+W + N ++FT + +L+ +++ P + +++
Sbjct: 322 VGPKVVLASTPDMESGFTRDLFVQWAGNPNNSIIFTTRTGPGSLSMELVENSVPGRQLEL 381
Query: 325 TVSKRVPLVGDELIAYEEEQNRIKKEEALKASLIKAEELKASHGADSNTSDPMVIDTGNN 384
V +RV L G EL Y Q E L ++K E ++S + + VI TG +
Sbjct: 382 DVRRRVELEGAELEEYLRTQG-----EKLNPLIVKPEVEESSSSESEDDIEMSVI-TGKH 435
Query: 385 LIAPEVAGPRGGGYRDIFVDGFVPPSTSVAPMFPCYENTSEWDDFGEVINPDDYVIKDED 444
I G GF + MFP +E ++DD+GEVIN DDY I D +
Sbjct: 436 DIVVRAEGRHHS--------GFFKSNKRHHVMFPFHEEKIKYDDYGEVINLDDYRIVDAN 487
Query: 445 MDQTAM-------------HVGGDINGKLDEGAASLILDTKPSKVISDERTVQVRCSLVY 491
D TAM H NG LD L KP+K+IS +T++V +
Sbjct: 488 YDYTAMDDQNKENVKKEEPHAELHSNGNLDNDVQLL---EKPTKLISQRKTIEVHAQIQR 544
Query: 492 MDFEGRSDGRSIKNILSHVAPLK 514
+DFEGRSDG S+ ILS + P +
Sbjct: 545 IDFEGRSDGESMLKILSQLRPRR 567
>B5X4U8_SALSA (tr|B5X4U8) Cleavage and polyadenylation specificity factor subunit
2 OS=Salmo salar GN=CPSF2 PE=2 SV=1
Length = 796
Score = 162 bits (410), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 100/315 (31%), Positives = 174/315 (55%), Gaps = 26/315 (8%)
Query: 207 VYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVPD 266
VY + L+ VS + +++ KS +EWMSD + + FE R N F ++++L + +L VP
Sbjct: 263 VYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHSLADLARVP- 321
Query: 267 GPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADPPPKAVKVTV 326
PKVVL S LE+GFS ++F++W + KN V+ T + TL R L +P K + + +
Sbjct: 322 SPKVVLCSQPDLESGFSRELFIQWCQEAKNSVILTYRTTPGTLGRYLIDNPGEKMLDLEI 381
Query: 327 SKRVPLVGDELIAYEEEQNRIKKEEALKASLIKAEELKASHGAD--SNTSDPMVIDTGNN 384
KRV L G EL Y E++ R+KKE A K K ++ +S +D + P ++ T ++
Sbjct: 382 RKRVKLEGRELEEYLEKE-RMKKEAAKKLEQEKEVDVDSSDESDMEDDLELPAMVKTKHH 440
Query: 385 LIAPEVAGPRGGGYRDIFVDGFVPPSTSVAPMFPCYENTSEWDDFGEVINPDDYVI---- 440
+ + G R G F + PMFP +E +WD++GE+I P+D+++
Sbjct: 441 DLMMKGDGVRKG--------SFFKQAKKSYPMFPTHEERVKWDEYGEIIRPEDFLVPELQ 492
Query: 441 ---KDEDMDQTAMHVGGDINGKLDEGAASLILDTKPSKVISDERTVQVRCSLVYMDFEGR 497
++++ ++ M G + +D+ ++S + P+K S ++++ + Y+D+EGR
Sbjct: 493 ATEEEKNKLESCMAKGDE---PMDQDSSSKV----PTKCTSTTENLEIKARVTYIDYEGR 545
Query: 498 SDGRSIKNILSHVAP 512
SDG SIK I++ + P
Sbjct: 546 SDGDSIKKIINQMKP 560
>B4QZF3_DROSI (tr|B4QZF3) GD21403 OS=Drosophila simulans GN=GD21403 PE=4 SV=1
Length = 664
Score = 159 bits (403), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 110/326 (33%), Positives = 169/326 (51%), Gaps = 33/326 (10%)
Query: 206 LVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVP 265
+ Y + L+ VS + I++ KS +EWMSD + K+FE R N F K+I L + ++ N+P
Sbjct: 170 MAYSLALLNNVSYNVIEFAKSQIEWMSDKLTKAFEGARNNPFQFKHIQLCHSLADVYNLP 229
Query: 266 DGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQAD-PPPKAVKV 324
GPKVVLAS LE+GF+ D+FV+W S+ N ++ T + TLA L + P K +++
Sbjct: 230 AGPKVVLASTPDLESGFTRDLFVQWASNANNSIILTTRTSPGTLAMELVENCAPGKQIEL 289
Query: 325 TVSKRVPLVGDELIAYEEEQNRIKKEEALKASLIKAEELKASHGADSNTSDPMVIDTGNN 384
V +RV L G EL Y Q E L ++K + + S + + VI TG +
Sbjct: 290 DVRRRVDLEGAELEEYLRTQG-----EKLNPLIVKPDVEEESSSESEDDIEMSVI-TGKH 343
Query: 385 LIAPEVAGPRGGGYRDIFVDGFVPPSTSVAPMFPCYENTSEWDDFGEVINPDDYVIKD-- 442
I V P G + GF + MFP +E + D++GE+IN DDY I D
Sbjct: 344 DI---VVRPEGRHH-----SGFFKSNKRHHVMFPYHEEKVKCDEYGEIINLDDYRIADAT 395
Query: 443 ------------EDMDQTAMHVGGD--INGKLDEGAASLILDTKPSKVISDERTVQVRCS 488
E++ + +G D NG + + L+ KP+K+I+ +T++V
Sbjct: 396 GYEFVPMEEQNKENVKKEEPGMGADQQANGAIVDNDVQLL--EKPTKLINQRKTIEVNAQ 453
Query: 489 LVYMDFEGRSDGRSIKNILSHVAPLK 514
+ +DFEGRSDG S+ ILS + P +
Sbjct: 454 VQRIDFEGRSDGESMLKILSQLRPRR 479
>B3S6C6_TRIAD (tr|B3S6C6) Putative uncharacterized protein OS=Trichoplax
adhaerens GN=TRIADDRAFT_30006 PE=4 SV=1
Length = 745
Score = 159 bits (403), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 108/313 (34%), Positives = 164/313 (52%), Gaps = 35/313 (11%)
Query: 208 YPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVPDG 267
YPI L++VS + +++ KS +EWM D + +FE R N F KYI L + EL +P+
Sbjct: 263 YPIALLNHVSYNVVEFAKSQVEWMCDKVLVAFEDNRNNPFQFKYIQLCHSLNELSGLPE- 321
Query: 268 PKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADPPPKAVKVTVS 327
PKVVLAS L GF+ D+F++W + KNL +FT + TL R + D P+++ VTV
Sbjct: 322 PKVVLASSPDLTCGFARDLFLQWAGNSKNLTIFTGRSSPGTLGRHI-LDERPQSIDVTVK 380
Query: 328 KRVPLVGDELIAYEEEQNRIKKEEALKASLIKAEELKASHGADSNTSDPMVIDTGNNLIA 387
RV L G+EL Y +++ +K + L LK + ID+ + L
Sbjct: 381 TRVELSGNELEEYLQKEREKEKVKELDG-------LKF-----------VTIDSDDELTT 422
Query: 388 PEVAGPRGGGYRDIFVDGFVPPST-----SVAPMFPCYENTSEWDDFGEVINPDDYVIKD 442
G RD+ + S+ V PM+P E +WD++GE+INP+D+ + D
Sbjct: 423 ITGGYHTGKVKRDLMIKDDDRRSSFFKKAVVHPMYPFSETRIKWDEYGEIINPEDFTLID 482
Query: 443 ---EDMDQTAMHVGGDINGKLDEGAASLILDTKPSKVISDERTVQVRCSLVYMDFEGRSD 499
ED + H D + L++G + P+K +S + + + C + +DFEGRSD
Sbjct: 483 VSEEDKPKKVTH--SDRHYFLNKGNPKI-----PTKCVSFLKHIDINCRISLIDFEGRSD 535
Query: 500 GRSIKNILSHVAP 512
G SI+NILS V P
Sbjct: 536 GESIRNILSLVNP 548
>B4KDT1_DROMO (tr|B4KDT1) GI23051 OS=Drosophila mojavensis GN=GI23051 PE=4 SV=1
Length = 754
Score = 158 bits (399), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 106/323 (32%), Positives = 164/323 (50%), Gaps = 31/323 (9%)
Query: 206 LVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVP 265
+ Y + L+ VS + I++ KS +EWMSD + K+FE R N F K+I L ++ +P
Sbjct: 262 MAYSLALLNNVSYNVIEFAKSQIEWMSDKLTKAFEGARNNPFQFKHINLCHTLADIYKLP 321
Query: 266 DGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLAR-MLQADPPPKAVKV 324
GPKVVLAS +E+GF+ D+FV+W + N ++FT + +L+ +++ P + +++
Sbjct: 322 AGPKVVLASTPDMESGFTRDLFVQWAGNPNNSIIFTTRTGPGSLSMDLVENYSPGRQIEL 381
Query: 325 TVSKRVPLVGDELIAYEEEQNRIKKEEALKASLIKAEELKASHGADSNTSDPMVIDTGNN 384
+ +RV L G EL Y Q E L ++K E + S + + VI TG +
Sbjct: 382 DLRRRVELEGAELEEYLRTQG-----EKLNPLIVKPEVEEESSSESEDDIEMSVI-TGKH 435
Query: 385 LIAPEVAGPRGGGYRDIFVDGFVPPSTSVAPMFPCYENTSEWDDFGEVINPDDYVIKD-- 442
I G GF + MFP +E ++DD+GEVIN DDY I D
Sbjct: 436 DIVVRSEGRHHS--------GFFKSNKRHHVMFPYHEEKIKYDDYGEVINLDDYRIVDTG 487
Query: 443 -----------EDMDQTAMHVGGDINGKLDEGAASLILDTKPSKVISDERTVQVRCSLVY 491
E++ + HV NG L+ L KP+K+IS +T++V +
Sbjct: 488 YDYAPTDDQNKENIKKEEPHVEPQSNGNLNNDVQLL---EKPTKLISQRKTIEVNAQIQR 544
Query: 492 MDFEGRSDGRSIKNILSHVAPLK 514
+DFEGRSDG S+ ILS + P +
Sbjct: 545 IDFEGRSDGESMLKILSQLRPRR 567
>B3P5M6_DROER (tr|B3P5M6) GG11633 OS=Drosophila erecta GN=GG11633 PE=4 SV=1
Length = 756
Score = 157 bits (397), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 108/324 (33%), Positives = 164/324 (50%), Gaps = 29/324 (8%)
Query: 206 LVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVP 265
+ Y + L+ VS + I++ KS +EWMSD + K+FE R N F K+I L + ++ +P
Sbjct: 262 MAYSLALLNNVSYNVIEFAKSQIEWMSDKLTKAFEGARNNPFQFKHIQLCHSLADVYKLP 321
Query: 266 DGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQAD-PPPKAVKV 324
GPKVVLAS LE+GF+ D+FV+W S+ N ++ T + TLA L + P K +++
Sbjct: 322 AGPKVVLASTPDLESGFTRDLFVQWASNANNSIILTTRTSPGTLAMELVENCAPGKQIEL 381
Query: 325 TVSKRVPLVGDELIAYEEEQNRIKKEEALKASLIKAEELKASHGADSNTSDPMVIDTGNN 384
V +RV L G EL Y Q E L ++K + S + + VI TG +
Sbjct: 382 DVRRRVELEGAELEEYLRTQG-----EKLNPLIVKPDVEDESSSESEDDIEMSVI-TGKH 435
Query: 385 LIAPEVAGPRGGGYRDIFVDGFVPPSTSVAPMFPCYENTSEWDDFGEVINPDDYVIKD-- 442
I V P G + GF + MFP +E + D++GE+IN DDY I D
Sbjct: 436 DI---VVRPEGRHH-----SGFFKSNKRHHVMFPYHEEKVKCDEYGEIINLDDYRIADAT 487
Query: 443 ------------EDMDQTAMHVGGDINGKLDEGAASLILDTKPSKVISDERTVQVRCSLV 490
E++ + +G D G + L KP+K+I+ +T++V +
Sbjct: 488 GYDFVPMEEQNKENVKKEEPGMGADQQANGGIGDNDVQLLEKPTKLINQRKTIEVNAQVQ 547
Query: 491 YMDFEGRSDGRSIKNILSHVAPLK 514
+DFEGRSDG S+ ILS + P +
Sbjct: 548 RIDFEGRSDGESMLKILSQLRPRR 571
>Q8IML7_DROME (tr|Q8IML7) CG1957-PB, isoform B OS=Drosophila melanogaster
GN=CG1957 PE=2 SV=1
Length = 664
Score = 157 bits (397), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 109/326 (33%), Positives = 168/326 (51%), Gaps = 33/326 (10%)
Query: 206 LVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVP 265
+ Y + L+ VS + I++ KS +EWMSD + K+FE R N F K+I L + ++ +P
Sbjct: 170 MAYSLALLNNVSYNVIEFAKSQIEWMSDKLTKAFEGARNNPFQFKHIQLCHSLADVYKLP 229
Query: 266 DGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQAD-PPPKAVKV 324
GPKVVLAS LE+GF+ D+FV+W S+ N ++ T + TLA L + P K +++
Sbjct: 230 AGPKVVLASTPDLESGFTRDLFVQWASNANNSIILTTRTSPGTLAMELVENCAPGKQIEL 289
Query: 325 TVSKRVPLVGDELIAYEEEQNRIKKEEALKASLIKAEELKASHGADSNTSDPMVIDTGNN 384
V +RV L G EL Y Q E L ++K + + S + + VI TG +
Sbjct: 290 DVRRRVDLEGAELEEYLRTQG-----EKLNPLIVKPDVEEESSSESEDDIEMSVI-TGKH 343
Query: 385 LIAPEVAGPRGGGYRDIFVDGFVPPSTSVAPMFPCYENTSEWDDFGEVINPDDYVIKD-- 442
I V P G + GF + MFP +E + D++GE+IN DDY I D
Sbjct: 344 DI---VVRPEGRHH-----SGFFKSNKRHHVMFPYHEEKVKCDEYGEIINLDDYRIADAT 395
Query: 443 ------------EDMDQTAMHVGGD--INGKLDEGAASLILDTKPSKVISDERTVQVRCS 488
E++ + +G + NG + + L+ KP+K+IS +T++V
Sbjct: 396 GYEFVPMEEQNKENVKKEEPGIGAEQQANGGIVDNDVQLL--EKPTKLISQRKTIEVNAQ 453
Query: 489 LVYMDFEGRSDGRSIKNILSHVAPLK 514
+ +DFEGRSDG S+ ILS + P +
Sbjct: 454 VQRIDFEGRSDGESMLKILSQLRPRR 479
>Q17Q35_AEDAE (tr|Q17Q35) Cleavage and polyadenylation specificity factor
OS=Aedes aegypti GN=AAEL000118 PE=4 SV=1
Length = 744
Score = 157 bits (396), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 114/317 (35%), Positives = 173/317 (54%), Gaps = 27/317 (8%)
Query: 206 LVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVP 265
+ Y + L+ VS + +++ KS +EWMSD + KSFE R N F K++ L EL VP
Sbjct: 262 MAYSLALLNNVSYNVVEFAKSQIEWMSDKLMKSFEGARNNPFQFKHLRLCHTMAELAKVP 321
Query: 266 DGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLAR-MLQADPPPKAVKV 324
PKVVLAS A +E+GFS ++FV+W S+V N ++ T + TLAR +++ + +++
Sbjct: 322 -SPKVVLASSADMESGFSRELFVQWASNVNNSIIITCRSSPGTLARDLIENGGNGRKIEL 380
Query: 325 TVSKRVPLVGDELIAYEEEQNRIKKEEALKASLIKAEELKASHGADSNTSDPMVIDTGNN 384
V +RV L G EL EE R + E+ + S+IK++ S + + VI TG +
Sbjct: 381 DVRRRVELEGAEL----EEYMRTEGEKHNR-SIIKSDMDLDSSSDSDDELEMSVI-TGKH 434
Query: 385 LIAPEVAGPRGGGYRDIFVDGFVPPSTSVAPMFPCYENTSEWDDFGEVINPDDYVIKD-- 442
I V P G + GF S MFP +E ++D++GE+I PDDY + D
Sbjct: 435 DI---VVRPEGRSHT-----GFFKSSKKQYAMFPFHEEKIKFDEYGEIIQPDDYKMIDLG 486
Query: 443 -----EDMDQTAMHVGGDINGKLDEGAASLILDTKPSKVISDERTVQVRCSLVYMDFEGR 497
ED + + DI + DE + +LD KP+K IS + V+V + ++DFEGR
Sbjct: 487 PDGGFEDNKENQIK-PEDIKKEKDEELS--VLD-KPTKCISSRKLVEVNAQVQFIDFEGR 542
Query: 498 SDGRSIKNILSHVAPLK 514
SDG S+ ILS + P +
Sbjct: 543 SDGESMLKILSQLRPRR 559
>B3LXN9_DROAN (tr|B3LXN9) GF16269 OS=Drosophila ananassae GN=GF16269 PE=4 SV=1
Length = 756
Score = 157 bits (396), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 108/324 (33%), Positives = 165/324 (50%), Gaps = 29/324 (8%)
Query: 206 LVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVP 265
+ Y + L+ VS + I++ KS +EWMSD + K+FE R N F K+I L + ++ +P
Sbjct: 262 MAYSLALLNNVSYNVIEFAKSQIEWMSDKLTKAFEGARNNPFQFKHIQLCHSLADIYKLP 321
Query: 266 DGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQAD-PPPKAVKV 324
GPKVVLAS LE+GF+ D+FV+W S+ N ++ T + TLA L + P + +++
Sbjct: 322 AGPKVVLASTPDLESGFTRDLFVQWASNSNNSIILTTRTSPGTLAMELVENCTPGRQIEL 381
Query: 325 TVSKRVPLVGDELIAYEEEQNRIKKEEALKASLIKAEELKASHGADSNTSDPMVIDTGNN 384
+ +RV L G EL Y Q E L ++K + + S + + VI TG +
Sbjct: 382 DIRRRVELEGAELDEYLRTQG-----EKLNPLIVKPDVEEESSSESEDDIEMSVI-TGKH 435
Query: 385 LIAPEVAGPRGGGYRDIFVDGFVPPSTSVAPMFPCYENTSEWDDFGEVINPDDYVIKDED 444
I V P G + GF + MFP +E ++D++GE+IN DDY I D
Sbjct: 436 DI---VVRPEGRHH-----SGFFKSNKRHHVMFPYHEEKVKYDEYGEIINLDDYRIADTS 487
Query: 445 -MDQTAMHVGGDINGKLDE-------------GAASLILDTKPSKVISDERTVQVRCSLV 490
D M N K +E G + L KP+K+I+ +T++V +
Sbjct: 488 GYDFVPMEEQNKENVKKEEPGSGIDHQTNGTIGDTDVQLLEKPTKLINQRKTIEVNAQIQ 547
Query: 491 YMDFEGRSDGRSIKNILSHVAPLK 514
+DFEGRSDG S+ ILS + P +
Sbjct: 548 RIDFEGRSDGESMLKILSQLRPRR 571
>B4G4W9_DROPE (tr|B4G4W9) GL24248 (Fragment) OS=Drosophila persimilis GN=GL24248
PE=4 SV=1
Length = 583
Score = 156 bits (394), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 106/322 (32%), Positives = 168/322 (52%), Gaps = 27/322 (8%)
Query: 206 LVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVP 265
+ Y + L+ VS + +++ KS +EWMSD + K+FE R N F K+I L ++ +P
Sbjct: 88 MAYSLALLNNVSYNVVEFAKSQIEWMSDKLTKAFEGARNNPFQFKHIQLCHTLADVYKLP 147
Query: 266 DGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLA-RMLQADPPPKAVKV 324
GPKVVLAS LE+GF+ D+F++W S+ N ++ T + TLA +++ P + +++
Sbjct: 148 AGPKVVLASTPDLESGFTRDLFIQWASNANNSIILTTRTSPGTLAMELVENYAPGRQIEL 207
Query: 325 TVSKRVPLVGDELIAYEEEQNRIKKEEALKASLIKAEELKASHGADSNTSDPMVIDTGNN 384
V +RV L G EL Y Q E + + K E + S + + VI TG +
Sbjct: 208 DVRRRVELEGAELEEYLRTQG-----EKINPLIAKPEPEEESSSESEDDIEMSVI-TGKH 261
Query: 385 LIAPEVAGPRGGGYRDIFVDGFVPPSTSVAPMFPCYENTSEWDDFGEVINPDDYVIKDED 444
I V P G + GF + MFP +E ++D++GE+IN DDY I D +
Sbjct: 262 DI---VVRPEGRHH-----SGFFKSNKRHHVMFPYHEEKIKYDEYGEIINLDDYRIADMN 313
Query: 445 ------MDQTAMHVGGDINGKLDEGAASLILDT------KPSKVISDERTVQVRCSLVYM 492
+Q +V + G E A+ +DT KP+K+I+ +T++V + +
Sbjct: 314 NTEFPPEEQNKENVKKEEPGIGIEQQANGAMDTDVQLLEKPTKLINQRKTIEVNAQIQRI 373
Query: 493 DFEGRSDGRSIKNILSHVAPLK 514
DFEGRSDG S+ ILS + P +
Sbjct: 374 DFEGRSDGESMLKILSQLRPRR 395
>B4PQ86_DROYA (tr|B4PQ86) GE26465, isoform A OS=Drosophila yakuba GN=GE26465 PE=4
SV=1
Length = 756
Score = 156 bits (394), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 108/326 (33%), Positives = 167/326 (51%), Gaps = 33/326 (10%)
Query: 206 LVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVP 265
+ Y + L+ VS + I++ KS +EWMSD + K+FE R N F K+I L + ++ +P
Sbjct: 262 MAYSLALLNNVSYNVIEFAKSQIEWMSDKLTKAFEGARNNPFQFKHIQLCHSLADVYKLP 321
Query: 266 DGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQAD-PPPKAVKV 324
GPKVVLAS LE+GF+ D+FV+W S+ N ++ T + TLA L + P K +++
Sbjct: 322 AGPKVVLASTPDLESGFTRDLFVQWASNANNSIILTTRTSPGTLAMELVENCAPGKQIEL 381
Query: 325 TVSKRVPLVGDELIAYEEEQNRIKKEEALKASLIKAEELKASHGADSNTSDPMVIDTGNN 384
V +RV L G EL Y Q E L ++K + + S + + VI TG +
Sbjct: 382 DVRRRVELEGAELEEYLRTQG-----EKLNPLIVKPDVEEESSSESEDDIEMSVI-TGKH 435
Query: 385 LIAPEVAGPRGGGYRDIFVDGFVPPSTSVAPMFPCYENTSEWDDFGEVINPDDYVIKD-- 442
I V P G + GF + MFP +E + D++GE+IN DDY I D
Sbjct: 436 DI---VVRPEGRHH-----SGFFKSNKRHHVMFPYHEEKVKCDEYGEIINLDDYRIADAT 487
Query: 443 ------------EDMDQTAMHVGGD--INGKLDEGAASLILDTKPSKVISDERTVQVRCS 488
E++ + +G D NG + + L+ KP+K+ + +T++V
Sbjct: 488 GYDFVPMEEQNKENVKKEEPGLGADQQTNGGIGDNDVQLL--EKPTKLXNQRKTIEVNAQ 545
Query: 489 LVYMDFEGRSDGRSIKNILSHVAPLK 514
+ +DFEGRSDG S+ ILS + P +
Sbjct: 546 VQRIDFEGRSDGESMLKILSQLRPRR 571
>B5DYG6_DROPS (tr|B5DYG6) GA26549 OS=Drosophila pseudoobscura pseudoobscura
GN=Dpse\GA26549 PE=4 SV=1
Length = 757
Score = 154 bits (388), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 106/322 (32%), Positives = 169/322 (52%), Gaps = 27/322 (8%)
Query: 206 LVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVP 265
+ Y + L+ VS + +++ KS +EWMSD + K+FE R N F K+I L ++ +P
Sbjct: 262 MAYSLALLNNVSYNVVEFAKSQIEWMSDKLTKAFEGARNNPFQFKHIQLCHTLADVYKLP 321
Query: 266 DGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLA-RMLQADPPPKAVKV 324
GPKVVLAS LE+GF+ D+F++W + N ++ T + TLA +++ P + +++
Sbjct: 322 AGPKVVLASTPDLESGFTRDLFIQWAGNANNSIILTTRTSPGTLAMELVENYAPGRQIEL 381
Query: 325 TVSKRVPLVGDELIAYEEEQNRIKKEEALKASLIKAEELKASHGADSNTSDPMVIDTGNN 384
V +RV L G EL EE R + E+ LI E + ++S M + TG +
Sbjct: 382 DVRRRVELEGAEL----EEYLRTQGEKI--NPLIAKPEPEEESSSESEDDIEMSVITGKH 435
Query: 385 LIAPEVAGPRGGGYRDIFVDGFVPPSTSVAPMFPCYENTSEWDDFGEVINPDDYVIKDED 444
I V P G + GF + MFP +E ++D++GE+IN DDY I D +
Sbjct: 436 DI---VVRPEGRHH-----SGFFKSNKRHHVMFPYHEEKIKYDEYGEIINLDDYRIADMN 487
Query: 445 ------MDQTAMHVGGDINGKLDEGAASLILDT------KPSKVISDERTVQVRCSLVYM 492
+Q +V + G E A+ +DT KP+K+I+ +T++V + +
Sbjct: 488 NTEFPPEEQNKENVKKEEPGIGIEQQANGAMDTDVQLLEKPTKLINQRKTIEVNAQIQRI 547
Query: 493 DFEGRSDGRSIKNILSHVAPLK 514
DFEGRSDG S+ ILS + P +
Sbjct: 548 DFEGRSDGESMLKILSQLRPRR 569
>B4NF98_DROWI (tr|B4NF98) GK22564 OS=Drosophila willistoni GN=GK22564 PE=4 SV=1
Length = 757
Score = 152 bits (385), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 105/329 (31%), Positives = 164/329 (49%), Gaps = 37/329 (11%)
Query: 206 LVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVP 265
+ Y + L+ VS + I++ KS +EWMSD + K+FE R N F K+I L + ++ +P
Sbjct: 262 MAYSLALLNNVSYNVIEFAKSQIEWMSDKLTKAFEGARNNPFQFKHINLCHSLADVFKLP 321
Query: 266 DGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLA-RMLQADPPPKAVKV 324
GPKVVLAS +E+GF+ D+FV+W ++ N ++FT + +LA +++ P + +++
Sbjct: 322 AGPKVVLASTPDMESGFTRDLFVQWAANPNNSIIFTTRTSPGSLAMELVENAVPGRKIEL 381
Query: 325 TVSKRVPLVGDELIAYEEEQNRIKKEEALKASLIKAEELKASHGADSNTSDPMVIDTGNN 384
V +RV L G EL Y Q E L +IK + + S + + VI TG +
Sbjct: 382 DVRRRVELEGPELEEYLRTQG-----EKLNPLIIKPDVEEESSSESEDDIEMSVI-TGKH 435
Query: 385 LIAPEVAGPRGGGYRDIFVDGFVPPSTSVAPMFPCYENTSEWDDFGEVINPDDYVIKD-- 442
I V P G GF + MFP +E ++D++GE+IN DDY I D
Sbjct: 436 DI---VVRPEGR-----HTSGFFKSNKRHHVMFPYHEEKIKYDEYGEIINLDDYRIADLG 487
Query: 443 -----------------EDMDQTAMHVGGDINGKLDEGAASLILDTKPSKVISDERTVQV 485
E+ NG +D L KP+K+I+ +T++V
Sbjct: 488 GYDYLPAEEQNKENVKKEEPGGGQQDQQQHANGDMDTDVQLL---EKPTKLINQRKTIEV 544
Query: 486 RCSLVYMDFEGRSDGRSIKNILSHVAPLK 514
+ +DFEGRSDG S+ ILS + P +
Sbjct: 545 NAQIQRIDFEGRSDGESMLKILSQLRPRR 573
>Q5F3I9_CHICK (tr|Q5F3I9) Putative uncharacterized protein OS=Gallus gallus
GN=RCJMB04_15m16 PE=2 SV=1
Length = 782
Score = 148 bits (374), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 99/309 (32%), Positives = 162/309 (52%), Gaps = 16/309 (5%)
Query: 207 VYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVPD 266
VY + L+ VS + +++ KS +EWMSD + + FE R N F ++++L + ++L VP
Sbjct: 263 VYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHSLSDLARVP- 321
Query: 267 GPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADPPPKAVKVTV 326
PKVVLAS LE GFS D+F++W D KN ++ T + TLAR L +P K + + +
Sbjct: 322 CPKVVLASQPDLECGFSRDLFIQWCQDSKNSIILTYRTTPGTLARFLIDNPSEKVIDIEL 381
Query: 327 SKRVPLVGDELIAYEEEQNRIKKEEALKASLIKAE-ELKASHGADSNTSDPMVIDTGNNL 385
+RV L G EL Y E++ K+ +A+ + A+ + P V T ++L
Sbjct: 382 RRRVKLEGKELEEYLEKEKLKKEAAKKLEQSKEADIDSSDESDAEEDIDQPTVHKTKHDL 441
Query: 386 IAPEVAGPRGGGYRDIFVDGFVPPSTSVAPMFPCYENTSEWDDFGEVINPDDYVIKDEDM 445
+ +G G R F + PMFP E +WD++GE+I P+D+++ +
Sbjct: 442 MM------KGEGSRK---GSFFKQAKKSYPMFPAPEERIKWDEYGEIIKPEDFLVPEL-- 490
Query: 446 DQTAMHVGGDINGKLDEGAASLILDTK--PSKVISDERTVQVRCSLVYMDFEGRSDGRSI 503
Q + L G + D P+K IS +++++ + Y+D+EGRSDG SI
Sbjct: 491 -QATEEEKSKLESGLTNGEEPMDQDLSDVPTKCISATESMEIKARVTYIDYEGRSDGDSI 549
Query: 504 KNILSHVAP 512
K I++ + P
Sbjct: 550 KKIINQMKP 558
>B4HZ45_DROSE (tr|B4HZ45) GM12754 OS=Drosophila sechellia GN=GM12754 PE=4 SV=1
Length = 743
Score = 148 bits (373), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 101/326 (30%), Positives = 163/326 (50%), Gaps = 46/326 (14%)
Query: 206 LVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVP 265
+ Y + L+ VS + I++ KS +EWMSD + K+FE R N F K+I L + ++ +P
Sbjct: 262 MAYSLALLNNVSYNVIEFAKSQIEWMSDKLTKAFEGARNNPFQFKHIQLCHSLADVYKLP 321
Query: 266 DGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQAD-PPPKAVKV 324
GPKVVLAS LE+GF+ D+FV+W S+ N ++ T + TLA L + P K +++
Sbjct: 322 AGPKVVLASTPDLESGFTRDLFVQWASNANNSIILTTRTSPGTLAMELVENCAPGKQIEL 381
Query: 325 TVSKRVPLVGDELIAYEEEQNRIKKEEALKASLIKAEELKASHGADSNTSDPMVIDTGNN 384
V +RV L G EL Y Q E L ++K + + S + + VI ++
Sbjct: 382 DVRRRVDLEGAELEEYLRTQG-----EKLNPLIVKPDVEEESSSESEDDIEMSVITGKHD 436
Query: 385 LIAPEVAGPRGGGYRDIFVDGFVPPSTSVAPMFPCYENTSEWDDFGEVINPDDYVIKD-- 442
+++ + MFP +E + D++GE+IN DDY I D
Sbjct: 437 IVS----------------------NKRHHVMFPYHEEKVKCDEYGEIINLDDYRIADAT 474
Query: 443 ------------EDMDQTAMHVGGD--INGKLDEGAASLILDTKPSKVISDERTVQVRCS 488
E++ + +G D NG + + L+ KP+K+I+ +T++V
Sbjct: 475 GYEFVPMEEQNKENVKKEEPGIGADQQANGAIVDNDVQLL--EKPTKLINQRKTIEVNAQ 532
Query: 489 LVYMDFEGRSDGRSIKNILSHVAPLK 514
+ +DFEGRSDG S+ ILS + P +
Sbjct: 533 VQRIDFEGRSDGESMLKILSQLRPRR 558
>Q4R8N0_MACFA (tr|Q4R8N0) Testis cDNA clone: QtsA-12018, similar to human
cleavage and polyadenylation specific factor 2,
100kDa(CPSF2), OS=Macaca fascicularis PE=2 SV=1
Length = 592
Score = 147 bits (371), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 102/310 (32%), Positives = 165/310 (53%), Gaps = 18/310 (5%)
Query: 207 VYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVPD 266
VY + L+ VS + +++ KS +EWMSD + + FE R N F ++++L ++L VP
Sbjct: 73 VYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVP- 131
Query: 267 GPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADPPPKAVKVTV 326
PKVVLAS LE GFS D+F++W D KN ++ T + TLAR L +P K ++ +
Sbjct: 132 SPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIEL 191
Query: 327 SKRVPLVGDELIAYEEEQNRIKKEEALKASLIKAEELKASHGAD--SNTSDPMVIDTGNN 384
KRV L G EL Y E++ K+ K ++ +S +D + P T ++
Sbjct: 192 RKRVKLEGKELEEYLEKEKLKKEAAKKLEQ-SKEADIDSSDESDIEEDIDQPSAHKTKHD 250
Query: 385 LIAPEVAGPRGGGYRDIFVDGFVPPSTSVAPMFPCYENTSEWDDFGEVINPDDYVIKDED 444
L+ +G G R F + PMFP E +WD++GE+I P+D+++ +
Sbjct: 251 LMM------KGEGSRK---GSFFKQAKKSYPMFPAPEERIKWDEYGEIIKPEDFLVPELQ 301
Query: 445 M--DQTAMHVGGDINGKLDEGAASLILDTKPSKVISDERTVQVRCSLVYMDFEGRSDGRS 502
++ + G NG DE + D P+K IS +++++ + Y+D+EGRSDG S
Sbjct: 302 ATEEEKSKLESGLTNG--DEPMDQDLSDV-PTKCISTTESIEIKARVTYIDYEGRSDGDS 358
Query: 503 IKNILSHVAP 512
IK I++ + P
Sbjct: 359 IKKIINQMKP 368
>B3KMI0_HUMAN (tr|B3KMI0) cDNA FLJ11050 fis, clone PLACE1004564, highly similar
to Cleavage and polyadenylation specificity factor 100
kDa subunit OS=Homo sapiens PE=2 SV=1
Length = 644
Score = 147 bits (371), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 102/310 (32%), Positives = 165/310 (53%), Gaps = 18/310 (5%)
Query: 207 VYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVPD 266
VY + L+ VS + +++ KS +EWMSD + + FE R N F ++++L ++L VP
Sbjct: 125 VYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVP- 183
Query: 267 GPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADPPPKAVKVTV 326
PKVVLAS LE GFS D+F++W D KN ++ T + TLAR L +P K ++ +
Sbjct: 184 SPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIEL 243
Query: 327 SKRVPLVGDELIAYEEEQNRIKKEEALKASLIKAEELKASHGAD--SNTSDPMVIDTGNN 384
KRV L G EL Y E++ K+ K ++ +S +D + P T ++
Sbjct: 244 RKRVKLEGKELEEYLEKEKLKKEAAKKLEQ-SKEADIDSSDESDIEEDIDQPSAHKTKHD 302
Query: 385 LIAPEVAGPRGGGYRDIFVDGFVPPSTSVAPMFPCYENTSEWDDFGEVINPDDYVIKDED 444
L+ +G G R F + PMFP E +WD++GE+I P+D+++ +
Sbjct: 303 LMM------KGEGSRK---GSFFKQAKKSYPMFPAPEERIKWDEYGEIIKPEDFLVPELQ 353
Query: 445 M--DQTAMHVGGDINGKLDEGAASLILDTKPSKVISDERTVQVRCSLVYMDFEGRSDGRS 502
++ + G NG DE + D P+K IS +++++ + Y+D+EGRSDG S
Sbjct: 354 ATEEEKSKLESGLTNG--DEPMDQDLSDV-PTKCISTTESIEIKARVTYIDYEGRSDGDS 410
Query: 503 IKNILSHVAP 512
IK I++ + P
Sbjct: 411 IKKIINQMKP 420
>B3KME1_HUMAN (tr|B3KME1) cDNA FLJ10765 fis, A-NT2RP4000111, highly similar to
Cleavage and polyadenylation specificity factor 100 kDa
subunit OS=Homo sapiens GN=CPSF2 PE=2 SV=1
Length = 782
Score = 147 bits (370), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 102/310 (32%), Positives = 165/310 (53%), Gaps = 18/310 (5%)
Query: 207 VYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVPD 266
VY + L+ VS + +++ KS +EWMSD + + FE R N F ++++L ++L VP
Sbjct: 263 VYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVP- 321
Query: 267 GPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADPPPKAVKVTV 326
PKVVLAS LE GFS D+F++W D KN ++ T + TLAR L +P K ++ +
Sbjct: 322 SPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIEL 381
Query: 327 SKRVPLVGDELIAYEEEQNRIKKEEALKASLIKAEELKASHGAD--SNTSDPMVIDTGNN 384
KRV L G EL Y E++ K+ K ++ +S +D + P T ++
Sbjct: 382 RKRVKLEGKELEEYLEKEKLKKEAAKKLEQ-SKEADIDSSDESDIEEDIDQPSAHKTKHD 440
Query: 385 LIAPEVAGPRGGGYRDIFVDGFVPPSTSVAPMFPCYENTSEWDDFGEVINPDDYVIKDED 444
L+ +G G R F + PMFP E +WD++GE+I P+D+++ +
Sbjct: 441 LMM------KGEGSRK---GSFFKQAKKSYPMFPAPEERIKWDEYGEIIKPEDFLVPELQ 491
Query: 445 M--DQTAMHVGGDINGKLDEGAASLILDTKPSKVISDERTVQVRCSLVYMDFEGRSDGRS 502
++ + G NG DE + D P+K IS +++++ + Y+D+EGRSDG S
Sbjct: 492 ATEEEKSKLESGLTNG--DEPMDQDLSDV-PTKCISTTESIEIKARVTYIDYEGRSDGDS 548
Query: 503 IKNILSHVAP 512
IK I++ + P
Sbjct: 549 IKKIINQMKP 558
>B3KN45_HUMAN (tr|B3KN45) cDNA FLJ13521 fis, clone PLACE1005876, highly similar
to Cleavage and polyadenylation specificity factor 100
kDa subunit OS=Homo sapiens PE=2 SV=1
Length = 690
Score = 147 bits (370), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 102/310 (32%), Positives = 165/310 (53%), Gaps = 18/310 (5%)
Query: 207 VYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVPD 266
VY + L+ VS + +++ KS +EWMSD + + FE R N F ++++L ++L VP
Sbjct: 171 VYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVP- 229
Query: 267 GPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADPPPKAVKVTV 326
PKVVLAS LE GFS D+F++W D KN ++ T + TLAR L +P K ++ +
Sbjct: 230 SPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIEL 289
Query: 327 SKRVPLVGDELIAYEEEQNRIKKEEALKASLIKAEELKASHGAD--SNTSDPMVIDTGNN 384
KRV L G EL Y E++ K+ K ++ +S +D + P T ++
Sbjct: 290 RKRVKLEGKELEEYLEKEKLKKEAAKKLEQ-SKEADIDSSDESDIEEDIDQPSAHKTKHD 348
Query: 385 LIAPEVAGPRGGGYRDIFVDGFVPPSTSVAPMFPCYENTSEWDDFGEVINPDDYVIKDED 444
L+ +G G R F + PMFP E +WD++GE+I P+D+++ +
Sbjct: 349 LMM------KGEGSRK---GSFFKQAKKSYPMFPAPEERIKWDEYGEIIKPEDFLVPELQ 399
Query: 445 M--DQTAMHVGGDINGKLDEGAASLILDTKPSKVISDERTVQVRCSLVYMDFEGRSDGRS 502
++ + G NG DE + D P+K IS +++++ + Y+D+EGRSDG S
Sbjct: 400 ATEEEKSKLESGLTNG--DEPMDQDLSDV-PTKCISTTESIEIKARVTYIDYEGRSDGDS 456
Query: 503 IKNILSHVAP 512
IK I++ + P
Sbjct: 457 IKKIINQMKP 466
>A4RR19_OSTLU (tr|A4RR19) Predicted protein OS=Ostreococcus lucimarinus (strain
CCE9901) GN=OSTLU_12108 PE=4 SV=1
Length = 715
Score = 144 bits (364), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 98/328 (29%), Positives = 157/328 (47%), Gaps = 49/328 (14%)
Query: 201 WAEENL-VYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKT 259
WA+ L Y I L+ V+ +T+D+ KS LEWM D + +FE+ REN F K++TL +
Sbjct: 236 WAQRQLGSYQIVLLTNVAYNTLDFAKSHLEWMGDHVTNAFERRRENPFNTKFLTLCHSME 295
Query: 260 ELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLA----RMLQA 315
EL +P GPKVVLAS SLEAG S +F EW D NLV+ T + + +L ++
Sbjct: 296 ELQALPPGPKVVLASFGSLEAGPSRHLFAEWAEDKSNLVILTGQPEHGSLTEQVVQLSAK 355
Query: 316 DPPPKAVKVTVSKRVPLVGDELIAYEEEQNRIKKEEALKASLIKAEELKASHGADSNTSD 375
K +K+T+S+R+PL G EL +E + E K +L+
Sbjct: 356 ATAKKKIKLTLSRRIPLEGSELAEHESSRKSSTSTELEKKESETEADLR----------- 404
Query: 376 PMVIDTGNNLIAPEVAGPRGGGYRDIFVDGFVPPSTSVAPMFPCYENTSEWDDFGEVINP 435
RD +GF P ST PMFP D+G+ I+
Sbjct: 405 ----------------------RRDTLTEGFTPISTPHGPMFPDEVWEPTMTDYGQEIDI 442
Query: 436 DDY---------VIKDEDMDQTAMHVGGDINGKLDEGAASLILDTKPSKVISDERTVQVR 486
+ + + E M +T V D++ E P+K++++ R + +R
Sbjct: 443 ETFHQISQMSSGIPIPEPMKETT--VVDDLDVANIEEDEEEEPQEVPTKLVTETREINIR 500
Query: 487 CSLVYMDFEGRSDGRSIKNILSHVAPLK 514
+++ +DFEG++DG+S++ +++ AP +
Sbjct: 501 ATIITVDFEGKADGKSVRTLITQAAPRR 528
>B1H337_XENTR (tr|B1H337) LOC100145546 protein OS=Xenopus tropicalis
GN=LOC100145546 PE=2 SV=1
Length = 783
Score = 140 bits (354), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 98/308 (31%), Positives = 154/308 (50%), Gaps = 14/308 (4%)
Query: 207 VYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVPD 266
VY + L+ VS + +++ KS +EWMSD + + FE R N F +++TL ++L VP
Sbjct: 263 VYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLTLCHGFSDLARVP- 321
Query: 267 GPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADPPPKAVKVTV 326
PKVVLAS LE GFS ++F++W D KN V+ T + TLAR L P + + + +
Sbjct: 322 SPKVVLASQPDLECGFSRELFIQWCQDPKNSVILTYRTTPGTLARFLIDHPSERIIDIEL 381
Query: 327 SKRVPLVGDELIAYEEEQNRIKKEEALKASLIKAEELKASHGADSNTSDPMVIDTGNNLI 386
KRV L G EL Y E++ K+ +A+ + D + +
Sbjct: 382 RKRVKLEGKELEEYLEKEKLKKEAAKKLEQSKEADLDSSDDSDAEEDIDQTTSHKAKHDL 441
Query: 387 APEVAGPRGGGYRDIFVDGFVPPSTSVAPMFPCYENTSEWDDFGEVINPDDYVIKDEDM- 445
+ G R G F + PMFP E +WD++GE+I P+D+++ +
Sbjct: 442 MMKNEGSRKG--------SFFKQAKKSYPMFPAPEERIKWDEYGEIIKPEDFLVPELQAT 493
Query: 446 -DQTAMHVGGDINGKLDEGAASLILDTKPSKVISDERTVQVRCSLVYMDFEGRSDGRSIK 504
D+ G NG E L P+K IS +++++ + Y+D+EGRSDG SIK
Sbjct: 494 EDEKTKLESGLTNG---EEPMDQDLSDVPTKCISATESMEIKARVTYIDYEGRSDGDSIK 550
Query: 505 NILSHVAP 512
I++ + P
Sbjct: 551 KIINQMKP 558
>Q7QC68_ANOGA (tr|Q7QC68) AGAP002474-PA OS=Anopheles gambiae GN=AGAP002474 PE=4
SV=4
Length = 745
Score = 140 bits (352), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 101/316 (31%), Positives = 159/316 (50%), Gaps = 32/316 (10%)
Query: 206 LVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVP 265
+ Y + L+ S + +++ KS +EWMSD + KSFE R N F K++ L +L VP
Sbjct: 262 MAYSLALLNNQSYNVVEFAKSQIEWMSDKLMKSFEGARNNPFTFKHLRLCHTMADLAKVP 321
Query: 266 DGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLAR-MLQADPPPKAVKV 324
PKVVLAS LE+GFS ++F++W + N ++ T + TLAR +++ + +++
Sbjct: 322 -SPKVVLASSPDLESGFSRELFIQWAPNASNSIIITSRSSPGTLARDLIENGGNGRKIEM 380
Query: 325 TVSKRVPLVGDELIAYEEEQNRIKKEEALKASLIKAEELKASHGADSNTSDPMVIDTGNN 384
+ +RV L G EL Y + E L S+ K +DS+ M + TG +
Sbjct: 381 DIRRRVELEGAELEEYMRTEG-----EKLNRSIKK--RDLDESSSDSDDELEMNVITGKH 433
Query: 385 LIAPEVAGPRGGGYRDIFVDGFVPPSTSVAPMFPCYENTSEWDDFGEVINPDDYVIKDED 444
I V P G + GF S MFP +E ++D++GE+I PDDY + D
Sbjct: 434 DI---VVRPEGRSHT-----GFFKSSKKNYAMFPFHEEKIKYDEYGEIIQPDDYRMVDLG 485
Query: 445 MDQTAMHVGGDING----------KLDEGAASLILDTKPSKVISDERTVQVRCSLVYMDF 494
+ GGD N K ++ +LD KP+K + + ++V + ++DF
Sbjct: 486 PETN----GGDDNKENGGIKTEDIKKEKEDEVTVLD-KPTKCVQSRKPIEVNAQVQFIDF 540
Query: 495 EGRSDGRSIKNILSHV 510
EGRSDG S+ ILS +
Sbjct: 541 EGRSDGESLLKILSQL 556
>B0WQG5_CULQU (tr|B0WQG5) Cleavage and polyadenylation specificity factor subunit
2 OS=Culex quinquefasciatus GN=CpipJ_CPIJ009643 PE=4
SV=1
Length = 747
Score = 139 bits (351), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 105/313 (33%), Positives = 162/313 (51%), Gaps = 27/313 (8%)
Query: 206 LVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVP 265
+ Y + L+ VS + +++ KS +EWMSD + KSFE R N F K++ L +L VP
Sbjct: 262 MAYSLALLNNVSYNVVEFAKSQIEWMSDKLMKSFEGARNNPFQFKHLRLCHTMADLAKVP 321
Query: 266 DGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLAR-MLQADPPPKAVKV 324
PKVVLAS +E+GFS ++FV+W +V N ++ T + TLAR ++ + +++
Sbjct: 322 -SPKVVLASSPDMESGFSRELFVQWAGNVNNSIIITCRSSPGTLARDLIDNGGNGRKLEL 380
Query: 325 TVSKRVPLVGDELIAYEEEQNRIKKEEALKASLIKAEELKASHGADSNTSDPMVIDTGNN 384
V +RV L G EL Y + E S+IK++ S + + VI TG +
Sbjct: 381 DVRRRVELEGAELDEYMRTEG-----EKHNRSVIKSDMDLDSSSDSEDELEMSVI-TGKH 434
Query: 385 LIAPEVAGPRGGGYRDIFVDGFVPPSTSVAPMFPCYENTSEWDDFGEVINPDDYVIKD-- 442
I V P G + GF S MFP +E ++D++GE+I D+Y + D
Sbjct: 435 DI---VVRPEGRSHT-----GFFKSSKKQYAMFPFHEEKIKFDEYGEIIQADEYRMVDLG 486
Query: 443 ----EDMDQTAMHVGGDING-KLDEGAASLILDTKPSKVISDERTVQVRCSLVYMDFEGR 497
ED + DI K+D+ +LD KP+K I+ + V+V + ++DFEGR
Sbjct: 487 PDGAEDNKENHQIKPEDIKKEKMDDMT---VLD-KPTKCINSRKLVEVNAQVQFIDFEGR 542
Query: 498 SDGRSIKNILSHV 510
SDG S+ ILS +
Sbjct: 543 SDGESMLKILSQL 555
>Q01GI5_OSTTA (tr|Q01GI5) Polyadenylation cleavage/specificity factor 100 kDa
subunit (ISS) OS=Ostreococcus tauri GN=Ot01g02480 PE=4
SV=1
Length = 807
Score = 139 bits (349), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 101/346 (29%), Positives = 165/346 (47%), Gaps = 47/346 (13%)
Query: 201 WAEENL-VYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKT 259
W + L Y I L+ V+ +T+D+ KS LEWM D + +FE+ REN F K+IT+
Sbjct: 282 WQQRQLGSYQIVLLTNVAYNTLDFAKSHLEWMGDLVTSAFERRRENPFNTKFITICHTMD 341
Query: 260 ELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFT---EKGQFATLARMLQAD 316
EL +P GPKVVLAS SLEAG + +F EW D NLV+ T E+G + +
Sbjct: 342 ELKALPPGPKVVLASFGSLEAGPARHLFAEWAGDKSNLVVLTGQPEEGSLMEEVVRVSSK 401
Query: 317 PPPKA-VKVTVSKRVPLVGDELIAYEEEQNRIKKEEALKASLIKA---EELKASHGADSN 372
P K VK T+S+RVPL G+EL +E + K ++ + EE+ + +
Sbjct: 402 PAAKKNVKFTLSRRVPLEGEELATHESTRKADKSKKEEEKKPEHVSVEEEMVDIKPVEPD 461
Query: 373 TSDPMVIDTGNNLIAPEVAGPRGGGYRDIFVDGFVPPSTSVAPMFPCYENTSEWD----D 428
+PM + G + R+ +GF P T PMF WD D
Sbjct: 462 EPEPMDVLFGVTTVGSTAEADL--RRRETLTEGFTPIMTQHGPMFA----DEVWDPVMTD 515
Query: 429 FGEVINPDDYVIKDEDMDQTAMHVGGDINGKLDEGAASLILDTK---------------- 472
+G+ I+ + ++ +T+ G + +L + +++ D
Sbjct: 516 YGQEIDIELFM-------RTSQQASGRMVPELAKEPSTMFEDPSVEMIEEQQLVEAAQEA 568
Query: 473 ------PSKVISDERTVQVRCSLVYMDFEGRSDGRSIKNILSHVAP 512
P+K++S+ V V+ +++ +DFEG++DG+S++ ++ AP
Sbjct: 569 EEDEEIPTKLVSEAVEVSVKATILTIDFEGKADGQSVRTLIEQAAP 614
>A8PAE9_BRUMA (tr|A8PAE9) Cleavage and polyadenylation specificity factor,
putative OS=Brugia malayi GN=Bm1_20430 PE=4 SV=1
Length = 831
Score = 136 bits (342), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 110/358 (30%), Positives = 166/358 (46%), Gaps = 55/358 (15%)
Query: 206 LVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVP 265
+ Y + LS+V+SS +++ KS +EWMSD + KSFE R N F +++ L +L V
Sbjct: 261 MTYNLVMLSHVASSVVEFAKSQVEWMSDKVLKSFEVGRYNPFQFRHVQLCHTHIDLMRVR 320
Query: 266 DGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATL-ARML----QADPPP- 319
PKVVL S +E+GFS ++F+EW +D+KN V+ T + TL AR++ QA P
Sbjct: 321 -SPKVVLVSGLDMESGFSRELFLEWCTDIKNSVIVTGRSGDRTLGARLIRMAEQAAENPN 379
Query: 320 ----KAVKVTVSKRVPLVGDELIAY-------EEEQNRIKKEE----------------- 351
+ + + V +R+ L G EL Y E E RI+ E
Sbjct: 380 GTINRNLTLEVKRRIRLDGVELENYRAKKRAEEREATRIRLEASRRNARLEQADSSDDSD 439
Query: 352 --------ALKASLIKAEELKASHGADSNTSDPMVIDTGNNLIAPEVAGPRGGGYRDIF- 402
A + ++ + + S+ S T +L A ++A R DI
Sbjct: 440 DDAVMVVPATTSGILNGKMTNSKRNIASSFSASTTTSTTADLSAAQIAEQRS---HDIMW 496
Query: 403 ------VDGFVPPSTSVAPMFPCYENTSEWDDFGEVINPDDYVIKDEDMDQTAMHVGGDI 456
F S PMFP E + WDD+GE+I P++Y+I D + D
Sbjct: 497 KWEQQQKSSFFKQSKKSFPMFPYIEEKTRWDDYGEIIRPEEYMIVDTPVVPQIPPEHKDG 556
Query: 457 NGKLDEGAASLILDTK--PSKVISDERTVQVRCSLVYMDFEGRSDGRSIKNILSHVAP 512
+G + + + PSK IS ++V C + ++DFEGRSDG S K ILS + P
Sbjct: 557 TDSTFDGQVVPLYEEREWPSKCISQIMKMEVLCKVDFIDFEGRSDGESAKKILSQIKP 614
>B4PQ87_DROYA (tr|B4PQ87) GE26465, isoform B OS=Drosophila yakuba GN=GE26465 PE=4
SV=1
Length = 548
Score = 132 bits (331), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 96/299 (32%), Positives = 150/299 (50%), Gaps = 33/299 (11%)
Query: 206 LVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVP 265
+ Y + L+ VS + I++ KS +EWMSD + K+FE R N F K+I L + ++ +P
Sbjct: 262 MAYSLALLNNVSYNVIEFAKSQIEWMSDKLTKAFEGARNNPFQFKHIQLCHSLADVYKLP 321
Query: 266 DGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQAD-PPPKAVKV 324
GPKVVLAS LE+GF+ D+FV+W S+ N ++ T + TLA L + P K +++
Sbjct: 322 AGPKVVLASTPDLESGFTRDLFVQWASNANNSIILTTRTSPGTLAMELVENCAPGKQIEL 381
Query: 325 TVSKRVPLVGDELIAYEEEQNRIKKEEALKASLIKAEELKASHGADSNTSDPMVIDTGNN 384
V +RV L G EL Y Q E L ++K + + S + + VI TG +
Sbjct: 382 DVRRRVELEGAELEEYLRTQG-----EKLNPLIVKPDVEEESSSESEDDIEMSVI-TGKH 435
Query: 385 LIAPEVAGPRGGGYRDIFVDGFVPPSTSVAPMFPCYENTSEWDDFGEVINPDDYVIKD-- 442
I V P G + GF + MFP +E + D++GE+IN DDY I D
Sbjct: 436 DI---VVRPEGRHH-----SGFFKSNKRHHVMFPYHEEKVKCDEYGEIINLDDYRIADAT 487
Query: 443 ------------EDMDQTAMHVGGD--INGKLDEGAASLILDTKPSKVISDERTVQVRC 487
E++ + +G D NG + + L+ KP+K+I+ +T++V
Sbjct: 488 GYDFVPMEEQNKENVKKEEPGLGADQQTNGGIGDNDVQLL--EKPTKLINQRKTIEVNA 544
>Q8WPK6_OIKDI (tr|Q8WPK6) Similar to cleavage and polyadenylation factor
OS=Oikopleura dioica GN=BAC001.26 PE=4 SV=1
Length = 765
Score = 130 bits (327), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 95/339 (28%), Positives = 163/339 (48%), Gaps = 39/339 (11%)
Query: 199 GYWAEE-------NLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKY 251
G W +E NLV +S V++STI+ K +EWMS+ I F RENIF L
Sbjct: 251 GIWNDERYGLGRVNLV----MVSNVATSTIEAAKGMIEWMSEKIISKFTHKRENIFDLTK 306
Query: 252 ITLLVNKTELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLAR 311
+ L + E+ +P+ PKV+LA+ ++ GFS ++FV + KN V+ + + +L R
Sbjct: 307 MKLRSSIQEIARIPE-PKVILATPMDMDTGFSRELFVMMAAHPKNAVIMSGRSTKGSLCR 365
Query: 312 MLQADPPPKAVKVTVSKRVPLVGDELIAYEEEQNRIKKEEALKASLIKAEELKASHGADS 371
+ + ++ + ++KR+PLVG EL YE+++ + + +K ++ + + +++
Sbjct: 366 KIIENEGMSSITLEMNKRLPLVGPELEEYEKQKEQERNANLIKRLEEESSDESENEMSET 425
Query: 372 NTSDPMVI--DTGNNLIAPEVAGPRGGGYRDIFVDGFVPPSTSVAPMFPCYENTSEWDDF 429
+ + +++I P + GG+ F P+FP EN +WDD+
Sbjct: 426 ISVRKKTVKGKRTHDIIMPHHVQKKEGGF-------FKKARKEKFPLFPFNENRIKWDDY 478
Query: 430 GEVINPDDY----------------VIKDEDMDQTAMHVGGDINGKLDEGAASLILDTKP 473
GE+INPDDY + +++ H D K E + P
Sbjct: 479 GEIINPDDYKTHELIPESEPVNINNLTENQQSVTFGRHKPNDSRKKQKEEPVE--EEKAP 536
Query: 474 SKVISDERTVQVRCSLVYMDFEGRSDGRSIKNILSHVAP 512
+K I V +RCS+ +++FEGR DG S +LS + P
Sbjct: 537 TKCIKTREQVSIRCSIEFINFEGRVDGESQLQLLSTIKP 575
>B6ECP4_9EUCA (tr|B6ECP4) Cyclin B OS=Fenneropenaeus penicillatus PE=2 SV=1
Length = 475
Score = 127 bits (320), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 90/285 (31%), Positives = 152/285 (53%), Gaps = 29/285 (10%)
Query: 227 FLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVPDGPKVVLASMASLEAGFSHDI 286
+EWMS+ + K+F+ R N F K++ N T+L +P PKVVLAS L G++ ++
Sbjct: 1 MIEWMSEKLTKAFDSLRTNPFSFKHLKFCHNLTDLSRLP-SPKVVLASFPDLGCGYAREL 59
Query: 287 FVEWGSDVKNLVLFTEKGQFATLARMLQADPPPKAVKVTVSKRVPLVGDELIAYEEEQNR 346
FV+W ++ KN ++ T + TLAR L +P + K+ +R+ L G EL +E R
Sbjct: 60 FVQWATNPKNTIILTSRTGPDTLARRLIDNPQIRTFKLLEKRRMKLEGSEL----DEHYR 115
Query: 347 IKKEEALKASLIKAEELKASHGADSNTSDPMVIDTGNNLIAPEVAGPRGGGYRDIFVDGF 406
+K+EE + IK EE+++S +++ ++ G + I + G + +F
Sbjct: 116 MKREEEQQQQRIKMEEVESSSDSENEDG----LEAGKHDI---IVLHEKAGNQSMF---- 164
Query: 407 VPPSTSVAPMFPCYENTSEWDDFGEVINPDDYVI---KDEDMDQTA-MHVGGDINGKLDE 462
S PMFP +E DD+GE IN +D+ I KD++ + + + + + +D
Sbjct: 165 --RSRKHHPMFPFHEEKIRGDDYGEYINLEDFDISSMKDDNKENLENLQIPYEDDDLMD- 221
Query: 463 GAASLILDTKPSKVISDERTVQVRCSLVYMDFEGRSDGRSIKNIL 507
++ PSK +S TV+V ++++DFEGRSDG SI+ I+
Sbjct: 222 ------IEEPPSKCVSQTVTVRVTAQVLFIDFEGRSDGESIRKIV 260
>B6K3N6_SCHJP (tr|B6K3N6) Cleavage factor two Cft2/polyadenylation factor CPSF-73
OS=Schizosaccharomyces japonicus yFS275 GN=SJAG_03226
PE=4 SV=1
Length = 786
Score = 123 bits (309), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 94/359 (26%), Positives = 168/359 (46%), Gaps = 49/359 (13%)
Query: 200 YWAEEN--LVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVN 257
+WA L +PIYFLSY S+ TI Y KS +EWM D I + F E++ ++I + +
Sbjct: 245 HWASSQPPLSFPIYFLSYSSAKTIGYAKSMIEWMGDNIVRDFGMN-ESLLEFRHIQTITH 303
Query: 258 KTELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSD-VKNLVLFTEKGQFA--TLARML- 313
++L + GPKV++A+ +LE+GFS ++ ++ D NL+L T+K +++ +LA+
Sbjct: 304 PSQLSQISPGPKVIIATSLTLESGFSQNVLLDIMPDNSNNLILLTQKSRYSENSLAKQFY 363
Query: 314 -----QADPPPKAV---------KVTVSKRVPLVGDELIAYEEEQNRIKKEEALKASLIK 359
+ P+ + V PL G+EL ++E++ + +A +L
Sbjct: 364 RYWERASRKSPENFSSVGMYFEQSIQVKHSEPLQGEELREFQEKEQSKRTRDAEDIAL-- 421
Query: 360 AEELKASHGADSNTSDPMVIDTGNNLIAPEVAGPR--------GGGYRDIFVDGFVPPST 411
EL+ D + S+ D PE++ G + D +
Sbjct: 422 --ELRNRTILDEDESEESSSDEDELTQVPELSNTNLGSAAFMSGKTFDLNLRDPNIASLQ 479
Query: 412 SVAPMFPCYENTSEWDDFGEVINPDDYVIK---------DEDMDQTAMHVGGDI------ 456
S MFP E +DD+GE++ +D+ ++ +E+ D H
Sbjct: 480 SKFKMFPYVEKRRRFDDYGEILRQEDFAMEERTAGIVEGEENEDYAPAHESTGKRKWAEV 539
Query: 457 -NGKLDEGAASLILDTKPSKVISDERTVQVRCSLVYMDFEGRSDGRSIKNILSHVAPLK 514
NG++ E + + PSK+++ R +++ C + ++D EG DGRS+K I+ V P +
Sbjct: 540 NNGQISENQLNEDMPDVPSKIVTTTRYLKISCQVAFIDMEGLHDGRSLKTIIPQVNPRR 598
>Q3US33_MOUSE (tr|Q3US33) Putative uncharacterized protein (Fragment) OS=Mus
musculus GN=Cpsf2 PE=2 SV=1
Length = 493
Score = 121 bits (304), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 79/236 (33%), Positives = 125/236 (52%), Gaps = 13/236 (5%)
Query: 207 VYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVPD 266
VY + L+ VS + +++ KS +EWMSD + + FE R N F ++++L ++L VP
Sbjct: 263 VYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVP- 321
Query: 267 GPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADPPPKAVKVTV 326
PKVVLAS LE GFS D+F++W D KN ++ T + TLAR L +P K ++ +
Sbjct: 322 SPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRTTPGTLARFLIDNPTEKVTEIEL 381
Query: 327 SKRVPLVGDELIAYEEEQNRIKKEEALKASLIKAEELKASHGAD--SNTSDPMVIDTGNN 384
KRV L G EL Y E++ K+ K ++ +S +D + P T ++
Sbjct: 382 RKRVKLEGKELEEYVEKEKLKKEAAKKLEQ-SKEADIDSSDESDVEEDVDQPSAHKTKHD 440
Query: 385 LIAPEVAGPRGGGYRDIFVDGFVPPSTSVAPMFPCYENTSEWDDFGEVINPDDYVI 440
L+ +G G R F + PMFP E +WD++GE+I P+D+++
Sbjct: 441 LMM------KGEGSRK---GSFFKQAKKSYPMFPAPEERIKWDEYGEIIKPEDFLV 487
>A8XUS3_CAEBR (tr|A8XUS3) CBR-CPSF-2 protein OS=Caenorhabditis briggsae
GN=Cbr-cpsf-2 PE=4 SV=2
Length = 842
Score = 118 bits (296), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 98/351 (27%), Positives = 163/351 (46%), Gaps = 50/351 (14%)
Query: 201 WAEENL---VYPIYFLSYVSSSTIDYVKSFLEWMSDTIAK-SFEQTRENIFLLKYITLLV 256
WA ++ Y + +S+V+SS + + KS LEWM + + + R N F LK + L+
Sbjct: 253 WANQDAGLSTYNLVMMSHVASSVVQFAKSQLEWMDEKLFRYDSSSARYNPFTLKNVNLVH 312
Query: 257 NKTELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEK-GQFATLARMLQA 315
+ EL + PKVVL S +E GFS ++F++W +D +N V+ T + F AR+++
Sbjct: 313 SHLELIKIR-SPKVVLCSSQDMETGFSRELFLDWCADQRNGVILTARPASFTLAARLVEL 371
Query: 316 DP---------PPKAVKVTVSKRVPLVGDELIAYEEEQNRIKKEEALKASLIKAEELKAS 366
K + + V KRVPL G+EL+ Y+ R K E + + I+ E +A
Sbjct: 372 AERANDGVLRNEDKHLSLLVRKRVPLEGEELLEYK----RRKAERDAEETRIRME--RAR 425
Query: 367 HGADSNTSDPMVIDTGNNLIAPEVAGPRGGGYRDIFVD----------------GFVPPS 410
A +N SD D I P ++ + I D F +
Sbjct: 426 RQAQANESDDSDDDDIAAPIVPRLSEKDHRSFDAIENDSHCFDIMAKWDNQQKASFFKST 485
Query: 411 TSVAPMFPCYENTSEWDDFGEVINPDDYVIKDEDMDQTAMHVGGD------INGKLDEGA 464
PM+P E +WDD+GEVI P+DY + + + M G + ++ + DE
Sbjct: 486 KKSFPMYPYIEEKVKWDDYGEVIKPEDYTV----ISKIDMRKGKNKDEPVVVHKREDEEE 541
Query: 465 ASLILDTK---PSKVISDERTVQVRCSLVYMDFEGRSDGRSIKNILSHVAP 512
D P+K + +++ C + ++++EG SDG S K +L+ + P
Sbjct: 542 VYNPNDHDEEMPTKCVEFRNRIEISCRVEFIEYEGISDGESTKKMLAGLMP 592
>B6NF52_BRAFL (tr|B6NF52) Putative uncharacterized protein (Fragment)
OS=Branchiostoma floridae GN=BRAFLDRAFT_127339 PE=4 SV=1
Length = 712
Score = 104 bits (260), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 62/144 (43%), Positives = 85/144 (59%), Gaps = 4/144 (2%)
Query: 200 YW--AEENL-VYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLV 256
YW AE L Y + L+ V+ + +++ KS +EWMSD I + FE R N F K++ L
Sbjct: 551 YWQNAETGLQAYNLCLLNNVAYNVVEFAKSQVEWMSDKIMRVFEDNRNNPFQFKHLKLCH 610
Query: 257 NKTELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQAD 316
+ +EL VPD PKVVLAS+ LE+GFS ++FV+W + KN V+ T + TL RML +
Sbjct: 611 SLSELHKVPD-PKVVLASVPDLESGFSRELFVQWCQNQKNTVVLTSRPGPGTLGRMLIDN 669
Query: 317 PPPKAVKVTVSKRVPLVGDELIAY 340
P K + KRV L G EL Y
Sbjct: 670 PKMKTFTLQARKRVRLEGPELEEY 693
>Q3URN8_MOUSE (tr|Q3URN8) Putative uncharacterized protein (Fragment) OS=Mus
musculus GN=Cpsf2 PE=2 SV=1
Length = 396
Score = 102 bits (253), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 55/134 (41%), Positives = 80/134 (59%), Gaps = 1/134 (0%)
Query: 207 VYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVPD 266
VY + L+ VS + +++ KS +EWMSD + + FE R N F ++++L ++L VP
Sbjct: 263 VYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVP- 321
Query: 267 GPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADPPPKAVKVTV 326
PKVVLAS LE GFS D+F++W D KN ++ T + TLAR L +P K ++ +
Sbjct: 322 SPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRTTPGTLARFLIDNPTEKVTEIEL 381
Query: 327 SKRVPLVGDELIAY 340
KRV L G EL Y
Sbjct: 382 RKRVKLEGKELEEY 395
>Q3UGU6_MOUSE (tr|Q3UGU6) Putative uncharacterized protein (Fragment) OS=Mus
musculus GN=Cpsf2 PE=2 SV=1
Length = 412
Score = 101 bits (251), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 54/131 (41%), Positives = 79/131 (60%), Gaps = 1/131 (0%)
Query: 207 VYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVPD 266
VY + L+ VS + +++ KS +EWMSD + + FE R N F ++++L ++L VP
Sbjct: 263 VYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVP- 321
Query: 267 GPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADPPPKAVKVTV 326
PKVVLAS LE GFS D+F++W D KN ++ T + TLAR L +P K ++ +
Sbjct: 322 SPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRTTPGTLARFLIDNPTEKVTEIEL 381
Query: 327 SKRVPLVGDEL 337
KRV L G EL
Sbjct: 382 RKRVKLEGKEL 392
>B4M067_DROVI (tr|B4M067) GJ24636 OS=Drosophila virilis GN=GJ24636 PE=4 SV=1
Length = 693
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 52/140 (37%), Positives = 84/140 (60%), Gaps = 1/140 (0%)
Query: 206 LVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVP 265
+ Y + L+ VS + I++ KS +EWMSD + K+FE R N F K+I L ++ +P
Sbjct: 262 MAYSLALLNNVSYNVIEFAKSQIEWMSDKLTKAFEGARNNPFQFKHIHLCHTLADIYKLP 321
Query: 266 DGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLA-RMLQADPPPKAVKV 324
GPKVVLAS +E+GF+ D+FV+W S+ N ++FT + +L+ +++ P + +++
Sbjct: 322 AGPKVVLASTPDMESGFTRDLFVQWASNPNNSIIFTTRTGPGSLSMELVENSTPGRQIEL 381
Query: 325 TVSKRVPLVGDELIAYEEEQ 344
V +RV L G EL Y Q
Sbjct: 382 DVRRRVELEGAELEEYLRTQ 401
Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 26/62 (41%), Positives = 35/62 (56%), Gaps = 3/62 (4%)
Query: 451 HVGGDINGKLDEGAASLILDTKPSKVISDERTVQVRCSLVYMDFEGRSDGRSIKNILSHV 510
HV NG D L KP+K+IS +T++V + +DFEGRSDG S+ ILS +
Sbjct: 446 HVEQQTNGNQDNDVQML---EKPTKLISQRKTIEVNAQIQRIDFEGRSDGESMLKILSQL 502
Query: 511 AP 512
P
Sbjct: 503 RP 504
>B0CXU7_LACBS (tr|B0CXU7) Predicted protein OS=Laccaria bicolor (strain
S238N-H82) GN=LACBIDRAFT_231539 PE=4 SV=1
Length = 901
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 99/393 (25%), Positives = 163/393 (41%), Gaps = 90/393 (22%)
Query: 200 YWAEENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKS---FEQT-REN---------- 245
+W L YPI LS + +V+S +EW+ TI+K E T R+N
Sbjct: 315 HWNYSRLRYPICLLSRTGREMLTFVRSMMEWLGGTISKEDVGEEGTGRQNQNKRRRDEEG 374
Query: 246 -------IFLLKYITLLVNKTELDNV--PDGPKVVLASMASLEAGFSHDIFVEWGSDVKN 296
+ +++ N L PK++LA ASL G S ++F ++ + N
Sbjct: 375 DEDALGALTFFRHLEFFPNPQALLQTYSSKDPKLILAVPASLSHGPSRNMFSDFAAVPDN 434
Query: 297 LVLFTEKGQFATLARML------QADPPPK--------------AVKVTVSKRVPLVGDE 336
+VL T + + TL R L P K A+ + ++ +VPL G E
Sbjct: 435 VVLLTGRSEEGTLGRALFDKWNNSQRPDDKWDKGKIGSNVMMDGAITIKMNHKVPLQGAE 494
Query: 337 LIAY-EEEQNRIKKEEALKASLIKAE----------------------ELKASHGADSNT 373
L A+ +EE+ +KE A +A+L + + E++ + G D
Sbjct: 495 LEAHLQEERVAKEKEAAHQAALARNQRMLEADEDDSDSDLDSDADEEAEVRQALGGDM-- 552
Query: 374 SDPMVIDTGNNLIAPEVAGPRGGGYRDIFVDGFVPPSTSVAP----------MFPCYENT 423
M D G L ++ DI++ G V +TS MFP E
Sbjct: 553 ---MDTDDGEGLTKQLLSF-------DIYIKGNVSKATSFFKISGSQTQRFRMFPYVEKK 602
Query: 424 SEWDDFGEVINPDDYVIKDEDMDQTAM--HVGGDINGKLDEGAASLILDTKPSKVISDER 481
D++GE I+ ++ K + +++ A V E A + PSK ++ E
Sbjct: 603 RRVDEYGETIDVGMWLRKGKVLEEEAESDEVKDYKRRTQAEEEAKASIREPPSKYVTTEI 662
Query: 482 TVQVRCSLVYMDFEGRSDGRSIKNILSHVAPLK 514
+Q+ C L+++D EG +DGR++K I+ V P K
Sbjct: 663 EIQLACRLLFVDMEGLNDGRAVKTIVPQVNPRK 695
>A9VA41_MONBE (tr|A9VA41) Predicted protein OS=Monosiga brevicollis GN=34190 PE=4
SV=1
Length = 770
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 92/339 (27%), Positives = 147/339 (43%), Gaps = 39/339 (11%)
Query: 208 YPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNV--P 265
YP++FLS+ S + +++ ++ +E+MSD + + N F I KT +D V
Sbjct: 261 YPVFFLSHNSYNVLEFAQTLIEFMSDKMLVKLQTMTHNPFACPNIK--CQKT-VDGVMRS 317
Query: 266 DGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATL-ARMLQADPPPKAVKV 324
G KVV+ +SLEAGF ++ + +N LF + +L AR+L ++
Sbjct: 318 AGAKVVIVPHSSLEAGFGRELLFRLAGEARNRFLFIARPPPHSLGARLLAKSGQIHTIQF 377
Query: 325 TVSKRVPLVGDELIAYEE---EQNRIKKEEALKASLIKAE-----------ELKASHGAD 370
RV L G+EL AY + E+ + +KE+AL + +AE E + H AD
Sbjct: 378 EHRFRVQLEGEELKAYRQHKAEEAKQQKEDALAQA--RAEGTFVGSDSEDDEDEDDHVAD 435
Query: 371 SNTSDPMV---IDTGNNLIAPEVAGPRGGGYRDIFVDGFVPPSTSV--APMFPCYENTS- 424
P ID ++ A R R + F S V A + +
Sbjct: 436 LPMRLPGTQPSIDAVHHTPQQTRAKDRTFRSRRQALTTFPFQSNKVVRASTYDSFMGAQK 495
Query: 425 -EWDDFGEVINPDDYVIKDEDMDQTAMHVGGD----------INGKLDEGAASLILDTKP 473
EWDD+G + + + D + D + E AS+ +P
Sbjct: 496 VEWDDYGMTFDREKLKLLDSHLATGLEAPAADEADKPAEDSNLEAMQAELTASIQEAERP 555
Query: 474 SKVISDERTVQVRCSLVYMDFEGRSDGRSIKNILSHVAP 512
SKV++ +R + VRC + Y+D EG SD S+ NIL + P
Sbjct: 556 SKVVAQQRDLSVRCQVEYLDLEGLSDRESMLNILERMRP 594
>B6NF51_BRAFL (tr|B6NF51) Putative uncharacterized protein OS=Branchiostoma
floridae GN=BRAFLDRAFT_60691 PE=4 SV=1
Length = 282
Score = 83.2 bits (204), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 42/108 (38%), Positives = 61/108 (56%), Gaps = 15/108 (13%)
Query: 405 GFVPPSTSVAPMFPCYENTSEWDDFGEVINPDDYVIKDEDMDQTAMHVGGDINGKLDEGA 464
GF + PMFP E +WDD+GE+I P+DY++ + M Q +E
Sbjct: 10 GFFKQAKKAYPMFPAPEERVKWDDYGEIIKPEDYMVVE--MTQA------------EEEK 55
Query: 465 ASLILDTKPSKVISDERTVQVRCSLVYMDFEGRSDGRSIKNILSHVAP 512
A + D P+K I E T+ ++C +VY+DFEGRSDG S+K IL+ + P
Sbjct: 56 AKALTDV-PTKSIVQELTLDIKCRVVYIDFEGRSDGESMKKILTQLKP 102
>A8NYN1_COPC7 (tr|A8NYN1) Putative uncharacterized protein OS=Coprinopsis cinerea
(strain Okayama-7 / 130 / FGSC 9003) GN=CC1G_01385 PE=4
SV=1
Length = 926
Score = 80.1 bits (196), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 97/409 (23%), Positives = 159/409 (38%), Gaps = 101/409 (24%)
Query: 200 YWAEENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAK-------SFEQTREN------- 245
+W L YPI LS + +V+S +EW+ TI+K + Q R
Sbjct: 321 HWNYSRLTYPICLLSRTGREMLTFVRSMMEWLGGTISKEDVGEEGNKRQDRNKRRRDDED 380
Query: 246 ---------IFLLKYITLLVNKTEL--DNVPDGPKVVLASMASLEAGFSHDIFVEWGSDV 294
K++ N L + PK++LA ASL G S +F ++ +
Sbjct: 381 GVEEALGALALRFKHLEFFPNPQALLQRHSSKDPKLILAVPASLSHGPSRQLFADFAAVP 440
Query: 295 KNLVLFTEKGQFATLARML--------QADPP------------PKAVKVTVSKRVPLVG 334
N+VL T +G TL R L + D A+K+ + +VPL G
Sbjct: 441 DNVVLLTTRGAEGTLGRALFDKWNNSQRGDDKWDKGRIGRNVMMDGAIKIKMYHKVPLQG 500
Query: 335 DELIAY-EEEQNRIKKEEALKASLIKAEEL------------------------KASHGA 369
EL Y +E+ +KE A +A++ + + + + + G
Sbjct: 501 AELEEYLAKERAAKEKEAAQQAAMARNQRMLEADEDDSDSESDSDSDADDEEEVREALGG 560
Query: 370 DSNTSDPMVIDTGNNLIAPEVAGPRGG--------GYR------DIFVDGFVPPSTSVAP 415
D + V + GN + G GY DI++ G V STS
Sbjct: 561 DMD-----VDEAGNRRRRRGMKKSSDGADWGDGDEGYTKQLLSFDIYLKGKVSKSTSFFK 615
Query: 416 ----------MFPCYENTSEWDDFGEVINPDDYVIKDEDMDQTAMHVGGDINGKLDEGAA 465
MFP E D++GE ++ ++ K + +++ A G E
Sbjct: 616 SVGGQTQRFRMFPYVEKKRRVDEYGETVDVGLWLRKGKALEEEAEKKEKMEEGATIEEED 675
Query: 466 SLILDTKPSKVISDERTVQVRCSLVYMDFEGRSDGRSIKNILSHVAPLK 514
+ PSK ++ E VQ+ C L+++D EG +DGR++K I+ V P +
Sbjct: 676 KIA--EPPSKYVTSEVEVQLACRLLFIDMEGLNDGRAVKTIVPQVNPRR 722
>A5DGP1_PICGU (tr|A5DGP1) Putative uncharacterized protein OS=Pichia
guilliermondii GN=PGUG_02442 PE=4 SV=2
Length = 821
Score = 79.7 bits (195), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 96/409 (23%), Positives = 161/409 (39%), Gaps = 112/409 (27%)
Query: 209 PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFE--------QTRENIFLL--KYITLLVNK 258
P+YFLSY + ++Y S LEWMS ++ K +E T +N F + LL +
Sbjct: 154 PVYFLSYSGTKVLNYASSLLEWMSTSLVKEWEAASSASMNSTNKNNFPFDPSKVDLLSDP 213
Query: 259 TELDNVPDGPKVVLASMASLEAG-FSHDIFVEWGSDVKNLVLFTEKGQFAT--------- 308
EL + GPK+VL + + +G S ++ SD KN VL TEK F
Sbjct: 214 KELIQL-SGPKIVLCAGIDMNSGDVSFEVLKYLCSDQKNTVLLTEKTHFGADFSINAQLF 272
Query: 309 -----------------LARMLQADPPPKAVKVTVSKRVPLVGDELIAYEEEQNRIKK-- 349
LA + P + +S+ PL G EL +++E N +K
Sbjct: 273 TDWVRLSREKYGNAEDGLAIGYEGTIPLRG----LSREDPLSGSELTSFQERINHQRKKK 328
Query: 350 ----------EEALKASLIKAEELKASHGADSNTSD--------------PMVIDTGNNL 385
+ L A ++ E+ + G D+ +SD P IDT N
Sbjct: 329 LFEQVRDRKNQNLLNADNLEEEDSSSDDGEDAESSDEEMPTTTETEAGAMPGAIDTNVNA 388
Query: 386 IAPEVAGPRGGGYR--------DIFVDGFVPPSTSVAPMFPCYENTSEWDDFGEVINPDD 437
I + A + D+ + + P ++ P P ++ ++DD+GEVI+ D
Sbjct: 389 IVTQDAFVADQVKQTLDDELPLDVKITHKLKPRQAMFPYIPPHKR--KFDDYGEVIDIKD 446
Query: 438 Y----------VIKD--------------EDMDQTAMHVGG----------DINGKLDEG 463
Y +I D D D+ + GG +N ++ +
Sbjct: 447 YQRAEDLTNAKLISDSKRKFEQEDKLKWGNDDDRRSGRGGGIQTNRLTPQETLNNQILQK 506
Query: 464 AASLILDTKPSKVISDERTVQVRCSLVYMDFEGRSDGRSIKNILSHVAP 512
+ + +++ + ++ RCSL ++D G D RS+ I+S + P
Sbjct: 507 NLHTLFQPRKRVIVTKTQDLKFRCSLSFVDLAGLVDLRSLSLIVSSLKP 555
>A2R7F5_ASPNC (tr|A2R7F5) Contig An16c0120, complete genome. OS=Aspergillus niger
(strain CBS 513.88 / FGSC A1513) GN=An16g03340 PE=3 SV=1
Length = 865
Score = 73.6 bits (179), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 82/340 (24%), Positives = 136/340 (40%), Gaps = 60/340 (17%)
Query: 210 IYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENI----FLLKYITLLVNKTELDNV- 264
+Y +++T+ +S LEWM + I + FE E F K++ +L K L+ +
Sbjct: 307 LYLAGRKANTTMRLARSMLEWMDENIVREFEAAEEGKGVGPFTFKHLRILERKKRLEKIL 366
Query: 265 -PDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADPPPKAVK 323
PKV+LAS SL+ GF+ D NL+L TE P +
Sbjct: 367 SDQKPKVILASDTSLDWGFAKDSLRLVAEGANNLLLLTE---------------PLHKER 411
Query: 324 VT---VSKRVPLVGDELIA-YEEEQNRIKKEEALKASLIKAEEL---------------- 363
+T S +G + YEE ++ + E+A +++ E L
Sbjct: 412 ITEGQESTHRKTLGSMIWQWYEERKDGVALEKASDGEMLEQENLDNAADALDDRSSSTSE 471
Query: 364 ---KASHGADSNTSDPMVIDTGNNL-IAPEVAGPRGGGYRDIFVDGFVPPSTSVAPMFPC 419
G N S + N L ++ E G R D V MFP
Sbjct: 472 DSETEQQGRVLNFSTSLAHSNRNKLGLSDEDLGVNILLRRKNVYDYDVRGKKGRERMFPY 531
Query: 420 YENTSEWDDFGEVINPDDY-----VIKDEDMDQTAMHVGGDINGKLDEGAASLILDTKPS 474
+ D+FGE I P+D + +D ++D A+ ++ G+ EG P+
Sbjct: 532 VAPRKKGDEFGEFIRPEDTADELSLAEDGEVD-AAVSSEDEVEGQSFEG---------PA 581
Query: 475 KVISDERTVQVRCSLVYMDFEGRSDGRSIKNILSHVAPLK 514
K + ++ T+ + L Y+DF G D RS++ ++ + P K
Sbjct: 582 KAVYEKATLTINARLAYVDFTGLHDKRSLEMLIPLIQPRK 621
>B2VU84_PYRTR (tr|B2VU84) Cleavage and polyadenylation specificity factor subunit
2 OS=Pyrenophora tritici-repentis (strain Pt-1C-BFP)
GN=PTRG_02057 PE=4 SV=1
Length = 954
Score = 72.4 bits (176), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 59/176 (33%), Positives = 78/176 (44%), Gaps = 43/176 (24%)
Query: 210 IYFLSYVSSSTIDYVKSFLEWMSDTIAKSFE-------------QTRENI----FLLKYI 252
+Y S +T+ YV+S LEWM + I K FE Q E + F ++I
Sbjct: 318 LYLASRTGGATMRYVRSMLEWMEEGIVKEFEASAADQDRRTKEGQEEERVAKVPFDFRHI 377
Query: 253 TLLVNKTELDNV--PDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEK-----GQ 305
TLL KT + + GP+V+LAS A+LE GFS D SD KNLV+ TE+ Q
Sbjct: 378 TLLERKTRVARMLAGAGPRVILASDATLEWGFSKDAIRSLASDEKNLVILTERSGELGSQ 437
Query: 306 FATLARML-------QADPPPKAVKVTV------------SKRVPLVGDELIAYEE 342
L R L A P A TV + V L GDE+ Y++
Sbjct: 438 KKGLGRYLWDLWNQRNASPGEDAPSTTVIDASGNQAPLDTVRTVALQGDEVPLYQQ 493
>Q7S0J8_NEUCR (tr|Q7S0J8) Putative uncharacterized protein OS=Neurospora crassa
GN=NCU06869 PE=4 SV=2
Length = 986
Score = 71.6 bits (174), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 87/332 (26%), Positives = 128/332 (38%), Gaps = 83/332 (25%)
Query: 202 AEENLVYP---IYFLSYVSSSTIDYVKSFLEWMSDTIAKSFE-----------------Q 241
A++N V+ ++ SST+ +S LEWM D+I + FE Q
Sbjct: 304 AKDNDVFKSAKLFLAGRTISSTMKNARSMLEWMDDSIIREFEAFADESRRNNRRDEGNHQ 363
Query: 242 TRENIFLLKYITLLVNKTELDNVPDG-------PKVVLASMASLEAGFSHDIFVEWGSDV 294
T F KY+ LL K ++D + KV+LAS SL+ GFS DI +D
Sbjct: 364 TGPGPFDFKYLRLLERKAQIDKILQQSDDAEPRAKVILASDTSLDWGFSKDILKSIAADA 423
Query: 295 KNLVLFTEK-----GQFATLARML-----------------------QADPPPKAVKVTV 326
+NLV+ TEK Q +++R L Q + +++
Sbjct: 424 RNLVILTEKPNLEPNQKPSISRTLWEWWKERRDGVATERTSNGDTFEQVYAGNRELEIET 483
Query: 327 SKRVPLVGDELIAYEEEQNRIKKEEALKASLIK-AEELKASHG-----ADSNT-SDPMVI 379
++R L GDEL Y Q + + L+A+L L + G ADS+T S+
Sbjct: 484 AERKGLEGDELNVY---QQWLATQRQLQATLQSGGTNLLEAPGDVLDDADSDTDSESEGS 540
Query: 380 DTGNNLIAPEVAGPRGGGYRDIFV------------------DGFVPPSTSVAPMFPCYE 421
DT A +A R V D V + MFP
Sbjct: 541 DTEQQGKALNIANTMAQASRKKVVLRDEDLGVTILIKKENVYDFNVRGTKGRDRMFPVAM 600
Query: 422 NTSEWDDFGEVINPDDYVIKDEDMDQTAMHVG 453
D+FGE+I P+DY+ +E D G
Sbjct: 601 RRRRADEFGELIRPEDYLRAEEREDAENQEAG 632
>B6GY49_PENCH (tr|B6GY49) Pc12g15810 protein OS=Penicillium chrysogenum Wisconsin
54-1255 GN=Pc12g15810 PE=4 SV=1
Length = 893
Score = 70.9 bits (172), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 81/349 (23%), Positives = 136/349 (38%), Gaps = 65/349 (18%)
Query: 210 IYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRE--------------------NIFLL 249
+Y ++TI +S LEWM + I + FE F
Sbjct: 326 LYLAGKKVTNTIRLARSMLEWMDENIVREFEAAESADVTNGQRTGGQDKSTSKGGGPFTF 385
Query: 250 KYITLLVNKTELDNV--PDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFA 307
K++ ++ K L+ + GPKV+LAS S++ GFS + NL+L TE F
Sbjct: 386 KHLKIIERKKRLEKLLAEPGPKVILASDTSMDWGFSKHALRQVAEGPNNLLLMTE--SFR 443
Query: 308 TLARMLQADPPPKAVKVTVSKRVPLVGDELIA-YEEEQNRIKKEEALKASLIKAEELKAS 366
+ ++ P S+ +G + YEE ++ + E+ I+ E L+ +
Sbjct: 444 KDMQTQESKP---------SQSSATIGSMIWQWYEERRDGVALEKGSDGEHIEQETLETA 494
Query: 367 --------------------HGADSNTSDPMVIDTGNNL-IAPEVAGPRGGGYRDIFVDG 405
G N S + T + L ++ E G R D
Sbjct: 495 ADALDDGSSSTTSEDSDPEHQGRALNFSASLAHATRSKLAVSDEDLGINILLRRKNVYDY 554
Query: 406 FVPPSTSVAPMFPCYENTSEWDDFGEVINPDDYVIKDEDMDQTAMHVGGDINGKLDEGAA 465
V MFP + D++GE I P+D V ED D A ++ G+ EG
Sbjct: 555 DVRGKKGRERMFPYVAPRKKGDEYGEFIRPEDLVSDGEDAD-VAAESEDEVEGQSFEG-- 611
Query: 466 SLILDTKPSKVISDERTVQVRCSLVYMDFEGRSDGRSIKNILSHVAPLK 514
P+KV+ + +T+ + + ++DF G D RS++ ++ + P K
Sbjct: 612 -------PAKVVYNTQTITINARIAFIDFMGLHDKRSLEMLIPLIQPQK 653
>Q4R8A1_MACFA (tr|Q4R8A1) Testis cDNA clone: QtsA-13018, similar to human
cleavage and polyadenylation specific factor 2,
100kDa(CPSF2), OS=Macaca fascicularis PE=2 SV=1
Length = 341
Score = 68.6 bits (166), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 35/100 (35%), Positives = 60/100 (60%), Gaps = 5/100 (5%)
Query: 415 PMFPCYENTSEWDDFGEVINPDDYVIKDEDM--DQTAMHVGGDINGKLDEGAASLILDTK 472
PMFP E +WD++GE+I P+D+++ + ++ + G NG DE + D
Sbjct: 21 PMFPAPEERIKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNG--DEPMDQDLSDV- 77
Query: 473 PSKVISDERTVQVRCSLVYMDFEGRSDGRSIKNILSHVAP 512
P+K IS +++++ + Y+D+EGRSDG SIK I++ + P
Sbjct: 78 PTKCISTTESIEIKARVTYIDYEGRSDGDSIKKIINQMKP 117
>Q6CAZ0_YARLI (tr|Q6CAZ0) YALI0C23232p OS=Yarrowia lipolytica GN=YALI0C23232g
PE=4 SV=1
Length = 799
Score = 68.2 bits (165), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 76/346 (21%), Positives = 144/346 (41%), Gaps = 45/346 (13%)
Query: 210 IYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVPDGPK 269
+ L+++ + ++Y S LEWMS +I +E+ E+ F + ++ + + D V G
Sbjct: 282 LVLLTHLGARLLEYASSMLEWMSPSIIAEWEKKNESPFQTRNFKIVHSMDQFDKVVKGGN 341
Query: 270 ---VVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQ-------ADPPP 319
VV++ LE+GFS +F SD +N VLFTE+ + +LA LQ D
Sbjct: 342 GQFVVVSVGEDLESGFSRLLFNRLASDERNSVLFTERSEGNSLATELQDKWEKTERDGNS 401
Query: 320 KAVKVTVSKRVPLVGDELIAYEEEQNRIKKEEALKASLIKAEELKASHGADSNTSDPMVI 379
+ + ++P A +E + + ++KA EL+ + ++
Sbjct: 402 AKMDFQTTLKMPTYTPLSEAEMKEYRTTVESQQKDLQMVKAMELR-NKELLEEAEAEEMM 460
Query: 380 DTGNNLIAPEVAGPRGGGYRDIFVDGF---VPPSTSVAPM--FPCYENTSEWDDFGEVIN 434
D+ ++ +V+ +G G F+ G V +V + FP Y+ ++G I+
Sbjct: 461 DSSDD---EDVSRMQGSGQEYGFLHGTVLDVDVRDAVGSLRNFPFYQKRQRVSEYGIPIH 517
Query: 435 PDDYVIKDE--------------DMDQTAMHVGGDINGKLDEGAASLI------------ 468
P D+ +E D D+ +E ++
Sbjct: 518 PSDFARVEERPEVAWKERDRNEFDSDEPRKRQRRRTKAAAEEQEERVVEDADDAPETITS 577
Query: 469 LDTKPSKVISDERTVQVRCSLVYMDFEGRSDGRSIKNILSHVAPLK 514
LD +P +V ++ + + C + ++D GR D RS+ I+ + P K
Sbjct: 578 LDNQPIRVSYEDVDLNIICHVDFVDLSGRIDERSLGMIMHSIHPKK 623
>A8PTN4_MALGO (tr|A8PTN4) Putative uncharacterized protein OS=Malassezia globosa
(strain ATCC 96807 / CBS 7966) GN=MGL_0452 PE=4 SV=1
Length = 862
Score = 67.4 bits (163), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 87/383 (22%), Positives = 146/383 (38%), Gaps = 75/383 (19%)
Query: 200 YWA--EENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRE------------- 244
+WA ++ +P+ +S+ ++ ++F+EWMS A E
Sbjct: 311 HWAFSYQHQRFPLCLVSHTGQEVVERARTFMEWMSREWAIQLLDAPEASSRRKTTSSSSS 370
Query: 245 ----------NIFLLKYITLLVNKTELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDV 294
+ L++ + V P KVVLA+ +L G S + E+ D
Sbjct: 371 SSAATAKSPLDFSGLRFYSS-VEALHQALTPSQVKVVLATPPALSHGLSRQLLPEFLCDP 429
Query: 295 KNLVLFTEKGQFATLARML-------QADPP---------PKAVKVTVS----KRVPLVG 334
L++ T +G ++L R L QAD P +V +S +RVPL G
Sbjct: 430 DALLILTSRGTPSSLVRNLWDRWNAKQADRDAWRQGHVGVPVSVGGQLSYELRRRVPLAG 489
Query: 335 DELIAYEEEQN-RIKKEEALKASLIKAE----------------------ELKASHGADS 371
DEL Y E Q R +A +A + + + + + S +
Sbjct: 490 DELRTYVERQKAREAAADAPRARIQQPQREADDVDDDDASSSDSSSDDEFDGQPSRLPST 549
Query: 372 NTSDPMVIDTGNNLIAPEVAGPRGGGYRDIFVDGFVPPSTSVAPMFPCYENTSEWDDFGE 431
T P N APE P G + DIF+ G V MFP E + D +GE
Sbjct: 550 RTIAPERAQMQLNAAAPE---PVGMSF-DIFLRGQVSRDAVHYRMFPHIERKRKVDGYGE 605
Query: 432 VINPDDYVIKDEDMDQTAMHVGGDINGKLDEGAASLILDTKPSKVISDERTVQVRCSLVY 491
I+ ++ + ++ K + + P K SD VRC ++Y
Sbjct: 606 SIDTSRWLARRRRLEAEQEEQLNPERLKPQKKRTRPV--DVPCKYTSDTLNAAVRCHVLY 663
Query: 492 MDFEGRSDGRSIKNILSHVAPLK 514
+D +G +DGR++ ++ + P +
Sbjct: 664 VDLQGLNDGRALTTLVPQLQPRR 686
>Q0UZX3_PHANO (tr|Q0UZX3) Putative uncharacterized protein OS=Phaeosphaeria
nodorum GN=SNOG_02691 PE=4 SV=2
Length = 957
Score = 66.6 bits (161), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 45/129 (34%), Positives = 62/129 (48%), Gaps = 26/129 (20%)
Query: 211 YFLSYVSSSTIDYVKSFLEWMSDTIAKSFE--------QTRENI---------------F 247
Y S +T+ YV+S LEWM + I K FE Q + N+ F
Sbjct: 318 YLASRTGGATMRYVRSMLEWMGEGIVKEFEAASGAAEGQGQRNVRGAPGRDDGRGIRTPF 377
Query: 248 LLKYITLLVNKTELDNVPDG--PKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEK-G 304
++ITLL K + + + P+V+LAS SLE GFS D SD KNLV+ TE+ G
Sbjct: 378 DFQHITLLEKKARVTRMLNATEPRVILASDTSLEWGFSKDAIRSLASDEKNLVILTERVG 437
Query: 305 QFATLARML 313
+ T + L
Sbjct: 438 ELGTQEKGL 446
>A8IL96_CHLRE (tr|A8IL96) Predicted protein (Fragment) OS=Chlamydomonas
reinhardtii GN=CHLREDRAFT_128192 PE=4 SV=1
Length = 389
Score = 66.2 bits (160), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 58/101 (57%), Gaps = 1/101 (0%)
Query: 213 LSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVPDGPKVVL 272
LSY + +++ ++ LE++ + ++F R F + + ++ +L +P GPKVVL
Sbjct: 290 LSYTIKTVLEFARTQLEYLGSEMVQAFSHKRTIPFTFRKLAVITRLEDLGAIP-GPKVVL 348
Query: 273 ASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARML 313
A++ SL+ G + + V+W + +N ++FTE+ TLA L
Sbjct: 349 ATLPSLDCGPARQLLVDWAAAPRNTIIFTERANPGTLAHAL 389
>Q5KIP3_CRYNE (tr|Q5KIP3) Cleavage and polyadenylation specificity factor
subunit, putative (Putative uncharacterized protein)
OS=Cryptococcus neoformans GN=CNBD4110 PE=4 SV=1
Length = 899
Score = 65.1 bits (157), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 88/367 (23%), Positives = 153/367 (41%), Gaps = 61/367 (16%)
Query: 208 YPIYFLSYVSSSTIDYVKSFLEWMSDTI------------------AKSFEQTRENIFLL 249
YP+ +S + + + +S ++WM + A+ + +
Sbjct: 311 YPLCIVSKTAQDMVAFARSLIDWMGGVVKDSAGDMVDVGRGKRARGARMALGSEYGVLDF 370
Query: 250 KYITLLVNKTEL-DNVP-DGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFA 307
+++ +N T+L P PK+VLA ++ G S +F + N+++ T + +
Sbjct: 371 RHVQFFLNTTDLLQTYPLTRPKLVLAVPPTMSHGPSRFLFTAMANTEGNVIMLTGRSEEQ 430
Query: 308 TLARML--------------------QADPPPKAVKVTVSKRVPLVGDELIAY-EEEQNR 346
TLAR L ++V V +VPL G EL A+ E E+ +
Sbjct: 431 TLARDLYNRWERSQTTGSKWGEGKIGHLTQLEGKLQVEVDSKVPLSGAELEAHVESERLQ 490
Query: 347 IKKEEALKAS------LIKAEELKASHG----ADSNTSDPMVIDT-GNNLIAPEVAGPRG 395
+KE A KA+ +++A++L++ AD + D V T G N A + R
Sbjct: 491 KEKEAAHKAAVDRSRRMLEADDLESDSDSESEADGHAGDITVRRTEGANAYAGDGEDVRT 550
Query: 396 GGYRDIFVDGFVPPSTSVAPM-----FPCYENTS-EWDDFGEVINPDDYVIKDEDMDQTA 449
+ DI+V G S A M FP E + D FGE ++ ++ K ++ +
Sbjct: 551 MSF-DIYVKGQQMRSGRGAEMARFRMFPFVERKGRKIDQFGEGLDIGQWMRKGREIAEEG 609
Query: 450 MHVGGDINGKLDEGAASLILDTK--PSKVISDERTVQVRCSLVYMDFEGRSDGRSIKNIL 507
K E PSK +S+E V+++ + ++D EG DG+SIK I+
Sbjct: 610 ETEEVREAKKRKEEEEEKAKQAPEPPSKYVSEEVGVELKAMIGFVDMEGLHDGQSIKTII 669
Query: 508 SHVAPLK 514
S + P K
Sbjct: 670 SDLQPRK 676
>Q4S6C2_TETNG (tr|Q4S6C2) Chromosome 9 SCAF14729, whole genome shotgun sequence
OS=Tetraodon nigroviridis GN=GSTENG00023355001 PE=4 SV=1
Length = 698
Score = 65.1 bits (157), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 37/104 (35%), Positives = 54/104 (51%), Gaps = 5/104 (4%)
Query: 200 YWAEENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKT 259
+W NL PIYF + ++ Y K F+ W + I K+F Q N+F K+I +++
Sbjct: 323 FWERMNLKAPIYFSTGLTEKANHYYKLFITWTNQKIRKTFVQ--RNMFEFKHIKAF-DRS 379
Query: 260 ELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEK 303
DN GP VV A+ L AG S IF +W + KN+V F +
Sbjct: 380 YADN--PGPMVVFATPGMLHAGQSLQIFKKWAGNEKNMVQFLRR 421
>A7TPD0_VANPO (tr|A7TPD0) Putative uncharacterized protein OS=Vanderwaltozyma
polyspora (strain ATCC 22028 / DSM 70294) GN=Kpol_1009p4
PE=4 SV=1
Length = 778
Score = 64.7 bits (156), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 37/110 (33%), Positives = 61/110 (55%), Gaps = 4/110 (3%)
Query: 209 PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVPD-G 267
PIY+ S ++ + ++++ M+D I K F ++ N F+ K+I+ L N LD D G
Sbjct: 272 PIYYASNLAKKCMSVYQTYVNMMNDDIRKKFRDSQTNPFIFKHISYLKN---LDEFQDFG 328
Query: 268 PKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADP 317
P V+LAS L+ G S D+ +W + KN+VL T T+A+ + +P
Sbjct: 329 PSVMLASPGMLQNGLSRDLLEKWCPEDKNMVLITGYSVEGTMAKYIMLEP 378
>B6ABK4_9CRYT (tr|B6ABK4) Putative uncharacterized protein OS=Cryptosporidium
muris RN66 GN=CMU_027660 PE=4 SV=1
Length = 1097
Score = 64.7 bits (156), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 44/117 (37%), Positives = 64/117 (54%), Gaps = 9/117 (7%)
Query: 201 WAEENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFL-LKYITLLVNKT 259
W + ++YPI +S +SSS I K+ +EWMS I F TR N F LK I + +
Sbjct: 299 WNKSVILYPIVVVSPMSSSFILSAKTLIEWMSLDIRSEFCNTRFNPFHGLKNILI---EN 355
Query: 260 ELDNVPDGP-----KVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLAR 311
L+NV GP KV+ AS AS++ G+S ++F + S N V+FT + + T A
Sbjct: 356 TLNNVRTGPSAKLPKVIFASPASMDYGYSRELFADLASSPNNSVIFTREPKLNTFAH 412
>B6PQF2_BRAFL (tr|B6PQF2) Putative uncharacterized protein OS=Branchiostoma
floridae GN=BRAFLDRAFT_289082 PE=4 SV=1
Length = 597
Score = 63.5 bits (153), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 55/101 (54%), Gaps = 5/101 (4%)
Query: 200 YWAEENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKT 259
+W N+ PIYF + ++ +Y + F+ W + I K+F + N+F K+I +++
Sbjct: 248 FWERMNIKAPIYFSTGLTEKANNYYRLFITWTNQKIRKTF--VKRNMFEFKHIKAF-DRS 304
Query: 260 ELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLF 300
+DN GP VV A+ L AG S IF +W D KN+V+
Sbjct: 305 YIDN--PGPMVVFATPGMLHAGLSLQIFKKWAPDSKNMVIM 343
>A8Y5J3_MOUSE (tr|A8Y5J3) Cleavage and polyadenylation specific factor 3-like
OS=Mus musculus GN=Cpsf3l PE=4 SV=1
Length = 578
Score = 62.4 bits (150), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 53/101 (52%), Gaps = 5/101 (4%)
Query: 200 YWAEENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKT 259
+W NL PIYF + ++ Y K F+ W + I K+F Q N+F K+I ++T
Sbjct: 233 FWERMNLKVPIYFSTGLTEKANHYYKLFITWTNQKIRKTFVQ--RNMFEFKHIKAF-DRT 289
Query: 260 ELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLF 300
DN GP VV A+ L AG S IF +W + KN+V+
Sbjct: 290 FADN--PGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 328
>Q9N0D6_MACFA (tr|Q9N0D6) Unnamed protein product OS=Macaca fascicularis PE=2
SV=1
Length = 328
Score = 62.4 bits (150), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 36/99 (36%), Positives = 51/99 (51%), Gaps = 5/99 (5%)
Query: 200 YWAEENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKT 259
+W NL PIYF + ++ Y K F+ W + I K+F Q N+F K+I ++
Sbjct: 191 FWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTFVQ--RNMFEFKHIKAF-DRA 247
Query: 260 ELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLV 298
DN GP VV A+ L AG S IF +W + KN+V
Sbjct: 248 FADN--PGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMV 284
>B3RHF9_YEAS1 (tr|B3RHF9) Putative uncharacterized protein OS=Saccharomyces
cerevisiae (strain RM11-1a) GN=SCRG_04228 PE=4 SV=1
Length = 779
Score = 62.4 bits (150), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 41/126 (32%), Positives = 69/126 (54%), Gaps = 8/126 (6%)
Query: 209 PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVPD-G 267
PI++ S ++ + ++++ M+D I K F ++ N F+ K I+ L N L++ D G
Sbjct: 275 PIFYASNLAKKCMSVFQTYVNMMNDDIRKKFRDSQTNPFIFKNISYLRN---LEDFQDFG 331
Query: 268 PKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLAR--MLQADPPPKAV--K 323
P V+LAS L++G S D+ W + KNLVL T T+A+ ML+ D P +
Sbjct: 332 PSVMLASPGMLQSGLSRDLLERWCPEDKNLVLITGYSIEGTMAKFIMLEPDTIPSINNPE 391
Query: 324 VTVSKR 329
+T+ +R
Sbjct: 392 ITIPRR 397
>A7A1G1_YEAS7 (tr|A7A1G1) Cleavage factor II (CF II) component OS=Saccharomyces
cerevisiae (strain YJM789) GN=YSH1 PE=4 SV=1
Length = 779
Score = 62.4 bits (150), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 41/126 (32%), Positives = 69/126 (54%), Gaps = 8/126 (6%)
Query: 209 PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVPD-G 267
PI++ S ++ + ++++ M+D I K F ++ N F+ K I+ L N L++ D G
Sbjct: 275 PIFYASNLAKKCMSVFQTYVNMMNDDIRKKFRDSQTNPFIFKNISYLRN---LEDFQDFG 331
Query: 268 PKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLAR--MLQADPPPKAV--K 323
P V+LAS L++G S D+ W + KNLVL T T+A+ ML+ D P +
Sbjct: 332 PSVMLASPGMLQSGLSRDLLERWCPEDKNLVLITGYSIEGTMAKFIMLEPDTIPSINNPE 391
Query: 324 VTVSKR 329
+T+ +R
Sbjct: 392 ITIPRR 397
>A2ADA4_MOUSE (tr|A2ADA4) Cleavage and polyadenylation specific factor 3-like
OS=Mus musculus GN=Cpsf3l PE=4 SV=1
Length = 600
Score = 62.4 bits (150), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 53/101 (52%), Gaps = 5/101 (4%)
Query: 200 YWAEENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKT 259
+W NL PIYF + ++ Y K F+ W + I K+F Q N+F K+I ++T
Sbjct: 255 FWERMNLKVPIYFSTGLTEKANHYYKLFITWTNQKIRKTFVQ--RNMFEFKHIKAF-DRT 311
Query: 260 ELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLF 300
DN GP VV A+ L AG S IF +W + KN+V+
Sbjct: 312 FADN--PGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350
>Q17BY3_AEDAE (tr|Q17BY3) Cleavage and polyadenylation specificity factor
OS=Aedes aegypti GN=AAEL004757 PE=4 SV=1
Length = 613
Score = 62.0 bits (149), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 52/101 (51%), Gaps = 5/101 (4%)
Query: 200 YWAEENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKT 259
YW NL YPIYF ++ +Y K F+ W + I K+F Q N+F K+I +K
Sbjct: 255 YWERMNLKYPIYFAVGLTEKANNYYKMFITWTNQKIRKTFVQ--RNMFDFKHIKPF-DKG 311
Query: 260 ELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLF 300
+DN G VV A+ L AG S IF +W + N+V+
Sbjct: 312 YIDN--PGAMVVFATPGMLHAGLSLQIFKKWAPNENNMVIM 350
>B0WXD2_CULQU (tr|B0WXD2) Cleavage and polyadenylation specificity factor subunit
3 OS=Culex quinquefasciatus GN=CpipJ_CPIJ011955 PE=4
SV=1
Length = 615
Score = 62.0 bits (149), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 52/101 (51%), Gaps = 5/101 (4%)
Query: 200 YWAEENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKT 259
YW NL YP+YF ++ +Y K F+ W + I K+F Q N+F K+I +K
Sbjct: 255 YWERMNLKYPVYFAVGLTEKANNYYKMFITWTNQKIRKTFVQ--RNMFDFKHIKPF-DKG 311
Query: 260 ELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLF 300
+DN G VV A+ L AG S IF +W + N+V+
Sbjct: 312 YIDN--PGAMVVFATPGMLHAGLSLQIFKKWAPNENNMVIM 350
>Q5AEE3_CANAL (tr|Q5AEE3) Putative uncharacterized protein CFT2 OS=Candida
albicans GN=CFT2 PE=4 SV=1
Length = 931
Score = 61.6 bits (148), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 89/406 (21%), Positives = 156/406 (38%), Gaps = 104/406 (25%)
Query: 209 PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVPDGP 268
P+YFLSY + + Y + L+WMS + K +E+ F + LL++ +EL + GP
Sbjct: 267 PVYFLSYSGTKILTYASNLLDWMSKSFTKEWEELSSVPFNPSKVDLLLDPSELLKL-SGP 325
Query: 269 KVVLASMASLEAG-FSHDIFVEWGSDVKNLVLFTEKG--QFAT------------LARML 313
K+V S L +G S + F +D ++ TEK FA+ LA+
Sbjct: 326 KIVFCSGIDLRSGDISAEAFQYLCNDEHTTIILTEKTTMNFASSLSSVLYTEWDSLAKKR 385
Query: 314 QADPPPKAVKVTV---------SKRVPLVGDELIAYEEEQNRIKKEEAL-------KASL 357
+ V + +K V L G EL ++E+ + +KE+ L ++
Sbjct: 386 GGGESEDGIAVPIDKNISLKNWTKEVELTGTELTEFQEKVAQKRKEKLLAKVRDQKNQNI 445
Query: 358 IKAEELKA-------------SHGADSNTSDPMVIDTGNNL------IAPEVAGPRGGGY 398
+ A+ + + + NTS ++I N+ +AP P
Sbjct: 446 LSADTVDSEDSSDDDDEGDNEAEKQKGNTSSNLLIKQYQNINVADSNVAPNEVNPLATHE 505
Query: 399 R----------------DIFVDGFVPPSTSVAPMFPCYENTSEWDDFGEVINPDDYVIKD 442
D+ + + P + P F + ++DD+GEVI +DY D
Sbjct: 506 AFITDHIKQSLEKNLPIDLKITHKLRPRQATFPYFAT-AHKQKFDDYGEVIKIEDYQRHD 564
Query: 443 E--------------DMDQTA---------------MHVGGDINGKLDEGAASLILDTKP 473
E D +TA + +N KL + + + K
Sbjct: 565 EVSHSKIIMEGKRKFDEKRTANNRRNKNQNKQQANKLTPQEQVNRKLLQKYLDTLSNPKK 624
Query: 474 -------SKVISDERTVQVRCSLVYMDFEGRSDGRSIKNILSHVAP 512
+K S+ + ++VRC L ++D G+ D RS+ I+ + P
Sbjct: 625 RVGLNYGTKKKSETQKLKVRCGLSFVDLSGQVDLRSLGIIVQALKP 670
>Q7Q184_ANOGA (tr|Q7Q184) AGAP009923-PA OS=Anopheles gambiae GN=AGAP009923 PE=4
SV=4
Length = 608
Score = 61.6 bits (148), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 52/101 (51%), Gaps = 5/101 (4%)
Query: 200 YWAEENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKT 259
YW NL YP+YF ++ +Y K F+ W + I K+F Q N+F K+I +K
Sbjct: 255 YWERMNLKYPVYFAVGLTEKANNYYKMFITWTNQKIRKTFVQ--RNMFDFKHIKPF-DKG 311
Query: 260 ELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLF 300
+DN G VV A+ L AG S IF +W + N+V+
Sbjct: 312 YIDN--PGAMVVFATPGMLHAGLSLQIFKKWAPNENNMVIM 350
>B4JYM6_DROGR (tr|B4JYM6) GH14325 OS=Drosophila grimshawi GN=GH14325 PE=4 SV=1
Length = 597
Score = 60.5 bits (145), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 50/101 (49%), Gaps = 5/101 (4%)
Query: 200 YWAEENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKT 259
YW NL YPIYF ++ Y K F+ W + I K+F N+F K+I +K
Sbjct: 255 YWERMNLKYPIYFALGLTEKANTYYKMFITWTNQKIRKTF--VHRNMFDFKHIKPF-DKA 311
Query: 260 ELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLF 300
+DN G VV A+ L AG S IF +W + N+V+
Sbjct: 312 YIDN--PGAMVVFATPGMLHAGLSLQIFKKWAPNENNMVIM 350
>Q9VAH9_DROME (tr|Q9VAH9) CG1972-PA (LD08814p) OS=Drosophila melanogaster
GN=CG1972 PE=2 SV=1
Length = 597
Score = 60.5 bits (145), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 50/101 (49%), Gaps = 5/101 (4%)
Query: 200 YWAEENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKT 259
YW NL YPIYF ++ Y K F+ W + I K+F N+F K+I +K
Sbjct: 255 YWERMNLKYPIYFALGLTEKANTYYKMFITWTNQKIRKTF--VHRNMFDFKHIKPF-DKA 311
Query: 260 ELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLF 300
+DN G VV A+ L AG S IF +W + N+V+
Sbjct: 312 YIDN--PGAMVVFATPGMLHAGLSLQIFKKWAPNENNMVIM 350
>B3P5D8_DROER (tr|B3P5D8) GG11690 OS=Drosophila erecta GN=GG11690 PE=4 SV=1
Length = 597
Score = 60.5 bits (145), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 50/101 (49%), Gaps = 5/101 (4%)
Query: 200 YWAEENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKT 259
YW NL YPIYF ++ Y K F+ W + I K+F N+F K+I +K
Sbjct: 255 YWERMNLKYPIYFALGLTEKANTYYKMFITWTNQKIRKTF--VHRNMFDFKHIKPF-DKA 311
Query: 260 ELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLF 300
+DN G VV A+ L AG S IF +W + N+V+
Sbjct: 312 YIDN--PGAMVVFATPGMLHAGLSLQIFKKWAPNENNMVIM 350
>Q29AN7_DROPS (tr|Q29AN7) GA15164 OS=Drosophila pseudoobscura pseudoobscura
GN=GA15164 PE=4 SV=1
Length = 597
Score = 60.5 bits (145), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 50/101 (49%), Gaps = 5/101 (4%)
Query: 200 YWAEENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKT 259
YW NL YPIYF ++ Y K F+ W + I K+F N+F K+I +K
Sbjct: 255 YWERMNLKYPIYFALGLTEKANTYYKMFITWTNQKIRKTF--VHRNMFDFKHIKPF-DKA 311
Query: 260 ELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLF 300
+DN G VV A+ L AG S IF +W + N+V+
Sbjct: 312 YIDN--PGAMVVFATPGMLHAGLSLQIFKKWAPNENNMVIM 350
>B4R083_DROSI (tr|B4R083) GD21460 OS=Drosophila simulans GN=GD21460 PE=4 SV=1
Length = 597
Score = 60.5 bits (145), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 50/101 (49%), Gaps = 5/101 (4%)
Query: 200 YWAEENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKT 259
YW NL YPIYF ++ Y K F+ W + I K+F N+F K+I +K
Sbjct: 255 YWERMNLKYPIYFALGLTEKANTYYKMFITWTNQKIRKTF--VHRNMFDFKHIKPF-DKN 311
Query: 260 ELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLF 300
+DN G VV A+ L AG S IF +W + N+V+
Sbjct: 312 YIDN--PGAMVVFATPGMLHAGLSLQIFKKWAPNENNMVIM 350
>B4PPK8_DROYA (tr|B4PPK8) GE23879 OS=Drosophila yakuba GN=GE23879 PE=4 SV=1
Length = 597
Score = 60.5 bits (145), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 50/101 (49%), Gaps = 5/101 (4%)
Query: 200 YWAEENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKT 259
YW NL YPIYF ++ Y K F+ W + I K+F N+F K+I +K
Sbjct: 255 YWERMNLKYPIYFALGLTEKANTYYKMFITWTNQKIRKTF--VHRNMFDFKHIKPF-DKA 311
Query: 260 ELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLF 300
+DN G VV A+ L AG S IF +W + N+V+
Sbjct: 312 YIDN--PGAMVVFATPGMLHAGLSLQIFKKWAPNENNMVIM 350
>B4HZE1_DROSE (tr|B4HZE1) GM12816 OS=Drosophila sechellia GN=GM12816 PE=4 SV=1
Length = 597
Score = 60.5 bits (145), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 50/101 (49%), Gaps = 5/101 (4%)
Query: 200 YWAEENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKT 259
YW NL YPIYF ++ Y K F+ W + I K+F N+F K+I +K
Sbjct: 255 YWERMNLKYPIYFALGLTEKANTYYKMFITWTNQKIRKTF--VHRNMFDFKHIKPF-DKN 311
Query: 260 ELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLF 300
+DN G VV A+ L AG S IF +W + N+V+
Sbjct: 312 YIDN--PGAMVVFATPGMLHAGLSLQIFKKWAPNENNMVIM 350
>B4M5T1_DROVI (tr|B4M5T1) GJ10637 OS=Drosophila virilis GN=GJ10637 PE=4 SV=1
Length = 597
Score = 60.5 bits (145), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 50/101 (49%), Gaps = 5/101 (4%)
Query: 200 YWAEENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKT 259
YW NL YPIYF ++ Y K F+ W + I K+F N+F K+I +K
Sbjct: 255 YWERMNLKYPIYFALGLTEKANTYYKMFITWTNQKIRKTF--VHRNMFDFKHIKPF-DKA 311
Query: 260 ELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLF 300
+DN G VV A+ L AG S IF +W + N+V+
Sbjct: 312 YIDN--PGAMVVFATPGMLHAGLSLQIFKKWAPNENNMVIM 350
>Q5TA44_HUMAN (tr|Q5TA44) Cleavage and polyadenylation specific factor 3-like
(Fragment) OS=Homo sapiens GN=CPSF3L PE=4 SV=1
Length = 401
Score = 60.5 bits (145), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 52/101 (51%), Gaps = 5/101 (4%)
Query: 200 YWAEENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKT 259
+W NL PIYF + ++ Y K F+ W + I K+F Q N+F K+I ++
Sbjct: 147 FWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTFVQ--RNMFEFKHIKAF-DRA 203
Query: 260 ELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLF 300
DN GP VV A+ L AG S IF +W + KN+V+
Sbjct: 204 FADN--PGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 242
>B4NBL1_DROWI (tr|B4NBL1) GK11920 OS=Drosophila willistoni GN=GK11920 PE=4 SV=1
Length = 597
Score = 60.5 bits (145), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 50/101 (49%), Gaps = 5/101 (4%)
Query: 200 YWAEENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKT 259
YW NL YPIYF ++ Y K F+ W + I K+F N+F K+I +K
Sbjct: 255 YWERMNLKYPIYFALGLTEKANTYYKMFITWTNQKIRKTF--VHRNMFDFKHIKPF-DKA 311
Query: 260 ELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLF 300
+DN G VV A+ L AG S IF +W + N+V+
Sbjct: 312 YIDN--PGAMVVFATPGMLHAGLSLQIFKKWAPNENNMVIM 350
>B4G664_DROPE (tr|B4G664) GL23717 OS=Drosophila persimilis GN=GL23717 PE=4 SV=1
Length = 597
Score = 60.5 bits (145), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 50/101 (49%), Gaps = 5/101 (4%)
Query: 200 YWAEENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKT 259
YW NL YPIYF ++ Y K F+ W + I K+F N+F K+I +K
Sbjct: 255 YWERMNLKYPIYFALGLTEKANTYYKMFITWTNQKIRKTF--VHRNMFDFKHIKPF-DKA 311
Query: 260 ELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLF 300
+DN G VV A+ L AG S IF +W + N+V+
Sbjct: 312 YIDN--PGAMVVFATPGMLHAGLSLQIFKKWAPNENNMVIM 350
>A8K5S2_HUMAN (tr|A8K5S2) cDNA FLJ77752 OS=Homo sapiens PE=2 SV=1
Length = 606
Score = 60.5 bits (145), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 52/101 (51%), Gaps = 5/101 (4%)
Query: 200 YWAEENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKT 259
+W NL PIYF + ++ Y K F+ W + I K+F Q N+F K+I ++
Sbjct: 261 FWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTFVQ--RNMFEFKHIKAF-DRA 317
Query: 260 ELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLF 300
DN GP VV A+ L AG S IF +W + KN+V+
Sbjct: 318 FADN--PGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 356
>B4K612_DROMO (tr|B4K612) GI10422 OS=Drosophila mojavensis GN=GI10422 PE=4 SV=1
Length = 597
Score = 60.5 bits (145), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 50/101 (49%), Gaps = 5/101 (4%)
Query: 200 YWAEENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKT 259
YW NL YPIYF ++ Y K F+ W + I K+F N+F K+I +K
Sbjct: 255 YWERMNLKYPIYFALGLTEKANTYYKMFITWTNQKIRKTF--VHRNMFDFKHIKPF-DKA 311
Query: 260 ELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLF 300
+DN G VV A+ L AG S IF +W + N+V+
Sbjct: 312 YIDN--PGAMVVFATPGMLHAGLSLQIFKKWAPNENNMVIM 350
>B3KPR3_HUMAN (tr|B3KPR3) cDNA FLJ32090 fis, clone OCBBF2000719, highly similar
to Homo sapiens cleavage and polyadenylation specific
factor 3-like, transcript variant 1, mRNA OS=Homo
sapiens PE=2 SV=1
Length = 606
Score = 60.1 bits (144), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 52/101 (51%), Gaps = 5/101 (4%)
Query: 200 YWAEENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKT 259
+W NL PIYF + ++ Y K F+ W + I K+F Q N+F K+I ++
Sbjct: 261 FWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTFVQ--RNMFEFKHIKAF-DRA 317
Query: 260 ELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLF 300
DN GP VV A+ L AG S IF +W + KN+V+
Sbjct: 318 FADN--PGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 356
>B6K781_SCHJP (tr|B6K781) Endoribonuclease ysh1 OS=Schizosaccharomyces japonicus
yFS275 GN=SJAG_04587 PE=4 SV=1
Length = 771
Score = 60.1 bits (144), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 42/168 (25%), Positives = 86/168 (51%), Gaps = 13/168 (7%)
Query: 209 PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVPDGP 268
PIY+ S ++ + ++++ M+D I K+F + N F+ +YI L + + D++ GP
Sbjct: 245 PIYYASSLARKCMAVYQTYINMMNDKIRKAFAE--RNPFIFRYIKSLRSIDKFDDI--GP 300
Query: 269 KVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADPPPKAVKVTV-- 326
V+LAS L+ G S + W D +N +L T T+A+++ +P +++T
Sbjct: 301 SVILASPGMLQNGVSRTLLERWAPDARNTLLLTGYSVEGTMAKLIANEP----IEITTLS 356
Query: 327 SKRVP-LVGDELIAYEEEQNRIKKEEALKASLIKAEELKASHGADSNT 373
+++P + E +++ + I+ E + A + + + HG +NT
Sbjct: 357 GQKIPRRMTVEELSFAAHVDYIQNSEFIDA--VNPDHIILVHGEQTNT 402
>B3S010_TRIAD (tr|B3S010) Putative uncharacterized protein OS=Trichoplax
adhaerens GN=TRIADDRAFT_57642 PE=4 SV=1
Length = 596
Score = 60.1 bits (144), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 35/101 (34%), Positives = 51/101 (50%), Gaps = 5/101 (4%)
Query: 200 YWAEENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKT 259
YW NL PIYF + ++ Y K F+ W + I ++F Q N+F K+I ++
Sbjct: 255 YWDRMNLKVPIYFSTGLTEKANHYYKLFITWTNQKIRRTFVQ--HNMFEFKHIKPF-DRA 311
Query: 260 ELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLF 300
+DN P VV A+ L G S IF +W D KN+V+
Sbjct: 312 LIDN--PNPMVVFATPGMLHGGLSLQIFKKWAPDDKNMVIL 350
>A7RL47_NEMVE (tr|A7RL47) Predicted protein OS=Nematostella vectensis
GN=v1g179209 PE=4 SV=1
Length = 527
Score = 60.1 bits (144), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 35/101 (34%), Positives = 53/101 (52%), Gaps = 5/101 (4%)
Query: 200 YWAEENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKT 259
YW NL PIYF + ++ Y K F+ W + I +F Q N+F ++I +++
Sbjct: 226 YWERMNLKAPIYFSTGLTEKANHYYKLFITWTNQKIKNTFVQ--RNMFEFEHIKPF-DRS 282
Query: 260 ELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLF 300
+DN GP VV A+ L AG S IF +W S+ N+V+
Sbjct: 283 YIDN--PGPMVVFATPGMLHAGLSLQIFKKWASNENNMVVI 321
>Q96HV7_HUMAN (tr|Q96HV7) CPSF3L protein OS=Homo sapiens GN=CPSF3L PE=2 SV=1
Length = 389
Score = 60.1 bits (144), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 52/101 (51%), Gaps = 5/101 (4%)
Query: 200 YWAEENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKT 259
+W NL PIYF + ++ Y K F+ W + I K+F Q N+F K+I ++
Sbjct: 135 FWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTFVQ--RNMFEFKHIKAF-DRA 191
Query: 260 ELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLF 300
DN GP VV A+ L AG S IF +W + KN+V+
Sbjct: 192 FADN--PGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 230
>B4DM87_HUMAN (tr|B4DM87) cDNA FLJ55759, highly similar to Homo sapiens cleavage
and polyadenylation specific factor 3-like, transcript
variant 1, mRNA OS=Homo sapiens PE=2 SV=1
Length = 571
Score = 60.1 bits (144), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 52/101 (51%), Gaps = 5/101 (4%)
Query: 200 YWAEENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKT 259
+W NL PIYF + ++ Y K F+ W + I K+F Q N+F K+I ++
Sbjct: 226 FWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTFVQ--RNMFEFKHIKAF-DRA 282
Query: 260 ELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLF 300
DN GP VV A+ L AG S IF +W + KN+V+
Sbjct: 283 FADN--PGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 321
>B3MTI3_DROAN (tr|B3MTI3) GF23370 OS=Drosophila ananassae GN=GF23370 PE=4 SV=1
Length = 597
Score = 60.1 bits (144), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 50/101 (49%), Gaps = 5/101 (4%)
Query: 200 YWAEENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKT 259
YW NL YPIYF ++ Y K F+ W + I K+F N+F K+I +K
Sbjct: 255 YWDRMNLKYPIYFALGLTEKANTYYKMFITWTNQKIRKTF--VHRNMFDFKHIKPF-DKA 311
Query: 260 ELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLF 300
+DN G VV A+ L AG S IF +W + N+V+
Sbjct: 312 YIDN--PGAMVVFATPGMLHAGLSLQIFKKWAPNENNMVIM 350
>Q750X1_ASHGO (tr|Q750X1) AGL182Cp OS=Ashbya gossypii GN=AGL182C PE=4 SV=1
Length = 803
Score = 59.7 bits (143), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 86/361 (23%), Positives = 148/361 (40%), Gaps = 63/361 (17%)
Query: 209 PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENI--FLLKYITLLVNKTELDNVPD 266
P+ LSY T+ Y +S LEW+S + K++E +R+N F L +VN +L N P
Sbjct: 277 PVLLLSYSRGRTLTYARSMLEWLSSQLVKTWE-SRDNKSPFDLGNRLKIVNVNDLANYP- 334
Query: 267 GPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQF-----ATLAR---------- 311
G K+ S +D + + K +++ TEK + A LA+
Sbjct: 335 GTKICFISQVE---TLINDALSKVCTKEKAMLVLTEKPTYYSHTIAILAKAYAKWERALN 391
Query: 312 -----MLQADPPP--KAVKVTVSKRVPLVGDELIAYEEEQNRIKKEEALKASLIKAEELK 364
++ +P +++ + SK PL G +L ++E +KE A S ++ +
Sbjct: 392 SNNLNAVEGNPIAYSESLSLQFSKTKPLTGSDLEEFKERIEARRKERAELLSSFQSNDNP 451
Query: 365 ASHGADSNTSDPMVIDTGNNLIAPEVAGPRGGGYRDIFVDGFVPPST-SVAPMFPCYENT 423
A GA + T+ D +++ P AG +I D + P+ MFP
Sbjct: 452 A--GASAFTAIEDDDDEEEDVLRPHGAGALSTKV-EIPTDLIIQPNALPKHKMFPFQPGK 508
Query: 424 SEWDDFGEVINPDDYV------------IKDEDMDQTAMHVGGDIN-----GKL------ 460
DD+GE+++ + ++ +ED + H DI GK
Sbjct: 509 VAHDDYGELVDFERFLPQSAPSSAKRGATNEEDEESYDPHDFRDIRTNDSGGKRRRREQD 568
Query: 461 -------DEGAASLILDTKPSKVISDERTVQVRCSLVYMDFEGRSDGRSIKNILSHVAPL 513
+ + L TKP S+ + V +RC++ ++D G D RS+ I + P
Sbjct: 569 ALQRQMNQDNLSYLDTLTKPQHRTSNTQKVVIRCTMAFVDLAGLVDERSMSIIWPALKPR 628
Query: 514 K 514
K
Sbjct: 629 K 629
>Q4TC63_TETNG (tr|Q4TC63) Chromosome undetermined SCAF7053, whole genome shotgun
sequence. (Fragment) OS=Tetraodon nigroviridis
GN=GSTENG00003449001 PE=4 SV=1
Length = 810
Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 33/120 (27%), Positives = 64/120 (53%), Gaps = 6/120 (5%)
Query: 200 YWAEENLVY--PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVN 257
YW ++ PIY+ S ++ + ++++ M+D I K+ N F+ K+I+ L +
Sbjct: 312 YWQNHPELHDIPIYYASSLARKCMAVYQTYINAMNDKIRKAININ--NPFVFKHISNLKS 369
Query: 258 KTELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADP 317
D++ GP VV+AS +++G S ++F W +D +N V+ TLA+ + ++P
Sbjct: 370 MDHFDDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEP 427
>A7S9J6_NEMVE (tr|A7S9J6) Predicted protein OS=Nematostella vectensis
GN=v1g243769 PE=4 SV=1
Length = 688
Score = 58.2 bits (139), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 36/134 (26%), Positives = 73/134 (54%), Gaps = 8/134 (5%)
Query: 200 YWAEENLVY--PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVN 257
YW ++ PIY+ S ++ + ++++ M+D I K + N F+ K+I+ L +
Sbjct: 265 YWQNHPELHDIPIYYASQLAKKCMSVFQTYVNAMNDKIKKQIAIS--NPFVFKHISNLKS 322
Query: 258 KTELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADP 317
+ D++ GP VV+AS +++G S ++F +W +D +N V+ TLA+ L ++
Sbjct: 323 IDQFDDI--GPSVVMASPGMMQSGLSRELFEQWCTDRRNGVIIAGYCVEGTLAKNLMSE- 379
Query: 318 PPKAVKVTVSKRVP 331
P+ V+ +++P
Sbjct: 380 -PEEVQTMSGQKIP 392
>Q5CXL4_CRYPV (tr|Q5CXL4) Inactive CPSFs Cft2p metallobeta-lactamase
OS=Cryptosporidium parvum Iowa II GN=cgd6_580 PE=4 SV=1
Length = 1196
Score = 57.0 bits (136), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 36/116 (31%), Positives = 61/116 (52%), Gaps = 3/116 (2%)
Query: 201 WAEENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFL-LKYITLLVNKT 259
W + ++YPI+ S + S + VK+ +EWMS I F +R N F LK I L N
Sbjct: 308 WNKGKILYPIFVTSPLIKSFLLSVKTLIEWMSLEIRSEFCDSRFNPFHDLKNIILETNLK 367
Query: 260 EL--DNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARML 313
+ +N+ PKV+ A S++ G+S ++F E ++ N ++F + + T A +
Sbjct: 368 NIRSENLSKVPKVIFAFPESMDYGYSRELFTELATNENNTIMFVREPKENTFAHYI 423
>Q581U7_9TRYP (tr|Q581U7) Cleavage and polyadenylation specificity factor
subunit, putative OS=Trypanosoma brucei GN=Tb927.4.1340
PE=4 SV=1
Length = 770
Score = 56.2 bits (134), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 37/120 (30%), Positives = 64/120 (53%), Gaps = 5/120 (4%)
Query: 200 YW-AEENLVY-PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVN 257
YW A + L + PIY+ S ++ + ++F+ M+D + K E R N F+ KYI L++
Sbjct: 281 YWEAHKELQHIPIYYASSLAQRCMKLYQTFVSAMNDRVKKQHENHR-NPFVFKYIQSLLD 339
Query: 258 KTELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADP 317
++ GP VVLAS L++G S ++F W D +N ++ T+A+ + + P
Sbjct: 340 TRSFEDT--GPCVVLASPGMLQSGISLELFERWCGDKRNGIIVAGYCVDGTIAKDILSKP 397
>A2EDT7_TRIVA (tr|A2EDT7) Putative uncharacterized protein OS=Trichomonas
vaginalis G3 GN=TVAG_363680 PE=4 SV=1
Length = 700
Score = 56.2 bits (134), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 71/309 (22%), Positives = 136/309 (44%), Gaps = 42/309 (13%)
Query: 201 WAEENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRE----NIFLLKYITLLV 256
W + N + IYFLS+ S + + +S ++S I + N L Y + +
Sbjct: 249 WKKVNWKWKIYFLSHSSPAVLKNAQSLSNYLSLDIQEKINSGEYPFEFNDPDLSYFSCVT 308
Query: 257 NKTELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQAD 316
+ ++D G VV++S +LE GFS +F++ ++ NL++FT++ +LA L+ +
Sbjct: 309 SIKDID-FSQGC-VVISSTDTLERGFSRKLFLD-KANSDNLIIFTQREPPYSLAEALRTN 365
Query: 317 PPPKAVKVTVSKRVPLVGDELIAYEEEQNRIKKEEALKASLIKAEELKASHGADSNTSDP 376
+ + + R PL G+EL+ + E+Q+ +++ KA E++ G S+ SD
Sbjct: 366 NAHRTFRFIIKHREPLTGEELVKFMEKQSALQE---------KANEIE---GDISDESDE 413
Query: 377 MVIDTGNNLIAPEVAGPRGGGYRDIFVDGFVPPSTSVAPMFPCY--ENTSEWDDFGEVIN 434
+ + N + ++A S+ F + + TS+ D+G I
Sbjct: 414 VSQENIEN--SSQIA-------------------QSLKKHFFQFKRKETSDLSDYGANIV 452
Query: 435 PDDYVIKDEDMDQTAMHVGGDINGKLDEGAASLILDTKPSKVISDERTVQVRCSLVYMDF 494
++Y+ M + M I+ L + L KPSK + + + V+ +
Sbjct: 453 VENYLKGANPMAPSKMDTSKMIDSSLTQQNFIQELVYKPSKFMITQYDYNFVGTAVFWNL 512
Query: 495 EGRSDGRSI 503
E SD +I
Sbjct: 513 ERTSDYSTI 521
>B6QRB9_PENMA (tr|B6QRB9) Cleavage and polyadenylylation specificity factor,
putative OS=Penicillium marneffei ATCC 18224
GN=PMAA_045880 PE=4 SV=1
Length = 1015
Score = 56.2 bits (134), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 48/182 (26%), Positives = 78/182 (42%), Gaps = 43/182 (23%)
Query: 210 IYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQ---------------------------- 241
+Y + ST+ +S LEWM + I + FE
Sbjct: 335 LYLAGRKAHSTMRLARSMLEWMDEGIVREFEAVEGGDAAAVRGHKTTDSQNRNAGVTRDK 394
Query: 242 --TRENIFLLKYITLLVNKTELDNV-PDG-PKVVLASMASLEAGFSHDIFVEWGSDVKNL 297
T+ F LK++ ++ K +L+ V DG PKV++AS SL+ G+S + F +NL
Sbjct: 395 QGTKLGPFTLKHLKIVEQKRKLEKVLADGIPKVIIASDTSLDWGYSKETFRTLAQGSQNL 454
Query: 298 VLFTEKGQFATLARMLQADPPPKAVKVTVSKRVPLVGDELIAYEEEQNRIKKEEALKASL 357
+L TE TL Q D P + K+T+ + + YEE ++ + E A L
Sbjct: 455 ILLTE-----TLPIRYQTDDPEQPDKMTLGRMI------WRWYEERRDGVAMETASNGEL 503
Query: 358 IK 359
++
Sbjct: 504 LE 505
>A7E3Q0_BOVIN (tr|A7E3Q0) Related to CPSF subunits 68 kDa OS=Bos taurus GN=CPSF3L
PE=2 SV=1
Length = 599
Score = 55.8 bits (133), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 52/101 (51%), Gaps = 5/101 (4%)
Query: 200 YWAEENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKT 259
+W +L PIYF + ++ Y K F+ W + I K+F Q N+F K+I ++
Sbjct: 255 FWERMDLKAPIYFSTGLTEKANHYYKLFIPWTNQKIRKTFVQ--RNMFEFKHIKAF-DRA 311
Query: 260 ELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLF 300
D+ GP VV A+ L AG S IF +W + KN+V+
Sbjct: 312 FADS--PGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350
>A7T4V8_NEMVE (tr|A7T4V8) Predicted protein OS=Nematostella vectensis
GN=v1g222359 PE=4 SV=1
Length = 388
Score = 55.5 bits (132), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 33/124 (26%), Positives = 67/124 (54%), Gaps = 9/124 (7%)
Query: 200 YWAEENLVY--PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVN 257
YW ++ PIY+ S ++ + ++++ M+D I K + N F+ K+I+ L +
Sbjct: 191 YWQNHPELHDIPIYYASQLAKKCMSVFQTYVNAMNDKIKKQIAIS--NPFVFKHISNLKS 248
Query: 258 KTELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVL---FTEKGQFATLARMLQ 314
+ D++ GP VV+AS +++G S ++F +W +D +N V+ + +G A ++
Sbjct: 249 IDQFDDI--GPSVVMASPGMMQSGLSRELFEQWCTDRRNGVIIAGYCVEGTLAKEVSLVV 306
Query: 315 ADPP 318
+PP
Sbjct: 307 HNPP 310
>Q4RM75_TETNG (tr|Q4RM75) Chromosome 10 SCAF15019, whole genome shotgun sequence.
(Fragment) OS=Tetraodon nigroviridis
GN=GSTENG00032180001 PE=4 SV=1
Length = 730
Score = 55.1 bits (131), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 33/120 (27%), Positives = 64/120 (53%), Gaps = 6/120 (5%)
Query: 200 YWAEENLVY--PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVN 257
YW ++ PIY+ S ++ + ++++ M+D I K+ N F+ K+I+ L +
Sbjct: 253 YWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKAININ--NPFVFKHISNLKS 310
Query: 258 KTELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADP 317
D++ GP VV+AS +++G S ++F W +D +N V+ TLA+ + ++P
Sbjct: 311 MDHFDDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEP 368
>B5X1A0_SALSA (tr|B5X1A0) Cleavage and polyadenylation specificity factor subunit
3 OS=Salmo salar GN=CPSF3 PE=2 SV=1
Length = 690
Score = 54.7 bits (130), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 33/120 (27%), Positives = 64/120 (53%), Gaps = 6/120 (5%)
Query: 200 YWAEENLVY--PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVN 257
YW ++ PIY+ S ++ + ++++ M+D I K+ N F+ K+I+ L +
Sbjct: 263 YWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKAINVN--NPFVFKHISNLKS 320
Query: 258 KTELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADP 317
D++ GP VV+AS +++G S ++F W +D +N V+ TLA+ + ++P
Sbjct: 321 MDHFDDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYSVEGTLAKHIMSEP 378
>A7F0N0_SCLS1 (tr|A7F0N0) Putative uncharacterized protein OS=Sclerotinia
sclerotiorum (strain ATCC 18683 / 1980 / Ss-1)
GN=SS1G_11148 PE=4 SV=1
Length = 936
Score = 54.7 bits (130), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 81/338 (23%), Positives = 125/338 (36%), Gaps = 101/338 (29%)
Query: 197 RTGYWAEENLVYPI--YFLSYVSSSTIDYVKSFLEWMSDTIAKSFE-------------- 240
RTG EE+ Y S T+ Y +S EWM + I + FE
Sbjct: 275 RTGNQEEESAFRSAKPYLAVSTSEMTMRYTRSMFEWMDEAIIREFEAQPGHEEQQTGQQR 334
Query: 241 ------QTRENI--FLLKYITLLVNKTELDNVPDGP--------KVVLASMASLEAGFSH 284
++++N F K++ LL K ++D + + KV+LAS S+E GFS
Sbjct: 335 RHAYSDESKQNAGPFEFKHLRLLGRKGQIDRMLNETDNLGRSVGKVILASDTSIEWGFSK 394
Query: 285 DIFVEWGSDVKNLVLFTEK----GQFATLARML-----------------------QADP 317
++ + D KNL++ TEK L R L Q
Sbjct: 395 EVLRKIADDDKNLLILTEKLNRIDGVTGLGRTLWSWWEERRNGVATEPSSNGGNLEQVYG 454
Query: 318 PPKAVKVTVSKRVPLVGDELIAYE-------EEQNRIK--KEEALKASLI---------- 358
+ +++ KR+PL G++L Y+ + QN ++ AL+AS
Sbjct: 455 GGRDLEIREPKRIPLEGNDLTVYQQWLATQRQLQNTLQPGGATALEASADIVDDASSDSS 514
Query: 359 ------------KAEELKASHG-ADSNTSDPMVIDTGNNLIAPEVAGPRGGGYRDIFVDG 405
KA + A+ G A+ D G N++ R G D V G
Sbjct: 515 SDSDDSETEQQGKALNISATMGQANRKKIGLSDEDLGINILL------RKKGVHDFDVRG 568
Query: 406 FVPPSTSVAPMFPCYENTSEWDDFGEVINPDDYVIKDE 443
MFP DDFGE+I P +++ +E
Sbjct: 569 ----KKGRDKMFPMAIRRKRNDDFGELIRPGEFLRAEE 602
>Q6DIH6_XENTR (tr|Q6DIH6) Cleavage and polyadenylation specific factor 3, 73kDa
OS=Xenopus tropicalis GN=cpsf3 PE=2 SV=1
Length = 692
Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/135 (25%), Positives = 70/135 (51%), Gaps = 8/135 (5%)
Query: 200 YWAEENLVY--PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVN 257
YW ++ PIY+ S ++ + ++++ M+D I K N F+ K+I+ L +
Sbjct: 263 YWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININ--NPFVFKHISNLKS 320
Query: 258 KTELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADP 317
D++ GP VV+AS +++G S ++F W +D +N V+ TLA+ + ++
Sbjct: 321 MDHFDDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE- 377
Query: 318 PPKAVKVTVSKRVPL 332
P+ + +++PL
Sbjct: 378 -PEEIATMSGQKLPL 391
>Q8SUC3_ENCCU (tr|Q8SUC3) Putative uncharacterized protein ECU10_1350
OS=Encephalitozoon cuniculi GN=ECU10_1350 PE=4 SV=1
Length = 496
Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 54/103 (52%), Gaps = 7/103 (6%)
Query: 199 GYWAEENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKS-FEQTRENIFLLKYITLLVN 257
GYW L P+YF S ++ + K F+ + ++T+ K FE+ N+F K+I
Sbjct: 249 GYWERTGLKTPVYFSSGLTEKANEIYKKFISYTNETVRKKIFER---NMFEYKHIKPF-Q 304
Query: 258 KTELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLF 300
+ +++ GP V+ AS L +G S IF EW D KNLV+
Sbjct: 305 RHYMES--KGPMVLFASPGMLHSGMSLKIFKEWCEDEKNLVII 345
>Q28FT5_XENTR (tr|Q28FT5) Cleavage and polyadenylation specific factor 3
OS=Xenopus tropicalis GN=cpsf3 PE=2 SV=1
Length = 692
Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/135 (25%), Positives = 70/135 (51%), Gaps = 8/135 (5%)
Query: 200 YWAEENLVY--PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVN 257
YW ++ PIY+ S ++ + ++++ M+D I K N F+ K+I+ L +
Sbjct: 263 YWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININ--NPFVFKHISNLKS 320
Query: 258 KTELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADP 317
D++ GP VV+AS +++G S ++F W +D +N V+ TLA+ + ++
Sbjct: 321 MDHFDDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE- 377
Query: 318 PPKAVKVTVSKRVPL 332
P+ + +++PL
Sbjct: 378 -PEEIATMSGQKLPL 391
>Q6DRG6_DANRE (tr|Q6DRG6) Cleavage and polyadenylation specificity factor 3
OS=Danio rerio GN=cpsf3 PE=2 SV=1
Length = 690
Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/135 (25%), Positives = 71/135 (52%), Gaps = 8/135 (5%)
Query: 200 YWAEENLVY--PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVN 257
YW ++ PIY+ S ++ + ++++ M+D I K+ N F+ K+I+ L +
Sbjct: 263 YWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKAI--NINNPFVFKHISNLKS 320
Query: 258 KTELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADP 317
D++ GP VV+AS +++G S ++F W +D +N V+ TLA+ + ++
Sbjct: 321 MDHFDDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE- 377
Query: 318 PPKAVKVTVSKRVPL 332
P+ + +++PL
Sbjct: 378 -PEEITTMSGQKLPL 391
>Q5U3T4_DANRE (tr|Q5U3T4) Cleavage and polyadenylation specific factor 3 OS=Danio
rerio GN=cpsf3 PE=2 SV=1
Length = 690
Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/135 (25%), Positives = 71/135 (52%), Gaps = 8/135 (5%)
Query: 200 YWAEENLVY--PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVN 257
YW ++ PIY+ S ++ + ++++ M+D I K+ N F+ K+I+ L +
Sbjct: 263 YWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKAI--NINNPFVFKHISNLKS 320
Query: 258 KTELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADP 317
D++ GP VV+AS +++G S ++F W +D +N V+ TLA+ + ++
Sbjct: 321 MDHFDDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE- 377
Query: 318 PPKAVKVTVSKRVPL 332
P+ + +++PL
Sbjct: 378 -PEEITTMSGQKLPL 391
>A4S2M5_OSTLU (tr|A4S2M5) Predicted protein OS=Ostreococcus lucimarinus (strain
CCE9901) GN=OSTLU_33504 PE=4 SV=1
Length = 767
Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 34/119 (28%), Positives = 65/119 (54%), Gaps = 6/119 (5%)
Query: 200 YWAEENLV--YPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVN 257
+WA+ + PIY S ++ + ++++ ++ + +FE+ N F+ ++ +
Sbjct: 257 FWAQNPDLQRVPIYQASALARKAMTIYQTYINVLNSDMKAAFEEA--NPFVFNHVKHVSK 314
Query: 258 KTELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQAD 316
+ELD+V GP VVLA+ + L++G S ++F W D KN V+ + TLAR + +D
Sbjct: 315 SSELDDV--GPCVVLATPSMLQSGLSRELFESWCEDPKNGVIIADFAVQGTLAREILSD 371
>Q53RS2_HUMAN (tr|Q53RS2) Putative uncharacterized protein CPSF3 (Cleavage and
polyadenylation specific factor 3, 73kDa, isoform CRA_a)
OS=Homo sapiens GN=CPSF3 PE=2 SV=1
Length = 684
Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/135 (25%), Positives = 70/135 (51%), Gaps = 8/135 (5%)
Query: 200 YWAEENLVY--PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVN 257
YW ++ PIY+ S ++ + ++++ M+D I K N F+ K+I+ L +
Sbjct: 256 YWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQI--NINNPFVFKHISNLKS 313
Query: 258 KTELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADP 317
D++ GP VV+AS +++G S ++F W +D +N V+ TLA+ + ++
Sbjct: 314 MDHFDDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE- 370
Query: 318 PPKAVKVTVSKRVPL 332
P+ + +++PL
Sbjct: 371 -PEEITTMSGQKLPL 384
>Q53F02_HUMAN (tr|Q53F02) Cleavage and polyadenylation specific factor 3, 73kDa
variant (Fragment) OS=Homo sapiens PE=2 SV=1
Length = 684
Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/135 (25%), Positives = 70/135 (51%), Gaps = 8/135 (5%)
Query: 200 YWAEENLVY--PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVN 257
YW ++ PIY+ S ++ + ++++ M+D I K N F+ K+I+ L +
Sbjct: 256 YWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQI--NINNPFVFKHISNLKS 313
Query: 258 KTELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADP 317
D++ GP VV+AS +++G S ++F W +D +N V+ TLA+ + ++
Sbjct: 314 MDHFDDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE- 370
Query: 318 PPKAVKVTVSKRVPL 332
P+ + +++PL
Sbjct: 371 -PEEITTMSGQKLPL 384
>Q2GQR7_CHAGB (tr|Q2GQR7) Putative uncharacterized protein OS=Chaetomium globosum
GN=CHGG_09687 PE=4 SV=1
Length = 956
Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 61/137 (44%), Gaps = 29/137 (21%)
Query: 195 HCRTGYWAEENLVYP---IYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENI----- 246
H A++N V+ Y +ST+ +S LEWM D+I + FE
Sbjct: 295 HAWRAEIAKDNEVFKSTKAYLAGRTINSTMRNARSMLEWMDDSIVREFEAVAGGQRGNGG 354
Query: 247 -----------FLLKYITLLVNKTELDNV---------PDGPKVVLASMASLEAGFSHDI 286
F KY+ LL K +++ V P G +V++A+ +SLE GFS ++
Sbjct: 355 SGGGKGKDAGPFDFKYLRLLERKAQVERVLQQAADASEPKG-RVIVATDSSLEWGFSKEV 413
Query: 287 FVEWGSDVKNLVLFTEK 303
D +NLV+ TEK
Sbjct: 414 MRAIAGDPRNLVILTEK 430
>A8QG81_BRUMA (tr|A8QG81) RNA-metabolising metallo-beta-lactamase family protein
OS=Brugia malayi GN=Bm1_54785 PE=4 SV=1
Length = 589
Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 53/101 (52%), Gaps = 5/101 (4%)
Query: 200 YWAEENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKT 259
YW NL YPI+F ++ Y + F+ W ++ I ++F N+F K+I ++
Sbjct: 256 YWERMNLKYPIFFSQGLAEKANQYYRLFISWTNEKIKRTF--VERNMFDFKHIRPF-EQS 312
Query: 260 ELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLF 300
+++ GP V+ ++ L G S +F +W SD KNL++
Sbjct: 313 YIES--PGPMVLFSTPGMLHGGQSLRVFTKWCSDEKNLIIM 351
>Q4R7A0_MACFA (tr|Q4R7A0) Testis cDNA, clone: QtsA-15820, similar to human
cleavage and polyadenylation specific factor 3,
73kDa(CPSF3), OS=Macaca fascicularis PE=2 SV=1
Length = 684
Score = 54.3 bits (129), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 35/135 (25%), Positives = 70/135 (51%), Gaps = 8/135 (5%)
Query: 200 YWAEENLVY--PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVN 257
YW ++ PIY+ S ++ + ++++ M+D I K N F+ K+I+ L +
Sbjct: 256 YWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQI--NINNPFVFKHISNLKS 313
Query: 258 KTELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADP 317
D++ GP VV+AS +++G S ++F W +D +N V+ TLA+ + ++
Sbjct: 314 MDHFDDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE- 370
Query: 318 PPKAVKVTVSKRVPL 332
P+ + +++PL
Sbjct: 371 -PEEITTMSGQKLPL 384
>Q5R428_PONAB (tr|Q5R428) Putative uncharacterized protein DKFZp459J042 OS=Pongo
abelii GN=DKFZp459J042 PE=2 SV=1
Length = 647
Score = 53.9 bits (128), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 35/135 (25%), Positives = 70/135 (51%), Gaps = 8/135 (5%)
Query: 200 YWAEENLVY--PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVN 257
YW ++ PIY+ S ++ + ++++ M+D I K N F+ K+I+ L +
Sbjct: 219 YWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQI--NINNPFVFKHISNLKS 276
Query: 258 KTELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADP 317
D++ GP VV+AS +++G S ++F W +D +N V+ TLA+ + ++
Sbjct: 277 MDHFDDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE- 333
Query: 318 PPKAVKVTVSKRVPL 332
P+ + +++PL
Sbjct: 334 -PEEITTMSGQKLPL 347
>Q5XGZ1_XENLA (tr|Q5XGZ1) LOC495111 protein OS=Xenopus laevis GN=cpsf3 PE=2 SV=1
Length = 692
Score = 53.9 bits (128), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 35/135 (25%), Positives = 70/135 (51%), Gaps = 8/135 (5%)
Query: 200 YWAEENLVY--PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVN 257
YW ++ PIY+ S ++ + ++++ M+D I K N F+ K+I+ L +
Sbjct: 263 YWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQI--NINNPFVFKHISNLKS 320
Query: 258 KTELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADP 317
D++ GP VV+AS +++G S ++F W +D +N V+ TLA+ + ++
Sbjct: 321 MDHFDDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE- 377
Query: 318 PPKAVKVTVSKRVPL 332
P+ + +++PL
Sbjct: 378 -PEEIVTMSGQKLPL 391
>Q4R6V4_MACFA (tr|Q4R6V4) Testis cDNA, clone: QtsA-17044, similar to human
cleavage and polyadenylation specific factor 3,
73kDa(CPSF3), OS=Macaca fascicularis PE=2 SV=1
Length = 684
Score = 53.5 bits (127), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 35/135 (25%), Positives = 70/135 (51%), Gaps = 8/135 (5%)
Query: 200 YWAEENLVY--PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVN 257
YW ++ PIY+ S ++ + ++++ M+D I K N F+ K+I+ L +
Sbjct: 256 YWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQI--NINNPFVFKHISNLKS 313
Query: 258 KTELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADP 317
D++ GP VV+AS +++G S ++F W +D +N V+ TLA+ + ++
Sbjct: 314 MDHFDDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE- 370
Query: 318 PPKAVKVTVSKRVPL 332
P+ + +++PL
Sbjct: 371 -PEEITTMSGQKLPL 384
>Q497G2_MOUSE (tr|Q497G2) Cpsf3 protein OS=Mus musculus GN=Cpsf3 PE=2 SV=1
Length = 512
Score = 53.5 bits (127), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 35/135 (25%), Positives = 69/135 (51%), Gaps = 8/135 (5%)
Query: 200 YWAEENLVY--PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVN 257
YW ++ PIY+ S ++ + ++++ M+D I K N F+ K+I+ L +
Sbjct: 256 YWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININ--NPFVFKHISNLKS 313
Query: 258 KTELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADP 317
D++ GP VV+AS ++ G S ++F W +D +N V+ TLA+ + ++
Sbjct: 314 MDHFDDI--GPSVVMASPGMIQNGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE- 370
Query: 318 PPKAVKVTVSKRVPL 332
P+ + +++PL
Sbjct: 371 -PEEITTMSGQKLPL 384
>A6SPR2_BOTFB (tr|A6SPR2) Putative uncharacterized protein OS=Botryotinia
fuckeliana (strain B05.10) GN=BC1G_14624 PE=4 SV=1
Length = 901
Score = 53.5 bits (127), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 96/436 (22%), Positives = 154/436 (35%), Gaps = 142/436 (32%)
Query: 211 YFLSYVSSSTIDYVKSFLEWMSDTIAKSFE-----------QTRENI---------FLLK 250
Y S T+ Y +S EWM + I + FE Q R N F K
Sbjct: 291 YLAVSTSEMTMRYTRSMFEWMDEAIIREFEAQPGHEEQRTGQQRRNAEEAKQHIGPFEFK 350
Query: 251 YITLLVNKTELDNVPDGP--------KVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTE 302
++ LL K ++D + + KV+LAS S+E GFS ++ + D KNL++ TE
Sbjct: 351 HLRLLGRKGQIDRMLNETDNLGRSVGKVILASDTSIEWGFSKEVLCKIADDDKNLLILTE 410
Query: 303 K----GQFATLARML-----------------------QADPPPKAVKVTVSKRVPLVGD 335
+ L R L Q + +++ KR+PL G+
Sbjct: 411 RLNPISGAPGLGRTLWSWWEERRDGVISEPSSNGGVLEQVYGGGRDLEIKEPKRIPLEGN 470
Query: 336 ELIAYEE---EQNRIKKE------EALKASLI----------------------KAEELK 364
+L Y++ Q +++ AL+AS KA +
Sbjct: 471 DLTVYQQWLATQRQLQTTLQPGGATALEASADIVDDASSDSSSDSDDSENEQQGKALNIS 530
Query: 365 ASHG-ADSNTSDPMVIDTGNNLIAPEVAGPRGGGYRDIFVDGFVPPSTSVAPMFPCYENT 423
A+ G A+ D G N++ R G D V G MFP
Sbjct: 531 ATMGQANRKKIGLSDEDLGVNILL------RKKGVHDFDVRG----KKGRDKMFPMAIRR 580
Query: 424 SEWDDFGEVINPDDYVI---KDEDMDQTAMHVG----GDINGKL---------------D 461
DDFGE+I P +++ +DE Q G D GK +
Sbjct: 581 KRNDDFGELIRPGEFLRAEERDEVDGQEPQRPGKYDTKDTLGKKRKWDDVAASGKRRASN 640
Query: 462 EGAASLILDTK-----------------------PSKVISDERTVQVRCSLVYMDFEGRS 498
EG I + + PS++ +T+++ + ++DF G
Sbjct: 641 EGKRQQISNNEDGSVADSPEEDDLMDIVEEEIPGPSRLEISIKTLKINLRIAFVDFSGLH 700
Query: 499 DGRSIKNILSHVAPLK 514
D RS++ ++ + P K
Sbjct: 701 DKRSLQMLIPLIQPRK 716
>A8Q4U4_MALGO (tr|A8Q4U4) Putative uncharacterized protein OS=Malassezia globosa
(strain ATCC 96807 / CBS 7966) GN=MGL_2640 PE=4 SV=1
Length = 741
Score = 53.5 bits (127), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 33/121 (27%), Positives = 63/121 (52%), Gaps = 7/121 (5%)
Query: 200 YWAEENLVY--PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVN 257
YW ++ PIY+ S ++ + ++++ M+ I F + R+N F+ K+++ N
Sbjct: 177 YWEAHPELHSVPIYYASSLARKCMSIYQTYIHTMNQHIRARFHR-RDNPFVFKHVS---N 232
Query: 258 KTELDNVPD-GPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQAD 316
LD D GP V++AS +++G S ++ W D +N V+ + T+AR + +D
Sbjct: 233 LRSLDKFDDKGPCVMMASPGFMQSGISRELLERWAPDKRNGVIVSGYSVEGTMARDILSD 292
Query: 317 P 317
P
Sbjct: 293 P 293
>B2AL57_PODAN (tr|B2AL57) Predicted CDS Pa_5_8800 OS=Podospora anserina PE=4 SV=1
Length = 967
Score = 53.1 bits (126), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 36/122 (29%), Positives = 56/122 (45%), Gaps = 28/122 (22%)
Query: 210 IYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQT---RENI-----------------FLL 249
+Y + ST+ + +S EWM D I + FE +E+ F
Sbjct: 314 LYLAGHSIGSTLKHARSLFEWMDDKIVREFEAAAGGKESHSRGQRGGHHHDHKVAGPFDF 373
Query: 250 KYITLL--------VNKTELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFT 301
K++ LL V K L+++ +V+LA+ +SLE GFS ++ D +NLVL T
Sbjct: 374 KHLRLLERKGQVSWVLKQALEDLEPKGRVILATDSSLEWGFSKEVLKSIAGDARNLVLLT 433
Query: 302 EK 303
EK
Sbjct: 434 EK 435
>Q011P7_OSTTA (tr|Q011P7) MRNA cleavage and polyadenylation factor II complex,
BRR5 (CPSF subunit) (ISS) OS=Ostreococcus tauri
GN=Ot09g02240 PE=4 SV=1
Length = 572
Score = 53.1 bits (126), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 33/119 (27%), Positives = 65/119 (54%), Gaps = 6/119 (5%)
Query: 200 YWAEENLV--YPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVN 257
+WA+ + PIY S ++ + ++++ ++ + +FE+ N F+ ++ +
Sbjct: 245 FWAQNPDLQRVPIYQASTLARKAMTIYQTYINVLNADMKAAFEEA--NPFVFNHVKHISK 302
Query: 258 KTELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQAD 316
+ELD+V GP VVLA+ + L++G S ++F W + KN V+ + TLAR + +D
Sbjct: 303 ASELDDV--GPCVVLATPSMLQSGLSRELFESWCEEPKNGVIIADFAVQGTLAREILSD 359
>A8JGK6_CHLRE (tr|A8JGK6) Subunit of mRNA cleavage and polyadenylation
specificity factor (Fragment) OS=Chlamydomonas
reinhardtii GN=CHLREDRAFT_122831 PE=4 SV=1
Length = 690
Score = 53.1 bits (126), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 33/124 (26%), Positives = 61/124 (49%), Gaps = 4/124 (3%)
Query: 209 PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVPDGP 268
PIY S + S + ++++E ++D I + F + N F +++ L N + GP
Sbjct: 255 PIYQASSMMSKALGVYQTYVESLNDDIKRVFHE--RNPFKFRHVQTLKNPAHFISDYSGP 312
Query: 269 KVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADPPPKAVKVTVSK 328
V++A+ + L++G S D F W D +N + + TLA+ + P ++ +
Sbjct: 313 CVIMATPSGLQSGASRDFFEAWCEDSRNTCIICDFAVQGTLAKEILGG--PSSITTREGR 370
Query: 329 RVPL 332
RVPL
Sbjct: 371 RVPL 374
>Q499P4_RAT (tr|Q499P4) Cleavage and polyadenylation specificity factor 3
OS=Rattus norvegicus GN=Cpsf3 PE=2 SV=1
Length = 685
Score = 53.1 bits (126), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 35/135 (25%), Positives = 69/135 (51%), Gaps = 8/135 (5%)
Query: 200 YWAEENLVY--PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVN 257
YW ++ PIY+ S ++ + ++++ M+D I K N F+ K+I+ L +
Sbjct: 256 YWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQI--NINNPFVFKHISNLKS 313
Query: 258 KTELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADP 317
D++ GP VV+AS ++ G S ++F W +D +N V+ TLA+ + ++
Sbjct: 314 MDHFDDI--GPSVVMASPGMIQNGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE- 370
Query: 318 PPKAVKVTVSKRVPL 332
P+ + +++PL
Sbjct: 371 -PEEITTMSGQKLPL 384
>Q8CIM0_MOUSE (tr|Q8CIM0) Cleavage and polyadenylation specificity factor 3
OS=Mus musculus GN=Cpsf3 PE=2 SV=1
Length = 684
Score = 53.1 bits (126), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 35/135 (25%), Positives = 69/135 (51%), Gaps = 8/135 (5%)
Query: 200 YWAEENLVY--PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVN 257
YW ++ PIY+ S ++ + ++++ M+D I K N F+ K+I+ L +
Sbjct: 256 YWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQI--NINNPFVFKHISNLKS 313
Query: 258 KTELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADP 317
D++ GP VV+AS ++ G S ++F W +D +N V+ TLA+ + ++
Sbjct: 314 MDHFDDI--GPSVVMASPGMIQNGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE- 370
Query: 318 PPKAVKVTVSKRVPL 332
P+ + +++PL
Sbjct: 371 -PEEITTMSGQKLPL 384
>A9RRH1_PHYPA (tr|A9RRH1) Predicted protein OS=Physcomitrella patens subsp.
patens GN=PHYPADRAFT_177452 PE=4 SV=1
Length = 682
Score = 53.1 bits (126), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 44/135 (32%), Positives = 67/135 (49%), Gaps = 8/135 (5%)
Query: 200 YW-AEENLVY-PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVN 257
YW A L + PIY+ S ++ + ++++ M+D I K FE + N F K+I L N
Sbjct: 262 YWEAHPELQHIPIYYASPLAKKCMAVYQTYINAMNDRIQKQFEVS--NPFDFKHIQPLKN 319
Query: 258 KTELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADP 317
D++ GP VV+AS L++G S +F W D KN + TLA+ + +
Sbjct: 320 IDGFDDI--GPAVVMASPGGLQSGLSRQLFDIWCQDKKNSCIIPGYVVEGTLAKAIMNE- 376
Query: 318 PPKAVKVTVSKRVPL 332
PK V + VPL
Sbjct: 377 -PKEVTLLSGLVVPL 390
>Q3U057_MOUSE (tr|Q3U057) Putative uncharacterized protein OS=Mus musculus
GN=Cpsf3 PE=2 SV=1
Length = 684
Score = 52.8 bits (125), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 35/135 (25%), Positives = 69/135 (51%), Gaps = 8/135 (5%)
Query: 200 YWAEENLVY--PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVN 257
YW ++ PIY+ S ++ + ++++ M+D I K N F+ K+I+ L +
Sbjct: 256 YWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQI--NINNPFVFKHISNLKS 313
Query: 258 KTELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADP 317
D++ GP VV+AS ++ G S ++F W +D +N V+ TLA+ + ++
Sbjct: 314 MDHFDDI--GPSVVMASPGMIQNGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE- 370
Query: 318 PPKAVKVTVSKRVPL 332
P+ + +++PL
Sbjct: 371 -PEEITTMSGQKLPL 384
>B5RTE7_DEBHA (tr|B5RTE7) DEHA2D07304p OS=Debaryomyces hansenii GN=DEHA2D07304g
PE=4 SV=1
Length = 959
Score = 52.8 bits (125), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 92/425 (21%), Positives = 145/425 (34%), Gaps = 123/425 (28%)
Query: 209 PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENI----------FLLKYITLLVNK 258
P+YFLSY + + Y + LEWMS + K +E+ F + LL +
Sbjct: 267 PVYFLSYSGTKVLSYASNLLEWMSSQLVKEWEEASSVNNNSSNKNNFPFDPSKVDLLSDP 326
Query: 259 TELDNVPDGPKVVLASMASLEAG-FSHDIFVEWGSDVKNLVLFTEKGQFA---------- 307
+EL + GPK+V S L+ G S + D K ++ TEK F
Sbjct: 327 SELVQL-SGPKIVFCSGIDLKNGDMSSEALQYLCQDEKTTIVLTEKTHFGLDNTINSQLY 385
Query: 308 ----TLARMLQADPPPKAVKVTVSKRV---------PLVGDELIAYEEEQNRIKKEEAL- 353
L + Q V V + K + PL+G EL ++E+ N +K++ L
Sbjct: 386 HDWYNLTKQKQGGTVEDGVAVPLEKVISLENWNREEPLIGAELTDFQEKINLQRKQKLLA 445
Query: 354 ------KASLIKAEELKASH----------------GADSNTSDPMVIDTGNNLIAPEVA 391
+L+ A+ + A T P D P V
Sbjct: 446 KVRDRKNQNLLNADTINGDDSSSDEEDDVVSSDDEAAALKYTEAPANADASTTTNVPAVV 505
Query: 392 GPRGGGYRDIFVDGFVPPSTSV---------------APMFP--CYENTSEWDDFGEVIN 434
+ F+ V + MFP + ++DD+GEVI+
Sbjct: 506 KVDELSAHEAFITDHVKQTLEANRPLDLKITHKLKPRQAMFPYIVGSHKQKFDDYGEVID 565
Query: 435 PDDYVIKDED-------------MDQTAMHVGGDINGK---------------------- 459
D+ K ED +Q G+++ K
Sbjct: 566 IKDFQ-KQEDTSSNKLIMESKRKFEQNEKRKWGNVDNKGKGRGKNSDKDNNNQNKITPQE 624
Query: 460 -LDEGAASLILDT--KPSKVI---------SDERTVQVRCSLVYMDFEGRSDGRSIKNIL 507
L+ LDT P K I S + +++RC L ++D G D RS+ I+
Sbjct: 625 LLNNQLLQKNLDTLFSPRKRIPLNAASSFSSKPQELRMRCGLSFVDLSGLVDMRSLSLIV 684
Query: 508 SHVAP 512
S + P
Sbjct: 685 SSLKP 689
>Q3UDS1_MOUSE (tr|Q3UDS1) Putative uncharacterized protein OS=Mus musculus
GN=Cpsf3 PE=2 SV=1
Length = 684
Score = 52.8 bits (125), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 35/135 (25%), Positives = 69/135 (51%), Gaps = 8/135 (5%)
Query: 200 YWAEENLVY--PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVN 257
YW ++ PIY+ S ++ + ++++ M+D I K N F+ K+I+ L +
Sbjct: 256 YWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQI--NINNPFVFKHISNLKS 313
Query: 258 KTELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADP 317
D++ GP VV+AS ++ G S ++F W +D +N V+ TLA+ + ++
Sbjct: 314 MDHFDDI--GPSVVMASPGMIQNGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE- 370
Query: 318 PPKAVKVTVSKRVPL 332
P+ + +++PL
Sbjct: 371 -PEEITTMSGQKLPL 384
>Q3TC91_MOUSE (tr|Q3TC91) Putative uncharacterized protein OS=Mus musculus
GN=Cpsf3 PE=2 SV=1
Length = 684
Score = 52.8 bits (125), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 35/135 (25%), Positives = 69/135 (51%), Gaps = 8/135 (5%)
Query: 200 YWAEENLVY--PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVN 257
YW ++ PIY+ S ++ + ++++ M+D I K N F+ K+I+ L +
Sbjct: 256 YWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQI--NINNPFVFKHISNLKS 313
Query: 258 KTELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADP 317
D++ GP VV+AS ++ G S ++F W +D +N V+ TLA+ + ++
Sbjct: 314 MDHFDDI--GPSVVMASPGMIQNGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE- 370
Query: 318 PPKAVKVTVSKRVPL 332
P+ + +++PL
Sbjct: 371 -PEEITTMSGQKLPL 384
>A4HB61_LEIBR (tr|A4HB61) Cleavage and polyadenylation specificity
factor,putative OS=Leishmania braziliensis
GN=LbrM20_V2.3010 PE=4 SV=1
Length = 756
Score = 52.8 bits (125), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 63/124 (50%), Gaps = 5/124 (4%)
Query: 209 PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVPDGP 268
PIY+ S ++ + ++F+ M+D + K N F+ KYI L++ ++ +GP
Sbjct: 279 PIYYASSLAQRCMKLYQTFVSAMNDRV-KQQHANHHNPFVFKYIHSLIDTKSFED--NGP 335
Query: 269 KVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADPPPKAVKVTVSK 328
VVLAS L++G S ++F W D +N ++ T+A+ + A PK V K
Sbjct: 336 CVVLASPGMLQSGISLELFERWCGDRRNGIIMAGYCVDGTIAKDVLAK--PKEVAKPDGK 393
Query: 329 RVPL 332
+PL
Sbjct: 394 VLPL 397
>A8WQT1_CAEBR (tr|A8WQT1) CBR-CPSF-3 protein OS=Caenorhabditis briggsae
GN=Cbr-cpsf-3 PE=4 SV=2
Length = 727
Score = 52.4 bits (124), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 33/120 (27%), Positives = 61/120 (50%), Gaps = 6/120 (5%)
Query: 200 YWAEENLVY--PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVN 257
YW ++ P+Y+ S ++ + ++F+ M+ I K +N F+ K+++ L
Sbjct: 257 YWEAHQELHDIPVYYASSLAKKCMSVYQTFVNGMNSRIQKQI--AIKNPFIFKHVSTLRG 314
Query: 258 KTELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADP 317
+ ++ GP VVLA+ L++GFS ++F W SD KN + TLA+ + +P
Sbjct: 315 MDQFEDA--GPCVVLATPGMLQSGFSRELFENWCSDSKNGCIIAGYCVEGTLAKHILTEP 372
>A4IAA9_LEIIN (tr|A4IAA9) Cleavage and polyadenylation specificity factor,
putative OS=Leishmania infantum GN=LinJ34.2950 PE=4 SV=1
Length = 756
Score = 52.4 bits (124), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 63/124 (50%), Gaps = 5/124 (4%)
Query: 209 PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVPDGP 268
PIY+ S ++ + ++F+ M+D + K N F+ KYI L++ ++ +GP
Sbjct: 279 PIYYASSLAQRCMKLYQTFVSAMNDRV-KQQHANHHNPFVFKYIHSLMDTKSFED--NGP 335
Query: 269 KVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADPPPKAVKVTVSK 328
VVLAS L++G S ++F W D +N ++ T+A+ + A PK V K
Sbjct: 336 CVVLASPGMLQSGISLELFERWCGDRRNGIIMAGYCVDGTIAKDVLAK--PKEVAKPDGK 393
Query: 329 RVPL 332
+PL
Sbjct: 394 VLPL 397
>Q95PY8_CAEEL (tr|Q95PY8) Protein Y67H2A.1, partially confirmed by transcript
evidence OS=Caenorhabditis elegans GN=cpsf-3 PE=4 SV=2
Length = 707
Score = 52.0 bits (123), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 32/120 (26%), Positives = 61/120 (50%), Gaps = 6/120 (5%)
Query: 200 YWAEENLVY--PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVN 257
YW ++ P+Y+ S ++ + ++F+ M+ I K +N F+ K+++ L
Sbjct: 257 YWESHQELHDIPVYYASSLAKKCMSVYQTFVNGMNSRIQKQI--AVKNPFIFKHVSTLRG 314
Query: 258 KTELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADP 317
+ ++ GP VVLA+ L++GFS ++F W D KN + TLA+ + ++P
Sbjct: 315 MDQFEDA--GPCVVLATPGMLQSGFSRELFESWCPDTKNGCIIAGYCVEGTLAKHILSEP 372
>Q4Q2K1_LEIMA (tr|Q4Q2K1) Cleavage and polyadenylation specificity factor,
putative OS=Leishmania major GN=LmjF34.3430 PE=4 SV=1
Length = 756
Score = 52.0 bits (123), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 32/109 (29%), Positives = 57/109 (52%), Gaps = 3/109 (2%)
Query: 209 PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVPDGP 268
PIY+ S ++ + ++F+ M+D + K N F+ KYI L++ ++ +GP
Sbjct: 279 PIYYASSLAQRCMKLYQTFVSAMNDRV-KQQHANHHNPFVFKYIRSLMDTKSFED--NGP 335
Query: 269 KVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADP 317
VVLAS L++G S ++F W D +N ++ T+A+ + A P
Sbjct: 336 CVVLASPGMLQSGISLELFERWCGDRRNGIIMAGYCVDGTIAKDVLAKP 384
>Q05BZ5_HUMAN (tr|Q05BZ5) CPSF3 protein (Fragment) OS=Homo sapiens GN=CPSF3 PE=2
SV=1
Length = 554
Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 33/116 (28%), Positives = 61/116 (52%), Gaps = 6/116 (5%)
Query: 200 YWAEENLVY--PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVN 257
YW ++ PIY+ S ++ + ++++ M+D I K N F+ K+I+ L +
Sbjct: 256 YWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQI--NINNPFVFKHISNLKS 313
Query: 258 KTELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARML 313
D++ GP VV+AS +++G S ++F W +D +N V+ TLA++L
Sbjct: 314 MDHFDDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKIL 367
>A3BYK8_ORYSJ (tr|A3BYK8) Putative uncharacterized protein OS=Oryza sativa subsp.
japonica GN=OsJ_028130 PE=4 SV=1
Length = 572
Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 35/121 (28%), Positives = 55/121 (45%), Gaps = 5/121 (4%)
Query: 200 YWAEENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKT 259
YW NL PIYF + ++ Y K + W S I S+ T N F K++ ++
Sbjct: 270 YWERMNLKIPIYFSAGLTIQANMYYKMLIGWTSQKIKNSY--TVHNPFDFKHVCHF-ERS 326
Query: 260 ELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADPPP 319
++N GP V+ A+ + GFS ++F +W KNLV T+ L + P
Sbjct: 327 FINN--PGPCVLFATPGMISGGFSLEVFKKWAPSEKNLVTLPGYCVAGTIGHKLMSGKPT 384
Query: 320 K 320
+
Sbjct: 385 R 385
>A2Z109_ORYSI (tr|A2Z109) Putative uncharacterized protein OS=Oryza sativa subsp.
indica GN=OsI_030252 PE=4 SV=1
Length = 572
Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 35/121 (28%), Positives = 55/121 (45%), Gaps = 5/121 (4%)
Query: 200 YWAEENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKT 259
YW NL PIYF + ++ Y K + W S I S+ T N F K++ ++
Sbjct: 270 YWERMNLKIPIYFSAGLTIQANMYYKMLIGWTSQKIKNSY--TVHNPFDFKHVCHF-ERS 326
Query: 260 ELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADPPP 319
++N GP V+ A+ + GFS ++F +W KNLV T+ L + P
Sbjct: 327 FINN--PGPCVLFATPGMISGGFSLEVFKKWAPSEKNLVTLPGYCVAGTIGHKLMSGKPT 384
Query: 320 K 320
+
Sbjct: 385 R 385
>B6KMV5_TOXGO (tr|B6KMV5) Cleavage and polyadenylation specificity factor,
putative OS=Toxoplasma gondii ME49 GN=TGME49_085200 PE=4
SV=1
Length = 1100
Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 39/123 (31%), Positives = 58/123 (47%), Gaps = 7/123 (5%)
Query: 200 YWAEENLV--YPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVN 257
YW + PI FLS +SS +F++ + + +S EN F +++ V
Sbjct: 377 YWTAHPEIRHVPILFLSPLSSKCAVVFDAFVDMCGEAV-RSRALRGENPFAFRFVKN-VK 434
Query: 258 KTELDNV---PDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQ 314
E V DGP VV+A+ L++G S +IF W D KN V+ T TLA L+
Sbjct: 435 SVEAARVYIHHDGPAVVMAAPGMLQSGASREIFEAWAPDAKNGVILTGYSVKGTLADELK 494
Query: 315 ADP 317
+P
Sbjct: 495 REP 497
>Q6ERD4_ORYSJ (tr|Q6ERD4) Putative FEG protein (Os09g0397900 protein) OS=Oryza
sativa subsp. japonica GN=OJ1655_B12.22 PE=4 SV=1
Length = 559
Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 35/121 (28%), Positives = 55/121 (45%), Gaps = 5/121 (4%)
Query: 200 YWAEENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKT 259
YW NL PIYF + ++ Y K + W S I S+ T N F K++ ++
Sbjct: 257 YWERMNLKIPIYFSAGLTIQANMYYKMLIGWTSQKIKNSY--TVHNPFDFKHVCHF-ERS 313
Query: 260 ELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADPPP 319
++N GP V+ A+ + GFS ++F +W KNLV T+ L + P
Sbjct: 314 FINN--PGPCVLFATPGMISGGFSLEVFKKWAPSEKNLVTLPGYCVAGTIGHKLMSGKPT 371
Query: 320 K 320
+
Sbjct: 372 R 372
>A9SGX1_PHYPA (tr|A9SGX1) Predicted protein OS=Physcomitrella patens subsp.
patens GN=PHYPADRAFT_184823 PE=4 SV=1
Length = 682
Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 35/99 (35%), Positives = 54/99 (54%), Gaps = 6/99 (6%)
Query: 200 YW-AEENLVY-PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVN 257
YW A L + PIY+ S ++ + ++++ M++ I K FE + N F K+I L N
Sbjct: 262 YWEAHPELQHIPIYYASPLAKKCMAVYQTYINAMNERIQKQFEVS--NPFDFKHIQPLKN 319
Query: 258 KTELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKN 296
E D++ GP VV+AS L++G S +F W D KN
Sbjct: 320 IDEFDDI--GPAVVMASPGGLQSGLSRQLFDIWCQDKKN 356
>Q6BCB3_TRYCR (tr|Q6BCB3) Cleavage polyadenylation specificity factor CPSF73
OS=Trypanosoma cruzi GN=CPSF73 PE=4 SV=1
Length = 762
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 40/135 (29%), Positives = 67/135 (49%), Gaps = 7/135 (5%)
Query: 200 YW-AEENLVY-PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVN 257
YW A + L + PIY+ S ++ + ++F+ M+D + + R N F+ KYI L+
Sbjct: 280 YWEAHKELQHIPIYYASSLAQRCMKLYQTFVSAMNDRVKQQHANHR-NPFVFKYIHSLME 338
Query: 258 KTELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADP 317
++ GP VVLAS L++G S ++F W D +N ++ T+A+ +
Sbjct: 339 TRSFEDT--GPCVVLASPGMLQSGISLELFERWCGDRRNGIIIAGYCVDGTIAKDILTK- 395
Query: 318 PPKAVKVTVSKRVPL 332
PK V K +PL
Sbjct: 396 -PKEVTKPDGKVLPL 409
>B4JEZ6_DROGR (tr|B4JEZ6) GH19212 OS=Drosophila grimshawi GN=GH19212 PE=4 SV=1
Length = 686
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/178 (21%), Positives = 89/178 (50%), Gaps = 11/178 (6%)
Query: 200 YWAEENLVY--PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVN 257
YW++ ++ PIY+ S ++ + ++++ M+D I + N F+ ++I+ L
Sbjct: 264 YWSQNPDLHEIPIYYASSLAKKCMAVYQTYINAMNDRIRRQI--AVNNPFVFRHISNLKG 321
Query: 258 KTELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADP 317
D++ GP V++AS +++G S ++F W +D KN V+ TLA+ + ++
Sbjct: 322 IDHFDDI--GPCVIMASPGMMQSGLSRELFESWCTDPKNGVIIAGYCVEGTLAKTILSE- 378
Query: 318 PPKAVKVTVSKRVPL-VGDELIAYEEEQNRIKKEEALKASLIKAEELKASHGADSNTS 374
P+ + +++PL + + I++ + + E ++ L++ + HG + S
Sbjct: 379 -PEEIITLSGQKLPLNMSVDYISFSAHTDYQQTSEFIR--LLRPNHVVLVHGEQNEMS 433
>B3RKJ0_TRIAD (tr|B3RKJ0) Putative uncharacterized protein OS=Trichoplax
adhaerens GN=TRIADDRAFT_19764 PE=4 SV=1
Length = 636
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 33/120 (27%), Positives = 64/120 (53%), Gaps = 6/120 (5%)
Query: 200 YWAEENLVY--PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVN 257
YW+ ++ PIY+ S ++ + ++++ M+D I + N F+ K+I+ L
Sbjct: 267 YWSNHTELHDIPIYYASSLAKKCMAVYQTYVSAMNDKIRNQIAIS--NPFIFKHISNLKG 324
Query: 258 KTELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADP 317
D++ GP VV+AS +++G S ++F +W +D KN V+ TLA+ + ++P
Sbjct: 325 IDHFDDI--GPCVVMASPGMMQSGLSRELFEKWCTDSKNGVVIAGYCVEGTLAKEVMSEP 382
>Q9U3K2_CAEEL (tr|Q9U3K2) Protein F10B5.8, partially confirmed by transcript
evidence OS=Caenorhabditis elegans GN=F10B5.8 PE=2 SV=2
Length = 608
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 50/101 (49%), Gaps = 5/101 (4%)
Query: 200 YWAEENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKT 259
YW L PIYF ++ Y + F+ W ++ I K+F N+F K+I +
Sbjct: 260 YWERMALNVPIYFSQGLAERANQYYRLFISWTNENIKKTF--VERNMFEFKHIKPMEKGC 317
Query: 260 ELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLF 300
E + P GP+V+ ++ L G S +F +W SD N+++
Sbjct: 318 E--DQP-GPQVLFSTPGMLHGGQSLKVFKKWCSDPLNMIIM 355
>Q4DTW2_TRYCR (tr|Q4DTW2) Cleavage and polyadenylation specificity factor,
putative (Fragment) OS=Trypanosoma cruzi
GN=Tc00.1047053508693.10 PE=4 SV=1
Length = 625
Score = 50.8 bits (120), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 40/135 (29%), Positives = 67/135 (49%), Gaps = 7/135 (5%)
Query: 200 YW-AEENLVY-PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVN 257
YW A + L + PIY+ S ++ + ++F+ M+D + + R N F+ KYI L+
Sbjct: 143 YWEAHKELQHIPIYYASSLAQRCMKLYQTFVSAMNDRVKQQHANHR-NPFVFKYIHSLME 201
Query: 258 KTELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADP 317
++ GP VVLAS L++G S ++F W D +N ++ T+A+ +
Sbjct: 202 TRSFEDT--GPCVVLASPGMLQSGISLELFERWCGDRRNGIIIAGYCVDGTIAKDILTK- 258
Query: 318 PPKAVKVTVSKRVPL 332
PK V K +PL
Sbjct: 259 -PKEVTKPDGKVLPL 272
>Q170J6_AEDAE (tr|Q170J6) Cleavage and polyadenylation specificity factor
OS=Aedes aegypti GN=AAEL007904 PE=4 SV=1
Length = 687
Score = 50.4 bits (119), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 36/136 (26%), Positives = 72/136 (52%), Gaps = 10/136 (7%)
Query: 200 YWAE--ENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVN 257
YW++ E +PIY+ S ++ + ++++ M+D I + N F+ ++I+ N
Sbjct: 263 YWSQNPELQEFPIYYASSLAKKCMAVYQTYINAMNDKIRRQI--AVNNPFVFRHIS---N 317
Query: 258 KTELDNVPD-GPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQAD 316
+D+ D GP VV+AS +++G S ++F W +D KN V+ TLA+ + ++
Sbjct: 318 LKGIDHFEDIGPCVVMASPGMMQSGLSRELFETWCTDPKNGVIIAGYCVEGTLAKTILSE 377
Query: 317 PPPKAVKVTVSKRVPL 332
P+ + +++PL
Sbjct: 378 --PEEITSMSGQKLPL 391
>Q9C952_ARATH (tr|Q9C952) Putative cleavage and polyadenylation specificity
factor; 72745-70039 (Putative cleavage and
polyadenylation specificity factor) (Putative cleavage
and polyadenylation specificity factor 73 kDa subunit)
OS=Arabidopsis thaliana GN=At1g61010 PE=2 SV=1
Length = 693
Score = 50.4 bits (119), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 35/120 (29%), Positives = 63/120 (52%), Gaps = 6/120 (5%)
Query: 200 YWAEENLVY--PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVN 257
YWA ++ PIY+ S ++ + ++++ M+D I F + N F+ K+I+ L +
Sbjct: 266 YWANHPDLHNIPIYYASPLAKKCMAVYQTYILSMNDRIRNQFANS--NPFVFKHISPLNS 323
Query: 258 KTELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADP 317
+ ++V GP VV+A+ L++G S +F W SD KN + TLA+ + +P
Sbjct: 324 IDDFNDV--GPSVVMATPGGLQSGLSRQLFDSWCSDKKNACIIPGYMVEGTLAKTIINEP 381
>Q8VY18_ARATH (tr|Q8VY18) Putative cleavage and polyadenylation specificity
factor OS=Arabidopsis thaliana GN=At1g61010 PE=1 SV=1
Length = 693
Score = 50.1 bits (118), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 35/120 (29%), Positives = 63/120 (52%), Gaps = 6/120 (5%)
Query: 200 YWAEENLVY--PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVN 257
YWA ++ PIY+ S ++ + ++++ M+D I F + N F+ K+I+ L +
Sbjct: 266 YWANHPDLHNIPIYYASPLAKKCMAVYQTYILSMNDRIRNQFANS--NPFVFKHISPLNS 323
Query: 258 KTELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADP 317
+ ++V GP VV+A+ L++G S +F W SD KN + TLA+ + +P
Sbjct: 324 IDDFNDV--GPSVVMATPGGLQSGLSRQLFDSWCSDKKNACIIPGYMVEGTLAKTIINEP 381
>A8WN58_CAEBR (tr|A8WN58) Putative uncharacterized protein OS=Caenorhabditis
briggsae GN=CBG00745 PE=4 SV=2
Length = 652
Score = 50.1 bits (118), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 50/101 (49%), Gaps = 5/101 (4%)
Query: 200 YWAEENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKT 259
YW L PIYF ++ Y + F+ W ++ I K+F N+F K+I +
Sbjct: 301 YWERMALNVPIYFSQGLAERANQYYRLFISWTNENIKKTF--VERNMFEFKHIRPMEKGC 358
Query: 260 ELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLF 300
E + P GP+V+ ++ L G S +F +W SD N+++
Sbjct: 359 E--DQP-GPQVLFSTPGMLHGGQSLKVFKKWCSDPLNMIIM 396
>B4K8L8_DROMO (tr|B4K8L8) GI24246 OS=Drosophila mojavensis GN=GI24246 PE=4 SV=1
Length = 686
Score = 50.1 bits (118), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 31/124 (25%), Positives = 65/124 (52%), Gaps = 6/124 (4%)
Query: 209 PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVPDGP 268
PIY+ S ++ + ++++ M+D I + N F+ ++I+ L D++ GP
Sbjct: 275 PIYYASSLAKKCMAVYQTYINAMNDRIRRQI--AVNNPFVFRHISNLKGIDHFDDI--GP 330
Query: 269 KVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADPPPKAVKVTVSK 328
V++AS +++G S ++F W +D KN V+ TLA+ + ++ P+ + +
Sbjct: 331 CVIMASPGMMQSGLSRELFESWCTDPKNGVIIAGYCVEGTLAKTILSE--PEEITTLSGQ 388
Query: 329 RVPL 332
++PL
Sbjct: 389 KLPL 392
>Q4DNS8_TRYCR (tr|Q4DNS8) Cleavage and polyadenylation specificity factor,
putative (Fragment) OS=Trypanosoma cruzi
GN=Tc00.1047053511003.221 PE=4 SV=1
Length = 430
Score = 50.1 bits (118), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 55/103 (53%), Gaps = 5/103 (4%)
Query: 200 YW-AEENLVY-PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVN 257
YW A + L + PIY+ S ++ + ++F+ M+D + + R N F+ KYI L+
Sbjct: 280 YWEAHKELQHIPIYYASSLAQRCMKLYQTFVSAMNDRVKQQHANHR-NPFVFKYIHSLME 338
Query: 258 KTELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLF 300
++ GP VVLAS L++G S ++F W D +N ++
Sbjct: 339 TRSFEDT--GPCVVLASPGMLQSGISLELFERWCGDRRNGIII 379
>B4M3X4_DROVI (tr|B4M3X4) GJ10819 OS=Drosophila virilis GN=GJ10819 PE=4 SV=1
Length = 686
Score = 49.7 bits (117), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 31/124 (25%), Positives = 65/124 (52%), Gaps = 6/124 (4%)
Query: 209 PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVPDGP 268
PIY+ S ++ + ++++ M+D I + N F+ ++I+ L D++ GP
Sbjct: 275 PIYYASSLAKKCMAVYQTYINAMNDRIRRQI--AVNNPFVFRHISNLKGIDHFDDI--GP 330
Query: 269 KVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADPPPKAVKVTVSK 328
V++AS +++G S ++F W +D KN V+ TLA+ + ++ P+ + +
Sbjct: 331 CVIMASPGMMQSGLSRELFESWCTDPKNGVIIAGYCVEGTLAKTILSE--PEEITTLSGQ 388
Query: 329 RVPL 332
++PL
Sbjct: 389 KLPL 392
>Q8SUE4_ENCCU (tr|Q8SUE4) Putative uncharacterized protein ECU10_0900
OS=Encephalitozoon cuniculi GN=ECU10_0900 PE=4 SV=1
Length = 730
Score = 49.7 bits (117), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 63/124 (50%), Gaps = 6/124 (4%)
Query: 209 PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVPDGP 268
PIY+ S ++ + ++++ M++ I K N F KY+ L D+ +GP
Sbjct: 349 PIYYASALAKRCMGVYQTYIGMMNERIQKL--SLVRNPFAFKYVKNLKGIDSFDD--EGP 404
Query: 269 KVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADPPPKAVKVTVSK 328
V++AS L++G S D+F W SD KN V+ TLA+ + ++ PK ++ K
Sbjct: 405 CVIMASPGMLQSGLSRDLFERWCSDSKNAVIIPGYCVDGTLAKEILSE--PKEIEAMNGK 462
Query: 329 RVPL 332
++ L
Sbjct: 463 KLRL 466
>B0X8N8_CULQU (tr|B0X8N8) Cleavage and polyadenylation specificity factor
OS=Culex quinquefasciatus GN=CpipJ_CPIJ015409 PE=4 SV=1
Length = 688
Score = 49.3 bits (116), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 35/121 (28%), Positives = 64/121 (52%), Gaps = 8/121 (6%)
Query: 200 YWAE--ENLVYPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVN 257
YW++ E PIY+ S ++ + ++++ M+D I + N F+ ++I+ N
Sbjct: 263 YWSQNPELQEIPIYYASSLAKKCMAVYQTYINAMNDKIRRQI--AVNNPFVFRHIS---N 317
Query: 258 KTELDNVPD-GPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQAD 316
+D+ D GP VV+AS +++G S ++F W SD KN V+ TLA+ + ++
Sbjct: 318 LKGIDHFEDIGPCVVMASPGMMQSGLSRELFETWCSDPKNGVIIAGYCVEGTLAKTVLSE 377
Query: 317 P 317
P
Sbjct: 378 P 378
>B4QU05_DROSI (tr|B4QU05) GD20157 OS=Drosophila simulans GN=GD20157 PE=4 SV=1
Length = 684
Score = 49.3 bits (116), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 39/168 (23%), Positives = 84/168 (50%), Gaps = 11/168 (6%)
Query: 209 PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVPD-G 267
PIY+ S ++ + ++++ M+D I + N F+ ++I+ N +D+ D G
Sbjct: 273 PIYYASSLAKKCMAVYQTYINAMNDRIRRQI--AVNNPFVFRHIS---NLKGIDHFEDIG 327
Query: 268 PKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADPPPKAVKVTVS 327
P V++AS +++G S ++F W +D KN V+ TLA+ + ++ P+ +
Sbjct: 328 PCVIMASPGMMQSGLSRELFESWCTDPKNGVIIAGYCVEGTLAKTVLSE--PEEITTLSG 385
Query: 328 KRVPL-VGDELIAYEEEQNRIKKEEALKASLIKAEELKASHGADSNTS 374
+++PL + + I++ + + E ++ L+K + HG + S
Sbjct: 386 QKLPLNMSVDYISFSAHTDYQQTSEFIR--LLKPTHVVLVHGEQNEMS 431
>B3LW78_DROAN (tr|B3LW78) GF18101 OS=Drosophila ananassae GN=GF18101 PE=4 SV=1
Length = 684
Score = 49.3 bits (116), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 39/179 (21%), Positives = 90/179 (50%), Gaps = 13/179 (7%)
Query: 200 YWAEENLVY--PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVN 257
+W++ ++ PIY+ S ++ + ++++ M+D I + N F+ ++I+ N
Sbjct: 262 FWSQNPDLHEIPIYYASSLAKKCMAVYQTYINAMNDRIRRQI--AVNNPFVFRHIS---N 316
Query: 258 KTELDNVPD-GPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQAD 316
+D+ D GP V++AS +++G S ++F W +D KN V+ TLA+ + ++
Sbjct: 317 LKGIDHFEDIGPCVIMASPGMMQSGLSRELFESWCTDPKNGVIIAGYCVEGTLAKTILSE 376
Query: 317 PPPKAVKVTVSKRVPL-VGDELIAYEEEQNRIKKEEALKASLIKAEELKASHGADSNTS 374
P+ + +++PL + + I++ + + E ++ ++K + HG + S
Sbjct: 377 --PEEITTLSGQKLPLNMSVDYISFSAHTDYQQTSEFIR--MLKPTHVVLVHGEQNEMS 431
>A8N7F0_COPC7 (tr|A8N7F0) Putative uncharacterized protein OS=Coprinopsis cinerea
(strain Okayama-7 / 130 / FGSC 9003) GN=CC1G_03293 PE=4
SV=1
Length = 802
Score = 49.3 bits (116), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 35/122 (28%), Positives = 63/122 (51%), Gaps = 5/122 (4%)
Query: 200 YWAEENLVY--PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVN 257
YW + ++ PIY+ S ++ + ++++ M+ I F + R+N F+ KYI+ L
Sbjct: 210 YWKKHPDLHNVPIYYASSLARKCMAVYQTYIHTMNANIRTRFAK-RDNPFVFKYISNLPQ 268
Query: 258 KTELDN-VPDGPK-VVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQA 315
+ + +GP VVLAS ++ G S ++F W D +N ++ T TLAR +
Sbjct: 269 TRGWEKKIAEGPPCVVLASPGFMQVGPSRELFELWAPDARNGLIITGYSIEGTLARDIMT 328
Query: 316 DP 317
+P
Sbjct: 329 EP 330
>Q9VE51_DROME (tr|Q9VE51) CG7698-PA (RE31408p) OS=Drosophila melanogaster
GN=CG7698 PE=1 SV=2
Length = 684
Score = 48.9 bits (115), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 39/168 (23%), Positives = 84/168 (50%), Gaps = 11/168 (6%)
Query: 209 PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVPD-G 267
PIY+ S ++ + ++++ M+D I + N F+ ++I+ N +D+ D G
Sbjct: 273 PIYYASSLAKKCMAVYQTYINAMNDRIRRQI--AVNNPFVFRHIS---NLKGIDHFEDIG 327
Query: 268 PKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADPPPKAVKVTVS 327
P V++AS +++G S ++F W +D KN V+ TLA+ + ++ P+ +
Sbjct: 328 PCVIMASPGMMQSGLSRELFESWCTDPKNGVIIAGYCVEGTLAKAVLSE--PEEITTLSG 385
Query: 328 KRVPL-VGDELIAYEEEQNRIKKEEALKASLIKAEELKASHGADSNTS 374
+++PL + + I++ + + E ++ L+K + HG + S
Sbjct: 386 QKLPLNMSVDYISFSAHTDYQQTSEFIR--LLKPTHVVLVHGEQNEMS 431
>B4I269_DROSE (tr|B4I269) GM18692 OS=Drosophila sechellia GN=GM18692 PE=4 SV=1
Length = 684
Score = 48.9 bits (115), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 39/168 (23%), Positives = 84/168 (50%), Gaps = 11/168 (6%)
Query: 209 PIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVNKTELDNVPD-G 267
PIY+ S ++ + ++++ M+D I + N F+ ++I+ N +D+ D G
Sbjct: 273 PIYYASSLAKKCMAVYQTYINAMNDRIRRQI--AVNNPFVFRHIS---NLKGIDHFEDIG 327
Query: 268 PKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADPPPKAVKVTVS 327
P V++AS +++G S ++F W +D KN V+ TLA+ + ++ P+ +
Sbjct: 328 PCVIMASPGMMQSGLSRELFESWCTDPKNGVIIAGYCVEGTLAKTVLSE--PEEITTLSG 385
Query: 328 KRVPL-VGDELIAYEEEQNRIKKEEALKASLIKAEELKASHGADSNTS 374
+++PL + + I++ + + E ++ L+K + HG + S
Sbjct: 386 QKLPLNMSVDYISFSAHTDYQQTSEFIR--LLKPTHVVLVHGEQNEMS 431
>Q7PX56_ANOGA (tr|Q7PX56) AGAP001224-PA (Fragment) OS=Anopheles gambiae
GN=AGAP001224 PE=4 SV=4
Length = 588
Score = 48.9 bits (115), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 35/135 (25%), Positives = 69/135 (51%), Gaps = 8/135 (5%)
Query: 200 YWAEENLV--YPIYFLSYVSSSTIDYVKSFLEWMSDTIAKSFEQTRENIFLLKYITLLVN 257
YW++ + PIY+ S ++ + ++++ M+D I + N F+ ++I+ L
Sbjct: 248 YWSQNPDLQEIPIYYASSLAKKCMAVYQTYINAMNDKIRRQI--AINNPFVFRFISNLKG 305
Query: 258 KTELDNVPDGPKVVLASMASLEAGFSHDIFVEWGSDVKNLVLFTEKGQFATLARMLQADP 317
D+V GP VV+AS +++G S ++F W +D KN V+ TLA+ + +
Sbjct: 306 IDHFDDV--GPCVVMASPGMMQSGLSRELFESWCTDPKNGVIIAGYCVEGTLAKTILFE- 362
Query: 318 PPKAVKVTVSKRVPL 332
P+ + +++PL
Sbjct: 363 -PEEITSMNGQKLPL 376