FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB8303, 600 aa
1>>>pF1KB8303 600 - 600 aa - 600 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.7238+/-0.00106; mu= 16.7059+/- 0.064
mean_var=68.0336+/-13.595, 0's: 0 Z-trim(102.9): 25 B-trim: 0 in 0/51
Lambda= 0.155494
statistics sampled from 7150 (7161) to 7150 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.588), E-opt: 0.2 (0.22), width: 16
Scan time: 3.170
The best scores are: opt bits E(32554)
CCDS21.1 CPSF3L gene_id:54973|Hs108|chr1 ( 600) 4015 910.2 0
CCDS57960.1 CPSF3L gene_id:54973|Hs108|chr1 ( 606) 3955 896.7 0
CCDS57959.1 CPSF3L gene_id:54973|Hs108|chr1 ( 571) 3825 867.6 0
CCDS57961.1 CPSF3L gene_id:54973|Hs108|chr1 ( 499) 3032 689.7 2.4e-198
CCDS72678.1 CPSF3L gene_id:54973|Hs108|chr1 ( 502) 3020 687.0 1.6e-197
CCDS1664.1 CPSF3 gene_id:51692|Hs108|chr2 ( 684) 1147 266.8 6.3e-71
CCDS82417.1 CPSF3 gene_id:51692|Hs108|chr2 ( 647) 1039 242.6 1.2e-63
CCDS9902.1 CPSF2 gene_id:53981|Hs108|chr14 ( 782) 438 107.8 5.4e-23
>>CCDS21.1 CPSF3L gene_id:54973|Hs108|chr1 (600 aa)
initn: 4015 init1: 4015 opt: 4015 Z-score: 4864.9 bits: 910.2 E(32554): 0
Smith-Waterman score: 4015; 99.8% identity (99.8% similar) in 600 aa overlap (1-600:1-600)
10 20 30 40 50 60
pF1KB8 MPEIRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS21 MPEIRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDF
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB8 LDCVIISHFHLDHCGALPYFSEMVGYDGPIYMTHSTQAICPILLEDYRKIAVDKKGEANF
:::::::::::::::::::::::::::::::::: :::::::::::::::::::::::::
CCDS21 LDCVIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANF
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB8 FTSQMIKDCMKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS21 FTSQMIKDCMKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYN
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB8 MTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS21 MTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVF
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB8 ALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTFVQRNMF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS21 ALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTFVQRNMF
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB8 EFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQGTVG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS21 EFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQGTVG
310 320 330 340 350 360
370 380 390 400 410 420
pF1KB8 HKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVHGEAKKM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS21 HKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVHGEAKKM
370 380 390 400 410 420
430 440 450 460 470 480
pF1KB8 EFLKQKIEQELRVNCYMPANGETVTLPTSPSIPVGISLGLLKREMAQGLLPEAKKPRLLH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS21 EFLKQKIEQELRVNCYMPANGETVTLPTSPSIPVGISLGLLKREMAQGLLPEAKKPRLLH
430 440 450 460 470 480
490 500 510 520 530 540
pF1KB8 GTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDTRKEQETALRVYSHLKSVLKD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS21 GTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDTRKEQETALRVYSHLKSVLKD
490 500 510 520 530 540
550 560 570 580 590 600
pF1KB8 HCVQHLPDGSVTVESVLLQAAAPSEDPGTKVLLVSWTYQDEELGSFLTSLLKKGLPQAPS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS21 HCVQHLPDGSVTVESVLLQAAAPSEDPGTKVLLVSWTYQDEELGSFLTSLLKKGLPQAPS
550 560 570 580 590 600
>>CCDS57960.1 CPSF3L gene_id:54973|Hs108|chr1 (606 aa)
initn: 3955 init1: 3955 opt: 3955 Z-score: 4792.0 bits: 896.7 E(32554): 0
Smith-Waterman score: 3955; 99.8% identity (99.8% similar) in 591 aa overlap (10-600:16-606)
10 20 30 40 50
pF1KB8 MPEIRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQN
:::::::::::::::::::::::::::::::::::::::::::::
CCDS57 MCGAGFGHFEWLAGGGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQN
10 20 30 40 50 60
60 70 80 90 100 110
pF1KB8 GRLTDFLDCVIISHFHLDHCGALPYFSEMVGYDGPIYMTHSTQAICPILLEDYRKIAVDK
:::::::::::::::::::::::::::::::::::::::: :::::::::::::::::::
CCDS57 GRLTDFLDCVIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDK
70 80 90 100 110 120
120 130 140 150 160 170
pF1KB8 KGEANFFTSQMIKDCMKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS57 KGEANFFTSQMIKDCMKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVV
130 140 150 160 170 180
180 190 200 210 220 230
pF1KB8 YTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS57 YTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGK
190 200 210 220 230 240
240 250 260 270 280 290
pF1KB8 VLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS57 VLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTF
250 260 270 280 290 300
300 310 320 330 340 350
pF1KB8 VQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS57 VQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYC
310 320 330 340 350 360
360 370 380 390 400 410
pF1KB8 VQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS57 VQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVH
370 380 390 400 410 420
420 430 440 450 460 470
pF1KB8 GEAKKMEFLKQKIEQELRVNCYMPANGETVTLPTSPSIPVGISLGLLKREMAQGLLPEAK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS57 GEAKKMEFLKQKIEQELRVNCYMPANGETVTLPTSPSIPVGISLGLLKREMAQGLLPEAK
430 440 450 460 470 480
480 490 500 510 520 530
pF1KB8 KPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDTRKEQETALRVYSHL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS57 KPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDTRKEQETALRVYSHL
490 500 510 520 530 540
540 550 560 570 580 590
pF1KB8 KSVLKDHCVQHLPDGSVTVESVLLQAAAPSEDPGTKVLLVSWTYQDEELGSFLTSLLKKG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS57 KSVLKDHCVQHLPDGSVTVESVLLQAAAPSEDPGTKVLLVSWTYQDEELGSFLTSLLKKG
550 560 570 580 590 600
600
pF1KB8 LPQAPS
::::::
CCDS57 LPQAPS
>>CCDS57959.1 CPSF3L gene_id:54973|Hs108|chr1 (571 aa)
initn: 3825 init1: 3825 opt: 3825 Z-score: 4634.9 bits: 867.6 E(32554): 0
Smith-Waterman score: 3825; 99.8% identity (99.8% similar) in 571 aa overlap (30-600:1-571)
10 20 30 40 50 60
pF1KB8 MPEIRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDF
:::::::::::::::::::::::::::::::
CCDS57 MLDCGMHMGFNDDRRFPDFSYITQNGRLTDF
10 20 30
70 80 90 100 110 120
pF1KB8 LDCVIISHFHLDHCGALPYFSEMVGYDGPIYMTHSTQAICPILLEDYRKIAVDKKGEANF
:::::::::::::::::::::::::::::::::: :::::::::::::::::::::::::
CCDS57 LDCVIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANF
40 50 60 70 80 90
130 140 150 160 170 180
pF1KB8 FTSQMIKDCMKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS57 FTSQMIKDCMKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYN
100 110 120 130 140 150
190 200 210 220 230 240
pF1KB8 MTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS57 MTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVF
160 170 180 190 200 210
250 260 270 280 290 300
pF1KB8 ALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTFVQRNMF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS57 ALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTFVQRNMF
220 230 240 250 260 270
310 320 330 340 350 360
pF1KB8 EFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQGTVG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS57 EFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQGTVG
280 290 300 310 320 330
370 380 390 400 410 420
pF1KB8 HKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVHGEAKKM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS57 HKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVHGEAKKM
340 350 360 370 380 390
430 440 450 460 470 480
pF1KB8 EFLKQKIEQELRVNCYMPANGETVTLPTSPSIPVGISLGLLKREMAQGLLPEAKKPRLLH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS57 EFLKQKIEQELRVNCYMPANGETVTLPTSPSIPVGISLGLLKREMAQGLLPEAKKPRLLH
400 410 420 430 440 450
490 500 510 520 530 540
pF1KB8 GTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDTRKEQETALRVYSHLKSVLKD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS57 GTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDTRKEQETALRVYSHLKSVLKD
460 470 480 490 500 510
550 560 570 580 590 600
pF1KB8 HCVQHLPDGSVTVESVLLQAAAPSEDPGTKVLLVSWTYQDEELGSFLTSLLKKGLPQAPS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS57 HCVQHLPDGSVTVESVLLQAAAPSEDPGTKVLLVSWTYQDEELGSFLTSLLKKGLPQAPS
520 530 540 550 560 570
>>CCDS57961.1 CPSF3L gene_id:54973|Hs108|chr1 (499 aa)
initn: 3306 init1: 3026 opt: 3032 Z-score: 3674.4 bits: 689.7 E(32554): 2.4e-198
Smith-Waterman score: 3108; 83.2% identity (83.2% similar) in 600 aa overlap (1-600:1-499)
10 20 30 40 50 60
pF1KB8 MPEIRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDF
::::::::::::::::::::::::::::::::::::::::::
CCDS57 MPEIRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDD------------------
10 20 30 40
70 80 90 100 110 120
pF1KB8 LDCVIISHFHLDHCGALPYFSEMVGYDGPIYMTHSTQAICPILLEDYRKIAVDKKGEANF
CCDS57 ------------------------------------------------------------
130 140 150 160 170 180
pF1KB8 FTSQMIKDCMKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYN
:::::::::::::::::::::::::::::::::::::
CCDS57 -----------------------VDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYN
50 60 70
190 200 210 220 230 240
pF1KB8 MTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS57 MTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVF
80 90 100 110 120 130
250 260 270 280 290 300
pF1KB8 ALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTFVQRNMF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS57 ALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTFVQRNMF
140 150 160 170 180 190
310 320 330 340 350 360
pF1KB8 EFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQGTVG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS57 EFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQGTVG
200 210 220 230 240 250
370 380 390 400 410 420
pF1KB8 HKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVHGEAKKM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS57 HKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVHGEAKKM
260 270 280 290 300 310
430 440 450 460 470 480
pF1KB8 EFLKQKIEQELRVNCYMPANGETVTLPTSPSIPVGISLGLLKREMAQGLLPEAKKPRLLH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS57 EFLKQKIEQELRVNCYMPANGETVTLPTSPSIPVGISLGLLKREMAQGLLPEAKKPRLLH
320 330 340 350 360 370
490 500 510 520 530 540
pF1KB8 GTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDTRKEQETALRVYSHLKSVLKD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS57 GTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDTRKEQETALRVYSHLKSVLKD
380 390 400 410 420 430
550 560 570 580 590 600
pF1KB8 HCVQHLPDGSVTVESVLLQAAAPSEDPGTKVLLVSWTYQDEELGSFLTSLLKKGLPQAPS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS57 HCVQHLPDGSVTVESVLLQAAAPSEDPGTKVLLVSWTYQDEELGSFLTSLLKKGLPQAPS
440 450 460 470 480 490
>>CCDS72678.1 CPSF3L gene_id:54973|Hs108|chr1 (502 aa)
initn: 3081 init1: 3020 opt: 3020 Z-score: 3659.8 bits: 687.0 E(32554): 1.6e-197
Smith-Waterman score: 3020; 99.8% identity (100.0% similar) in 457 aa overlap (144-600:46-502)
120 130 140 150 160 170
pF1KB8 KKGEANFFTSQMIKDCMKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESV
:.::::::::::::::::::::::::::::
CCDS72 SWNLVLPPGRGSRAPSSTDTRTGFHVLPHGVNDELEIKAYYAGHVLGAAMFQIKVGSESV
20 30 40 50 60 70
180 190 200 210 220 230
pF1KB8 VYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS72 VYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGG
80 90 100 110 120 130
240 250 260 270 280 290
pF1KB8 KVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS72 KVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKT
140 150 160 170 180 190
300 310 320 330 340 350
pF1KB8 FVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS72 FVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGY
200 210 220 230 240 250
360 370 380 390 400 410
pF1KB8 CVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS72 CVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLV
260 270 280 290 300 310
420 430 440 450 460 470
pF1KB8 HGEAKKMEFLKQKIEQELRVNCYMPANGETVTLPTSPSIPVGISLGLLKREMAQGLLPEA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS72 HGEAKKMEFLKQKIEQELRVNCYMPANGETVTLPTSPSIPVGISLGLLKREMAQGLLPEA
320 330 340 350 360 370
480 490 500 510 520 530
pF1KB8 KKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDTRKEQETALRVYSH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS72 KKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDTRKEQETALRVYSH
380 390 400 410 420 430
540 550 560 570 580 590
pF1KB8 LKSVLKDHCVQHLPDGSVTVESVLLQAAAPSEDPGTKVLLVSWTYQDEELGSFLTSLLKK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS72 LKSVLKDHCVQHLPDGSVTVESVLLQAAAPSEDPGTKVLLVSWTYQDEELGSFLTSLLKK
440 450 460 470 480 490
600
pF1KB8 GLPQAPS
:::::::
CCDS72 GLPQAPS
500
>>CCDS1664.1 CPSF3 gene_id:51692|Hs108|chr2 (684 aa)
initn: 1017 init1: 378 opt: 1147 Z-score: 1386.9 bits: 266.8 E(32554): 6.3e-71
Smith-Waterman score: 1147; 38.2% identity (69.7% similar) in 502 aa overlap (3-494:11-496)
10 20 30 40 50
pF1KB8 MPEIRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYIT
.. . ::::::.::::::.. . :...:::::.: :.. .: .. :
CCDS16 MSAIPAEESDQLLIRPLGAGQEVGRSCIILEFKGRKIMLDCGIHPGLEGMDALPYIDLID
10 20 30 40 50 60
60 70 80 90 100 110
pF1KB8 QNGRLTDFLDCVIISHFHLDHCGALPYFSEMVGYDGPIYMTHSTQAICPILLEDYRKIAV
.: ..:::::::::::::.: . ... : .:::.:.:: :: :: :..
CCDS16 PAE-----IDLLLISHFHLDHCGALPWFLQKTSFKGRTFMTHATKAIYRWLLSDYVKVS-
70 80 90 100 110
120 130 140 150 160 170
pF1KB8 DKKGEANFFTSQMIKDCMKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSES
. ... ..: ... : :. ....:.. .: ... :.::::::::::.:.... .
CCDS16 NISADDMLYTETDLEESMDKIETINFHEVKEVAG-IKFWCYHAGHVLGAAMFMIEIAGVK
120 130 140 150 160 170
180 190 200 210 220 230
pF1KB8 VVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERG
..::::.. :::: :: : . .:..:: ::::.: :..... :: : . ::. :.::
CCDS16 LLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFCNTVHDIVNRG
180 190 200 210 220 230
240 250 260 270 280 290
pF1KB8 GKVLIPVFALGRAQELCILLETFWERM-NLK-VPIYFSTGLTEKANHYYKLFIPWTNQKI
:. ::::::::::::: ..:. .:. .:. .:::....:..: :. .. :.::
CCDS16 GRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKI
240 250 260 270 280 290
300 310 320 330 340
pF1KB8 RKTFVQRNMFEFKHI---KAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNM
:: . : : :::: :..:. : :. :: ::.:.:::...: : ..:..: ...:
CCDS16 RKQININNPFVFKHISNLKSMDH-F-DDIGPSVVMASPGMMQSGLSRELFESWCTDKRNG
300 310 320 330 340 350
350 360 370 380 390 400
pF1KB8 VIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEP
::. ::::.::....:.: ... . : : .::.:.:.:::::.: . ... .:
CCDS16 VIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTSEFIRALKP
360 370 380 390 400 410
410 420 430 440 450 460
pF1KB8 ESVLLVHGEAKKMEFLKQKIEQELRVNCYMPANGETVTLPTSPSIPVGISLGLLKREMAQ
:.::::: ..: :: . .: . : :. .: ...:.. ...:.
CCDS16 PHVILVHGEQNEMARLKAALIREYE------DNDEVHIEVHNPRNTEAVTLNFRGEKLAK
420 430 440 450 460
470 480 490 500 510 520
pF1KB8 --GLLPEAKKPRL---LHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDTRK
:.: . :::. . : :. .. :....:
CCDS16 VMGFLAD-KKPEQGQRVSGILVKRNFNYHILSPCDLSNYTDLAMSTVKQTQAIPYTGPFN
470 480 490 500 510 520
530 540 550 560 570 580
pF1KB8 EQETALRVYSHLKSVLKDHCVQHLPDGSVTVESVLLQAAAPSEDPGTKVLLVSWTYQDEE
CCDS16 LLCYQLQKLTGDVEELEIQEKPALKVFKNITVIQEPGMVVLEWLANPSNDMYADTVTTVI
530 540 550 560 570 580
>>CCDS82417.1 CPSF3 gene_id:51692|Hs108|chr2 (647 aa)
initn: 913 init1: 378 opt: 1039 Z-score: 1256.3 bits: 242.6 E(32554): 1.2e-63
Smith-Waterman score: 1039; 38.0% identity (68.7% similar) in 479 aa overlap (30-494:1-459)
10 20 30 40 50 60
pF1KB8 MPEIRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDF
:::::.: :.. .: .. :
CCDS82 MLDCGIHPGLEGMDALPYIDLIDPAE-----
10 20
70 80 90 100 110 120
pF1KB8 LDCVIISHFHLDHCGALPYFSEMVGYDGPIYMTHSTQAICPILLEDYRKIAVDKKGEANF
.: ..:::::::::::::.: . ... : .:::.:.:: :: :: :.. . ... .
CCDS82 IDLLLISHFHLDHCGALPWFLQKTSFKGRTFMTHATKAIYRWLLSDYVKVS-NISADDML
30 40 50 60 70 80
130 140 150 160 170 180
pF1KB8 FTSQMIKDCMKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYN
.: ... : :. ....:.. .: ... :.::::::::::.:.... ...::::..
CCDS82 YTETDLEESMDKIETINFHEVKEVAG-IKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFS
90 100 110 120 130 140
190 200 210 220 230 240
pF1KB8 MTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVF
:::: :: : . .:..:: ::::.: :..... :: : . ::. :.:::. :::::
CCDS82 RQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVF
150 160 170 180 190 200
250 260 270 280 290
pF1KB8 ALGRAQELCILLETFWERM-NLK-VPIYFSTGLTEKANHYYKLFIPWTNQKIRKTFVQRN
:::::::: ..:. .:. .:. .:::....:..: :. .. :.:::: . :
CCDS82 ALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININN
210 220 230 240 250 260
300 310 320 330 340 350
pF1KB8 MFEFKHI---KAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCV
: :::: :..:. : :. :: ::.:.:::...: : ..:..: ...: ::. ::::
CCDS82 PFVFKHISNLKSMDH-F-DDIGPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCV
270 280 290 300 310 320
360 370 380 390 400 410
pF1KB8 QGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVHG
.::....:.: ... . : : .::.:.:.:::::.: . ... .: :.::::
CCDS82 EGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTSEFIRALKPPHVILVHG
330 340 350 360 370 380
420 430 440 450 460
pF1KB8 EAKKMEFLKQKI------EQELRVNCYMPANGETVTLPTSPSIPVGISLGLLKREMAQGL
: ..: :: . ..:.... . : : :.::: : .:. ..:.
CCDS82 EQNEMARLKAALIREYEDNDEVHIEVHNPRNTEAVTLNFR-----GEKLA-----KVMGF
390 400 410 420 430
470 480 490 500 510 520
pF1KB8 LPEAKKPRL---LHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDTRKEQET
: . :::. . : :. .. :....:
CCDS82 LAD-KKPEQGQRVSGILVKRNFNYHILSPCDLSNYTDLAMSTVKQTQAIPYTGPFNLLCY
440 450 460 470 480 490
>>CCDS9902.1 CPSF2 gene_id:53981|Hs108|chr14 (782 aa)
initn: 219 init1: 145 opt: 438 Z-score: 526.3 bits: 107.8 E(32554): 5.4e-23
Smith-Waterman score: 451; 27.2% identity (56.9% similar) in 401 aa overlap (4-387:5-396)
10 20 30 40 50
pF1KB8 MPEIRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTD
:..: :.. :. . : :... .:::: :. : : . . .
CCDS99 MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMD-------IIDSLRKHVH
10 20 30 40 50
60 70 80 90 100 110
pF1KB8 FLDCVIISHFHLDHCGALPYFSEMVGYDGPIYMTHSTQAICPILLEDYRKIAVDKKGEAN
.: :..:: : ::::: .: . :: : . . ... : . . . . .
CCDS99 QIDAVLLSHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQ-SRHNTEDFT
60 70 80 90 100 110
120 130 140 150 160 170
pF1KB8 FFTSQMIKDCMKKVVAVHLHQTVQVDDE---LEIKAYYAGHVLGAAMFQI-KVGSESVVY
.:: . . . :. ... : :.. . : : :::..:.....: : : : .::
CCDS99 LFTLDDVDAAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVY
120 130 140 150 160 170
180 190 200 210 220 230
pF1KB8 TGDYNMTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGK
. :.: . ::.. .. ::.::::.: :: .. .. :....: .: ::.. :.
CCDS99 AVDFNHKREIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGN
180 190 200 210 220 230
240 250 260 270 280 290
pF1KB8 VLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANH----YYKLFIPWTNQKI
::: : . ::. :: ::. .:. . . .: : .: ..... . : . : ..:.
CCDS99 VLIAVDTAGRVLELAQLLDQIWRTKDAGLGVY-SLALLNNVSYNVVEFSKSQVEWMSDKL
240 250 260 270 280 290
300 310 320 330 340
pF1KB8 RKTFVQR--NMFEFKHIKAFD--RAFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKN
. : .. : :.:.:.. .: :.: ::.:. :. : : ..: .: . ::
CCDS99 MRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDLFIQWCQDPKN
300 310 320 330 340 350
350 360 370 380 390 400
pF1KB8 MVIMPGYCVQGTVGHKILSGQRK----LEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV
.:. . ::... .... . .:.. : :: : ::.
CCDS99 SIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEGKELEEYLEKEKLKKEAAKKLEQ
360 370 380 390 400 410
410 420 430 440 450 460
pF1KB8 GQAEPESVLLVHGEAKKMEFLKQKIEQELRVNCYMPANGETVTLPTSPSIPVGISLGLLK
CCDS99 SKEADIDSSDESDIEEDIDQPSAHKTKHDLMMKGEGSRKGSFFKQAKKSYPMFPAPEERI
420 430 440 450 460 470
600 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 11:33:08 2016 done: Fri Nov 4 11:33:09 2016
Total Scan time: 3.170 Total Display time: 0.090
Function used was FASTA [36.3.4 Apr, 2011]