FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB5995, 577 aa
1>>>pF1KB5995 577 - 577 aa - 577 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 11.4998+/-0.00129; mu= -5.5941+/- 0.078
mean_var=483.5272+/-99.616, 0's: 0 Z-trim(114.3): 70 B-trim: 0 in 0/51
Lambda= 0.058326
statistics sampled from 14846 (14900) to 14846 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.756), E-opt: 0.2 (0.458), width: 16
Scan time: 4.200
The best scores are: opt bits E(32554)
CCDS14473.1 CSTF2 gene_id:1478|Hs108|chrX ( 577) 3935 345.7 9e-95
CCDS7245.1 CSTF2T gene_id:23283|Hs108|chr10 ( 616) 2535 228.0 2.7e-59
CCDS78498.1 CSTF2 gene_id:1478|Hs108|chrX ( 597) 2299 208.1 2.6e-53
>>CCDS14473.1 CSTF2 gene_id:1478|Hs108|chrX (577 aa)
initn: 3935 init1: 3935 opt: 3935 Z-score: 1814.9 bits: 345.7 E(32554): 9e-95
Smith-Waterman score: 3935; 100.0% identity (100.0% similar) in 577 aa overlap (1-577:1-577)
10 20 30 40 50 60
pF1KB5 MAGLTVRDPAVDRSLRSVFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 MAGLTVRDPAVDRSLRSVFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYG
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB5 FCEYQDQETALSAMRNLNGREFSGRALRVDNAASEKNKEELKSLGTGAPVIESPYGETIS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 FCEYQDQETALSAMRNLNGREFSGRALRVDNAASEKNKEELKSLGTGAPVIESPYGETIS
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB5 PEDAPESISKAVASLPPEQMFELMKQMKLCVQNSPQEARNMLLQNPQLAYALLQAQVVMR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 PEDAPESISKAVASLPPEQMFELMKQMKLCVQNSPQEARNMLLQNPQLAYALLQAQVVMR
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB5 IVDPEIALKILHRQTNIPTLIAGNPQPVHGAGPGSGSNVSMNQQNPQAPQAQSLGGMHVN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 IVDPEIALKILHRQTNIPTLIAGNPQPVHGAGPGSGSNVSMNQQNPQAPQAQSLGGMHVN
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB5 GAPPLMQASMQGGVPAPGQMPAAVTGPGPGSLAPGGGMQAQVGMPGSGPVSMERGQVPMQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 GAPPLMQASMQGGVPAPGQMPAAVTGPGPGSLAPGGGMQAQVGMPGSGPVSMERGQVPMQ
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB5 DPRAAMQRGSLPANVPTPRGLLGDAPNDPRGGTLLSVTGEVEPRGYLGPPHQGPPMHHVP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 DPRAAMQRGSLPANVPTPRGLLGDAPNDPRGGTLLSVTGEVEPRGYLGPPHQGPPMHHVP
310 320 330 340 350 360
370 380 390 400 410 420
pF1KB5 GHESRGPPPHELRGGPLPEPRPLMAEPRGPMLDQRGPPLDGRGGRDPRGIDARGMEARAM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 GHESRGPPPHELRGGPLPEPRPLMAEPRGPMLDQRGPPLDGRGGRDPRGIDARGMEARAM
370 380 390 400 410 420
430 440 450 460 470 480
pF1KB5 EARGLDARGLEARAMEARAMEARAMEARAMEARAMEVRGMEARGMDTRGPVPGPRGPIPS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 EARGLDARGLEARAMEARAMEARAMEARAMEARAMEVRGMEARGMDTRGPVPGPRGPIPS
430 440 450 460 470 480
490 500 510 520 530 540
pF1KB5 GMQGPSPINMGAVVPQGSRQVPVMQGTGMQGASIQGGSQPGGFSPGQNQVTPQDHEKAAL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 GMQGPSPINMGAVVPQGSRQVPVMQGTGMQGASIQGGSQPGGFSPGQNQVTPQDHEKAAL
490 500 510 520 530 540
550 560 570
pF1KB5 IMQVLQLTADQIAMLPPEQRQSILILKEQIQKSTGAP
:::::::::::::::::::::::::::::::::::::
CCDS14 IMQVLQLTADQIAMLPPEQRQSILILKEQIQKSTGAP
550 560 570
>>CCDS7245.1 CSTF2T gene_id:23283|Hs108|chr10 (616 aa)
initn: 1812 init1: 1221 opt: 2535 Z-score: 1177.9 bits: 228.0 E(32554): 2.7e-59
Smith-Waterman score: 2777; 69.4% identity (80.8% similar) in 625 aa overlap (1-570:1-609)
10 20 30 40 50 60
pF1KB5 MAGLTVRDPAVDRSLRSVFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYG
:..:.:::::.::::::::::::::::::::::::::::: :::::::::::::::::::
CCDS72 MSSLAVRDPAMDRSLRSVFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYG
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB5 FCEYQDQETALSAMRNLNGREFSGRALRVDNAASEKNKEELKSLGTGAPVIESPYGETIS
::::::::::::::::::::::::::::::::::::::::::::: .::.:.::::. :.
CCDS72 FCEYQDQETALSAMRNLNGREFSGRALRVDNAASEKNKEELKSLGPAAPIIDSPYGDPID
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB5 PEDAPESISKAVASLPPEQMFELMKQMKLCVQNSPQEARNMLLQNPQLAYALLQAQVVMR
::::::::..:::::::::::::::::::::::: :::::::::::::::::::::::::
CCDS72 PEDAPESITRAVASLPPEQMFELMKQMKLCVQNSHQEARNMLLQNPQLAYALLQAQVVMR
130 140 150 160 170 180
190 200 210 220 230
pF1KB5 IVDPEIALKILHRQTNIPTLIAGNPQ------PVHGAGPG--SGSNVSMNQQNPQAPQAQ
:.:::::::::::. .. :: :. : : : ::: : :: .::::: ::: :
CCDS72 IMDPEIALKILHRKIHVTPLIPGKSQSVSVSGPGPGPGPGLCPGPNVLLNQQNPPAPQPQ
190 200 210 220 230 240
240 250 260 270 280 290
pF1KB5 SLGGMHVNGAPPLMQASMQGGVPAPGQMPAAVTGPGPGSLAPGGGMQAQVGMPGSGPVSM
:. :. :::::. .:::.:::: .:::: : :::::.:::.:: :.:::: ::: .
CCDS72 HLARRPVKDIPPLMQTPIQGGIPAPGPIPAAVPGAGPGSLTPGGAMQPQLGMPGVGPVPL
250 260 270 280 290 300
300 310 320 330 340 350
pF1KB5 ERGQVPMQDPRAAMQRGSL-PANVPTPRGLLGDAPNDPRGGTLLSVTGEVEPRGYLGPPH
::::: :.:::: . :: . :...: ::::::::::::::::::::::::::::::::::
CCDS72 ERGQVQMSDPRAPIPRGPVTPGGLP-PRGLLGDAPNDPRGGTLLSVTGEVEPRGYLGPPH
310 320 330 340 350
360 370 380 390 400 410
pF1KB5 QGPPMHHVPGHESRGPPPHELRGGPLPEPRPLMAEPRGPMLDQRGPPLDGRGGRDPRGID
:::::::. ::..::: ::.::::: .:: :..::::::.:::: :.:::::::
CCDS72 QGPPMHHASGHDTRGPSSHEMRGGPLGDPRLLIGEPRGPMIDQRGLPMDGRGGRD-----
360 370 380 390 400 410
420 430 440 450 460 470
pF1KB5 ARGMEARAMEARGLDARGLEARAMEARAMEARAMEARAMEARAMEVRGMEARGMDTRGPV
.:.::.::::.. ..:.:.:: :.::. :::.:.::.:::.:::.. ::::
CCDS72 SRAMETRAMETE----------VLETRVMERRGMETCAMETRGMEARGMDARGLEMRGPV
420 430 440 450 460
480 490 500 510
pF1KB5 PGPRGPIPSGMQGPSPINMGAV-VPQGSRQVPVM---------------QGTGMQGASIQ
:. :::. .:.:::.:::.:: ::: :::: . ::::::::.::
CCDS72 PSSRGPMTGGIQGPGPINIGAGGPPQGPRQVPGISGVGNPGAGMQGTGIQGTGMQGAGIQ
470 480 490 500 510 520
520 530 540
pF1KB5 GG------------------------------SQPGGFSPGQNQVTPQDHEKAALIMQVL
:: :::..:::::.::::::.::::::::::
CCDS72 GGGMQGAGIQGVSIQGGGIQGGGIQGASKQGGSQPSSFSPGQSQVTPQDQEKAALIMQVL
530 540 550 560 570 580
550 560 570
pF1KB5 QLTADQIAMLPPEQRQSILILKEQIQKSTGAP
:::::::::::::::::::::::::
CCDS72 QLTADQIAMLPPEQRQSILILKEQIQKSTGAS
590 600 610
>>CCDS78498.1 CSTF2 gene_id:1478|Hs108|chrX (597 aa)
initn: 2043 init1: 2043 opt: 2299 Z-score: 1070.7 bits: 208.1 E(32554): 2.6e-53
Smith-Waterman score: 3885; 96.6% identity (96.6% similar) in 597 aa overlap (1-577:1-597)
10 20 30 40 50 60
pF1KB5 MAGLTVRDPAVDRSLRSVFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS78 MAGLTVRDPAVDRSLRSVFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYG
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB5 FCEYQDQETALSAMRNLNGREFSGRALRVDNAASEKNKEELKSLGTGAPVIESPYGETIS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS78 FCEYQDQETALSAMRNLNGREFSGRALRVDNAASEKNKEELKSLGTGAPVIESPYGETIS
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB5 PEDAPESISKAVASLPPEQMFELMKQMKLCVQNSPQEARNMLLQNPQLAYALLQAQVVMR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS78 PEDAPESISKAVASLPPEQMFELMKQMKLCVQNSPQEARNMLLQNPQLAYALLQAQVVMR
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB5 IVDPEIALKILHRQTNIPTLIAGNPQPVHGAGPGSGSNVSMNQQNPQAPQAQSLGGMHVN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS78 IVDPEIALKILHRQTNIPTLIAGNPQPVHGAGPGSGSNVSMNQQNPQAPQAQSLGGMHVN
190 200 210 220 230 240
250 260 270 280 290
pF1KB5 GAPPLMQASMQGGVPAPGQMPAAVTGPGPGSLAPGGGMQAQVGMPGSGPVSMERGQ----
::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS78 GAPPLMQASMQGGVPAPGQMPAAVTGPGPGSLAPGGGMQAQVGMPGSGPVSMERGQGTLQ
250 260 270 280 290 300
300 310 320 330 340
pF1KB5 ----------------VPMQDPRAAMQRGSLPANVPTPRGLLGDAPNDPRGGTLLSVTGE
::::::::::::::::::::::::::::::::::::::::::::
CCDS78 HSPVGPAGPASIERVQVPMQDPRAAMQRGSLPANVPTPRGLLGDAPNDPRGGTLLSVTGE
310 320 330 340 350 360
350 360 370 380 390 400
pF1KB5 VEPRGYLGPPHQGPPMHHVPGHESRGPPPHELRGGPLPEPRPLMAEPRGPMLDQRGPPLD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS78 VEPRGYLGPPHQGPPMHHVPGHESRGPPPHELRGGPLPEPRPLMAEPRGPMLDQRGPPLD
370 380 390 400 410 420
410 420 430 440 450 460
pF1KB5 GRGGRDPRGIDARGMEARAMEARGLDARGLEARAMEARAMEARAMEARAMEARAMEVRGM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS78 GRGGRDPRGIDARGMEARAMEARGLDARGLEARAMEARAMEARAMEARAMEARAMEVRGM
430 440 450 460 470 480
470 480 490 500 510 520
pF1KB5 EARGMDTRGPVPGPRGPIPSGMQGPSPINMGAVVPQGSRQVPVMQGTGMQGASIQGGSQP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS78 EARGMDTRGPVPGPRGPIPSGMQGPSPINMGAVVPQGSRQVPVMQGTGMQGASIQGGSQP
490 500 510 520 530 540
530 540 550 560 570
pF1KB5 GGFSPGQNQVTPQDHEKAALIMQVLQLTADQIAMLPPEQRQSILILKEQIQKSTGAP
:::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS78 GGFSPGQNQVTPQDHEKAALIMQVLQLTADQIAMLPPEQRQSILILKEQIQKSTGAP
550 560 570 580 590
577 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 10:52:52 2016 done: Sat Nov 5 10:52:52 2016
Total Scan time: 4.200 Total Display time: 0.030
Function used was FASTA [36.3.4 Apr, 2011]