FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB4038, 363 aa
1>>>pF1KB4038 363 - 363 aa - 363 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 7.6462+/-0.000773; mu= 7.9881+/- 0.047
mean_var=158.0736+/-32.181, 0's: 0 Z-trim(114.8): 38 B-trim: 43 in 1/52
Lambda= 0.102010
statistics sampled from 15307 (15343) to 15307 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.796), E-opt: 0.2 (0.471), width: 16
Scan time: 3.080
The best scores are: opt bits E(32554)
CCDS11286.1 RFFL gene_id:117584|Hs108|chr17 ( 363) 2533 383.9 1.2e-106
CCDS31915.1 RNF34 gene_id:80196|Hs108|chr12 ( 372) 981 155.5 6.8e-38
CCDS9221.1 RNF34 gene_id:80196|Hs108|chr12 ( 373) 981 155.5 6.8e-38
CCDS73538.1 RNF34 gene_id:80196|Hs108|chr12 ( 180) 607 100.2 1.4e-21
>>CCDS11286.1 RFFL gene_id:117584|Hs108|chr17 (363 aa)
initn: 2533 init1: 2533 opt: 2533 Z-score: 2028.3 bits: 383.9 E(32554): 1.2e-106
Smith-Waterman score: 2533; 100.0% identity (100.0% similar) in 363 aa overlap (1-363:1-363)
10 20 30 40 50 60
pF1KB4 MWATCCNWFCLDGQPEEVPPPQGARMQAYSNPGYSSFPSPTGLEPSCKSCGAHFANTARK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 MWATCCNWFCLDGQPEEVPPPQGARMQAYSNPGYSSFPSPTGLEPSCKSCGAHFANTARK
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB4 QTCLDCKKNFCMTCSSQVGNGPRLCLLCQRFRATAFQREELMKMKVKDLRDYLSLHDIST
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 QTCLDCKKNFCMTCSSQVGNGPRLCLLCQRFRATAFQREELMKMKVKDLRDYLSLHDIST
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB4 EMCREKEELVLLVLGQQPVISQEDRTRASTLSPDFPEQQAFLTQPHSSMVPPTSPNLPSS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 EMCREKEELVLLVLGQQPVISQEDRTRASTLSPDFPEQQAFLTQPHSSMVPPTSPNLPSS
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB4 SAQATSVPPAQVQENQQANGHVSQDQEEPVYLESVARVPAEDETQSIDSEDSFVPGRRAS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 SAQATSVPPAQVQENQQANGHVSQDQEEPVYLESVARVPAEDETQSIDSEDSFVPGRRAS
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB4 LSDLTDLEDIEGLTVRQLKEILARNFVNYKGCCEKWELMERVTRLYKDQKGLQHLVSGAE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 LSDLTDLEDIEGLTVRQLKEILARNFVNYKGCCEKWELMERVTRLYKDQKGLQHLVSGAE
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB4 DQNGGAVPSGLEENLCKICMDSPIDCVLLECGHMVTCTKCGKRMNECPICRQYVIRAVHV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 DQNGGAVPSGLEENLCKICMDSPIDCVLLECGHMVTCTKCGKRMNECPICRQYVIRAVHV
310 320 330 340 350 360
pF1KB4 FRS
:::
CCDS11 FRS
>>CCDS31915.1 RNF34 gene_id:80196|Hs108|chr12 (372 aa)
initn: 939 init1: 432 opt: 981 Z-score: 793.8 bits: 155.5 E(32554): 6.8e-38
Smith-Waterman score: 981; 44.2% identity (69.1% similar) in 382 aa overlap (1-363:8-372)
10 20 30 40
pF1KB4 MWATCCNWF-------CLDGQPEEVPPPQGARMQAYSNPGYSSFPSPTGLE-P
:::.::. . . :: : .. :: .:..: :.. : :
CCDS31 MKAGATSMWASCCGLLNEVMGTGAVRGQQSAFAGATGP-FRFTPNPEFSTYP-PAATEGP
10 20 30 40 50
50 60 70 80 90 100
pF1KB4 S--CKSCGAHFANTARKQTCLDCKKNFCMTCSSQVGNGPRLCLLCQRFRATAFQREELMK
. ::.:: :. .:..: ::::.:: .:: . .. : : :. .. ::::: .::.
CCDS31 NIVCKACGLSFSVFRKKHVCCDCKKDFCSVCSV-LQENLRRCSTCHLLQETAFQRPQLMR
60 70 80 90 100 110
110 120 130 140 150 160
pF1KB4 MKVKDLRDYLSLHDISTEMCREKEELVLLVLGQQPVISQEDRTRASTLSPDFPEQQAFLT
.::::::.:: :..: . :::::.:: ::: .. . :..: .:.:. . . ..:.:
CCDS31 LKVKDLRQYLILRNIPIDTCREKEDLVDLVLCHHGLGSEDDMD-TSSLNSSRSQTSSFFT
120 130 140 150 160 170
170 180 190 200 210
pF1KB4 QPHSSMVPPTSPNLPSSSAQ---------ATSVPPAQVQENQQANGHVSQDQEEPVYLES
. : :.:. :: : . : ::::: .. ..... .:... ..
CCDS31 RSFFSNY--TAPSATMSSFQGELMDGDQTSRSGVPAQVQ-SEITSANTEDDDDDDDEDDD
180 190 200 210 220 230
220 230 240 250 260 270
pF1KB4 VARVPAEDETQSIDSEDSFVPGRRASLSDLTDLEDIEGLTVRQLKEILARNFVNYKGCCE
. :::.. ....: :::::::..:.:.::..:::::::::::::::.::::
CCDS31 DEEENAEDRNPGLSKERV-----RASLSDLSSLDDVEGMSVRQLKEILARNFVNYSGCCE
240 250 260 270 280
280 290 300 310 320 330
pF1KB4 KWELMERVTRLYKDQKGLQHLVSGAEDQNGGAVPSGLEENLCKICMDSPIDCVLLECGHM
::::.:.:.::::... :. : . : . . ...::.::::. :::::::::::
CCDS31 KWELVEKVNRLYKENEENQKSY-GERLQ----LQDEEDDSLCRICMDAVIDCVLLECGHM
290 300 310 320 330 340
340 350 360
pF1KB4 VTCTKCGKRMNECPICRQYVIRAVHVFRS
::::::::::.:::::::::.::::::.:
CCDS31 VTCTKCGKRMSECPICRQYVVRAVHVFKS
350 360 370
>>CCDS9221.1 RNF34 gene_id:80196|Hs108|chr12 (373 aa)
initn: 939 init1: 432 opt: 981 Z-score: 793.8 bits: 155.5 E(32554): 6.8e-38
Smith-Waterman score: 981; 44.2% identity (69.1% similar) in 382 aa overlap (1-363:9-373)
10 20 30 40
pF1KB4 MWATCCNWF-------CLDGQPEEVPPPQGARMQAYSNPGYSSFPSPTGLE-
:::.::. . . :: : .. :: .:..: :.. :
CCDS92 MRKAGATSMWASCCGLLNEVMGTGAVRGQQSAFAGATGP-FRFTPNPEFSTYP-PAATEG
10 20 30 40 50
50 60 70 80 90 100
pF1KB4 PS--CKSCGAHFANTARKQTCLDCKKNFCMTCSSQVGNGPRLCLLCQRFRATAFQREELM
:. ::.:: :. .:..: ::::.:: .:: . .. : : :. .. ::::: .::
CCDS92 PNIVCKACGLSFSVFRKKHVCCDCKKDFCSVCSV-LQENLRRCSTCHLLQETAFQRPQLM
60 70 80 90 100 110
110 120 130 140 150 160
pF1KB4 KMKVKDLRDYLSLHDISTEMCREKEELVLLVLGQQPVISQEDRTRASTLSPDFPEQQAFL
..::::::.:: :..: . :::::.:: ::: .. . :..: .:.:. . . ..:.
CCDS92 RLKVKDLRQYLILRNIPIDTCREKEDLVDLVLCHHGLGSEDDMD-TSSLNSSRSQTSSFF
120 130 140 150 160 170
170 180 190 200 210
pF1KB4 TQPHSSMVPPTSPNLPSSSAQ---------ATSVPPAQVQENQQANGHVSQDQEEPVYLE
:. : :.:. :: : . : ::::: .. ..... .:... .
CCDS92 TRSFFSNY--TAPSATMSSFQGELMDGDQTSRSGVPAQVQ-SEITSANTEDDDDDDDEDD
180 190 200 210 220 230
220 230 240 250 260 270
pF1KB4 SVARVPAEDETQSIDSEDSFVPGRRASLSDLTDLEDIEGLTVRQLKEILARNFVNYKGCC
. . :::.. ....: :::::::..:.:.::..:::::::::::::::.:::
CCDS92 DDEEENAEDRNPGLSKERV-----RASLSDLSSLDDVEGMSVRQLKEILARNFVNYSGCC
240 250 260 270 280
280 290 300 310 320 330
pF1KB4 EKWELMERVTRLYKDQKGLQHLVSGAEDQNGGAVPSGLEENLCKICMDSPIDCVLLECGH
:::::.:.:.::::... :. : . : . . ...::.::::. ::::::::::
CCDS92 EKWELVEKVNRLYKENEENQKSY-GERLQ----LQDEEDDSLCRICMDAVIDCVLLECGH
290 300 310 320 330 340
340 350 360
pF1KB4 MVTCTKCGKRMNECPICRQYVIRAVHVFRS
:::::::::::.:::::::::.::::::.:
CCDS92 MVTCTKCGKRMSECPICRQYVVRAVHVFKS
350 360 370
>>CCDS73538.1 RNF34 gene_id:80196|Hs108|chr12 (180 aa)
initn: 636 init1: 339 opt: 607 Z-score: 500.7 bits: 100.2 E(32554): 1.4e-21
Smith-Waterman score: 621; 54.9% identity (80.6% similar) in 175 aa overlap (189-363:17-180)
160 170 180 190 200 210
pF1KB4 QAFLTQPHSSMVPPTSPNLPSSSAQATSVPPAQVQENQQANGHVSQDQEEPVYLESVARV
::::: .. ..... .:... .. .
CCDS73 MKGELMDGDQTSRSGVPAQVQ-SEITSANTEDDDDDDDEDDDDEEE
10 20 30 40
220 230 240 250 260 270
pF1KB4 PAEDETQSIDSEDSFVPGRRASLSDLTDLEDIEGLTVRQLKEILARNFVNYKGCCEKWEL
:::.. ....: :::::::..:.:.::..:::::::::::::::.::::::::
CCDS73 NAEDRNPGLSKERV-----RASLSDLSSLDDVEGMSVRQLKEILARNFVNYSGCCEKWEL
50 60 70 80 90 100
280 290 300 310 320 330
pF1KB4 MERVTRLYKDQKGLQHLVSGAEDQNGGAVPSGLEENLCKICMDSPIDCVLLECGHMVTCT
.:.:.::::... :. : . : . . ...::.::::. :::::::::::::::
CCDS73 VEKVNRLYKENEENQK-SYGERLQ----LQDEEDDSLCRICMDAVIDCVLLECGHMVTCT
110 120 130 140 150
340 350 360
pF1KB4 KCGKRMNECPICRQYVIRAVHVFRS
::::::.:::::::::.::::::.:
CCDS73 KCGKRMSECPICRQYVVRAVHVFKS
160 170 180
363 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 05:33:55 2016 done: Sat Nov 5 05:33:55 2016
Total Scan time: 3.080 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]