FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE4070, 373 aa
1>>>pF1KE4070 373 - 373 aa - 373 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 7.8331+/-0.00092; mu= 6.7843+/- 0.056
mean_var=188.5480+/-39.104, 0's: 0 Z-trim(112.6): 34 B-trim: 42 in 1/50
Lambda= 0.093404
statistics sampled from 13328 (13361) to 13328 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.747), E-opt: 0.2 (0.41), width: 16
Scan time: 2.820
The best scores are: opt bits E(32554)
CCDS9221.1 RNF34 gene_id:80196|Hs108|chr12 ( 373) 2562 357.3 1.3e-98
CCDS31915.1 RNF34 gene_id:80196|Hs108|chr12 ( 372) 2548 355.4 4.7e-98
CCDS73538.1 RNF34 gene_id:80196|Hs108|chr12 ( 180) 1215 175.5 3.3e-44
CCDS11286.1 RFFL gene_id:117584|Hs108|chr17 ( 363) 981 144.2 1.7e-34
>>CCDS9221.1 RNF34 gene_id:80196|Hs108|chr12 (373 aa)
initn: 2562 init1: 2562 opt: 2562 Z-score: 1884.0 bits: 357.3 E(32554): 1.3e-98
Smith-Waterman score: 2562; 100.0% identity (100.0% similar) in 373 aa overlap (1-373:1-373)
10 20 30 40 50 60
pF1KE4 MRKAGATSMWASCCGLLNEVMGTGAVRGQQSAFAGATGPFRFTPNPEFSTYPPAATEGPN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS92 MRKAGATSMWASCCGLLNEVMGTGAVRGQQSAFAGATGPFRFTPNPEFSTYPPAATEGPN
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE4 IVCKACGLSFSVFRKKHVCCDCKKDFCSVCSVLQENLRRCSTCHLLQETAFQRPQLMRLK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS92 IVCKACGLSFSVFRKKHVCCDCKKDFCSVCSVLQENLRRCSTCHLLQETAFQRPQLMRLK
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE4 VKDLRQYLILRNIPIDTCREKEDLVDLVLCHHGLGSEDDMDTSSLNSSRSQTSSFFTRSF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS92 VKDLRQYLILRNIPIDTCREKEDLVDLVLCHHGLGSEDDMDTSSLNSSRSQTSSFFTRSF
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE4 FSNYTAPSATMSSFQGELMDGDQTSRSGVPAQVQSEITSANTEDDDDDDDEDDDDEEENA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS92 FSNYTAPSATMSSFQGELMDGDQTSRSGVPAQVQSEITSANTEDDDDDDDEDDDDEEENA
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE4 EDRNPGLSKERVRASLSDLSSLDDVEGMSVRQLKEILARNFVNYSGCCEKWELVEKVNRL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS92 EDRNPGLSKERVRASLSDLSSLDDVEGMSVRQLKEILARNFVNYSGCCEKWELVEKVNRL
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE4 YKENEENQKSYGERLQLQDEEDDSLCRICMDAVIDCVLLECGHMVTCTKCGKRMSECPIC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS92 YKENEENQKSYGERLQLQDEEDDSLCRICMDAVIDCVLLECGHMVTCTKCGKRMSECPIC
310 320 330 340 350 360
370
pF1KE4 RQYVVRAVHVFKS
:::::::::::::
CCDS92 RQYVVRAVHVFKS
370
>>CCDS31915.1 RNF34 gene_id:80196|Hs108|chr12 (372 aa)
initn: 2548 init1: 2548 opt: 2548 Z-score: 1873.9 bits: 355.4 E(32554): 4.7e-98
Smith-Waterman score: 2548; 100.0% identity (100.0% similar) in 371 aa overlap (3-373:2-372)
10 20 30 40 50 60
pF1KE4 MRKAGATSMWASCCGLLNEVMGTGAVRGQQSAFAGATGPFRFTPNPEFSTYPPAATEGPN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS31 MKAGATSMWASCCGLLNEVMGTGAVRGQQSAFAGATGPFRFTPNPEFSTYPPAATEGPN
10 20 30 40 50
70 80 90 100 110 120
pF1KE4 IVCKACGLSFSVFRKKHVCCDCKKDFCSVCSVLQENLRRCSTCHLLQETAFQRPQLMRLK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS31 IVCKACGLSFSVFRKKHVCCDCKKDFCSVCSVLQENLRRCSTCHLLQETAFQRPQLMRLK
60 70 80 90 100 110
130 140 150 160 170 180
pF1KE4 VKDLRQYLILRNIPIDTCREKEDLVDLVLCHHGLGSEDDMDTSSLNSSRSQTSSFFTRSF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS31 VKDLRQYLILRNIPIDTCREKEDLVDLVLCHHGLGSEDDMDTSSLNSSRSQTSSFFTRSF
120 130 140 150 160 170
190 200 210 220 230 240
pF1KE4 FSNYTAPSATMSSFQGELMDGDQTSRSGVPAQVQSEITSANTEDDDDDDDEDDDDEEENA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS31 FSNYTAPSATMSSFQGELMDGDQTSRSGVPAQVQSEITSANTEDDDDDDDEDDDDEEENA
180 190 200 210 220 230
250 260 270 280 290 300
pF1KE4 EDRNPGLSKERVRASLSDLSSLDDVEGMSVRQLKEILARNFVNYSGCCEKWELVEKVNRL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS31 EDRNPGLSKERVRASLSDLSSLDDVEGMSVRQLKEILARNFVNYSGCCEKWELVEKVNRL
240 250 260 270 280 290
310 320 330 340 350 360
pF1KE4 YKENEENQKSYGERLQLQDEEDDSLCRICMDAVIDCVLLECGHMVTCTKCGKRMSECPIC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS31 YKENEENQKSYGERLQLQDEEDDSLCRICMDAVIDCVLLECGHMVTCTKCGKRMSECPIC
300 310 320 330 340 350
370
pF1KE4 RQYVVRAVHVFKS
:::::::::::::
CCDS31 RQYVVRAVHVFKS
360 370
>>CCDS73538.1 RNF34 gene_id:80196|Hs108|chr12 (180 aa)
initn: 1215 init1: 1215 opt: 1215 Z-score: 907.2 bits: 175.5 E(32554): 3.3e-44
Smith-Waterman score: 1215; 99.4% identity (100.0% similar) in 179 aa overlap (195-373:2-180)
170 180 190 200 210 220
pF1KE4 LNSSRSQTSSFFTRSFFSNYTAPSATMSSFQGELMDGDQTSRSGVPAQVQSEITSANTED
.:::::::::::::::::::::::::::::
CCDS73 MKGELMDGDQTSRSGVPAQVQSEITSANTED
10 20 30
230 240 250 260 270 280
pF1KE4 DDDDDDEDDDDEEENAEDRNPGLSKERVRASLSDLSSLDDVEGMSVRQLKEILARNFVNY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS73 DDDDDDEDDDDEEENAEDRNPGLSKERVRASLSDLSSLDDVEGMSVRQLKEILARNFVNY
40 50 60 70 80 90
290 300 310 320 330 340
pF1KE4 SGCCEKWELVEKVNRLYKENEENQKSYGERLQLQDEEDDSLCRICMDAVIDCVLLECGHM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS73 SGCCEKWELVEKVNRLYKENEENQKSYGERLQLQDEEDDSLCRICMDAVIDCVLLECGHM
100 110 120 130 140 150
350 360 370
pF1KE4 VTCTKCGKRMSECPICRQYVVRAVHVFKS
:::::::::::::::::::::::::::::
CCDS73 VTCTKCGKRMSECPICRQYVVRAVHVFKS
160 170 180
>>CCDS11286.1 RFFL gene_id:117584|Hs108|chr17 (363 aa)
initn: 939 init1: 432 opt: 981 Z-score: 732.8 bits: 144.2 E(32554): 1.7e-34
Smith-Waterman score: 981; 43.9% identity (70.0% similar) in 380 aa overlap (9-373:1-363)
10 20 30 40 50
pF1KE4 MRKAGATSMWASCCGLLNEVMGTGAVRGQQSAFAGATGP-FRFTPNPEFSTYP-PAATEG
:::.::. . . :: : .. :: .:..: :.. :
CCDS11 MWATCCNWF-------CLDGQPEEVPPPQGARMQAYSNPGYSSFPSPTGLE-
10 20 30 40
60 70 80 90 100 110
pF1KE4 PNIVCKACGLSFSVFRKKHVCCDCKKDFCSVCSVLQENLRR-CSTCHLLQETAFQRPQLM
:. ::.:: :. .:..: ::::.:: .:: : : : :. .. ::::: .::
CCDS11 PS--CKSCGAHFANTARKQTCLDCKKNFCMTCSSQVGNGPRLCLLCQRFRATAFQREELM
50 60 70 80 90 100
120 130 140 150 160 170
pF1KE4 RLKVKDLRQYLILRNIPIDTCREKEDLVDLVLCHHGLGSEDDMD-TSSLNSSRSQTSSFF
..::::::.:: :..: . :::::.:: ::: .. . :..: .:.:. . . ..:.
CCDS11 KMKVKDLRDYLSLHDISTEMCREKEELVLLVLGQQPVISQEDRTRASTLSPDFPEQQAFL
110 120 130 140 150 160
180 190 200 210 220 230
pF1KE4 TRSFFSNYTAPSATMSSFQGELMDGDQTSRSGVPAQVQ-SEITSANTEDDDDDDDEDDDD
:. :... :.. .: ... . : ::::: .. ..... .:... ..
CCDS11 TQPH-SSMVPPTSP------NLPSSSAQATSVPPAQVQENQQANGHVSQDQEEPVYLESV
170 180 190 200 210
240 250 260 270 280 290
pF1KE4 EEENAEDRNPGLSKERV-----RASLSDLSSLDDVEGMSVRQLKEILARNFVNYSGCCEK
. :::.. ....: :::::::..:.:.::..:::::::::::::::.:::::
CCDS11 ARVPAEDETQSIDSEDSFVPGRRASLSDLTDLEDIEGLTVRQLKEILARNFVNYKGCCEK
220 230 240 250 260 270
300 310 320 330 340
pF1KE4 WELVEKVNRLYKENEENQKSY-GERLQ----LQDEEDDSLCRICMDAVIDCVLLECGHMV
:::.:.:.::::... :. : . : . . ...::.::::. ::::::::::::
CCDS11 WELMERVTRLYKDQKGLQHLVSGAEDQNGGAVPSGLEENLCKICMDSPIDCVLLECGHMV
280 290 300 310 320 330
350 360 370
pF1KE4 TCTKCGKRMSECPICRQYVVRAVHVFKS
:::::::::.:::::::::.::::::.:
CCDS11 TCTKCGKRMNECPICRQYVIRAVHVFRS
340 350 360
373 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Mon Nov 7 16:54:54 2016 done: Mon Nov 7 16:54:55 2016
Total Scan time: 2.820 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]