FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE3903, 465 aa
1>>>pF1KE3903 465 - 465 aa - 465 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 10.1275+/-0.000971; mu= -2.4616+/- 0.059
mean_var=321.6713+/-64.858, 0's: 0 Z-trim(116.1): 58 B-trim: 0 in 0/52
Lambda= 0.071510
statistics sampled from 16592 (16647) to 16592 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.807), E-opt: 0.2 (0.511), width: 16
Scan time: 3.540
The best scores are: opt bits E(32554)
CCDS6604.1 RNF38 gene_id:152006|Hs108|chr9 ( 465) 3360 360.0 3e-99
CCDS6603.1 RNF38 gene_id:152006|Hs108|chr9 ( 515) 3331 357.1 2.6e-98
CCDS4404.1 RNF44 gene_id:22838|Hs108|chr5 ( 432) 1836 202.7 6.1e-52
>>CCDS6604.1 RNF38 gene_id:152006|Hs108|chr9 (465 aa)
initn: 3360 init1: 3360 opt: 3360 Z-score: 1895.3 bits: 360.0 E(32554): 3e-99
Smith-Waterman score: 3360; 100.0% identity (100.0% similar) in 465 aa overlap (1-465:1-465)
10 20 30 40 50 60
pF1KE3 MACKSEDSPSPKRQRLSHSVFDYTSASPAPSPPMRPWEMTSNRQPPSVRPSQHHFSGERC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS66 MACKSEDSPSPKRQRLSHSVFDYTSASPAPSPPMRPWEMTSNRQPPSVRPSQHHFSGERC
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE3 NTPARNRRSPPVRRQRGRRDRLSRHNSISQDENYHHLPYAQQQAIEEPRAFHPPNVSPRL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS66 NTPARNRRSPPVRRQRGRRDRLSRHNSISQDENYHHLPYAQQQAIEEPRAFHPPNVSPRL
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE3 LHPAAHPPQQNAVMVDIHDQLHQGTVPVSYTVTTVAPHGIPLCTGQHIPACSTQQVPGCS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS66 LHPAAHPPQQNAVMVDIHDQLHQGTVPVSYTVTTVAPHGIPLCTGQHIPACSTQQVPGCS
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE3 VVFSGQHLPVCSVPPPMLQACSVQHLPVPYAAFPPLISSDPFLIHPPHLSPHHPPHLPPP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS66 VVFSGQHLPVCSVPPPMLQACSVQHLPVPYAAFPPLISSDPFLIHPPHLSPHHPPHLPPP
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE3 GQFVPFQTQQSRSPLQRIENEVELLGEHLPVGGFTYPPSAHPPTLPPSAPLQFLTHDPLH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS66 GQFVPFQTQQSRSPLQRIENEVELLGEHLPVGGFTYPPSAHPPTLPPSAPLQFLTHDPLH
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE3 QEVSFGVPYPPFMPRRLTGRSRYRSQQPIPPPPYHPSLLPYVLSMLPVPPAVGPTFSFEL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS66 QEVSFGVPYPPFMPRRLTGRSRYRSQQPIPPPPYHPSLLPYVLSMLPVPPAVGPTFSFEL
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE3 DVEDGEVENYEALLNLAERLGEAKPRGLTKADIEQLPSYRFNPNNHQSEQTLCVVCMCDF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS66 DVEDGEVENYEALLNLAERLGEAKPRGLTKADIEQLPSYRFNPNNHQSEQTLCVVCMCDF
370 380 390 400 410 420
430 440 450 460
pF1KE3 ESRQLLRVLPCNHEFHAKCVDKWLKANRTCPICRADASEVHRDSE
:::::::::::::::::::::::::::::::::::::::::::::
CCDS66 ESRQLLRVLPCNHEFHAKCVDKWLKANRTCPICRADASEVHRDSE
430 440 450 460
>>CCDS6603.1 RNF38 gene_id:152006|Hs108|chr9 (515 aa)
initn: 3331 init1: 3331 opt: 3331 Z-score: 1878.6 bits: 357.1 E(32554): 2.6e-98
Smith-Waterman score: 3331; 99.8% identity (100.0% similar) in 462 aa overlap (4-465:54-515)
10 20 30
pF1KE3 MACKSEDSPSPKRQRLSHSVFDYTSASPAPSPP
.:::::::::::::::::::::::::::::
CCDS66 ERVRLQSLFPLLPSDQNTTVQEDAHFKAFFQSEDSPSPKRQRLSHSVFDYTSASPAPSPP
30 40 50 60 70 80
40 50 60 70 80 90
pF1KE3 MRPWEMTSNRQPPSVRPSQHHFSGERCNTPARNRRSPPVRRQRGRRDRLSRHNSISQDEN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS66 MRPWEMTSNRQPPSVRPSQHHFSGERCNTPARNRRSPPVRRQRGRRDRLSRHNSISQDEN
90 100 110 120 130 140
100 110 120 130 140 150
pF1KE3 YHHLPYAQQQAIEEPRAFHPPNVSPRLLHPAAHPPQQNAVMVDIHDQLHQGTVPVSYTVT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS66 YHHLPYAQQQAIEEPRAFHPPNVSPRLLHPAAHPPQQNAVMVDIHDQLHQGTVPVSYTVT
150 160 170 180 190 200
160 170 180 190 200 210
pF1KE3 TVAPHGIPLCTGQHIPACSTQQVPGCSVVFSGQHLPVCSVPPPMLQACSVQHLPVPYAAF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS66 TVAPHGIPLCTGQHIPACSTQQVPGCSVVFSGQHLPVCSVPPPMLQACSVQHLPVPYAAF
210 220 230 240 250 260
220 230 240 250 260 270
pF1KE3 PPLISSDPFLIHPPHLSPHHPPHLPPPGQFVPFQTQQSRSPLQRIENEVELLGEHLPVGG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS66 PPLISSDPFLIHPPHLSPHHPPHLPPPGQFVPFQTQQSRSPLQRIENEVELLGEHLPVGG
270 280 290 300 310 320
280 290 300 310 320 330
pF1KE3 FTYPPSAHPPTLPPSAPLQFLTHDPLHQEVSFGVPYPPFMPRRLTGRSRYRSQQPIPPPP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS66 FTYPPSAHPPTLPPSAPLQFLTHDPLHQEVSFGVPYPPFMPRRLTGRSRYRSQQPIPPPP
330 340 350 360 370 380
340 350 360 370 380 390
pF1KE3 YHPSLLPYVLSMLPVPPAVGPTFSFELDVEDGEVENYEALLNLAERLGEAKPRGLTKADI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS66 YHPSLLPYVLSMLPVPPAVGPTFSFELDVEDGEVENYEALLNLAERLGEAKPRGLTKADI
390 400 410 420 430 440
400 410 420 430 440 450
pF1KE3 EQLPSYRFNPNNHQSEQTLCVVCMCDFESRQLLRVLPCNHEFHAKCVDKWLKANRTCPIC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS66 EQLPSYRFNPNNHQSEQTLCVVCMCDFESRQLLRVLPCNHEFHAKCVDKWLKANRTCPIC
450 460 470 480 490 500
460
pF1KE3 RADASEVHRDSE
::::::::::::
CCDS66 RADASEVHRDSE
510
>>CCDS4404.1 RNF44 gene_id:22838|Hs108|chr5 (432 aa)
initn: 1381 init1: 730 opt: 1836 Z-score: 1046.0 bits: 202.7 E(32554): 6.1e-52
Smith-Waterman score: 1836; 60.9% identity (79.8% similar) in 445 aa overlap (34-465:1-432)
10 20 30 40 50 60
pF1KE3 KSEDSPSPKRQRLSHSVFDYTSASPAPSPPMRPWEMTSNRQPPSVRPSQHHFSGERCNTP
:::: .. .: :::. .:..::. .::
CCDS44 MRPWALAVTRWPPSAPVGQRRFSAGPGSTP
10 20 30
70 80 90 100 110
pF1KE3 ARNRRSP----PVRRQRGRRDRLSRHNSISQDENYHHLPYAQQQAIEEPRAFHPPNVSPR
.. :: :. .: .:: .. :. ::: .:: :: : . :::
CCDS44 GQLWGSPGLEGPLASPPARDERLPSQQPPSRPP---HLP------VEERRASAPAGGSPR
40 50 60 70 80
120 130 140 150 160 170
pF1KE3 LLHPAAHPPQQNAVMVDIHDQLHQGTVPVSYTVTTVAPHGIPLCTGQHIPACSTQQVPGC
.::::. ::. :::.:.:.::: ::.:::::::. .:.:: ::::::.::.::.:.:
CCDS44 MLHPAT---QQSPFMVDLHEQVHQGPVPLSYTVTTVTTQGFPLPTGQHIPGCSAQQLPAC
90 100 110 120 130
180 190 200 210 220 230
pF1KE3 SVVFSGQHLPVCSVPPPMLQACSVQHLPVPYAAFPPLISSDPFLIHPPHLSPH-HPPHLP
::.::::: :.: .:::..:::..:.::::: :.: ::::: ...::: .: .: :.
CCDS44 SVMFSGQHYPLCCLPPPLIQACTMQQLPVPYQAYPHLISSDHYILHPPPPAPPPQPTHMA
140 150 160 170 180 190
240 250 260 270 280 290
pF1KE3 PPGQFVPFQTQQSRSPLQRIENEVELLGEHLPVGGFTYPPSAHPPTLPPSAPLQFLTHDP
: :::: .:::. : ::::..:.:.: :.. .:.::: :: :.: ::.::..: :::
CCDS44 PLGQFVSLQTQHPRMPLQRLDNDVDLRGDQPSLGSFTYSTSAPGPALSPSVPLHYLPHDP
200 210 220 230 240 250
300 310 320 330 340 350
pF1KE3 LHQEVSFGVPYPPFMPRRLTGRSRYRSQQPIPPPP-------YHPSLLPYVLSMLPVPP-
::::.:::::: .:::::. .::: :::.:::: :.::.::: :::::. :
CCDS44 LHQELSFGVPYSHMMPRRLS-TQRYRLQQPLPPPPPPPPPPPYYPSFLPYFLSMLPMSPT
260 270 280 290 300 310
360 370 380 390 400 410
pF1KE3 AVGPTFSFELDVEDGEVENYEALLNLAERLGEAKPRGLTKADIEQLPSYRFNPNNHQSEQ
:.:::.:..:::.: :.::::::::::::::.:::::::::::::::::::::..:::::
CCDS44 AMGPTISLDLDVDDVEMENYEALLNLAERLGDAKPRGLTKADIEQLPSYRFNPDSHQSEQ
320 330 340 350 360 370
420 430 440 450 460
pF1KE3 TLCVVCMCDFESRQLLRVLPCNHEFHAKCVDKWLKANRTCPICRADASEVHRDSE
::::::. :::.::::::::::::::.::::::::::::::::::::::: :..:
CCDS44 TLCVVCFSDFEARQLLRVLPCNHEFHTKCVDKWLKANRTCPICRADASEVPREAE
380 390 400 410 420 430
465 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 09:06:07 2016 done: Sun Nov 6 09:06:07 2016
Total Scan time: 3.540 Total Display time: 0.010
Function used was FASTA [36.3.4 Apr, 2011]