FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE4140, 557 aa
1>>>pF1KE4140 557 - 557 aa - 557 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 8.7313+/-0.00108; mu= 5.6387+/- 0.065
mean_var=268.4494+/-54.819, 0's: 0 Z-trim(112.8): 39 B-trim: 0 in 0/51
Lambda= 0.078279
statistics sampled from 13485 (13516) to 13485 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.73), E-opt: 0.2 (0.415), width: 16
Scan time: 3.720
The best scores are: opt bits E(32554)
CCDS10244.1 ARIH1 gene_id:25820|Hs108|chr15 ( 557) 4005 465.8 6.2e-131
CCDS2780.1 ARIH2 gene_id:10425|Hs108|chr3 ( 493) 1135 141.6 2.1e-33
CCDS4890.1 CUL9 gene_id:23113|Hs108|chr6 (2517) 517 72.5 6.6e-12
>>CCDS10244.1 ARIH1 gene_id:25820|Hs108|chr15 (557 aa)
initn: 4005 init1: 4005 opt: 4005 Z-score: 2464.1 bits: 465.8 E(32554): 6.2e-131
Smith-Waterman score: 4005; 100.0% identity (100.0% similar) in 557 aa overlap (1-557:1-557)
10 20 30 40 50 60
pF1KE4 MDSDEGYNYEFDEDEECSEEDSGAEEEEDEDDDEPDDDTLDLGEVELVEPGLGVGGERDG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 MDSDEGYNYEFDEDEECSEEDSGAEEEEDEDDDEPDDDTLDLGEVELVEPGLGVGGERDG
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE4 LLCGETGGGGGSALGPGGGGGGGGGGGGGGPGHEQEEDYRYEVLTAEQILQHMVECIREV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 LLCGETGGGGGSALGPGGGGGGGGGGGGGGPGHEQEEDYRYEVLTAEQILQHMVECIREV
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE4 NEVIQNPATITRILLSHFNWDKEKLMERYFDGNLEKLFAECHVINPSKKSRTRQMNTRSS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 NEVIQNPATITRILLSHFNWDKEKLMERYFDGNLEKLFAECHVINPSKKSRTRQMNTRSS
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE4 AQDMPCQICYLNYPNSYFTGLECGHKFCMQCWSEYLTTKIMEEGMGQTISCPAHGCDILV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 AQDMPCQICYLNYPNSYFTGLECGHKFCMQCWSEYLTTKIMEEGMGQTISCPAHGCDILV
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE4 DDNTVMRLITDSKVKLKYQHLITNSFVECNRLLKWCPAPDCHHVVKVQYPDAKPVRCKCG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 DDNTVMRLITDSKVKLKYQHLITNSFVECNRLLKWCPAPDCHHVVKVQYPDAKPVRCKCG
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE4 RQFCFNCGENWHDPVKCKWLKKWIKKCDDDSETSNWIAANTKECPKCHVTIEKDGGCNHM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 RQFCFNCGENWHDPVKCKWLKKWIKKCDDDSETSNWIAANTKECPKCHVTIEKDGGCNHM
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE4 VCRNQNCKAEFCWVCLGPWEPHGSAWYNCNRYNEDDAKAARDAQERSRAALQRYLFYCNR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 VCRNQNCKAEFCWVCLGPWEPHGSAWYNCNRYNEDDAKAARDAQERSRAALQRYLFYCNR
370 380 390 400 410 420
430 440 450 460 470 480
pF1KE4 YMNHMQSLRFEHKLYAQVKQKMEEMQQHNMSWIEVQFLKKAVDVLCQCRATLMYTYVFAF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 YMNHMQSLRFEHKLYAQVKQKMEEMQQHNMSWIEVQFLKKAVDVLCQCRATLMYTYVFAF
430 440 450 460 470 480
490 500 510 520 530 540
pF1KE4 YLKKNNQSIIFENNQADLENATEVLSGYLERDISQDSLQDIKQKVQDKYRYCESRRRVLL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 YLKKNNQSIIFENNQADLENATEVLSGYLERDISQDSLQDIKQKVQDKYRYCESRRRVLL
490 500 510 520 530 540
550
pF1KE4 QHVHEGYEKDLWEYIED
:::::::::::::::::
CCDS10 QHVHEGYEKDLWEYIED
550
>>CCDS2780.1 ARIH2 gene_id:10425|Hs108|chr3 (493 aa)
initn: 950 init1: 319 opt: 1135 Z-score: 713.1 bits: 141.6 E(32554): 2.1e-33
Smith-Waterman score: 1142; 33.0% identity (64.4% similar) in 537 aa overlap (12-545:11-492)
10 20 30 40 50 60
pF1KE4 MDSDEGYNYEFDEDEECSEEDSGAEEEEDEDDDEPDDDTLDLGEVELVEPGLGVGGERDG
: .:: . : . ::::.:..:.: :..: :.. :..:
CCDS27 MSVDMNSQGSDSNEE--DYDPNCEEEEEEEEDDP-------GDIEDYYVGVASDVEQQG
10 20 30 40 50
70 80 90 100 110 120
pF1KE4 LLCGETGGGGGSALGPGGGGGGGGGGGGGGPGHEQEEDYRYEVLTAEQILQHMVECIREV
..:. : :.:.. :: .. . : . .
CCDS27 ----------ADAFDP--------------------EEYQFTCLTYKESEGALNEHMTSL
60 70 80
130 140 150 160 170
pF1KE4 NEVIQNPATITRILLSHFNWDKEKLMERYFDGNLEKLFAECHVI-NPSKKSRTRQMNTRS
:.. ......: .:.:. ....:: .: .:..: .: :::: .. .
CCDS27 ASVLKVSHSVAKLILVNFHWQVSEILDRY-KSNSAQLLVEARVQPNPSK-------HVPT
90 100 110 120 130
180 190 200 210 220 230
pF1KE4 SAQDMPCQICYLNYPNSYFTGLECGHKFCMQCWSEYLTTKIMEEGMGQTISCPAHGCDIL
: : .:. . . .: : :.:: .:: .. .. ....:.: .:: :. : .
CCDS27 SHPPHHCAVCMQFVRKENLLSLACQHQFCRSCWEQHCSV-LVKDGVGVGVSCMAQDCPLR
140 150 160 170 180 190
240 250 260 270 280 290
pF1KE4 VDDNTVMRLITDSKVKLKYQHLITNSFVECNRLLKWCPAPDCHHVVKVQYPDAKPVRC-K
. .. :. :. . ... ::.. . ..:: . :. ::. :: :..:: : :. :.: .
CCDS27 TPEDFVFPLLPNEELREKYRRYLFRDYVESHYQLQLCPGADCPMVIRVQEPRARRVQCNR
200 210 220 230 240 250
300 310 320 330 340 350
pF1KE4 CGRQFCFNCGENWHDPVKCKWLKKWIKKCDDDSETSNWIAANTKECPKCHVTIEKDGGCN
:.. :::.: . .: :. : ..::. :: :::::.:.:.:.::.::::.. :::.::::
CCDS27 CNEVFCFKCRQMYHAPTDCATIRKWLTKCADDSETANYISAHTKDCPKCNICIEKNGGCN
260 270 280 290 300 310
360 370 380 390 400 410
pF1KE4 HMVCRNQNCKAEFCWVCLGPWEPHGSAWYNCNRYNEDDAKAARDAQERSRAALQRYLFYC
:: : ..:: .:::.::: :. ::: .:.:.::.:. . .. : ..: ::..::::
CCDS27 HMQC--SKCKHDFCWMCLGDWKTHGSEYYECSRYKENPDIVNQSQQAQAREALKKYLFYF
320 330 340 350 360
420 430 440 450 460 470
pF1KE4 NRYMNHMQSLRFEHKLYAQVKQKMEEMQQHNM-SWIEVQFLKKAVDVLCQCRATLMYTYV
.:. :: .::..: . : ....:..: ..:. .::. :.:..:. .: .:: ::.:::
CCDS27 ERWENHNKSLQLEAQTYQRIHEKIQERVMNNLGTWIDWQYLQNAAKLLAKCRYTLQYTYP
370 380 390 400 410 420
480 490 500 510 520 530
pF1KE4 FAFYLKKNNQSIIFENNQADLENATEVLSGYLERDISQDSLQDIKQKVQDKYRYCESRRR
.:.:.... .. .:: .::.:: : :: .:: : : . ...... :.:::
CCDS27 YAYYMESGPRKKLFEYQQAQLEAEIENLSWKVERADSYD-----RGDLENQMHIAEQRRR
430 440 450 460 470 480
540 550
pF1KE4 VLLQHVHEGYEKDLWEYIED
.::. :.
CCDS27 TLLKDFHDT
490
>>CCDS4890.1 CUL9 gene_id:23113|Hs108|chr6 (2517 aa)
initn: 279 init1: 106 opt: 517 Z-score: 327.2 bits: 72.5 E(32554): 6.6e-12
Smith-Waterman score: 578; 26.0% identity (54.0% similar) in 454 aa overlap (104-545:1997-2417)
80 90 100 110 120 130
pF1KE4 LGPGGGGGGGGGGGGGGPGHEQEEDYRYEVLTAEQILQHMVECIREVNEVIQNPATITRI
.. ... : . .:.:.:... ...
CCDS48 PFCGSQSETSKPSPEAVATLASLQLPAGRTMSPQEVEGLMKQTVRQVQETLNLEPDVAQH
1970 1980 1990 2000 2010 2020
140 150 160 170 180 190
pF1KE4 LLSHFNWDKEKLMERYFDGNLEKLFAECHVINPSKKSRTRQMNTRSSAQDMPCQICYLNY
::.: .: :.:.. : . :.: .. .. .: . : .:
CCDS48 LLAHSHWGAEQLLQSYSEDPEPLLLAAGLCVHQAQAVPVRPDH---------CPVCVSPL
2030 2040 2050 2060 2070
200 210 220 230 240 250
pF1KE4 P-NSYFTGLECGHKFCMQCWSEYLTTKIMEEGMGQTISCPAHGCDILVDDNTVMRLITDS
.. . .: : : : .::.:::::.: :... . .:: : . ....
CCDS48 GCDDDLPSLCCMHYCCKSCWNEYLTTRI-EQNLVLNCTCPIADCPAQPTGAFIRAIVSSP
2080 2090 2100 2110 2120 2130
260 270 280 290 300 310
pF1KE4 KVKLKYQHLITNSFVECNRLLKWCPAPD-CHHVVKVQYPDAKPVRCKCGRQFCFNCG-EN
.: ::.. . ..:: : :: :. : ... : . ::: ::::. .
CCDS48 EVISKYEKALLRGYVESCSNLTWCTNPQGCDRILCRQGLGCGTTCSKCGWASCFNCSFPE
2140 2150 2160 2170 2180 2190
320 330 340 350 360
pF1KE4 WHDPVKCKWLKKWIKKCDD---------DSETSNWIAANTKECPKCHVTIEKDGGCNHMV
: :..: ...:. :: ...... .:.::.:.. :::. :: ::.
CCDS48 AHYPASCGHMSQWV---DDGGYYDGMSVEAQSKHLAKLISKRCPSCQAPIEKNEGCLHMT
2200 2210 2220 2230 2240 2250
370 380 390 400 410 420
pF1KE4 CRNQNCKAEFCWVCLGPWEPHGSAWYNCNRYNEDDAKAARDAQERSRAALQRYLFYCNRY
: .:. ::: :: :.:. . .:::. . .:::: ::. :. : .:
CCDS48 C--AKCNHGFCWRCLKSWKPNHKDYYNCSAMV---SKAAR--QEK------RFQDYNERC
2260 2270 2280 2290 2300
430 440 450 460 470 480
pF1KE4 MNHMQSLRFEHKLYAQVKQKMEEMQQHNMSWIEVQFLKKAVDVLCQCRATLMYTYVFAFY
: :. .: .: .:. : .... ::. : . : : : .: :. :..::
CCDS48 TFHHQAREFAVNLRNRVSAIHEVPPPRSFT-----FLNDACQGLEQARKVLAYACVYSFY
2310 2320 2330 2340 2350
490 500 510 520 530 540
pF1KE4 LKKNNQSIIFENNQADLENATEVLSGYLERDISQDSLQDIKQKVQDKYRYCESRRRVLLQ
. . . :.. .:: :..:. ::. . . .:. .... : : ::.
CCDS48 SQDAEYMDVVEQQTENLELHTNALQILLEETLLR--CRDLASSLRLLRADCLSTGMELLR
2360 2370 2380 2390 2400 2410
550
pF1KE4 HVHEGYEKDLWEYIED
...:
CCDS48 RIQERLLAILQHSAQDFRVGLQSPSVEAWEAKGPNMPGSQPQASSGPEAEEEEEDDEDDV
2420 2430 2440 2450 2460 2470
557 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 01:50:27 2016 done: Sun Nov 6 01:50:27 2016
Total Scan time: 3.720 Total Display time: 0.010
Function used was FASTA [36.3.4 Apr, 2011]