FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE0576, 271 aa
1>>>pF1KE0576 271 - 271 aa - 271 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.4581+/-0.000671; mu= 14.5497+/- 0.040
mean_var=67.6817+/-13.747, 0's: 0 Z-trim(110.2): 16 B-trim: 11 in 1/50
Lambda= 0.155897
statistics sampled from 11412 (11428) to 11412 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.725), E-opt: 0.2 (0.351), width: 16
Scan time: 1.940
The best scores are: opt bits E(32554)
CCDS46709.1 APOBEC3D gene_id:140564|Hs108|chr22 ( 386) 1180 273.8 1.3e-73
CCDS33648.1 APOBEC3F gene_id:200316|Hs108|chr22 ( 373) 508 122.7 3.9e-28
CCDS58807.1 APOBEC3B gene_id:9582|Hs108|chr22 ( 357) 494 119.5 3.3e-27
CCDS13982.1 APOBEC3B gene_id:9582|Hs108|chr22 ( 382) 494 119.5 3.5e-27
CCDS13983.1 APOBEC3C gene_id:27350|Hs108|chr22 ( 190) 331 82.7 2.1e-16
>>CCDS46709.1 APOBEC3D gene_id:140564|Hs108|chr22 (386 aa)
initn: 1464 init1: 1180 opt: 1180 Z-score: 1435.3 bits: 273.8 E(32554): 1.3e-73
Smith-Waterman score: 1330; 74.5% identity (74.5% similar) in 271 aa overlap (1-271:1-202)
10 20 30 40 50 60
pF1KE0 MNPQIRNPMERMYRDTFYDNFENEPILYGRSYTWLCYEVKIKRGRSNLLWDTGVFRGPVL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 MNPQIRNPMERMYRDTFYDNFENEPILYGRSYTWLCYEVKIKRGRSNLLWDTGVFRGPVL
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 PKRQSNHRQEVYFRFENHAEMCFLSWFCGNRLPANRRFQITWFVSWNPCLPCVVKVTKFL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 PKRQSNHRQEVYFRFENHAEMCFLSWFCGNRLPANRRFQITWFVSWNPCLPCVVKVTKFL
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE0 AEHPNVTLTISAARLYYYRDRDWRWVLLRLHKAGARVKIMDYEGERCRGQGSMTGRNSLR
:::::::::::::::::::::::::::::::::::::::::::
CCDS46 AEHPNVTLTISAARLYYYRDRDWRWVLLRLHKAGARVKIMDYE-----------------
130 140 150 160
190 200 210 220 230 240
pF1KE0 DGWICNAMAGGVPGQPAGVGLALIATDSQETRPGRAGPGSGESLSASHLFISDFAYCWEN
::::::::
CCDS46 ----------------------------------------------------DFAYCWEN
170
250 260 270
pF1KE0 FVCNEGQPFMPWYKFDDNYASLHRTLKEILR
:::::::::::::::::::::::::::::::
CCDS46 FVCNEGQPFMPWYKFDDNYASLHRTLKEILRNPMEAMYPHIFYFHFKNLLKACGRNESWL
180 190 200 210 220 230
CCDS46 CFTMEVTKHHSAVFRKRGVFRNQVDPETHCHAERCFLSWFCDDILSPNTNYEVTWYTSWS
240 250 260 270 280 290
>--
initn: 601 init1: 315 opt: 429 Z-score: 522.4 bits: 104.9 E(32554): 8.9e-23
Smith-Waterman score: 429; 42.0% identity (63.7% similar) in 157 aa overlap (7-163:203-347)
10 20 30
pF1KE0 MNPQIRNPMERMYRDTFYDNFENEPILYGRSYTWLC
:::: :: :: .:.: ::. .:::
CCDS46 VCNEGQPFMPWYKFDDNYASLHRTLKEILRNPMEAMYPHIFYFHFKNLLKACGRNESWLC
180 190 200 210 220 230
40 50 60 70 80 90
pF1KE0 YEVKIKRGRSNLLWDTGVFRGPVLPKRQSNHRQEVYFRFENHAEMCFLSWFCGNRLPANR
. ... . .: .. ::::. : :. . ::: ::::::: . : :
CCDS46 FTMEVTKHHSAVFRKRGVFRNQVDPETHC------------HAERCFLSWFCDDILSPNT
240 250 260 270 280
100 110 120 130 140 150
pF1KE0 RFQITWFVSWNPCLPCVVKVTKFLAEHPNVTLTISAARLYYYRDRDWRWVLLRLHKAGAR
...::..::.:: :. .:..:::.: ::.::: .::: :. : :.. : : . ::
CCDS46 NYEVTWYTSWSPCPECAGEVAEFLARHSNVNLTIFTARLCYFWDTDYQEGLCSLSQEGAS
290 300 310 320 330 340
160 170 180 190 200 210
pF1KE0 VKIMDYEGERCRGQGSMTGRNSLRDGWICNAMAGGVPGQPAGVGLALIATDSQETRPGRA
:::: :.
CCDS46 VKIMGYKDFVSCWKNFVYSDDEPFKPWKGLQTNFRLLKRRLREILQ
350 360 370 380
>>CCDS33648.1 APOBEC3F gene_id:200316|Hs108|chr22 (373 aa)
initn: 1095 init1: 481 opt: 508 Z-score: 618.7 bits: 122.7 E(32554): 3.9e-28
Smith-Waterman score: 771; 50.6% identity (59.8% similar) in 271 aa overlap (1-271:1-189)
10 20 30 40 50 60
pF1KE0 MNPQIRNPMERMYRDTFYDNFENEPILYGRSYTWLCYEVKIKRGRSNLLWDTGVFRGPVL
:.:..:: .:::::::: :: :.::: :. .::::::: : : : :. .:::
CCDS33 MKPHFRNTVERMYRDTFSYNFYNRPILSRRNTVWLCYEVKTK-GPSRPRLDAKIFRG---
10 20 30 40 50
70 80 90 100 110 120
pF1KE0 PKRQSNHRQEVYFRFENHAEMCFLSWFCGNRLPANRRFQITWFVSWNPCLPCVVKVTKFL
.:: . :.:::::::::::::.::: . :::::::::.:: ::.:...::
CCDS33 ---------QVYSQPEHHAEMCFLSWFCGNQLPAYKCFQITWFVSWTPCPDCVAKLAEFL
60 70 80 90 100
130 140 150 160 170 180
pF1KE0 AEHPNVTLTISAARLYYYRDRDWRWVLLRLHKAGARVKIMDYEGERCRGQGSMTGRNSLR
:::::::::::::::::: .::.: .: :: .::::::::: :
CCDS33 AEHPNVTLTISAARLYYYWERDYRRALCRLSQAGARVKIMDDE-----------------
110 120 130 140 150
190 200 210 220 230 240
pF1KE0 DGWICNAMAGGVPGQPAGVGLALIATDSQETRPGRAGPGSGESLSASHLFISDFAYCWEN
.:::::::
CCDS33 ----------------------------------------------------EFAYCWEN
250 260 270
pF1KE0 FVCNEGQPFMPWYKFDDNYASLHRTLKEILR
:: .:::::::::::::::: ::::::::::
CCDS33 FVYSEGQPFMPWYKFDDNYAFLHRTLKEILRNPMEAMYPHIFYFHFKNLRKAYGRNESWL
160 170 180 190 200 210
CCDS33 CFTMEVVKHHSPVSWKRGVFRNQVDPETHCHAERCFLSWFCDDILSPNTNYEVTWYTSWS
220 230 240 250 260 270
>--
initn: 626 init1: 320 opt: 457 Z-score: 556.7 bits: 111.2 E(32554): 1.1e-24
Smith-Waterman score: 458; 33.3% identity (48.1% similar) in 264 aa overlap (7-270:190-372)
10 20 30
pF1KE0 MNPQIRNPMERMYRDTFYDNFENEPILYGRSYTWLC
:::: :: :: .:.: :::. .:::
CCDS33 VYSEGQPFMPWYKFDDNYAFLHRTLKEILRNPMEAMYPHIFYFHFKNLRKAYGRNESWLC
160 170 180 190 200 210
40 50 60 70 80 90
pF1KE0 YEVKIKRGRSNLLWDTGVFRGPVLPKRQSNHRQEVYFRFENHAEMCFLSWFCGNRLPANR
. ... . .: . : ::::. : :. . ::: ::::::: . : :
CCDS33 FTMEVVKHHSPVSWKRGVFRNQVDPETHC------------HAERCFLSWFCDDILSPNT
220 230 240 250 260
100 110 120 130 140 150
pF1KE0 RFQITWFVSWNPCLPCVVKVTKFLAEHPNVTLTISAARLYYYRDRDWRWVLLRLHKAGAR
...::..::.:: :. .:..:::.: ::.::: .:::::. : :.. : : . ::
CCDS33 NYEVTWYTSWSPCPECAGEVAEFLARHSNVNLTIFTARLYYFWDTDYQEGLRSLSQEGAS
270 280 290 300 310 320
160 170 180 190 200 210
pF1KE0 VKIMDYEGERCRGQGSMTGRNSLRDGWICNAMAGGVPGQPAGVGLALIATDSQETRPGRA
:.:: :.
CCDS33 VEIMGYK-----------------------------------------------------
330
220 230 240 250 260 270
pF1KE0 GPGSGESLSASHLFISDFAYCWENFVCNEGQPFMPWYKFDDNYASLHRTLKEILR
:: ::::::: :. .:: :: . :. : :.:::
CCDS33 ----------------DFKYCWENFVYNDDEPFKPWKGLKYNFLFLDSKLQEILE
340 350 360 370
>>CCDS58807.1 APOBEC3B gene_id:9582|Hs108|chr22 (357 aa)
initn: 731 init1: 494 opt: 494 Z-score: 602.0 bits: 119.5 E(32554): 3.3e-27
Smith-Waterman score: 969; 58.7% identity (64.9% similar) in 271 aa overlap (1-271:1-190)
10 20 30 40 50 60
pF1KE0 MNPQIRNPMERMYRDTFYDNFENEPILYGRSYTWLCYEVKIKRGRSNLLWDTGVFRGPVL
:::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 MNPQIRNPMERMYRDTFYDNFENEPILYGRSYTWLCYEVKIKRGRSNLLWDTGVFRG---
10 20 30 40 50
70 80 90 100 110 120
pF1KE0 PKRQSNHRQEVYFRFENHAEMCFLSWFCGNRLPANRRFQITWFVSWNPCLPCVVKVTKFL
.:::. . :::::::::::::.::: . :::::::::.:: ::.:...::
CCDS58 ---------QVYFKPQYHAEMCFLSWFCGNQLPAYKCFQITWFVSWTPCPDCVAKLAEFL
60 70 80 90 100
130 140 150 160 170 180
pF1KE0 AEHPNVTLTISAARLYYYRDRDWRWVLLRLHKAGARVKIMDYEGERCRGQGSMTGRNSLR
.::::::::::::::::: .::.: .: :: .::::: :::::
CCDS58 SEHPNVTLTISAARLYYYWERDYRRALCRLSQAGARVTIMDYE-----------------
110 120 130 140 150
190 200 210 220 230 240
pF1KE0 DGWICNAMAGGVPGQPAGVGLALIATDSQETRPGRAGPGSGESLSASHLFISDFAYCWEN
.:::::::
CCDS58 ----------------------------------------------------EFAYCWEN
250 260 270
pF1KE0 FVCNEGQPFMPWYKFDDNYASLHRTLKEILR
:: :::: ::::::::.::: ::::::::::
CCDS58 FVYNEGQQFMPWYKFDENYAFLHRTLKEILRYLMDPDTFTFNFNNDPLVLRRRQTYLCYE
160 170 180 190 200 210
CCDS58 VERLDNGTWVLMDQHMGFLCNELDPAQIYRVTWFISWSPCFSWGCAGEVRAFLQENTHVR
220 230 240 250 260 270
>>CCDS13982.1 APOBEC3B gene_id:9582|Hs108|chr22 (382 aa)
initn: 731 init1: 494 opt: 494 Z-score: 601.5 bits: 119.5 E(32554): 3.5e-27
Smith-Waterman score: 969; 58.7% identity (64.9% similar) in 271 aa overlap (1-271:1-190)
10 20 30 40 50 60
pF1KE0 MNPQIRNPMERMYRDTFYDNFENEPILYGRSYTWLCYEVKIKRGRSNLLWDTGVFRGPVL
:::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 MNPQIRNPMERMYRDTFYDNFENEPILYGRSYTWLCYEVKIKRGRSNLLWDTGVFRG---
10 20 30 40 50
70 80 90 100 110 120
pF1KE0 PKRQSNHRQEVYFRFENHAEMCFLSWFCGNRLPANRRFQITWFVSWNPCLPCVVKVTKFL
.:::. . :::::::::::::.::: . :::::::::.:: ::.:...::
CCDS13 ---------QVYFKPQYHAEMCFLSWFCGNQLPAYKCFQITWFVSWTPCPDCVAKLAEFL
60 70 80 90 100
130 140 150 160 170 180
pF1KE0 AEHPNVTLTISAARLYYYRDRDWRWVLLRLHKAGARVKIMDYEGERCRGQGSMTGRNSLR
.::::::::::::::::: .::.: .: :: .::::: :::::
CCDS13 SEHPNVTLTISAARLYYYWERDYRRALCRLSQAGARVTIMDYE-----------------
110 120 130 140 150
190 200 210 220 230 240
pF1KE0 DGWICNAMAGGVPGQPAGVGLALIATDSQETRPGRAGPGSGESLSASHLFISDFAYCWEN
.:::::::
CCDS13 ----------------------------------------------------EFAYCWEN
250 260 270
pF1KE0 FVCNEGQPFMPWYKFDDNYASLHRTLKEILR
:: :::: ::::::::.::: ::::::::::
CCDS13 FVYNEGQQFMPWYKFDENYAFLHRTLKEILRYLMDPDTFTFNFNNDPLVLRRRQTYLCYE
160 170 180 190 200 210
CCDS13 VERLDNGTWVLMDQHMGFLCNEAKNLLCGFYGRHAELRFLDLVPSLQLDPAQIYRVTWFI
220 230 240 250 260 270
>>CCDS13983.1 APOBEC3C gene_id:27350|Hs108|chr22 (190 aa)
initn: 611 init1: 316 opt: 331 Z-score: 408.0 bits: 82.7 E(32554): 2.1e-16
Smith-Waterman score: 482; 36.4% identity (50.7% similar) in 272 aa overlap (1-271:1-190)
10 20 30 40 50
pF1KE0 MNPQIRNPMERMYRDTFYDNFENEPILYGRSYTWLCYEVK-IKRGRSNLLWDTGVFRGPV
:::::::::. :: ::: .:.: :. ::::. :. ::: :: . : :::::. :
CCDS13 MNPQIRNPMKAMYPGTFYFQFKNLWEANDRNETWLCFTVEGIKR-RSVVSWKTGVFRNQV
10 20 30 40 50
60 70 80 90 100 110
pF1KE0 LPKRQSNHRQEVYFRFENHAEMCFLSWFCGNRLPANRRFQITWFVSWNPCLPCVVKVTKF
. .: ::: ::::::: . : : ..:.::..::.:: :. .:..:
CCDS13 ---DSETH---------CHAERCFLSWFCDDILSPNTKYQVTWYTSWSPCPDCAGEVAEF
60 70 80 90 100
120 130 140 150 160 170
pF1KE0 LAEHPNVTLTISAARLYYYRDRDWRWVLLRLHKAGARVKIMDYEGERCRGQGSMTGRNSL
::.: ::.::: .:::::.. .. : : . :. :.:::::
CCDS13 LARHSNVNLTIFTARLYYFQYPCYQEGLRSLSQEGVAVEIMDYE----------------
110 120 130 140 150
180 190 200 210 220 230
pF1KE0 RDGWICNAMAGGVPGQPAGVGLALIATDSQETRPGRAGPGSGESLSASHLFISDFAYCWE
:: ::::
CCDS13 -----------------------------------------------------DFKYCWE
240 250 260 270
pF1KE0 NFVCNEGQPFMPWYKFDDNYASLHRTLKEILR
::: :...:: :: . :. :.: :.: :.
CCDS13 NFVYNDNEPFKPWKGLKTNFRLLKRRLRESLQ
160 170 180 190
271 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Wed Nov 2 22:31:48 2016 done: Wed Nov 2 22:31:48 2016
Total Scan time: 1.940 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]