FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE1115, 190 aa
1>>>pF1KE1115 190 - 190 aa - 190 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.4721+/-0.000723; mu= 11.4767+/- 0.043
mean_var=59.2261+/-12.145, 0's: 0 Z-trim(108.4): 22 B-trim: 129 in 1/49
Lambda= 0.166655
statistics sampled from 10163 (10181) to 10163 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.693), E-opt: 0.2 (0.313), width: 16
Scan time: 1.830
The best scores are: opt bits E(32554)
CCDS13983.1 APOBEC3C gene_id:27350|Hs108|chr22 ( 190) 1374 338.3 1.7e-93
CCDS33648.1 APOBEC3F gene_id:200316|Hs108|chr22 ( 373) 1067 264.6 5.1e-71
CCDS46709.1 APOBEC3D gene_id:140564|Hs108|chr22 ( 386) 1030 255.7 2.5e-68
CCDS58807.1 APOBEC3B gene_id:9582|Hs108|chr22 ( 357) 703 177.1 1.1e-44
CCDS13982.1 APOBEC3B gene_id:9582|Hs108|chr22 ( 382) 703 177.1 1.2e-44
CCDS13984.1 APOBEC3G gene_id:60489|Hs108|chr22 ( 384) 513 131.4 6.5e-31
CCDS41747.1 AICDA gene_id:57379|Hs108|chr12 ( 198) 485 124.6 3.8e-29
CCDS13981.1 APOBEC3A gene_id:200315|Hs108|chr22 ( 199) 417 108.3 3.2e-24
CCDS4848.1 APOBEC2 gene_id:10930|Hs108|chr6 ( 224) 383 100.1 1e-21
CCDS81662.1 AICDA gene_id:57379|Hs108|chr12 ( 188) 365 95.7 1.8e-20
CCDS13985.1 APOBEC3H gene_id:164668|Hs108|chr22 ( 183) 353 92.9 1.3e-19
CCDS54531.1 APOBEC3H gene_id:164668|Hs108|chr22 ( 182) 352 92.6 1.5e-19
CCDS54530.1 APOBEC3H gene_id:164668|Hs108|chr22 ( 200) 352 92.6 1.6e-19
CCDS8579.1 APOBEC1 gene_id:339|Hs108|chr12 ( 236) 283 76.1 1.9e-14
CCDS54532.1 APOBEC3H gene_id:164668|Hs108|chr22 ( 154) 266 71.9 2.1e-13
>>CCDS13983.1 APOBEC3C gene_id:27350|Hs108|chr22 (190 aa)
initn: 1374 init1: 1374 opt: 1374 Z-score: 1792.2 bits: 338.3 E(32554): 1.7e-93
Smith-Waterman score: 1374; 100.0% identity (100.0% similar) in 190 aa overlap (1-190:1-190)
10 20 30 40 50 60
pF1KE1 MNPQIRNPMKAMYPGTFYFQFKNLWEANDRNETWLCFTVEGIKRRSVVSWKTGVFRNQVD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 MNPQIRNPMKAMYPGTFYFQFKNLWEANDRNETWLCFTVEGIKRRSVVSWKTGVFRNQVD
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 SETHCHAERCFLSWFCDDILSPNTKYQVTWYTSWSPCPDCAGEVAEFLARHSNVNLTIFT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 SETHCHAERCFLSWFCDDILSPNTKYQVTWYTSWSPCPDCAGEVAEFLARHSNVNLTIFT
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE1 ARLYYFQYPCYQEGLRSLSQEGVAVEIMDYEDFKYCWENFVYNDNEPFKPWKGLKTNFRL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 ARLYYFQYPCYQEGLRSLSQEGVAVEIMDYEDFKYCWENFVYNDNEPFKPWKGLKTNFRL
130 140 150 160 170 180
190
pF1KE1 LKRRLRESLQ
::::::::::
CCDS13 LKRRLRESLQ
190
>>CCDS33648.1 APOBEC3F gene_id:200316|Hs108|chr22 (373 aa)
initn: 1651 init1: 1065 opt: 1067 Z-score: 1388.5 bits: 264.6 E(32554): 5.1e-71
Smith-Waterman score: 1067; 79.0% identity (89.8% similar) in 186 aa overlap (5-190:188-373)
10 20 30
pF1KE1 MNPQIRNPMKAMYPGTFYFQFKNLWEANDRNETW
.::::.:::: :::.:::: .: :::.:
CCDS33 NFVYSEGQPFMPWYKFDDNYAFLHRTLKEILRNPMEAMYPHIFYFHFKNLRKAYGRNESW
160 170 180 190 200 210
40 50 60 70 80 90
pF1KE1 LCFTVEGIKRRSVVSWKTGVFRNQVDSETHCHAERCFLSWFCDDILSPNTKYQVTWYTSW
::::.: .:..: :::: :::::::: :::::::::::::::::::::::.:.:::::::
CCDS33 LCFTMEVVKHHSPVSWKRGVFRNQVDPETHCHAERCFLSWFCDDILSPNTNYEVTWYTSW
220 230 240 250 260 270
100 110 120 130 140 150
pF1KE1 SPCPDCAGEVAEFLARHSNVNLTIFTARLYYFQYPCYQEGLRSLSQEGVAVEIMDYEDFK
::::.::::::::::::::::::::::::::: ::::::::::::..:::: :.:::
CCDS33 SPCPECAGEVAEFLARHSNVNLTIFTARLYYFWDTDYQEGLRSLSQEGASVEIMGYKDFK
280 290 300 310 320 330
160 170 180 190
pF1KE1 YCWENFVYNDNEPFKPWKGLKTNFRLLKRRLRESLQ
::::::::::.:::::::::: :: .: .:.: :.
CCDS33 YCWENFVYNDDEPFKPWKGLKYNFLFLDSKLQEILE
340 350 360 370
>--
initn: 600 init1: 503 opt: 624 Z-score: 812.9 bits: 158.1 E(32554): 5.9e-39
Smith-Waterman score: 624; 47.6% identity (71.7% similar) in 187 aa overlap (1-187:1-186)
10 20 30 40 50 60
pF1KE1 MNPQIRNPMKAMYPGTFYFQFKNLWEANDRNETWLCFTVEGIKRRSVVSWKTGVFRNQVD
:.:..:: .. :: :: ..: : . :: .:::. :. : : . .::.::
CCDS33 MKPHFRNTVERMYRDTFSYNFYNRPILSRRNTVWLCYEVK-TKGPSRPRLDAKIFRGQVY
10 20 30 40 50
70 80 90 100 110 120
pF1KE1 SETHCHAERCFLSWFCDDILSPNTKYQVTWYTSWSPCPDCAGEVAEFLARHSNVNLTIFT
:. . ::: ::::::: . : .:.::..::.:::::....:::::.: ::.::: .
CCDS33 SQPEHHAEMCFLSWFCGNQLPAYKCFQITWFVSWTPCPDCVAKLAEFLAEHPNVTLTISA
60 70 80 90 100 110
130 140 150 160 170 180
pF1KE1 ARLYYFQYPCYQEGLRSLSQEGVAVEIMDYEDFKYCWENFVYNDNEPFKPWKGLKTNFRL
:::::. :...: ::: :. :.::: :.: ::::::::....:: :: . :. .
CCDS33 ARLYYYWERDYRRALCRLSQAGARVKIMDDEEFAYCWENFVYSEGQPFMPWYKFDDNYAF
120 130 140 150 160 170
190
pF1KE1 LKRRLRESLQ
:.: :.:
CCDS33 LHRTLKEILRNPMEAMYPHIFYFHFKNLRKAYGRNESWLCFTMEVVKHHSPVSWKRGVFR
180 190 200 210 220 230
>>CCDS46709.1 APOBEC3D gene_id:140564|Hs108|chr22 (386 aa)
initn: 1030 init1: 1030 opt: 1030 Z-score: 1340.2 bits: 255.7 E(32554): 2.5e-68
Smith-Waterman score: 1030; 78.0% identity (88.7% similar) in 186 aa overlap (5-190:201-386)
10 20 30
pF1KE1 MNPQIRNPMKAMYPGTFYFQFKNLWEANDRNETW
.::::.:::: :::.:::: .: :::.:
CCDS46 NFVCNEGQPFMPWYKFDDNYASLHRTLKEILRNPMEAMYPHIFYFHFKNLLKACGRNESW
180 190 200 210 220 230
40 50 60 70 80 90
pF1KE1 LCFTVEGIKRRSVVSWKTGVFRNQVDSETHCHAERCFLSWFCDDILSPNTKYQVTWYTSW
::::.: :..:.: : :::::::: :::::::::::::::::::::::.:.:::::::
CCDS46 LCFTMEVTKHHSAVFRKRGVFRNQVDPETHCHAERCFLSWFCDDILSPNTNYEVTWYTSW
240 250 260 270 280 290
100 110 120 130 140 150
pF1KE1 SPCPDCAGEVAEFLARHSNVNLTIFTARLYYFQYPCYQEGLRSLSQEGVAVEIMDYEDFK
::::.:::::::::::::::::::::::: :: ::::: ::::::..:.:: :.::
CCDS46 SPCPECAGEVAEFLARHSNVNLTIFTARLCYFWDTDYQEGLCSLSQEGASVKIMGYKDFV
300 310 320 330 340 350
160 170 180 190
pF1KE1 YCWENFVYNDNEPFKPWKGLKTNFRLLKRRLRESLQ
::.::::.:.:::::::::.:::::::::::: ::
CCDS46 SCWKNFVYSDDEPFKPWKGLQTNFRLLKRRLREILQ
360 370 380
>--
initn: 625 init1: 442 opt: 627 Z-score: 816.5 bits: 158.8 E(32554): 3.7e-39
Smith-Waterman score: 627; 48.5% identity (67.5% similar) in 200 aa overlap (1-187:1-199)
10 20 30 40 50
pF1KE1 MNPQIRNPMKAMYPGTFYFQFKNLWEANDRNETWLCFTVEGIKR-RSVVSWKTGVFRNQV
:::::::::. :: ::: .:.: :. ::::. :. ::: :: . : :::::. :
CCDS46 MNPQIRNPMERMYRDTFYDNFENEPILYGRSYTWLCYEVK-IKRGRSNLLWDTGVFRGPV
10 20 30 40 50
60 70 80 90 100
pF1KE1 DSETHC------------HAERCFLSWFCDDILSPNTKYQVTWYTSWSPCPDCAGEVAEF
. . ::: ::::::: . : : ..:.::..::.:: :. .:..:
CCDS46 LPKRQSNHRQEVYFRFENHAEMCFLSWFCGNRLPANRRFQITWFVSWNPCLPCVVKVTKF
60 70 80 90 100 110
110 120 130 140 150 160
pF1KE1 LARHSNVNLTIFTARLYYFQYPCYQEGLRSLSQEGVAVEIMDYEDFKYCWENFVYNDNEP
::.: ::.::: .:::::.. .. : : . :. :.::::::: ::::::: :...:
CCDS46 LAEHPNVTLTISAARLYYYRDRDWRWVLLRLHKAGARVKIMDYEDFAYCWENFVCNEGQP
120 130 140 150 160 170
170 180 190
pF1KE1 FKPWKGLKTNFRLLKRRLRESLQ
: :: . :. :.: :.:
CCDS46 FMPWYKFDDNYASLHRTLKEILRNPMEAMYPHIFYFHFKNLLKACGRNESWLCFTMEVTK
180 190 200 210 220 230
>>CCDS58807.1 APOBEC3B gene_id:9582|Hs108|chr22 (357 aa)
initn: 669 init1: 669 opt: 703 Z-score: 915.8 bits: 177.1 E(32554): 1.1e-44
Smith-Waterman score: 703; 53.4% identity (73.8% similar) in 191 aa overlap (1-190:1-190)
10 20 30 40 50
pF1KE1 MNPQIRNPMKAMYPGTFYFQFKNLWEANDRNETWLCFTVEGIKR-RSVVSWKTGVFRNQV
:::::::::. :: ::: .:.: :. ::::. :. ::: :: . : :::::.::
CCDS58 MNPQIRNPMERMYRDTFYDNFENEPILYGRSYTWLCYEVK-IKRGRSNLLWDTGVFRGQV
10 20 30 40 50
60 70 80 90 100 110
pF1KE1 DSETHCHAERCFLSWFCDDILSPNTKYQVTWYTSWSPCPDCAGEVAEFLARHSNVNLTIF
. . ::: ::::::: . : .:.::..::.:::::....::::..: ::.:::
CCDS58 YFKPQYHAEMCFLSWFCGNQLPAYKCFQITWFVSWTPCPDCVAKLAEFLSEHPNVTLTIS
60 70 80 90 100 110
120 130 140 150 160 170
pF1KE1 TARLYYFQYPCYQEGLRSLSQEGVAVEIMDYEDFKYCWENFVYNDNEPFKPWKGLKTNFR
.:::::. :...: ::: :. : :::::.: :::::::::... : :: . :.
CCDS58 AARLYYYWERDYRRALCRLSQAGARVTIMDYEEFAYCWENFVYNEGQQFMPWYKFDENYA
120 130 140 150 160 170
180 190
pF1KE1 LLKRRLRESLQ
.:.: :.: :.
CCDS58 FLHRTLKEILRYLMDPDTFTFNFNNDPLVLRRRQTYLCYEVERLDNGTWVLMDQHMGFLC
180 190 200 210 220 230
>--
initn: 426 init1: 212 opt: 381 Z-score: 497.4 bits: 99.7 E(32554): 2.2e-21
Smith-Waterman score: 419; 40.3% identity (65.2% similar) in 181 aa overlap (12-190:193-353)
10 20 30 40
pF1KE1 MNPQIRNPMKAMYPGTFYFQFKNLWEANDRNETWLCFTVEG
: : :: :.:.: . : .:.::. ::
CCDS58 NEGQQFMPWYKFDENYAFLHRTLKEILRYLMDPDTFTFNFNNDPLVLRRRQTYLCYEVE-
170 180 190 200 210 220
50 60 70 80 90
pF1KE1 IKRRSVVSWKTGVFRNQVDSETHCHAERCFLSWFCDDILSPNTKYQVTWYTSWSPCPD--
: . .: :. .: : ....:.. :.: :.:::. ::::: .
CCDS58 --RLDNGTW---VLMDQ-------H-----MGFLCNE-LDPAQIYRVTWFISWSPCFSWG
230 240 250 260
100 110 120 130 140 150
pF1KE1 CAGEVAEFLARHSNVNLTIFTARLYYFQYPCYQEGLRSLSQEGVAVEIMDYEDFKYCWEN
::::: :: ....: : ::.::.: .. : :.:.:. : . :. : :: :..:.:::..
CCDS58 CAGEVRAFLQENTHVRLRIFAARIYDYD-PLYKEALQMLRDAGAQVSIMTYDEFEYCWDT
270 280 290 300 310 320
160 170 180 190
pF1KE1 FVYNDNEPFKPWKGLKTNFRLLKRRLRESLQ
::: .. ::.:: ::. . . :. ::: ::
CCDS58 FVYRQGCPFQPWDGLEEHSQALSGRLRAILQNQGN
330 340 350
>>CCDS13982.1 APOBEC3B gene_id:9582|Hs108|chr22 (382 aa)
initn: 669 init1: 669 opt: 703 Z-score: 915.4 bits: 177.1 E(32554): 1.2e-44
Smith-Waterman score: 703; 53.4% identity (73.8% similar) in 191 aa overlap (1-190:1-190)
10 20 30 40 50
pF1KE1 MNPQIRNPMKAMYPGTFYFQFKNLWEANDRNETWLCFTVEGIKR-RSVVSWKTGVFRNQV
:::::::::. :: ::: .:.: :. ::::. :. ::: :: . : :::::.::
CCDS13 MNPQIRNPMERMYRDTFYDNFENEPILYGRSYTWLCYEVK-IKRGRSNLLWDTGVFRGQV
10 20 30 40 50
60 70 80 90 100 110
pF1KE1 DSETHCHAERCFLSWFCDDILSPNTKYQVTWYTSWSPCPDCAGEVAEFLARHSNVNLTIF
. . ::: ::::::: . : .:.::..::.:::::....::::..: ::.:::
CCDS13 YFKPQYHAEMCFLSWFCGNQLPAYKCFQITWFVSWTPCPDCVAKLAEFLSEHPNVTLTIS
60 70 80 90 100 110
120 130 140 150 160 170
pF1KE1 TARLYYFQYPCYQEGLRSLSQEGVAVEIMDYEDFKYCWENFVYNDNEPFKPWKGLKTNFR
.:::::. :...: ::: :. : :::::.: :::::::::... : :: . :.
CCDS13 AARLYYYWERDYRRALCRLSQAGARVTIMDYEEFAYCWENFVYNEGQQFMPWYKFDENYA
120 130 140 150 160 170
180 190
pF1KE1 LLKRRLRESLQ
.:.: :.: :.
CCDS13 FLHRTLKEILRYLMDPDTFTFNFNNDPLVLRRRQTYLCYEVERLDNGTWVLMDQHMGFLC
180 190 200 210 220 230
>--
initn: 449 init1: 212 opt: 433 Z-score: 564.5 bits: 112.2 E(32554): 4e-25
Smith-Waterman score: 433; 40.4% identity (64.9% similar) in 188 aa overlap (12-190:193-378)
10 20 30 40
pF1KE1 MNPQIRNPMKAMYPGTFYFQFKNLWEANDRNETWLCFTVEG
: : :: :.:.: . : .:.::. ::
CCDS13 NEGQQFMPWYKFDENYAFLHRTLKEILRYLMDPDTFTFNFNNDPLVLRRRQTYLCYEVER
170 180 190 200 210 220
50 60 70 80 90
pF1KE1 IKRRSVV--SWKTGVFRNQVDSETHC-----HAERCFLSWFCDDILSPNTKYQVTWYTSW
. . : . . : . :.. .. : ::: ::. . :.: :.:::. ::
CCDS13 LDNGTWVLMDQHMGFLCNEA-KNLLCGFYGRHAELRFLDLVPSLQLDPAQIYRVTWFISW
230 240 250 260 270 280
100 110 120 130 140 150
pF1KE1 SPCPD--CAGEVAEFLARHSNVNLTIFTARLYYFQYPCYQEGLRSLSQEGVAVEIMDYED
::: . ::::: :: ....: : ::.::.: .. : :.:.:. : . :. : :: :..
CCDS13 SPCFSWGCAGEVRAFLQENTHVRLRIFAARIYDYD-PLYKEALQMLRDAGAQVSIMTYDE
290 300 310 320 330 340
160 170 180 190
pF1KE1 FKYCWENFVYNDNEPFKPWKGLKTNFRLLKRRLRESLQ
:.:::..::: .. ::.:: ::. . . :. ::: ::
CCDS13 FEYCWDTFVYRQGCPFQPWDGLEEHSQALSGRLRAILQNQGN
350 360 370 380
>>CCDS13984.1 APOBEC3G gene_id:60489|Hs108|chr22 (384 aa)
initn: 478 init1: 275 opt: 513 Z-score: 668.4 bits: 131.4 E(32554): 6.5e-31
Smith-Waterman score: 513; 43.6% identity (67.2% similar) in 195 aa overlap (1-190:1-194)
10 20 30 40 50 60
pF1KE1 MNPQIRNPMKAMYPGTFYFQFKNLWEANDRNETWLCFTVEGIKRRSVVSWKTGVFRNQVD
:.:..:: .. :: :: ..: : . :: .:::. :. : : . .::.::
CCDS13 MKPHFRNTVERMYRDTFSYNFYNRPILSRRNTVWLCYEVK-TKGPSRPPLDAKIFRGQVY
10 20 30 40 50
70 80 90 100 110
pF1KE1 SETHCHAERCFLSWFCD-DILSPNTKYQVTWYTSWSPCPDCAGEVAEFLARHSNVNLTIF
:: . : : :. :: : . .:.:::: ::::: :. ..: :::. .:.::::
CCDS13 SELKYHPEMRFFHWFSKWRKLHRDQEYEVTWYISWSPCTKCTRDMATFLAEDPKVTLTIF
60 70 80 90 100 110
120 130 140 150 160 170
pF1KE1 TARLYYFQYPCYQEGLRSLSQ--EG--VAVEIMDYEDFKYCWENFVYNDNEPFKPWKGLK
.:::::: : :::.:::: : .: ....::.:..:..:: .:::.. : :.::..:
CCDS13 VARLYYFWDPDYQEALRSLCQKRDGPRATMKIMNYDEFQHCWSKFVYSQRELFEPWNNLP
120 130 140 150 160 170
180 190
pF1KE1 TNFRLLKRRLRESLQ
. ::. : : :.
CCDS13 KYYILLHIMLGEILRHSMDPPTFTFNFNNEPWVRGRHETYLCYEVERMHNDTWVLLNQRR
180 190 200 210 220 230
>--
initn: 478 init1: 205 opt: 455 Z-score: 593.1 bits: 117.5 E(32554): 1e-26
Smith-Waterman score: 455; 41.4% identity (67.2% similar) in 186 aa overlap (11-190:196-380)
10 20 30 40
pF1KE1 MNPQIRNPMKAMYPGTFYFQFKNLWEANDRNETWLCFTVE
.: : :: :.:.: . :.::.::. ::
CCDS13 YSQRELFEPWNNLPKYYILLHIMLGEILRHSMDPPTFTFNFNNEPWVRGRHETYLCYEVE
170 180 190 200 210 220
50 60 70 80 90
pF1KE1 GIKRRSVV--SWKTGVFRNQVDSETHC----HAERCFLSWFCDDILSPNTKYQVTWYTSW
.. . : . . : . ::. . ::: :::. . :. . :.:: .:::
CCDS13 RMHNDTWVLLNQRRGFLCNQAPHKHGFLEGRHAELCFLDVIPFWKLDLDQDYRVTCFTSW
230 240 250 260 270 280
100 110 120 130 140 150
pF1KE1 SPCPDCAGEVAEFLARHSNVNLTIFTARLYYFQYPCYQEGLRSLSQEGVAVEIMDYEDFK
::: .:: :.:.:......:.: :::::.: : : :::::.:.. :. . :: : .::
CCDS13 SPCFSCAQEMAKFISKNKHVSLCIFTARIYDDQGRC-QEGLRTLAEAGAKISIMTYSEFK
290 300 310 320 330 340
160 170 180 190
pF1KE1 YCWENFVYNDNEPFKPWKGLKTNFRLLKRRLRESLQ
.::..:: ... ::.:: :: . . :. ::: ::
CCDS13 HCWDTFVDHQGCPFQPWDGLDEHSQDLSGRLRAILQNQEN
350 360 370 380
>>CCDS41747.1 AICDA gene_id:57379|Hs108|chr12 (198 aa)
initn: 495 init1: 238 opt: 485 Z-score: 636.8 bits: 124.6 E(32554): 3.8e-29
Smith-Waterman score: 485; 44.9% identity (69.3% similar) in 176 aa overlap (17-189:11-180)
10 20 30 40 50
pF1KE1 MNPQIRNPMKAMYPGTFYFQFKNLWEANDRNETWLCFTVEGIKRRSVVSWKT--GVFRNQ
: .::::. :. : ::.::..:. .: :..:.. : .::.
CCDS41 MDSLLMNRRKFLYQFKNVRWAKGRRETYLCYVVK--RRDSATSFSLDFGYLRNK
10 20 30 40 50
60 70 80 90 100 110
pF1KE1 VDSETHCHAERCFLSWFCDDILSPNTKYQVTWYTSWSPCPDCAGEVAEFLARHSNVNLTI
. ::.: :: .. : :.:. :.:::.:::::: ::: .::.:: . :..: :
CCDS41 ----NGCHVELLFLRYISDWDLDPGRCYRVTWFTSWSPCYDCARHVADFLRGNPNLSLRI
60 70 80 90 100
120 130 140 150 160 170
pF1KE1 FTARLYYFQ-YPCYQEGLRSLSQEGVAVEIMDYEDFKYCWENFVYNDNEPFKPWKGLKTN
::::::. . :::: : . :: . :: ..:. :::..:: : .. :: :.::. :
CCDS41 FTARLYFCEDRKAEPEGLRRLHRAGVQIAIMTFKDYFYCWNTFVENHERTFKAWEGLHEN
110 120 130 140 150 160
180 190
pF1KE1 FRLLKRRLRESLQ
:.:.::. :
CCDS41 SVRLSRQLRRILLPLYEVDDLRDAFRTLGL
170 180 190
>>CCDS13981.1 APOBEC3A gene_id:200315|Hs108|chr22 (199 aa)
initn: 408 init1: 200 opt: 417 Z-score: 548.4 bits: 108.3 E(32554): 3.2e-24
Smith-Waterman score: 417; 39.1% identity (63.5% similar) in 192 aa overlap (8-190:9-195)
10 20 30 40 50
pF1KE1 MNPQIRNPMKAMYPGTFYFQFKNLWEANDRNETWLCFTVEGIKRRSVVSW--KTGVFRN
: . : : : .:.: . :..:.::. :: . . :. . : ..:
CCDS13 MEASPASGPRHLMDPHIFTSNFNN---GIGRHKTYLCYEVERLDNGTSVKMDQHRGFLHN
10 20 30 40 50
60 70 80 90 100 110
pF1KE1 QVDSETHC-----HAERCFLSWFCDDILSPNTKYQVTWYTSWSPCPD--CAGEVAEFLAR
:. . : ::: ::. . :.: :.:::. ::::: . ::::: :: .
CCDS13 QAKNLL-CGFYGRHAELRFLDLVPSLQLDPAQIYRVTWFISWSPCFSWGCAGEVRAFLQE
60 70 80 90 100 110
120 130 140 150 160 170
pF1KE1 HSNVNLTIFTARLYYFQYPCYQEGLRSLSQEGVAVEIMDYEDFKYCWENFVYNDNEPFKP
...: : ::.::.: .. : :.:.:. : . :. : :: :..::.::..:: ... ::.:
CCDS13 NTHVRLRIFAARIYDYD-PLYKEALQMLRDAGAQVSIMTYDEFKHCWDTFVDHQGCPFQP
120 130 140 150 160 170
180 190
pF1KE1 WKGLKTNFRLLKRRLRESLQ
: :: . . :. ::: ::
CCDS13 WDGLDEHSQALSGRLRAILQNQGN
180 190
>>CCDS4848.1 APOBEC2 gene_id:10930|Hs108|chr6 (224 aa)
initn: 377 init1: 243 opt: 383 Z-score: 503.3 bits: 100.1 E(32554): 1e-21
Smith-Waterman score: 383; 32.6% identity (69.1% similar) in 181 aa overlap (14-190:48-224)
10 20 30 40
pF1KE1 MNPQIRNPMKAMYPGTFY-FQFKNLWEANDRNETWLCFTVEGI
:..:. :::.:. .. ::.:.::..::.
CCDS48 GEDLENLDDPEKLKELIELPPFEIVTGERLPANFFKFQFRNVEYSSGRNKTFLCYVVEAQ
20 30 40 50 60 70
50 60 70 80 90 100
pF1KE1 KRRSVVSWKTGVFRNQVDSETHCHAERCFLSWFCDDILSPNTKYQVTWYTSWSPCPDCAG
. . :. . : .. : .. :::. :.. . ..: .:.::::.: ::: ::
CCDS48 GKGGQVQASRGYLE---DEHAAAHAEEAFFNTILP-AFDPALRYNVTWYVSSSPCAACAD
80 90 100 110 120 130
110 120 130 140 150 160
pF1KE1 EVAEFLARHSNVNLTIFTARLYYFQYPCYQEGLRSLSQEGVAVEIMDYEDFKYCWENFVY
.. . :.. .:. : :...::.... : : .:..:.. : ..:: .::.: :.:::
CCDS48 RIIKTLSKTKNLRLLILVGRLFMWEEPEIQAALKKLKEAGCKLRIMKPQDFEYVWQNFVE
140 150 160 170 180 190
170 180 190
pF1KE1 ND---NEPFKPWKGLKTNFRLLKRRLRESLQ
.. .. :.::. .. :: ...: . :.
CCDS48 QEEGESKAFQPWEDIQENFLYYEEKLADILK
200 210 220
>>CCDS81662.1 AICDA gene_id:57379|Hs108|chr12 (188 aa)
initn: 410 init1: 238 opt: 365 Z-score: 481.2 bits: 95.7 E(32554): 1.8e-20
Smith-Waterman score: 402; 41.5% identity (64.8% similar) in 176 aa overlap (17-189:11-170)
10 20 30 40 50
pF1KE1 MNPQIRNPMKAMYPGTFYFQFKNLWEANDRNETWLCFTVEGIKRRSVVSWKT--GVFRNQ
: .::::. :. : ::.::..:. .: :..:.. : .::.
CCDS81 MDSLLMNRRKFLYQFKNVRWAKGRRETYLCYVVK--RRDSATSFSLDFGYLRNK
10 20 30 40 50
60 70 80 90 100 110
pF1KE1 VDSETHCHAERCFLSWFCDDILSPNTKYQVTWYTSWSPCPDCAGEVAEFLARHSNVNLTI
. ::.: :: .. : :.:. :.:::.:::::: ::: .::.:: . :..: :
CCDS81 ----NGCHVELLFLRYISDWDLDPGRCYRVTWFTSWSPCYDCARHVADFLRGNPNLSLRI
60 70 80 90 100
120 130 140 150 160 170
pF1KE1 FTARLYYFQ-YPCYQEGLRSLSQEGVAVEIMDYEDFKYCWENFVYNDNEPFKPWKGLKTN
::::::. . :::: : . :: . :: ... : .. :: :.::. :
CCDS81 FTARLYFCEDRKAEPEGLRRLHRAGVQIAIMTFKE----------NHERTFKAWEGLHEN
110 120 130 140 150
180 190
pF1KE1 FRLLKRRLRESLQ
:.:.::. :
CCDS81 SVRLSRQLRRILLPLYEVDDLRDAFRTLGL
160 170 180
190 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Mon Nov 7 04:37:09 2016 done: Mon Nov 7 04:37:10 2016
Total Scan time: 1.830 Total Display time: 0.010
Function used was FASTA [36.3.4 Apr, 2011]