FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE5056, 224 aa
1>>>pF1KE5056 224 - 224 aa - 224 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.4330+/-0.000724; mu= 12.7556+/- 0.043
mean_var=58.0059+/-11.471, 0's: 0 Z-trim(107.8): 17 B-trim: 0 in 0/50
Lambda= 0.168399
statistics sampled from 9797 (9813) to 9797 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.698), E-opt: 0.2 (0.301), width: 16
Scan time: 2.050
The best scores are: opt bits E(32554)
CCDS4848.1 APOBEC2 gene_id:10930|Hs108|chr6 ( 224) 1489 369.7 8.5e-103
CCDS33648.1 APOBEC3F gene_id:200316|Hs108|chr22 ( 373) 403 105.9 3.6e-23
CCDS13983.1 APOBEC3C gene_id:27350|Hs108|chr22 ( 190) 383 101.0 5.6e-22
CCDS41747.1 AICDA gene_id:57379|Hs108|chr12 ( 198) 358 94.9 3.9e-20
CCDS58807.1 APOBEC3B gene_id:9582|Hs108|chr22 ( 357) 326 87.2 1.5e-17
CCDS13982.1 APOBEC3B gene_id:9582|Hs108|chr22 ( 382) 326 87.2 1.6e-17
CCDS13984.1 APOBEC3G gene_id:60489|Hs108|chr22 ( 384) 294 79.4 3.4e-15
CCDS46709.1 APOBEC3D gene_id:140564|Hs108|chr22 ( 386) 272 74.1 1.4e-13
CCDS81662.1 AICDA gene_id:57379|Hs108|chr12 ( 188) 265 72.3 2.4e-13
CCDS13981.1 APOBEC3A gene_id:200315|Hs108|chr22 ( 199) 254 69.6 1.6e-12
CCDS54531.1 APOBEC3H gene_id:164668|Hs108|chr22 ( 182) 243 66.9 9.3e-12
CCDS13985.1 APOBEC3H gene_id:164668|Hs108|chr22 ( 183) 243 66.9 9.4e-12
CCDS54530.1 APOBEC3H gene_id:164668|Hs108|chr22 ( 200) 243 67.0 1e-11
>>CCDS4848.1 APOBEC2 gene_id:10930|Hs108|chr6 (224 aa)
initn: 1489 init1: 1489 opt: 1489 Z-score: 1959.1 bits: 369.7 E(32554): 8.5e-103
Smith-Waterman score: 1489; 100.0% identity (100.0% similar) in 224 aa overlap (1-224:1-224)
10 20 30 40 50 60
pF1KE5 MAQKEEAAVATEAASQNGEDLENLDDPEKLKELIELPPFEIVTGERLPANFFKFQFRNVE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS48 MAQKEEAAVATEAASQNGEDLENLDDPEKLKELIELPPFEIVTGERLPANFFKFQFRNVE
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE5 YSSGRNKTFLCYVVEAQGKGGQVQASRGYLEDEHAAAHAEEAFFNTILPAFDPALRYNVT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS48 YSSGRNKTFLCYVVEAQGKGGQVQASRGYLEDEHAAAHAEEAFFNTILPAFDPALRYNVT
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE5 WYVSSSPCAACADRIIKTLSKTKNLRLLILVGRLFMWEEPEIQAALKKLKEAGCKLRIMK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS48 WYVSSSPCAACADRIIKTLSKTKNLRLLILVGRLFMWEEPEIQAALKKLKEAGCKLRIMK
130 140 150 160 170 180
190 200 210 220
pF1KE5 PQDFEYVWQNFVEQEEGESKAFQPWEDIQENFLYYEEKLADILK
::::::::::::::::::::::::::::::::::::::::::::
CCDS48 PQDFEYVWQNFVEQEEGESKAFQPWEDIQENFLYYEEKLADILK
190 200 210 220
>>CCDS33648.1 APOBEC3F gene_id:200316|Hs108|chr22 (373 aa)
initn: 541 init1: 219 opt: 403 Z-score: 529.5 bits: 105.9 E(32554): 3.6e-23
Smith-Waterman score: 403; 32.2% identity (68.3% similar) in 199 aa overlap (30-224:184-373)
10 20 30 40 50
pF1KE5 MAQKEEAAVATEAASQNGEDLENLDDPEKLKELIELPPFEIVTGERLPANFFKFQFRNV
:::... : : . ..: :.:.:.
CCDS33 YCWENFVYSEGQPFMPWYKFDDNYAFLHRTLKEILRNPM------EAMYPHIFYFHFKNL
160 170 180 190 200
60 70 80 90 100 110
pF1KE5 EYSSGRNKTFLCYVVEAQGKGGQVQASRGYLE---DEHAAAHAEEAFFNTILP-AFDPAL
. . :::...::...:. . . :. .:: .. : .. :::. :.. . ..:
CCDS33 RKAYGRNESWLCFTMEVVKHHSPVSWKRGVFRNQVDPETHCHAERCFLSWFCDDILSPNT
210 220 230 240 250 260
120 130 140 150 160 170
pF1KE5 RYNVTWYVSSSPCAACADRIIKTLSKTKNLRLLILVGRLFMWEEPEIQAALKKLKEAGCK
:.::::.: ::: :: .. . :.. .:. : :...::... . . : .:..:.. : .
CCDS33 NYEVTWYTSWSPCPECAGEVAEFLARHSNVNLTIFTARLYYFWDTDYQEGLRSLSQEGAS
270 280 290 300 310 320
180 190 200 210 220
pF1KE5 LRIMKPQDFEYVWQNFVEQEEGESKAFQPWEDIQENFLYYEEKLADILK
..:: .::.: :.::: ... . :.::. .. :::. . :: .::.
CCDS33 VEIMGYKDFKYCWENFVYNDD---EPFKPWKGLKYNFLFLDSKLQEILE
330 340 350 360 370
>--
initn: 337 init1: 206 opt: 332 Z-score: 436.3 bits: 88.7 E(32554): 5.6e-18
Smith-Waterman score: 332; 32.6% identity (64.1% similar) in 184 aa overlap (37-214:3-179)
10 20 30 40 50 60
pF1KE5 AAVATEAASQNGEDLENLDDPEKLKELIELPPFEIVTGERLPANFFKFQFRNVEYSSGRN
: :. : ::. . :...: : : ::
CCDS33 MKPHFRN-TVERMYRDTFSYNFYNRPILSRRN
10 20 30
70 80 90 100 110 120
pF1KE5 KTFLCYVVEAQGKG-GQVQAS--RG--YLEDEHAAAHAEEAFFNTILPAFDPALR-YNVT
..::: :...: . ...:. :: : . :: ::: :.. . :: . ...:
CCDS33 TVWLCYEVKTKGPSRPRLDAKIFRGQVYSQPEH---HAEMCFLSWFCGNQLPAYKCFQIT
40 50 60 70 80
130 140 150 160 170 180
pF1KE5 WYVSSSPCAACADRIIKTLSKTKNLRLLILVGRLFMWEEPEIQAALKKLKEAGCKLRIMK
:.:: .:: :. .. . :.. :. : : ..::... : . . :: .:..:: ...::
CCDS33 WFVSWTPCPDCVAKLAEFLAEHPNVTLTISAARLYYYWERDYRRALCRLSQAGARVKIMD
90 100 110 120 130 140
190 200 210 220
pF1KE5 PQDFEYVWQNFVEQEEGESKAFQPWEDIQENFLYYEEKLADILK
..: : :.::: .:.. :.:: ...:. .
CCDS33 DEEFAYCWENFVY---SEGQPFMPWYKFDDNYAFLHRTLKEILRNPMEAMYPHIFYFHFK
150 160 170 180 190 200
CCDS33 NLRKAYGRNESWLCFTMEVVKHHSPVSWKRGVFRNQVDPETHCHAERCFLSWFCDDILSP
210 220 230 240 250 260
>>CCDS13983.1 APOBEC3C gene_id:27350|Hs108|chr22 (190 aa)
initn: 361 init1: 243 opt: 383 Z-score: 508.1 bits: 101.0 E(32554): 5.6e-22
Smith-Waterman score: 383; 32.6% identity (69.1% similar) in 181 aa overlap (48-224:14-190)
20 30 40 50 60 70
pF1KE5 GEDLENLDDPEKLKELIELPPFEIVTGERLPANFFKFQFRNVEYSSGRNKTFLCYVVEAQ
:..:. :::.:. .. ::.:.::..::.
CCDS13 MNPQIRNPMKAMYPGTFY-FQFKNLWEANDRNETWLCFTVEGI
10 20 30 40
80 90 100 110 120 130
pF1KE5 GKGGQVQASRGYLE---DEHAAAHAEEAFFNTILP-AFDPALRYNVTWYVSSSPCAACAD
. . :. . : .. : .. :::. :.. . ..: .:.::::.: ::: ::
CCDS13 KRRSVVSWKTGVFRNQVDSETHCHAERCFLSWFCDDILSPNTKYQVTWYTSWSPCPDCAG
50 60 70 80 90 100
140 150 160 170 180 190
pF1KE5 RIIKTLSKTKNLRLLILVGRLFMWEEPEIQAALKKLKEAGCKLRIMKPQDFEYVWQNFVE
.. . :.. .:. : :...::.... : : .:..:.. : ..:: .::.: :.:::
CCDS13 EVAEFLARHSNVNLTIFTARLYYFQYPCYQEGLRSLSQEGVAVEIMDYEDFKYCWENFVY
110 120 130 140 150 160
200 210 220
pF1KE5 QEEGESKAFQPWEDIQENFLYYEEKLADILK
.. .. :.::. .. :: ...: . :.
CCDS13 ND---NEPFKPWKGLKTNFRLLKRRLRESLQ
170 180 190
>>CCDS41747.1 AICDA gene_id:57379|Hs108|chr12 (198 aa)
initn: 271 init1: 199 opt: 358 Z-score: 474.9 bits: 94.9 E(32554): 3.9e-20
Smith-Waterman score: 358; 35.1% identity (68.4% similar) in 174 aa overlap (52-223:11-180)
30 40 50 60 70 80
pF1KE5 ENLDDPEKLKELIELPPFEIVTGERLPANFFKFQFRNVEYSSGRNKTFLCYVVEAQGKGG
: .::.::....:: .:.:::::. . ..
CCDS41 MDSLLMNRRKFLYQFKNVRWAKGRRETYLCYVVKRRDSAT
10 20 30 40
90 100 110 120 130 140
pF1KE5 QVQASRGYLEDEHAAAHAEEAFFNTILP-AFDPALRYNVTWYVSSSPCAACADRIIKTLS
. . . :::..... :.: :. : .::. : :::..: ::: :: .. :
CCDS41 SFSLDFGYLRNKNGC-HVELLFLRYISDWDLDPGRCYRVTWFTSWSPCYDCARHVADFLR
50 60 70 80 90
150 160 170 180 190
pF1KE5 KTKNLRLLILVGRLFMWEEPEIQA-ALKKLKEAGCKLRIMKPQDFEYVWQNFVEQEEGES
. :: : :...::.. :. . . .:..:..:: .. :: .:. : :..:::..:
CCDS41 GNPNLSLRIFTARLYFCEDRKAEPEGLRRLHRAGVQIAIMTFKDYFYCWNTFVENHE---
100 110 120 130 140 150
200 210 220
pF1KE5 KAFQPWEDIQENFLYYEEKLADILK
..:. :: ..:: . ..: ::
CCDS41 RTFKAWEGLHENSVRLSRQLRRILLPLYEVDDLRDAFRTLGL
160 170 180 190
>>CCDS58807.1 APOBEC3B gene_id:9582|Hs108|chr22 (357 aa)
initn: 379 init1: 204 opt: 326 Z-score: 428.7 bits: 87.2 E(32554): 1.5e-17
Smith-Waterman score: 326; 31.6% identity (63.1% similar) in 187 aa overlap (45-224:10-190)
20 30 40 50 60 70
pF1KE5 SQNGEDLENLDDPEKLKELIELPPFEIVTGERLPANFFKFQFRNVEYSSGRNKTFLCYVV
::. . : .:.: ::. :.::: :
CCDS58 MNPQIRNPMERMYRDTFYDNFENEPILYGRSYTWLCYEV
10 20 30
80 90 100 110 120
pF1KE5 EAQ-GKGGQVQAS---RG--YLEDEHAAAHAEEAFFNTILPAFDPALR-YNVTWYVSSSP
. . :... . . :: :.. .. ::: :.. . :: . ...::.:: .:
CCDS58 KIKRGRSNLLWDTGVFRGQVYFKPQY---HAEMCFLSWFCGNQLPAYKCFQITWFVSWTP
40 50 60 70 80 90
130 140 150 160 170 180
pF1KE5 CAACADRIIKTLSKTKNLRLLILVGRLFMWEEPEIQAALKKLKEAGCKLRIMKPQDFEYV
: :. .. . ::. :. : : ..::... : . . :: .:..:: .. :: ..: :
CCDS58 CPDCVAKLAEFLSEHPNVTLTISAARLYYYWERDYRRALCRLSQAGARVTIMDYEEFAYC
100 110 120 130 140 150
190 200 210 220
pF1KE5 WQNFVEQEEGESKAFQPWEDIQENFLYYEEKLADILK
:.::: .: .. :.:: ..::. . .. : .::.
CCDS58 WENFVYNE---GQQFMPWYKFDENYAFLHRTLKEILRYLMDPDTFTFNFNNDPLVLRRRQ
160 170 180 190 200 210
CCDS58 TYLCYEVERLDNGTWVLMDQHMGFLCNELDPAQIYRVTWFISWSPCFSWGCAGEVRAFLQ
220 230 240 250 260 270
>--
initn: 287 init1: 94 opt: 288 Z-score: 378.8 bits: 78.0 E(32554): 8.8e-15
Smith-Waterman score: 288; 31.7% identity (57.2% similar) in 180 aa overlap (47-224:193-353)
20 30 40 50 60 70
pF1KE5 NGEDLENLDDPEKLKELIELPPFEIVTGERLPANFFKFQFRNVEYSSGRNKTFLCYVVEA
. . : :.: : : .:.::: ::
CCDS58 NEGQQFMPWYKFDENYAFLHRTLKEILRYLMDPDTFTFNFNNDPLVLRRRQTYLCYEVER
170 180 190 200 210 220
80 90 100 110 120 130
pF1KE5 QGKGGQVQASRGYLEDEHAAAHAEEAFFNTILPAFDPALRYNVTWYVSSSPCAA--CADR
.: : : :.: . .: .::: : :::..: ::: . :: .
CCDS58 LDNGTWV------LMDQHMGFLCNE---------LDPAQIYRVTWFISWSPCFSWGCAGE
230 240 250 260
140 150 160 170 180 190
pF1KE5 IIKTLSKTKNLRLLILVGRLFMWEEPEIQAALKKLKEAGCKLRIMKPQDFEYVWQNFVEQ
. :... ..:: :...:.. .. : . ::. :..:: .. :: ..::: :..:: .
CCDS58 VRAFLQENTHVRLRIFAARIYDYD-PLYKEALQMLRDAGAQVSIMTYDEFEYCWDTFVYR
270 280 290 300 310 320
200 210 220
pF1KE5 EEGESKAFQPWEDIQENFLYYEEKLADILK
. . ::::. ..:. .: ::.
CCDS58 Q---GCPFQPWDGLEEHSQALSGRLRAILQNQGN
330 340 350
>>CCDS13982.1 APOBEC3B gene_id:9582|Hs108|chr22 (382 aa)
initn: 379 init1: 204 opt: 326 Z-score: 428.2 bits: 87.2 E(32554): 1.6e-17
Smith-Waterman score: 326; 31.6% identity (63.1% similar) in 187 aa overlap (45-224:10-190)
20 30 40 50 60 70
pF1KE5 SQNGEDLENLDDPEKLKELIELPPFEIVTGERLPANFFKFQFRNVEYSSGRNKTFLCYVV
::. . : .:.: ::. :.::: :
CCDS13 MNPQIRNPMERMYRDTFYDNFENEPILYGRSYTWLCYEV
10 20 30
80 90 100 110 120
pF1KE5 EAQ-GKGGQVQAS---RG--YLEDEHAAAHAEEAFFNTILPAFDPALR-YNVTWYVSSSP
. . :... . . :: :.. .. ::: :.. . :: . ...::.:: .:
CCDS13 KIKRGRSNLLWDTGVFRGQVYFKPQY---HAEMCFLSWFCGNQLPAYKCFQITWFVSWTP
40 50 60 70 80 90
130 140 150 160 170 180
pF1KE5 CAACADRIIKTLSKTKNLRLLILVGRLFMWEEPEIQAALKKLKEAGCKLRIMKPQDFEYV
: :. .. . ::. :. : : ..::... : . . :: .:..:: .. :: ..: :
CCDS13 CPDCVAKLAEFLSEHPNVTLTISAARLYYYWERDYRRALCRLSQAGARVTIMDYEEFAYC
100 110 120 130 140 150
190 200 210 220
pF1KE5 WQNFVEQEEGESKAFQPWEDIQENFLYYEEKLADILK
:.::: .: .. :.:: ..::. . .. : .::.
CCDS13 WENFVYNE---GQQFMPWYKFDENYAFLHRTLKEILRYLMDPDTFTFNFNNDPLVLRRRQ
160 170 180 190 200 210
CCDS13 TYLCYEVERLDNGTWVLMDQHMGFLCNEAKNLLCGFYGRHAELRFLDLVPSLQLDPAQIY
220 230 240 250 260 270
>--
initn: 287 init1: 94 opt: 309 Z-score: 405.9 bits: 83.1 E(32554): 2.7e-16
Smith-Waterman score: 309; 31.9% identity (59.7% similar) in 191 aa overlap (47-224:193-378)
20 30 40 50 60 70
pF1KE5 NGEDLENLDDPEKLKELIELPPFEIVTGERLPANFFKFQFRNVEYSSGRNKTFLCYVVEA
. . : :.: : : .:.::: ::
CCDS13 NEGQQFMPWYKFDENYAFLHRTLKEILRYLMDPDTFTFNFNNDPLVLRRRQTYLCYEVER
170 180 190 200 210 220
80 90 100 110 120
pF1KE5 QGKGGQVQASR--GYLEDEHA-------AAHAEEAFFNTILPAF--DPALRYNVTWYVSS
.: : .. :.: .: . ::: :.. ..:.. ::: : :::..:
CCDS13 LDNGTWVLMDQHMGFLCNEAKNLLCGFYGRHAELRFLD-LVPSLQLDPAQIYRVTWFISW
230 240 250 260 270 280
130 140 150 160 170 180
pF1KE5 SPCAA--CADRIIKTLSKTKNLRLLILVGRLFMWEEPEIQAALKKLKEAGCKLRIMKPQD
::: . :: .. :... ..:: :...:.. .. : . ::. :..:: .. :: ..
CCDS13 SPCFSWGCAGEVRAFLQENTHVRLRIFAARIYDYD-PLYKEALQMLRDAGAQVSIMTYDE
290 300 310 320 330 340
190 200 210 220
pF1KE5 FEYVWQNFVEQEEGESKAFQPWEDIQENFLYYEEKLADILK
::: :..:: .. . ::::. ..:. .: ::.
CCDS13 FEYCWDTFVYRQ---GCPFQPWDGLEEHSQALSGRLRAILQNQGN
350 360 370 380
>>CCDS13984.1 APOBEC3G gene_id:60489|Hs108|chr22 (384 aa)
initn: 316 init1: 143 opt: 294 Z-score: 386.2 bits: 79.4 E(32554): 3.4e-15
Smith-Waterman score: 300; 32.6% identity (62.5% similar) in 184 aa overlap (52-224:202-380)
30 40 50 60 70 80
pF1KE5 ENLDDPEKLKELIELPPFEIVTGERLPANFFKFQFRNVEYSSGRNKTFLCYVVEAQGKGG
: :.: : . ::..:.::: :: . .
CCDS13 FEPWNNLPKYYILLHIMLGEILRHSMDPPTFTFNFNNEPWVRGRHETYLCYEVERMHNDT
180 190 200 210 220 230
90 100 110 120 130
pF1KE5 QV--QASRGYLEDE----HA---AAHAEEAFFNTILP--AFDPALRYNVTWYVSSSPCAA
: . ::.: .. :. . ::: :.. ..: .: : :: ..: ::: .
CCDS13 WVLLNQRRGFLCNQAPHKHGFLEGRHAELCFLD-VIPFWKLDLDQDYRVTCFTSWSPCFS
240 250 260 270 280 290
140 150 160 170 180 190
pF1KE5 CADRIIKTLSKTKNLRLLILVGRLFMWEEPEIQAALKKLKEAGCKLRIMKPQDFEYVWQN
::... : .::.:.. : :...:.. .. . : .:. : ::: :. :: ..:.. :..
CCDS13 CAQEMAKFISKNKHVSLCIFTARIYD-DQGRCQEGLRTLAEAGAKISIMTYSEFKHCWDT
300 310 320 330 340
200 210 220
pF1KE5 FVEQEEGESKAFQPWEDIQENFLYYEEKLADILK
::... . ::::. ..:. .: ::.
CCDS13 FVDHQ---GCPFQPWDGLDEHSQDLSGRLRAILQNQEN
350 360 370 380
>--
initn: 316 init1: 143 opt: 294 Z-score: 386.2 bits: 79.4 E(32554): 3.4e-15
Smith-Waterman score: 294; 29.4% identity (62.4% similar) in 197 aa overlap (37-224:3-194)
10 20 30 40 50 60
pF1KE5 AAVATEAASQNGEDLENLDDPEKLKELIELPPFEIVTGERLPANFFKFQFRNVEYSSGRN
: :. : ::. . :...: : : ::
CCDS13 MKPHFRN-TVERMYRDTFSYNFYNRPILSRRN
10 20 30
70 80 90 100 110 120
pF1KE5 KTFLCYVVEAQGKGG---QVQASRGYLEDEHAAAHAEEAFFN--TILPAFDPALRYNVTW
..::: :...: . ... :: . .: : : ::. . . .:.:::
CCDS13 TVWLCYEVKTKGPSRPPLDAKIFRGQVYSE-LKYHPEMRFFHWFSKWRKLHRDQEYEVTW
40 50 60 70 80 90
130 140 150 160 170
pF1KE5 YVSSSPCAACADRIIKTLSKTKNLRLLILVGRLFMWEEPEIQAALKKL--KEAGCK--LR
:.: :::. :. . :.. .. : :.:.::... .:. : ::..: :. : . ..
CCDS13 YISWSPCTKCTRDMATFLAEDPKVTLTIFVARLYYFWDPDYQEALRSLCQKRDGPRATMK
100 110 120 130 140 150
180 190 200 210 220
pF1KE5 IMKPQDFEYVWQNFVEQEEGESKAFQPWEDIQENFLYYEEKLADILK
::. ..:.. :..:: ... . :.::... . .. . :..::.
CCDS13 IMNYDEFQHCWSKFVYSQR---ELFEPWNNLPKYYILLHIMLGEILRHSMDPPTFTFNFN
160 170 180 190 200
CCDS13 NEPWVRGRHETYLCYEVERMHNDTWVLLNQRRGFLCNQAPHKHGFLEGRHAELCFLDVIP
210 220 230 240 250 260
>>CCDS46709.1 APOBEC3D gene_id:140564|Hs108|chr22 (386 aa)
initn: 488 init1: 203 opt: 272 Z-score: 357.3 bits: 74.1 E(32554): 1.4e-13
Smith-Waterman score: 365; 31.5% identity (66.3% similar) in 184 aa overlap (45-224:206-386)
20 30 40 50 60 70
pF1KE5 SQNGEDLENLDDPEKLKELIELPPFEIVTGERLPANFFKFQFRNVEYSSGRNKTFLCYVV
: . ..: :.:.:. . :::...::...
CCDS46 EGQPFMPWYKFDDNYASLHRTLKEILRNPMEAMYPHIFYFHFKNLLKACGRNESWLCFTM
180 190 200 210 220 230
80 90 100 110 120 130
pF1KE5 EAQGKGGQVQASRGYLE---DEHAAAHAEEAFFNTILP-AFDPALRYNVTWYVSSSPCAA
:. . . : .:: .. : .. :::. :.. . ..: :.::::.: :::
CCDS46 EVTKHHSAVFRKRGVFRNQVDPETHCHAERCFLSWFCDDILSPNTNYEVTWYTSWSPCPE
240 250 260 270 280 290
140 150 160 170 180 190
pF1KE5 CADRIIKTLSKTKNLRLLILVGRLFMWEEPEIQAALKKLKEAGCKLRIMKPQDFEYVWQN
:: .. . :.. .:. : :...:: .. . . : .: .:.. : ...:: .:: :.:
CCDS46 CAGEVAEFLARHSNVNLTIFTARLCYFWDTDYQEGLCSLSQEGASVKIMGYKDFVSCWKN
300 310 320 330 340 350
200 210 220
pF1KE5 FVEQEEGESKAFQPWEDIQENFLYYEEKLADILK
:: ... . :.::. .: :: ...: .::.
CCDS46 FVYSDD---EPFKPWKGLQTNFRLLKRRLREILQ
360 370 380
>--
initn: 512 init1: 203 opt: 305 Z-score: 400.6 bits: 82.1 E(32554): 5.4e-16
Smith-Waterman score: 305; 29.6% identity (61.2% similar) in 196 aa overlap (45-224:10-202)
20 30 40 50 60 70
pF1KE5 SQNGEDLENLDDPEKLKELIELPPFEIVTGERLPANFFKFQFRNVEYSSGRNKTFLCYVV
::. . : .:.: ::. :.::: :
CCDS46 MNPQIRNPMERMYRDTFYDNFENEPILYGRSYTWLCYEV
10 20 30
80 90 100 110
pF1KE5 EAQ-GKGGQVQAS---RGYLEDEHAAAHAEEAFFN-------TILPAFD----PA-LRYN
. . :... . . :: . .. . : .:..: .: : :: :..
CCDS46 KIKRGRSNLLWDTGVFRGPVLPKRQSNHRQEVYFRFENHAEMCFLSWFCGNRLPANRRFQ
40 50 60 70 80 90
120 130 140 150 160 170
pF1KE5 VTWYVSSSPCAACADRIIKTLSKTKNLRLLILVGRLFMWEEPEIQAALKKLKEAGCKLRI
.::.:: .:: :. .. : :.. :. : : ..::..... . . .: .:..:: ...:
CCDS46 ITWFVSWNPCLPCVVKVTKFLAEHPNVTLTISAARLYYYRDRDWRWVLLRLHKAGARVKI
100 110 120 130 140 150
180 190 200 210 220
pF1KE5 MKPQDFEYVWQNFVEQEEGESKAFQPWEDIQENFLYYEEKLADILK
: .:: : :.::: .: .. :.:: ...:. .. : .::.
CCDS46 MDYEDFAYCWENFVCNE---GQPFMPWYKFDDNYASLHRTLKEILRNPMEAMYPHIFYFH
160 170 180 190 200 210
CCDS46 FKNLLKACGRNESWLCFTMEVTKHHSAVFRKRGVFRNQVDPETHCHAERCFLSWFCDDIL
220 230 240 250 260 270
>>CCDS81662.1 AICDA gene_id:57379|Hs108|chr12 (188 aa)
initn: 287 init1: 211 opt: 265 Z-score: 353.2 bits: 72.3 E(32554): 2.4e-13
Smith-Waterman score: 292; 32.8% identity (64.4% similar) in 174 aa overlap (52-223:11-170)
30 40 50 60 70 80
pF1KE5 ENLDDPEKLKELIELPPFEIVTGERLPANFFKFQFRNVEYSSGRNKTFLCYVVEAQGKGG
: .::.::....:: .:.:::::. . ..
CCDS81 MDSLLMNRRKFLYQFKNVRWAKGRRETYLCYVVKRRDSAT
10 20 30 40
90 100 110 120 130 140
pF1KE5 QVQASRGYLEDEHAAAHAEEAFFNTILP-AFDPALRYNVTWYVSSSPCAACADRIIKTLS
. . . :::..... :.: :. : .::. : :::..: ::: :: .. :
CCDS81 SFSLDFGYLRNKNGC-HVELLFLRYISDWDLDPGRCYRVTWFTSWSPCYDCARHVADFLR
50 60 70 80 90
150 160 170 180 190
pF1KE5 KTKNLRLLILVGRLFMWEEPEIQA-ALKKLKEAGCKLRIMKPQDFEYVWQNFVEQEEGES
. :: : :...::.. :. . . .:..:..:: .. :: .: :..:
CCDS81 GNPNLSLRIFTARLYFCEDRKAEPEGLRRLHRAGVQIAIM----------TFKENHE---
100 110 120 130 140
200 210 220
pF1KE5 KAFQPWEDIQENFLYYEEKLADILK
..:. :: ..:: . ..: ::
CCDS81 RTFKAWEGLHENSVRLSRQLRRILLPLYEVDDLRDAFRTLGL
150 160 170 180
>>CCDS13981.1 APOBEC3A gene_id:200315|Hs108|chr22 (199 aa)
initn: 286 init1: 87 opt: 254 Z-score: 338.3 bits: 69.6 E(32554): 1.6e-12
Smith-Waterman score: 311; 31.5% identity (62.4% similar) in 197 aa overlap (43-224:7-195)
20 30 40 50 60 70
pF1KE5 AASQNGEDLENLDDPEKLKELIELPPFEIVTGER--LPANFFKFQFRNVEYSSGRNKTFL
.: : . ..: .: : . ::.::.:
CCDS13 MEASPASGPRHLMDPHIFTSNFNN---GIGRHKTYL
10 20 30
80 90 100 110
pF1KE5 CYVVEAQGKGGQVQAS--RGYLEDEHA-------AAHAEEAFFNTILPAF--DPALRYNV
:: :: .: .:. . ::.:... . ::: :.. . :.. ::: : :
CCDS13 CYEVERLDNGTSVKMDQHRGFLHNQAKNLLCGFYGRHAELRFLDLV-PSLQLDPAQIYRV
40 50 60 70 80 90
120 130 140 150 160 170
pF1KE5 TWYVSSSPCAA--CADRIIKTLSKTKNLRLLILVGRLFMWEEPEIQAALKKLKEAGCKLR
::..: ::: . :: .. :... ..:: :...:.. .. : . ::. :..:: ..
CCDS13 TWFISWSPCFSWGCAGEVRAFLQENTHVRLRIFAARIYDYD-PLYKEALQMLRDAGAQVS
100 110 120 130 140 150
180 190 200 210 220
pF1KE5 IMKPQDFEYVWQNFVEQEEGESKAFQPWEDIQENFLYYEEKLADILK
:: ..:.. :..::... . ::::. ..:. .: ::.
CCDS13 IMTYDEFKHCWDTFVDHQ---GCPFQPWDGLDEHSQALSGRLRAILQNQGN
160 170 180 190
224 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Tue Nov 8 04:35:59 2016 done: Tue Nov 8 04:35:59 2016
Total Scan time: 2.050 Total Display time: 0.010
Function used was FASTA [36.3.4 Apr, 2011]