FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1115, 190 aa 1>>>pF1KE1115 190 - 190 aa - 190 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.4721+/-0.000723; mu= 11.4767+/- 0.043 mean_var=59.2261+/-12.145, 0's: 0 Z-trim(108.4): 22 B-trim: 129 in 1/49 Lambda= 0.166655 statistics sampled from 10163 (10181) to 10163 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.693), E-opt: 0.2 (0.313), width: 16 Scan time: 1.830 The best scores are: opt bits E(32554) CCDS13983.1 APOBEC3C gene_id:27350|Hs108|chr22 ( 190) 1374 338.3 1.7e-93 CCDS33648.1 APOBEC3F gene_id:200316|Hs108|chr22 ( 373) 1067 264.6 5.1e-71 CCDS46709.1 APOBEC3D gene_id:140564|Hs108|chr22 ( 386) 1030 255.7 2.5e-68 CCDS58807.1 APOBEC3B gene_id:9582|Hs108|chr22 ( 357) 703 177.1 1.1e-44 CCDS13982.1 APOBEC3B gene_id:9582|Hs108|chr22 ( 382) 703 177.1 1.2e-44 CCDS13984.1 APOBEC3G gene_id:60489|Hs108|chr22 ( 384) 513 131.4 6.5e-31 CCDS41747.1 AICDA gene_id:57379|Hs108|chr12 ( 198) 485 124.6 3.8e-29 CCDS13981.1 APOBEC3A gene_id:200315|Hs108|chr22 ( 199) 417 108.3 3.2e-24 CCDS4848.1 APOBEC2 gene_id:10930|Hs108|chr6 ( 224) 383 100.1 1e-21 CCDS81662.1 AICDA gene_id:57379|Hs108|chr12 ( 188) 365 95.7 1.8e-20 CCDS13985.1 APOBEC3H gene_id:164668|Hs108|chr22 ( 183) 353 92.9 1.3e-19 CCDS54531.1 APOBEC3H gene_id:164668|Hs108|chr22 ( 182) 352 92.6 1.5e-19 CCDS54530.1 APOBEC3H gene_id:164668|Hs108|chr22 ( 200) 352 92.6 1.6e-19 CCDS8579.1 APOBEC1 gene_id:339|Hs108|chr12 ( 236) 283 76.1 1.9e-14 CCDS54532.1 APOBEC3H gene_id:164668|Hs108|chr22 ( 154) 266 71.9 2.1e-13 >>CCDS13983.1 APOBEC3C gene_id:27350|Hs108|chr22 (190 aa) initn: 1374 init1: 1374 opt: 1374 Z-score: 1792.2 bits: 338.3 E(32554): 1.7e-93 Smith-Waterman score: 1374; 100.0% identity (100.0% similar) in 190 aa overlap (1-190:1-190) 10 20 30 40 50 60 pF1KE1 MNPQIRNPMKAMYPGTFYFQFKNLWEANDRNETWLCFTVEGIKRRSVVSWKTGVFRNQVD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 MNPQIRNPMKAMYPGTFYFQFKNLWEANDRNETWLCFTVEGIKRRSVVSWKTGVFRNQVD 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 SETHCHAERCFLSWFCDDILSPNTKYQVTWYTSWSPCPDCAGEVAEFLARHSNVNLTIFT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 SETHCHAERCFLSWFCDDILSPNTKYQVTWYTSWSPCPDCAGEVAEFLARHSNVNLTIFT 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 ARLYYFQYPCYQEGLRSLSQEGVAVEIMDYEDFKYCWENFVYNDNEPFKPWKGLKTNFRL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 ARLYYFQYPCYQEGLRSLSQEGVAVEIMDYEDFKYCWENFVYNDNEPFKPWKGLKTNFRL 130 140 150 160 170 180 190 pF1KE1 LKRRLRESLQ :::::::::: CCDS13 LKRRLRESLQ 190 >>CCDS33648.1 APOBEC3F gene_id:200316|Hs108|chr22 (373 aa) initn: 1651 init1: 1065 opt: 1067 Z-score: 1388.5 bits: 264.6 E(32554): 5.1e-71 Smith-Waterman score: 1067; 79.0% identity (89.8% similar) in 186 aa overlap (5-190:188-373) 10 20 30 pF1KE1 MNPQIRNPMKAMYPGTFYFQFKNLWEANDRNETW .::::.:::: :::.:::: .: :::.: CCDS33 NFVYSEGQPFMPWYKFDDNYAFLHRTLKEILRNPMEAMYPHIFYFHFKNLRKAYGRNESW 160 170 180 190 200 210 40 50 60 70 80 90 pF1KE1 LCFTVEGIKRRSVVSWKTGVFRNQVDSETHCHAERCFLSWFCDDILSPNTKYQVTWYTSW ::::.: .:..: :::: :::::::: :::::::::::::::::::::::.:.::::::: CCDS33 LCFTMEVVKHHSPVSWKRGVFRNQVDPETHCHAERCFLSWFCDDILSPNTNYEVTWYTSW 220 230 240 250 260 270 100 110 120 130 140 150 pF1KE1 SPCPDCAGEVAEFLARHSNVNLTIFTARLYYFQYPCYQEGLRSLSQEGVAVEIMDYEDFK ::::.::::::::::::::::::::::::::: ::::::::::::..:::: :.::: CCDS33 SPCPECAGEVAEFLARHSNVNLTIFTARLYYFWDTDYQEGLRSLSQEGASVEIMGYKDFK 280 290 300 310 320 330 160 170 180 190 pF1KE1 YCWENFVYNDNEPFKPWKGLKTNFRLLKRRLRESLQ ::::::::::.:::::::::: :: .: .:.: :. CCDS33 YCWENFVYNDDEPFKPWKGLKYNFLFLDSKLQEILE 340 350 360 370 >-- initn: 600 init1: 503 opt: 624 Z-score: 812.9 bits: 158.1 E(32554): 5.9e-39 Smith-Waterman score: 624; 47.6% identity (71.7% similar) in 187 aa overlap (1-187:1-186) 10 20 30 40 50 60 pF1KE1 MNPQIRNPMKAMYPGTFYFQFKNLWEANDRNETWLCFTVEGIKRRSVVSWKTGVFRNQVD :.:..:: .. :: :: ..: : . :: .:::. :. : : . .::.:: CCDS33 MKPHFRNTVERMYRDTFSYNFYNRPILSRRNTVWLCYEVK-TKGPSRPRLDAKIFRGQVY 10 20 30 40 50 70 80 90 100 110 120 pF1KE1 SETHCHAERCFLSWFCDDILSPNTKYQVTWYTSWSPCPDCAGEVAEFLARHSNVNLTIFT :. . ::: ::::::: . : .:.::..::.:::::....:::::.: ::.::: . CCDS33 SQPEHHAEMCFLSWFCGNQLPAYKCFQITWFVSWTPCPDCVAKLAEFLAEHPNVTLTISA 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE1 ARLYYFQYPCYQEGLRSLSQEGVAVEIMDYEDFKYCWENFVYNDNEPFKPWKGLKTNFRL :::::. :...: ::: :. :.::: :.: ::::::::....:: :: . :. . CCDS33 ARLYYYWERDYRRALCRLSQAGARVKIMDDEEFAYCWENFVYSEGQPFMPWYKFDDNYAF 120 130 140 150 160 170 190 pF1KE1 LKRRLRESLQ :.: :.: CCDS33 LHRTLKEILRNPMEAMYPHIFYFHFKNLRKAYGRNESWLCFTMEVVKHHSPVSWKRGVFR 180 190 200 210 220 230 >>CCDS46709.1 APOBEC3D gene_id:140564|Hs108|chr22 (386 aa) initn: 1030 init1: 1030 opt: 1030 Z-score: 1340.2 bits: 255.7 E(32554): 2.5e-68 Smith-Waterman score: 1030; 78.0% identity (88.7% similar) in 186 aa overlap (5-190:201-386) 10 20 30 pF1KE1 MNPQIRNPMKAMYPGTFYFQFKNLWEANDRNETW .::::.:::: :::.:::: .: :::.: CCDS46 NFVCNEGQPFMPWYKFDDNYASLHRTLKEILRNPMEAMYPHIFYFHFKNLLKACGRNESW 180 190 200 210 220 230 40 50 60 70 80 90 pF1KE1 LCFTVEGIKRRSVVSWKTGVFRNQVDSETHCHAERCFLSWFCDDILSPNTKYQVTWYTSW ::::.: :..:.: : :::::::: :::::::::::::::::::::::.:.::::::: CCDS46 LCFTMEVTKHHSAVFRKRGVFRNQVDPETHCHAERCFLSWFCDDILSPNTNYEVTWYTSW 240 250 260 270 280 290 100 110 120 130 140 150 pF1KE1 SPCPDCAGEVAEFLARHSNVNLTIFTARLYYFQYPCYQEGLRSLSQEGVAVEIMDYEDFK ::::.:::::::::::::::::::::::: :: ::::: ::::::..:.:: :.:: CCDS46 SPCPECAGEVAEFLARHSNVNLTIFTARLCYFWDTDYQEGLCSLSQEGASVKIMGYKDFV 300 310 320 330 340 350 160 170 180 190 pF1KE1 YCWENFVYNDNEPFKPWKGLKTNFRLLKRRLRESLQ ::.::::.:.:::::::::.:::::::::::: :: CCDS46 SCWKNFVYSDDEPFKPWKGLQTNFRLLKRRLREILQ 360 370 380 >-- initn: 625 init1: 442 opt: 627 Z-score: 816.5 bits: 158.8 E(32554): 3.7e-39 Smith-Waterman score: 627; 48.5% identity (67.5% similar) in 200 aa overlap (1-187:1-199) 10 20 30 40 50 pF1KE1 MNPQIRNPMKAMYPGTFYFQFKNLWEANDRNETWLCFTVEGIKR-RSVVSWKTGVFRNQV :::::::::. :: ::: .:.: :. ::::. :. ::: :: . : :::::. : CCDS46 MNPQIRNPMERMYRDTFYDNFENEPILYGRSYTWLCYEVK-IKRGRSNLLWDTGVFRGPV 10 20 30 40 50 60 70 80 90 100 pF1KE1 DSETHC------------HAERCFLSWFCDDILSPNTKYQVTWYTSWSPCPDCAGEVAEF . . ::: ::::::: . : : ..:.::..::.:: :. .:..: CCDS46 LPKRQSNHRQEVYFRFENHAEMCFLSWFCGNRLPANRRFQITWFVSWNPCLPCVVKVTKF 60 70 80 90 100 110 110 120 130 140 150 160 pF1KE1 LARHSNVNLTIFTARLYYFQYPCYQEGLRSLSQEGVAVEIMDYEDFKYCWENFVYNDNEP ::.: ::.::: .:::::.. .. : : . :. :.::::::: ::::::: :...: CCDS46 LAEHPNVTLTISAARLYYYRDRDWRWVLLRLHKAGARVKIMDYEDFAYCWENFVCNEGQP 120 130 140 150 160 170 170 180 190 pF1KE1 FKPWKGLKTNFRLLKRRLRESLQ : :: . :. :.: :.: CCDS46 FMPWYKFDDNYASLHRTLKEILRNPMEAMYPHIFYFHFKNLLKACGRNESWLCFTMEVTK 180 190 200 210 220 230 >>CCDS58807.1 APOBEC3B gene_id:9582|Hs108|chr22 (357 aa) initn: 669 init1: 669 opt: 703 Z-score: 915.8 bits: 177.1 E(32554): 1.1e-44 Smith-Waterman score: 703; 53.4% identity (73.8% similar) in 191 aa overlap (1-190:1-190) 10 20 30 40 50 pF1KE1 MNPQIRNPMKAMYPGTFYFQFKNLWEANDRNETWLCFTVEGIKR-RSVVSWKTGVFRNQV :::::::::. :: ::: .:.: :. ::::. :. ::: :: . : :::::.:: CCDS58 MNPQIRNPMERMYRDTFYDNFENEPILYGRSYTWLCYEVK-IKRGRSNLLWDTGVFRGQV 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 DSETHCHAERCFLSWFCDDILSPNTKYQVTWYTSWSPCPDCAGEVAEFLARHSNVNLTIF . . ::: ::::::: . : .:.::..::.:::::....::::..: ::.::: CCDS58 YFKPQYHAEMCFLSWFCGNQLPAYKCFQITWFVSWTPCPDCVAKLAEFLSEHPNVTLTIS 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 TARLYYFQYPCYQEGLRSLSQEGVAVEIMDYEDFKYCWENFVYNDNEPFKPWKGLKTNFR .:::::. :...: ::: :. : :::::.: :::::::::... : :: . :. CCDS58 AARLYYYWERDYRRALCRLSQAGARVTIMDYEEFAYCWENFVYNEGQQFMPWYKFDENYA 120 130 140 150 160 170 180 190 pF1KE1 LLKRRLRESLQ .:.: :.: :. CCDS58 FLHRTLKEILRYLMDPDTFTFNFNNDPLVLRRRQTYLCYEVERLDNGTWVLMDQHMGFLC 180 190 200 210 220 230 >-- initn: 426 init1: 212 opt: 381 Z-score: 497.4 bits: 99.7 E(32554): 2.2e-21 Smith-Waterman score: 419; 40.3% identity (65.2% similar) in 181 aa overlap (12-190:193-353) 10 20 30 40 pF1KE1 MNPQIRNPMKAMYPGTFYFQFKNLWEANDRNETWLCFTVEG : : :: :.:.: . : .:.::. :: CCDS58 NEGQQFMPWYKFDENYAFLHRTLKEILRYLMDPDTFTFNFNNDPLVLRRRQTYLCYEVE- 170 180 190 200 210 220 50 60 70 80 90 pF1KE1 IKRRSVVSWKTGVFRNQVDSETHCHAERCFLSWFCDDILSPNTKYQVTWYTSWSPCPD-- : . .: :. .: : ....:.. :.: :.:::. ::::: . CCDS58 --RLDNGTW---VLMDQ-------H-----MGFLCNE-LDPAQIYRVTWFISWSPCFSWG 230 240 250 260 100 110 120 130 140 150 pF1KE1 CAGEVAEFLARHSNVNLTIFTARLYYFQYPCYQEGLRSLSQEGVAVEIMDYEDFKYCWEN ::::: :: ....: : ::.::.: .. : :.:.:. : . :. : :: :..:.:::.. CCDS58 CAGEVRAFLQENTHVRLRIFAARIYDYD-PLYKEALQMLRDAGAQVSIMTYDEFEYCWDT 270 280 290 300 310 320 160 170 180 190 pF1KE1 FVYNDNEPFKPWKGLKTNFRLLKRRLRESLQ ::: .. ::.:: ::. . . :. ::: :: CCDS58 FVYRQGCPFQPWDGLEEHSQALSGRLRAILQNQGN 330 340 350 >>CCDS13982.1 APOBEC3B gene_id:9582|Hs108|chr22 (382 aa) initn: 669 init1: 669 opt: 703 Z-score: 915.4 bits: 177.1 E(32554): 1.2e-44 Smith-Waterman score: 703; 53.4% identity (73.8% similar) in 191 aa overlap (1-190:1-190) 10 20 30 40 50 pF1KE1 MNPQIRNPMKAMYPGTFYFQFKNLWEANDRNETWLCFTVEGIKR-RSVVSWKTGVFRNQV :::::::::. :: ::: .:.: :. ::::. :. ::: :: . : :::::.:: CCDS13 MNPQIRNPMERMYRDTFYDNFENEPILYGRSYTWLCYEVK-IKRGRSNLLWDTGVFRGQV 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 DSETHCHAERCFLSWFCDDILSPNTKYQVTWYTSWSPCPDCAGEVAEFLARHSNVNLTIF . . ::: ::::::: . : .:.::..::.:::::....::::..: ::.::: CCDS13 YFKPQYHAEMCFLSWFCGNQLPAYKCFQITWFVSWTPCPDCVAKLAEFLSEHPNVTLTIS 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 TARLYYFQYPCYQEGLRSLSQEGVAVEIMDYEDFKYCWENFVYNDNEPFKPWKGLKTNFR .:::::. :...: ::: :. : :::::.: :::::::::... : :: . :. CCDS13 AARLYYYWERDYRRALCRLSQAGARVTIMDYEEFAYCWENFVYNEGQQFMPWYKFDENYA 120 130 140 150 160 170 180 190 pF1KE1 LLKRRLRESLQ .:.: :.: :. CCDS13 FLHRTLKEILRYLMDPDTFTFNFNNDPLVLRRRQTYLCYEVERLDNGTWVLMDQHMGFLC 180 190 200 210 220 230 >-- initn: 449 init1: 212 opt: 433 Z-score: 564.5 bits: 112.2 E(32554): 4e-25 Smith-Waterman score: 433; 40.4% identity (64.9% similar) in 188 aa overlap (12-190:193-378) 10 20 30 40 pF1KE1 MNPQIRNPMKAMYPGTFYFQFKNLWEANDRNETWLCFTVEG : : :: :.:.: . : .:.::. :: CCDS13 NEGQQFMPWYKFDENYAFLHRTLKEILRYLMDPDTFTFNFNNDPLVLRRRQTYLCYEVER 170 180 190 200 210 220 50 60 70 80 90 pF1KE1 IKRRSVV--SWKTGVFRNQVDSETHC-----HAERCFLSWFCDDILSPNTKYQVTWYTSW . . : . . : . :.. .. : ::: ::. . :.: :.:::. :: CCDS13 LDNGTWVLMDQHMGFLCNEA-KNLLCGFYGRHAELRFLDLVPSLQLDPAQIYRVTWFISW 230 240 250 260 270 280 100 110 120 130 140 150 pF1KE1 SPCPD--CAGEVAEFLARHSNVNLTIFTARLYYFQYPCYQEGLRSLSQEGVAVEIMDYED ::: . ::::: :: ....: : ::.::.: .. : :.:.:. : . :. : :: :.. CCDS13 SPCFSWGCAGEVRAFLQENTHVRLRIFAARIYDYD-PLYKEALQMLRDAGAQVSIMTYDE 290 300 310 320 330 340 160 170 180 190 pF1KE1 FKYCWENFVYNDNEPFKPWKGLKTNFRLLKRRLRESLQ :.:::..::: .. ::.:: ::. . . :. ::: :: CCDS13 FEYCWDTFVYRQGCPFQPWDGLEEHSQALSGRLRAILQNQGN 350 360 370 380 >>CCDS13984.1 APOBEC3G gene_id:60489|Hs108|chr22 (384 aa) initn: 478 init1: 275 opt: 513 Z-score: 668.4 bits: 131.4 E(32554): 6.5e-31 Smith-Waterman score: 513; 43.6% identity (67.2% similar) in 195 aa overlap (1-190:1-194) 10 20 30 40 50 60 pF1KE1 MNPQIRNPMKAMYPGTFYFQFKNLWEANDRNETWLCFTVEGIKRRSVVSWKTGVFRNQVD :.:..:: .. :: :: ..: : . :: .:::. :. : : . .::.:: CCDS13 MKPHFRNTVERMYRDTFSYNFYNRPILSRRNTVWLCYEVK-TKGPSRPPLDAKIFRGQVY 10 20 30 40 50 70 80 90 100 110 pF1KE1 SETHCHAERCFLSWFCD-DILSPNTKYQVTWYTSWSPCPDCAGEVAEFLARHSNVNLTIF :: . : : :. :: : . .:.:::: ::::: :. ..: :::. .:.:::: CCDS13 SELKYHPEMRFFHWFSKWRKLHRDQEYEVTWYISWSPCTKCTRDMATFLAEDPKVTLTIF 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 TARLYYFQYPCYQEGLRSLSQ--EG--VAVEIMDYEDFKYCWENFVYNDNEPFKPWKGLK .:::::: : :::.:::: : .: ....::.:..:..:: .:::.. : :.::..: CCDS13 VARLYYFWDPDYQEALRSLCQKRDGPRATMKIMNYDEFQHCWSKFVYSQRELFEPWNNLP 120 130 140 150 160 170 180 190 pF1KE1 TNFRLLKRRLRESLQ . ::. : : :. CCDS13 KYYILLHIMLGEILRHSMDPPTFTFNFNNEPWVRGRHETYLCYEVERMHNDTWVLLNQRR 180 190 200 210 220 230 >-- initn: 478 init1: 205 opt: 455 Z-score: 593.1 bits: 117.5 E(32554): 1e-26 Smith-Waterman score: 455; 41.4% identity (67.2% similar) in 186 aa overlap (11-190:196-380) 10 20 30 40 pF1KE1 MNPQIRNPMKAMYPGTFYFQFKNLWEANDRNETWLCFTVE .: : :: :.:.: . :.::.::. :: CCDS13 YSQRELFEPWNNLPKYYILLHIMLGEILRHSMDPPTFTFNFNNEPWVRGRHETYLCYEVE 170 180 190 200 210 220 50 60 70 80 90 pF1KE1 GIKRRSVV--SWKTGVFRNQVDSETHC----HAERCFLSWFCDDILSPNTKYQVTWYTSW .. . : . . : . ::. . ::: :::. . :. . :.:: .::: CCDS13 RMHNDTWVLLNQRRGFLCNQAPHKHGFLEGRHAELCFLDVIPFWKLDLDQDYRVTCFTSW 230 240 250 260 270 280 100 110 120 130 140 150 pF1KE1 SPCPDCAGEVAEFLARHSNVNLTIFTARLYYFQYPCYQEGLRSLSQEGVAVEIMDYEDFK ::: .:: :.:.:......:.: :::::.: : : :::::.:.. :. . :: : .:: CCDS13 SPCFSCAQEMAKFISKNKHVSLCIFTARIYDDQGRC-QEGLRTLAEAGAKISIMTYSEFK 290 300 310 320 330 340 160 170 180 190 pF1KE1 YCWENFVYNDNEPFKPWKGLKTNFRLLKRRLRESLQ .::..:: ... ::.:: :: . . :. ::: :: CCDS13 HCWDTFVDHQGCPFQPWDGLDEHSQDLSGRLRAILQNQEN 350 360 370 380 >>CCDS41747.1 AICDA gene_id:57379|Hs108|chr12 (198 aa) initn: 495 init1: 238 opt: 485 Z-score: 636.8 bits: 124.6 E(32554): 3.8e-29 Smith-Waterman score: 485; 44.9% identity (69.3% similar) in 176 aa overlap (17-189:11-180) 10 20 30 40 50 pF1KE1 MNPQIRNPMKAMYPGTFYFQFKNLWEANDRNETWLCFTVEGIKRRSVVSWKT--GVFRNQ : .::::. :. : ::.::..:. .: :..:.. : .::. CCDS41 MDSLLMNRRKFLYQFKNVRWAKGRRETYLCYVVK--RRDSATSFSLDFGYLRNK 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 VDSETHCHAERCFLSWFCDDILSPNTKYQVTWYTSWSPCPDCAGEVAEFLARHSNVNLTI . ::.: :: .. : :.:. :.:::.:::::: ::: .::.:: . :..: : CCDS41 ----NGCHVELLFLRYISDWDLDPGRCYRVTWFTSWSPCYDCARHVADFLRGNPNLSLRI 60 70 80 90 100 120 130 140 150 160 170 pF1KE1 FTARLYYFQ-YPCYQEGLRSLSQEGVAVEIMDYEDFKYCWENFVYNDNEPFKPWKGLKTN ::::::. . :::: : . :: . :: ..:. :::..:: : .. :: :.::. : CCDS41 FTARLYFCEDRKAEPEGLRRLHRAGVQIAIMTFKDYFYCWNTFVENHERTFKAWEGLHEN 110 120 130 140 150 160 180 190 pF1KE1 FRLLKRRLRESLQ :.:.::. : CCDS41 SVRLSRQLRRILLPLYEVDDLRDAFRTLGL 170 180 190 >>CCDS13981.1 APOBEC3A gene_id:200315|Hs108|chr22 (199 aa) initn: 408 init1: 200 opt: 417 Z-score: 548.4 bits: 108.3 E(32554): 3.2e-24 Smith-Waterman score: 417; 39.1% identity (63.5% similar) in 192 aa overlap (8-190:9-195) 10 20 30 40 50 pF1KE1 MNPQIRNPMKAMYPGTFYFQFKNLWEANDRNETWLCFTVEGIKRRSVVSW--KTGVFRN : . : : : .:.: . :..:.::. :: . . :. . : ..: CCDS13 MEASPASGPRHLMDPHIFTSNFNN---GIGRHKTYLCYEVERLDNGTSVKMDQHRGFLHN 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 QVDSETHC-----HAERCFLSWFCDDILSPNTKYQVTWYTSWSPCPD--CAGEVAEFLAR :. . : ::: ::. . :.: :.:::. ::::: . ::::: :: . CCDS13 QAKNLL-CGFYGRHAELRFLDLVPSLQLDPAQIYRVTWFISWSPCFSWGCAGEVRAFLQE 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 HSNVNLTIFTARLYYFQYPCYQEGLRSLSQEGVAVEIMDYEDFKYCWENFVYNDNEPFKP ...: : ::.::.: .. : :.:.:. : . :. : :: :..::.::..:: ... ::.: CCDS13 NTHVRLRIFAARIYDYD-PLYKEALQMLRDAGAQVSIMTYDEFKHCWDTFVDHQGCPFQP 120 130 140 150 160 170 180 190 pF1KE1 WKGLKTNFRLLKRRLRESLQ : :: . . :. ::: :: CCDS13 WDGLDEHSQALSGRLRAILQNQGN 180 190 >>CCDS4848.1 APOBEC2 gene_id:10930|Hs108|chr6 (224 aa) initn: 377 init1: 243 opt: 383 Z-score: 503.3 bits: 100.1 E(32554): 1e-21 Smith-Waterman score: 383; 32.6% identity (69.1% similar) in 181 aa overlap (14-190:48-224) 10 20 30 40 pF1KE1 MNPQIRNPMKAMYPGTFY-FQFKNLWEANDRNETWLCFTVEGI :..:. :::.:. .. ::.:.::..::. CCDS48 GEDLENLDDPEKLKELIELPPFEIVTGERLPANFFKFQFRNVEYSSGRNKTFLCYVVEAQ 20 30 40 50 60 70 50 60 70 80 90 100 pF1KE1 KRRSVVSWKTGVFRNQVDSETHCHAERCFLSWFCDDILSPNTKYQVTWYTSWSPCPDCAG . . :. . : .. : .. :::. :.. . ..: .:.::::.: ::: :: CCDS48 GKGGQVQASRGYLE---DEHAAAHAEEAFFNTILP-AFDPALRYNVTWYVSSSPCAACAD 80 90 100 110 120 130 110 120 130 140 150 160 pF1KE1 EVAEFLARHSNVNLTIFTARLYYFQYPCYQEGLRSLSQEGVAVEIMDYEDFKYCWENFVY .. . :.. .:. : :...::.... : : .:..:.. : ..:: .::.: :.::: CCDS48 RIIKTLSKTKNLRLLILVGRLFMWEEPEIQAALKKLKEAGCKLRIMKPQDFEYVWQNFVE 140 150 160 170 180 190 170 180 190 pF1KE1 ND---NEPFKPWKGLKTNFRLLKRRLRESLQ .. .. :.::. .. :: ...: . :. CCDS48 QEEGESKAFQPWEDIQENFLYYEEKLADILK 200 210 220 >>CCDS81662.1 AICDA gene_id:57379|Hs108|chr12 (188 aa) initn: 410 init1: 238 opt: 365 Z-score: 481.2 bits: 95.7 E(32554): 1.8e-20 Smith-Waterman score: 402; 41.5% identity (64.8% similar) in 176 aa overlap (17-189:11-170) 10 20 30 40 50 pF1KE1 MNPQIRNPMKAMYPGTFYFQFKNLWEANDRNETWLCFTVEGIKRRSVVSWKT--GVFRNQ : .::::. :. : ::.::..:. .: :..:.. : .::. CCDS81 MDSLLMNRRKFLYQFKNVRWAKGRRETYLCYVVK--RRDSATSFSLDFGYLRNK 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 VDSETHCHAERCFLSWFCDDILSPNTKYQVTWYTSWSPCPDCAGEVAEFLARHSNVNLTI . ::.: :: .. : :.:. :.:::.:::::: ::: .::.:: . :..: : CCDS81 ----NGCHVELLFLRYISDWDLDPGRCYRVTWFTSWSPCYDCARHVADFLRGNPNLSLRI 60 70 80 90 100 120 130 140 150 160 170 pF1KE1 FTARLYYFQ-YPCYQEGLRSLSQEGVAVEIMDYEDFKYCWENFVYNDNEPFKPWKGLKTN ::::::. . :::: : . :: . :: ... : .. :: :.::. : CCDS81 FTARLYFCEDRKAEPEGLRRLHRAGVQIAIMTFKE----------NHERTFKAWEGLHEN 110 120 130 140 150 180 190 pF1KE1 FRLLKRRLRESLQ :.:.::. : CCDS81 SVRLSRQLRRILLPLYEVDDLRDAFRTLGL 160 170 180 190 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 04:37:09 2016 done: Mon Nov 7 04:37:10 2016 Total Scan time: 1.830 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]