FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5056, 224 aa 1>>>pF1KE5056 224 - 224 aa - 224 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.4330+/-0.000724; mu= 12.7556+/- 0.043 mean_var=58.0059+/-11.471, 0's: 0 Z-trim(107.8): 17 B-trim: 0 in 0/50 Lambda= 0.168399 statistics sampled from 9797 (9813) to 9797 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.698), E-opt: 0.2 (0.301), width: 16 Scan time: 2.050 The best scores are: opt bits E(32554) CCDS4848.1 APOBEC2 gene_id:10930|Hs108|chr6 ( 224) 1489 369.7 8.5e-103 CCDS33648.1 APOBEC3F gene_id:200316|Hs108|chr22 ( 373) 403 105.9 3.6e-23 CCDS13983.1 APOBEC3C gene_id:27350|Hs108|chr22 ( 190) 383 101.0 5.6e-22 CCDS41747.1 AICDA gene_id:57379|Hs108|chr12 ( 198) 358 94.9 3.9e-20 CCDS58807.1 APOBEC3B gene_id:9582|Hs108|chr22 ( 357) 326 87.2 1.5e-17 CCDS13982.1 APOBEC3B gene_id:9582|Hs108|chr22 ( 382) 326 87.2 1.6e-17 CCDS13984.1 APOBEC3G gene_id:60489|Hs108|chr22 ( 384) 294 79.4 3.4e-15 CCDS46709.1 APOBEC3D gene_id:140564|Hs108|chr22 ( 386) 272 74.1 1.4e-13 CCDS81662.1 AICDA gene_id:57379|Hs108|chr12 ( 188) 265 72.3 2.4e-13 CCDS13981.1 APOBEC3A gene_id:200315|Hs108|chr22 ( 199) 254 69.6 1.6e-12 CCDS54531.1 APOBEC3H gene_id:164668|Hs108|chr22 ( 182) 243 66.9 9.3e-12 CCDS13985.1 APOBEC3H gene_id:164668|Hs108|chr22 ( 183) 243 66.9 9.4e-12 CCDS54530.1 APOBEC3H gene_id:164668|Hs108|chr22 ( 200) 243 67.0 1e-11 >>CCDS4848.1 APOBEC2 gene_id:10930|Hs108|chr6 (224 aa) initn: 1489 init1: 1489 opt: 1489 Z-score: 1959.1 bits: 369.7 E(32554): 8.5e-103 Smith-Waterman score: 1489; 100.0% identity (100.0% similar) in 224 aa overlap (1-224:1-224) 10 20 30 40 50 60 pF1KE5 MAQKEEAAVATEAASQNGEDLENLDDPEKLKELIELPPFEIVTGERLPANFFKFQFRNVE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 MAQKEEAAVATEAASQNGEDLENLDDPEKLKELIELPPFEIVTGERLPANFFKFQFRNVE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 YSSGRNKTFLCYVVEAQGKGGQVQASRGYLEDEHAAAHAEEAFFNTILPAFDPALRYNVT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 YSSGRNKTFLCYVVEAQGKGGQVQASRGYLEDEHAAAHAEEAFFNTILPAFDPALRYNVT 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 WYVSSSPCAACADRIIKTLSKTKNLRLLILVGRLFMWEEPEIQAALKKLKEAGCKLRIMK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 WYVSSSPCAACADRIIKTLSKTKNLRLLILVGRLFMWEEPEIQAALKKLKEAGCKLRIMK 130 140 150 160 170 180 190 200 210 220 pF1KE5 PQDFEYVWQNFVEQEEGESKAFQPWEDIQENFLYYEEKLADILK :::::::::::::::::::::::::::::::::::::::::::: CCDS48 PQDFEYVWQNFVEQEEGESKAFQPWEDIQENFLYYEEKLADILK 190 200 210 220 >>CCDS33648.1 APOBEC3F gene_id:200316|Hs108|chr22 (373 aa) initn: 541 init1: 219 opt: 403 Z-score: 529.5 bits: 105.9 E(32554): 3.6e-23 Smith-Waterman score: 403; 32.2% identity (68.3% similar) in 199 aa overlap (30-224:184-373) 10 20 30 40 50 pF1KE5 MAQKEEAAVATEAASQNGEDLENLDDPEKLKELIELPPFEIVTGERLPANFFKFQFRNV :::... : : . ..: :.:.:. CCDS33 YCWENFVYSEGQPFMPWYKFDDNYAFLHRTLKEILRNPM------EAMYPHIFYFHFKNL 160 170 180 190 200 60 70 80 90 100 110 pF1KE5 EYSSGRNKTFLCYVVEAQGKGGQVQASRGYLE---DEHAAAHAEEAFFNTILP-AFDPAL . . :::...::...:. . . :. .:: .. : .. :::. :.. . ..: CCDS33 RKAYGRNESWLCFTMEVVKHHSPVSWKRGVFRNQVDPETHCHAERCFLSWFCDDILSPNT 210 220 230 240 250 260 120 130 140 150 160 170 pF1KE5 RYNVTWYVSSSPCAACADRIIKTLSKTKNLRLLILVGRLFMWEEPEIQAALKKLKEAGCK :.::::.: ::: :: .. . :.. .:. : :...::... . . : .:..:.. : . CCDS33 NYEVTWYTSWSPCPECAGEVAEFLARHSNVNLTIFTARLYYFWDTDYQEGLRSLSQEGAS 270 280 290 300 310 320 180 190 200 210 220 pF1KE5 LRIMKPQDFEYVWQNFVEQEEGESKAFQPWEDIQENFLYYEEKLADILK ..:: .::.: :.::: ... . :.::. .. :::. . :: .::. CCDS33 VEIMGYKDFKYCWENFVYNDD---EPFKPWKGLKYNFLFLDSKLQEILE 330 340 350 360 370 >-- initn: 337 init1: 206 opt: 332 Z-score: 436.3 bits: 88.7 E(32554): 5.6e-18 Smith-Waterman score: 332; 32.6% identity (64.1% similar) in 184 aa overlap (37-214:3-179) 10 20 30 40 50 60 pF1KE5 AAVATEAASQNGEDLENLDDPEKLKELIELPPFEIVTGERLPANFFKFQFRNVEYSSGRN : :. : ::. . :...: : : :: CCDS33 MKPHFRN-TVERMYRDTFSYNFYNRPILSRRN 10 20 30 70 80 90 100 110 120 pF1KE5 KTFLCYVVEAQGKG-GQVQAS--RG--YLEDEHAAAHAEEAFFNTILPAFDPALR-YNVT ..::: :...: . ...:. :: : . :: ::: :.. . :: . ...: CCDS33 TVWLCYEVKTKGPSRPRLDAKIFRGQVYSQPEH---HAEMCFLSWFCGNQLPAYKCFQIT 40 50 60 70 80 130 140 150 160 170 180 pF1KE5 WYVSSSPCAACADRIIKTLSKTKNLRLLILVGRLFMWEEPEIQAALKKLKEAGCKLRIMK :.:: .:: :. .. . :.. :. : : ..::... : . . :: .:..:: ...:: CCDS33 WFVSWTPCPDCVAKLAEFLAEHPNVTLTISAARLYYYWERDYRRALCRLSQAGARVKIMD 90 100 110 120 130 140 190 200 210 220 pF1KE5 PQDFEYVWQNFVEQEEGESKAFQPWEDIQENFLYYEEKLADILK ..: : :.::: .:.. :.:: ...:. . CCDS33 DEEFAYCWENFVY---SEGQPFMPWYKFDDNYAFLHRTLKEILRNPMEAMYPHIFYFHFK 150 160 170 180 190 200 CCDS33 NLRKAYGRNESWLCFTMEVVKHHSPVSWKRGVFRNQVDPETHCHAERCFLSWFCDDILSP 210 220 230 240 250 260 >>CCDS13983.1 APOBEC3C gene_id:27350|Hs108|chr22 (190 aa) initn: 361 init1: 243 opt: 383 Z-score: 508.1 bits: 101.0 E(32554): 5.6e-22 Smith-Waterman score: 383; 32.6% identity (69.1% similar) in 181 aa overlap (48-224:14-190) 20 30 40 50 60 70 pF1KE5 GEDLENLDDPEKLKELIELPPFEIVTGERLPANFFKFQFRNVEYSSGRNKTFLCYVVEAQ :..:. :::.:. .. ::.:.::..::. CCDS13 MNPQIRNPMKAMYPGTFY-FQFKNLWEANDRNETWLCFTVEGI 10 20 30 40 80 90 100 110 120 130 pF1KE5 GKGGQVQASRGYLE---DEHAAAHAEEAFFNTILP-AFDPALRYNVTWYVSSSPCAACAD . . :. . : .. : .. :::. :.. . ..: .:.::::.: ::: :: CCDS13 KRRSVVSWKTGVFRNQVDSETHCHAERCFLSWFCDDILSPNTKYQVTWYTSWSPCPDCAG 50 60 70 80 90 100 140 150 160 170 180 190 pF1KE5 RIIKTLSKTKNLRLLILVGRLFMWEEPEIQAALKKLKEAGCKLRIMKPQDFEYVWQNFVE .. . :.. .:. : :...::.... : : .:..:.. : ..:: .::.: :.::: CCDS13 EVAEFLARHSNVNLTIFTARLYYFQYPCYQEGLRSLSQEGVAVEIMDYEDFKYCWENFVY 110 120 130 140 150 160 200 210 220 pF1KE5 QEEGESKAFQPWEDIQENFLYYEEKLADILK .. .. :.::. .. :: ...: . :. CCDS13 ND---NEPFKPWKGLKTNFRLLKRRLRESLQ 170 180 190 >>CCDS41747.1 AICDA gene_id:57379|Hs108|chr12 (198 aa) initn: 271 init1: 199 opt: 358 Z-score: 474.9 bits: 94.9 E(32554): 3.9e-20 Smith-Waterman score: 358; 35.1% identity (68.4% similar) in 174 aa overlap (52-223:11-180) 30 40 50 60 70 80 pF1KE5 ENLDDPEKLKELIELPPFEIVTGERLPANFFKFQFRNVEYSSGRNKTFLCYVVEAQGKGG : .::.::....:: .:.:::::. . .. CCDS41 MDSLLMNRRKFLYQFKNVRWAKGRRETYLCYVVKRRDSAT 10 20 30 40 90 100 110 120 130 140 pF1KE5 QVQASRGYLEDEHAAAHAEEAFFNTILP-AFDPALRYNVTWYVSSSPCAACADRIIKTLS . . . :::..... :.: :. : .::. : :::..: ::: :: .. : CCDS41 SFSLDFGYLRNKNGC-HVELLFLRYISDWDLDPGRCYRVTWFTSWSPCYDCARHVADFLR 50 60 70 80 90 150 160 170 180 190 pF1KE5 KTKNLRLLILVGRLFMWEEPEIQA-ALKKLKEAGCKLRIMKPQDFEYVWQNFVEQEEGES . :: : :...::.. :. . . .:..:..:: .. :: .:. : :..:::..: CCDS41 GNPNLSLRIFTARLYFCEDRKAEPEGLRRLHRAGVQIAIMTFKDYFYCWNTFVENHE--- 100 110 120 130 140 150 200 210 220 pF1KE5 KAFQPWEDIQENFLYYEEKLADILK ..:. :: ..:: . ..: :: CCDS41 RTFKAWEGLHENSVRLSRQLRRILLPLYEVDDLRDAFRTLGL 160 170 180 190 >>CCDS58807.1 APOBEC3B gene_id:9582|Hs108|chr22 (357 aa) initn: 379 init1: 204 opt: 326 Z-score: 428.7 bits: 87.2 E(32554): 1.5e-17 Smith-Waterman score: 326; 31.6% identity (63.1% similar) in 187 aa overlap (45-224:10-190) 20 30 40 50 60 70 pF1KE5 SQNGEDLENLDDPEKLKELIELPPFEIVTGERLPANFFKFQFRNVEYSSGRNKTFLCYVV ::. . : .:.: ::. :.::: : CCDS58 MNPQIRNPMERMYRDTFYDNFENEPILYGRSYTWLCYEV 10 20 30 80 90 100 110 120 pF1KE5 EAQ-GKGGQVQAS---RG--YLEDEHAAAHAEEAFFNTILPAFDPALR-YNVTWYVSSSP . . :... . . :: :.. .. ::: :.. . :: . ...::.:: .: CCDS58 KIKRGRSNLLWDTGVFRGQVYFKPQY---HAEMCFLSWFCGNQLPAYKCFQITWFVSWTP 40 50 60 70 80 90 130 140 150 160 170 180 pF1KE5 CAACADRIIKTLSKTKNLRLLILVGRLFMWEEPEIQAALKKLKEAGCKLRIMKPQDFEYV : :. .. . ::. :. : : ..::... : . . :: .:..:: .. :: ..: : CCDS58 CPDCVAKLAEFLSEHPNVTLTISAARLYYYWERDYRRALCRLSQAGARVTIMDYEEFAYC 100 110 120 130 140 150 190 200 210 220 pF1KE5 WQNFVEQEEGESKAFQPWEDIQENFLYYEEKLADILK :.::: .: .. :.:: ..::. . .. : .::. CCDS58 WENFVYNE---GQQFMPWYKFDENYAFLHRTLKEILRYLMDPDTFTFNFNNDPLVLRRRQ 160 170 180 190 200 210 CCDS58 TYLCYEVERLDNGTWVLMDQHMGFLCNELDPAQIYRVTWFISWSPCFSWGCAGEVRAFLQ 220 230 240 250 260 270 >-- initn: 287 init1: 94 opt: 288 Z-score: 378.8 bits: 78.0 E(32554): 8.8e-15 Smith-Waterman score: 288; 31.7% identity (57.2% similar) in 180 aa overlap (47-224:193-353) 20 30 40 50 60 70 pF1KE5 NGEDLENLDDPEKLKELIELPPFEIVTGERLPANFFKFQFRNVEYSSGRNKTFLCYVVEA . . : :.: : : .:.::: :: CCDS58 NEGQQFMPWYKFDENYAFLHRTLKEILRYLMDPDTFTFNFNNDPLVLRRRQTYLCYEVER 170 180 190 200 210 220 80 90 100 110 120 130 pF1KE5 QGKGGQVQASRGYLEDEHAAAHAEEAFFNTILPAFDPALRYNVTWYVSSSPCAA--CADR .: : : :.: . .: .::: : :::..: ::: . :: . CCDS58 LDNGTWV------LMDQHMGFLCNE---------LDPAQIYRVTWFISWSPCFSWGCAGE 230 240 250 260 140 150 160 170 180 190 pF1KE5 IIKTLSKTKNLRLLILVGRLFMWEEPEIQAALKKLKEAGCKLRIMKPQDFEYVWQNFVEQ . :... ..:: :...:.. .. : . ::. :..:: .. :: ..::: :..:: . CCDS58 VRAFLQENTHVRLRIFAARIYDYD-PLYKEALQMLRDAGAQVSIMTYDEFEYCWDTFVYR 270 280 290 300 310 320 200 210 220 pF1KE5 EEGESKAFQPWEDIQENFLYYEEKLADILK . . ::::. ..:. .: ::. CCDS58 Q---GCPFQPWDGLEEHSQALSGRLRAILQNQGN 330 340 350 >>CCDS13982.1 APOBEC3B gene_id:9582|Hs108|chr22 (382 aa) initn: 379 init1: 204 opt: 326 Z-score: 428.2 bits: 87.2 E(32554): 1.6e-17 Smith-Waterman score: 326; 31.6% identity (63.1% similar) in 187 aa overlap (45-224:10-190) 20 30 40 50 60 70 pF1KE5 SQNGEDLENLDDPEKLKELIELPPFEIVTGERLPANFFKFQFRNVEYSSGRNKTFLCYVV ::. . : .:.: ::. :.::: : CCDS13 MNPQIRNPMERMYRDTFYDNFENEPILYGRSYTWLCYEV 10 20 30 80 90 100 110 120 pF1KE5 EAQ-GKGGQVQAS---RG--YLEDEHAAAHAEEAFFNTILPAFDPALR-YNVTWYVSSSP . . :... . . :: :.. .. ::: :.. . :: . ...::.:: .: CCDS13 KIKRGRSNLLWDTGVFRGQVYFKPQY---HAEMCFLSWFCGNQLPAYKCFQITWFVSWTP 40 50 60 70 80 90 130 140 150 160 170 180 pF1KE5 CAACADRIIKTLSKTKNLRLLILVGRLFMWEEPEIQAALKKLKEAGCKLRIMKPQDFEYV : :. .. . ::. :. : : ..::... : . . :: .:..:: .. :: ..: : CCDS13 CPDCVAKLAEFLSEHPNVTLTISAARLYYYWERDYRRALCRLSQAGARVTIMDYEEFAYC 100 110 120 130 140 150 190 200 210 220 pF1KE5 WQNFVEQEEGESKAFQPWEDIQENFLYYEEKLADILK :.::: .: .. :.:: ..::. . .. : .::. CCDS13 WENFVYNE---GQQFMPWYKFDENYAFLHRTLKEILRYLMDPDTFTFNFNNDPLVLRRRQ 160 170 180 190 200 210 CCDS13 TYLCYEVERLDNGTWVLMDQHMGFLCNEAKNLLCGFYGRHAELRFLDLVPSLQLDPAQIY 220 230 240 250 260 270 >-- initn: 287 init1: 94 opt: 309 Z-score: 405.9 bits: 83.1 E(32554): 2.7e-16 Smith-Waterman score: 309; 31.9% identity (59.7% similar) in 191 aa overlap (47-224:193-378) 20 30 40 50 60 70 pF1KE5 NGEDLENLDDPEKLKELIELPPFEIVTGERLPANFFKFQFRNVEYSSGRNKTFLCYVVEA . . : :.: : : .:.::: :: CCDS13 NEGQQFMPWYKFDENYAFLHRTLKEILRYLMDPDTFTFNFNNDPLVLRRRQTYLCYEVER 170 180 190 200 210 220 80 90 100 110 120 pF1KE5 QGKGGQVQASR--GYLEDEHA-------AAHAEEAFFNTILPAF--DPALRYNVTWYVSS .: : .. :.: .: . ::: :.. ..:.. ::: : :::..: CCDS13 LDNGTWVLMDQHMGFLCNEAKNLLCGFYGRHAELRFLD-LVPSLQLDPAQIYRVTWFISW 230 240 250 260 270 280 130 140 150 160 170 180 pF1KE5 SPCAA--CADRIIKTLSKTKNLRLLILVGRLFMWEEPEIQAALKKLKEAGCKLRIMKPQD ::: . :: .. :... ..:: :...:.. .. : . ::. :..:: .. :: .. CCDS13 SPCFSWGCAGEVRAFLQENTHVRLRIFAARIYDYD-PLYKEALQMLRDAGAQVSIMTYDE 290 300 310 320 330 340 190 200 210 220 pF1KE5 FEYVWQNFVEQEEGESKAFQPWEDIQENFLYYEEKLADILK ::: :..:: .. . ::::. ..:. .: ::. CCDS13 FEYCWDTFVYRQ---GCPFQPWDGLEEHSQALSGRLRAILQNQGN 350 360 370 380 >>CCDS13984.1 APOBEC3G gene_id:60489|Hs108|chr22 (384 aa) initn: 316 init1: 143 opt: 294 Z-score: 386.2 bits: 79.4 E(32554): 3.4e-15 Smith-Waterman score: 300; 32.6% identity (62.5% similar) in 184 aa overlap (52-224:202-380) 30 40 50 60 70 80 pF1KE5 ENLDDPEKLKELIELPPFEIVTGERLPANFFKFQFRNVEYSSGRNKTFLCYVVEAQGKGG : :.: : . ::..:.::: :: . . CCDS13 FEPWNNLPKYYILLHIMLGEILRHSMDPPTFTFNFNNEPWVRGRHETYLCYEVERMHNDT 180 190 200 210 220 230 90 100 110 120 130 pF1KE5 QV--QASRGYLEDE----HA---AAHAEEAFFNTILP--AFDPALRYNVTWYVSSSPCAA : . ::.: .. :. . ::: :.. ..: .: : :: ..: ::: . CCDS13 WVLLNQRRGFLCNQAPHKHGFLEGRHAELCFLD-VIPFWKLDLDQDYRVTCFTSWSPCFS 240 250 260 270 280 290 140 150 160 170 180 190 pF1KE5 CADRIIKTLSKTKNLRLLILVGRLFMWEEPEIQAALKKLKEAGCKLRIMKPQDFEYVWQN ::... : .::.:.. : :...:.. .. . : .:. : ::: :. :: ..:.. :.. CCDS13 CAQEMAKFISKNKHVSLCIFTARIYD-DQGRCQEGLRTLAEAGAKISIMTYSEFKHCWDT 300 310 320 330 340 200 210 220 pF1KE5 FVEQEEGESKAFQPWEDIQENFLYYEEKLADILK ::... . ::::. ..:. .: ::. CCDS13 FVDHQ---GCPFQPWDGLDEHSQDLSGRLRAILQNQEN 350 360 370 380 >-- initn: 316 init1: 143 opt: 294 Z-score: 386.2 bits: 79.4 E(32554): 3.4e-15 Smith-Waterman score: 294; 29.4% identity (62.4% similar) in 197 aa overlap (37-224:3-194) 10 20 30 40 50 60 pF1KE5 AAVATEAASQNGEDLENLDDPEKLKELIELPPFEIVTGERLPANFFKFQFRNVEYSSGRN : :. : ::. . :...: : : :: CCDS13 MKPHFRN-TVERMYRDTFSYNFYNRPILSRRN 10 20 30 70 80 90 100 110 120 pF1KE5 KTFLCYVVEAQGKGG---QVQASRGYLEDEHAAAHAEEAFFN--TILPAFDPALRYNVTW ..::: :...: . ... :: . .: : : ::. . . .:.::: CCDS13 TVWLCYEVKTKGPSRPPLDAKIFRGQVYSE-LKYHPEMRFFHWFSKWRKLHRDQEYEVTW 40 50 60 70 80 90 130 140 150 160 170 pF1KE5 YVSSSPCAACADRIIKTLSKTKNLRLLILVGRLFMWEEPEIQAALKKL--KEAGCK--LR :.: :::. :. . :.. .. : :.:.::... .:. : ::..: :. : . .. CCDS13 YISWSPCTKCTRDMATFLAEDPKVTLTIFVARLYYFWDPDYQEALRSLCQKRDGPRATMK 100 110 120 130 140 150 180 190 200 210 220 pF1KE5 IMKPQDFEYVWQNFVEQEEGESKAFQPWEDIQENFLYYEEKLADILK ::. ..:.. :..:: ... . :.::... . .. . :..::. CCDS13 IMNYDEFQHCWSKFVYSQR---ELFEPWNNLPKYYILLHIMLGEILRHSMDPPTFTFNFN 160 170 180 190 200 CCDS13 NEPWVRGRHETYLCYEVERMHNDTWVLLNQRRGFLCNQAPHKHGFLEGRHAELCFLDVIP 210 220 230 240 250 260 >>CCDS46709.1 APOBEC3D gene_id:140564|Hs108|chr22 (386 aa) initn: 488 init1: 203 opt: 272 Z-score: 357.3 bits: 74.1 E(32554): 1.4e-13 Smith-Waterman score: 365; 31.5% identity (66.3% similar) in 184 aa overlap (45-224:206-386) 20 30 40 50 60 70 pF1KE5 SQNGEDLENLDDPEKLKELIELPPFEIVTGERLPANFFKFQFRNVEYSSGRNKTFLCYVV : . ..: :.:.:. . :::...::... CCDS46 EGQPFMPWYKFDDNYASLHRTLKEILRNPMEAMYPHIFYFHFKNLLKACGRNESWLCFTM 180 190 200 210 220 230 80 90 100 110 120 130 pF1KE5 EAQGKGGQVQASRGYLE---DEHAAAHAEEAFFNTILP-AFDPALRYNVTWYVSSSPCAA :. . . : .:: .. : .. :::. :.. . ..: :.::::.: ::: CCDS46 EVTKHHSAVFRKRGVFRNQVDPETHCHAERCFLSWFCDDILSPNTNYEVTWYTSWSPCPE 240 250 260 270 280 290 140 150 160 170 180 190 pF1KE5 CADRIIKTLSKTKNLRLLILVGRLFMWEEPEIQAALKKLKEAGCKLRIMKPQDFEYVWQN :: .. . :.. .:. : :...:: .. . . : .: .:.. : ...:: .:: :.: CCDS46 CAGEVAEFLARHSNVNLTIFTARLCYFWDTDYQEGLCSLSQEGASVKIMGYKDFVSCWKN 300 310 320 330 340 350 200 210 220 pF1KE5 FVEQEEGESKAFQPWEDIQENFLYYEEKLADILK :: ... . :.::. .: :: ...: .::. CCDS46 FVYSDD---EPFKPWKGLQTNFRLLKRRLREILQ 360 370 380 >-- initn: 512 init1: 203 opt: 305 Z-score: 400.6 bits: 82.1 E(32554): 5.4e-16 Smith-Waterman score: 305; 29.6% identity (61.2% similar) in 196 aa overlap (45-224:10-202) 20 30 40 50 60 70 pF1KE5 SQNGEDLENLDDPEKLKELIELPPFEIVTGERLPANFFKFQFRNVEYSSGRNKTFLCYVV ::. . : .:.: ::. :.::: : CCDS46 MNPQIRNPMERMYRDTFYDNFENEPILYGRSYTWLCYEV 10 20 30 80 90 100 110 pF1KE5 EAQ-GKGGQVQAS---RGYLEDEHAAAHAEEAFFN-------TILPAFD----PA-LRYN . . :... . . :: . .. . : .:..: .: : :: :.. CCDS46 KIKRGRSNLLWDTGVFRGPVLPKRQSNHRQEVYFRFENHAEMCFLSWFCGNRLPANRRFQ 40 50 60 70 80 90 120 130 140 150 160 170 pF1KE5 VTWYVSSSPCAACADRIIKTLSKTKNLRLLILVGRLFMWEEPEIQAALKKLKEAGCKLRI .::.:: .:: :. .. : :.. :. : : ..::..... . . .: .:..:: ...: CCDS46 ITWFVSWNPCLPCVVKVTKFLAEHPNVTLTISAARLYYYRDRDWRWVLLRLHKAGARVKI 100 110 120 130 140 150 180 190 200 210 220 pF1KE5 MKPQDFEYVWQNFVEQEEGESKAFQPWEDIQENFLYYEEKLADILK : .:: : :.::: .: .. :.:: ...:. .. : .::. CCDS46 MDYEDFAYCWENFVCNE---GQPFMPWYKFDDNYASLHRTLKEILRNPMEAMYPHIFYFH 160 170 180 190 200 210 CCDS46 FKNLLKACGRNESWLCFTMEVTKHHSAVFRKRGVFRNQVDPETHCHAERCFLSWFCDDIL 220 230 240 250 260 270 >>CCDS81662.1 AICDA gene_id:57379|Hs108|chr12 (188 aa) initn: 287 init1: 211 opt: 265 Z-score: 353.2 bits: 72.3 E(32554): 2.4e-13 Smith-Waterman score: 292; 32.8% identity (64.4% similar) in 174 aa overlap (52-223:11-170) 30 40 50 60 70 80 pF1KE5 ENLDDPEKLKELIELPPFEIVTGERLPANFFKFQFRNVEYSSGRNKTFLCYVVEAQGKGG : .::.::....:: .:.:::::. . .. CCDS81 MDSLLMNRRKFLYQFKNVRWAKGRRETYLCYVVKRRDSAT 10 20 30 40 90 100 110 120 130 140 pF1KE5 QVQASRGYLEDEHAAAHAEEAFFNTILP-AFDPALRYNVTWYVSSSPCAACADRIIKTLS . . . :::..... :.: :. : .::. : :::..: ::: :: .. : CCDS81 SFSLDFGYLRNKNGC-HVELLFLRYISDWDLDPGRCYRVTWFTSWSPCYDCARHVADFLR 50 60 70 80 90 150 160 170 180 190 pF1KE5 KTKNLRLLILVGRLFMWEEPEIQA-ALKKLKEAGCKLRIMKPQDFEYVWQNFVEQEEGES . :: : :...::.. :. . . .:..:..:: .. :: .: :..: CCDS81 GNPNLSLRIFTARLYFCEDRKAEPEGLRRLHRAGVQIAIM----------TFKENHE--- 100 110 120 130 140 200 210 220 pF1KE5 KAFQPWEDIQENFLYYEEKLADILK ..:. :: ..:: . ..: :: CCDS81 RTFKAWEGLHENSVRLSRQLRRILLPLYEVDDLRDAFRTLGL 150 160 170 180 >>CCDS13981.1 APOBEC3A gene_id:200315|Hs108|chr22 (199 aa) initn: 286 init1: 87 opt: 254 Z-score: 338.3 bits: 69.6 E(32554): 1.6e-12 Smith-Waterman score: 311; 31.5% identity (62.4% similar) in 197 aa overlap (43-224:7-195) 20 30 40 50 60 70 pF1KE5 AASQNGEDLENLDDPEKLKELIELPPFEIVTGER--LPANFFKFQFRNVEYSSGRNKTFL .: : . ..: .: : . ::.::.: CCDS13 MEASPASGPRHLMDPHIFTSNFNN---GIGRHKTYL 10 20 30 80 90 100 110 pF1KE5 CYVVEAQGKGGQVQAS--RGYLEDEHA-------AAHAEEAFFNTILPAF--DPALRYNV :: :: .: .:. . ::.:... . ::: :.. . :.. ::: : : CCDS13 CYEVERLDNGTSVKMDQHRGFLHNQAKNLLCGFYGRHAELRFLDLV-PSLQLDPAQIYRV 40 50 60 70 80 90 120 130 140 150 160 170 pF1KE5 TWYVSSSPCAA--CADRIIKTLSKTKNLRLLILVGRLFMWEEPEIQAALKKLKEAGCKLR ::..: ::: . :: .. :... ..:: :...:.. .. : . ::. :..:: .. CCDS13 TWFISWSPCFSWGCAGEVRAFLQENTHVRLRIFAARIYDYD-PLYKEALQMLRDAGAQVS 100 110 120 130 140 150 180 190 200 210 220 pF1KE5 IMKPQDFEYVWQNFVEQEEGESKAFQPWEDIQENFLYYEEKLADILK :: ..:.. :..::... . ::::. ..:. .: ::. CCDS13 IMTYDEFKHCWDTFVDHQ---GCPFQPWDGLDEHSQALSGRLRAILQNQGN 160 170 180 190 224 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 04:35:59 2016 done: Tue Nov 8 04:35:59 2016 Total Scan time: 2.050 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]