FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1263, 607 aa 1>>>pF1KE1263 607 - 607 aa - 607 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 10.1610+/-0.00148; mu= -3.8906+/- 0.086 mean_var=314.8530+/-67.888, 0's: 0 Z-trim(106.8): 119 B-trim: 389 in 1/52 Lambda= 0.072280 statistics sampled from 9099 (9197) to 9099 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.638), E-opt: 0.2 (0.283), width: 16 Scan time: 3.780 The best scores are: opt bits E(32554) CCDS2503.2 ATG16L1 gene_id:55054|Hs108|chr2 ( 607) 3996 431.5 1.6e-120 CCDS2502.2 ATG16L1 gene_id:55054|Hs108|chr2 ( 588) 2165 240.5 4.6e-63 CCDS54438.1 ATG16L1 gene_id:55054|Hs108|chr2 ( 444) 2159 239.8 5.8e-63 CCDS31634.1 ATG16L2 gene_id:89849|Hs108|chr11 ( 619) 1059 125.2 2.5e-28 >>CCDS2503.2 ATG16L1 gene_id:55054|Hs108|chr2 (607 aa) initn: 3996 init1: 3996 opt: 3996 Z-score: 2277.4 bits: 431.5 E(32554): 1.6e-120 Smith-Waterman score: 3996; 100.0% identity (100.0% similar) in 607 aa overlap (1-607:1-607) 10 20 30 40 50 60 pF1KE1 MSSGLRAADFPRWKRHISEQLRRRDRLQRQAFEEIILQYNKLLEKSDLHSVLAQKLQAEK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 MSSGLRAADFPRWKRHISEQLRRRDRLQRQAFEEIILQYNKLLEKSDLHSVLAQKLQAEK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 HDVPNRHEISPGHDGTWNDNQLQEMAQLRIKHQEELTELHKKRGELAQLVIDLNNQMQRK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 HDVPNRHEISPGHDGTWNDNQLQEMAQLRIKHQEELTELHKKRGELAQLVIDLNNQMQRK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 DREMQMNEAKIAECLQTISDLETECLDLRTKLCDLERANQTLKDEYDALQITFTALEGKL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 DREMQMNEAKIAECLQTISDLETECLDLRTKLCDLERANQTLKDEYDALQITFTALEGKL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 RKTTEENQELVTRWMAEKAQEANRLNAENEKDSRRRQARLQKELAEAAKEPLPVEQDDDI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 RKTTEENQELVTRWMAEKAQEANRLNAENEKDSRRRQARLQKELAEAAKEPLPVEQDDDI 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 EVIVDETSDHTEETSPVRAISRAATKRLSQPAGGLLDSITNIFGRRSVSSFPVPQDNVDT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 EVIVDETSDHTEETSPVRAISRAATKRLSQPAGGLLDSITNIFGRRSVSSFPVPQDNVDT 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE1 HPGSGKEVRVPATALCVFDAHDGEVNAVQFSPGSRLLATGGMDRRVKLWEVFGEKCEFKG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 HPGSGKEVRVPATALCVFDAHDGEVNAVQFSPGSRLLATGGMDRRVKLWEVFGEKCEFKG 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE1 SLSGSNAGITSIEFDSAGSYLLAASNDFASRIWTVDDYRLRHTLTGHSGKVLSAKFLLDN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 SLSGSNAGITSIEFDSAGSYLLAASNDFASRIWTVDDYRLRHTLTGHSGKVLSAKFLLDN 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE1 ARIVSGSHDRTLKLWDLRSKVCIKTVFAGSSCNDIVCTEQCVMSGHFDKKIRFWDIRSES :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 ARIVSGSHDRTLKLWDLRSKVCIKTVFAGSSCNDIVCTEQCVMSGHFDKKIRFWDIRSES 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE1 IVREMELLGKITALDLNPERTELLSCSRDDLLKVIDLRTNAIKQTFSAPGFKCGSDWTRV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 IVREMELLGKITALDLNPERTELLSCSRDDLLKVIDLRTNAIKQTFSAPGFKCGSDWTRV 490 500 510 520 530 540 550 560 570 580 590 600 pF1KE1 VFSPDGSYVAAGSAEGSLYIWSVLTGKVEKVLSKQHSSSINAVAWSPSGSHVVSVDKGCK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 VFSPDGSYVAAGSAEGSLYIWSVLTGKVEKVLSKQHSSSINAVAWSPSGSHVVSVDKGCK 550 560 570 580 590 600 pF1KE1 AVLWAQY ::::::: CCDS25 AVLWAQY >>CCDS2502.2 ATG16L1 gene_id:55054|Hs108|chr2 (588 aa) initn: 2158 init1: 2158 opt: 2165 Z-score: 1245.7 bits: 240.5 E(32554): 4.6e-63 Smith-Waterman score: 3826; 96.9% identity (96.9% similar) in 607 aa overlap (1-607:1-588) 10 20 30 40 50 60 pF1KE1 MSSGLRAADFPRWKRHISEQLRRRDRLQRQAFEEIILQYNKLLEKSDLHSVLAQKLQAEK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 MSSGLRAADFPRWKRHISEQLRRRDRLQRQAFEEIILQYNKLLEKSDLHSVLAQKLQAEK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 HDVPNRHEISPGHDGTWNDNQLQEMAQLRIKHQEELTELHKKRGELAQLVIDLNNQMQRK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 HDVPNRHEISPGHDGTWNDNQLQEMAQLRIKHQEELTELHKKRGELAQLVIDLNNQMQRK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 DREMQMNEAKIAECLQTISDLETECLDLRTKLCDLERANQTLKDEYDALQITFTALEGKL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 DREMQMNEAKIAECLQTISDLETECLDLRTKLCDLERANQTLKDEYDALQITFTALEGKL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 RKTTEENQELVTRWMAEKAQEANRLNAENEKDSRRRQARLQKELAEAAKEPLPVEQDDDI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 RKTTEENQELVTRWMAEKAQEANRLNAENEKDSRRRQARLQKELAEAAKEPLPVEQDDDI 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 EVIVDETSDHTEETSPVRAISRAATKRLSQPAGGLLDSITNIFGRRSVSSFPVPQDNVDT ::::::::::::::::::::::::: :::::::::::::::: CCDS25 EVIVDETSDHTEETSPVRAISRAAT-------------------RRSVSSFPVPQDNVDT 250 260 270 280 310 320 330 340 350 360 pF1KE1 HPGSGKEVRVPATALCVFDAHDGEVNAVQFSPGSRLLATGGMDRRVKLWEVFGEKCEFKG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 HPGSGKEVRVPATALCVFDAHDGEVNAVQFSPGSRLLATGGMDRRVKLWEVFGEKCEFKG 290 300 310 320 330 340 370 380 390 400 410 420 pF1KE1 SLSGSNAGITSIEFDSAGSYLLAASNDFASRIWTVDDYRLRHTLTGHSGKVLSAKFLLDN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 SLSGSNAGITSIEFDSAGSYLLAASNDFASRIWTVDDYRLRHTLTGHSGKVLSAKFLLDN 350 360 370 380 390 400 430 440 450 460 470 480 pF1KE1 ARIVSGSHDRTLKLWDLRSKVCIKTVFAGSSCNDIVCTEQCVMSGHFDKKIRFWDIRSES :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 ARIVSGSHDRTLKLWDLRSKVCIKTVFAGSSCNDIVCTEQCVMSGHFDKKIRFWDIRSES 410 420 430 440 450 460 490 500 510 520 530 540 pF1KE1 IVREMELLGKITALDLNPERTELLSCSRDDLLKVIDLRTNAIKQTFSAPGFKCGSDWTRV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 IVREMELLGKITALDLNPERTELLSCSRDDLLKVIDLRTNAIKQTFSAPGFKCGSDWTRV 470 480 490 500 510 520 550 560 570 580 590 600 pF1KE1 VFSPDGSYVAAGSAEGSLYIWSVLTGKVEKVLSKQHSSSINAVAWSPSGSHVVSVDKGCK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 VFSPDGSYVAAGSAEGSLYIWSVLTGKVEKVLSKQHSSSINAVAWSPSGSHVVSVDKGCK 530 540 550 560 570 580 pF1KE1 AVLWAQY ::::::: CCDS25 AVLWAQY >>CCDS54438.1 ATG16L1 gene_id:55054|Hs108|chr2 (444 aa) initn: 2600 init1: 2158 opt: 2159 Z-score: 1243.9 bits: 239.8 E(32554): 5.8e-63 Smith-Waterman score: 2591; 73.1% identity (73.1% similar) in 607 aa overlap (1-607:1-444) 10 20 30 40 50 60 pF1KE1 MSSGLRAADFPRWKRHISEQLRRRDRLQRQAFEEIILQYNKLLEKSDLHSVLAQKLQAEK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 MSSGLRAADFPRWKRHISEQLRRRDRLQRQAFEEIILQYNKLLEKSDLHSVLAQKLQAEK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 HDVPNRHEISPGHDGTWNDNQLQEMAQLRIKHQEELTELHKKRGELAQLVIDLNNQMQRK ::::::::: CCDS54 HDVPNRHEI--------------------------------------------------- 130 140 150 160 170 180 pF1KE1 DREMQMNEAKIAECLQTISDLETECLDLRTKLCDLERANQTLKDEYDALQITFTALEGKL CCDS54 ------------------------------------------------------------ 190 200 210 220 230 240 pF1KE1 RKTTEENQELVTRWMAEKAQEANRLNAENEKDSRRRQARLQKELAEAAKEPLPVEQDDDI ::::::::::::::::::::::::::: CCDS54 ---------------------------------RRRQARLQKELAEAAKEPLPVEQDDDI 70 80 90 250 260 270 280 290 300 pF1KE1 EVIVDETSDHTEETSPVRAISRAATKRLSQPAGGLLDSITNIFGRRSVSSFPVPQDNVDT ::::::::::::::::::::::::: :::::::::::::::: CCDS54 EVIVDETSDHTEETSPVRAISRAAT-------------------RRSVSSFPVPQDNVDT 100 110 120 130 310 320 330 340 350 360 pF1KE1 HPGSGKEVRVPATALCVFDAHDGEVNAVQFSPGSRLLATGGMDRRVKLWEVFGEKCEFKG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 HPGSGKEVRVPATALCVFDAHDGEVNAVQFSPGSRLLATGGMDRRVKLWEVFGEKCEFKG 140 150 160 170 180 190 370 380 390 400 410 420 pF1KE1 SLSGSNAGITSIEFDSAGSYLLAASNDFASRIWTVDDYRLRHTLTGHSGKVLSAKFLLDN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 SLSGSNAGITSIEFDSAGSYLLAASNDFASRIWTVDDYRLRHTLTGHSGKVLSAKFLLDN 200 210 220 230 240 250 430 440 450 460 470 480 pF1KE1 ARIVSGSHDRTLKLWDLRSKVCIKTVFAGSSCNDIVCTEQCVMSGHFDKKIRFWDIRSES :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 ARIVSGSHDRTLKLWDLRSKVCIKTVFAGSSCNDIVCTEQCVMSGHFDKKIRFWDIRSES 260 270 280 290 300 310 490 500 510 520 530 540 pF1KE1 IVREMELLGKITALDLNPERTELLSCSRDDLLKVIDLRTNAIKQTFSAPGFKCGSDWTRV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 IVREMELLGKITALDLNPERTELLSCSRDDLLKVIDLRTNAIKQTFSAPGFKCGSDWTRV 320 330 340 350 360 370 550 560 570 580 590 600 pF1KE1 VFSPDGSYVAAGSAEGSLYIWSVLTGKVEKVLSKQHSSSINAVAWSPSGSHVVSVDKGCK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 VFSPDGSYVAAGSAEGSLYIWSVLTGKVEKVLSKQHSSSINAVAWSPSGSHVVSVDKGCK 380 390 400 410 420 430 pF1KE1 AVLWAQY ::::::: CCDS54 AVLWAQY 440 >>CCDS31634.1 ATG16L2 gene_id:89849|Hs108|chr11 (619 aa) initn: 1445 init1: 1025 opt: 1059 Z-score: 622.1 bits: 125.2 E(32554): 2.5e-28 Smith-Waterman score: 1411; 38.8% identity (67.2% similar) in 616 aa overlap (4-604:5-618) 10 20 30 40 50 pF1KE1 MSSGLRAADFPRWKRHISEQLRRRDRLQRQAFEEIILQYNKLLEKSDLHSVLAQKLQAE :. .: :::::: .::: ::: :. : :.. ::.::::..: . ...::: : CCDS31 MAGPGVPGAPAARWKRHIVRQLRLRDRTQKALFLELVPAYNHLLEKAELLDKFSKKLQPE 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE1 KHDV-PNRHEISPGHDGTWNDNQLQEMAQLRIKHQEELTELHKKRGELAQLVIDLNNQMQ ..: :. :. .: ... ...:. .. ::.: ::: :. ::.: :.. . . CCDS31 PNSVTPTTHQ-GPWEESELDSDQVPSLVALRVKWQEEEEGLRLVCGEMAYQVVEKGAALG 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 RKDREMQMNEAKIAECLQTISDLETECLDLRTKLCDLERANQTLKDEYDALQITFTALEG . :.:. ....: ...:. . .. . . : . . :.::. :. CCDS31 TLESELQQRQSRLAALEARVAQLREARAQQAQQVEEWRAQNAVQRAAYEALRAHVGLREA 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE1 KLRKTTEENQELVTRWMAEKAQEANRLNAENEKDSRRRQARLQKELAEAAKEPLPV-EQD ::. :: ..:. : . .::. : . : .::. : .:::...:: .:::. . . : CCDS31 ALRRLQEEARDLLERLVQRKARAAAERNLRNERRERAKQARVSQELKKAAKRTVSISEGP 180 190 200 210 220 230 240 250 260 270 280 pF1KE1 DDIEVIVDETSDHTEETSPVRAISRAATKRLSQP-----AGGL-----LDSITNIFG--- : . . : . . . . . : .. ..: : .: .: . ... CCDS31 DTLGDGMRERRETLALAPEPEPLEKEACEKWKRPFRSASATSLTLSHCVDVVKGLLDFKK 240 250 260 270 280 290 290 300 310 320 330 340 pF1KE1 RRSVSSFPVPQDNVDTHPGSGKEVRVPATALCVFDAHDGEVNAVQFSPGSRLLATGGMDR ::. : .:.. . : .:.:. : :.::: .:::::.:.:.: :::::: :: CCDS31 RRGHSIGGAPEQRYQIIP-VCVAARLPTRAQDVLDAHLSEVNAVRFGPNSSLLATGGADR 300 310 320 330 340 350 350 360 370 380 390 400 pF1KE1 RVKLWEVFGEKCEFKGSLSGSNAGITSIEFDSAGSYLLAASNDFASRIWTVDDYRLRHTL ..::.: : . : . .: :....:::..:: .: .:::. . :...: : . . ..:: CCDS31 LIHLWNVVGSRLEANQTLEGAGGSITSVDFDPSGYQVLAATYNQAAQLWKVGEAQSKETL 360 370 380 390 400 410 410 420 430 440 450 460 pF1KE1 TGHSGKVLSAKFLLDNARIVSGSHDRTLKLWDLRSKVCIKTVFAGSSCNDIVCTEQCVMS .::. :: .::: : . :.::.:::.: ::: : .:. . : :::.:: .. ..: CCDS31 SGHKDKVTAAKFKLTRHQAVTGSRDRTVKEWDLGRAYCSRTINVLSYCNDVVCGDHIIIS 420 430 440 450 460 470 470 480 490 500 510 520 pF1KE1 GHFDKKIRFWDIRSESIVREMELLGKITALDLNPERTELLSCSRDDLLKVIDLRTNAIKQ :: :.:::::: :. .. . . :..:.:.:. .. .:::::::. :::::::.. :.: CCDS31 GHNDQKIRFWDSRGPHCTQVIPVQGRVTSLSLSHDQLHLLSCSRDNTLKVIDLRVSNIRQ 480 490 500 510 520 530 530 540 550 560 570 580 pF1KE1 TFSAPGFKCGSDWTRVVFSPDGSYVAAGSAEGSLYIWSVLTGKVEKVLSKQHSSSINAVA .: : :::::::::..::::: ::. ::: .:.::::.: :::.:. :. : ...:::: CCDS31 VFRADGFKCGSDWTKAVFSPDRSYALAGSCDGALYIWDVDTGKLESRLQGPHCAAVNAVA 540 550 560 570 580 590 590 600 pF1KE1 WSPSGSHVVSVDKGCKAVLWAQY : ::::.::::.: :.::: CCDS31 WCYSGSHMVSVDQGRKVVLWQ 600 610 607 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 21:38:09 2016 done: Sun Nov 6 21:38:10 2016 Total Scan time: 3.780 Total Display time: 0.060 Function used was FASTA [36.3.4 Apr, 2011]