FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3589, 227 aa 1>>>pF1KE3589 227 - 227 aa - 227 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.5833+/-0.000811; mu= 12.4429+/- 0.049 mean_var=65.4275+/-13.013, 0's: 0 Z-trim(106.9): 28 B-trim: 5 in 1/49 Lambda= 0.158560 statistics sampled from 9250 (9274) to 9250 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.671), E-opt: 0.2 (0.285), width: 16 Scan time: 2.160 The best scores are: opt bits E(32554) CCDS6455.1 AK3 gene_id:50808|Hs108|chr9 ( 227) 1485 348.2 2.5e-96 CCDS56561.1 AK3 gene_id:50808|Hs108|chr9 ( 157) 1041 246.6 6.8e-66 CCDS56562.1 AK3 gene_id:50808|Hs108|chr9 ( 187) 929 221.0 4.1e-58 CCDS629.1 AK4 gene_id:205|Hs108|chr1 ( 223) 926 220.3 7.7e-58 CCDS81340.1 AK4 gene_id:205|Hs108|chr1 ( 171) 730 175.5 1.9e-44 CCDS81296.1 AK2 gene_id:204|Hs108|chr1 ( 232) 580 141.2 5.3e-34 CCDS373.1 AK2 gene_id:204|Hs108|chr1 ( 232) 580 141.2 5.3e-34 CCDS374.1 AK2 gene_id:204|Hs108|chr1 ( 239) 580 141.2 5.5e-34 CCDS81294.1 AK2 gene_id:204|Hs108|chr1 ( 190) 406 101.4 4.3e-22 CCDS6954.1 AK8 gene_id:158067|Hs108|chr9 ( 479) 307 78.9 6.4e-15 CCDS81295.1 AK2 gene_id:204|Hs108|chr1 ( 133) 276 71.6 2.8e-13 >>CCDS6455.1 AK3 gene_id:50808|Hs108|chr9 (227 aa) initn: 1485 init1: 1485 opt: 1485 Z-score: 1842.9 bits: 348.2 E(32554): 2.5e-96 Smith-Waterman score: 1485; 100.0% identity (100.0% similar) in 227 aa overlap (1-227:1-227) 10 20 30 40 50 60 pF1KE3 MGASARLLRAVIMGAPGSGKGTVSSRITTHFELKHLSSGDLLRDNMLRGTEIGVLAKAFI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 MGASARLLRAVIMGAPGSGKGTVSSRITTHFELKHLSSGDLLRDNMLRGTEIGVLAKAFI 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 DQGKLIPDDVMTRLALHELKNLTQYSWLLDGFPRTLPQAEALDRAYQIDTVINLNVPFEV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 DQGKLIPDDVMTRLALHELKNLTQYSWLLDGFPRTLPQAEALDRAYQIDTVINLNVPFEV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 IKQRLTARWIHPASGRVYNIEFNPPKTVGIDDLTGEPLIQREDDKPETVIKRLKAYEDQT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 IKQRLTARWIHPASGRVYNIEFNPPKTVGIDDLTGEPLIQREDDKPETVIKRLKAYEDQT 130 140 150 160 170 180 190 200 210 220 pF1KE3 KPVLEYYQKKGVLETFSGTETNKIWPYVYAFLQTKVPQRSQKASVTP ::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 KPVLEYYQKKGVLETFSGTETNKIWPYVYAFLQTKVPQRSQKASVTP 190 200 210 220 >>CCDS56561.1 AK3 gene_id:50808|Hs108|chr9 (157 aa) initn: 1041 init1: 1041 opt: 1041 Z-score: 1296.5 bits: 246.6 E(32554): 6.8e-66 Smith-Waterman score: 1041; 100.0% identity (100.0% similar) in 157 aa overlap (71-227:1-157) 50 60 70 80 90 100 pF1KE3 LLRDNMLRGTEIGVLAKAFIDQGKLIPDDVMTRLALHELKNLTQYSWLLDGFPRTLPQAE :::::::::::::::::::::::::::::: CCDS56 MTRLALHELKNLTQYSWLLDGFPRTLPQAE 10 20 30 110 120 130 140 150 160 pF1KE3 ALDRAYQIDTVINLNVPFEVIKQRLTARWIHPASGRVYNIEFNPPKTVGIDDLTGEPLIQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 ALDRAYQIDTVINLNVPFEVIKQRLTARWIHPASGRVYNIEFNPPKTVGIDDLTGEPLIQ 40 50 60 70 80 90 170 180 190 200 210 220 pF1KE3 REDDKPETVIKRLKAYEDQTKPVLEYYQKKGVLETFSGTETNKIWPYVYAFLQTKVPQRS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 REDDKPETVIKRLKAYEDQTKPVLEYYQKKGVLETFSGTETNKIWPYVYAFLQTKVPQRS 100 110 120 130 140 150 pF1KE3 QKASVTP ::::::: CCDS56 QKASVTP >>CCDS56562.1 AK3 gene_id:50808|Hs108|chr9 (187 aa) initn: 1212 init1: 910 opt: 929 Z-score: 1156.8 bits: 221.0 E(32554): 4.1e-58 Smith-Waterman score: 1136; 82.4% identity (82.4% similar) in 227 aa overlap (1-227:1-187) 10 20 30 40 50 60 pF1KE3 MGASARLLRAVIMGAPGSGKGTVSSRITTHFELKHLSSGDLLRDNMLRGTEIGVLAKAFI :::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 MGASARLLRAVIMGAPGSGKGTVSSRITTHFELKHLSSGDLLRDNMLRGT---------- 10 20 30 40 50 70 80 90 100 110 120 pF1KE3 DQGKLIPDDVMTRLALHELKNLTQYSWLLDGFPRTLPQAEALDRAYQIDTVINLNVPFEV :::::::::::::::::::::::::::::: CCDS56 ------------------------------GFPRTLPQAEALDRAYQIDTVINLNVPFEV 60 70 80 130 140 150 160 170 180 pF1KE3 IKQRLTARWIHPASGRVYNIEFNPPKTVGIDDLTGEPLIQREDDKPETVIKRLKAYEDQT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 IKQRLTARWIHPASGRVYNIEFNPPKTVGIDDLTGEPLIQREDDKPETVIKRLKAYEDQT 90 100 110 120 130 140 190 200 210 220 pF1KE3 KPVLEYYQKKGVLETFSGTETNKIWPYVYAFLQTKVPQRSQKASVTP ::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 KPVLEYYQKKGVLETFSGTETNKIWPYVYAFLQTKVPQRSQKASVTP 150 160 170 180 >>CCDS629.1 AK4 gene_id:205|Hs108|chr1 (223 aa) initn: 931 init1: 912 opt: 926 Z-score: 1151.9 bits: 220.3 E(32554): 7.7e-58 Smith-Waterman score: 926; 59.7% identity (85.1% similar) in 221 aa overlap (4-223:2-222) 10 20 30 40 50 60 pF1KE3 MGASARLLRAVIMGAPGSGKGTVSSRITTHFELKHLSSGDLLRDNMLRGTEIGVLAKAFI ...::::::.: :::::::: .::. .: :.::::: .::.:. .::.: .:: .: CCDS62 MASKLLRAVILGPPGSGKGTVCQRIAQNFGLQHLSSGHFLRENIKASTEVGEMAKQYI 10 20 30 40 50 70 80 90 100 110 120 pF1KE3 DQGKLIPDDVMTRLALHELKNLTQYSWLLDGFPRTLPQAEALDRAYQIDTVINLNVPFEV ... :.:: :.::: . ::.: :::::::::: ::::::. ..: ::.::.:::. CCDS62 EKSLLVPDHVITRLMMSELENRRGQHWLLDGFPRTLGQAEALDKICEVDLVISLNIPFET 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE3 IKQRLTARWIHPASGRVYNIEFNPPKTVGIDDLTGEPLIQREDDKPETVIKRLKAYEDQT .:.::. ::::: ::::::..::::.. ::::.:::::.:.::::::.: ::. :.: . CCDS62 LKDRLSRRWIHPPSGRVYNLDFNPPHVHGIDDVTGEPLVQQEDDKPEAVAARLRQYKDVA 120 130 140 150 160 170 190 200 210 220 pF1KE3 KPVLEYYQKKGVLETFSGTETNKIWPYVYAFLQTKV-PQRSQKASVTP :::.: :...:::. ::::::::::::::.....:. : .:..: CCDS62 KPVIELYKSRGVLHQFSGTETNKIWPYVYTLFSNKITPIQSKEAY 180 190 200 210 220 >>CCDS81340.1 AK4 gene_id:205|Hs108|chr1 (171 aa) initn: 737 init1: 718 opt: 730 Z-score: 911.4 bits: 175.5 E(32554): 1.9e-44 Smith-Waterman score: 730; 60.0% identity (85.3% similar) in 170 aa overlap (55-223:1-170) 30 40 50 60 70 80 pF1KE3 SRITTHFELKHLSSGDLLRDNMLRGTEIGVLAKAFIDQGKLIPDDVMTRLALHELKNLTQ .:: .:... :.:: :.::: . ::.: CCDS81 MAKQYIEKSLLVPDHVITRLMMSELENRRG 10 20 30 90 100 110 120 130 140 pF1KE3 YSWLLDGFPRTLPQAEALDRAYQIDTVINLNVPFEVIKQRLTARWIHPASGRVYNIEFNP :::::::::: ::::::. ..: ::.::.:::..:.::. ::::: ::::::..::: CCDS81 QHWLLDGFPRTLGQAEALDKICEVDLVISLNIPFETLKDRLSRRWIHPPSGRVYNLDFNP 40 50 60 70 80 90 150 160 170 180 190 200 pF1KE3 PKTVGIDDLTGEPLIQREDDKPETVIKRLKAYEDQTKPVLEYYQKKGVLETFSGTETNKI :.. ::::.:::::.:.::::::.: ::. :.: .:::.: :...:::. ::::::::: CCDS81 PHVHGIDDVTGEPLVQQEDDKPEAVAARLRQYKDVAKPVIELYKSRGVLHQFSGTETNKI 100 110 120 130 140 150 210 220 pF1KE3 WPYVYAFLQTKV-PQRSQKASVTP :::::.....:. : .:..: CCDS81 WPYVYTLFSNKITPIQSKEAY 160 170 >>CCDS81296.1 AK2 gene_id:204|Hs108|chr1 (232 aa) initn: 613 init1: 299 opt: 580 Z-score: 723.9 bits: 141.2 E(32554): 5.3e-34 Smith-Waterman score: 580; 43.8% identity (74.9% similar) in 203 aa overlap (8-204:16-218) 10 20 30 40 50 pF1KE3 MGASARLLRAVIMGAPGSGKGTVSSRITTHFELKHLSSGDLLRDNMLRGTEI .:::..: ::.:::: . :.. .: . ::..::.:: . :.:. CCDS81 MAPSVPAAEPEYPKGIRAVLLGPPGAGKGTQAPRLAENFCVCHLATGDMLRAMVASGSEL 10 20 30 40 50 60 60 70 80 90 100 pF1KE3 GVLAKAFIDQGKLIPDDVMTRLALHELKN-LTQYSWLLDGFPRTLPQAEALD-----RAY : :: .: :::. :.....: ..:.. : . ..::::::::. ::: :: : CCDS81 GKKLKATMDAGKLVSDEMVVELIEKNLETPLCKNGFLLDGFPRTVRQAEMLDDLMEKRKE 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE3 QIDTVINLNVPFEVIKQRLTARWIHPASGRVYNIEFNPPKTVGIDDLTGEPLIQREDDKP ..:.::....: .. .:.:.: ::: ::: :. :::::: ::.::::::.: ::. CCDS81 KLDSVIEFSIPDSLLIRRITGRLIHPKSGRSYHEEFNPPKEPMKDDITGEPLIRRSDDNE 130 140 150 160 170 180 170 180 190 200 210 220 pF1KE3 ETVIKRLKAYEDQTKPVLEYYQKKGVLETFSGTETNKIWPYVYAFLQTKVPQRSQKASVT ... ::.::. :: :..:::.:.:. ......: . CCDS81 KALKIRLQAYHTQTTPLIEYYRKRGIHSAIDASQTPDVVFASILAAFSKATC 190 200 210 220 230 pF1KE3 P >>CCDS373.1 AK2 gene_id:204|Hs108|chr1 (232 aa) initn: 613 init1: 299 opt: 580 Z-score: 723.9 bits: 141.2 E(32554): 5.3e-34 Smith-Waterman score: 580; 43.8% identity (74.9% similar) in 203 aa overlap (8-204:16-218) 10 20 30 40 50 pF1KE3 MGASARLLRAVIMGAPGSGKGTVSSRITTHFELKHLSSGDLLRDNMLRGTEI .:::..: ::.:::: . :.. .: . ::..::.:: . :.:. CCDS37 MAPSVPAAEPEYPKGIRAVLLGPPGAGKGTQAPRLAENFCVCHLATGDMLRAMVASGSEL 10 20 30 40 50 60 60 70 80 90 100 pF1KE3 GVLAKAFIDQGKLIPDDVMTRLALHELKN-LTQYSWLLDGFPRTLPQAEALD-----RAY : :: .: :::. :.....: ..:.. : . ..::::::::. ::: :: : CCDS37 GKKLKATMDAGKLVSDEMVVELIEKNLETPLCKNGFLLDGFPRTVRQAEMLDDLMEKRKE 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE3 QIDTVINLNVPFEVIKQRLTARWIHPASGRVYNIEFNPPKTVGIDDLTGEPLIQREDDKP ..:.::....: .. .:.:.: ::: ::: :. :::::: ::.::::::.: ::. CCDS37 KLDSVIEFSIPDSLLIRRITGRLIHPKSGRSYHEEFNPPKEPMKDDITGEPLIRRSDDNE 130 140 150 160 170 180 170 180 190 200 210 220 pF1KE3 ETVIKRLKAYEDQTKPVLEYYQKKGVLETFSGTETNKIWPYVYAFLQTKVPQRSQKASVT ... ::.::. :: :..:::.:.:. ......: . CCDS37 KALKIRLQAYHTQTTPLIEYYRKRGIHSAIDASQTPDVVFASILAAFSKATS 190 200 210 220 230 pF1KE3 P >>CCDS374.1 AK2 gene_id:204|Hs108|chr1 (239 aa) initn: 613 init1: 299 opt: 580 Z-score: 723.7 bits: 141.2 E(32554): 5.5e-34 Smith-Waterman score: 580; 43.8% identity (74.9% similar) in 203 aa overlap (8-204:16-218) 10 20 30 40 50 pF1KE3 MGASARLLRAVIMGAPGSGKGTVSSRITTHFELKHLSSGDLLRDNMLRGTEI .:::..: ::.:::: . :.. .: . ::..::.:: . :.:. CCDS37 MAPSVPAAEPEYPKGIRAVLLGPPGAGKGTQAPRLAENFCVCHLATGDMLRAMVASGSEL 10 20 30 40 50 60 60 70 80 90 100 pF1KE3 GVLAKAFIDQGKLIPDDVMTRLALHELKN-LTQYSWLLDGFPRTLPQAEALD-----RAY : :: .: :::. :.....: ..:.. : . ..::::::::. ::: :: : CCDS37 GKKLKATMDAGKLVSDEMVVELIEKNLETPLCKNGFLLDGFPRTVRQAEMLDDLMEKRKE 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE3 QIDTVINLNVPFEVIKQRLTARWIHPASGRVYNIEFNPPKTVGIDDLTGEPLIQREDDKP ..:.::....: .. .:.:.: ::: ::: :. :::::: ::.::::::.: ::. CCDS37 KLDSVIEFSIPDSLLIRRITGRLIHPKSGRSYHEEFNPPKEPMKDDITGEPLIRRSDDNE 130 140 150 160 170 180 170 180 190 200 210 220 pF1KE3 ETVIKRLKAYEDQTKPVLEYYQKKGVLETFSGTETNKIWPYVYAFLQTKVPQRSQKASVT ... ::.::. :: :..:::.:.:. ......: . CCDS37 KALKIRLQAYHTQTTPLIEYYRKRGIHSAIDASQTPDVVFASILAAFSKATCKDLVMFI 190 200 210 220 230 pF1KE3 P >>CCDS81294.1 AK2 gene_id:204|Hs108|chr1 (190 aa) initn: 464 init1: 299 opt: 406 Z-score: 510.1 bits: 101.4 E(32554): 4.3e-22 Smith-Waterman score: 406; 39.9% identity (71.8% similar) in 163 aa overlap (48-204:14-176) 20 30 40 50 60 70 pF1KE3 SGKGTVSSRITTHFELKHLSSGDLLRDNMLRGTEIGVLAKAFIDQGKLIPDDVMTRLALH .: . .:. .: . :.....: . CCDS81 MAPSVPAAEPEYPKGIRAVLLGPPGAGKGTQVSDEMVVELIEK 10 20 30 40 80 90 100 110 120 130 pF1KE3 ELKN-LTQYSWLLDGFPRTLPQAEALD-----RAYQIDTVINLNVPFEVIKQRLTARWIH .:.. : . ..::::::::. ::: :: : ..:.::....: .. .:.:.: :: CCDS81 NLETPLCKNGFLLDGFPRTVRQAEMLDDLMEKRKEKLDSVIEFSIPDSLLIRRITGRLIH 50 60 70 80 90 100 140 150 160 170 180 190 pF1KE3 PASGRVYNIEFNPPKTVGIDDLTGEPLIQREDDKPETVIKRLKAYEDQTKPVLEYYQKKG : ::: :. :::::: ::.::::::.: ::. ... ::.::. :: :..:::.:.: CCDS81 PKSGRSYHEEFNPPKEPMKDDITGEPLIRRSDDNEKALKIRLQAYHTQTTPLIEYYRKRG 110 120 130 140 150 160 200 210 220 pF1KE3 VLETFSGTETNKIWPYVYAFLQTKVPQRSQKASVTP . ......: . CCDS81 IHSAIDASQTPDVVFASILAAFSKATS 170 180 190 >>CCDS6954.1 AK8 gene_id:158067|Hs108|chr9 (479 aa) initn: 272 init1: 120 opt: 307 Z-score: 381.3 bits: 78.9 E(32554): 6.4e-15 Smith-Waterman score: 307; 27.4% identity (60.9% similar) in 215 aa overlap (9-219:270-477) 10 20 30 pF1KE3 MGASARLLRAVIMGAPGSGKGTVSSRITTHFELKHLSS :....: ::::. .. .. ...: .. CCDS69 ISADQPCVDVFYQALTYVQSNHRTNAPFTPRVLLLGPVGSGKSLQAALLAQKYRLVNVCC 240 250 260 270 280 290 40 50 60 70 80 90 pF1KE3 GDLLRDNMLRGTEIGVLAKAFIDQGKLIPDDVMTRLALHEL--KNLTQYSWLLDGFPRTL :.::.. . : .: : . :... .::... .. ..: .. : .:.: : :: : CCDS69 GQLLKEAVADRTTFGELIQPFFEKEMAVPDSLLMKVLSQRLDQQDCIQKGWVLHGVPRDL 300 310 320 330 340 350 100 110 120 130 140 150 pF1KE3 PQAEALDR-AYQIDTVINLNVPFEVIKQRLTARWIHPASGRVYNIEFNPPKTVGIDDLTG ::. :.: .:. . :. :::::. : .::: : : :..:. :.. ..:: :. :. CCDS69 DQAHLLNRLGYNPNRVFFLNVPFDSIMERLTLRRIDPVTGERYHLMYKPPPTMEIQ---- 360 370 380 390 400 410 160 170 180 190 200 210 pF1KE3 EPLIQREDDKPETVIKRLKAYEDQTKPVLEYYQKKGVLETFSGTETN-KIWPYVYAFLQT :.: : : : .. . .. . . : : :..: . .. :. . . . CCDS69 ARLLQNPKDAEEQVKLKMDLFYRNSADLEQLY---GSAITLNGDQDPYTVFEYIESGIIN 420 430 440 450 460 470 220 pF1KE3 KVPQRSQKASVTP .:.. CCDS69 PLPKKIP 227 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 01:29:40 2016 done: Mon Nov 7 01:29:40 2016 Total Scan time: 2.160 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]