FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3620, 223 aa 1>>>pF1KE3620 223 - 223 aa - 223 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.5368+/-0.000715; mu= 12.7679+/- 0.043 mean_var=70.7834+/-14.041, 0's: 0 Z-trim(109.6): 27 B-trim: 0 in 0/53 Lambda= 0.152443 statistics sampled from 10964 (10987) to 10964 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.723), E-opt: 0.2 (0.338), width: 16 Scan time: 2.250 The best scores are: opt bits E(32554) CCDS629.1 AK4 gene_id:205|Hs108|chr1 ( 223) 1494 337.2 5.1e-93 CCDS81340.1 AK4 gene_id:205|Hs108|chr1 ( 171) 1155 262.6 1.1e-70 CCDS6455.1 AK3 gene_id:50808|Hs108|chr9 ( 227) 926 212.3 2.1e-55 CCDS56561.1 AK3 gene_id:50808|Hs108|chr9 ( 157) 676 157.2 5.4e-39 CCDS56562.1 AK3 gene_id:50808|Hs108|chr9 ( 187) 612 143.2 1.1e-34 CCDS81296.1 AK2 gene_id:204|Hs108|chr1 ( 232) 522 123.4 1.2e-28 CCDS373.1 AK2 gene_id:204|Hs108|chr1 ( 232) 522 123.4 1.2e-28 CCDS374.1 AK2 gene_id:204|Hs108|chr1 ( 239) 522 123.4 1.2e-28 CCDS81294.1 AK2 gene_id:204|Hs108|chr1 ( 190) 352 86.0 1.8e-17 CCDS6954.1 AK8 gene_id:158067|Hs108|chr9 ( 479) 320 79.2 5.2e-15 CCDS81295.1 AK2 gene_id:204|Hs108|chr1 ( 133) 266 67.0 6.6e-12 CCDS676.1 AK5 gene_id:26289|Hs108|chr1 ( 536) 273 68.8 7.3e-12 CCDS675.1 AK5 gene_id:26289|Hs108|chr1 ( 562) 273 68.9 7.6e-12 CCDS6881.1 AK1 gene_id:203|Hs108|chr9 ( 194) 259 65.5 2.6e-11 CCDS83419.1 AK1 gene_id:203|Hs108|chr9 ( 210) 259 65.6 2.8e-11 >>CCDS629.1 AK4 gene_id:205|Hs108|chr1 (223 aa) initn: 1494 init1: 1494 opt: 1494 Z-score: 1783.5 bits: 337.2 E(32554): 5.1e-93 Smith-Waterman score: 1494; 100.0% identity (100.0% similar) in 223 aa overlap (1-223:1-223) 10 20 30 40 50 60 pF1KE3 MASKLLRAVILGPPGSGKGTVCQRIAQNFGLQHLSSGHFLRENIKASTEVGEMAKQYIEK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS62 MASKLLRAVILGPPGSGKGTVCQRIAQNFGLQHLSSGHFLRENIKASTEVGEMAKQYIEK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 SLLVPDHVITRLMMSELENRRGQHWLLDGFPRTLGQAEALDKICEVDLVISLNIPFETLK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS62 SLLVPDHVITRLMMSELENRRGQHWLLDGFPRTLGQAEALDKICEVDLVISLNIPFETLK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 DRLSRRWIHPPSGRVYNLDFNPPHVHGIDDVTGEPLVQQEDDKPEAVAARLRQYKDVAKP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS62 DRLSRRWIHPPSGRVYNLDFNPPHVHGIDDVTGEPLVQQEDDKPEAVAARLRQYKDVAKP 130 140 150 160 170 180 190 200 210 220 pF1KE3 VIELYKSRGVLHQFSGTETNKIWPYVYTLFSNKITPIQSKEAY ::::::::::::::::::::::::::::::::::::::::::: CCDS62 VIELYKSRGVLHQFSGTETNKIWPYVYTLFSNKITPIQSKEAY 190 200 210 220 >>CCDS81340.1 AK4 gene_id:205|Hs108|chr1 (171 aa) initn: 1155 init1: 1155 opt: 1155 Z-score: 1382.3 bits: 262.6 E(32554): 1.1e-70 Smith-Waterman score: 1155; 100.0% identity (100.0% similar) in 171 aa overlap (53-223:1-171) 30 40 50 60 70 80 pF1KE3 QRIAQNFGLQHLSSGHFLRENIKASTEVGEMAKQYIEKSLLVPDHVITRLMMSELENRRG :::::::::::::::::::::::::::::: CCDS81 MAKQYIEKSLLVPDHVITRLMMSELENRRG 10 20 30 90 100 110 120 130 140 pF1KE3 QHWLLDGFPRTLGQAEALDKICEVDLVISLNIPFETLKDRLSRRWIHPPSGRVYNLDFNP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 QHWLLDGFPRTLGQAEALDKICEVDLVISLNIPFETLKDRLSRRWIHPPSGRVYNLDFNP 40 50 60 70 80 90 150 160 170 180 190 200 pF1KE3 PHVHGIDDVTGEPLVQQEDDKPEAVAARLRQYKDVAKPVIELYKSRGVLHQFSGTETNKI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 PHVHGIDDVTGEPLVQQEDDKPEAVAARLRQYKDVAKPVIELYKSRGVLHQFSGTETNKI 100 110 120 130 140 150 210 220 pF1KE3 WPYVYTLFSNKITPIQSKEAY ::::::::::::::::::::: CCDS81 WPYVYTLFSNKITPIQSKEAY 160 170 >>CCDS6455.1 AK3 gene_id:50808|Hs108|chr9 (227 aa) initn: 929 init1: 912 opt: 926 Z-score: 1108.2 bits: 212.3 E(32554): 2.1e-55 Smith-Waterman score: 926; 59.7% identity (85.1% similar) in 221 aa overlap (2-222:4-223) 10 20 30 40 50 pF1KE3 MASKLLRAVILGPPGSGKGTVCQRIAQNFGLQHLSSGHFLRENIKASTEVGEMAKQYI ...::::::.: :::::::: .::. .: :.::::: .::.:. .::.: .:: .: CCDS64 MGASARLLRAVIMGAPGSGKGTVSSRITTHFELKHLSSGDLLRDNMLRGTEIGVLAKAFI 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE3 EKSLLVPDHVITRLMMSELENRRGQHWLLDGFPRTLGQAEALDKICEVDLVISLNIPFET ... :.:: :.::: . ::.: :::::::::: ::::::. ..: ::.::.:::. CCDS64 DQGKLIPDDVMTRLALHELKNLTQYSWLLDGFPRTLPQAEALDRAYQIDTVINLNVPFEV 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE3 LKDRLSRRWIHPPSGRVYNLDFNPPHVHGIDDVTGEPLVQQEDDKPEAVAARLRQYKDVA .:.::. ::::: ::::::..::::.. ::::.:::::.:.::::::.: ::. :.: . CCDS64 IKQRLTARWIHPASGRVYNIEFNPPKTVGIDDLTGEPLIQREDDKPETVIKRLKAYEDQT 130 140 150 160 170 180 180 190 200 210 220 pF1KE3 KPVIELYKSRGVLHQFSGTETNKIWPYVYTLFSNKITPIQSKEAY :::.: :...:::. ::::::::::::::.....:. : .:..: CCDS64 KPVLEYYQKKGVLETFSGTETNKIWPYVYAFLQTKV-PQRSQKASVTP 190 200 210 220 >>CCDS56561.1 AK3 gene_id:50808|Hs108|chr9 (157 aa) initn: 684 init1: 665 opt: 676 Z-score: 813.5 bits: 157.2 E(32554): 5.4e-39 Smith-Waterman score: 676; 61.7% identity (85.7% similar) in 154 aa overlap (69-222:1-153) 40 50 60 70 80 90 pF1KE3 FLRENIKASTEVGEMAKQYIEKSLLVPDHVITRLMMSELENRRGQHWLLDGFPRTLGQAE .::: . ::.: :::::::::: ::: CCDS56 MTRLALHELKNLTQYSWLLDGFPRTLPQAE 10 20 30 100 110 120 130 140 150 pF1KE3 ALDKICEVDLVISLNIPFETLKDRLSRRWIHPPSGRVYNLDFNPPHVHGIDDVTGEPLVQ :::. ..: ::.::.:::..:.::. ::::: ::::::..::::.. ::::.:::::.: CCDS56 ALDRAYQIDTVINLNVPFEVIKQRLTARWIHPASGRVYNIEFNPPKTVGIDDLTGEPLIQ 40 50 60 70 80 90 160 170 180 190 200 210 pF1KE3 QEDDKPEAVAARLRQYKDVAKPVIELYKSRGVLHQFSGTETNKIWPYVYTLFSNKITPIQ .::::::.: ::. :.: .:::.: :...:::. ::::::::::::::.....:. : . CCDS56 REDDKPETVIKRLKAYEDQTKPVLEYYQKKGVLETFSGTETNKIWPYVYAFLQTKV-PQR 100 110 120 130 140 220 pF1KE3 SKEAY :..: CCDS56 SQKASVTP 150 >>CCDS56562.1 AK3 gene_id:50808|Hs108|chr9 (187 aa) initn: 784 init1: 603 opt: 612 Z-score: 736.3 bits: 143.2 E(32554): 1.1e-34 Smith-Waterman score: 705; 52.0% identity (72.4% similar) in 221 aa overlap (2-222:4-183) 10 20 30 40 50 pF1KE3 MASKLLRAVILGPPGSGKGTVCQRIAQNFGLQHLSSGHFLRENIKASTEVGEMAKQYI ...::::::.: :::::::: .::. .: :.::::: .::.: CCDS56 MGASARLLRAVIMGAPGSGKGTVSSRITTHFELKHLSSGDLLRDN--------------- 10 20 30 40 60 70 80 90 100 110 pF1KE3 EKSLLVPDHVITRLMMSELENRRGQHWLLDGFPRTLGQAEALDKICEVDLVISLNIPFET :. :: :::::: ::::::. ..: ::.::.:::. CCDS56 --------------ML------RGT-----GFPRTLPQAEALDRAYQIDTVINLNVPFEV 50 60 70 80 120 130 140 150 160 170 pF1KE3 LKDRLSRRWIHPPSGRVYNLDFNPPHVHGIDDVTGEPLVQQEDDKPEAVAARLRQYKDVA .:.::. ::::: ::::::..::::.. ::::.:::::.:.::::::.: ::. :.: . CCDS56 IKQRLTARWIHPASGRVYNIEFNPPKTVGIDDLTGEPLIQREDDKPETVIKRLKAYEDQT 90 100 110 120 130 140 180 190 200 210 220 pF1KE3 KPVIELYKSRGVLHQFSGTETNKIWPYVYTLFSNKITPIQSKEAY :::.: :...:::. ::::::::::::::.....:. : .:..: CCDS56 KPVLEYYQKKGVLETFSGTETNKIWPYVYAFLQTKV-PQRSQKASVTP 150 160 170 180 >>CCDS81296.1 AK2 gene_id:204|Hs108|chr1 (232 aa) initn: 518 init1: 251 opt: 522 Z-score: 627.9 bits: 123.4 E(32554): 1.2e-28 Smith-Waterman score: 522; 40.0% identity (69.8% similar) in 215 aa overlap (4-211:14-228) 10 20 30 40 50 pF1KE3 MASKLLRAVILGPPGSGKGTVCQRIAQNFGLQHLSSGHFLRENIKASTEV : .:::.:::::.:::: :.:.:: . ::..: .:: . ...:. CCDS81 MAPSVPAAEPEYPKGIRAVLLGPPGAGKGTQAPRLAENFCVCHLATGDMLRAMVASGSEL 10 20 30 40 50 60 60 70 80 90 100 pF1KE3 GEMAKQYIEKSLLVPDHVITRLMMSELENRRGQH-WLLDGFPRTLGQAEALDKICE---- :. : .. . :: :.....:. ..::. .. .::::::::. ::: :: . : CCDS81 GKKLKATMDAGKLVSDEMVVELIEKNLETPLCKNGFLLDGFPRTVRQAEMLDDLMEKRKE 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE3 -VDLVISLNIPFETLKDRLSRRWIHPPSGRVYNLDFNPPHVHGIDDVTGEPLVQQEDDKP .: :: ..:: : :.. : ::: ::: :. .::::. ::.:::::... ::. CCDS81 KLDSVIEFSIPDSLLIRRITGRLIHPKSGRSYHEEFNPPKEPMKDDITGEPLIRRSDDNE 130 140 150 160 170 180 170 180 190 200 210 220 pF1KE3 EAVAARLRQYKDVAKPVIELYKSRGVLHQFSGTET-NKIWPYVYTLFSNKITPIQSKEAY .:. ::. :. . :.:: :..::. .....: . .. . . :: CCDS81 KALKIRLQAYHTQTTPLIEYYRKRGIHSAIDASQTPDVVFASILAAFSKATC 190 200 210 220 230 >>CCDS373.1 AK2 gene_id:204|Hs108|chr1 (232 aa) initn: 518 init1: 251 opt: 522 Z-score: 627.9 bits: 123.4 E(32554): 1.2e-28 Smith-Waterman score: 522; 40.0% identity (69.8% similar) in 215 aa overlap (4-211:14-228) 10 20 30 40 50 pF1KE3 MASKLLRAVILGPPGSGKGTVCQRIAQNFGLQHLSSGHFLRENIKASTEV : .:::.:::::.:::: :.:.:: . ::..: .:: . ...:. CCDS37 MAPSVPAAEPEYPKGIRAVLLGPPGAGKGTQAPRLAENFCVCHLATGDMLRAMVASGSEL 10 20 30 40 50 60 60 70 80 90 100 pF1KE3 GEMAKQYIEKSLLVPDHVITRLMMSELENRRGQH-WLLDGFPRTLGQAEALDKICE---- :. : .. . :: :.....:. ..::. .. .::::::::. ::: :: . : CCDS37 GKKLKATMDAGKLVSDEMVVELIEKNLETPLCKNGFLLDGFPRTVRQAEMLDDLMEKRKE 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE3 -VDLVISLNIPFETLKDRLSRRWIHPPSGRVYNLDFNPPHVHGIDDVTGEPLVQQEDDKP .: :: ..:: : :.. : ::: ::: :. .::::. ::.:::::... ::. CCDS37 KLDSVIEFSIPDSLLIRRITGRLIHPKSGRSYHEEFNPPKEPMKDDITGEPLIRRSDDNE 130 140 150 160 170 180 170 180 190 200 210 220 pF1KE3 EAVAARLRQYKDVAKPVIELYKSRGVLHQFSGTET-NKIWPYVYTLFSNKITPIQSKEAY .:. ::. :. . :.:: :..::. .....: . .. . . :: CCDS37 KALKIRLQAYHTQTTPLIEYYRKRGIHSAIDASQTPDVVFASILAAFSKATS 190 200 210 220 230 >>CCDS374.1 AK2 gene_id:204|Hs108|chr1 (239 aa) initn: 518 init1: 251 opt: 522 Z-score: 627.7 bits: 123.4 E(32554): 1.2e-28 Smith-Waterman score: 522; 40.0% identity (69.8% similar) in 215 aa overlap (4-211:14-228) 10 20 30 40 50 pF1KE3 MASKLLRAVILGPPGSGKGTVCQRIAQNFGLQHLSSGHFLRENIKASTEV : .:::.:::::.:::: :.:.:: . ::..: .:: . ...:. CCDS37 MAPSVPAAEPEYPKGIRAVLLGPPGAGKGTQAPRLAENFCVCHLATGDMLRAMVASGSEL 10 20 30 40 50 60 60 70 80 90 100 pF1KE3 GEMAKQYIEKSLLVPDHVITRLMMSELENRRGQH-WLLDGFPRTLGQAEALDKICE---- :. : .. . :: :.....:. ..::. .. .::::::::. ::: :: . : CCDS37 GKKLKATMDAGKLVSDEMVVELIEKNLETPLCKNGFLLDGFPRTVRQAEMLDDLMEKRKE 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE3 -VDLVISLNIPFETLKDRLSRRWIHPPSGRVYNLDFNPPHVHGIDDVTGEPLVQQEDDKP .: :: ..:: : :.. : ::: ::: :. .::::. ::.:::::... ::. CCDS37 KLDSVIEFSIPDSLLIRRITGRLIHPKSGRSYHEEFNPPKEPMKDDITGEPLIRRSDDNE 130 140 150 160 170 180 170 180 190 200 210 220 pF1KE3 EAVAARLRQYKDVAKPVIELYKSRGVLHQFSGTET-NKIWPYVYTLFSNKITPIQSKEAY .:. ::. :. . :.:: :..::. .....: . .. . . :: CCDS37 KALKIRLQAYHTQTTPLIEYYRKRGIHSAIDASQTPDVVFASILAAFSKATCKDLVMFI 190 200 210 220 230 >>CCDS81294.1 AK2 gene_id:204|Hs108|chr1 (190 aa) initn: 431 init1: 251 opt: 352 Z-score: 427.1 bits: 86.0 E(32554): 1.8e-17 Smith-Waterman score: 357; 36.0% identity (57.5% similar) in 214 aa overlap (4-211:14-186) 10 20 30 40 50 pF1KE3 MASKLLRAVILGPPGSGKGTVCQRIAQNFGLQHLSSGHFLRENIKASTEV : .:::.:::::.:::: ..:. CCDS81 MAPSVPAAEPEYPKGIRAVLLGPPGAGKGT------------QVSD-------------- 10 20 30 60 70 80 90 100 pF1KE3 GEMAKQYIEKSLLVPDHVITRLMMSELENRRGQHWLLDGFPRTLGQAEALDKICE----- ::. . :::.: .: . : .::::::::. ::: :: . : CCDS81 -EMVVELIEKNLETP------------LCKNG--FLLDGFPRTVRQAEMLDDLMEKRKEK 40 50 60 70 110 120 130 140 150 160 pF1KE3 VDLVISLNIPFETLKDRLSRRWIHPPSGRVYNLDFNPPHVHGIDDVTGEPLVQQEDDKPE .: :: ..:: : :.. : ::: ::: :. .::::. ::.:::::... ::. . CCDS81 LDSVIEFSIPDSLLIRRITGRLIHPKSGRSYHEEFNPPKEPMKDDITGEPLIRRSDDNEK 80 90 100 110 120 130 170 180 190 200 210 220 pF1KE3 AVAARLRQYKDVAKPVIELYKSRGVLHQFSGTET-NKIWPYVYTLFSNKITPIQSKEAY :. ::. :. . :.:: :..::. .....: . .. . . :: CCDS81 ALKIRLQAYHTQTTPLIEYYRKRGIHSAIDASQTPDVVFASILAAFSKATS 140 150 160 170 180 190 >>CCDS6954.1 AK8 gene_id:158067|Hs108|chr9 (479 aa) initn: 337 init1: 151 opt: 320 Z-score: 383.0 bits: 79.2 E(32554): 5.2e-15 Smith-Waterman score: 320; 28.6% identity (60.4% similar) in 217 aa overlap (7-220:270-477) 10 20 30 pF1KE3 MASKLLRAVILGPPGSGKGTVCQRIAQNFGLQHLSS :...::: ::::. .::.. : .. CCDS69 ISADQPCVDVFYQALTYVQSNHRTNAPFTPRVLLLGPVGSGKSLQAALLAQKYRLVNVCC 240 250 260 270 280 290 40 50 60 70 80 90 pF1KE3 GHFLRENIKASTEVGEMAKQYIEKSLLVPDHVITRLMMSELENRR--GQHWLLDGFPRTL :..:.: . : ::. . ..:: . ::: .. ... ..:... . :.: : :: : CCDS69 GQLLKEAVADRTTFGELIQPFFEKEMAVPDSLLMKVLSQRLDQQDCIQKGWVLHGVPRDL 300 310 320 330 340 350 100 110 120 130 140 150 pF1KE3 GQAEALDKIC-EVDLVISLNIPFETLKDRLSRRWIHPPSGRVYNLDFNPPHVHGIDDVTG ::. :... . . :. ::.::... .::. : : : .:. :.: ..:: . :. CCDS69 DQAHLLNRLGYNPNRVFFLNVPFDSIMERLTLRRIDPVTGERYHLMYKPPPTMEIQ---- 360 370 380 390 400 410 160 170 180 190 200 210 pF1KE3 EPLVQQEDDKPEAVAARLRQYKDVAKPVIELYKSRGVLHQFSGTETNKIWPYVYTLFSNK :.:. : : : .. . . . .:: : .:. . . .. :. :. CCDS69 ARLLQNPKDAEEQVKLKMDLFYRNSADLEQLYGSAITLN--GDQDPYTVFEYIE---SGI 420 430 440 450 460 470 220 pF1KE3 ITPIQSKEAY :.:. .: CCDS69 INPLPKKIP 223 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 01:12:33 2016 done: Mon Nov 7 01:12:34 2016 Total Scan time: 2.250 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]