FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE2319, 333 aa 1>>>pF1KE2319 333 - 333 aa - 333 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.6140+/-0.000791; mu= 2.8400+/- 0.048 mean_var=201.0464+/-40.993, 0's: 0 Z-trim(115.3): 10 B-trim: 110 in 1/52 Lambda= 0.090454 statistics sampled from 15843 (15849) to 15843 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.797), E-opt: 0.2 (0.487), width: 16 Scan time: 2.960 The best scores are: opt bits E(32554) CCDS4683.2 ATAT1 gene_id:79969|Hs108|chr6 ( 333) 2262 306.9 1.5e-83 CCDS83072.1 ATAT1 gene_id:79969|Hs108|chr6 ( 323) 2193 297.9 7.4e-81 CCDS54978.1 ATAT1 gene_id:79969|Hs108|chr6 ( 409) 1954 266.8 2.2e-71 CCDS83073.1 ATAT1 gene_id:79969|Hs108|chr6 ( 300) 1326 184.7 8e-47 CCDS59002.1 ATAT1 gene_id:79969|Hs108|chr6 ( 310) 1326 184.7 8.2e-47 >>CCDS4683.2 ATAT1 gene_id:79969|Hs108|chr6 (333 aa) initn: 2262 init1: 2262 opt: 2262 Z-score: 1613.6 bits: 306.9 E(32554): 1.5e-83 Smith-Waterman score: 2262; 100.0% identity (100.0% similar) in 333 aa overlap (1-333:1-333) 10 20 30 40 50 60 pF1KE2 MEFPFDVDALFPERITVLDQHLRPPARRPGTTTPARVDLQQQIMTIIDELGKASAKAQNL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 MEFPFDVDALFPERITVLDQHLRPPARRPGTTTPARVDLQQQIMTIIDELGKASAKAQNL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 SAPITSASRMQSNRHVVYILKDSSARPAGKGAIIGFIKVGYKKLFVLDDREAHNEVEPLC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 SAPITSASRMQSNRHVVYILKDSSARPAGKGAIIGFIKVGYKKLFVLDDREAHNEVEPLC 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 ILDFYIHESVQRHGHGRELFQYMLQKERVEPHQLAIDRPSQKLLKFLNKHYNLETTVPQV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 ILDFYIHESVQRHGHGRELFQYMLQKERVEPHQLAIDRPSQKLLKFLNKHYNLETTVPQV 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 NNFVIFEGFFAHQHRPPAPSLRATRHSRAAAVDPTPAAPARKLPPKRAEGDIKPYSSSDR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 NNFVIFEGFFAHQHRPPAPSLRATRHSRAAAVDPTPAAPARKLPPKRAEGDIKPYSSSDR 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE2 EFLKVAVEPPWPLNRAPRRATPPAHPPPRSSSLGNSPERGPLRPFVPEQELLRSLRLCPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 EFLKVAVEPPWPLNRAPRRATPPAHPPPRSSSLGNSPERGPLRPFVPEQELLRSLRLCPP 250 260 270 280 290 300 310 320 330 pF1KE2 HPTARLLLAADPGGSPAQRRRTSSLPRSEESRY ::::::::::::::::::::::::::::::::: CCDS46 HPTARLLLAADPGGSPAQRRRTSSLPRSEESRY 310 320 330 >>CCDS83072.1 ATAT1 gene_id:79969|Hs108|chr6 (323 aa) initn: 2193 init1: 2193 opt: 2193 Z-score: 1565.2 bits: 297.9 E(32554): 7.4e-81 Smith-Waterman score: 2193; 100.0% identity (100.0% similar) in 322 aa overlap (1-322:1-322) 10 20 30 40 50 60 pF1KE2 MEFPFDVDALFPERITVLDQHLRPPARRPGTTTPARVDLQQQIMTIIDELGKASAKAQNL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 MEFPFDVDALFPERITVLDQHLRPPARRPGTTTPARVDLQQQIMTIIDELGKASAKAQNL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 SAPITSASRMQSNRHVVYILKDSSARPAGKGAIIGFIKVGYKKLFVLDDREAHNEVEPLC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 SAPITSASRMQSNRHVVYILKDSSARPAGKGAIIGFIKVGYKKLFVLDDREAHNEVEPLC 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 ILDFYIHESVQRHGHGRELFQYMLQKERVEPHQLAIDRPSQKLLKFLNKHYNLETTVPQV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 ILDFYIHESVQRHGHGRELFQYMLQKERVEPHQLAIDRPSQKLLKFLNKHYNLETTVPQV 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 NNFVIFEGFFAHQHRPPAPSLRATRHSRAAAVDPTPAAPARKLPPKRAEGDIKPYSSSDR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 NNFVIFEGFFAHQHRPPAPSLRATRHSRAAAVDPTPAAPARKLPPKRAEGDIKPYSSSDR 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE2 EFLKVAVEPPWPLNRAPRRATPPAHPPPRSSSLGNSPERGPLRPFVPEQELLRSLRLCPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 EFLKVAVEPPWPLNRAPRRATPPAHPPPRSSSLGNSPERGPLRPFVPEQELLRSLRLCPP 250 260 270 280 290 300 310 320 330 pF1KE2 HPTARLLLAADPGGSPAQRRRTSSLPRSEESRY :::::::::::::::::::::: CCDS83 HPTARLLLAADPGGSPAQRRRTR 310 320 >>CCDS54978.1 ATAT1 gene_id:79969|Hs108|chr6 (409 aa) initn: 1982 init1: 1949 opt: 1954 Z-score: 1395.2 bits: 266.8 E(32554): 2.2e-71 Smith-Waterman score: 1954; 96.3% identity (97.3% similar) in 300 aa overlap (27-326:15-314) 10 20 30 40 50 60 pF1KE2 MEFPFDVDALFPERITVLDQHLRPPARRPGTTTPARVDLQQQIMTIIDELGKASAKAQNL :. :. :::::::::::::::::::::::: CCDS54 MWLTWPFCFLTITLREEGVCHLESVDLQQQIMTIIDELGKASAKAQNL 10 20 30 40 70 80 90 100 110 120 pF1KE2 SAPITSASRMQSNRHVVYILKDSSARPAGKGAIIGFIKVGYKKLFVLDDREAHNEVEPLC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 SAPITSASRMQSNRHVVYILKDSSARPAGKGAIIGFIKVGYKKLFVLDDREAHNEVEPLC 50 60 70 80 90 100 130 140 150 160 170 180 pF1KE2 ILDFYIHESVQRHGHGRELFQYMLQKERVEPHQLAIDRPSQKLLKFLNKHYNLETTVPQV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 ILDFYIHESVQRHGHGRELFQYMLQKERVEPHQLAIDRPSQKLLKFLNKHYNLETTVPQV 110 120 130 140 150 160 190 200 210 220 230 240 pF1KE2 NNFVIFEGFFAHQHRPPAPSLRATRHSRAAAVDPTPAAPARKLPPKRAEGDIKPYSSSDR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 NNFVIFEGFFAHQHRPPAPSLRATRHSRAAAVDPTPAAPARKLPPKRAEGDIKPYSSSDR 170 180 190 200 210 220 250 260 270 280 290 300 pF1KE2 EFLKVAVEPPWPLNRAPRRATPPAHPPPRSSSLGNSPERGPLRPFVPEQELLRSLRLCPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 EFLKVAVEPPWPLNRAPRRATPPAHPPPRSSSLGNSPERGPLRPFVPEQELLRSLRLCPP 230 240 250 260 270 280 310 320 330 pF1KE2 HPTARLLLAADPGGSPAQRRRTSSLPRSEESRY :::::::::::::::::::::: . : CCDS54 HPTARLLLAADPGGSPAQRRRTRGTPPGLVAQSCCYSRHGGVNSSSPNTGNQDSKQGEQE 290 300 310 320 330 340 >>CCDS83073.1 ATAT1 gene_id:79969|Hs108|chr6 (300 aa) initn: 2025 init1: 1300 opt: 1326 Z-score: 954.1 bits: 184.7 E(32554): 8e-47 Smith-Waterman score: 1976; 92.9% identity (92.9% similar) in 322 aa overlap (1-322:1-299) 10 20 30 40 50 60 pF1KE2 MEFPFDVDALFPERITVLDQHLRPPARRPGTTTPARVDLQQQIMTIIDELGKASAKAQNL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 MEFPFDVDALFPERITVLDQHLRPPARRPGTTTPARVDLQQQIMTIIDELGKASAKAQNL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 SAPITSASRMQSNRHVVYILKDSSARPAGKGAIIGFIKVGYKKLFVLDDREAHNEVEPLC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 SAPITSASRMQSNRHVVYILKDSSARPAGKGAIIGFIKVGYKKLFVLDDREAHNEVEPLC 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 ILDFYIHESVQRHGHGRELFQYMLQKERVEPHQLAIDRPSQKLLKFLNKHYNLETTVPQV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 ILDFYIHESVQRHGHGRELFQYMLQKERVEPHQLAIDRPSQKLLKFLNKHYNLETTVPQV 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 NNFVIFEGFFAHQHRPPAPSLRATRHSRAAAVDPTPAAPARKLPPKRAEGDIKPYSSSDR :::::::::::::: ::: :::::::::::::::::::: CCDS83 NNFVIFEGFFAHQH-PPA----------------------RKLPPKRAEGDIKPYSSSDR 190 200 210 250 260 270 280 290 300 pF1KE2 EFLKVAVEPPWPLNRAPRRATPPAHPPPRSSSLGNSPERGPLRPFVPEQELLRSLRLCPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 EFLKVAVEPPWPLNRAPRRATPPAHPPPRSSSLGNSPERGPLRPFVPEQELLRSLRLCPP 220 230 240 250 260 270 310 320 330 pF1KE2 HPTARLLLAADPGGSPAQRRRTSSLPRSEESRY :::::::::::::::::::::: CCDS83 HPTARLLLAADPGGSPAQRRRTR 280 290 300 >>CCDS59002.1 ATAT1 gene_id:79969|Hs108|chr6 (310 aa) initn: 2094 init1: 1300 opt: 1326 Z-score: 953.9 bits: 184.7 E(32554): 8.2e-47 Smith-Waterman score: 2045; 93.1% identity (93.1% similar) in 333 aa overlap (1-333:1-310) 10 20 30 40 50 60 pF1KE2 MEFPFDVDALFPERITVLDQHLRPPARRPGTTTPARVDLQQQIMTIIDELGKASAKAQNL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 MEFPFDVDALFPERITVLDQHLRPPARRPGTTTPARVDLQQQIMTIIDELGKASAKAQNL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 SAPITSASRMQSNRHVVYILKDSSARPAGKGAIIGFIKVGYKKLFVLDDREAHNEVEPLC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 SAPITSASRMQSNRHVVYILKDSSARPAGKGAIIGFIKVGYKKLFVLDDREAHNEVEPLC 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 ILDFYIHESVQRHGHGRELFQYMLQKERVEPHQLAIDRPSQKLLKFLNKHYNLETTVPQV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 ILDFYIHESVQRHGHGRELFQYMLQKERVEPHQLAIDRPSQKLLKFLNKHYNLETTVPQV 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 NNFVIFEGFFAHQHRPPAPSLRATRHSRAAAVDPTPAAPARKLPPKRAEGDIKPYSSSDR :::::::::::::: ::: :::::::::::::::::::: CCDS59 NNFVIFEGFFAHQH-PPA----------------------RKLPPKRAEGDIKPYSSSDR 190 200 210 250 260 270 280 290 300 pF1KE2 EFLKVAVEPPWPLNRAPRRATPPAHPPPRSSSLGNSPERGPLRPFVPEQELLRSLRLCPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 EFLKVAVEPPWPLNRAPRRATPPAHPPPRSSSLGNSPERGPLRPFVPEQELLRSLRLCPP 220 230 240 250 260 270 310 320 330 pF1KE2 HPTARLLLAADPGGSPAQRRRTSSLPRSEESRY ::::::::::::::::::::::::::::::::: CCDS59 HPTARLLLAADPGGSPAQRRRTSSLPRSEESRY 280 290 300 310 333 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 16:52:34 2016 done: Sun Nov 6 16:52:35 2016 Total Scan time: 2.960 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]