FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3707, 301 aa 1>>>pF1KE3707 301 - 301 aa - 301 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 10.5931+/-0.00102; mu= -6.2587+/- 0.062 mean_var=347.6654+/-70.497, 0's: 0 Z-trim(114.1): 31 B-trim: 0 in 0/53 Lambda= 0.068785 statistics sampled from 14646 (14675) to 14646 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.772), E-opt: 0.2 (0.451), width: 16 Scan time: 2.450 The best scores are: opt bits E(32554) CCDS31803.1 POU6F1 gene_id:5463|Hs108|chr12 ( 301) 1957 207.3 1.2e-53 CCDS81691.1 POU6F1 gene_id:5463|Hs108|chr12 ( 611) 1957 207.6 2e-53 CCDS55103.1 POU6F2 gene_id:11281|Hs108|chr7 ( 655) 926 105.3 1.3e-22 CCDS34620.2 POU6F2 gene_id:11281|Hs108|chr7 ( 691) 658 78.7 1.4e-14 >>CCDS31803.1 POU6F1 gene_id:5463|Hs108|chr12 (301 aa) initn: 1957 init1: 1957 opt: 1957 Z-score: 1076.7 bits: 207.3 E(32554): 1.2e-53 Smith-Waterman score: 1957; 100.0% identity (100.0% similar) in 301 aa overlap (1-301:1-301) 10 20 30 40 50 60 pF1KE3 MPGISSQILTNAQGQVIGTLPWVVNSASVAAPAPAQSLQVQAVTPQLLLNAQGQVIATLA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 MPGISSQILTNAQGQVIGTLPWVVNSASVAAPAPAQSLQVQAVTPQLLLNAQGQVIATLA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 SSPLPPPVAVRKPSTPESPAKSEVQPIQPTPTVPQPAVVIASPAPAAKPSASAPIPITCS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 SSPLPPPVAVRKPSTPESPAKSEVQPIQPTPTVPQPAVVIASPAPAAKPSASAPIPITCS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 ETPTVSQLVSKPHTPSLDEDGINLEEIREFAKNFKIRRLSLGLTQTQVGQALTATEGPAY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 ETPTVSQLVSKPHTPSLDEDGINLEEIREFAKNFKIRRLSLGLTQTQVGQALTATEGPAY 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 SQSAICRFEKLDITPKSAQKLKPVLEKWLNEAELRNQEGQQNLMEFVGGEPSKKRKRRTS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 SQSAICRFEKLDITPKSAQKLKPVLEKWLNEAELRNQEGQQNLMEFVGGEPSKKRKRRTS 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE3 FTPQAIEALNAYFEKNPLPTGQEITEIAKELNYDREVVRVWFCNRRQTLKNTSKLNVFQI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 FTPQAIEALNAYFEKNPLPTGQEITEIAKELNYDREVVRVWFCNRRQTLKNTSKLNVFQI 250 260 270 280 290 300 pF1KE3 P : CCDS31 P >>CCDS81691.1 POU6F1 gene_id:5463|Hs108|chr12 (611 aa) initn: 1957 init1: 1957 opt: 1957 Z-score: 1072.7 bits: 207.6 E(32554): 2e-53 Smith-Waterman score: 1957; 100.0% identity (100.0% similar) in 301 aa overlap (1-301:311-611) 10 20 30 pF1KE3 MPGISSQILTNAQGQVIGTLPWVVNSASVA :::::::::::::::::::::::::::::: CCDS81 GIISAASLGGQTQILGSLTTAPVITSAIPSMPGISSQILTNAQGQVIGTLPWVVNSASVA 290 300 310 320 330 340 40 50 60 70 80 90 pF1KE3 APAPAQSLQVQAVTPQLLLNAQGQVIATLASSPLPPPVAVRKPSTPESPAKSEVQPIQPT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 APAPAQSLQVQAVTPQLLLNAQGQVIATLASSPLPPPVAVRKPSTPESPAKSEVQPIQPT 350 360 370 380 390 400 100 110 120 130 140 150 pF1KE3 PTVPQPAVVIASPAPAAKPSASAPIPITCSETPTVSQLVSKPHTPSLDEDGINLEEIREF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 PTVPQPAVVIASPAPAAKPSASAPIPITCSETPTVSQLVSKPHTPSLDEDGINLEEIREF 410 420 430 440 450 460 160 170 180 190 200 210 pF1KE3 AKNFKIRRLSLGLTQTQVGQALTATEGPAYSQSAICRFEKLDITPKSAQKLKPVLEKWLN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 AKNFKIRRLSLGLTQTQVGQALTATEGPAYSQSAICRFEKLDITPKSAQKLKPVLEKWLN 470 480 490 500 510 520 220 230 240 250 260 270 pF1KE3 EAELRNQEGQQNLMEFVGGEPSKKRKRRTSFTPQAIEALNAYFEKNPLPTGQEITEIAKE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 EAELRNQEGQQNLMEFVGGEPSKKRKRRTSFTPQAIEALNAYFEKNPLPTGQEITEIAKE 530 540 550 560 570 580 280 290 300 pF1KE3 LNYDREVVRVWFCNRRQTLKNTSKLNVFQIP ::::::::::::::::::::::::::::::: CCDS81 LNYDREVVRVWFCNRRQTLKNTSKLNVFQIP 590 600 610 >>CCDS55103.1 POU6F2 gene_id:11281|Hs108|chr7 (655 aa) initn: 1102 init1: 896 opt: 926 Z-score: 519.4 bits: 105.3 E(32554): 1.3e-22 Smith-Waterman score: 1117; 59.3% identity (80.7% similar) in 305 aa overlap (1-294:328-631) 10 20 pF1KE3 MPGISSQILTNAQGQVIGTLPWVVNSA-SV . :...:..::::::.:::.: . : . : CCDS55 NNPLASQAAAAAAAMSSIASSQAFGNALSSLQGVTGQLVTNAQGQIIGTIPLMPNPGPSS 300 310 320 330 340 350 30 40 50 60 70 80 pF1KE3 AAPAPAQSLQVQAVTPQLLLNAQGQVIATLASSPLPPPVAVRKPS-TPESPAKSEVQPIQ : . .:.:::: .::::: :::::.:::. .. . : . .. . .: .:... :: : CCDS55 QAASGTQGLQVQPITPQLLTNAQGQIIATVIGNQILPVINTQGITLSPIKPGQQLHQPSQ 360 370 380 390 400 410 90 100 110 120 130 pF1KE3 PT--PTVPQPAVV-IA------SPAPAAKPSASAPIPITCSETPTVSQLVSKPHTPSLDE . .. : .. .: : .:. . :.:. . : . .:.::::.:.: . . CCDS55 TSVGQAASQGNLLHLAHSQASMSQSPVRQASSSSSSS-SSSSALSVGQLVSNPQTAAGEV 420 430 440 450 460 470 140 150 160 170 180 190 pF1KE3 DGINLEEIREFAKNFKIRRLSLGLTQTQVGQALTATEGPAYSQSAICRFEKLDITPKSAQ ::.:::::::::: :::::::::::::::::::.:::::::::::::::::::::::::: CCDS55 DGVNLEEIREFAKAFKIRRLSLGLTQTQVGQALSATEGPAYSQSAICRFEKLDITPKSAQ 480 490 500 510 520 530 200 210 220 230 240 250 pF1KE3 KLKPVLEKWLNEAELRNQEGQQNLMEFVGGEPSKKRKRRTSFTPQAIEALNAYFEKNPLP :.:::::.:. ::: :.. :.::: ::.:.::::::::::::::::.: :::.:::: : CCDS55 KIKPVLERWMAEAEARHRAGMQNLTEFIGSEPSKKRKRRTSFTPQALEILNAHFEKNTHP 540 550 560 570 580 590 260 270 280 290 300 pF1KE3 TGQEITEIAKELNYDREVVRVWFCNRRQTLKNTSKLNVFQIP .:::.::::..::::::::::::::.::.:::: : CCDS55 SGQEMTEIAEKLNYDREVVRVWFCNKRQALKNTIKRLKQHEPATAVPLEPLTDSLEENS 600 610 620 630 640 650 >>CCDS34620.2 POU6F2 gene_id:11281|Hs108|chr7 (691 aa) initn: 1102 init1: 579 opt: 658 Z-score: 375.3 bits: 78.7 E(32554): 1.4e-14 Smith-Waterman score: 977; 53.2% identity (71.3% similar) in 327 aa overlap (15-294:342-667) 10 20 30 40 pF1KE3 MPGISSQILTNAQGQVIGTLPWVVNSA-SVAAPAPAQSLQVQAV :.:::.: . : . : : . .:.:::: . CCDS34 MSSIASSQAFGNALSSLQGVTGQLVTNAQGQIIGTIPLMPNPGPSSQAASGTQGLQVQPI 320 330 340 350 360 370 50 60 70 80 90 pF1KE3 TPQLLLNAQGQVIATLASSPLPPPVAVRKPS-TPESPAKSEVQPIQPT--PTVPQPAVV- ::::: :::::.:::. .. . : . .. . .: .:... :: : . .. : .. CCDS34 TPQLLTNAQGQIIATVIGNQILPVINTQGITLSPIKPGQQLHQPSQTSVGQAASQGNLLH 380 390 400 410 420 430 100 110 120 130 140 150 pF1KE3 IA------SPAPAAKPSASAPIPITCSETPTVSQLVSKPHTPSLDEDGINLEEIREFAKN .: : .:. . :.:. . : . .:.::::.:.: . . ::.:::::::::: CCDS34 LAHSQASMSQSPVRQASSSSSSS-SSSSALSVGQLVSNPQTAAGEVDGVNLEEIREFAKA 440 450 460 470 480 490 160 170 180 pF1KE3 FKIRRLSLGLTQTQVGQALTATEGPAYSQSAICR-------------------------- :::::::::::::::::::.:::::::::::::: CCDS34 FKIRRLSLGLTQTQVGQALSATEGPAYSQSAICRHTILRSHFFLPQEAQENTIASSLTAK 500 510 520 530 540 550 190 200 210 220 230 pF1KE3 ----------FEKLDITPKSAQKLKPVLEKWLNEAELRNQEGQQNLMEFVGGEPSKKRKR :::::::::::::.:::::.:. ::: :.. :.::: ::.:.:::::::: CCDS34 LNPGLLYPARFEKLDITPKSAQKIKPVLERWMAEAEARHRAGMQNLTEFIGSEPSKKRKR 560 570 580 590 600 610 240 250 260 270 280 290 pF1KE3 RTSFTPQAIEALNAYFEKNPLPTGQEITEIAKELNYDREVVRVWFCNRRQTLKNTSKLNV ::::::::.: :::.:::: :.:::.::::..::::::::::::::.::.:::: : CCDS34 RTSFTPQALEILNAHFEKNTHPSGQEMTEIAEKLNYDREVVRVWFCNKRQALKNTIKRLK 620 630 640 650 660 670 300 pF1KE3 FQIP CCDS34 QHEPATAVPLEPLTDSLEENS 680 690 301 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 06:34:26 2016 done: Sun Nov 6 06:34:26 2016 Total Scan time: 2.450 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]