FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3711, 359 aa 1>>>pF1KE3711 359 - 359 aa - 359 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.7641+/-0.000769; mu= 14.4778+/- 0.047 mean_var=223.0593+/-46.485, 0's: 0 Z-trim(117.0): 32 B-trim: 146 in 1/50 Lambda= 0.085874 statistics sampled from 17606 (17637) to 17606 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.834), E-opt: 0.2 (0.542), width: 16 Scan time: 3.420 The best scores are: opt bits E(32554) CCDS55274.1 POU5F1B gene_id:5462|Hs108|chr8 ( 359) 2518 323.9 1.3e-88 CCDS34391.1 POU5F1 gene_id:5460|Hs108|chr6 ( 360) 2413 310.9 1.1e-84 CCDS47398.2 POU5F1 gene_id:5460|Hs108|chr6 ( 190) 1251 166.6 1.6e-41 CCDS75420.1 POU5F1 gene_id:5460|Hs108|chr6 ( 164) 1086 146.0 2.1e-35 CCDS33265.1 POU3F3 gene_id:5455|Hs108|chr2 ( 500) 666 94.7 1.8e-19 CCDS59489.1 POU5F2 gene_id:134187|Hs108|chr5 ( 328) 661 93.8 2.2e-19 CCDS30679.1 POU3F1 gene_id:5453|Hs108|chr1 ( 451) 639 91.3 1.8e-18 CCDS14450.1 POU3F4 gene_id:5456|Hs108|chrX ( 361) 634 90.5 2.4e-18 CCDS5040.1 POU3F2 gene_id:5454|Hs108|chr6 ( 443) 635 90.8 2.5e-18 CCDS55655.1 POU2F1 gene_id:5451|Hs108|chr1 ( 703) 559 81.6 2.2e-15 CCDS55656.1 POU2F1 gene_id:5451|Hs108|chr1 ( 755) 559 81.7 2.3e-15 CCDS1259.2 POU2F1 gene_id:5451|Hs108|chr1 ( 766) 559 81.7 2.3e-15 CCDS8431.1 POU2F3 gene_id:25833|Hs108|chr11 ( 436) 532 78.0 1.7e-14 CCDS58190.1 POU2F3 gene_id:25833|Hs108|chr11 ( 438) 532 78.0 1.7e-14 CCDS31996.1 POU4F1 gene_id:5457|Hs108|chr13 ( 419) 489 72.6 6.6e-13 CCDS2919.1 POU1F1 gene_id:5449|Hs108|chr3 ( 291) 463 69.2 5e-12 CCDS46873.1 POU1F1 gene_id:5449|Hs108|chr3 ( 317) 463 69.3 5.3e-12 CCDS4281.1 POU4F3 gene_id:5459|Hs108|chr5 ( 338) 431 65.3 8.5e-11 >>CCDS55274.1 POU5F1B gene_id:5462|Hs108|chr8 (359 aa) initn: 2518 init1: 2518 opt: 2518 Z-score: 1704.4 bits: 323.9 E(32554): 1.3e-88 Smith-Waterman score: 2518; 99.4% identity (99.7% similar) in 359 aa overlap (1-359:1-359) 10 20 30 40 50 60 pF1KE3 MAGHLASDFAFSPPPGGGGDGPWGAEPGWVDPLTWLSFQGPPGGPGIGPGVGPGSEVWGI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 MAGHLASDFAFSPPPGGGGDGPWGAEPGWVDPLTWLSFQGPPGGPGIGPGVGPGSEVWGI 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 PPCPPPYELCGGMAYCGPQVGVGLVPQGGLETSQPESEAGVGVESNSNGASPEPCTVPPG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 PPCPPPYELCGGMAYCGPQVGVGLVPQGGLETSQPESEAGVGVESNSNGASPEPCTVPPG 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 AVKLEKEKLEQNPEKSQDIKALQKELEQFAKLLKQKRITLGYTQADVGLILGVLFGKVFS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 AVKLEKEKLEQNPEKSQDIKALQKELEQFAKLLKQKRITLGYTQADVGLILGVLFGKVFS 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 QTTICRFEALQLSFKNMCKLRPLLQKWVEEADNDENLQEICKAETLMQARKRKRTSIENR : :::::::::::::::::::::::::::::::.:::::::::::::::::::::::::: CCDS55 QKTICRFEALQLSFKNMCKLRPLLQKWVEEADNNENLQEICKAETLMQARKRKRTSIENR 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE3 VRGNLENLFLQCPKPTLQISHIAQQLGLEKDVVRVWFCNRRQKGKRSSSDYAQREDFEAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 VRGNLENLFLQCPKPTLQISHIAQQLGLEKDVVRVWFCNRRQKGKRSSSDYAQREDFEAA 250 260 270 280 290 300 310 320 330 340 350 pF1KE3 GSPFSGGPVSFPPAPGPHFGTPGYGSPHFTALYSSVPFPEGEVFPPVSVITLGSPMHSN ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 GSPFSGGPVSFPPAPGPHFGTPGYGSPHFTALYSSVPFPEGEVFPPVSVITLGSPMHSN 310 320 330 340 350 >>CCDS34391.1 POU5F1 gene_id:5460|Hs108|chr6 (360 aa) initn: 1735 init1: 1735 opt: 2413 Z-score: 1634.1 bits: 310.9 E(32554): 1.1e-84 Smith-Waterman score: 2413; 95.8% identity (97.8% similar) in 360 aa overlap (1-359:1-360) 10 20 30 40 50 60 pF1KE3 MAGHLASDFAFSPPPGGGGDGPWGAEPGWVDPLTWLSFQGPPGGPGIGPGVGPGSEVWGI :::::::::::::::::::::: : ::::::: ::::::::::::::::::::::::::: CCDS34 MAGHLASDFAFSPPPGGGGDGPGGPEPGWVDPRTWLSFQGPPGGPGIGPGVGPGSEVWGI 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 PPCPPPYELCGGMAYCGPQVGVGLVPQGGLETSQPESEAGVGVESNSNGASPEPCTVPPG ::::::::.:::::::::::::::::::::::::::.::::::::::.::::::::: :: CCDS34 PPCPPPYEFCGGMAYCGPQVGVGLVPQGGLETSQPEGEAGVGVESNSDGASPEPCTVTPG 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 AVKLEKEKLEQNPEKSQDIKALQKELEQFAKLLKQKRITLGYTQADVGLILGVLFGKVFS ::::::::::::::.:::::::::::::::::::::::::::::::::: :::::::::: CCDS34 AVKLEKEKLEQNPEESQDIKALQKELEQFAKLLKQKRITLGYTQADVGLTLGVLFGKVFS 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 QTTICRFEALQLSFKNMCKLRPLLQKWVEEADNDENLQEICKAETLMQARKRKRTSIENR :::::::::::::::::::::::::::::::::.::::::::::::.::::::::::::: CCDS34 QTTICRFEALQLSFKNMCKLRPLLQKWVEEADNNENLQEICKAETLVQARKRKRTSIENR 190 200 210 220 230 240 250 260 270 280 290 pF1KE3 VRGNLENLFLQCPKPTLQ-ISHIAQQLGLEKDVVRVWFCNRRQKGKRSSSDYAQREDFEA :::::::::::::::::: ::::::::::::::::::::::::::::::::::::::::: CCDS34 VRGNLENLFLQCPKPTLQQISHIAQQLGLEKDVVRVWFCNRRQKGKRSSSDYAQREDFEA 250 260 270 280 290 300 300 310 320 330 340 350 pF1KE3 AGSPFSGGPVSFPPAPGPHFGTPGYGSPHFTALYSSVPFPEGEVFPPVSVITLGSPMHSN ::::::::::::: :::::::::::::::::::::::::::::.:::::: ::::::::: CCDS34 AGSPFSGGPVSFPLAPGPHFGTPGYGSPHFTALYSSVPFPEGEAFPPVSVTTLGSPMHSN 310 320 330 340 350 360 >>CCDS47398.2 POU5F1 gene_id:5460|Hs108|chr6 (190 aa) initn: 697 init1: 697 opt: 1251 Z-score: 858.9 bits: 166.6 E(32554): 1.6e-41 Smith-Waterman score: 1251; 96.3% identity (98.4% similar) in 190 aa overlap (171-359:1-190) 150 160 170 180 190 200 pF1KE3 ALQKELEQFAKLLKQKRITLGYTQADVGLILGVLFGKVFSQTTICRFEALQLSFKNMCKL .::::::::::::::::::::::::::::: CCDS47 MGVLFGKVFSQTTICRFEALQLSFKNMCKL 10 20 30 210 220 230 240 250 pF1KE3 RPLLQKWVEEADNDENLQEICKAETLMQARKRKRTSIENRVRGNLENLFLQCPKPTLQ-I :::::::::::::.::::::::::::.::::::::::::::::::::::::::::::: : CCDS47 RPLLQKWVEEADNNENLQEICKAETLVQARKRKRTSIENRVRGNLENLFLQCPKPTLQQI 40 50 60 70 80 90 260 270 280 290 300 310 pF1KE3 SHIAQQLGLEKDVVRVWFCNRRQKGKRSSSDYAQREDFEAAGSPFSGGPVSFPPAPGPHF ::::::::::::::::::::::::::::::::::::::::::::::::::::: :::::: CCDS47 SHIAQQLGLEKDVVRVWFCNRRQKGKRSSSDYAQREDFEAAGSPFSGGPVSFPLAPGPHF 100 110 120 130 140 150 320 330 340 350 pF1KE3 GTPGYGSPHFTALYSSVPFPEGEVFPPVSVITLGSPMHSN :::::::::::::::::::::::.:::::: ::::::::: CCDS47 GTPGYGSPHFTALYSSVPFPEGEAFPPVSVTTLGSPMHSN 160 170 180 190 >>CCDS75420.1 POU5F1 gene_id:5460|Hs108|chr6 (164 aa) initn: 697 init1: 697 opt: 1086 Z-score: 749.1 bits: 146.0 E(32554): 2.1e-35 Smith-Waterman score: 1086; 96.3% identity (98.2% similar) in 164 aa overlap (197-359:1-164) 170 180 190 200 210 220 pF1KE3 VGLILGVLFGKVFSQTTICRFEALQLSFKNMCKLRPLLQKWVEEADNDENLQEICKAETL :::::::::::::::::.:::::::::::: CCDS75 MCKLRPLLQKWVEEADNNENLQEICKAETL 10 20 30 230 240 250 260 270 280 pF1KE3 MQARKRKRTSIENRVRGNLENLFLQCPKPTLQ-ISHIAQQLGLEKDVVRVWFCNRRQKGK .::::::::::::::::::::::::::::::: ::::::::::::::::::::::::::: CCDS75 VQARKRKRTSIENRVRGNLENLFLQCPKPTLQQISHIAQQLGLEKDVVRVWFCNRRQKGK 40 50 60 70 80 90 290 300 310 320 330 340 pF1KE3 RSSSDYAQREDFEAAGSPFSGGPVSFPPAPGPHFGTPGYGSPHFTALYSSVPFPEGEVFP ::::::::::::::::::::::::::: :::::::::::::::::::::::::::::.:: CCDS75 RSSSDYAQREDFEAAGSPFSGGPVSFPLAPGPHFGTPGYGSPHFTALYSSVPFPEGEAFP 100 110 120 130 140 150 350 pF1KE3 PVSVITLGSPMHSN :::: ::::::::: CCDS75 PVSVTTLGSPMHSN 160 >>CCDS33265.1 POU3F3 gene_id:5455|Hs108|chr2 (500 aa) initn: 695 init1: 477 opt: 666 Z-score: 462.9 bits: 94.7 E(32554): 1.8e-19 Smith-Waterman score: 686; 43.3% identity (60.7% similar) in 328 aa overlap (13-320:177-494) 10 20 30 pF1KE3 MAGHLASDFAFSPPPGGGGDGPWGAEPGWVD----------- ::: : : ::: . . CCDS33 PDVKGGAGRDDLHAGTALHHRGPPHLGPPPPPPHQGHPGGWGAAAAAAAAAAAAAAAAHL 150 160 170 180 190 200 40 50 60 70 80 90 pF1KE3 PLTWLSFQGPPGGPGIG-PGVGPGSEVWGIPPCPPPYELCGGMAYCGPQVGVGLVPQGGL : . : :: . . :: : : :. :: :: : : : .:: : . CCDS33 PSMAGGQQPPPQSLLYSQPG---GFTVNGMLSAPPGPGGGGGGAGGGAQ---SLVHPGLV 210 220 230 240 250 260 100 110 120 130 140 pF1KE3 ETSQPE-SEAGVGVESNSNGASPEPCTV--PP---GAVKLEKEKLEQNPEKSQDIKALQK . . :: .: . ... :.: . :: :. :... .:.. . CCDS33 RGDTPELAEHHHHHHHHAHPHPPHPHHAQGPPHHGGGGGGAGPGLNSHDPHSDEDTPTSD 270 280 290 300 310 320 150 160 170 180 190 200 pF1KE3 ELEQFAKLLKQKRITLGYTQADVGLILGVLFGKVFSQTTICRFEALQLSFKNMCKLRPLL .:::::: .::.:: ::.::::::: ::.:.:.:::::::::::::::::::::::.::: CCDS33 DLEQFAKQFKQRRIKLGFTQADVGLALGTLYGNVFSQTTICRFEALQLSFKNMCKLKPLL 330 340 350 360 370 380 210 220 230 240 250 260 pF1KE3 QKWVEEADNDENLQEICKAETLMQARKRK-RTSIENRVRGNLENLFLQCPKPTLQ-ISHI .::.::::.. . . . :.:::: ::::: :.: ::. ::.::::. : :... CCDS33 NKWLEEADSSTG-SPTSIDKIAAQGRKRKKRTSIEVSVKGALESHFLKCPKPSAQEITNL 390 400 410 420 430 270 280 290 300 310 320 pF1KE3 AQQLGLEKDVVRVWFCNRRQKGKRSSSDYAQREDFEAAGSPFSGGPVSFPPAPGPHFGTP :..: :::.:::::::::::: :: . :.. . . : : :: .: :: : CCDS33 ADSLQLEKEVVRVWFCNRRQKEKRMTPPGIQQQTPDDVYSQV--GTVS-ADTPPPHHGLQ 440 450 460 470 480 490 330 340 350 pF1KE3 GYGSPHFTALYSSVPFPEGEVFPPVSVITLGSPMHSN CCDS33 TSVQ 500 >>CCDS59489.1 POU5F2 gene_id:134187|Hs108|chr5 (328 aa) initn: 783 init1: 430 opt: 661 Z-score: 461.4 bits: 93.8 E(32554): 2.2e-19 Smith-Waterman score: 901; 47.8% identity (65.1% similar) in 341 aa overlap (1-335:1-311) 10 20 30 40 50 pF1KE3 MAGHLASDFAFSPPPGGGGDGPWGAEPGWVDPLTWLSFQGPPGG----PGIGPGVGPGSE :::: :. : : ::.:: :: : : :: ::::: :. :: :.. ::. :: . CCDS59 MAGHRPSNH-FCPLPGSGGGGPRGPMPLRVDTLTWLSTQAAPGRVMVWPAVRPGICPGPD 10 20 30 40 50 60 70 80 90 100 110 pF1KE3 VWGIPPCPPPYELCGGMAYCGPQVGVGLVPQGGLETSQPESEAGVGVESNSNGASPEPCT :: :: : :.:. : .: : :..:. :::: .. :.:: : : CCDS59 VWRIPLGPLPHEFRGWIAPCRPRLGA--------------SEAGDWLRRPSEGALPGPYI 60 70 80 90 100 120 130 140 150 160 170 pF1KE3 VPPGAVKLEKEKLEQNPEKSQDIKALQKELEQFAKLLKQKRITLGYTQADVGLILGVLFG . . :: :: ::... :::.:.:: :.:::..:::.:::::. .:.::: CCDS59 ALRSIPKLPP------PE---DISGILKELQQLAKELRQKRLSLGYSQADVGIAVGALFG 110 120 130 140 150 180 190 200 210 220 230 pF1KE3 KVFSQTTICRFEALQLSFKNMCKLRPLLQKWVEEADNDENLQEICKAETLMQAR-KRKRT ::.:::::::::: ::: :: ::::::.::..:.. ::: .:: : ..: : .:. CCDS59 KVLSQTTICRFEAQQLSVANMWKLRPLLKKWLKEVEA-ENLLGLCKMEMILQQSGKWRRA 160 170 180 190 200 210 240 250 260 270 280 290 pF1KE3 SIENRVRGNLENLFLQCPKPT-LQISHIAQQLGLEKDVVRVWFCNRRQKGKRSSSDYAQR : : :. ..::..: .::::: :::::: : :.:::::::: :: . :.: ..: . : CCDS59 SRERRIGNSLEKFFQRCPKPTPQQISHIAGCLQLQKDVVRVWFYNRSKMGSRPTNDASPR 220 230 240 250 260 270 300 310 320 330 340 350 pF1KE3 EDFEAAGSPFSGGPVSFPPAPGPHFGTPGYGSPHFTALYSSVPFPEGEVFPPVSVITLGS : .:: : :.:: : . .: : ::.: :::. CCDS59 EIVGTAGPPCPGAPVCFHLG----LGLP-VDIPHYTRLYSAGVAHSSAPATTLGLLRF 280 290 300 310 320 pF1KE3 PMHSN >>CCDS30679.1 POU3F1 gene_id:5453|Hs108|chr1 (451 aa) initn: 672 init1: 465 opt: 639 Z-score: 445.3 bits: 91.3 E(32554): 1.8e-18 Smith-Waterman score: 689; 43.4% identity (60.3% similar) in 343 aa overlap (2-317:134-435) 10 20 pF1KE3 MAGHLASDFAFSPPPGG-GGDGPW------- : ::. :.:: ::. :: : CCDS30 DDGGGGGGFHARLVHQGAAHAGAAWAQGSTAHHLGP--AMSPSPGASGGHQPQPLGLYAQ 110 120 130 140 150 160 30 40 50 60 70 pF1KE3 GAEPGWVDPLTWLSFQGPPGGPGIGPGVG-----PGSEVWGIPPCPPPYELCGGMAYCGP .: :: :. . :: : :::. : :. . : :::. : :. CCDS30 AAYPG--GGGGGLAGMLAAGGGGAGPGLHHALHEDGHEAQ-LEPSPPPHLGAHGHAH--- 170 180 190 200 210 80 90 100 110 120 130 pF1KE3 QVGVGLVPQGGLETSQPESEAGVGVESNSNGASPEPCTVPPGAVKLEKEKLEQNPEKSQD : . :::... . . :.: ..: : :. ... .:.: CCDS30 ----GHAHAGGLHAAAAHLHPGAGGGGSSVG-----------------EHSDEDAPSSDD 220 230 240 250 140 150 160 170 180 190 pF1KE3 IKALQKELEQFAKLLKQKRITLGYTQADVGLILGVLFGKVFSQTTICRFEALQLSFKNMC :::::: .::.:: ::.::::::: ::.:.:.::::::::::::::::::::: CCDS30 -------LEQFAKQFKQRRIKLGFTQADVGLALGTLYGNVFSQTTICRFEALQLSFKNMC 260 270 280 290 300 200 210 220 230 240 250 pF1KE3 KLRPLLQKWVEEADNDE----NLQEICKAETLMQARKRK-RTSIENRVRGNLENLFLQCP ::.:::.::.::.:.. ::..: :.:::: ::::: :.: ::. ::.:: CCDS30 KLKPLLNKWLEETDSSSGSPTNLDKIAA-----QGRKRKKRTSIEVGVKGALESHFLKCP 310 320 330 340 350 360 260 270 280 290 300 pF1KE3 KPTL-QISHIAQQLGLEKDVVRVWFCNRRQKGKR----SSSDYAQREDFEAAG--SPFSG ::. .:. .:..: :::.:::::::::::: :: ... . .: : : .: .: CCDS30 KPSAHEITGLADSLQLEKEVVRVWFCNRRQKEKRMTPAAGAGHPPMDDVYAPGELGPGGG 370 380 390 400 410 420 310 320 330 340 350 pF1KE3 G--PVSFPPAPGPHFGTPGYGSPHFTALYSSVPFPEGEVFPPVSVITLGSPMHSN : : : :: : : CCDS30 GASPPSAPPPPPPAALHHHHHHTLPGSVQ 430 440 450 >>CCDS14450.1 POU3F4 gene_id:5456|Hs108|chrX (361 aa) initn: 629 init1: 478 opt: 634 Z-score: 442.9 bits: 90.5 E(32554): 2.4e-18 Smith-Waterman score: 650; 49.2% identity (68.0% similar) in 244 aa overlap (58-294:114-343) 30 40 50 60 70 80 pF1KE3 GWVDPLTWLSFQGPPGGPGIGPGVGPGSEVWGIPPCPPPYELCGGM---AYCGPQVGV-G :: : : : .:. .: : : : CCDS14 GREDLQLGAIIHHRSPHVAHHSPHTNHPNAWGASPAPNPSITSSGQPLNVYSQPGFTVSG 90 100 110 120 130 140 90 100 110 120 130 140 pF1KE3 LVPQGGLETSQPESEAGVGVESNSNGASPEPCTV-PPGAVKLEKEKLEQNPEKSQDIKAL .. .::: . : . : .. : .: :: .: ... . ..:.. CCDS14 MLEHGGL--TPPPAAA--------SAQSLHPVLREPPDHGELGSHHCQ---DHSDEETPT 150 160 170 180 190 150 160 170 180 190 200 pF1KE3 QKELEQFAKLLKQKRITLGYTQADVGLILGVLFGKVFSQTTICRFEALQLSFKNMCKLRP . ::::::: .::.:: ::.::::::: ::.:.:.:::::::::::::::::::::::.: CCDS14 SDELEQFAKQFKQRRIKLGFTQADVGLALGTLYGNVFSQTTICRFEALQLSFKNMCKLKP 200 210 220 230 240 250 210 220 230 240 250 260 pF1KE3 LLQKWVEEADNDENLQEICKAETLMQARKRK-RTSIENRVRGNLENLFLQCPKPTLQ-IS ::.::.::::.. . . . :.:::: ::::: :.: ::. ::.::::. : :: CCDS14 LLNKWLEEADSSTG-SPTSIDKIAAQGRKRKKRTSIEVSVKGVLETHFLKCPKPAAQEIS 260 270 280 290 300 270 280 290 300 310 320 pF1KE3 HIAQQLGLEKDVVRVWFCNRRQKGKRSSSDYAQREDFEAAGSPFSGGPVSFPPAPGPHFG .:..: :::.:::::::::::: :: . :. CCDS14 SLADSLQLEKEVVRVWFCNRRQKEKRMTPPGDQQPHEVYSHTVKTDTSCHDL 310 320 330 340 350 360 330 340 350 pF1KE3 TPGYGSPHFTALYSSVPFPEGEVFPPVSVITLGSPMHSN >>CCDS5040.1 POU3F2 gene_id:5454|Hs108|chr6 (443 aa) initn: 719 init1: 479 opt: 635 Z-score: 442.7 bits: 90.8 E(32554): 2.5e-18 Smith-Waterman score: 650; 43.8% identity (62.6% similar) in 281 aa overlap (49-316:163-437) 20 30 40 50 60 70 pF1KE3 GDGPWGAEPGWVDPLTWLSFQGPPGGPGIGPGVGPGSEVWGIPPCPPPYELC-GGMAYCG :: : . . :: . ::. : CCDS50 QQQQQQQQQQQQQQQQQRPPHLVHHAANHHPGPGAWRSAAAAAHLPPSMGASNGGLLYSQ 140 150 160 170 180 190 80 90 100 110 120 pF1KE3 PQ------VGVGLVPQG----GLETSQPESEAGVGVESNSNGASPEPCTVPPGAVKLEKE :. .:.: : : ::. .. : . . . .: :: . CCDS50 PSFTVNGMLGAGGQPAGLHHHGLRDAHDEPHHADHHPHPHSHPHQQPPPPPPPQGPPGHP 200 210 220 230 240 250 130 140 150 160 170 180 pF1KE3 KLEQNPEKSQDIKALQKELEQFAKLLKQKRITLGYTQADVGLILGVLFGKVFSQTTICRF ...:....: . . .:::::: .::.:: ::.::::::: ::.:.:.:::::::::: CCDS50 GAHHDPHSDEDTPT-SDDLEQFAKQFKQRRIKLGFTQADVGLALGTLYGNVFSQTTICRF 260 270 280 290 300 310 190 200 210 220 230 240 pF1KE3 EALQLSFKNMCKLRPLLQKWVEEADNDENLQEICKAETLMQARKRK-RTSIENRVRGNLE :::::::::::::.:::.::.::::.. . . . :.:::: ::::: :.: :: CCDS50 EALQLSFKNMCKLKPLLNKWLEEADSSSG-SPTSIDKIAAQGRKRKKRTSIEVSVKGALE 320 330 340 350 360 370 250 260 270 280 290 300 pF1KE3 NLFLQCPKPTLQ-ISHIAQQLGLEKDVVRVWFCNRRQKGKRSSSDYAQREDFEAAGSPFS . ::.::::. : :. .:..: :::.:::::::::::: :: . . : . CCDS50 SHFLKCPKPSAQEITSLADSLQLEKEVVRVWFCNRRQKEKRMTPPGGTLPGAEDV----Y 380 390 400 410 420 310 320 330 340 350 pF1KE3 GGPVSFPPAPGPHFGTPGYGSPHFTALYSSVPFPEGEVFPPVSVITLGSPMHSN :: . :: : CCDS50 GGSRDTPPHHGVQTPVQ 430 440 >>CCDS55655.1 POU2F1 gene_id:5451|Hs108|chr1 (703 aa) initn: 549 init1: 335 opt: 559 Z-score: 389.7 bits: 81.6 E(32554): 2.2e-15 Smith-Waterman score: 559; 39.6% identity (62.1% similar) in 293 aa overlap (78-348:172-446) 50 60 70 80 90 100 pF1KE3 GPGVGPGSEVWGIPPCPPPYELCGGMAYCGPQVGVGLVPQGGLETSQPE-SEAGVGVESN :: ::. .: :. :. :.:.. . CCDS55 LQQQNLNLQQFVLVHPTTNLQPAQFIISQTPQGQQGLLQAQNLLTQLPQQSQANLLQSQP 150 160 170 180 190 200 110 120 130 140 150 pF1KE3 SNGASPEPCTVPP---GAVKLEKEKLEQNPEKSQDIKALQK-----ELEQFAKLLKQKRI : . .: : : .:. .. :. : : .:.. ::::::: .::.:: CCDS55 SITLTSQPAT-PTRTIAATPIQTLPQSQSTPKRIDTPSLEEPSDLEELEQFAKTFKQRRI 210 220 230 240 250 260 160 170 180 190 200 210 pF1KE3 TLGYTQADVGLILGVLFGKVFSQTTICRFEALQLSFKNMCKLRPLLQKWVEEADNDENLQ ::.::.:::: .: :.:. :::::: :::::.:::::::::.:::.::...:.: . . CCDS55 KLGFTQGDVGLAMGKLYGNDFSQTTISRFEALNLSFKNMCKLKPLLEKWLNDAENLSSDS 270 280 290 300 310 320 220 230 240 250 260 270 pF1KE3 EICKAETL-------MQARKRKRTSIENRVRGNLENLFLQCPKPTLQ-ISHIAQQLGLEK . . .: .. :..::::::. .: ::. ::. ::: . :. ::.::..:: CCDS55 SLSSPSALNSPGIEGLSRRRKKRTSIETNIRVALEKSFLENQKPTSEEITMIADQLNMEK 330 340 350 360 370 380 280 290 300 310 320 pF1KE3 DVVRVWFCNRRQKGKRSSSDYAQREDFEAAGSPFSGGPVSFP-----PAPGPHFGTPGYG .:.:::::::::: :: . : ::: : : :.: .: CCDS55 EVIRVWFCNRRQKEKR-------------INPPSSGGTSSSPIKAIFPSPTSLVAT---- 390 400 410 420 330 340 350 pF1KE3 SPHFTALYSSVPFPEGEVFPPVSVITLGSPMHSN .: ... ... . . :.: .: CCDS55 TPSLVTSSAATTLTVSPVLPLTSAAVTNLSVTGTSDTTSNNTATVISTAPPASSAVTSPS 430 440 450 460 470 480 359 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 07:43:39 2016 done: Tue Nov 8 07:43:40 2016 Total Scan time: 3.420 Total Display time: 0.030 Function used was FASTA [36.3.4 Apr, 2011]