FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB8316, 423 aa 1>>>pF1KB8316 423 - 423 aa - 423 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.6841+/-0.000848; mu= 2.9503+/- 0.051 mean_var=194.8874+/-39.389, 0's: 0 Z-trim(113.7): 22 B-trim: 0 in 0/52 Lambda= 0.091872 statistics sampled from 14302 (14319) to 14302 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.773), E-opt: 0.2 (0.44), width: 16 Scan time: 3.450 The best scores are: opt bits E(32554) CCDS48179.1 HSFX2 gene_id:100130086|Hs108|chrX ( 423) 2940 401.8 6.5e-112 CCDS44011.1 HSFX1 gene_id:100506164|Hs108|chrX ( 423) 2940 401.8 6.5e-112 CCDS35475.1 HSFY1 gene_id:86614|Hs108|chrY ( 401) 513 80.1 4.3e-15 CCDS14791.1 HSFY2 gene_id:159119|Hs108|chrY ( 401) 513 80.1 4.3e-15 >>CCDS48179.1 HSFX2 gene_id:100130086|Hs108|chrX (423 aa) initn: 2940 init1: 2940 opt: 2940 Z-score: 2122.7 bits: 401.8 E(32554): 6.5e-112 Smith-Waterman score: 2940; 99.8% identity (99.8% similar) in 423 aa overlap (1-423:1-423) 10 20 30 40 50 60 pF1KB8 MEDKRSLSMARCEERNSRGQDHGLERVPFPPQLQSETYLHPADPSPAWDDPGSTGSPNLR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 MEDKRSLSMARCEERNSRGQDHGLERVPFPPQLQSETYLHPADPSPAWDDPGSTGSPNLR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 LLTEEIAFQPLAEEASFRRPHPDGDVPPQGEDNLLSLPFPQKLWRLVSSNQFSSIWWDDS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 LLTEEIAFQPLAEEASFRRPHPDGDVPPQGEDNLLSLPFPQKLWRLVSSNQFSSIWWDDS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 GACRVINQKLFEKEILKRDVAHKVFATTSIKSFFRQLNLYGFRKRRQCTFRTFTRIFSAK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 GACRVINQKLFEKEILKRDVAHKVFATTSIKSFFRQLNLYGFRKRRQCTFRTFTRIFSAK 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB8 RLVSILNKLEFYCHPYFQRDSPHLLVRMKRRVGVKSAPRHQEEDKPEAAGSCLAPADTEQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 RLVSILNKLEFYCHPYFQRDSPHLLVRMKRRVGVKSAPRHQEEDKPEAAGSCLAPADTEQ 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB8 QDHTSPNENDQVTPQHREPAGPNTQIRSGSAPPATPVMVPDSAVASDNSPVTQPAGEWSE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 QDHTSPNENDQVTPQHREPAGPNTQIRSGSAPPATPVMVPDSAVASDNSPVTQPAGEWSE 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB8 GSQAHVTPVAAVPGPAALPFLYVPGSPTQMNSYGPVVALPTASRSTLAMDTTGLPAPGML :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 GSQAHVTPVAAVPGPAALPFLYVPGSPTQMNSYGPVVALPTASRSTLAMDTTGLPAPGML 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB8 PFCHLWVPVTLVAAGAAQPAASMVMFPHLPALHHHCPHSHCTSQYMPASDGPQAYPDYAD :::::::::::::::::::::::::::::::::::::::: ::::::::::::::::::: CCDS48 PFCHLWVPVTLVAAGAAQPAASMVMFPHLPALHHHCPHSHRTSQYMPASDGPQAYPDYAD 370 380 390 400 410 420 pF1KB8 QST ::: CCDS48 QST >>CCDS44011.1 HSFX1 gene_id:100506164|Hs108|chrX (423 aa) initn: 2940 init1: 2940 opt: 2940 Z-score: 2122.7 bits: 401.8 E(32554): 6.5e-112 Smith-Waterman score: 2940; 99.8% identity (99.8% similar) in 423 aa overlap (1-423:1-423) 10 20 30 40 50 60 pF1KB8 MEDKRSLSMARCEERNSRGQDHGLERVPFPPQLQSETYLHPADPSPAWDDPGSTGSPNLR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 MEDKRSLSMARCEERNSRGQDHGLERVPFPPQLQSETYLHPADPSPAWDDPGSTGSPNLR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 LLTEEIAFQPLAEEASFRRPHPDGDVPPQGEDNLLSLPFPQKLWRLVSSNQFSSIWWDDS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 LLTEEIAFQPLAEEASFRRPHPDGDVPPQGEDNLLSLPFPQKLWRLVSSNQFSSIWWDDS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 GACRVINQKLFEKEILKRDVAHKVFATTSIKSFFRQLNLYGFRKRRQCTFRTFTRIFSAK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 GACRVINQKLFEKEILKRDVAHKVFATTSIKSFFRQLNLYGFRKRRQCTFRTFTRIFSAK 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB8 RLVSILNKLEFYCHPYFQRDSPHLLVRMKRRVGVKSAPRHQEEDKPEAAGSCLAPADTEQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 RLVSILNKLEFYCHPYFQRDSPHLLVRMKRRVGVKSAPRHQEEDKPEAAGSCLAPADTEQ 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB8 QDHTSPNENDQVTPQHREPAGPNTQIRSGSAPPATPVMVPDSAVASDNSPVTQPAGEWSE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 QDHTSPNENDQVTPQHREPAGPNTQIRSGSAPPATPVMVPDSAVASDNSPVTQPAGEWSE 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB8 GSQAHVTPVAAVPGPAALPFLYVPGSPTQMNSYGPVVALPTASRSTLAMDTTGLPAPGML :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 GSQAHVTPVAAVPGPAALPFLYVPGSPTQMNSYGPVVALPTASRSTLAMDTTGLPAPGML 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB8 PFCHLWVPVTLVAAGAAQPAASMVMFPHLPALHHHCPHSHCTSQYMPASDGPQAYPDYAD :::::::::::::::::::::::::::::::::::::::: ::::::::::::::::::: CCDS44 PFCHLWVPVTLVAAGAAQPAASMVMFPHLPALHHHCPHSHRTSQYMPASDGPQAYPDYAD 370 380 390 400 410 420 pF1KB8 QST ::: CCDS44 QST >>CCDS35475.1 HSFY1 gene_id:86614|Hs108|chrY (401 aa) initn: 496 init1: 300 opt: 513 Z-score: 384.5 bits: 80.1 E(32554): 4.3e-15 Smith-Waterman score: 563; 35.2% identity (59.6% similar) in 369 aa overlap (55-395:35-397) 30 40 50 60 70 80 pF1KB8 ERVPFPPQLQSETYLHPADPSPAWDDPGSTGSPNLRLLTEEIAFQPLAEEASFRRPHPDG :. .:: . :: ::: :.. . .. : CCDS35 SSETQDVSPKDELTASEASTRSPLCEHTFPGDSDLRSMIEEHAFQVLSQGSLLESPSYTV 10 20 30 40 50 60 90 100 110 120 130 140 pF1KB8 DVP-PQGEDNLLSLPFPQKLWRLVSSNQFSSIWWDDSGACRVINQKLFEKEILKRDVAHK : :. .:..::: ::.:::..: :.::.:: ::..:.: :::..::.::::. . .. CCDS35 CVSEPDKDDDFLSLNFPRKLWKIVESDQFKSISWDENGTCIVINEELFKKEILETKAPYR 70 80 90 100 110 120 150 160 170 180 190 200 pF1KB8 VFATTSIKSFFRQLNLYGFRKRRQCTFRT-FTRIF-SAKRLVSILNKLEFYCHPYFQRDS .: : .:::: :::::::: : .: :. : : : .. :.:.::.:: .: :.: CCDS35 IFQTDAIKSFVRQLNLYGFSKIQQNFQRSAFLATFLSEEKESSVLSKLKFYYNPNFKRGY 130 140 150 160 170 180 210 220 230 240 pF1KB8 PHLLVRMKRRVGVKSA-PRHQ--EED---KPEAAG-------SCLAPADTEQQDHTSPNE :.::::.:::.:::.: : .:: : :: : :: :.. ... : .. CCDS35 PQLLVRVKRRIGVKNASPISTLFNEDFNKKHFRAGANMENHNSALA-AEASEESLFSASK 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB8 NDQVTPQHREP------AGPNTQIRSGSAPPATPVMV-PDSAVASDNSPVTQPAGEWSEG : .. : :: :. .. :::: ::. . : :. .:.:. . . . CCDS35 NLNM-PLTRESSVRQIIANSSVPIRSGFPPPSPSTSVGPSEQIATDQHAILNQLT--TIH 250 260 270 280 290 300 310 320 330 340 350 pF1KB8 SQAHVTPVAA----VPGPAALPFLYVPGSPTQMNSYGPVVALPTASRSTLAMDTTG-LPA ..: : . : : .. : :: : . .: .: :.: . . ... : CCDS35 MHSHSTYMQARGHIVNFITTTTSQYHIISPLQNGYFGLTVE-PSAVPTRYPLVSVNEAPY 310 320 330 340 350 360 370 380 390 400 410 pF1KB8 PGMLPFCHLWVPVTLVAAGAAQPAASMVMFPHLPALHHHCPHSHCTSQYMPASDGPQAYP .::: . :. . .: .: : . ... : : ..: CCDS35 RNMLPAGNPWLQMPTIADRSAAPHSRLALQPS-PLDKYHPNYN 360 370 380 390 400 420 pF1KB8 DYADQST >>CCDS14791.1 HSFY2 gene_id:159119|Hs108|chrY (401 aa) initn: 496 init1: 300 opt: 513 Z-score: 384.5 bits: 80.1 E(32554): 4.3e-15 Smith-Waterman score: 563; 35.2% identity (59.6% similar) in 369 aa overlap (55-395:35-397) 30 40 50 60 70 80 pF1KB8 ERVPFPPQLQSETYLHPADPSPAWDDPGSTGSPNLRLLTEEIAFQPLAEEASFRRPHPDG :. .:: . :: ::: :.. . .. : CCDS14 SSETQDVSPKDELTASEASTRSPLCEHTFPGDSDLRSMIEEHAFQVLSQGSLLESPSYTV 10 20 30 40 50 60 90 100 110 120 130 140 pF1KB8 DVP-PQGEDNLLSLPFPQKLWRLVSSNQFSSIWWDDSGACRVINQKLFEKEILKRDVAHK : :. .:..::: ::.:::..: :.::.:: ::..:.: :::..::.::::. . .. CCDS14 CVSEPDKDDDFLSLNFPRKLWKIVESDQFKSISWDENGTCIVINEELFKKEILETKAPYR 70 80 90 100 110 120 150 160 170 180 190 200 pF1KB8 VFATTSIKSFFRQLNLYGFRKRRQCTFRT-FTRIF-SAKRLVSILNKLEFYCHPYFQRDS .: : .:::: :::::::: : .: :. : : : .. :.:.::.:: .: :.: CCDS14 IFQTDAIKSFVRQLNLYGFSKIQQNFQRSAFLATFLSEEKESSVLSKLKFYYNPNFKRGY 130 140 150 160 170 180 210 220 230 240 pF1KB8 PHLLVRMKRRVGVKSA-PRHQ--EED---KPEAAG-------SCLAPADTEQQDHTSPNE :.::::.:::.:::.: : .:: : :: : :: :.. ... : .. CCDS14 PQLLVRVKRRIGVKNASPISTLFNEDFNKKHFRAGANMENHNSALA-AEASEESLFSASK 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB8 NDQVTPQHREP------AGPNTQIRSGSAPPATPVMV-PDSAVASDNSPVTQPAGEWSEG : .. : :: :. .. :::: ::. . : :. .:.:. . . . CCDS14 NLNM-PLTRESSVRQIIANSSVPIRSGFPPPSPSTSVGPSEQIATDQHAILNQLT--TIH 250 260 270 280 290 300 310 320 330 340 350 pF1KB8 SQAHVTPVAA----VPGPAALPFLYVPGSPTQMNSYGPVVALPTASRSTLAMDTTG-LPA ..: : . : : .. : :: : . .: .: :.: . . ... : CCDS14 MHSHSTYMQARGHIVNFITTTTSQYHIISPLQNGYFGLTVE-PSAVPTRYPLVSVNEAPY 310 320 330 340 350 360 370 380 390 400 410 pF1KB8 PGMLPFCHLWVPVTLVAAGAAQPAASMVMFPHLPALHHHCPHSHCTSQYMPASDGPQAYP .::: . :. . .: .: : . ... : : ..: CCDS14 RNMLPAGNPWLQMPTIADRSAAPHSRLALQPS-PLDKYHPNYN 360 370 380 390 400 420 pF1KB8 DYADQST 423 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 04:41:25 2016 done: Mon Nov 7 04:41:26 2016 Total Scan time: 3.450 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]