FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB7634, 375 aa 1>>>pF1KB7634 375 - 375 aa - 375 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.2542+/-0.000758; mu= 4.4673+/- 0.046 mean_var=154.9003+/-31.817, 0's: 0 Z-trim(114.0): 12 B-trim: 254 in 2/53 Lambda= 0.103050 statistics sampled from 14568 (14580) to 14568 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.773), E-opt: 0.2 (0.448), width: 16 Scan time: 3.230 The best scores are: opt bits E(32554) CCDS9724.1 TBPL2 gene_id:387332|Hs108|chr14 ( 375) 2476 379.3 3.1e-105 CCDS5315.1 TBP gene_id:6908|Hs108|chr6 ( 339) 1234 194.6 1.1e-49 CCDS55077.1 TBP gene_id:6908|Hs108|chr6 ( 319) 1217 192.1 6e-49 CCDS5168.1 TBPL1 gene_id:9519|Hs108|chr6 ( 186) 451 78.1 7.2e-15 >>CCDS9724.1 TBPL2 gene_id:387332|Hs108|chr14 (375 aa) initn: 2476 init1: 2476 opt: 2476 Z-score: 2002.8 bits: 379.3 E(32554): 3.1e-105 Smith-Waterman score: 2476; 99.7% identity (99.7% similar) in 375 aa overlap (1-375:1-375) 10 20 30 40 50 60 pF1KB7 MASAPWPERVPRLLAPRLPSYPPPPPTVGLPSMEQEETYLELYLDQCAAQDGLAPPRSPL :::::::::::::::::::::::::::::: ::::::::::::::::::::::::::::: CCDS97 MASAPWPERVPRLLAPRLPSYPPPPPTVGLRSMEQEETYLELYLDQCAAQDGLAPPRSPL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 FSPVVPYDMYILNASNPDTAFNSNPEVKETSGDFSSVDLSFLPDEVTQENKDQPVISKHE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS97 FSPVVPYDMYILNASNPDTAFNSNPEVKETSGDFSSVDLSFLPDEVTQENKDQPVISKHE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 TEENSESQSPQSRLPSPSEQDVGLGLNSSSLSNSHSQLHPGDTDSVQPSPEKPNSDSLSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS97 TEENSESQSPQSRLPSPSEQDVGLGLNSSSLSNSHSQLHPGDTDSVQPSPEKPNSDSLSL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB7 ASITPMTPMTPISECCGIVPQLQNIVSTVNLACKLDLKKIALHAKNAEYNPKRFAAVIMR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS97 ASITPMTPMTPISECCGIVPQLQNIVSTVNLACKLDLKKIALHAKNAEYNPKRFAAVIMR 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB7 IREPRTTALIFSSGKMVCTGAKSEEQSRLAARKYARVVQKLGFPARFLDFKIQNMVGSCD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS97 IREPRTTALIFSSGKMVCTGAKSEEQSRLAARKYARVVQKLGFPARFLDFKIQNMVGSCD 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB7 VRFPIRLEGLVLTHQQFSSYEPELFPGLIYRMVKPRIVLLIFVSGKVVLTGAKERSEIYE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS97 VRFPIRLEGLVLTHQQFSSYEPELFPGLIYRMVKPRIVLLIFVSGKVVLTGAKERSEIYE 310 320 330 340 350 360 370 pF1KB7 AFENIYPILKGFKKA ::::::::::::::: CCDS97 AFENIYPILKGFKKA 370 >>CCDS5315.1 TBP gene_id:6908|Hs108|chr6 (339 aa) initn: 1429 init1: 1150 opt: 1234 Z-score: 1005.5 bits: 194.6 E(32554): 1.1e-49 Smith-Waterman score: 1234; 59.4% identity (77.5% similar) in 342 aa overlap (33-374:1-337) 10 20 30 40 50 60 pF1KB7 SAPWPERVPRLLAPRLPSYPPPPPTVGLPSMEQEETYLELYLDQCAAQDGLAPPRSPLFS :.:... : : . :. .: : :.:: CCDS53 MDQNNS-LPPYAQGLASPQGAMTPGIPIFS 10 20 70 80 90 100 110 120 pF1KB7 PVVPYDMYILNASNPDTAFNSNPEVKETSGDFSSVDLSFLPDEVTQENKDQPVISKHETE :..:: . .:. :.: . .. . . .. :....: .... . CCDS53 PMMPYGTGL----TPQPIQNTNSLSILEEQQRQQQQQQQQQQQQQQQQQQQQQQQQQQQQ 30 40 50 60 70 80 130 140 150 160 170 180 pF1KB7 ENSESQSPQSRLPSPSEQDVGLGLNSSSLSNSHSQLHPGDTDSVQPSPEKPNSDSLSLAS .....:. :. . . . :. ... :.. :: ..: .. : : .. CCDS53 QQQQQQQQQQAVAAAAVQQSTSQQATQGTSGQAPQLFHSQTLTTAPLPGTTPLYPSPMTP 90 100 110 120 130 140 190 200 210 220 230 240 pF1KB7 ITPMTPMTPISECCGIVPQLQNIVSTVNLACKLDLKKIALHAKNAEYNPKRFAAVIMRIR .::.:: :: :: :::::::::::::::.:::::: :::.:.::::::::::::::::: CCDS53 MTPITPATPASESSGIVPQLQNIVSTVNLGCKLDLKTIALRARNAEYNPKRFAAVIMRIR 150 160 170 180 190 200 250 260 270 280 290 300 pF1KB7 EPRTTALIFSSGKMVCTGAKSEEQSRLAARKYARVVQKLGFPARFLDFKIQNMVGSCDVR :::::::::::::::::::::::::::::::::::::::::::.:::::::::::::::. CCDS53 EPRTTALIFSSGKMVCTGAKSEEQSRLAARKYARVVQKLGFPAKFLDFKIQNMVGSCDVK 210 220 230 240 250 260 310 320 330 340 350 360 pF1KB7 FPIRLEGLVLTHQQFSSYEPELFPGLIYRMVKPRIVLLIFVSGKVVLTGAKERSEIYEAF ::::::::::::::::::::::::::::::.:::::::::::::::::::: :.:::::: CCDS53 FPIRLEGLVLTHQQFSSYEPELFPGLIYRMIKPRIVLLIFVSGKVVLTGAKVRAEIYEAF 270 280 290 300 310 320 370 pF1KB7 ENIYPILKGFKKA ::::::::::.: CCDS53 ENIYPILKGFRKTT 330 >>CCDS55077.1 TBP gene_id:6908|Hs108|chr6 (319 aa) initn: 1429 init1: 1150 opt: 1217 Z-score: 992.3 bits: 192.1 E(32554): 6e-49 Smith-Waterman score: 1217; 61.8% identity (79.0% similar) in 319 aa overlap (56-374:3-317) 30 40 50 60 70 80 pF1KB7 PTVGLPSMEQEETYLELYLDQCAAQDGLAPPRSPLFSPVVPYDMYILNASNPDTAFNSNP : :.:::..:: . .:. :.: CCDS55 MTPGIPIFSPMMPYGTGL----TPQPIQNTNS 10 20 90 100 110 120 130 140 pF1KB7 EVKETSGDFSSVDLSFLPDEVTQENKDQPVISKHETEENSESQSPQSRLPSPSEQDVGLG . .. . . .. :....: .... ......:. :. . . . :. CCDS55 LSILEEQQRQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQAVAAAAVQQSTSQ 30 40 50 60 70 80 150 160 170 180 190 200 pF1KB7 LNSSSLSNSHSQLHPGDTDSVQPSPEKPNSDSLSLASITPMTPMTPISECCGIVPQLQNI ... :.. :: ..: .. : : .. .::.:: :: :: ::::::::: CCDS55 QATQGTSGQAPQLFHSQTLTTAPLPGTTPLYPSPMTPMTPITPATPASESSGIVPQLQNI 90 100 110 120 130 140 210 220 230 240 250 260 pF1KB7 VSTVNLACKLDLKKIALHAKNAEYNPKRFAAVIMRIREPRTTALIFSSGKMVCTGAKSEE ::::::.:::::: :::.:.:::::::::::::::::::::::::::::::::::::::: CCDS55 VSTVNLGCKLDLKTIALRARNAEYNPKRFAAVIMRIREPRTTALIFSSGKMVCTGAKSEE 150 160 170 180 190 200 270 280 290 300 310 320 pF1KB7 QSRLAARKYARVVQKLGFPARFLDFKIQNMVGSCDVRFPIRLEGLVLTHQQFSSYEPELF ::::::::::::::::::::.:::::::::::::::.::::::::::::::::::::::: CCDS55 QSRLAARKYARVVQKLGFPAKFLDFKIQNMVGSCDVKFPIRLEGLVLTHQQFSSYEPELF 210 220 230 240 250 260 330 340 350 360 370 pF1KB7 PGLIYRMVKPRIVLLIFVSGKVVLTGAKERSEIYEAFENIYPILKGFKKA :::::::.:::::::::::::::::::: :.::::::::::::::::.: CCDS55 PGLIYRMIKPRIVLLIFVSGKVVLTGAKVRAEIYEAFENIYPILKGFRKTT 270 280 290 300 310 >>CCDS5168.1 TBPL1 gene_id:9519|Hs108|chr6 (186 aa) initn: 371 init1: 371 opt: 451 Z-score: 380.4 bits: 78.1 E(32554): 7.2e-15 Smith-Waterman score: 451; 41.7% identity (75.0% similar) in 168 aa overlap (202-369:13-178) 180 190 200 210 220 230 pF1KB7 KPNSDSLSLASITPMTPMTPISECCGIVPQLQNIVSTVNLACKLDLKKIALHAKNAEYNP . :.: . :.:.:.::::.. :. :. CCDS51 MDADSDVALDILITNVVCVFRTRCHLNLRKIALEGANVIYK- 10 20 30 40 240 250 260 270 280 290 pF1KB7 KRFAAVIMRIREPRTTALIFSSGKMVCTGAKSEEQSRLAARKYARVVQKLGFPARFLDFK . . :.:..:.:: :: :.::::..:::: :::.....::. :: .::::: . : ::: CCDS51 RDVGKVLMKLRKPRITATIWSSGKIICTGATSEEEAKFGARRLARSLQKLGFQVIFTDFK 50 60 70 80 90 100 300 310 320 330 340 350 pF1KB7 IQNMVGSCDVRFPIRLEGLVLTHQQFSSYEPELFPGLIYRMVKPRIVLLIFVSGKVVLTG . :... :.. : ::: .. ... .:::::: :.. ::. . : .: :: .:....:: CCDS51 VVNVLAVCNMPFEIRLPEFTKNNRPHASYEPELHPAVCYRIKSLRATLQIFSTGSITVTG 110 120 130 140 150 160 360 370 pF1KB7 AKERSEIYEAFENIYPILKGFKKA . .. . : :.:::.. CCDS51 PNVKA-VATAVEQIYPFVFESRKEIL 170 180 375 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 16:42:13 2016 done: Mon Nov 7 16:42:13 2016 Total Scan time: 3.230 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]