FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB7704, 462 aa 1>>>pF1KB7704 462 - 462 aa - 462 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 10.3938+/-0.00115; mu= -4.7728+/- 0.070 mean_var=289.7392+/-58.259, 0's: 0 Z-trim(111.3): 15 B-trim: 0 in 0/54 Lambda= 0.075348 statistics sampled from 12298 (12304) to 12298 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.699), E-opt: 0.2 (0.378), width: 16 Scan time: 3.420 The best scores are: opt bits E(32554) CCDS35347.1 TAF7L gene_id:54457|Hs108|chrX ( 462) 2984 337.9 1.3e-92 CCDS55466.1 TAF7L gene_id:54457|Hs108|chrX ( 376) 2439 278.6 7.7e-75 CCDS4259.1 TAF7 gene_id:6879|Hs108|chr5 ( 349) 726 92.4 8.3e-19 >>CCDS35347.1 TAF7L gene_id:54457|Hs108|chrX (462 aa) initn: 2984 init1: 2984 opt: 2984 Z-score: 1776.0 bits: 337.9 E(32554): 1.3e-92 Smith-Waterman score: 2984; 99.8% identity (99.8% similar) in 462 aa overlap (1-462:1-462) 10 20 30 40 50 60 pF1KB7 MECPEGQLPISSENDSTPTVSTSEVTSQQEPQIPVDRGSETTYESSADIAGDEGTQIPAD ::::::::::::::::::::::::::::::::: :::::::::::::::::::::::::: CCDS35 MECPEGQLPISSENDSTPTVSTSEVTSQQEPQILVDRGSETTYESSADIAGDEGTQIPAD 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 EDTQTDADSSAQAAAQAPENFQEGKDMSESQDEVPDEVENQFILRLPLEHACTVRNLARS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 EDTQTDADSSAQAAAQAPENFQEGKDMSESQDEVPDEVENQFILRLPLEHACTVRNLARS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 QSVKMKDKLKIDLLPDGRHAVVEVEDVPLAAKLVDLPCVIESLRTLDKKTFYKTADISQM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 QSVKMKDKLKIDLLPDGRHAVVEVEDVPLAAKLVDLPCVIESLRTLDKKTFYKTADISQM 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB7 LVCTADGDIHLSPEEPAASTDPNIVRKKERGREEKCVWKHGITPPLKNVRKKRFRKTQKK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 LVCTADGDIHLSPEEPAASTDPNIVRKKERGREEKCVWKHGITPPLKNVRKKRFRKTQKK 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB7 VPDVKEMEKSSFTEYIESPDVENEVKRLLRSDAEAVSTRWEVIAEDGTKEIESQGSIPGF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 VPDVKEMEKSSFTEYIESPDVENEVKRLLRSDAEAVSTRWEVIAEDGTKEIESQGSIPGF 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB7 LISSGMSSHKQGHTSSEYDMLREMFSDSRSNNDDDEDEDDEDEDEDEDEDEDEDKEEEEE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 LISSGMSSHKQGHTSSEYDMLREMFSDSRSNNDDDEDEDDEDEDEDEDEDEDEDKEEEEE 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB7 DCSEEYLERQLQAEFIESGQYRANEGTSSIVMEIQKQIEKKEKKLHKIQNKAQRQKDLIM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 DCSEEYLERQLQAEFIESGQYRANEGTSSIVMEIQKQIEKKEKKLHKIQNKAQRQKDLIM 370 380 390 400 410 420 430 440 450 460 pF1KB7 KVENLTLKNHFQSVLEQLELQEKQKNEKLISLQEQLQRFLKK :::::::::::::::::::::::::::::::::::::::::: CCDS35 KVENLTLKNHFQSVLEQLELQEKQKNEKLISLQEQLQRFLKK 430 440 450 460 >>CCDS55466.1 TAF7L gene_id:54457|Hs108|chrX (376 aa) initn: 2439 init1: 2439 opt: 2439 Z-score: 1457.1 bits: 278.6 E(32554): 7.7e-75 Smith-Waterman score: 2439; 100.0% identity (100.0% similar) in 376 aa overlap (87-462:1-376) 60 70 80 90 100 110 pF1KB7 IPADEDTQTDADSSAQAAAQAPENFQEGKDMSESQDEVPDEVENQFILRLPLEHACTVRN :::::::::::::::::::::::::::::: CCDS55 MSESQDEVPDEVENQFILRLPLEHACTVRN 10 20 30 120 130 140 150 160 170 pF1KB7 LARSQSVKMKDKLKIDLLPDGRHAVVEVEDVPLAAKLVDLPCVIESLRTLDKKTFYKTAD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 LARSQSVKMKDKLKIDLLPDGRHAVVEVEDVPLAAKLVDLPCVIESLRTLDKKTFYKTAD 40 50 60 70 80 90 180 190 200 210 220 230 pF1KB7 ISQMLVCTADGDIHLSPEEPAASTDPNIVRKKERGREEKCVWKHGITPPLKNVRKKRFRK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 ISQMLVCTADGDIHLSPEEPAASTDPNIVRKKERGREEKCVWKHGITPPLKNVRKKRFRK 100 110 120 130 140 150 240 250 260 270 280 290 pF1KB7 TQKKVPDVKEMEKSSFTEYIESPDVENEVKRLLRSDAEAVSTRWEVIAEDGTKEIESQGS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 TQKKVPDVKEMEKSSFTEYIESPDVENEVKRLLRSDAEAVSTRWEVIAEDGTKEIESQGS 160 170 180 190 200 210 300 310 320 330 340 350 pF1KB7 IPGFLISSGMSSHKQGHTSSEYDMLREMFSDSRSNNDDDEDEDDEDEDEDEDEDEDEDKE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 IPGFLISSGMSSHKQGHTSSEYDMLREMFSDSRSNNDDDEDEDDEDEDEDEDEDEDEDKE 220 230 240 250 260 270 360 370 380 390 400 410 pF1KB7 EEEEDCSEEYLERQLQAEFIESGQYRANEGTSSIVMEIQKQIEKKEKKLHKIQNKAQRQK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 EEEEDCSEEYLERQLQAEFIESGQYRANEGTSSIVMEIQKQIEKKEKKLHKIQNKAQRQK 280 290 300 310 320 330 420 430 440 450 460 pF1KB7 DLIMKVENLTLKNHFQSVLEQLELQEKQKNEKLISLQEQLQRFLKK :::::::::::::::::::::::::::::::::::::::::::::: CCDS55 DLIMKVENLTLKNHFQSVLEQLELQEKQKNEKLISLQEQLQRFLKK 340 350 360 370 >>CCDS4259.1 TAF7 gene_id:6879|Hs108|chr5 (349 aa) initn: 1213 init1: 669 opt: 726 Z-score: 451.2 bits: 92.4 E(32554): 8.3e-19 Smith-Waterman score: 1288; 57.1% identity (79.1% similar) in 378 aa overlap (87-462:1-349) 60 70 80 90 100 110 pF1KB7 IPADEDTQTDADSSAQAAAQAPENFQEGKDMSESQDEVPDEVENQFILRLPLEHACTVRN ::.:.:..: :.:.::::::: :.: ::: CCDS42 MSKSKDDAPHELESQFILRLPPEYASTVRR 10 20 30 120 130 140 150 160 170 pF1KB7 LARSQSVKMKDKLKIDLLPDGRHAVVEVEDVPLAAKLVDLPCVIESLRTLDKKTFYKTAD ..: :..::.: :.: :::::..:.:. ::::.::::::::.:::.:.:::::::::: CCDS42 AVQSGHVNLKDRLTIELHPDGRHGIVRVDRVPLASKLVDLPCVMESLKTIDKKTFYKTAD 40 50 60 70 80 90 180 190 200 210 220 230 pF1KB7 ISQMLVCTADGDIHLSPEEPAASTDPNIVRKKERGREEKCVWKHGITPPLKNVRKKRFRK : :::: :.:::.. :::.:::::. .::.. .:.: .:.:::: :::::::.:::: CCDS42 ICQMLVSTVDGDLYPPVEEPVASTDPKASKKKDKDKEKKFIWNHGITLPLKNVRKRRFRK 100 110 120 130 140 150 240 250 260 270 280 290 pF1KB7 TQKKVPDVKEMEKSSFTEYIESPDVENEVKRLLRSDAEAVSTRWEVIAEDGTKEIESQGS : :: .::::::::.:::::: .::::::::::.:::: ::: :.:: CCDS42 TAKK-------------KYIESPDVEKEVKRLLSTDAEAVSTRWEIIAEDETKEAENQG- 160 170 180 190 300 310 320 330 340 350 pF1KB7 IPGFLISS-GMSSHKQGHTSSEYDMLREMFSDSRSNNDDDEDEDDEDEDEDEDEDEDEDK . ::: :::.:.::: : :.: :::.:.: :.. .:::: . .:: :: . CCDS42 ---LDISSPGMSGHRQGHDSLEHDELREIFNDLSSSS------EDEDETQHQDE-EDINI 200 210 220 230 240 360 370 380 390 400 410 pF1KB7 EEEEEDCSEEYLERQLQAEFIESG-QYRANEGTSSIVMEIQKQIEKKEKKLHKIQNKAQR . ::: :::::: .. :: :.. ::::...:: :::::.. . ::.. :..:.: CCDS42 IDTEED-----LERQLQDKLNESDEQHQENEGTNQLVMGIQKQIDNMKGKLQETQDRAKR 250 260 270 280 290 300 420 430 440 450 460 pF1KB7 QKDLIMKVENLTLKNHFQSVLEQLELQEKQKNEKLISLQEQLQRFLKK :.:::::::::.:::.::.::..:. .: ...:.: ::::.:. .:.: CCDS42 QEDLIMKVENLALKNRFQAVLDELKQKEDREKEQLSSLQEELESLLEK 310 320 330 340 462 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 17:03:34 2016 done: Mon Nov 7 17:03:35 2016 Total Scan time: 3.420 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]