FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB7704, 462 aa
1>>>pF1KB7704 462 - 462 aa - 462 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 10.3938+/-0.00115; mu= -4.7728+/- 0.070
mean_var=289.7392+/-58.259, 0's: 0 Z-trim(111.3): 15 B-trim: 0 in 0/54
Lambda= 0.075348
statistics sampled from 12298 (12304) to 12298 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.699), E-opt: 0.2 (0.378), width: 16
Scan time: 3.420
The best scores are: opt bits E(32554)
CCDS35347.1 TAF7L gene_id:54457|Hs108|chrX ( 462) 2984 337.9 1.3e-92
CCDS55466.1 TAF7L gene_id:54457|Hs108|chrX ( 376) 2439 278.6 7.7e-75
CCDS4259.1 TAF7 gene_id:6879|Hs108|chr5 ( 349) 726 92.4 8.3e-19
>>CCDS35347.1 TAF7L gene_id:54457|Hs108|chrX (462 aa)
initn: 2984 init1: 2984 opt: 2984 Z-score: 1776.0 bits: 337.9 E(32554): 1.3e-92
Smith-Waterman score: 2984; 99.8% identity (99.8% similar) in 462 aa overlap (1-462:1-462)
10 20 30 40 50 60
pF1KB7 MECPEGQLPISSENDSTPTVSTSEVTSQQEPQIPVDRGSETTYESSADIAGDEGTQIPAD
::::::::::::::::::::::::::::::::: ::::::::::::::::::::::::::
CCDS35 MECPEGQLPISSENDSTPTVSTSEVTSQQEPQILVDRGSETTYESSADIAGDEGTQIPAD
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB7 EDTQTDADSSAQAAAQAPENFQEGKDMSESQDEVPDEVENQFILRLPLEHACTVRNLARS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS35 EDTQTDADSSAQAAAQAPENFQEGKDMSESQDEVPDEVENQFILRLPLEHACTVRNLARS
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB7 QSVKMKDKLKIDLLPDGRHAVVEVEDVPLAAKLVDLPCVIESLRTLDKKTFYKTADISQM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS35 QSVKMKDKLKIDLLPDGRHAVVEVEDVPLAAKLVDLPCVIESLRTLDKKTFYKTADISQM
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB7 LVCTADGDIHLSPEEPAASTDPNIVRKKERGREEKCVWKHGITPPLKNVRKKRFRKTQKK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS35 LVCTADGDIHLSPEEPAASTDPNIVRKKERGREEKCVWKHGITPPLKNVRKKRFRKTQKK
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB7 VPDVKEMEKSSFTEYIESPDVENEVKRLLRSDAEAVSTRWEVIAEDGTKEIESQGSIPGF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS35 VPDVKEMEKSSFTEYIESPDVENEVKRLLRSDAEAVSTRWEVIAEDGTKEIESQGSIPGF
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB7 LISSGMSSHKQGHTSSEYDMLREMFSDSRSNNDDDEDEDDEDEDEDEDEDEDEDKEEEEE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS35 LISSGMSSHKQGHTSSEYDMLREMFSDSRSNNDDDEDEDDEDEDEDEDEDEDEDKEEEEE
310 320 330 340 350 360
370 380 390 400 410 420
pF1KB7 DCSEEYLERQLQAEFIESGQYRANEGTSSIVMEIQKQIEKKEKKLHKIQNKAQRQKDLIM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS35 DCSEEYLERQLQAEFIESGQYRANEGTSSIVMEIQKQIEKKEKKLHKIQNKAQRQKDLIM
370 380 390 400 410 420
430 440 450 460
pF1KB7 KVENLTLKNHFQSVLEQLELQEKQKNEKLISLQEQLQRFLKK
::::::::::::::::::::::::::::::::::::::::::
CCDS35 KVENLTLKNHFQSVLEQLELQEKQKNEKLISLQEQLQRFLKK
430 440 450 460
>>CCDS55466.1 TAF7L gene_id:54457|Hs108|chrX (376 aa)
initn: 2439 init1: 2439 opt: 2439 Z-score: 1457.1 bits: 278.6 E(32554): 7.7e-75
Smith-Waterman score: 2439; 100.0% identity (100.0% similar) in 376 aa overlap (87-462:1-376)
60 70 80 90 100 110
pF1KB7 IPADEDTQTDADSSAQAAAQAPENFQEGKDMSESQDEVPDEVENQFILRLPLEHACTVRN
::::::::::::::::::::::::::::::
CCDS55 MSESQDEVPDEVENQFILRLPLEHACTVRN
10 20 30
120 130 140 150 160 170
pF1KB7 LARSQSVKMKDKLKIDLLPDGRHAVVEVEDVPLAAKLVDLPCVIESLRTLDKKTFYKTAD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS55 LARSQSVKMKDKLKIDLLPDGRHAVVEVEDVPLAAKLVDLPCVIESLRTLDKKTFYKTAD
40 50 60 70 80 90
180 190 200 210 220 230
pF1KB7 ISQMLVCTADGDIHLSPEEPAASTDPNIVRKKERGREEKCVWKHGITPPLKNVRKKRFRK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS55 ISQMLVCTADGDIHLSPEEPAASTDPNIVRKKERGREEKCVWKHGITPPLKNVRKKRFRK
100 110 120 130 140 150
240 250 260 270 280 290
pF1KB7 TQKKVPDVKEMEKSSFTEYIESPDVENEVKRLLRSDAEAVSTRWEVIAEDGTKEIESQGS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS55 TQKKVPDVKEMEKSSFTEYIESPDVENEVKRLLRSDAEAVSTRWEVIAEDGTKEIESQGS
160 170 180 190 200 210
300 310 320 330 340 350
pF1KB7 IPGFLISSGMSSHKQGHTSSEYDMLREMFSDSRSNNDDDEDEDDEDEDEDEDEDEDEDKE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS55 IPGFLISSGMSSHKQGHTSSEYDMLREMFSDSRSNNDDDEDEDDEDEDEDEDEDEDEDKE
220 230 240 250 260 270
360 370 380 390 400 410
pF1KB7 EEEEDCSEEYLERQLQAEFIESGQYRANEGTSSIVMEIQKQIEKKEKKLHKIQNKAQRQK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS55 EEEEDCSEEYLERQLQAEFIESGQYRANEGTSSIVMEIQKQIEKKEKKLHKIQNKAQRQK
280 290 300 310 320 330
420 430 440 450 460
pF1KB7 DLIMKVENLTLKNHFQSVLEQLELQEKQKNEKLISLQEQLQRFLKK
::::::::::::::::::::::::::::::::::::::::::::::
CCDS55 DLIMKVENLTLKNHFQSVLEQLELQEKQKNEKLISLQEQLQRFLKK
340 350 360 370
>>CCDS4259.1 TAF7 gene_id:6879|Hs108|chr5 (349 aa)
initn: 1213 init1: 669 opt: 726 Z-score: 451.2 bits: 92.4 E(32554): 8.3e-19
Smith-Waterman score: 1288; 57.1% identity (79.1% similar) in 378 aa overlap (87-462:1-349)
60 70 80 90 100 110
pF1KB7 IPADEDTQTDADSSAQAAAQAPENFQEGKDMSESQDEVPDEVENQFILRLPLEHACTVRN
::.:.:..: :.:.::::::: :.: :::
CCDS42 MSKSKDDAPHELESQFILRLPPEYASTVRR
10 20 30
120 130 140 150 160 170
pF1KB7 LARSQSVKMKDKLKIDLLPDGRHAVVEVEDVPLAAKLVDLPCVIESLRTLDKKTFYKTAD
..: :..::.: :.: :::::..:.:. ::::.::::::::.:::.:.::::::::::
CCDS42 AVQSGHVNLKDRLTIELHPDGRHGIVRVDRVPLASKLVDLPCVMESLKTIDKKTFYKTAD
40 50 60 70 80 90
180 190 200 210 220 230
pF1KB7 ISQMLVCTADGDIHLSPEEPAASTDPNIVRKKERGREEKCVWKHGITPPLKNVRKKRFRK
: :::: :.:::.. :::.:::::. .::.. .:.: .:.:::: :::::::.::::
CCDS42 ICQMLVSTVDGDLYPPVEEPVASTDPKASKKKDKDKEKKFIWNHGITLPLKNVRKRRFRK
100 110 120 130 140 150
240 250 260 270 280 290
pF1KB7 TQKKVPDVKEMEKSSFTEYIESPDVENEVKRLLRSDAEAVSTRWEVIAEDGTKEIESQGS
: :: .::::::::.:::::: .::::::::::.:::: ::: :.::
CCDS42 TAKK-------------KYIESPDVEKEVKRLLSTDAEAVSTRWEIIAEDETKEAENQG-
160 170 180 190
300 310 320 330 340 350
pF1KB7 IPGFLISS-GMSSHKQGHTSSEYDMLREMFSDSRSNNDDDEDEDDEDEDEDEDEDEDEDK
. ::: :::.:.::: : :.: :::.:.: :.. .:::: . .:: :: .
CCDS42 ---LDISSPGMSGHRQGHDSLEHDELREIFNDLSSSS------EDEDETQHQDE-EDINI
200 210 220 230 240
360 370 380 390 400 410
pF1KB7 EEEEEDCSEEYLERQLQAEFIESG-QYRANEGTSSIVMEIQKQIEKKEKKLHKIQNKAQR
. ::: :::::: .. :: :.. ::::...:: :::::.. . ::.. :..:.:
CCDS42 IDTEED-----LERQLQDKLNESDEQHQENEGTNQLVMGIQKQIDNMKGKLQETQDRAKR
250 260 270 280 290 300
420 430 440 450 460
pF1KB7 QKDLIMKVENLTLKNHFQSVLEQLELQEKQKNEKLISLQEQLQRFLKK
:.:::::::::.:::.::.::..:. .: ...:.: ::::.:. .:.:
CCDS42 QEDLIMKVENLALKNRFQAVLDELKQKEDREKEQLSSLQEELESLLEK
310 320 330 340
462 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Mon Nov 7 17:03:34 2016 done: Mon Nov 7 17:03:35 2016
Total Scan time: 3.420 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]