FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE4120, 503 aa
1>>>pF1KE4120 503 - 503 aa - 503 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 8.7606+/-0.00122; mu= 2.1771+/- 0.071
mean_var=306.5992+/-68.738, 0's: 0 Z-trim(110.5): 794 B-trim: 541 in 1/49
Lambda= 0.073247
statistics sampled from 10683 (11666) to 10683 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.708), E-opt: 0.2 (0.358), width: 16
Scan time: 3.380
The best scores are: opt bits E(32554)
CCDS44278.1 ZBTB37 gene_id:84614|Hs108|chr1 ( 503) 3384 371.9 9.4e-103
CCDS1312.1 ZBTB37 gene_id:84614|Hs108|chr1 ( 361) 2244 251.2 1.4e-66
CCDS48023.1 ZBTB34 gene_id:403341|Hs108|chr9 ( 500) 1254 146.8 5.3e-35
CCDS6867.1 ZBTB43 gene_id:23099|Hs108|chr9 ( 467) 703 88.5 1.7e-17
>>CCDS44278.1 ZBTB37 gene_id:84614|Hs108|chr1 (503 aa)
initn: 3384 init1: 3384 opt: 3384 Z-score: 1958.2 bits: 371.9 E(32554): 9.4e-103
Smith-Waterman score: 3384; 100.0% identity (100.0% similar) in 503 aa overlap (1-503:1-503)
10 20 30 40 50 60
pF1KE4 MEKGGNIQLEIPDFSNSVLSHLNQLRMQGRLCDIVVNVQGQAFRAHKVVLAASSPYFRDH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 MEKGGNIQLEIPDFSNSVLSHLNQLRMQGRLCDIVVNVQGQAFRAHKVVLAASSPYFRDH
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE4 MSLNEMSTVSISVIKNPTVFEQLLSFCYTGRICLQLADIISYLTAASFLQMQHIIDKCTQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 MSLNEMSTVSISVIKNPTVFEQLLSFCYTGRICLQLADIISYLTAASFLQMQHIIDKCTQ
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE4 ILEGIHFKINVAEVEAELSQTRTKHQERPPESHRVTPNLNRSLSPRHNTPKGNRRGQVSA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 ILEGIHFKINVAEVEAELSQTRTKHQERPPESHRVTPNLNRSLSPRHNTPKGNRRGQVSA
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE4 VLDIRELSPPEESTSPQIIEPSSDVESREPILRINRAGQWYVETGVADRGGRSDDEVRVL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 VLDIRELSPPEESTSPQIIEPSSDVESREPILRINRAGQWYVETGVADRGGRSDDEVRVL
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE4 GAVHIKTENLEEWLGPENQPSGEDGSSAEEVTAMVIDTTGHGSVGQENYTLGSSGAKVAR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 GAVHIKTENLEEWLGPENQPSGEDGSSAEEVTAMVIDTTGHGSVGQENYTLGSSGAKVAR
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE4 PTSSEVDRFSPSGSVVPLTERHRARSESPGRMDEPKQPSSQVEESAMMGVSGYVEYLREQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 PTSSEVDRFSPSGSVVPLTERHRARSESPGRMDEPKQPSSQVEESAMMGVSGYVEYLREQ
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE4 EVSERWFRYNPRLTCIYCAKSFNQKGSLDRHMRLHMGITPFVCRMCGKKYTRKDQLEYHI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 EVSERWFRYNPRLTCIYCAKSFNQKGSLDRHMRLHMGITPFVCRMCGKKYTRKDQLEYHI
370 380 390 400 410 420
430 440 450 460 470 480
pF1KE4 RKHTGNKPFHCHVCGKSFPFQAILNQHFRKNHPGCIPLEGPHSISPETTVTSRGQAEEES
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 RKHTGNKPFHCHVCGKSFPFQAILNQHFRKNHPGCIPLEGPHSISPETTVTSRGQAEEES
430 440 450 460 470 480
490 500
pF1KE4 PSQEETVAPGEAVQGSVSTTGPD
:::::::::::::::::::::::
CCDS44 PSQEETVAPGEAVQGSVSTTGPD
490 500
>>CCDS1312.1 ZBTB37 gene_id:84614|Hs108|chr1 (361 aa)
initn: 2244 init1: 2244 opt: 2244 Z-score: 1308.9 bits: 251.2 E(32554): 1.4e-66
Smith-Waterman score: 2244; 100.0% identity (100.0% similar) in 342 aa overlap (1-342:1-342)
10 20 30 40 50 60
pF1KE4 MEKGGNIQLEIPDFSNSVLSHLNQLRMQGRLCDIVVNVQGQAFRAHKVVLAASSPYFRDH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 MEKGGNIQLEIPDFSNSVLSHLNQLRMQGRLCDIVVNVQGQAFRAHKVVLAASSPYFRDH
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE4 MSLNEMSTVSISVIKNPTVFEQLLSFCYTGRICLQLADIISYLTAASFLQMQHIIDKCTQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 MSLNEMSTVSISVIKNPTVFEQLLSFCYTGRICLQLADIISYLTAASFLQMQHIIDKCTQ
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE4 ILEGIHFKINVAEVEAELSQTRTKHQERPPESHRVTPNLNRSLSPRHNTPKGNRRGQVSA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 ILEGIHFKINVAEVEAELSQTRTKHQERPPESHRVTPNLNRSLSPRHNTPKGNRRGQVSA
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE4 VLDIRELSPPEESTSPQIIEPSSDVESREPILRINRAGQWYVETGVADRGGRSDDEVRVL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 VLDIRELSPPEESTSPQIIEPSSDVESREPILRINRAGQWYVETGVADRGGRSDDEVRVL
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE4 GAVHIKTENLEEWLGPENQPSGEDGSSAEEVTAMVIDTTGHGSVGQENYTLGSSGAKVAR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 GAVHIKTENLEEWLGPENQPSGEDGSSAEEVTAMVIDTTGHGSVGQENYTLGSSGAKVAR
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE4 PTSSEVDRFSPSGSVVPLTERHRARSESPGRMDEPKQPSSQVEESAMMGVSGYVEYLREQ
::::::::::::::::::::::::::::::::::::::::::
CCDS13 PTSSEVDRFSPSGSVVPLTERHRARSESPGRMDEPKQPSSQVWSCGFRTALVVGGIATVY
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE4 EVSERWFRYNPRLTCIYCAKSFNQKGSLDRHMRLHMGITPFVCRMCGKKYTRKDQLEYHI
CCDS13 E
>>CCDS48023.1 ZBTB34 gene_id:403341|Hs108|chr9 (500 aa)
initn: 1225 init1: 648 opt: 1254 Z-score: 741.8 bits: 146.8 E(32554): 5.3e-35
Smith-Waterman score: 1292; 45.6% identity (69.7% similar) in 498 aa overlap (1-485:1-483)
10 20 30 40 50 60
pF1KE4 MEKGGNIQLEIPDFSNSVLSHLNQLRMQGRLCDIVVNVQGQAFRAHKVVLAASSPYFRDH
:.... ::...:..:..:::.::.::.::.::::.:..::: :::::.::::::::::::
CCDS48 MDSSSFIQFDVPEYSSTVLSQLNELRLQGKLCDIIVHIQGQPFRAHKAVLAASSPYFRDH
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE4 MSLNEMSTVSISVIKNPTVFEQLLSFCYTGRICLQLADIISYLTAASFLQMQHIIDKCTQ
.:. :: .::::::::.:::::::::::::. ::: :..:.:::::::::: .::::::
CCDS48 SALSTMSGLSISVIKNPNVFEQLLSFCYTGRMSLQLKDVVSFLTAASFLQMQCVIDKCTQ
70 80 90 100 110 120
130 140 150 160 170
pF1KE4 ILEGIHFKINVAEVEAELSQTRTKHQERPPESHRVTPNLNRSLSPRHNTPKGNRRG-QVS
:::.:: ::.:..:. . : :. :::. . . . .: . .: .: : .
CCDS48 ILESIHSKISVGDVD-----SVTVGAEENPESRNGVKDSSFFANPVEISPPYCSQGRQPT
130 140 150 160 170
180 190 200 210 220 230
pF1KE4 AVLDIRELSPPEESTSPQIIEPS-SDVESREPILRINRAGQWYVETGVADRGGRSDDEVR
: :.: . : .. .. : . :: : . . . .: : ..: : .
CCDS48 ASSDLRMETTPSKALRSRLQEEGHSDRGSSGSVSEY----EIQIE-GDHEQGDLLVRESQ
180 190 200 210 220 230
240 250 260 270 280 290
pF1KE4 VLGAVHIKTENLEEWLGPENQPSGEDGSSAEEVTA---MVIDTTGHGSVGQENYTLGSSG
. :..: :. .. ... :.:: .: : . ..... ..::: :. :. ....
CCDS48 IT-EVKVKMEKSDRPSCSDSSSLGDDGYHTEMVDGEQVVAVNVGSYGSVLQHAYSYSQAA
240 250 260 270 280
300 310 320 330 340
pF1KE4 AKVARPTS-SE----VDRFSPSGSVVPLTERHRARSESPGRMDEPKQPSSQVEES---AM
.. ::. :: .. ::: :.. . :::.. . .. .. :. : ::
CCDS48 SQ---PTNVSEAFGSLSNSSPSRSMLSCFRGGRARQKRALSVHLHSDLQGLVQGSDSEAM
290 300 310 320 330 340
350 360 370 380 390 400
pF1KE4 MGVSGYVEYLREQEVSERWFRYNPRLTCIYCAKSFNQKGSLDRHMRLHMGITPFVCRMCG
:. :: ::. . .:. :: :: ::::.::::::::::::::::::::::::..::
CCDS48 MNNPGYESSPRERSARGHWYPYNERLICIYCGKSFNQKGSLDRHMRLHMGITPFVCKFCG
350 360 370 380 390 400
410 420 430 440 450 460
pF1KE4 KKYTRKDQLEYHIRKHTGNKPFHCHVCGKSFPFQAILNQHFRKNHPGCIPLEGPHSISPE
:::::::::::::: :: .:::.:..::: ::::. ::::.:::::: ... . :::
CCDS48 KKYTRKDQLEYHIRGHTDDKPFRCEICGKCFPFQGTLNQHLRKNHPGVAEVRS-RIESPE
410 420 430 440 450 460
470 480 490 500
pF1KE4 TTVTSRGQAEEESPSQEETVAPGEAVQGSVSTTGPD
: . : :.. : :
CCDS48 RTDVYVEQKLENDASASEMGLDSRMEIHTVSDAPD
470 480 490 500
>>CCDS6867.1 ZBTB43 gene_id:23099|Hs108|chr9 (467 aa)
initn: 821 init1: 451 opt: 703 Z-score: 427.5 bits: 88.5 E(32554): 1.7e-17
Smith-Waterman score: 703; 31.0% identity (60.6% similar) in 462 aa overlap (1-447:1-446)
10 20 30 40 50
pF1KE4 MEKGGN-IQLEIPDFSNSVLSHLNQLRMQGRLCDIVVNVQGQAFRAHKVVLAASSPYFRD
:: : : ...:.::::...:..::: :.::.:::. . :::. :::::.::::::::: :
CCDS68 MEPGTNSFRVEFPDFSSTILQKLNQQRQQGQLCDVSIVVQGHIFRAHKAVLAASSPYFCD
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE4 HMSLNEMSTVSISVIKNPTVFEQLLSFCYTGRICLQLADIISYLTAASFLQMQHIIDKCT
.. :.. . . . :: :::..: ::::. . .:.::::::::::: :..::::
CCDS68 QVLLKNSRRIVLPDVMNPRVFENILLSSYTGRLVMPAPEIVSYLTAASFLQMWHVVDKCT
70 80 90 100 110 120
120 130 140 150 160 170
pF1KE4 QILEGIHFKINVAEVEAELSQTRTKHQERPPESHR-VTPNLNRSLSPRHNTPKGN--RRG
..::: : . . .:.. . :: :. .. ... . . . . ::.. : :
CCDS68 EVLEG-----NPTVLCQKLNHG-SDHQSPSSSSYNGLVESFELGSGGHTDFPKAQELRDG
130 140 150 160 170
180 190 200 210 220
pF1KE4 QVSAVLDIRELSPPEESTSPQIIEPSSDVESREPILRINRAGQWYVETGVAD-------R
. ::: . : . . .:..: . : . :.: : :..: :
CCDS68 ENEEESTKDELSS--QLTEHEYLPSNSSTEHDR--LSTEMASQ-DGEEGASDSAEFHYTR
180 190 200 210 220
230 240 250 260 270 280
pF1KE4 GGRSDDEVRVLGA-VHIKTENLEEWL-GPENQPSGEDGSSAEEVTAMVIDTTGHGSVGQE
: . . .:.: : ::. : . . . .. . .: .... . : . : .:
CCDS68 PMYSKPSIMAHKRWIHVKPERLEQACEGMDVHATYDEHQVTESINTVQTEHTVQPSGVEE
230 240 250 260 270 280
290 300 310 320 330 340
pF1KE4 NYTLGSSGAKVARPTSSEVDRFSPSGSVVPLTERHRARSESPGRMDEPKQPSSQVEESAM
.. .: . ... ... . .. . . . .. . .: : . .: : .
CCDS68 DFHIGEKKVEAEFDEQADESNYDEQVDFYGSSMEEFSGERSDGNLIGHRQ-----EAALA
290 300 310 320 330 340
350 360 370 380 390 400
pF1KE4 MGVSGYVEYLR--EQEVSERWFRYNPRLTCIYCAKSFNQKGSLDRHMRLHMGITPFVCRM
: : .:.. ..:.:. : . .: :.:::..:.. :::: .:.:. :. : .
CCDS68 AGYSENIEMVTGIKEEASHLGFSATDKLYPCQCGKSFTHKSQRDRHMSMHLGLRPYGCGV
350 360 370 380 390 400
410 420 430 440 450 460
pF1KE4 CGKKYTRKDQLEYHIRKHTGNKPFHCHVCGKSFPFQAILNQHFRKNHPGCIPLEGPHSIS
::::. : .: :.. ::: ::..:..:.: : .. ...:
CCDS68 CGKKFKMKHHLVGHMKIHTGIKPYECNICAKRFMWRDSFHRHVTSCTKSYEAAKAEQNTT
410 420 430 440 450 460
470 480 490 500
pF1KE4 PETTVTSRGQAEEESPSQEETVAPGEAVQGSVSTTGPD
CCDS68 EAN
503 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Mon Nov 7 15:17:38 2016 done: Mon Nov 7 15:17:39 2016
Total Scan time: 3.380 Total Display time: 0.020
Function used was FASTA [36.3.4 Apr, 2011]