FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KSDA1993, 504 aa
1>>>pF1KSDA1993 504 - 504 aa - 504 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 9.6299+/-0.00151; mu= -3.8160+/- 0.086
mean_var=315.5695+/-72.543, 0's: 0 Z-trim(107.6): 824 B-trim: 432 in 1/52
Lambda= 0.072198
statistics sampled from 8724 (9702) to 8724 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.658), E-opt: 0.2 (0.298), width: 16
Scan time: 3.220
The best scores are: opt bits E(32554)
CCDS48023.1 ZBTB34 gene_id:403341|Hs108|chr9 ( 500) 3347 363.3 3.5e-100
CCDS44278.1 ZBTB37 gene_id:84614|Hs108|chr1 ( 503) 1254 145.3 1.5e-34
CCDS1312.1 ZBTB37 gene_id:84614|Hs108|chr1 ( 361) 679 85.3 1.3e-16
CCDS6867.1 ZBTB43 gene_id:23099|Hs108|chr9 ( 467) 630 80.3 5.2e-15
>>CCDS48023.1 ZBTB34 gene_id:403341|Hs108|chr9 (500 aa)
initn: 3347 init1: 3347 opt: 3347 Z-score: 1912.0 bits: 363.3 E(32554): 3.5e-100
Smith-Waterman score: 3347; 100.0% identity (100.0% similar) in 500 aa overlap (5-504:1-500)
10 20 30 40 50 60
pF1KSD MSVEMDSSSFIQFDVPEYSSTVLSQLNELRLQGKLCDIIVHIQGQPFRAHKAVLAASSPY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS48 MDSSSFIQFDVPEYSSTVLSQLNELRLQGKLCDIIVHIQGQPFRAHKAVLAASSPY
10 20 30 40 50
70 80 90 100 110 120
pF1KSD FRDHSALSTMSGLSISVIKNPNVFEQLLSFCYTGRMSLQLKDVVSFLTAASFLQMQCVID
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS48 FRDHSALSTMSGLSISVIKNPNVFEQLLSFCYTGRMSLQLKDVVSFLTAASFLQMQCVID
60 70 80 90 100 110
130 140 150 160 170 180
pF1KSD KCTQILESIHSKISVGDVDSVTVGAEENPESRNGVKDSSFFANPVEISPPYCSQGRQPTA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS48 KCTQILESIHSKISVGDVDSVTVGAEENPESRNGVKDSSFFANPVEISPPYCSQGRQPTA
120 130 140 150 160 170
190 200 210 220 230 240
pF1KSD SSDLRMETTPSKALRSRLQEEGHSDRGSSGSVSEYEIQIEGDHEQGDLLVRESQITEVKV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS48 SSDLRMETTPSKALRSRLQEEGHSDRGSSGSVSEYEIQIEGDHEQGDLLVRESQITEVKV
180 190 200 210 220 230
250 260 270 280 290 300
pF1KSD KMEKSDRPSCSDSSSLGDDGYHTEMVDGEQVVAVNVGSYGSVLQHAYSYSQAASQPTNVS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS48 KMEKSDRPSCSDSSSLGDDGYHTEMVDGEQVVAVNVGSYGSVLQHAYSYSQAASQPTNVS
240 250 260 270 280 290
310 320 330 340 350 360
pF1KSD EAFGSLSNSSPSRSMLSCFRGGRARQKRALSVHLHSDLQGLVQGSDSEAMMNNPGYESSP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS48 EAFGSLSNSSPSRSMLSCFRGGRARQKRALSVHLHSDLQGLVQGSDSEAMMNNPGYESSP
300 310 320 330 340 350
370 380 390 400 410 420
pF1KSD RERSARGHWYPYNERLICIYCGKSFNQKGSLDRHMRLHMGITPFVCKFCGKKYTRKDQLE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS48 RERSARGHWYPYNERLICIYCGKSFNQKGSLDRHMRLHMGITPFVCKFCGKKYTRKDQLE
360 370 380 390 400 410
430 440 450 460 470 480
pF1KSD YHIRGHTDDKPFRCEICGKCFPFQGTLNQHLRKNHPGVAEVRSRIESPERTDVYVEQKLE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS48 YHIRGHTDDKPFRCEICGKCFPFQGTLNQHLRKNHPGVAEVRSRIESPERTDVYVEQKLE
420 430 440 450 460 470
490 500
pF1KSD NDASASEMGLDSRMEIHTVSDAPD
::::::::::::::::::::::::
CCDS48 NDASASEMGLDSRMEIHTVSDAPD
480 490 500
>>CCDS44278.1 ZBTB37 gene_id:84614|Hs108|chr1 (503 aa)
initn: 1225 init1: 648 opt: 1254 Z-score: 733.8 bits: 145.3 E(32554): 1.5e-34
Smith-Waterman score: 1292; 45.6% identity (69.7% similar) in 498 aa overlap (5-487:1-485)
10 20 30 40 50 60
pF1KSD MSVEMDSSSFIQFDVPEYSSTVLSQLNELRLQGKLCDIIVHIQGQPFRAHKAVLAASSPY
:.... ::...:..:..:::.::.::.::.::::.:..::: :::::.::::::::
CCDS44 MEKGGNIQLEIPDFSNSVLSHLNQLRMQGRLCDIVVNVQGQAFRAHKVVLAASSPY
10 20 30 40 50
70 80 90 100 110 120
pF1KSD FRDHSALSTMSGLSISVIKNPNVFEQLLSFCYTGRMSLQLKDVVSFLTAASFLQMQCVID
:::: .:. :: .::::::::.:::::::::::::. ::: :..:.:::::::::: .::
CCDS44 FRDHMSLNEMSTVSISVIKNPTVFEQLLSFCYTGRICLQLADIISYLTAASFLQMQHIID
60 70 80 90 100 110
130 140 150 160 170
pF1KSD KCTQILESIHSKISVGDVD-----SVTVGAEENPESRNGVKDSSFFANPVEISPPYCSQG
:::::::.:: ::.:..:. . : :. :::. . . . .: . .: .:
CCDS44 KCTQILEGIHFKINVAEVEAELSQTRTKHQERPPESHRVTPNLNRSLSPRHNTPKGNRRG
120 130 140 150 160 170
180 190 200 210 220 230
pF1KSD RQPTASSDLRMETTPSKALRSRLQEEGHSDRGSSGSVSEY----EIQIE-GDHEQGDLLV
: .: :.: . : .. .. : . :: : . . . .: : ..:
CCDS44 -QVSAVLDIRELSPPEESTSPQIIEPS-SDVESREPILRINRAGQWYVETGVADRGGRSD
180 190 200 210 220 230
240 250 260 270 280
pF1KSD RESQIT-EVKVKMEKSDRPSCSDSSSLGDDGYHTEMVDGEQVVAVNVGSYGSVLQHAYSY
: .. :..: :. .. ... :.:: .: : . ..... ..::: :. :.
CCDS44 DEVRVLGAVHIKTENLEEWLGPENQPSGEDGSSAEEVTA---MVIDTTGHGSVGQENYTL
240 250 260 270 280 290
290 300 310 320 330 340
pF1KSD SQAASQ---PTNVSEAFGSLSNSSPSRSMLSCFRGGRARQKRALSVHLHSDLQGLVQGSD
...... ::. :: .. ::: :.. . :::.. . .. .. :. :
CCDS44 GSSGAKVARPTS-SE----VDRFSPSGSVVPLTERHRARSESPGRMDEPKQPSSQVEES-
300 310 320 330 340
350 360 370 380 390 400
pF1KSD SEAMMNNPGYESSPRERSARGHWYPYNERLICIYCGKSFNQKGSLDRHMRLHMGITPFVC
:::. :: ::. . .:. :: :: ::::.::::::::::::::::::::::::
CCDS44 --AMMGVSGYVEYLREQEVSERWFRYNPRLTCIYCAKSFNQKGSLDRHMRLHMGITPFVC
350 360 370 380 390 400
410 420 430 440 450 460
pF1KSD KFCGKKYTRKDQLEYHIRGHTDDKPFRCEICGKCFPFQGTLNQHLRKNHPGVAEVRS-RI
..:::::::::::::::: :: .:::.:..::: ::::. ::::.:::::: ... .
CCDS44 RMCGKKYTRKDQLEYHIRKHTGNKPFHCHVCGKSFPFQAILNQHFRKNHPGCIPLEGPHS
410 420 430 440 450 460
470 480 490 500
pF1KSD ESPERTDVYVEQKLENDASASEMGLDSRMEIHTVSDAPD
::: : . : :.. : :
CCDS44 ISPETTVTSRGQAEEESPSQEETVAPGEAVQGSVSTTGPD
470 480 490 500
>>CCDS1312.1 ZBTB37 gene_id:84614|Hs108|chr1 (361 aa)
initn: 670 init1: 648 opt: 679 Z-score: 411.9 bits: 85.3 E(32554): 1.3e-16
Smith-Waterman score: 717; 40.9% identity (68.5% similar) in 337 aa overlap (5-327:1-327)
10 20 30 40 50 60
pF1KSD MSVEMDSSSFIQFDVPEYSSTVLSQLNELRLQGKLCDIIVHIQGQPFRAHKAVLAASSPY
:.... ::...:..:..:::.::.::.::.::::.:..::: :::::.::::::::
CCDS13 MEKGGNIQLEIPDFSNSVLSHLNQLRMQGRLCDIVVNVQGQAFRAHKVVLAASSPY
10 20 30 40 50
70 80 90 100 110 120
pF1KSD FRDHSALSTMSGLSISVIKNPNVFEQLLSFCYTGRMSLQLKDVVSFLTAASFLQMQCVID
:::: .:. :: .::::::::.:::::::::::::. ::: :..:.:::::::::: .::
CCDS13 FRDHMSLNEMSTVSISVIKNPTVFEQLLSFCYTGRICLQLADIISYLTAASFLQMQHIID
60 70 80 90 100 110
130 140 150 160 170
pF1KSD KCTQILESIHSKISVGDVD-----SVTVGAEENPESRNGVKDSSFFANPVEISPPYCSQG
:::::::.:: ::.:..:. . : :. :::. . . . .: . .: .:
CCDS13 KCTQILEGIHFKINVAEVEAELSQTRTKHQERPPESHRVTPNLNRSLSPRHNTPKGNRRG
120 130 140 150 160 170
180 190 200 210 220 230
pF1KSD RQPTASSDLRMETTPSKALRSRLQEEGHSDRGSSGSVSEY----EIQIE-GDHEQGDLLV
: .: :.: . : .. .. : . :: : . . . .: : ..:
CCDS13 -QVSAVLDIRELSPPEESTSPQIIEPS-SDVESREPILRINRAGQWYVETGVADRGGRSD
180 190 200 210 220 230
240 250 260 270 280
pF1KSD RESQIT-EVKVKMEKSDRPSCSDSSSLGDDGYHTEMVDGEQVVAVNVGSYGSVLQHAYSY
: .. :..: :. .. ... :.:: .: : . ..... ..::: :. :.
CCDS13 DEVRVLGAVHIKTENLEEWLGPENQPSGEDGSSAEEVTA---MVIDTTGHGSVGQENYTL
240 250 260 270 280 290
290 300 310 320 330 340
pF1KSD SQAASQ---PTNVSEAFGSLSNSSPSRSMLSCFRGGRARQKRALSVHLHSDLQGLVQGSD
...... ::. :: .. ::: :.. . :::..
CCDS13 GSSGAKVARPTS-SE----VDRFSPSGSVVPLTERHRARSESPGRMDEPKQPSSQVWSCG
300 310 320 330 340
350 360 370 380 390 400
pF1KSD SEAMMNNPGYESSPRERSARGHWYPYNERLICIYCGKSFNQKGSLDRHMRLHMGITPFVC
CCDS13 FRTALVVGGIATVYE
350 360
>>CCDS6867.1 ZBTB43 gene_id:23099|Hs108|chr9 (467 aa)
initn: 754 init1: 406 opt: 630 Z-score: 382.9 bits: 80.3 E(32554): 5.2e-15
Smith-Waterman score: 631; 30.4% identity (57.6% similar) in 467 aa overlap (3-451:1-447)
10 20 30 40 50 60
pF1KSD MSVEMDSSSFIQFDVPEYSSTVLSQLNELRLQGKLCDIIVHIQGQPFRAHKAVLAASSPY
.: ..:: . . :..:::.:..::. : ::.:::. . .::. ::::::::::::::
CCDS68 MEPGTNSF-RVEFPDFSSTILQKLNQQRQQGQLCDVSIVVQGHIFRAHKAVLAASSPY
10 20 30 40 50
70 80 90 100 110 120
pF1KSD FRDHSALSTMSGLSISVIKNPNVFEQLLSFCYTGRMSLQLKDVVSFLTAASFLQMQCVID
: :. :.. . . . :: :::..: ::::. . ..::.::::::::: :.:
CCDS68 FCDQVLLKNSRRIVLPDVMNPRVFENILLSSYTGRLVMPAPEIVSYLTAASFLQMWHVVD
60 70 80 90 100 110
130 140 150 160 170
pF1KSD KCTQILESIHSKISVGDVDSVTVGAEENPESRNGVKDSSFFANPVEISPPYCSQGR----
:::..::. . . .. . . : ::. .: ... . . : .. :
CCDS68 KCTEVLEG-NPTVLCQKLNHGSDHQSPSSSSYNGLVESFELGSGGHTDFPKAQELRDGEN
120 130 140 150 160 170
180 190 200 210 220
pF1KSD -----QPTASSDL-RMETTPSKAL--RSRLQEEGHSDRGSSGSVSEYEIQIEGDHEQGDL
. ::.: . : ::.. ..::. : :. : :. . :. : .
CCDS68 EEESTKDELSSQLTEHEYLPSNSSTEHDRLSTEMASQDGEEGASDSAEF-----HYTRPM
180 190 200 210 220 230
230 240 250 260 270 280
pF1KSD LVRESQITE---VKVKMEKSDRPSCS--DSSSLGDDGYHTEMVDGEQVVAVNVGSYGSVL
. : ... ..:: :. .. .: : . :. :: .. :. . : :
CCDS68 YSKPSIMAHKRWIHVKPERLEQ-ACEGMDVHATYDEHQVTESINTVQTEHT-VQPSGVEE
240 250 260 270 280
290 300 310 320 330 340
pF1KSD QHAYSYSQAASQPTNVSEAFGSLSNSSPSRSMLSCFRGGRARQKRALSVHLHSDLQGLVQ
. . ... .. . .. . . . : . : : :. .. : . .:.
CCDS68 DFHIGEKKVEAEFDEQADESNYDEQVDFYGSSMEEFSGERSDG----NLIGHRQEAALAA
290 300 310 320 330 340
350 360 370 380 390 400
pF1KSD G-SDSEAMMNNPGYESSPRERSARGHWYPYNERLICIYCGKSFNQKGSLDRHMRLHMGIT
: :.. :... :.: :: . :: : :::::..:.. :::: .:.:.
CCDS68 GYSENIEMVTGIKEEASHLGFSATDKLYP------C-QCGKSFTHKSQRDRHMSMHLGLR
350 360 370 380 390
410 420 430 440 450 460
pF1KSD PFVCKFCGKKYTRKDQLEYHIRGHTDDKPFRCEICGKCFPFQGTLNQHLRKNHPGVAEVR
:. : ::::. : .: :.. :: ::..:.::.: : .. ....:.
CCDS68 PYGCGVCGKKFKMKHHLVGHMKIHTGIKPYECNICAKRFMWRDSFHRHVTSCTKSYEAAK
400 410 420 430 440 450
470 480 490 500
pF1KSD SRIESPERTDVYVEQKLENDASASEMGLDSRMEIHTVSDAPD
CCDS68 AEQNTTEAN
460
504 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Thu Nov 3 07:56:28 2016 done: Thu Nov 3 07:56:28 2016
Total Scan time: 3.220 Total Display time: 0.010
Function used was FASTA [36.3.4 Apr, 2011]