FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB7661, 335 aa
1>>>pF1KB7661 335 - 335 aa - 335 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.0352+/-0.000686; mu= 14.3509+/- 0.042
mean_var=98.9792+/-20.328, 0's: 0 Z-trim(113.0): 72 B-trim: 335 in 1/52
Lambda= 0.128915
statistics sampled from 13650 (13722) to 13650 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.762), E-opt: 0.2 (0.422), width: 16
Scan time: 3.070
The best scores are: opt bits E(32554)
CCDS4794.1 SPDEF gene_id:25803|Hs108|chr6 ( 335) 2307 438.7 3.1e-123
CCDS59013.1 SPDEF gene_id:25803|Hs108|chr6 ( 319) 1461 281.4 6.9e-76
CCDS56424.1 ETV7 gene_id:51513|Hs108|chr6 ( 264) 392 82.5 4.2e-16
CCDS7893.1 ELF5 gene_id:2001|Hs108|chr11 ( 255) 331 71.1 1.1e-12
CCDS7892.1 ELF5 gene_id:2001|Hs108|chr11 ( 265) 331 71.1 1.1e-12
CCDS45043.1 ELF1 gene_id:1997|Hs108|chr13 ( 595) 307 66.9 4.6e-11
CCDS9374.1 ELF1 gene_id:1997|Hs108|chr13 ( 619) 307 67.0 4.7e-11
CCDS59165.1 ELK1 gene_id:2002|Hs108|chrX ( 95) 295 64.1 5.2e-11
CCDS64063.1 ELF2 gene_id:1998|Hs108|chr4 ( 504) 305 66.5 5.2e-11
CCDS64062.1 ELF2 gene_id:1998|Hs108|chr4 ( 521) 305 66.5 5.3e-11
CCDS3745.1 ELF2 gene_id:1998|Hs108|chr4 ( 533) 305 66.5 5.4e-11
CCDS3744.1 ELF2 gene_id:1998|Hs108|chr4 ( 581) 305 66.6 5.8e-11
CCDS82954.1 ELF2 gene_id:1998|Hs108|chr4 ( 593) 305 66.6 5.9e-11
CCDS14617.1 ELF4 gene_id:2000|Hs108|chrX ( 663) 303 66.2 8.3e-11
>>CCDS4794.1 SPDEF gene_id:25803|Hs108|chr6 (335 aa)
initn: 2307 init1: 2307 opt: 2307 Z-score: 2325.9 bits: 438.7 E(32554): 3.1e-123
Smith-Waterman score: 2307; 100.0% identity (100.0% similar) in 335 aa overlap (1-335:1-335)
10 20 30 40 50 60
pF1KB7 MGSASPGLSSVSPSHLLLPPDTVSRTGLEKAAAGAVGLERRDWSPSPPATPEQGLSAFYL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 MGSASPGLSSVSPSHLLLPPDTVSRTGLEKAAAGAVGLERRDWSPSPPATPEQGLSAFYL
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB7 SYFDMLYPEDSSWAAKAPGASSREEPPEEPEQCPVIDSQAPAGSLDLVPGGLTLEEHSLE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 SYFDMLYPEDSSWAAKAPGASSREEPPEEPEQCPVIDSQAPAGSLDLVPGGLTLEEHSLE
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB7 QVQSMVVGEVLKDIETACKLLNITADPMDWSPSNVQKWLLWTEHQYRLPPMGKAFQELAG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 QVQSMVVGEVLKDIETACKLLNITADPMDWSPSNVQKWLLWTEHQYRLPPMGKAFQELAG
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB7 KELCAMSEEQFRQRSPLGGDVLHAHLDIWKSAAWMKERTSPGAIHYCASTSEESWTDSEV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 KELCAMSEEQFRQRSPLGGDVLHAHLDIWKSAAWMKERTSPGAIHYCASTSEESWTDSEV
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB7 DSSCSGQPIHLWQFLKELLLKPHSYGRFIRWLNKEKGIFKIEDSAQVARLWGIRKNRPAM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 DSSCSGQPIHLWQFLKELLLKPHSYGRFIRWLNKEKGIFKIEDSAQVARLWGIRKNRPAM
250 260 270 280 290 300
310 320 330
pF1KB7 NYDKLSRSIRQYYKKGIIRKPDISQRLVYQFVHPI
:::::::::::::::::::::::::::::::::::
CCDS47 NYDKLSRSIRQYYKKGIIRKPDISQRLVYQFVHPI
310 320 330
>>CCDS59013.1 SPDEF gene_id:25803|Hs108|chr6 (319 aa)
initn: 1456 init1: 1456 opt: 1461 Z-score: 1475.9 bits: 281.4 E(32554): 6.9e-76
Smith-Waterman score: 2145; 95.2% identity (95.2% similar) in 335 aa overlap (1-335:1-319)
10 20 30 40 50 60
pF1KB7 MGSASPGLSSVSPSHLLLPPDTVSRTGLEKAAAGAVGLERRDWSPSPPATPEQGLSAFYL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 MGSASPGLSSVSPSHLLLPPDTVSRTGLEKAAAGAVGLERRDWSPSPPATPEQGLSAFYL
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB7 SYFDMLYPEDSSWAAKAPGASSREEPPEEPEQCPVIDSQAPAGSLDLVPGGLTLEEHSLE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 SYFDMLYPEDSSWAAKAPGASSREEPPEEPEQCPVIDSQAPAGSLDLVPGGLTLEEHSLE
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB7 QVQSMVVGEVLKDIETACKLLNITADPMDWSPSNVQKWLLWTEHQYRLPPMGKAFQELAG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 QVQSMVVGEVLKDIETACKLLNITADPMDWSPSNVQKWLLWTEHQYRLPPMGKAFQELAG
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB7 KELCAMSEEQFRQRSPLGGDVLHAHLDIWKSAAWMKERTSPGAIHYCASTSEESWTDSEV
:::::::::::::::::::::::::::::::: ::::::::::::
CCDS59 KELCAMSEEQFRQRSPLGGDVLHAHLDIWKSA----------------STSEESWTDSEV
190 200 210 220
250 260 270 280 290 300
pF1KB7 DSSCSGQPIHLWQFLKELLLKPHSYGRFIRWLNKEKGIFKIEDSAQVARLWGIRKNRPAM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 DSSCSGQPIHLWQFLKELLLKPHSYGRFIRWLNKEKGIFKIEDSAQVARLWGIRKNRPAM
230 240 250 260 270 280
310 320 330
pF1KB7 NYDKLSRSIRQYYKKGIIRKPDISQRLVYQFVHPI
:::::::::::::::::::::::::::::::::::
CCDS59 NYDKLSRSIRQYYKKGIIRKPDISQRLVYQFVHPI
290 300 310
>>CCDS56424.1 ETV7 gene_id:51513|Hs108|chr6 (264 aa)
initn: 387 init1: 198 opt: 392 Z-score: 402.5 bits: 82.5 E(32554): 4.2e-16
Smith-Waterman score: 392; 34.8% identity (65.7% similar) in 201 aa overlap (138-332:41-228)
110 120 130 140 150 160
pF1KB7 VPGGLTLEEHSLEQVQSMVVGEVLKDIETACKLLN-ITADPMDWSPSNVQKWLLWTEHQY
::: . . .: :: .: .:: :.:..:
CCDS56 ISPVAAMPPLGTHVQARCEAQINLLGEGGICKLPGRLRIQPALWSREDVLHWLRWAEQEY
20 30 40 50 60 70
170 180 190 200 210 220
pF1KB7 RLPPMGKAFQELAGKELCAMSEEQFRQRSPLGGDVLHAHLDIWKSAAWMKERT---SP--
:: .. :. :. :: .....::.:.: .::::. :. :. ..:. .:
CCDS56 SLPCTAEHGFEMNGRALCILTKDDFRHRAPSSGDVLYELLQYIKT----QRRALVCGPFF
80 90 100 110 120
230 240 250 260 270 280
pF1KB7 GAIHYCASTSEESWTDSEVDSSCSGQPIHLWQFLKELLLKPHSYGRFIRWLNKEKGIFKI
:.: . ...: . : .: ::... .::: . : .:.: .:. ::..
CCDS56 GGIFRLKTPTQHSPVPPE---DCR----LLWDYVYQLLLDTR-YEPYIKWEDKDAKIFRV
130 140 150 160 170
290 300 310 320 330
pF1KB7 EDSAQVARLWGIRKNRPAMNYDKLSRSIRQYYKKGIIRKPDISQRLVYQFVHPI
: .::::: .::: :.:.:.::..:.::: .::.: . .:.:...:.
CCDS56 VDPNGLARLWGNHKNRVNMTYEKMSRALRHYYKLNIIKK-EPGQKLLFRFLKTPGKMVQD
180 190 200 210 220 230
CCDS56 KHSHLEPLESQEQDRIEFKDKRPEISP
240 250 260
>>CCDS7893.1 ELF5 gene_id:2001|Hs108|chr11 (255 aa)
initn: 324 init1: 261 opt: 331 Z-score: 341.4 bits: 71.1 E(32554): 1.1e-12
Smith-Waterman score: 392; 33.3% identity (60.9% similar) in 207 aa overlap (135-331:39-243)
110 120 130 140 150 160
pF1KB7 LDLVPGGLTLEEHSLEQVQSMVVGEVLKDIETACKLLNITADPMDWSPSNVQKWLLWTEH
.::: .. : :. .: .:: .
CCDS78 TFLPNASFCDPLMSWTDLFSNEEYYPAFEHQTACDSYWTSVHPEYWTKRHVWEWLQFCCD
10 20 30 40 50 60
170 180 190 200 210
pF1KB7 QYRLPPMGKAFQE--LAGKELCAMSEEQFRQRSPLGGDVLHAHLD--------IWKSAAW
::.: .: . ..: .::.:..:.: . . : :. :. :. ....:
CCDS78 QYKLDTNCISFCNFNISGLQLCSMTQEEFVEAAGLCGEYLYFILQNIRTQGYSFFNDAEE
70 80 90 100 110 120
220 230 240 250 260 270
pF1KB7 MKERTSPGAIHYCASTSEESWTDSEVDSSCSGQPIHLWQFLKELLLKPHSYGRFIRWLNK
: . : : .:: . : . : : : :::.:...:::.:. ...: ..
CCDS78 SKATIKDYADSNCLKTSGIKSQDCHSHSRTSLQSSHLWEFVRDLLLSPEENCGILEWEDR
130 140 150 160 170 180
280 290 300 310 320 330
pF1KB7 EKGIFKIEDSAQVARLWGIRKNRPAMNYDKLSRSIRQYYKKGIIRKPDISQRLVYQFVHP
:.:::.. : .:..:: ::. :.:.::::..: ::: ::... : .::::.:
CCDS78 EQGIFRVVKSEALAKMWGQRKKNDRMTYEKLSRALRYYYKTGILERVD--RRLVYKFGKN
190 200 210 220 230 240
pF1KB7 I
CCDS78 AHGWQEDKL
250
>>CCDS7892.1 ELF5 gene_id:2001|Hs108|chr11 (265 aa)
initn: 324 init1: 261 opt: 331 Z-score: 341.2 bits: 71.1 E(32554): 1.1e-12
Smith-Waterman score: 392; 33.3% identity (60.9% similar) in 207 aa overlap (135-331:49-253)
110 120 130 140 150 160
pF1KB7 LDLVPGGLTLEEHSLEQVQSMVVGEVLKDIETACKLLNITADPMDWSPSNVQKWLLWTEH
.::: .. : :. .: .:: .
CCDS78 TFLPNASFCDPLMSWTDLFSNEEYYPAFEHQTACDSYWTSVHPEYWTKRHVWEWLQFCCD
20 30 40 50 60 70
170 180 190 200 210
pF1KB7 QYRLPPMGKAFQE--LAGKELCAMSEEQFRQRSPLGGDVLHAHLD--------IWKSAAW
::.: .: . ..: .::.:..:.: . . : :. :. :. ....:
CCDS78 QYKLDTNCISFCNFNISGLQLCSMTQEEFVEAAGLCGEYLYFILQNIRTQGYSFFNDAEE
80 90 100 110 120 130
220 230 240 250 260 270
pF1KB7 MKERTSPGAIHYCASTSEESWTDSEVDSSCSGQPIHLWQFLKELLLKPHSYGRFIRWLNK
: . : : .:: . : . : : : :::.:...:::.:. ...: ..
CCDS78 SKATIKDYADSNCLKTSGIKSQDCHSHSRTSLQSSHLWEFVRDLLLSPEENCGILEWEDR
140 150 160 170 180 190
280 290 300 310 320 330
pF1KB7 EKGIFKIEDSAQVARLWGIRKNRPAMNYDKLSRSIRQYYKKGIIRKPDISQRLVYQFVHP
:.:::.. : .:..:: ::. :.:.::::..: ::: ::... : .::::.:
CCDS78 EQGIFRVVKSEALAKMWGQRKKNDRMTYEKLSRALRYYYKTGILERVD--RRLVYKFGKN
200 210 220 230 240 250
pF1KB7 I
CCDS78 AHGWQEDKL
260
>>CCDS45043.1 ELF1 gene_id:1997|Hs108|chr13 (595 aa)
initn: 291 init1: 257 opt: 307 Z-score: 312.1 bits: 66.9 E(32554): 4.6e-11
Smith-Waterman score: 307; 43.0% identity (70.2% similar) in 114 aa overlap (218-331:155-265)
190 200 210 220 230 240
pF1KB7 EEQFRQRSPLGGDVLHAHLDIWKSAAWMKERTSPGAIHYCASTSEESWTDSEVDSSCSGQ
.:.: :.: . : .. :.. :.
CCDS45 EVMETQQVQEKYADSPGASSPEQPKRKKGRKTKPPRPDSPATTPNISVKKKNKDGK--GN
130 140 150 160 170 180
250 260 270 280 290 300
pF1KB7 PIHLWQFLKELLLKPHSYGRFIRWLNKEKGIFKIEDSAQVARLWGIRKNRPAMNYDKLSR
:.::.:: :: . ..:.: ..::::::. :: :.:::: .::.: :::. ..:
CCDS45 TIYLWEFLLALLQDKATCPKYIKWTQREKGIFKLVDSKAVSRLWGKHKNKPDMNYETMGR
190 200 210 220 230 240
310 320 330
pF1KB7 SIRQYYKKGIIRKPDISQRLVYQFVHPI
..: ::..::. : . .:::::::
CCDS45 ALRYYYQRGILAKVE-GQRLVYQFKEMPKDLIYINDEDPSSSIESSDPSLSSSATSNRNQ
250 260 270 280 290 300
>>CCDS9374.1 ELF1 gene_id:1997|Hs108|chr13 (619 aa)
initn: 291 init1: 257 opt: 307 Z-score: 311.9 bits: 67.0 E(32554): 4.7e-11
Smith-Waterman score: 307; 43.0% identity (70.2% similar) in 114 aa overlap (218-331:179-289)
190 200 210 220 230 240
pF1KB7 EEQFRQRSPLGGDVLHAHLDIWKSAAWMKERTSPGAIHYCASTSEESWTDSEVDSSCSGQ
.:.: :.: . : .. :.. :.
CCDS93 EVMETQQVQEKYADSPGASSPEQPKRKKGRKTKPPRPDSPATTPNISVKKKNKDGK--GN
150 160 170 180 190 200
250 260 270 280 290 300
pF1KB7 PIHLWQFLKELLLKPHSYGRFIRWLNKEKGIFKIEDSAQVARLWGIRKNRPAMNYDKLSR
:.::.:: :: . ..:.: ..::::::. :: :.:::: .::.: :::. ..:
CCDS93 TIYLWEFLLALLQDKATCPKYIKWTQREKGIFKLVDSKAVSRLWGKHKNKPDMNYETMGR
210 220 230 240 250 260
310 320 330
pF1KB7 SIRQYYKKGIIRKPDISQRLVYQFVHPI
..: ::..::. : . .:::::::
CCDS93 ALRYYYQRGILAKVE-GQRLVYQFKEMPKDLIYINDEDPSSSIESSDPSLSSSATSNRNQ
270 280 290 300 310 320
>>CCDS59165.1 ELK1 gene_id:2002|Hs108|chrX (95 aa)
initn: 242 init1: 242 opt: 295 Z-score: 311.2 bits: 64.1 E(32554): 5.2e-11
Smith-Waterman score: 295; 52.4% identity (79.8% similar) in 84 aa overlap (249-332:5-86)
220 230 240 250 260 270
pF1KB7 TSPGAIHYCASTSEESWTDSEVDSSCSGQPIHLWQFLKELLLKPHSYGRFIRWLNKEKGI
. ::::: .:: . .. :..: : ... :
CCDS59 MDPSVTLWQFLLQLL-REQGNGHIISWTSRDGGE
10 20 30
280 290 300 310 320 330
pF1KB7 FKIEDSAQVARLWGIRKNRPAMNYDKLSRSIRQYYKKGIIRKPDISQRLVYQFVHPI
::. :. .::::::.:::. ::::::::..: :: :.:::: . .:..::.::
CCDS59 FKLVDAEEVARLWGLRKNKTNMNYDKLSRALRYYYDKNIIRKVS-GQKFVYKFVSYPESH
40 50 60 70 80 90
CCDS59 CAP
>>CCDS64063.1 ELF2 gene_id:1998|Hs108|chr4 (504 aa)
initn: 317 init1: 263 opt: 305 Z-score: 311.1 bits: 66.5 E(32554): 5.2e-11
Smith-Waterman score: 305; 48.8% identity (77.9% similar) in 86 aa overlap (246-331:116-200)
220 230 240 250 260 270
pF1KB7 KERTSPGAIHYCASTSEESWTDSEVDSSCSGQPIHLWQFLKELLLKPHSYGRFIRWLNKE
:. .::.:: .:: .. :.:.: ..:
CCDS64 VEVSTEESEPMDTSPIPTSPDSHEPMKKKKGNTTYLWEFLLDLLQDKNTCPRYIKWTQRE
90 100 110 120 130 140
280 290 300 310 320 330
pF1KB7 KGIFKIEDSAQVARLWGIRKNRPAMNYDKLSRSIRQYYKKGIIRKPDISQRLVYQFVHPI
:::::. :: :..::: .::.: :::. ..:..: ::..::. : . .:::::::
CCDS64 KGIFKLVDSKAVSKLWGKHKNKPDMNYETMGRALRYYYQRGILAKVE-GQRLVYQFKDMP
150 160 170 180 190 200
CCDS64 KNIVVIDDDKSETCNEDLAGTTDEKSLERVSLSAESLLKAASSVRSGKNSSPINCSRAEK
210 220 230 240 250 260
>>CCDS64062.1 ELF2 gene_id:1998|Hs108|chr4 (521 aa)
initn: 293 init1: 263 opt: 305 Z-score: 310.9 bits: 66.5 E(32554): 5.3e-11
Smith-Waterman score: 305; 48.8% identity (77.9% similar) in 86 aa overlap (246-331:133-217)
220 230 240 250 260 270
pF1KB7 KERTSPGAIHYCASTSEESWTDSEVDSSCSGQPIHLWQFLKELLLKPHSYGRFIRWLNKE
:. .::.:: .:: .. :.:.: ..:
CCDS64 KVGRKPKTQQSPISNGSPELGIKKKPREGKGNTTYLWEFLLDLLQDKNTCPRYIKWTQRE
110 120 130 140 150 160
280 290 300 310 320 330
pF1KB7 KGIFKIEDSAQVARLWGIRKNRPAMNYDKLSRSIRQYYKKGIIRKPDISQRLVYQFVHPI
:::::. :: :..::: .::.: :::. ..:..: ::..::. : . .:::::::
CCDS64 KGIFKLVDSKAVSKLWGKHKNKPDMNYETMGRALRYYYQRGILAKVE-GQRLVYQFKDMP
170 180 190 200 210 220
CCDS64 KNIVVIDDDKSETCNEDLAGTTDEKSLERVSLSAESLLKAASSVRSGKNSSPINCSRAEK
230 240 250 260 270 280
335 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 09:54:25 2016 done: Sat Nov 5 09:54:26 2016
Total Scan time: 3.070 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]