FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB7851, 548 aa
1>>>pF1KB7851 548 - 548 aa - 548 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 11.3532+/-0.000895; mu= -6.1561+/- 0.054
mean_var=386.1243+/-78.725, 0's: 0 Z-trim(118.3): 91 B-trim: 204 in 1/53
Lambda= 0.065270
statistics sampled from 19126 (19218) to 19126 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.834), E-opt: 0.2 (0.59), width: 16
Scan time: 4.180
The best scores are: opt bits E(32554)
CCDS12600.1 ERF gene_id:2077|Hs108|chr19 ( 548) 3864 377.5 2.3e-104
CCDS77308.1 ERF gene_id:2077|Hs108|chr19 ( 473) 3320 326.2 5.4e-89
CCDS44250.1 ETV3 gene_id:2117|Hs108|chr1 ( 512) 962 104.2 4e-22
CCDS1164.1 ETV3 gene_id:2117|Hs108|chr1 ( 143) 718 80.7 1.3e-15
CCDS30893.1 ETV3L gene_id:440695|Hs108|chr1 ( 361) 704 79.8 6.3e-15
>>CCDS12600.1 ERF gene_id:2077|Hs108|chr19 (548 aa)
initn: 3864 init1: 3864 opt: 3864 Z-score: 1987.2 bits: 377.5 E(32554): 2.3e-104
Smith-Waterman score: 3864; 100.0% identity (100.0% similar) in 548 aa overlap (1-548:1-548)
10 20 30 40 50 60
pF1KB7 MKTPADTGFAFPDWAYKPESSPGSRQIQLWHFILELLRKEEYQGVIAWQGDYGEFVIKDP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 MKTPADTGFAFPDWAYKPESSPGSRQIQLWHFILELLRKEEYQGVIAWQGDYGEFVIKDP
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB7 DEVARLWGVRKCKPQMNYDKLSRALRYYYNKRILHKTKGKRFTYKFNFNKLVLVNYPFID
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 DEVARLWGVRKCKPQMNYDKLSRALRYYYNKRILHKTKGKRFTYKFNFNKLVLVNYPFID
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB7 VGLAGGAVPQSAPPVPSGGSHFRFPPSTPSEVLSPTEDPRSPPACSSSSSSLFSAVVARR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 VGLAGGAVPQSAPPVPSGGSHFRFPPSTPSEVLSPTEDPRSPPACSSSSSSLFSAVVARR
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB7 LGRGSVSDCSDGTSELEEPLGEDPRARPPGPPDLGAFRGPPLARLPHDPGVFRVYPRPRG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 LGRGSVSDCSDGTSELEEPLGEDPRARPPGPPDLGAFRGPPLARLPHDPGVFRVYPRPRG
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB7 GPEPLSPFPVSPLAGPGSLLPPQLSPALPMTPTHLAYTPSPTLSPMYPSGGGGPSGSGGG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 GPEPLSPFPVSPLAGPGSLLPPQLSPALPMTPTHLAYTPSPTLSPMYPSGGGGPSGSGGG
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB7 SHFSFSPEDMKRYLQAHTQSVYNYHLSPRAFLHYPGLVVPQPQRPDKCPLPPMAPETPPV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 SHFSFSPEDMKRYLQAHTQSVYNYHLSPRAFLHYPGLVVPQPQRPDKCPLPPMAPETPPV
310 320 330 340 350 360
370 380 390 400 410 420
pF1KB7 PSSASSSSSSSSSPFKFKLQPPPLGRRQRAAGEKAVAGADKSGGSAGGLAEGAGALAPPP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 PSSASSSSSSSSSPFKFKLQPPPLGRRQRAAGEKAVAGADKSGGSAGGLAEGAGALAPPP
370 380 390 400 410 420
430 440 450 460 470 480
pF1KB7 PPPQIKVEPISEGESEEVEVTDISDEDEEDGEVFKTPRAPPAPPKPEPGEAPGASQCMPL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 PPPQIKVEPISEGESEEVEVTDISDEDEEDGEVFKTPRAPPAPPKPEPGEAPGASQCMPL
430 440 450 460 470 480
490 500 510 520 530 540
pF1KB7 KLRFKRRWSEDCRLEGGGGPAGGFEDEGEDKKVRGEGPGEAGGPLTPRRVSSDLQHATAQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 KLRFKRRWSEDCRLEGGGGPAGGFEDEGEDKKVRGEGPGEAGGPLTPRRVSSDLQHATAQ
490 500 510 520 530 540
pF1KB7 LSLEHRDS
::::::::
CCDS12 LSLEHRDS
>>CCDS77308.1 ERF gene_id:2077|Hs108|chr19 (473 aa)
initn: 3320 init1: 3320 opt: 3320 Z-score: 1711.2 bits: 326.2 E(32554): 5.4e-89
Smith-Waterman score: 3320; 100.0% identity (100.0% similar) in 473 aa overlap (76-548:1-473)
50 60 70 80 90 100
pF1KB7 IAWQGDYGEFVIKDPDEVARLWGVRKCKPQMNYDKLSRALRYYYNKRILHKTKGKRFTYK
::::::::::::::::::::::::::::::
CCDS77 MNYDKLSRALRYYYNKRILHKTKGKRFTYK
10 20 30
110 120 130 140 150 160
pF1KB7 FNFNKLVLVNYPFIDVGLAGGAVPQSAPPVPSGGSHFRFPPSTPSEVLSPTEDPRSPPAC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS77 FNFNKLVLVNYPFIDVGLAGGAVPQSAPPVPSGGSHFRFPPSTPSEVLSPTEDPRSPPAC
40 50 60 70 80 90
170 180 190 200 210 220
pF1KB7 SSSSSSLFSAVVARRLGRGSVSDCSDGTSELEEPLGEDPRARPPGPPDLGAFRGPPLARL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS77 SSSSSSLFSAVVARRLGRGSVSDCSDGTSELEEPLGEDPRARPPGPPDLGAFRGPPLARL
100 110 120 130 140 150
230 240 250 260 270 280
pF1KB7 PHDPGVFRVYPRPRGGPEPLSPFPVSPLAGPGSLLPPQLSPALPMTPTHLAYTPSPTLSP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS77 PHDPGVFRVYPRPRGGPEPLSPFPVSPLAGPGSLLPPQLSPALPMTPTHLAYTPSPTLSP
160 170 180 190 200 210
290 300 310 320 330 340
pF1KB7 MYPSGGGGPSGSGGGSHFSFSPEDMKRYLQAHTQSVYNYHLSPRAFLHYPGLVVPQPQRP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS77 MYPSGGGGPSGSGGGSHFSFSPEDMKRYLQAHTQSVYNYHLSPRAFLHYPGLVVPQPQRP
220 230 240 250 260 270
350 360 370 380 390 400
pF1KB7 DKCPLPPMAPETPPVPSSASSSSSSSSSPFKFKLQPPPLGRRQRAAGEKAVAGADKSGGS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS77 DKCPLPPMAPETPPVPSSASSSSSSSSSPFKFKLQPPPLGRRQRAAGEKAVAGADKSGGS
280 290 300 310 320 330
410 420 430 440 450 460
pF1KB7 AGGLAEGAGALAPPPPPPQIKVEPISEGESEEVEVTDISDEDEEDGEVFKTPRAPPAPPK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS77 AGGLAEGAGALAPPPPPPQIKVEPISEGESEEVEVTDISDEDEEDGEVFKTPRAPPAPPK
340 350 360 370 380 390
470 480 490 500 510 520
pF1KB7 PEPGEAPGASQCMPLKLRFKRRWSEDCRLEGGGGPAGGFEDEGEDKKVRGEGPGEAGGPL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS77 PEPGEAPGASQCMPLKLRFKRRWSEDCRLEGGGGPAGGFEDEGEDKKVRGEGPGEAGGPL
400 410 420 430 440 450
530 540
pF1KB7 TPRRVSSDLQHATAQLSLEHRDS
:::::::::::::::::::::::
CCDS77 TPRRVSSDLQHATAQLSLEHRDS
460 470
>>CCDS44250.1 ETV3 gene_id:2117|Hs108|chr1 (512 aa)
initn: 1060 init1: 480 opt: 962 Z-score: 510.8 bits: 104.2 E(32554): 4e-22
Smith-Waterman score: 1324; 45.6% identity (63.0% similar) in 551 aa overlap (2-502:10-504)
10 20 30 40 50
pF1KB7 MKTPADTGFAFPDWAYKPESSPGSRQIQLWHFILELLRKEEYQGVIAWQ-GD
: . :. ::::::: :::::::::::::::::::.:::.. ::::: :.
CCDS44 MKAGCSIVEKPEGGGGYQFPDWAYKTESSPGSRQIQLWHFILELLQKEEFRHVIAWQQGE
10 20 30 40 50 60
60 70 80 90 100 110
pF1KB7 YGEFVIKDPDEVARLWGVRKCKPQMNYDKLSRALRYYYNKRILHKTKGKRFTYKFNFNKL
::::::::::::::::: ::::::::::::::::::::::::::::::::::::::::::
CCDS44 YGEFVIKDPDEVARLWGRRKCKPQMNYDKLSRALRYYYNKRILHKTKGKRFTYKFNFNKL
70 80 90 100 110 120
120 130 140 150 160 170
pF1KB7 VLVNYPFIDVGLAGGAVPQSAPPVPSGGSHFRFPPSTPSEVLSPTEDPRSPPACSSSSSS
:. :::::.. ..:.:::::::::...:.:.::: .. :::.: . : :.::
CCDS44 VMPNYPFINIR-SSGVVPQSAPPVPTASSRFHFPP---LDTHSPTNDVQ-PGRFSASS--
130 140 150 160 170
180 190 200 210 220 230
pF1KB7 LFSAVVARRLGRGSVSDCSDGTSELEEPLGEDPRARPPGP-PDLGAFRGPPLARLPHDPG
..: .. .: . ::::. . : : : : . .:. : ... . :
CCDS44 ----LTASGQESSNGTDRKTELSELEDGSAADWR-RGVDPVSSRNAIGGGGIGHQKRKPD
180 190 200 210 220
240 250 260 270 280
pF1KB7 V-FRVYPRPRGGPEPLSPFPVSPLAGPGSLLPPQLSPALPMTPTHLAYTPSPTLSPMYPS
. . .. :: :.: ::: :::. : :..: .:::: .::: ..:.::: :::. :
CCDS44 IMLPLFARPGMYPDPHSPFAVSPIPGRGGVLNVPISPALSLTPTIFSYSPSPGLSPFTSS
230 240 250 260 270 280
290 300 310 320 330 340
pF1KB7 GGGGPSGSGGGSHFSFSPEDMKRYLQAHTQSVYNYHLSPRAFLHYPGLVVPQPQRPDKCP
: :::.::.::.::.... ::.:::::::.: .::::.:: : .:
CCDS44 -----------SCFSFNPEEMKHYLHSQACSVFNYHLSPRTFPRYPGLMVP----PLQC-
290 300 310 320 330
350 360 370 380 390 400
pF1KB7 LPPMAPETPPVPSSASSSSSSSSSPFKFKLQPPPLGRRQRAAGEKAVAGADKSGGSAGGL
: :: :. :..::::::.::..: :.. .: . . ...
CCDS44 --QMHPEE--------------STQFSIKLQPPPVGRKNRERVESSEESAPVTTPTMASI
340 350 360 370
410 420 430 440 450
pF1KB7 AEGAGALAPPPPPPQIKVEPISEGESEEV-----EVTDISDED---------EEDGEVFK
::.::::: :: . : . : . ..:. :: : .:
CCDS44 ------------PPRIKVEPASEKDPESLRQSAREKEEHTQEEGTVPSRTIEEEKGTIFA
380 390 400 410 420
460 470 480 490
pF1KB7 TPRAPPAPP---------KP---------EPGEAPGASQ-----CMPLKLRFKRRWSEDC
: ::: : .: .::. :.: . :: :::.::::..:
CCDS44 RPAAPPIWPSVPISTPSGEPLEVTEDSEDRPGKEPSAPEKKEDALMPPKLRLKRRWNDDP
430 440 450 460 470 480
500 510 520 530 540
pF1KB7 R----------LEGGGGPAGGFEDEGEDKKVRGEGPGEAGGPLTPRRVSSDLQHATAQLS
. : .:.:: :
CCDS44 EARELSKSGKFLWNGSGPQGLATAAADA
490 500 510
>>CCDS1164.1 ETV3 gene_id:2117|Hs108|chr1 (143 aa)
initn: 719 init1: 475 opt: 718 Z-score: 394.0 bits: 80.7 E(32554): 1.3e-15
Smith-Waterman score: 718; 82.4% identity (90.4% similar) in 125 aa overlap (2-125:10-134)
10 20 30 40 50
pF1KB7 MKTPADTGFAFPDWAYKPESSPGSRQIQLWHFILELLRKEEYQGVIAWQ-GD
: . :. ::::::: :::::::::::::::::::.:::.. ::::: :.
CCDS11 MKAGCSIVEKPEGGGGYQFPDWAYKTESSPGSRQIQLWHFILELLQKEEFRHVIAWQQGE
10 20 30 40 50 60
60 70 80 90 100 110
pF1KB7 YGEFVIKDPDEVARLWGVRKCKPQMNYDKLSRALRYYYNKRILHKTKGKRFTYKFNFNKL
::::::::::::::::: ::::::::::::::::::::::::::::::::::::::::::
CCDS11 YGEFVIKDPDEVARLWGRRKCKPQMNYDKLSRALRYYYNKRILHKTKGKRFTYKFNFNKL
70 80 90 100 110 120
120 130 140 150 160 170
pF1KB7 VLVNYPFIDVGLAGGAVPQSAPPVPSGGSHFRFPPSTPSEVLSPTEDPRSPPACSSSSSS
:. :::::.. .:
CCDS11 VMPNYPFINIRSSGKIQTLLVGN
130 140
>>CCDS30893.1 ETV3L gene_id:440695|Hs108|chr1 (361 aa)
initn: 548 init1: 460 opt: 704 Z-score: 381.5 bits: 79.8 E(32554): 6.3e-15
Smith-Waterman score: 835; 51.4% identity (63.0% similar) in 319 aa overlap (7-294:19-326)
10 20 30 40
pF1KB7 MKTPADTGFAFPDWAYKPESSPGSRQIQLWHFILELLRKEEYQGVIAW
.:.:::::::: :::::::::::::::::::.:::.. ::::
CCDS30 MHCSCLAEGIPANPGNWISGLAFPDWAYKAESSPGSRQIQLWHFILELLQKEEFRHVIAW
10 20 30 40 50 60
50 60 70 80 90 100
pF1KB7 Q-GDYGEFVIKDPDEVARLWGVRKCKPQMNYDKLSRALRYYYNKRILHKTKGKRFTYKFN
: :.::::::::::::::::: ::::::::::::::::::::::::::::::::::::::
CCDS30 QQGEYGEFVIKDPDEVARLWGRRKCKPQMNYDKLSRALRYYYNKRILHKTKGKRFTYKFN
70 80 90 100 110 120
110 120 130 140 150
pF1KB7 FNKLVLVNYPFIDV------GLAGGAVPQSAPP-VPSGGS----H-FRFPPSTPSEVLSP
:.::..::::. .: : :: : :: : . : . : .. : :.
CCDS30 FSKLIVVNYPLWEVRAPPSPHLLLGAPALCRPALVPVGVQSELLHSMLFAHQAMVEQLTG
130 140 150 160 170 180
160 170 180 190 200
pF1KB7 TEDPRSPPACSS----SSSSLF---SAVVARRLGR----GSVSDCSDGTSELEEPLGEDP
. ::.:: :. ::::.. :: ::: :::. :.. . :: :
CCDS30 QQTPRGPPETSGDKKGSSSSVYRLGSAPGPCRLGLCCHLGSVQGELPGVASFTPPL---P
190 200 210 220 230
210 220 230 240 250
pF1KB7 RARPPGPPDLGAFRGPPLARLPHD---PGVFR---VYPRPRGGPEPLSPFPVSPL-AGPG
:: : . . :: : :: . ::.:. . : ::. : :: :: :: :
CCDS30 ---PPLPSNWTCLSGPFLPPLPSEQQLPGAFKPDILLPGPRSLPGAWH-FPGLPLLAGLG
240 250 260 270 280 290
260 270 280 290 300 310
pF1KB7 SLLPPQLSPALPMTPTHLAYTPSPTLSPMYPSGGGGPSGSGGGSHFSFSPEDMKRYLQAH
. .: : . : : :.: : .:: :
CCDS30 QGAGERLW-LLSLRPEGLEVKPAPM---MEAKGGLDPREVFCPETRRLKTGEESLTSPNL
300 310 320 330 340
320 330 340 350 360 370
pF1KB7 TQSVYNYHLSPRAFLHYPGLVVPQPQRPDKCPLPPMAPETPPVPSSASSSSSSSSSPFKF
CCDS30 ENLKAVWPLDPP
350 360
548 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 22:38:24 2016 done: Fri Nov 4 22:38:24 2016
Total Scan time: 4.180 Total Display time: 0.020
Function used was FASTA [36.3.4 Apr, 2011]