FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE1890, 418 aa
1>>>pF1KE1890 418 - 418 aa - 418 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.0426+/-0.000853; mu= 14.7346+/- 0.051
mean_var=100.7444+/-19.632, 0's: 0 Z-trim(109.8): 24 B-trim: 0 in 0/50
Lambda= 0.127780
statistics sampled from 11125 (11147) to 11125 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.706), E-opt: 0.2 (0.342), width: 16
Scan time: 2.990
The best scores are: opt bits E(32554)
CCDS33211.1 SPRED2 gene_id:200734|Hs108|chr2 ( 418) 3006 564.5 6.6e-161
CCDS46308.1 SPRED2 gene_id:200734|Hs108|chr2 ( 415) 2950 554.2 8.5e-158
CCDS42560.1 SPRED3 gene_id:399473|Hs108|chr19 ( 410) 990 192.9 4.9e-49
CCDS32193.1 SPRED1 gene_id:161742|Hs108|chr15 ( 444) 937 183.1 4.6e-46
>>CCDS33211.1 SPRED2 gene_id:200734|Hs108|chr2 (418 aa)
initn: 3006 init1: 3006 opt: 3006 Z-score: 3002.2 bits: 564.5 E(32554): 6.6e-161
Smith-Waterman score: 3006; 100.0% identity (100.0% similar) in 418 aa overlap (1-418:1-418)
10 20 30 40 50 60
pF1KE1 MTEETHPDDDSYIVRVKAVVMTRDDSSGGWFPQEGGGISRVGVCKVMHPEGNGRSGFLIH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS33 MTEETHPDDDSYIVRVKAVVMTRDDSSGGWFPQEGGGISRVGVCKVMHPEGNGRSGFLIH
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 GERQKDKLVVLECYVRKDLVYTKANPTFHHWKVDNRKFGLTFQSPADARAFDRGVRKAIE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS33 GERQKDKLVVLECYVRKDLVYTKANPTFHHWKVDNRKFGLTFQSPADARAFDRGVRKAIE
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE1 DLIEGSTTSSSTIHNEAELGDDDVFTTATDSSSNSSQKREQPTRTISSPTSCEHRRIYTL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS33 DLIEGSTTSSSTIHNEAELGDDDVFTTATDSSSNSSQKREQPTRTISSPTSCEHRRIYTL
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE1 GHLHDSYPTDHYHLDQPMPRPYRQVSFPDDDEEIVRINPREKIWMTGYEDYRHAPVRGKY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS33 GHLHDSYPTDHYHLDQPMPRPYRQVSFPDDDEEIVRINPREKIWMTGYEDYRHAPVRGKY
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE1 PDPSEDADSSYVRFAKGEVPKHDYNYPYVDSSDFGLGEDPKGRGGSVIKTQPSRGKSRRR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS33 PDPSEDADSSYVRFAKGEVPKHDYNYPYVDSSDFGLGEDPKGRGGSVIKTQPSRGKSRRR
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE1 KEDGERSRCVYCRDMFNHEENRRGHCQDAPDSVRTCIRRVSCMWCADSMLYHCMSDPEGD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS33 KEDGERSRCVYCRDMFNHEENRRGHCQDAPDSVRTCIRRVSCMWCADSMLYHCMSDPEGD
310 320 330 340 350 360
370 380 390 400 410
pF1KE1 YTDPCSCDTSDEKFCLRWMALIALSFLAPCMCCYLPLRACYHCGVMCRCCGGKHKAAA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS33 YTDPCSCDTSDEKFCLRWMALIALSFLAPCMCCYLPLRACYHCGVMCRCCGGKHKAAA
370 380 390 400 410
>>CCDS46308.1 SPRED2 gene_id:200734|Hs108|chr2 (415 aa)
initn: 2950 init1: 2950 opt: 2950 Z-score: 2946.5 bits: 554.2 E(32554): 8.5e-158
Smith-Waterman score: 2950; 99.5% identity (99.8% similar) in 412 aa overlap (7-418:4-415)
10 20 30 40 50 60
pF1KE1 MTEETHPDDDSYIVRVKAVVMTRDDSSGGWFPQEGGGISRVGVCKVMHPEGNGRSGFLIH
: .:::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 MASPGSDSYIVRVKAVVMTRDDSSGGWFPQEGGGISRVGVCKVMHPEGNGRSGFLIH
10 20 30 40 50
70 80 90 100 110 120
pF1KE1 GERQKDKLVVLECYVRKDLVYTKANPTFHHWKVDNRKFGLTFQSPADARAFDRGVRKAIE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 GERQKDKLVVLECYVRKDLVYTKANPTFHHWKVDNRKFGLTFQSPADARAFDRGVRKAIE
60 70 80 90 100 110
130 140 150 160 170 180
pF1KE1 DLIEGSTTSSSTIHNEAELGDDDVFTTATDSSSNSSQKREQPTRTISSPTSCEHRRIYTL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 DLIEGSTTSSSTIHNEAELGDDDVFTTATDSSSNSSQKREQPTRTISSPTSCEHRRIYTL
120 130 140 150 160 170
190 200 210 220 230 240
pF1KE1 GHLHDSYPTDHYHLDQPMPRPYRQVSFPDDDEEIVRINPREKIWMTGYEDYRHAPVRGKY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 GHLHDSYPTDHYHLDQPMPRPYRQVSFPDDDEEIVRINPREKIWMTGYEDYRHAPVRGKY
180 190 200 210 220 230
250 260 270 280 290 300
pF1KE1 PDPSEDADSSYVRFAKGEVPKHDYNYPYVDSSDFGLGEDPKGRGGSVIKTQPSRGKSRRR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 PDPSEDADSSYVRFAKGEVPKHDYNYPYVDSSDFGLGEDPKGRGGSVIKTQPSRGKSRRR
240 250 260 270 280 290
310 320 330 340 350 360
pF1KE1 KEDGERSRCVYCRDMFNHEENRRGHCQDAPDSVRTCIRRVSCMWCADSMLYHCMSDPEGD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 KEDGERSRCVYCRDMFNHEENRRGHCQDAPDSVRTCIRRVSCMWCADSMLYHCMSDPEGD
300 310 320 330 340 350
370 380 390 400 410
pF1KE1 YTDPCSCDTSDEKFCLRWMALIALSFLAPCMCCYLPLRACYHCGVMCRCCGGKHKAAA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 YTDPCSCDTSDEKFCLRWMALIALSFLAPCMCCYLPLRACYHCGVMCRCCGGKHKAAA
360 370 380 390 400 410
>>CCDS42560.1 SPRED3 gene_id:399473|Hs108|chr19 (410 aa)
initn: 909 init1: 368 opt: 990 Z-score: 993.8 bits: 192.9 E(32554): 4.9e-49
Smith-Waterman score: 990; 39.4% identity (63.2% similar) in 419 aa overlap (13-418:1-409)
10 20 30 40 50
pF1KE1 MTEETHPDDDSYIVRVKAVVMTRDDSSGGWFPQEGGGISRVGVCKV--MHPEGNGRSG-F
.:::.::::.::::::::.: :::.:.:.::.: .:::..:.: .
CCDS42 MVRVRAVVMARDDSSGGWLPVGGGGLSQVSVCRVRGARPEGGARQGHY
10 20 30 40
60 70 80 90 100 110
pF1KE1 LIHGERQKDKLVVLECYVRKDLVYTKANPTFHHWKVDNRKFGLTFQSPADARAFDRGVRK
.::::: .:. ..::: .. :::.:.:: ::::.. . ::::::::::.: :....
CCDS42 VIHGERLRDQKTTLECTLKPGLVYNKVNPIFHHWSLGDCKFGLTFQSPAEADEFQKSLLA
50 60 70 80 90 100
120 130 140 150 160 170
pF1KE1 AIEDLIEGSTTSSSTIHNEAELGDDDV----FTTATDSSSNSSQKR-EQPTRTISSPTSC
:. : .:: : ::. . . : .:. .::.:.::..: : : . ..:
CCDS42 ALAALGRGSLTPSSSSSSSSPSQDTAETPCPLTSHVDSDSSSSHSRQETPPSAAAAPIIT
110 120 130 140 150 160
180 190 200 210 220 230
pF1KE1 EHRRIYTLGHLHDSYPTDHYHLDQPMPRPYRQVSFPDDDEEIVRINPREKIWM-TGYEDY
. . : . : . : .: ...:. .: .. . : :::::
CCDS42 MES---ASGFGPTTPPQRRRSSAQSYPPLLPFTGIPEPSEPLAGAGGLG--WGGRGYEDY
170 180 190 200 210 220
240 250 260 270 280 290
pF1KE1 RHAPVRGKYPDPSEDADSSYVRFAK-GEVPKHDYNYPYVDSSDFGLGEDPKGRGGSVIKT
: :. : : .. ::::: : . . : . . . . : .
CCDS42 R----RSGPPAPLA-LSTCVVRFAKTGALRGAALGPPAALPAPLTEAAPPAPPARPPPGP
230 240 250 260 270
300 310 320 330 340
pF1KE1 QPSRGKSRRRKEDGERSRCVYCRDMFNHE-ENRRGHCQDAPDSVRTCIRRVSCMWCADSM
:: . .. : : .:::.:: .: .. ..: :.: .::: : .::.::.:::.:.
CCDS42 GPSSAPAKASPEAEEAARCVHCRALFRRRADGRGGRCAEAPDPGRLLVRRLSCLWCAESL
280 290 300 310 320 330
350 360 370 380 390 400
pF1KE1 LYHCMSDPEGDYTDPCSCDTSDEKFCLRWMALIALSFLAPCMCCYLPLRACYHCGVMCRC
::::.:: :::..:::.:. . . :: :: :::. .::.::: :::::. .. : :
CCDS42 LYHCLSDAEGDFSDPCACEPGHPRPAARWAALAALSLAVPCLCCYAPLRACHWVAARCGC
340 350 360 370 380 390
410
pF1KE1 --CGGKHKAAA
:::.:. ::
CCDS42 AGCGGRHEEAAR
400 410
>>CCDS32193.1 SPRED1 gene_id:161742|Hs108|chr15 (444 aa)
initn: 1368 init1: 769 opt: 937 Z-score: 940.5 bits: 183.1 E(32554): 4.6e-46
Smith-Waterman score: 1484; 53.1% identity (70.5% similar) in 458 aa overlap (1-417:1-443)
10 20 30 40 50
pF1KE1 MTEETHP-DDDSYIVRVKAVVMTRDDSSGGWFPQEGGGISRVGVCKVMHPEGNGRSGFLI
:.::: :.:. .::.:::::::::::::.: :.:.: : : :: : : :: . :.:
CCDS32 MSEETATSDNDNSYARVRAVVMTRDDSSGGWLPLGGSGLSSVTVFKVPHQEENGCADFFI
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE1 HGERQKDKLVVLECYVRKDLVYTKANPTFHHWKVDNRKFGLTFQSPADARAFDRGVRKAI
.::: .::.:::::...:::.:.:..:::::::.:..::::::::::::::::::.:.::
CCDS32 RGERLRDKMVVLECMLKKDLIYNKVTPTFHHWKIDDKKFGLTFQSPADARAFDRGIRRAI
70 80 90 100 110 120
120 130 140 150 160
pF1KE1 EDLIEGSTTSSSTIHNEAELGDDDVFTTATDSSSNSSQ----KRE-----QPTRTIS-SP
::. .: :. :::: : ::. .. ::::. . ..: .: :. . :
CCDS32 EDISQGCPESK----NEAE-GADDLQANEEDSSSSLVKDHLFQQETVVTSEPYRSSNIRP
130 140 150 160 170
170 180 190 200
pF1KE1 TSCEH---RRIY--------TLGH--LHDSYPTDHYHLDQ--------------PMPRPY
. : ::.: :.:. : . . .: : :. .
CCDS32 SPFEDLNARRVYMQSQANQITFGQPGLDIQSRSMEYVQRQISKECGSLKSQNRVPL-KSI
180 190 200 210 220 230
210 220 230 240 250 260
pF1KE1 RQVSFPDDDEEIVRINPREKIWMTGYEDYRHAPVRGKYPDPSEDADSSYVRFAKGEVPKH
:.::: :.:: :::::::. : . : :::: : : .::::: ..:.: . :
CCDS32 RHVSFQDEDE-IVRINPRD-ILIRRYADYRH-PDMWKNDLERDDADSS-IQFSKPDSKKS
240 250 260 270 280 290
270 280 290 300 310
pF1KE1 DYNYPYVDSSDFGLGEDPKGRGGSVIKTQPSR---GKSRRRKEDGERSRCVYCRDMFNHE
:: : : . .. .:: . :.::::: ::.::::::::::::::.. ::::
CCDS32 DYLYSCGDETKLS---SPKD--SVVFKTQPSSLKIKKSKRRKEDGERSRCVYCQERFNHE
300 310 320 330 340
320 330 340 350 360 370
pF1KE1 ENRRGHCQDAPDSVRTCIRRVSCMWCADSMLYHCMSDPEGDYTDPCSCDTSDEKFCLRWM
:: ::.:::::: .. :: .:::: ::.::::::::: :::..:::::::::.::::::.
CCDS32 ENVRGKCQDAPDPIKRCIYQVSCMLCAESMLYHCMSDSEGDFSDPCSCDTSDDKFCLRWL
350 360 370 380 390 400
380 390 400 410
pF1KE1 ALIALSFLAPCMCCYLPLRACYHCGVMCRCCGGKHKAAA
::.::::..::::::.::: :..:: : :::::::::
CCDS32 ALVALSFIVPCMCCYVPLRMCHRCGEACGCCGGKHKAAG
410 420 430 440
418 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 12:07:46 2016 done: Sun Nov 6 12:07:47 2016
Total Scan time: 2.990 Total Display time: 0.020
Function used was FASTA [36.3.4 Apr, 2011]