FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB9627, 382 aa
1>>>pF1KB9627 382 - 382 aa - 382 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 9.2658+/-0.000893; mu= 3.3527+/- 0.055
mean_var=293.7861+/-58.818, 0's: 0 Z-trim(117.5): 27 B-trim: 663 in 1/54
Lambda= 0.074827
statistics sampled from 18256 (18282) to 18256 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.833), E-opt: 0.2 (0.562), width: 16
Scan time: 2.810
The best scores are: opt bits E(32554)
CCDS11338.1 NEUROD2 gene_id:4761|Hs108|chr17 ( 382) 2634 296.9 2.1e-80
CCDS2283.1 NEUROD1 gene_id:4760|Hs108|chr2 ( 356) 1087 129.8 3.7e-30
CCDS5434.1 NEUROD6 gene_id:63974|Hs108|chr7 ( 337) 788 97.5 1.8e-20
CCDS8886.1 NEUROD4 gene_id:58158|Hs108|chr12 ( 331) 653 82.9 4.4e-16
>>CCDS11338.1 NEUROD2 gene_id:4761|Hs108|chr17 (382 aa)
initn: 2634 init1: 2634 opt: 2634 Z-score: 1557.1 bits: 296.9 E(32554): 2.1e-80
Smith-Waterman score: 2634; 100.0% identity (100.0% similar) in 382 aa overlap (1-382:1-382)
10 20 30 40 50 60
pF1KB9 MLTRLFSEPGLLSDVPKFASWGDGEDDEPRSDKGDAPPPPPPAPGPGAPGPARAAKPVPL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 MLTRLFSEPGLLSDVPKFASWGDGEDDEPRSDKGDAPPPPPPAPGPGAPGPARAAKPVPL
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB9 RGEEGTEATLAEVKEEGELGGEEEEEEEEEEGLDEAEGERPKKRGPKKRKMTKARLERSK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 RGEEGTEATLAEVKEEGELGGEEEEEEEEEEGLDEAEGERPKKRGPKKRKMTKARLERSK
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB9 LRRQKANARERNRMHDLNAALDNLRKVVPCYSKTQKLSKIETLRLAKNYIWALSEILRSG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 LRRQKANARERNRMHDLNAALDNLRKVVPCYSKTQKLSKIETLRLAKNYIWALSEILRSG
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB9 KRPDLVSYVQTLCKGLSQPTTNLVAGCLQLNSRNFLTEQGADGAGRFHGSGGPFAMHPYP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 KRPDLVSYVQTLCKGLSQPTTNLVAGCLQLNSRNFLTEQGADGAGRFHGSGGPFAMHPYP
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB9 YPCSRLAGAQCQAAGGLGGGAAHALRTHGYCAAYETLYAAAGGGGASPDYNSSEYEGPLS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 YPCSRLAGAQCQAAGGLGGGAAHALRTHGYCAAYETLYAAAGGGGASPDYNSSEYEGPLS
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB9 PPLCLNGNFSLKQDSSPDHEKSYHYSMHYSALPGSRPTGHGLVFGSSAVRGGVHSENLLS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 PPLCLNGNFSLKQDSSPDHEKSYHYSMHYSALPGSRPTGHGLVFGSSAVRGGVHSENLLS
310 320 330 340 350 360
370 380
pF1KB9 YDMHLHHDRGPMYEELNAFFHN
::::::::::::::::::::::
CCDS11 YDMHLHHDRGPMYEELNAFFHN
370 380
>>CCDS2283.1 NEUROD1 gene_id:4760|Hs108|chr2 (356 aa)
initn: 1073 init1: 809 opt: 1087 Z-score: 655.0 bits: 129.8 E(32554): 3.7e-30
Smith-Waterman score: 1087; 51.2% identity (69.5% similar) in 361 aa overlap (31-382:6-356)
10 20 30 40 50 60
pF1KB9 MLTRLFSEPGLLSDVPKFASWGDGEDDEPRSDKGDAPPPPPPAPGPGAPGPARAAKPVPL
:..: : : .: :. ...
CCDS22 MTKSYSESGLMGEPQPQGP-PSWTDECLSSQDEEH
10 20 30
70 80 90 100 110
pF1KB9 RGEEGTEATLAEVKEEGEL--GGEEEEE----EEEEEGLDEAEGERPKKRGPKKRKMTKA
.... . . :: : :::::.: ::::: .: . ..::.:::::.:::::
CCDS22 EADKKEDDLETMNAEEDSLRNGGEEEDEDEDLEEEEEEEEEDDDQKPKRRGPKKKKMTKA
40 50 60 70 80 90
120 130 140 150 160 170
pF1KB9 RLERSKLRRQKANARERNRMHDLNAALDNLRKVVPCYSKTQKLSKIETLRLAKNYIWALS
:::: ::::.::::::::::: ::::::::::::::::::::::::::::::::::::::
CCDS22 RLERFKLRRMKANARERNRMHGLNAALDNLRKVVPCYSKTQKLSKIETLRLAKNYIWALS
100 110 120 130 140 150
180 190 200 210 220 230
pF1KB9 EILRSGKRPDLVSYVQTLCKGLSQPTTNLVAGCLQLNSRNFLTEQGADGAGRFHGSGGPF
::::::: :::::.::::::::::::::::::::::: :.:: ::. : .. ... :
CCDS22 EILRSGKSPDLVSFVQTLCKGLSQPTTNLVAGCLQLNPRTFLPEQNQDMPPHLPTASASF
160 170 180 190 200 210
240 250 260 270 280 290
pF1KB9 AMHPYPYPCSRLAGAQCQAAGGLGGGAAHALRT--HGYCAAYETLYAAAGGGGASPDYNS
.::: : : : . .. . .. :.: :: : .. . .::.
CCDS22 PVHPYSYQSP---GLPSPPYGTMDSSHVFHVKPPPHAYSAALEPFFESPLTDCTSPS---
220 230 240 250 260
300 310 320 330 340 350
pF1KB9 SEYEGPLSPPLCLNGNFSLKQDSSPDHEKSYHYSMHYSALPGSRPTGHGLVF-GSSAVRG
..::::::: .:::::.:.. : . ::.: ..::: : . .:: .: :..: :
CCDS22 --FDGPLSPPLSINGNFSFKHEPSAEFEKNYAFTMHYPAATLAGAQSHGSIFSGTAAPRC
270 280 290 300 310 320
360 370 380
pF1KB9 GVHSENLLSYDMHLHHDRGPMYEELNAFFHN
. .:..:.: : ::.: : .:::.::.
CCDS22 EIPIDNIMSFDSHSHHER-VMSAQLNAIFHD
330 340 350
>>CCDS5434.1 NEUROD6 gene_id:63974|Hs108|chr7 (337 aa)
initn: 1037 init1: 696 opt: 788 Z-score: 480.8 bits: 97.5 E(32554): 1.8e-20
Smith-Waterman score: 1014; 52.0% identity (72.4% similar) in 333 aa overlap (51-382:34-337)
30 40 50 60 70 80
pF1KB9 WGDGEDDEPRSDKGDAPPPPPPAPGPGAPGPARAAKPVPLRGEEGTEATLAEVKEEGELG
: .: . :::. .: :...: :
CCDS54 LPFDESVVMPESQMCRKFSRECEDQKQIKKPESFSKQIVLRGKSIKRAPGEETEKEEE--
10 20 30 40 50 60
90 100 110 120 130 140
pF1KB9 GEEEEEEEEEEGLDEAEGERPKKRGPKKRKMTKARLERSKLRRQKANARERNRMHDLNAA
::..:::.:.:: :..:: .:.: :: :::: :.:::.:::::::::: :: :
CCDS54 -EEDREEEDENGL-------PRRRGLRKKKTTKLRLERVKFRRQEANARERNRMHGLNDA
70 80 90 100 110
150 160 170 180 190 200
pF1KB9 LDNLRKVVPCYSKTQKLSKIETLRLAKNYIWALSEILRSGKRPDLVSYVQTLCKGLSQPT
:::::::::::::::::::::::::::::::::::::: ::::::...::.:::::::::
CCDS54 LDNLRKVVPCYSKTQKLSKIETLRLAKNYIWALSEILRIGKRPDLLTFVQNLCKGLSQPT
120 130 140 150 160 170
210 220 230 240 250 260
pF1KB9 TNLVAGCLQLNSRNFLTEQGADGAGRFHGSGGPFAMHPYPYPCSRLAGAQCQAAGGLGGG
:::::::::::.:.:: ::...: : . .:.. :: .:. . : : .
CCDS54 TNLVAGCLQLNARSFLMGQGGEAA---HHTRSPYSTFYPPYHSPELTTP--PGHGTLDN-
180 190 200 210 220
270 280 290 300 310
pF1KB9 AAHALRTHGYCAAYETLYAAAGGGGASPDYNSSEYEGPLSPP-LCLNGNFSLKQDSSPDH
..... ..::.:::..: .. ::. : ..::::::: . :: :::::. . :.
CCDS54 -SKSMKPYNYCSAYESFYEST-----SPECASPQFEGPLSPPPINYNGIFSLKQEETLDY
230 240 250 260 270 280
320 330 340 350 360 370
pF1KB9 EKSYHYSMHYSALPGSRPTGHGLVFGSSAVRGGVHSENLLSYDMHLHHDRGPMYEELNAF
:.:.:.::: :.: : :.: .: : . ... . ::.::. . : .::::
CCDS54 GKNYNYGMHYCAVPPRGPLGQGAMF-----R--LPTDSHFPYDLHLRSQSLTMQDELNAV
290 300 310 320 330
380
pF1KB9 FHN
:::
CCDS54 FHN
>>CCDS8886.1 NEUROD4 gene_id:58158|Hs108|chr12 (331 aa)
initn: 924 init1: 642 opt: 653 Z-score: 402.2 bits: 82.9 E(32554): 4.4e-16
Smith-Waterman score: 849; 48.4% identity (67.4% similar) in 316 aa overlap (65-380:39-329)
40 50 60 70 80 90
pF1KB9 DAPPPPPPAPGPGAPGPARAAKPVPLRGEEGTEATLAEVKEEGELGGEEEEEEEEEEGLD
:: . :. . :: . . ::::::::.:
CCDS88 KEMGELVNTPSWMDKGLGSQNEVKEEESRPGTYGMLSSLTEEHD--SIEEEEEEEEDG--
10 20 30 40 50 60
100 110 120 130 140 150
pF1KB9 EAEGERPKKRGPKKRKMTKARLERSKLRRQKANARERNRMHDLNAALDNLRKVVPCYSKT
:.::.:::::.::::::::: . :: :::::::.::: :: ::::::.:.::::::
CCDS88 ----EKPKRRGPKKKKMTKARLERFRARRVKANARERTRMHGLNDALDNLRRVMPCYSKT
70 80 90 100 110 120
160 170 180 190 200 210
pF1KB9 QKLSKIETLRLAKNYIWALSEILRSGKRPDLVSYVQTLCKGLSQPTTNLVAGCLQLNSRN
::::::::::::.::::::::.:..:. :. ..:. :::::::::.:::::::::. ..
CCDS88 QKLSKIETLRLARNYIWALSEVLETGQTPEGKGFVEMLCKGLSQPTSNLVAGCLQLGPQS
130 140 150 160 170 180
220 230 240 250 260 270
pF1KB9 FLTEQGADGAGRFHGSGGPFAMHPYPYPCSRLAGAQCQAAGGLGGGAAHALRTHGYCAAY
: :. : . . ...: . : : . : .: : : ..
CCDS88 VLLEKHED---KSPICDSAISVHNFNYQSPGLPSPP------YGHMETHLL--HLKPQVF
190 200 210 220
280 290 300 310 320 330
pF1KB9 ETLYAAAGGGGASPDYNSSEYEGPLSPPLCLNGNFSLKQDSSPDHEKSYHYSMHYSALPG
..: . .. :. :: .. :::::.::: ..::::::::.::: :::: . :: .
CCDS88 KSL-GESSFGSHLPDCSTPPYEGPLTPPLSISGNFSLKQDGSPDLEKSYSFMPHYPSSSL
230 240 250 260 270 280
340 350 360 370 380
pF1KB9 SRPTGHGLVFGSSAVRGGVHSENLLSYDMHLHHDRGPMYEELNAFFHN
: :. : ... : : . .::: . :: : .::. :
CCDS88 SSGHVHSTPFQAGTPRYDVPID--MSYDSYPHHGIGT---QLNTVFTE
290 300 310 320 330
382 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 17:48:04 2016 done: Fri Nov 4 17:48:05 2016
Total Scan time: 2.810 Total Display time: 0.010
Function used was FASTA [36.3.4 Apr, 2011]