FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB9631, 272 aa
1>>>pF1KB9631 272 - 272 aa - 272 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 7.4093+/-0.000656; mu= 8.5281+/- 0.040
mean_var=162.3655+/-32.518, 0's: 0 Z-trim(117.2): 43 B-trim: 0 in 0/54
Lambda= 0.100653
statistics sampled from 17888 (17932) to 17888 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.83), E-opt: 0.2 (0.551), width: 16
Scan time: 3.120
The best scores are: opt bits E(32554)
CCDS3698.1 NEUROG2 gene_id:63973|Hs108|chr4 ( 272) 1811 273.6 1.1e-73
CCDS4187.1 NEUROG1 gene_id:4762|Hs108|chr5 ( 237) 437 74.0 1.1e-13
CCDS31212.1 NEUROG3 gene_id:50674|Hs108|chr10 ( 214) 414 70.6 1e-12
>>CCDS3698.1 NEUROG2 gene_id:63973|Hs108|chr4 (272 aa)
initn: 1811 init1: 1811 opt: 1811 Z-score: 1436.6 bits: 273.6 E(32554): 1.1e-73
Smith-Waterman score: 1811; 100.0% identity (100.0% similar) in 272 aa overlap (1-272:1-272)
10 20 30 40 50 60
pF1KB9 MFVKSETLELKEEEDVLVLLGSASPALAALTPLSSSADEEEEEEPGASGGARRQRGAEAG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS36 MFVKSETLELKEEEDVLVLLGSASPALAALTPLSSSADEEEEEEPGASGGARRQRGAEAG
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB9 QGARGGVAAGAEGCRPARLLGLVHDCKRRPSRARAVSRGAKTAETVQRIKKTRRLKANNR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS36 QGARGGVAAGAEGCRPARLLGLVHDCKRRPSRARAVSRGAKTAETVQRIKKTRRLKANNR
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB9 ERNRMHNLNAALDALREVLPTFPEDAKLTKIETLRFAHNYIWALTETLRLADHCGGGGGG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS36 ERNRMHNLNAALDALREVLPTFPEDAKLTKIETLRFAHNYIWALTETLRLADHCGGGGGG
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB9 LPGALFSEAVLLSPGGASAALSSSGDSPSPASTWSCTNSPAPSSSVSSNSTSPYSCTLSP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS36 LPGALFSEAVLLSPGGASAALSSSGDSPSPASTWSCTNSPAPSSSVSSNSTSPYSCTLSP
190 200 210 220 230 240
250 260 270
pF1KB9 ASPAGSDMDYWQPPPPDKHRYAPHLPIARDCI
::::::::::::::::::::::::::::::::
CCDS36 ASPAGSDMDYWQPPPPDKHRYAPHLPIARDCI
250 260 270
>>CCDS4187.1 NEUROG1 gene_id:4762|Hs108|chr5 (237 aa)
initn: 476 init1: 373 opt: 437 Z-score: 359.1 bits: 74.0 E(32554): 1.1e-13
Smith-Waterman score: 437; 46.2% identity (67.7% similar) in 186 aa overlap (51-233:33-215)
30 40 50 60 70
pF1KB9 GSASPALAALTPLSSSADEEEEEEPGASGGARRQRGAEAGQGARGGVAAGAEG-CRPARL
:: :..: :. : . . :: . : ...
CCDS41 ARLETCISDLDCASSSGSDLSGFLTDEEDCARLQQAASAS-GPPAPARRGAPNISRASEV
10 20 30 40 50 60
80 90 100 110 120 130
pF1KB9 LGLVHDCKRRPSRARAVSRGAKTAETVQRIKKTRRLKANNRERNRMHNLNAALDALREVL
: : ..: : :. .: ... .. ....::.:::.::::::::::::::::: ::
CCDS41 PGAQDDEQER-RRRRGRTR-VRSEALLHSLRRSRRVKANDRERNRMHNLNAALDALRSVL
70 80 90 100 110
140 150 160 170 180 190
pF1KB9 PTFPEDAKLTKIETLRFAHNYIWALTETLRLADHCGGGGGGLPGALFSEAVLLSPGGASA
:.::.:.:::::::::::.::::::.:::::::. :::. : . : :: :
CCDS41 PSFPDDTKLTKIETLRFAYNYIWALAETLRLADQGLPGGGARERLLPPQCVPCLPGPPSP
120 130 140 150 160 170
200 210 220 230 240 250
pF1KB9 ALS--SSGDSPSPASTWSCTNSPAPSSSVSSNSTSPYSCTLSPASPAGSDMDYWQPPPPD
: . : :.. . :: : .::: : . . .:
CCDS41 ASDAESWGSGAAAASPLSDPSSPAASEDFTYRPGDPVFSFPSLPKDLLHTTPCFIPYH
180 190 200 210 220 230
260 270
pF1KB9 KHRYAPHLPIARDCI
>>CCDS31212.1 NEUROG3 gene_id:50674|Hs108|chr10 (214 aa)
initn: 423 init1: 372 opt: 414 Z-score: 341.7 bits: 70.6 E(32554): 1e-12
Smith-Waterman score: 427; 48.4% identity (64.7% similar) in 190 aa overlap (64-247:43-212)
40 50 60 70 80 90
pF1KB9 SSSADEEEEEEPGASGGARRQRGAEAGQGARGGVAAGAEG-CR--PARLLGLVHDCKRRP
::. : . :: :: : .: . ::
CCDS31 VTRETERSFPRASEDEVTCPTSAPPSPTRTRGNCAEAEEGGCRGAPRKLRA------RRG
20 30 40 50 60
100 110 120 130 140 150
pF1KB9 SRARAVSRGAKTAETVQRIKKTRRLKANNRERNRMHNLNAALDALREVLPTFPEDAKLTK
.:.: :. .... ...:: :::.::::::::::.:::::: ::::::.::::::
CCDS31 GRSRP-----KSELALSKQRRSRRKKANDRERNRMHNLNSALDALRGVLPTFPDDAKLTK
70 80 90 100 110 120
160 170 180 190 200 210
pF1KB9 IETLRFAHNYIWALTETLRLADHCGGGGGGLPGALFSEAVLLSPGGASAALSSSGDSPSP
:::::::::::::::.:::.::: . :. .: : :::: : :: :
CCDS31 IETLRFAHNYIWALTQTLRIADHSLYALEP-PAPHCGE--LGSPGG------SPGDWGSL
130 140 150 160 170
220 230 240 250 260
pF1KB9 ASTWSCTNSPAPSSSVSSNST---SPYSCTLSPASPAGSDMDYWQPPPPDKHRYAPHLPI
: : ..: .:..:. . .: :::.: : ::
CCDS31 YSPVSQAGSLSPAASLEERPGLLGATFSACLSPGSLAFSDFL
180 190 200 210
270
pF1KB9 ARDCI
272 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 17:50:59 2016 done: Fri Nov 4 17:50:59 2016
Total Scan time: 3.120 Total Display time: -0.030
Function used was FASTA [36.3.4 Apr, 2011]