FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB9632, 214 aa
1>>>pF1KB9632 214 - 214 aa - 214 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.0454+/-0.000632; mu= 13.5465+/- 0.039
mean_var=123.3933+/-24.393, 0's: 0 Z-trim(116.2): 47 B-trim: 0 in 0/52
Lambda= 0.115459
statistics sampled from 16699 (16747) to 16699 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.829), E-opt: 0.2 (0.514), width: 16
Scan time: 2.670
The best scores are: opt bits E(32554)
CCDS31212.1 NEUROG3 gene_id:50674|Hs108|chr10 ( 214) 1442 249.8 9.4e-67
CCDS4187.1 NEUROG1 gene_id:4762|Hs108|chr5 ( 237) 474 88.6 3.5e-18
CCDS3698.1 NEUROG2 gene_id:63973|Hs108|chr4 ( 272) 414 78.7 3.9e-15
>>CCDS31212.1 NEUROG3 gene_id:50674|Hs108|chr10 (214 aa)
initn: 1442 init1: 1442 opt: 1442 Z-score: 1311.9 bits: 249.8 E(32554): 9.4e-67
Smith-Waterman score: 1442; 99.5% identity (99.5% similar) in 214 aa overlap (1-214:1-214)
10 20 30 40 50 60
pF1KB9 MTPQPSGAPTVQVTRETERSFPRASEDEVTCPTSAPPSPTRTRGNCAEAEEGGCRGAPRK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS31 MTPQPSGAPTVQVTRETERSFPRASEDEVTCPTSAPPSPTRTRGNCAEAEEGGCRGAPRK
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB9 LRARRGGRSRPKSELALSKQRRSRRKKANDRERNRMHNLNSALDALRGVLPTFPDDAKLT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS31 LRARRGGRSRPKSELALSKQRRSRRKKANDRERNRMHNLNSALDALRGVLPTFPDDAKLT
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB9 KIETLRFAHNYIWALTQTLRIADHSLYALEPPAPHCGELGSPGGSPGDWGSLYSPVSQAG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS31 KIETLRFAHNYIWALTQTLRIADHSLYALEPPAPHCGELGSPGGSPGDWGSLYSPVSQAG
130 140 150 160 170 180
190 200 210
pF1KB9 SLSPAASLEERPGLLGATSSACLSPGSLAFSDFL
:::::::::::::::::: :::::::::::::::
CCDS31 SLSPAASLEERPGLLGATFSACLSPGSLAFSDFL
190 200 210
>>CCDS4187.1 NEUROG1 gene_id:4762|Hs108|chr5 (237 aa)
initn: 442 init1: 412 opt: 474 Z-score: 439.9 bits: 88.6 E(32554): 3.5e-18
Smith-Waterman score: 489; 53.7% identity (70.9% similar) in 175 aa overlap (34-193:42-213)
10 20 30 40 50 60
pF1KB9 QPSGAPTVQVTRETERSFPRASEDEVTCPTSAPPSPTRTRG--NCAEAEEGGCRGAPRKL
:.::.:.: :: : ..: : ..
CCDS41 LDCASSSGSDLSGFLTDEEDCARLQQAASASGPPAPAR-RGAPNISRASEVPGAQDDEQE
20 30 40 50 60 70
70 80 90 100 110 120
pF1KB9 RARRGGRSRPKSELALSKQRRSRRKKANDRERNRMHNLNSALDALRGVLPTFPDDAKLTK
: :: ::.: .:: : . ::::: ::::::::::::::.::::::.:::.::::.::::
CCDS41 RRRRRGRTRVRSEALLHSLRRSRRVKANDRERNRMHNLNAALDALRSVLPSFPDDTKLTK
80 90 100 110 120 130
130 140 150 160 170
pF1KB9 IETLRFAHNYIWALTQTLRIADHSLYA------LEPP--APHCGELGSPGGSPGDWGS--
:::::::.::::::..:::.::..: . : :: .: ::... .:::
CCDS41 IETLRFAYNYIWALAETLRLADQGLPGGGARERLLPPQCVPCLPGPPSPASDAESWGSGA
140 150 160 170 180 190
180 190 200 210
pF1KB9 -LYSPVSQAGSLSPAAS--LEERPGLLGATSSACLSPGSLAFSDFL
::.:. .: :::: . :::
CCDS41 AAASPLSDPSS--PAASEDFTYRPGDPVFSFPSLPKDLLHTTPCFIPYH
200 210 220 230
>>CCDS3698.1 NEUROG2 gene_id:63973|Hs108|chr4 (272 aa)
initn: 419 init1: 372 opt: 414 Z-score: 385.2 bits: 78.7 E(32554): 3.9e-15
Smith-Waterman score: 428; 48.4% identity (64.6% similar) in 192 aa overlap (43-211:64-239)
20 30 40 50 60
pF1KB9 VTRETERSFPRASEDEVTCPTSAPPSPTRTRGNCAEAEEGGCRGAPRKLRA------RRG
::. : . :: :: : .: . ::
CCDS36 SSSADEEEEEEPGASGGARRQRGAEAGQGARGGVAAGAEG-CR--PARLLGLVHDCKRRP
40 50 60 70 80 90
70 80 90 100 110 120
pF1KB9 GRSRP-----KSELALSKQRRSRRKKANDRERNRMHNLNSALDALRGVLPTFPDDAKLTK
.:.: :. .... ...:: :::.::::::::::.:::::: ::::::.::::::
CCDS36 SRARAVSRGAKTAETVQRIKKTRRLKANNRERNRMHNLNAALDALREVLPTFPEDAKLTK
100 110 120 130 140 150
130 140 150 160 170
pF1KB9 IETLRFAHNYIWALTQTLRIADHSLYALEPPAPHCGELGSPGGSPGDWGS---LYSP---
:::::::::::::::.:::.::: :: :. :: :: : : ::
CCDS36 IETLRFAHNYIWALTETLRLADH-----------CG--GGGGGLPGALFSEAVLLSPGGA
160 170 180 190
180 190 200 210
pF1KB9 ---VSQAG-SLSPAA--SLEERPGLLGATSSACLSPGSLAFSDFL
.:..: : :::. : . :. ...:: :: : ..:
CCDS36 SAALSSSGDSPSPASTWSCTNSPAPSSSVSSNSTSPYSCTLSPASPAGSDMDYWQPPPPD
200 210 220 230 240 250
CCDS36 KHRYAPHLPIARDCI
260 270
214 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 17:51:35 2016 done: Fri Nov 4 17:51:36 2016
Total Scan time: 2.670 Total Display time: -0.020
Function used was FASTA [36.3.4 Apr, 2011]