FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB6391, 276 aa
1>>>pF1KB6391 276 - 276 aa - 276 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.1722+/-0.000722; mu= 15.4137+/- 0.043
mean_var=59.1434+/-11.706, 0's: 0 Z-trim(108.4): 20 B-trim: 0 in 0/53
Lambda= 0.166771
statistics sampled from 10169 (10182) to 10169 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.7), E-opt: 0.2 (0.313), width: 16
Scan time: 2.120
The best scores are: opt bits E(32554)
CCDS31958.1 EXOSC8 gene_id:11340|Hs108|chr13 ( 276) 1821 446.2 1.2e-125
CCDS3722.2 EXOSC9 gene_id:5393|Hs108|chr4 ( 439) 412 107.2 2.1e-23
CCDS34057.1 EXOSC9 gene_id:5393|Hs108|chr4 ( 456) 412 107.3 2.1e-23
CCDS2725.1 EXOSC7 gene_id:23016|Hs108|chr3 ( 291) 375 98.3 6.9e-21
>>CCDS31958.1 EXOSC8 gene_id:11340|Hs108|chr13 (276 aa)
initn: 1821 init1: 1821 opt: 1821 Z-score: 2369.1 bits: 446.2 E(32554): 1.2e-125
Smith-Waterman score: 1821; 100.0% identity (100.0% similar) in 276 aa overlap (1-276:1-276)
10 20 30 40 50 60
pF1KB6 MAAGFKTVEPLEYYRRFLKENCRPDGRELGEFRTTTVNIGSISTADGSALVKLGNTTVIC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS31 MAAGFKTVEPLEYYRRFLKENCRPDGRELGEFRTTTVNIGSISTADGSALVKLGNTTVIC
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB6 GVKAEFAAPSTDAPDKGYVVPNVDLPPLCSSRFRSGPPGEEAQVASQFIADVIENSQIIQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS31 GVKAEFAAPSTDAPDKGYVVPNVDLPPLCSSRFRSGPPGEEAQVASQFIADVIENSQIIQ
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB6 KEDLCISPGKLVWVLYCDLICLDYDGNILDACTFALLAALKNVQLPEVTINEETALAEVN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS31 KEDLCISPGKLVWVLYCDLICLDYDGNILDACTFALLAALKNVQLPEVTINEETALAEVN
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB6 LKKKSYLNIRTHPVATSFAVFDDTLLIVDPTGEEEHLATGTLTIVMDEEGKLCCLHKPGG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS31 LKKKSYLNIRTHPVATSFAVFDDTLLIVDPTGEEEHLATGTLTIVMDEEGKLCCLHKPGG
190 200 210 220 230 240
250 260 270
pF1KB6 SGLTGAKLQDCMSRAVTRHKEVKKLMDEVIKSMKPK
::::::::::::::::::::::::::::::::::::
CCDS31 SGLTGAKLQDCMSRAVTRHKEVKKLMDEVIKSMKPK
250 260 270
>>CCDS3722.2 EXOSC9 gene_id:5393|Hs108|chr4 (439 aa)
initn: 227 init1: 142 opt: 412 Z-score: 533.8 bits: 107.2 E(32554): 2.1e-23
Smith-Waterman score: 412; 29.2% identity (64.0% similar) in 264 aa overlap (15-272:11-271)
10 20 30 40 50
pF1KB6 MAAGFKTVEPLEYYRRFL----KENCRPDGRELGEFRTTTVNIGSISTADGSALVKLGNT
:::: .:. : :::. ..:. ...: : : .:.::.:
CCDS37 MKETPLSNCERRFLLRAIEEKKRLDGRQTYDYRNIRISFG---TDYGCCIVELGKT
10 20 30 40 50
60 70 80 90 100 110
pF1KB6 TVICGVKAEFAAPSTDAPDKGYVVPNVDLPPLCSSRFRSGPPGEEAQVASQFIADVIENS
:. :. :...:. . .: . :..: . . :. : .. .... ..::
CCDS37 RVLGQVSCELVSPKLNRATEGILFFNLELSQMAAPAFEPGRQSDLLVKLNRLMERCLRNS
60 70 80 90 100 110
120 130 140 150 160 170
pF1KB6 QIIQKEDLCISPGKLVWVLYCDLICLDYDGNILDACTFALLAALKNVQLPEVTIN-EETA
. :. :.::. :. :: . :: :..::::.:: ..: ..:: . . :.:... .:..
CCDS37 KCIDTESLCVVAGEKVWQIRVDLHLLNHDGNIIDAASIAAIVALCHFRRPDVSVQGDEVT
120 130 140 150 160 170
180 190 200 210 220 230
pF1KB6 LAEVNLKKKSYLNIRTHPVATSFAVFDD-TLLIVDPTGEEEHLATGTLTIVMDEEGKLCC
: . . :.:. :. .::: :.. : :.:::. .::.. : :.:.:... ..:
CCDS37 LYTPEERDPVPLSIHHMPICVSFAFFQQGTYLLVDPNEREERVMDGLLVIAMNKHREICT
180 190 200 210 220 230
240 250 260 270
pF1KB6 LHKPGGSGLTGAKLQDCMSRAVTRHKEVKKLMDEVIKSMKPK
... :: : .. : . : .. :. .:. .....
CCDS37 IQSSGGIMLLKDQVLRCSKIAGVKVAEITELILKALENDQKVRKEGGKFGFAESIANQRI
240 250 260 270 280 290
CCDS37 TAFKMEKAPIDTSDVEEKAEEIIAEAEPPSEVVSTPVLWTPGTAQIGEGVENSWGDLEDS
300 310 320 330 340 350
>>CCDS34057.1 EXOSC9 gene_id:5393|Hs108|chr4 (456 aa)
initn: 227 init1: 142 opt: 412 Z-score: 533.6 bits: 107.3 E(32554): 2.1e-23
Smith-Waterman score: 412; 29.2% identity (64.0% similar) in 264 aa overlap (15-272:11-271)
10 20 30 40 50
pF1KB6 MAAGFKTVEPLEYYRRFL----KENCRPDGRELGEFRTTTVNIGSISTADGSALVKLGNT
:::: .:. : :::. ..:. ...: : : .:.::.:
CCDS34 MKETPLSNCERRFLLRAIEEKKRLDGRQTYDYRNIRISFG---TDYGCCIVELGKT
10 20 30 40 50
60 70 80 90 100 110
pF1KB6 TVICGVKAEFAAPSTDAPDKGYVVPNVDLPPLCSSRFRSGPPGEEAQVASQFIADVIENS
:. :. :...:. . .: . :..: . . :. : .. .... ..::
CCDS34 RVLGQVSCELVSPKLNRATEGILFFNLELSQMAAPAFEPGRQSDLLVKLNRLMERCLRNS
60 70 80 90 100 110
120 130 140 150 160 170
pF1KB6 QIIQKEDLCISPGKLVWVLYCDLICLDYDGNILDACTFALLAALKNVQLPEVTIN-EETA
. :. :.::. :. :: . :: :..::::.:: ..: ..:: . . :.:... .:..
CCDS34 KCIDTESLCVVAGEKVWQIRVDLHLLNHDGNIIDAASIAAIVALCHFRRPDVSVQGDEVT
120 130 140 150 160 170
180 190 200 210 220 230
pF1KB6 LAEVNLKKKSYLNIRTHPVATSFAVFDD-TLLIVDPTGEEEHLATGTLTIVMDEEGKLCC
: . . :.:. :. .::: :.. : :.:::. .::.. : :.:.:... ..:
CCDS34 LYTPEERDPVPLSIHHMPICVSFAFFQQGTYLLVDPNEREERVMDGLLVIAMNKHREICT
180 190 200 210 220 230
240 250 260 270
pF1KB6 LHKPGGSGLTGAKLQDCMSRAVTRHKEVKKLMDEVIKSMKPK
... :: : .. : . : .. :. .:. .....
CCDS34 IQSSGGIMLLKDQVLRCSKIAGVKVAEITELILKALENDQKVRKEGGKFGFAESIANQRI
240 250 260 270 280 290
CCDS34 TAFKMEKAPIDTSDVEEKAEEIIAEAEPPSEVVSTPVLWTPGTAQIGEGVENSWGDLEDS
300 310 320 330 340 350
>>CCDS2725.1 EXOSC7 gene_id:23016|Hs108|chr3 (291 aa)
initn: 391 init1: 316 opt: 375 Z-score: 488.5 bits: 98.3 E(32554): 6.9e-21
Smith-Waterman score: 375; 28.6% identity (62.5% similar) in 269 aa overlap (18-276:18-283)
10 20 30 40 50 60
pF1KB6 MAAGFKTVEPLEYYRRFLKENCRPDGRELGEFRTTTVNIGSISTADGSALVKLGNTTVIC
..:. : ::: ..: . :. .:...::: ::::.: ..
CCDS27 MASVTLSEAEKVYIVHGVQEDLRVDGRGCEDYRCVEVETDVVSNTSGSARVKLGHTDILV
10 20 30 40 50 60
70 80 90 100 110
pF1KB6 GVKAEFAAPSTDAPDKGYVVPNVDLPPLCSSRFRSGPPGEE--AQVASQFIADVIENSQI
:::::...:. . :..::. :: . .:. : :.. ...:. . ...:..
CCDS27 GVKAEMGTPKLEKPNEGYLEFFVDCSASATPEFE-GRGGDDLGTEIANTLYR-IFNNKSS
70 80 90 100 110
120 130 140 150 160 170
pF1KB6 IQKEDLCISPGKLVWVLYCDLICLDYDGNILDACTFALLAALKNVQLPEVTINE-ETALA
.. . ::::: . :::: :.. :. ::..:: ..:. ::: :...:.: . : : .
CCDS27 VDLKTLCISPREHCWVLYVDVLLLECGGNLFDAISIAVKAALFNTRIPRVRVLEDEEGSK
120 130 140 150 160 170
180 190 200 210 220 230
pF1KB6 EVNLKKKSY----LNIRTHPVATSFAVFDDTLLIVDPTGEEEHLATGTLTIVMDEEGKLC
...:. : :.... : ... . .:: : .:: . ..: . . .: .
CCDS27 DIELSDDPYDCIRLSVENVPCIVTLCKIGYRH-VVDATLQEEACSLASLLVSVTSKGVVT
180 190 200 210 220 230
240 250 260 270
pF1KB6 CLHKPGGSGLTGAKLQDCMSRAVTRHKEVKKLMDEVI---KSMKPK
:..: : ..: .. . : . : .. .. :. .:. ::
CCDS27 CMRKVGKGSLDPESIFEMMETGKRVGKVLHASLQSVVHKEESLGPKRQKVGFLG
240 250 260 270 280 290
276 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 20:46:13 2016 done: Fri Nov 4 20:46:13 2016
Total Scan time: 2.120 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]