FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KA0116, 291 aa
1>>>pF1KA0116 291 - 291 aa - 291 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.5859+/-0.000825; mu= 12.8372+/- 0.049
mean_var=53.0006+/-10.558, 0's: 0 Z-trim(104.6): 22 B-trim: 38 in 1/51
Lambda= 0.176171
statistics sampled from 7989 (7996) to 7989 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.623), E-opt: 0.2 (0.246), width: 16
Scan time: 2.250
The best scores are: opt bits E(32554)
CCDS2725.1 EXOSC7 gene_id:23016|Hs108|chr3 ( 291) 1904 491.8 2.4e-139
CCDS31958.1 EXOSC8 gene_id:11340|Hs108|chr13 ( 276) 374 103.0 2.6e-22
CCDS3722.2 EXOSC9 gene_id:5393|Hs108|chr4 ( 439) 375 103.2 3.5e-22
CCDS34057.1 EXOSC9 gene_id:5393|Hs108|chr4 ( 456) 375 103.2 3.6e-22
>>CCDS2725.1 EXOSC7 gene_id:23016|Hs108|chr3 (291 aa)
initn: 1904 init1: 1904 opt: 1904 Z-score: 2615.1 bits: 491.8 E(32554): 2.4e-139
Smith-Waterman score: 1904; 99.7% identity (100.0% similar) in 291 aa overlap (1-291:1-291)
10 20 30 40 50 60
pF1KA0 MASVTLSEAEKVYIVHGVQEDLRVDGRGCEDYRCVEVETDVVSNTSGSARVKLGHTDILV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS27 MASVTLSEAEKVYIVHGVQEDLRVDGRGCEDYRCVEVETDVVSNTSGSARVKLGHTDILV
10 20 30 40 50 60
70 80 90 100 110 120
pF1KA0 GVKAEMGTPKLEKPNEGYLEFFVDCSASATPEFEGRGGDDLGTEIANTLYRIFNNKSSVD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS27 GVKAEMGTPKLEKPNEGYLEFFVDCSASATPEFEGRGGDDLGTEIANTLYRIFNNKSSVD
70 80 90 100 110 120
130 140 150 160 170 180
pF1KA0 LKTLCISPREHCWVLYVDVLLLECGGNLFDAISIAVKAALFNTRIPRVRVLEDEEGSKDI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS27 LKTLCISPREHCWVLYVDVLLLECGGNLFDAISIAVKAALFNTRIPRVRVLEDEEGSKDI
130 140 150 160 170 180
190 200 210 220 230 240
pF1KA0 ELSDDPYDCIRLSVENVPCIVTLCKIGYRHVVDATLQEEACSLASLLVSVTSKGVVTCMR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS27 ELSDDPYDCIRLSVENVPCIVTLCKIGYRHVVDATLQEEACSLASLLVSVTSKGVVTCMR
190 200 210 220 230 240
250 260 270 280 290
pF1KA0 KVGKGSLDPESIFEMMETGKRVGKVLHASLQSVLHKEESLGPKRQKVGFLG
:::::::::::::::::::::::::::::::::.:::::::::::::::::
CCDS27 KVGKGSLDPESIFEMMETGKRVGKVLHASLQSVVHKEESLGPKRQKVGFLG
250 260 270 280 290
>>CCDS31958.1 EXOSC8 gene_id:11340|Hs108|chr13 (276 aa)
initn: 391 init1: 316 opt: 374 Z-score: 513.9 bits: 103.0 E(32554): 2.6e-22
Smith-Waterman score: 374; 28.9% identity (62.8% similar) in 266 aa overlap (18-276:18-276)
10 20 30 40 50 60
pF1KA0 MASVTLSEAEKVYIVHGVQEDLRVDGRGCEDYRCVEVETDVVSNTSGSARVKLGHTDILV
..:. : ::: ..: . :. .:...::: ::::.: ..
CCDS31 MAAGFKTVEPLEYYRRFLKENCRPDGRELGEFRTTTVNIGSISTADGSALVKLGNTTVIC
10 20 30 40 50 60
70 80 90 100 110
pF1KA0 GVKAEMGTPKLEKPNEGYLEFFVDCSASATPEFE-GRGGDDLGTEIANTLYR-IFNNKSS
:::::...:. . :..::. :: . .:. : :.. ...:. . ...:..
CCDS31 GVKAEFAAPSTDAPDKGYVVPNVDLPPLCSSRFRSGPPGEE--AQVASQFIADVIENSQI
70 80 90 100 110
120 130 140 150 160 170
pF1KA0 VDLKTLCISPREHCWVLYVDVLLLECGGNLFDAISIAVKAALFNTRIPRVRVLEDEEGSK
.. . ::::: . :::: :.. :. ::..:: ..:. ::: :...:.: . : : .
CCDS31 IQKEDLCISPGKLVWVLYCDLICLDYDGNILDACTFALLAALKNVQLPEVTINE-ETALA
120 130 140 150 160 170
180 190 200 210 220 230
pF1KA0 DIELSDDPYDCIRLSVENVPCIVTLCKIGYRH-VVDATLQEEACSLASLLVSVTSKGVVT
...:. : :.... : ... . .:: : .:: . ..: . . .: .
CCDS31 EVNLKKKSY----LNIRTHPVATSFAVFDDTLLIVDPTGEEEHLATGTLTIVMDEEGKLC
180 190 200 210 220 230
240 250 260 270 280 290
pF1KA0 CMRKVGKGSLDPESIFEMMETG----KRVGKVLHASLQSVLHKEESLGPKRQKVGFLG
:..: : ..: .. . : . :.: :.. ..:. :
CCDS31 CLHKPGGSGLTGAKLQDCMSRAVTRHKEVKKLMDEVIKSMKPK
240 250 260 270
>>CCDS3722.2 EXOSC9 gene_id:5393|Hs108|chr4 (439 aa)
initn: 355 init1: 262 opt: 375 Z-score: 511.7 bits: 103.2 E(32554): 3.5e-22
Smith-Waterman score: 375; 28.9% identity (60.2% similar) in 294 aa overlap (1-289:1-284)
10 20 30 40 50 60
pF1KA0 MASVTLSEAEKVYIVHGVQEDLRVDGRGCEDYRCVEVETDVVSNTSGSARVKLGHTDILV
: . ::. :. .......: :.::: ::: ... .. : :.::.: .:
CCDS37 MKETPLSNCERRFLLRAIEEKKRLDGRQTYDYRNIRIS---FGTDYGCCIVELGKTRVLG
10 20 30 40 50
70 80 90 100 110 120
pF1KA0 GVKAEMGTPKLEKPNEGYLEFFVDCSASATPEFEGRGGDDLGTEIANTLYRIFNNKSSVD
:. :. .:::.. .:: : : .. : :.: :: .:: ... . : . :.. .:
CCDS37 QVSCELVSPKLNRATEGILFFNLELSQMAAPAFEPGRQSDLLVKLNRLMERCLRNSKCID
60 70 80 90 100 110
130 140 150 160 170 180
pF1KA0 LKTLCISPREHCWVLYVDVLLLECGGNLFDAISIAVKAALFNTRIPRVRVLEDEEGSKDI
..::. :. : . ::. ::. ::..:: :::. .:: . : : : : :: .
CCDS37 TESLCVVAGEKVWQIRVDLHLLNHDGNIIDAASIAAIVALCHFRRPDVSVQGDE-----V
120 130 140 150 160 170
190 200 210 220 230
pF1KA0 EL-SDDPYDCIRLSVENVP-CI-VTLCKIGYRHVVDATLQEEACSLASLLVSVTSKGVVT
: . . : . ::....: :. .. . : .:: . .:: . .::: . .:
CCDS37 TLYTPEERDPVPLSIHHMPICVSFAFFQQGTYLLVDPNEREERV-MDGLLVIAMNKHREI
180 190 200 210 220 230
240 250 260 270 280 290
pF1KA0 C-MRKVGKGSLDPESIFEMME-TGKRVGKVLHASLQSVLHKEESLGPKRQKVGFLG
: ... : : ..... . .: .:... . :.. :...... . : ::
CCDS37 CTIQSSGGIMLLKDQVLRCSKIAGVKVAEITELILKA-LENDQKVRKEGGKFGFAESIAN
240 250 260 270 280 290
CCDS37 QRITAFKMEKAPIDTSDVEEKAEEIIAEAEPPSEVVSTPVLWTPGTAQIGEGVENSWGDL
300 310 320 330 340 350
>>CCDS34057.1 EXOSC9 gene_id:5393|Hs108|chr4 (456 aa)
initn: 355 init1: 262 opt: 375 Z-score: 511.4 bits: 103.2 E(32554): 3.6e-22
Smith-Waterman score: 375; 28.9% identity (60.2% similar) in 294 aa overlap (1-289:1-284)
10 20 30 40 50 60
pF1KA0 MASVTLSEAEKVYIVHGVQEDLRVDGRGCEDYRCVEVETDVVSNTSGSARVKLGHTDILV
: . ::. :. .......: :.::: ::: ... .. : :.::.: .:
CCDS34 MKETPLSNCERRFLLRAIEEKKRLDGRQTYDYRNIRIS---FGTDYGCCIVELGKTRVLG
10 20 30 40 50
70 80 90 100 110 120
pF1KA0 GVKAEMGTPKLEKPNEGYLEFFVDCSASATPEFEGRGGDDLGTEIANTLYRIFNNKSSVD
:. :. .:::.. .:: : : .. : :.: :: .:: ... . : . :.. .:
CCDS34 QVSCELVSPKLNRATEGILFFNLELSQMAAPAFEPGRQSDLLVKLNRLMERCLRNSKCID
60 70 80 90 100 110
130 140 150 160 170 180
pF1KA0 LKTLCISPREHCWVLYVDVLLLECGGNLFDAISIAVKAALFNTRIPRVRVLEDEEGSKDI
..::. :. : . ::. ::. ::..:: :::. .:: . : : : : :: .
CCDS34 TESLCVVAGEKVWQIRVDLHLLNHDGNIIDAASIAAIVALCHFRRPDVSVQGDE-----V
120 130 140 150 160 170
190 200 210 220 230
pF1KA0 EL-SDDPYDCIRLSVENVP-CI-VTLCKIGYRHVVDATLQEEACSLASLLVSVTSKGVVT
: . . : . ::....: :. .. . : .:: . .:: . .::: . .:
CCDS34 TLYTPEERDPVPLSIHHMPICVSFAFFQQGTYLLVDPNEREERV-MDGLLVIAMNKHREI
180 190 200 210 220 230
240 250 260 270 280 290
pF1KA0 C-MRKVGKGSLDPESIFEMME-TGKRVGKVLHASLQSVLHKEESLGPKRQKVGFLG
: ... : : ..... . .: .:... . :.. :...... . : ::
CCDS34 CTIQSSGGIMLLKDQVLRCSKIAGVKVAEITELILKA-LENDQKVRKEGGKFGFAESIAN
240 250 260 270 280 290
CCDS34 QRITAFKMEKAPIDTSDVEEKAEEIIAEAEPPSEVVSTPVLWTPGTAQIGEGVENSWGDL
300 310 320 330 340 350
291 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Wed Nov 2 18:08:56 2016 done: Wed Nov 2 18:08:56 2016
Total Scan time: 2.250 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]