FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE1367, 370 aa
1>>>pF1KE1367 370 - 370 aa - 370 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.2529+/-0.000797; mu= 12.8658+/- 0.048
mean_var=91.7237+/-18.126, 0's: 0 Z-trim(109.7): 27 B-trim: 0 in 0/52
Lambda= 0.133916
statistics sampled from 11044 (11068) to 11044 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.699), E-opt: 0.2 (0.34), width: 16
Scan time: 3.100
The best scores are: opt bits E(32554)
CCDS13015.1 NSFL1C gene_id:55968|Hs108|chr20 ( 370) 2396 472.7 2.2e-133
CCDS56175.1 NSFL1C gene_id:55968|Hs108|chr20 ( 372) 2382 470.0 1.4e-132
CCDS13016.1 NSFL1C gene_id:55968|Hs108|chr20 ( 339) 1234 248.2 7.8e-66
CCDS43741.1 UBXN2B gene_id:137886|Hs108|chr8 ( 331) 1031 209.0 4.9e-54
CCDS1704.1 UBXN2A gene_id:165324|Hs108|chr2 ( 259) 502 106.7 2.3e-23
CCDS83297.1 UBXN2B gene_id:137886|Hs108|chr8 ( 196) 402 87.3 1.2e-17
>>CCDS13015.1 NSFL1C gene_id:55968|Hs108|chr20 (370 aa)
initn: 2396 init1: 2396 opt: 2396 Z-score: 2508.1 bits: 472.7 E(32554): 2.2e-133
Smith-Waterman score: 2396; 100.0% identity (100.0% similar) in 370 aa overlap (1-370:1-370)
10 20 30 40 50 60
pF1KE1 MAAERQEALREFVAVTGAEEDRARFFLESAGWDLQIALASFYEDGGDEDIVTISQATPSS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 MAAERQEALREFVAVTGAEEDRARFFLESAGWDLQIALASFYEDGGDEDIVTISQATPSS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 VSRGTAPSDNRVTSFRDLIHDQDEDEEEEEGQRFYAGGSERSGQQIVGPPRKKSPNELVD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 VSRGTAPSDNRVTSFRDLIHDQDEDEEEEEGQRFYAGGSERSGQQIVGPPRKKSPNELVD
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE1 DLFKGAKEHGAVAVERVTKSPGETSKPRPFAGGGYRLGAAPEEESAYVAGEKRQHSSQDV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 DLFKGAKEHGAVAVERVTKSPGETSKPRPFAGGGYRLGAAPEEESAYVAGEKRQHSSQDV
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE1 HVVLKLWKSGFSLDNGELRSYQDPSNAQFLESIRRGEVPAELRRLAHGGQVNLDMEDHRD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 HVVLKLWKSGFSLDNGELRSYQDPSNAQFLESIRRGEVPAELRRLAHGGQVNLDMEDHRD
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE1 EDFVKPKGAFKAFTGEGQKLGSTAPQVLSTSSPAQQAENEAKASSSILIDESEPTTNIQI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 EDFVKPKGAFKAFTGEGQKLGSTAPQVLSTSSPAQQAENEAKASSSILIDESEPTTNIQI
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE1 RLADGGRLVQKFNHSHRISDIRLFIVDARPAMAATSFILMTTFPNKELADESQTLKEANL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 RLADGGRLVQKFNHSHRISDIRLFIVDARPAMAATSFILMTTFPNKELADESQTLKEANL
310 320 330 340 350 360
370
pF1KE1 LNAVIVQRLT
::::::::::
CCDS13 LNAVIVQRLT
370
>>CCDS56175.1 NSFL1C gene_id:55968|Hs108|chr20 (372 aa)
initn: 1844 init1: 1844 opt: 2382 Z-score: 2493.5 bits: 470.0 E(32554): 1.4e-132
Smith-Waterman score: 2382; 99.5% identity (99.5% similar) in 372 aa overlap (1-370:1-372)
10 20 30 40 50 60
pF1KE1 MAAERQEALREFVAVTGAEEDRARFFLESAGWDLQIALASFYEDGGDEDIVTISQATPSS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 MAAERQEALREFVAVTGAEEDRARFFLESAGWDLQIALASFYEDGGDEDIVTISQATPSS
10 20 30 40 50 60
70 80 90 100 110
pF1KE1 VSRGTAPSDNRVTSFRDLIHDQDEDEEEEEGQR--FYAGGSERSGQQIVGPPRKKSPNEL
::::::::::::::::::::::::::::::::: :::::::::::::::::::::::::
CCDS56 VSRGTAPSDNRVTSFRDLIHDQDEDEEEEEGQRSRFYAGGSERSGQQIVGPPRKKSPNEL
70 80 90 100 110 120
120 130 140 150 160 170
pF1KE1 VDDLFKGAKEHGAVAVERVTKSPGETSKPRPFAGGGYRLGAAPEEESAYVAGEKRQHSSQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 VDDLFKGAKEHGAVAVERVTKSPGETSKPRPFAGGGYRLGAAPEEESAYVAGEKRQHSSQ
130 140 150 160 170 180
180 190 200 210 220 230
pF1KE1 DVHVVLKLWKSGFSLDNGELRSYQDPSNAQFLESIRRGEVPAELRRLAHGGQVNLDMEDH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 DVHVVLKLWKSGFSLDNGELRSYQDPSNAQFLESIRRGEVPAELRRLAHGGQVNLDMEDH
190 200 210 220 230 240
240 250 260 270 280 290
pF1KE1 RDEDFVKPKGAFKAFTGEGQKLGSTAPQVLSTSSPAQQAENEAKASSSILIDESEPTTNI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 RDEDFVKPKGAFKAFTGEGQKLGSTAPQVLSTSSPAQQAENEAKASSSILIDESEPTTNI
250 260 270 280 290 300
300 310 320 330 340 350
pF1KE1 QIRLADGGRLVQKFNHSHRISDIRLFIVDARPAMAATSFILMTTFPNKELADESQTLKEA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 QIRLADGGRLVQKFNHSHRISDIRLFIVDARPAMAATSFILMTTFPNKELADESQTLKEA
310 320 330 340 350 360
360 370
pF1KE1 NLLNAVIVQRLT
::::::::::::
CCDS56 NLLNAVIVQRLT
370
>>CCDS13016.1 NSFL1C gene_id:55968|Hs108|chr20 (339 aa)
initn: 1222 init1: 1222 opt: 1234 Z-score: 1295.4 bits: 248.2 E(32554): 7.8e-66
Smith-Waterman score: 2114; 91.6% identity (91.6% similar) in 370 aa overlap (1-370:1-339)
10 20 30 40 50 60
pF1KE1 MAAERQEALREFVAVTGAEEDRARFFLESAGWDLQIALASFYEDGGDEDIVTISQATPSS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 MAAERQEALREFVAVTGAEEDRARFFLESAGWDLQIALASFYEDGGDEDIVTISQATPSS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 VSRGTAPSDNRVTSFRDLIHDQDEDEEEEEGQRFYAGGSERSGQQIVGPPRKKSPNELVD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 VSRGTAPSDNRVTSFRDLIHDQDEDEEEEEGQRFYAGGSERSGQQIVGPPRKKSPNELVD
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE1 DLFKGAKEHGAVAVERVTKSPGETSKPRPFAGGGYRLGAAPEEESAYVAGEKRQHSSQDV
:::::::::::::::::::::::::::: :
CCDS13 DLFKGAKEHGAVAVERVTKSPGETSKPR-------------------------------V
130 140
190 200 210 220 230 240
pF1KE1 HVVLKLWKSGFSLDNGELRSYQDPSNAQFLESIRRGEVPAELRRLAHGGQVNLDMEDHRD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 HVVLKLWKSGFSLDNGELRSYQDPSNAQFLESIRRGEVPAELRRLAHGGQVNLDMEDHRD
150 160 170 180 190 200
250 260 270 280 290 300
pF1KE1 EDFVKPKGAFKAFTGEGQKLGSTAPQVLSTSSPAQQAENEAKASSSILIDESEPTTNIQI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 EDFVKPKGAFKAFTGEGQKLGSTAPQVLSTSSPAQQAENEAKASSSILIDESEPTTNIQI
210 220 230 240 250 260
310 320 330 340 350 360
pF1KE1 RLADGGRLVQKFNHSHRISDIRLFIVDARPAMAATSFILMTTFPNKELADESQTLKEANL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 RLADGGRLVQKFNHSHRISDIRLFIVDARPAMAATSFILMTTFPNKELADESQTLKEANL
270 280 290 300 310 320
370
pF1KE1 LNAVIVQRLT
::::::::::
CCDS13 LNAVIVQRLT
330
>>CCDS43741.1 UBXN2B gene_id:137886|Hs108|chr8 (331 aa)
initn: 1011 init1: 486 opt: 1031 Z-score: 1083.6 bits: 209.0 E(32554): 4.9e-54
Smith-Waterman score: 1042; 48.9% identity (75.4% similar) in 354 aa overlap (17-369:10-330)
10 20 30 40 50 60
pF1KE1 MAAERQEALREFVAVTGAEEDRARFFLESAGWDLQIALASFYEDGGDEDIVTISQATPSS
: .: :. .. :::.::: .::: . .
CCDS43 MAEGGGPEPGEQERRSSGPRPPSARDLQLALAELYED-----------EVKCK
10 20 30 40
70 80 90 100 110 120
pF1KE1 VSRGTAPSDNRVTSFRDLIHDQDEDEEEEEGQRFYAGGSERSGQQIVGPPRKKSPNELVD
:... : ..: :.. . ::::.. : :: .:: : : ...:.
CCDS43 SSKSNRP---KATVFKS---------PRTPPQRFYSSEHEYSGLNIVRP----STGKIVN
50 60 70 80
130 140 150 160 170 180
pF1KE1 DLFKGAKEHGAVAVERVTKSPGETSKPRPFAGGGYRLGAAPEEESAYVAGEKRQHSSQDV
.::: :.::::: ....:.. :. .: . :.:::::::.. ..: :. ::.. :::
CCDS43 ELFKEAREHGAVPLNEATRASGD-DKSKSFTGGGYRLGSSFCKRSEYIYGENQL---QDV
90 100 110 120 130 140
190 200 210 220 230 240
pF1KE1 HVVLKLWKSGFSLDNGELRSYQDPSNAQFLESIRRGEVPAELRRLAHGGQVNLDMEDHRD
...::::..:::::.:::: :..:.:::::::..:::.: ::.::.::::::::::::.:
CCDS43 QILLKLWSNGFSLDDGELRPYNEPTNAQFLESVKRGEIPLELQRLVHGGQVNLDMEDHQD
150 160 170 180 190 200
250 260 270 280 290
pF1KE1 EDFVKPKGAFKAFTGEGQKLGSTAPQVLST-SSPAQQAENEAKASSSILIDESEPTTNIQ
....::. ::::.:::::::: .:...:: ::: . :... .. .:::.: :::.::
CCDS43 QEYIKPRLRFKAFSGEGQKLGSLTPEIVSTPSSPEE--EDKSILNAVVLIDDSVPTTKIQ
210 220 230 240 250 260
300 310 320 330 340 350
pF1KE1 IRLADGGRLVQKFNHSHRISDIRLFIVDARPAMAATSFILMTTFPNKELADESQTLKEAN
::::::.::.:.:: .::: :.: :::..:: .:: .:::.:.::::::.::: :: ::.
CCDS43 IRLADGSRLIQRFNSTHRILDVRNFIVQSRPEFAALDFILVTSFPNKELTDESLTLLEAD
270 280 290 300 310 320
360 370
pF1KE1 LLNAVIVQRLT
.::.:..:.:
CCDS43 ILNTVLLQQLK
330
>>CCDS1704.1 UBXN2A gene_id:165324|Hs108|chr2 (259 aa)
initn: 485 init1: 247 opt: 502 Z-score: 532.9 bits: 106.7 E(32554): 2.3e-23
Smith-Waterman score: 502; 43.0% identity (75.1% similar) in 193 aa overlap (178-369:59-247)
150 160 170 180 190 200
pF1KE1 RPFAGGGYRLGAAPEEESAYVAGEKRQHSSQDVHVVLKLWKSGFSLDNGELRSYQDPSNA
..: : .::::.::.. : ..:::.: ..
CCDS17 QQSNCEYFVDSLFEEAQKVSSKCVSPAEQKKQVDVNIKLWKNGFTV-NDDFRSYSDGASQ
30 40 50 60 70 80
210 220 230 240 250 260
pF1KE1 QFLESIRRGEVPAELRRLAHGGQVNLDMEDHRDEDFVKPKGAFKAFTGEGQKLGSTAPQV
:::.::..::.:.::. . .:.. .::...: .. : .:. :.:.:..:::..:..
CCDS17 QFLNSIKKGELPSELQGIFDKEEVDVKVEDKKNEICLSTKPVFQPFSGQGHRLGSATPKI
90 100 110 120 130 140
270 280 290 300 310 320
pF1KE1 LSTSSPAQQAENEAKAS-SSILIDESEPTTNIQIRLADGGRLVQKFNHSHRISDIRLFIV
.: :.. : : : . :.. ... :: ::::: ::.: :.::::: .::.: :. ::
CCDS17 VSK---AKNIEVENKNNLSAVPLNNLEPITNIQIWLANGKRIVQKFNITHRVSHIKDFIE
150 160 170 180 190 200
330 340 350 360 370
pF1KE1 DARPAMAATSFILMTTFPNKELADESQTLKEANLLNAVIVQRLT
. .. . : : :..: .: ::. ::.::.: ::::.:::
CCDS17 KYQGSQRSPPFSLATALPVLRLLDETLTLEEADLQNAVIIQRLQKTASFRELSEH
210 220 230 240 250
>>CCDS83297.1 UBXN2B gene_id:137886|Hs108|chr8 (196 aa)
initn: 338 init1: 206 opt: 402 Z-score: 430.3 bits: 87.3 E(32554): 1.2e-17
Smith-Waterman score: 413; 39.5% identity (66.0% similar) in 200 aa overlap (17-216:10-178)
10 20 30 40 50 60
pF1KE1 MAAERQEALREFVAVTGAEEDRARFFLESAGWDLQIALASFYEDGGDEDIVTISQATPSS
: .: :. .. :::.::: .::: . .
CCDS83 MAEGGGPEPGEQERRSSGPRPPSARDLQLALAELYED-----------EVKCK
10 20 30 40
70 80 90 100 110 120
pF1KE1 VSRGTAPSDNRVTSFRDLIHDQDEDEEEEEGQRFYAGGSERSGQQIVGPPRKKSPNELVD
:... : ..: :.. . ::::.. : :: .:: : : ...:.
CCDS83 SSKSNRP---KATVFKS---------PRTPPQRFYSSEHEYSGLNIVRP----STGKIVN
50 60 70 80
130 140 150 160 170 180
pF1KE1 DLFKGAKEHGAVAVERVTKSPGETSKPRPFAGGGYRLGAAPEEESAYVAGEKRQHSSQDV
.::: :.::::: ....:.. :. .: . :.:::::::.. ..: :. ::.. :::
CCDS83 ELFKEAREHGAVPLNEATRASGD-DKSKSFTGGGYRLGSSFCKRSEYIYGENQL---QDV
90 100 110 120 130 140
190 200 210 220 230 240
pF1KE1 HVVLKLWKSGFSLDNGELRSYQDPSNAQFLESIRRGEVPAELRRLAHGGQVNLDMEDHRD
...::::..:::::.:::: :..:.:::::::..::
CCDS83 QILLKLWSNGFSLDDGELRPYNEPTNAQFLESVKRGVTLIACMPEIQQLMLEIF
150 160 170 180 190
250 260 270 280 290 300
pF1KE1 EDFVKPKGAFKAFTGEGQKLGSTAPQVLSTSSPAQQAENEAKASSSILIDESEPTTNIQI
370 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 23:24:38 2016 done: Sun Nov 6 23:24:39 2016
Total Scan time: 3.100 Total Display time: 0.020
Function used was FASTA [36.3.4 Apr, 2011]