FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1367, 370 aa 1>>>pF1KE1367 370 - 370 aa - 370 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.2529+/-0.000797; mu= 12.8658+/- 0.048 mean_var=91.7237+/-18.126, 0's: 0 Z-trim(109.7): 27 B-trim: 0 in 0/52 Lambda= 0.133916 statistics sampled from 11044 (11068) to 11044 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.699), E-opt: 0.2 (0.34), width: 16 Scan time: 3.100 The best scores are: opt bits E(32554) CCDS13015.1 NSFL1C gene_id:55968|Hs108|chr20 ( 370) 2396 472.7 2.2e-133 CCDS56175.1 NSFL1C gene_id:55968|Hs108|chr20 ( 372) 2382 470.0 1.4e-132 CCDS13016.1 NSFL1C gene_id:55968|Hs108|chr20 ( 339) 1234 248.2 7.8e-66 CCDS43741.1 UBXN2B gene_id:137886|Hs108|chr8 ( 331) 1031 209.0 4.9e-54 CCDS1704.1 UBXN2A gene_id:165324|Hs108|chr2 ( 259) 502 106.7 2.3e-23 CCDS83297.1 UBXN2B gene_id:137886|Hs108|chr8 ( 196) 402 87.3 1.2e-17 >>CCDS13015.1 NSFL1C gene_id:55968|Hs108|chr20 (370 aa) initn: 2396 init1: 2396 opt: 2396 Z-score: 2508.1 bits: 472.7 E(32554): 2.2e-133 Smith-Waterman score: 2396; 100.0% identity (100.0% similar) in 370 aa overlap (1-370:1-370) 10 20 30 40 50 60 pF1KE1 MAAERQEALREFVAVTGAEEDRARFFLESAGWDLQIALASFYEDGGDEDIVTISQATPSS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 MAAERQEALREFVAVTGAEEDRARFFLESAGWDLQIALASFYEDGGDEDIVTISQATPSS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 VSRGTAPSDNRVTSFRDLIHDQDEDEEEEEGQRFYAGGSERSGQQIVGPPRKKSPNELVD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 VSRGTAPSDNRVTSFRDLIHDQDEDEEEEEGQRFYAGGSERSGQQIVGPPRKKSPNELVD 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 DLFKGAKEHGAVAVERVTKSPGETSKPRPFAGGGYRLGAAPEEESAYVAGEKRQHSSQDV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 DLFKGAKEHGAVAVERVTKSPGETSKPRPFAGGGYRLGAAPEEESAYVAGEKRQHSSQDV 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 HVVLKLWKSGFSLDNGELRSYQDPSNAQFLESIRRGEVPAELRRLAHGGQVNLDMEDHRD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 HVVLKLWKSGFSLDNGELRSYQDPSNAQFLESIRRGEVPAELRRLAHGGQVNLDMEDHRD 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 EDFVKPKGAFKAFTGEGQKLGSTAPQVLSTSSPAQQAENEAKASSSILIDESEPTTNIQI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 EDFVKPKGAFKAFTGEGQKLGSTAPQVLSTSSPAQQAENEAKASSSILIDESEPTTNIQI 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE1 RLADGGRLVQKFNHSHRISDIRLFIVDARPAMAATSFILMTTFPNKELADESQTLKEANL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 RLADGGRLVQKFNHSHRISDIRLFIVDARPAMAATSFILMTTFPNKELADESQTLKEANL 310 320 330 340 350 360 370 pF1KE1 LNAVIVQRLT :::::::::: CCDS13 LNAVIVQRLT 370 >>CCDS56175.1 NSFL1C gene_id:55968|Hs108|chr20 (372 aa) initn: 1844 init1: 1844 opt: 2382 Z-score: 2493.5 bits: 470.0 E(32554): 1.4e-132 Smith-Waterman score: 2382; 99.5% identity (99.5% similar) in 372 aa overlap (1-370:1-372) 10 20 30 40 50 60 pF1KE1 MAAERQEALREFVAVTGAEEDRARFFLESAGWDLQIALASFYEDGGDEDIVTISQATPSS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 MAAERQEALREFVAVTGAEEDRARFFLESAGWDLQIALASFYEDGGDEDIVTISQATPSS 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 VSRGTAPSDNRVTSFRDLIHDQDEDEEEEEGQR--FYAGGSERSGQQIVGPPRKKSPNEL ::::::::::::::::::::::::::::::::: ::::::::::::::::::::::::: CCDS56 VSRGTAPSDNRVTSFRDLIHDQDEDEEEEEGQRSRFYAGGSERSGQQIVGPPRKKSPNEL 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE1 VDDLFKGAKEHGAVAVERVTKSPGETSKPRPFAGGGYRLGAAPEEESAYVAGEKRQHSSQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 VDDLFKGAKEHGAVAVERVTKSPGETSKPRPFAGGGYRLGAAPEEESAYVAGEKRQHSSQ 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE1 DVHVVLKLWKSGFSLDNGELRSYQDPSNAQFLESIRRGEVPAELRRLAHGGQVNLDMEDH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 DVHVVLKLWKSGFSLDNGELRSYQDPSNAQFLESIRRGEVPAELRRLAHGGQVNLDMEDH 190 200 210 220 230 240 240 250 260 270 280 290 pF1KE1 RDEDFVKPKGAFKAFTGEGQKLGSTAPQVLSTSSPAQQAENEAKASSSILIDESEPTTNI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 RDEDFVKPKGAFKAFTGEGQKLGSTAPQVLSTSSPAQQAENEAKASSSILIDESEPTTNI 250 260 270 280 290 300 300 310 320 330 340 350 pF1KE1 QIRLADGGRLVQKFNHSHRISDIRLFIVDARPAMAATSFILMTTFPNKELADESQTLKEA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 QIRLADGGRLVQKFNHSHRISDIRLFIVDARPAMAATSFILMTTFPNKELADESQTLKEA 310 320 330 340 350 360 360 370 pF1KE1 NLLNAVIVQRLT :::::::::::: CCDS56 NLLNAVIVQRLT 370 >>CCDS13016.1 NSFL1C gene_id:55968|Hs108|chr20 (339 aa) initn: 1222 init1: 1222 opt: 1234 Z-score: 1295.4 bits: 248.2 E(32554): 7.8e-66 Smith-Waterman score: 2114; 91.6% identity (91.6% similar) in 370 aa overlap (1-370:1-339) 10 20 30 40 50 60 pF1KE1 MAAERQEALREFVAVTGAEEDRARFFLESAGWDLQIALASFYEDGGDEDIVTISQATPSS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 MAAERQEALREFVAVTGAEEDRARFFLESAGWDLQIALASFYEDGGDEDIVTISQATPSS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 VSRGTAPSDNRVTSFRDLIHDQDEDEEEEEGQRFYAGGSERSGQQIVGPPRKKSPNELVD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 VSRGTAPSDNRVTSFRDLIHDQDEDEEEEEGQRFYAGGSERSGQQIVGPPRKKSPNELVD 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 DLFKGAKEHGAVAVERVTKSPGETSKPRPFAGGGYRLGAAPEEESAYVAGEKRQHSSQDV :::::::::::::::::::::::::::: : CCDS13 DLFKGAKEHGAVAVERVTKSPGETSKPR-------------------------------V 130 140 190 200 210 220 230 240 pF1KE1 HVVLKLWKSGFSLDNGELRSYQDPSNAQFLESIRRGEVPAELRRLAHGGQVNLDMEDHRD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 HVVLKLWKSGFSLDNGELRSYQDPSNAQFLESIRRGEVPAELRRLAHGGQVNLDMEDHRD 150 160 170 180 190 200 250 260 270 280 290 300 pF1KE1 EDFVKPKGAFKAFTGEGQKLGSTAPQVLSTSSPAQQAENEAKASSSILIDESEPTTNIQI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 EDFVKPKGAFKAFTGEGQKLGSTAPQVLSTSSPAQQAENEAKASSSILIDESEPTTNIQI 210 220 230 240 250 260 310 320 330 340 350 360 pF1KE1 RLADGGRLVQKFNHSHRISDIRLFIVDARPAMAATSFILMTTFPNKELADESQTLKEANL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 RLADGGRLVQKFNHSHRISDIRLFIVDARPAMAATSFILMTTFPNKELADESQTLKEANL 270 280 290 300 310 320 370 pF1KE1 LNAVIVQRLT :::::::::: CCDS13 LNAVIVQRLT 330 >>CCDS43741.1 UBXN2B gene_id:137886|Hs108|chr8 (331 aa) initn: 1011 init1: 486 opt: 1031 Z-score: 1083.6 bits: 209.0 E(32554): 4.9e-54 Smith-Waterman score: 1042; 48.9% identity (75.4% similar) in 354 aa overlap (17-369:10-330) 10 20 30 40 50 60 pF1KE1 MAAERQEALREFVAVTGAEEDRARFFLESAGWDLQIALASFYEDGGDEDIVTISQATPSS : .: :. .. :::.::: .::: . . CCDS43 MAEGGGPEPGEQERRSSGPRPPSARDLQLALAELYED-----------EVKCK 10 20 30 40 70 80 90 100 110 120 pF1KE1 VSRGTAPSDNRVTSFRDLIHDQDEDEEEEEGQRFYAGGSERSGQQIVGPPRKKSPNELVD :... : ..: :.. . ::::.. : :: .:: : : ...:. CCDS43 SSKSNRP---KATVFKS---------PRTPPQRFYSSEHEYSGLNIVRP----STGKIVN 50 60 70 80 130 140 150 160 170 180 pF1KE1 DLFKGAKEHGAVAVERVTKSPGETSKPRPFAGGGYRLGAAPEEESAYVAGEKRQHSSQDV .::: :.::::: ....:.. :. .: . :.:::::::.. ..: :. ::.. ::: CCDS43 ELFKEAREHGAVPLNEATRASGD-DKSKSFTGGGYRLGSSFCKRSEYIYGENQL---QDV 90 100 110 120 130 140 190 200 210 220 230 240 pF1KE1 HVVLKLWKSGFSLDNGELRSYQDPSNAQFLESIRRGEVPAELRRLAHGGQVNLDMEDHRD ...::::..:::::.:::: :..:.:::::::..:::.: ::.::.::::::::::::.: CCDS43 QILLKLWSNGFSLDDGELRPYNEPTNAQFLESVKRGEIPLELQRLVHGGQVNLDMEDHQD 150 160 170 180 190 200 250 260 270 280 290 pF1KE1 EDFVKPKGAFKAFTGEGQKLGSTAPQVLST-SSPAQQAENEAKASSSILIDESEPTTNIQ ....::. ::::.:::::::: .:...:: ::: . :... .. .:::.: :::.:: CCDS43 QEYIKPRLRFKAFSGEGQKLGSLTPEIVSTPSSPEE--EDKSILNAVVLIDDSVPTTKIQ 210 220 230 240 250 260 300 310 320 330 340 350 pF1KE1 IRLADGGRLVQKFNHSHRISDIRLFIVDARPAMAATSFILMTTFPNKELADESQTLKEAN ::::::.::.:.:: .::: :.: :::..:: .:: .:::.:.::::::.::: :: ::. CCDS43 IRLADGSRLIQRFNSTHRILDVRNFIVQSRPEFAALDFILVTSFPNKELTDESLTLLEAD 270 280 290 300 310 320 360 370 pF1KE1 LLNAVIVQRLT .::.:..:.: CCDS43 ILNTVLLQQLK 330 >>CCDS1704.1 UBXN2A gene_id:165324|Hs108|chr2 (259 aa) initn: 485 init1: 247 opt: 502 Z-score: 532.9 bits: 106.7 E(32554): 2.3e-23 Smith-Waterman score: 502; 43.0% identity (75.1% similar) in 193 aa overlap (178-369:59-247) 150 160 170 180 190 200 pF1KE1 RPFAGGGYRLGAAPEEESAYVAGEKRQHSSQDVHVVLKLWKSGFSLDNGELRSYQDPSNA ..: : .::::.::.. : ..:::.: .. CCDS17 QQSNCEYFVDSLFEEAQKVSSKCVSPAEQKKQVDVNIKLWKNGFTV-NDDFRSYSDGASQ 30 40 50 60 70 80 210 220 230 240 250 260 pF1KE1 QFLESIRRGEVPAELRRLAHGGQVNLDMEDHRDEDFVKPKGAFKAFTGEGQKLGSTAPQV :::.::..::.:.::. . .:.. .::...: .. : .:. :.:.:..:::..:.. CCDS17 QFLNSIKKGELPSELQGIFDKEEVDVKVEDKKNEICLSTKPVFQPFSGQGHRLGSATPKI 90 100 110 120 130 140 270 280 290 300 310 320 pF1KE1 LSTSSPAQQAENEAKAS-SSILIDESEPTTNIQIRLADGGRLVQKFNHSHRISDIRLFIV .: :.. : : : . :.. ... :: ::::: ::.: :.::::: .::.: :. :: CCDS17 VSK---AKNIEVENKNNLSAVPLNNLEPITNIQIWLANGKRIVQKFNITHRVSHIKDFIE 150 160 170 180 190 200 330 340 350 360 370 pF1KE1 DARPAMAATSFILMTTFPNKELADESQTLKEANLLNAVIVQRLT . .. . : : :..: .: ::. ::.::.: ::::.::: CCDS17 KYQGSQRSPPFSLATALPVLRLLDETLTLEEADLQNAVIIQRLQKTASFRELSEH 210 220 230 240 250 >>CCDS83297.1 UBXN2B gene_id:137886|Hs108|chr8 (196 aa) initn: 338 init1: 206 opt: 402 Z-score: 430.3 bits: 87.3 E(32554): 1.2e-17 Smith-Waterman score: 413; 39.5% identity (66.0% similar) in 200 aa overlap (17-216:10-178) 10 20 30 40 50 60 pF1KE1 MAAERQEALREFVAVTGAEEDRARFFLESAGWDLQIALASFYEDGGDEDIVTISQATPSS : .: :. .. :::.::: .::: . . CCDS83 MAEGGGPEPGEQERRSSGPRPPSARDLQLALAELYED-----------EVKCK 10 20 30 40 70 80 90 100 110 120 pF1KE1 VSRGTAPSDNRVTSFRDLIHDQDEDEEEEEGQRFYAGGSERSGQQIVGPPRKKSPNELVD :... : ..: :.. . ::::.. : :: .:: : : ...:. CCDS83 SSKSNRP---KATVFKS---------PRTPPQRFYSSEHEYSGLNIVRP----STGKIVN 50 60 70 80 130 140 150 160 170 180 pF1KE1 DLFKGAKEHGAVAVERVTKSPGETSKPRPFAGGGYRLGAAPEEESAYVAGEKRQHSSQDV .::: :.::::: ....:.. :. .: . :.:::::::.. ..: :. ::.. ::: CCDS83 ELFKEAREHGAVPLNEATRASGD-DKSKSFTGGGYRLGSSFCKRSEYIYGENQL---QDV 90 100 110 120 130 140 190 200 210 220 230 240 pF1KE1 HVVLKLWKSGFSLDNGELRSYQDPSNAQFLESIRRGEVPAELRRLAHGGQVNLDMEDHRD ...::::..:::::.:::: :..:.:::::::..:: CCDS83 QILLKLWSNGFSLDDGELRPYNEPTNAQFLESVKRGVTLIACMPEIQQLMLEIF 150 160 170 180 190 250 260 270 280 290 300 pF1KE1 EDFVKPKGAFKAFTGEGQKLGSTAPQVLSTSSPAQQAENEAKASSSILIDESEPTTNIQI 370 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 23:24:38 2016 done: Sun Nov 6 23:24:39 2016 Total Scan time: 3.100 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]