FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5625, 493 aa 1>>>pF1KE5625 493 - 493 aa - 493 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.0232+/-0.000925; mu= 14.9819+/- 0.056 mean_var=80.6599+/-15.576, 0's: 0 Z-trim(106.7): 12 B-trim: 0 in 0/50 Lambda= 0.142806 statistics sampled from 9124 (9132) to 9124 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.658), E-opt: 0.2 (0.281), width: 16 Scan time: 3.280 The best scores are: opt bits E(32554) CCDS10859.1 DUS2 gene_id:54920|Hs108|chr16 ( 493) 3260 681.4 5.9e-196 CCDS61970.1 DUS2 gene_id:54920|Hs108|chr16 ( 458) 2455 515.6 4.7e-146 CCDS5745.1 DUS4L gene_id:11062|Hs108|chr7 ( 317) 386 89.2 7.1e-18 CCDS32775.1 DUS1L gene_id:64118|Hs108|chr17 ( 473) 340 79.8 7.1e-15 >>CCDS10859.1 DUS2 gene_id:54920|Hs108|chr16 (493 aa) initn: 3260 init1: 3260 opt: 3260 Z-score: 3631.5 bits: 681.4 E(32554): 5.9e-196 Smith-Waterman score: 3260; 100.0% identity (100.0% similar) in 493 aa overlap (1-493:1-493) 10 20 30 40 50 60 pF1KE5 MILNSLSLCYHNKLILAPMVRVGTLPMRLLALDYGADIVYCEELIDLKMIQCKRVVNEVL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 MILNSLSLCYHNKLILAPMVRVGTLPMRLLALDYGADIVYCEELIDLKMIQCKRVVNEVL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 STVDFVAPDDRVVFRTCEREQNRVVFQMGTSDAERALAVARLVENDVAGIDVNMGCPKQY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 STVDFVAPDDRVVFRTCEREQNRVVFQMGTSDAERALAVARLVENDVAGIDVNMGCPKQY 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 STKGGMGAALLSDPDKIEKILSTLVKGTRRPVTCKIRILPSLEDTLSLVKRIERTGIAAI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 STKGGMGAALLSDPDKIEKILSTLVKGTRRPVTCKIRILPSLEDTLSLVKRIERTGIAAI 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 AVHGRKREERPQHPVSCEVIKAIADTLSIPVIANGGSHDHIQQYSDIEDFRQATAASSVM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 AVHGRKREERPQHPVSCEVIKAIADTLSIPVIANGGSHDHIQQYSDIEDFRQATAASSVM 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE5 VARAAMWNPSIFLKEGLRPLEEVMQKYIRYAVQYDNHYTNTKYCLCQMLREQLESPQGRL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 VARAAMWNPSIFLKEGLRPLEEVMQKYIRYAVQYDNHYTNTKYCLCQMLREQLESPQGRL 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE5 LHAAQSSREICEAFGLGAFYEETTQELDAQQARLSAKTSEQTGEPAEDTSGVIKMAVKFD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 LHAAQSSREICEAFGLGAFYEETTQELDAQQARLSAKTSEQTGEPAEDTSGVIKMAVKFD 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE5 RRAYPAQITPKMCLLEWCRREKLAQPVYETVQRPLDRLFSSIVTVAEQKYQSTLWDKSKK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 RRAYPAQITPKMCLLEWCRREKLAQPVYETVQRPLDRLFSSIVTVAEQKYQSTLWDKSKK 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE5 LAEQAAAIVCLRSQGLPEGRLGEESPSLHKRKREAPDQDPGGPRAQELAQPGDLCKKPFV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 LAEQAAAIVCLRSQGLPEGRLGEESPSLHKRKREAPDQDPGGPRAQELAQPGDLCKKPFV 430 440 450 460 470 480 490 pF1KE5 ALGSGEESPLEGW ::::::::::::: CCDS10 ALGSGEESPLEGW 490 >>CCDS61970.1 DUS2 gene_id:54920|Hs108|chr16 (458 aa) initn: 2455 init1: 2455 opt: 2455 Z-score: 2735.7 bits: 515.6 E(32554): 4.7e-146 Smith-Waterman score: 2955; 92.9% identity (92.9% similar) in 493 aa overlap (1-493:1-458) 10 20 30 40 50 60 pF1KE5 MILNSLSLCYHNKLILAPMVRVGTLPMRLLALDYGADIVYCEELIDLKMIQCKRVVNEVL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS61 MILNSLSLCYHNKLILAPMVRVGTLPMRLLALDYGADIVYCEELIDLKMIQCKRVVNEVL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 STVDFVAPDDRVVFRTCEREQNRVVFQMGTSDAERALAVARLVENDVAGIDVNMGCPKQY :::::::::::::::::::::::::::: CCDS61 STVDFVAPDDRVVFRTCEREQNRVVFQM-------------------------------- 70 80 130 140 150 160 170 180 pF1KE5 STKGGMGAALLSDPDKIEKILSTLVKGTRRPVTCKIRILPSLEDTLSLVKRIERTGIAAI ::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS61 ---GGMGAALLSDPDKIEKILSTLVKGTRRPVTCKIRILPSLEDTLSLVKRIERTGIAAI 90 100 110 120 130 140 190 200 210 220 230 240 pF1KE5 AVHGRKREERPQHPVSCEVIKAIADTLSIPVIANGGSHDHIQQYSDIEDFRQATAASSVM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS61 AVHGRKREERPQHPVSCEVIKAIADTLSIPVIANGGSHDHIQQYSDIEDFRQATAASSVM 150 160 170 180 190 200 250 260 270 280 290 300 pF1KE5 VARAAMWNPSIFLKEGLRPLEEVMQKYIRYAVQYDNHYTNTKYCLCQMLREQLESPQGRL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS61 VARAAMWNPSIFLKEGLRPLEEVMQKYIRYAVQYDNHYTNTKYCLCQMLREQLESPQGRL 210 220 230 240 250 260 310 320 330 340 350 360 pF1KE5 LHAAQSSREICEAFGLGAFYEETTQELDAQQARLSAKTSEQTGEPAEDTSGVIKMAVKFD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS61 LHAAQSSREICEAFGLGAFYEETTQELDAQQARLSAKTSEQTGEPAEDTSGVIKMAVKFD 270 280 290 300 310 320 370 380 390 400 410 420 pF1KE5 RRAYPAQITPKMCLLEWCRREKLAQPVYETVQRPLDRLFSSIVTVAEQKYQSTLWDKSKK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS61 RRAYPAQITPKMCLLEWCRREKLAQPVYETVQRPLDRLFSSIVTVAEQKYQSTLWDKSKK 330 340 350 360 370 380 430 440 450 460 470 480 pF1KE5 LAEQAAAIVCLRSQGLPEGRLGEESPSLHKRKREAPDQDPGGPRAQELAQPGDLCKKPFV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS61 LAEQAAAIVCLRSQGLPEGRLGEESPSLHKRKREAPDQDPGGPRAQELAQPGDLCKKPFV 390 400 410 420 430 440 490 pF1KE5 ALGSGEESPLEGW ::::::::::::: CCDS61 ALGSGEESPLEGW 450 >>CCDS5745.1 DUS4L gene_id:11062|Hs108|chr7 (317 aa) initn: 322 init1: 119 opt: 386 Z-score: 434.4 bits: 89.2 E(32554): 7.1e-18 Smith-Waterman score: 413; 28.8% identity (59.2% similar) in 299 aa overlap (15-310:30-307) 10 20 30 40 pF1KE5 MILNSLSLCYHNKLILAPMVRVGTLPMRLLALDYGADIVYCEELI . ::::: . : .: :. :. :. : .. CCDS57 MKSDCMQTTICQERKKDPIEMFHSGQLVKVCAPMVRYSKLAFRTLVRKYSCDLCYTPMIV 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE5 DLKMIQCKRVVNEVLSTVDFVAPDDRVVFRTCEREQNRVVFQMGTSDAERALAVARLVEN ... .. . ..: . : .. :....::. .::.: CCDS57 AADFVKSIKARDSEFTTNQGDCP---------------LIVQFAANDARLLSDAARIVCP 70 80 90 100 110 120 130 140 150 160 pF1KE5 DVAGIDVNMGCPKQYSTKGGMGAALLSDPDKIEKILSTLVKGTRRP---VTCKIRILPSL . :::.: :::.... :.:: :.. :. .. ... . . .. : :. :::: .: CCDS57 YANGIDINCGCPQRWAMAEGYGACLINKPELVQDMVKQVRNQVETPGFSVSIKIRIHDDL 110 120 130 140 150 160 170 180 190 200 210 220 pF1KE5 EDTLSLVKRIERTGIAAIAVHGRKREERPQHPVSCEVIKAIADTLSIPVIANGGSHDHIQ . :..: .. : ::.. :.:::: ::: : :: . :: : ...:::::::: :. CCDS57 KRTVDLCQKAEATGVSWITVHGRTAEERHQ-PVHYDSIKIIKENMSIPVIANGD----IR 170 180 190 200 210 220 230 240 250 260 270 280 pF1KE5 QYSDIEDFRQATAASSVMVARAAMWNPSIFLKEGLRPLEEVMQKYIRYAVQYDNHYTNTK . .. :. . :....:::::. . ::..: ::. . . .. :.. . : . CCDS57 SLKEAENVWRITGTDGVMVARGLLANPAMFAGYEETPLKCIWD-WVDIALELGTPYMCFH 230 240 250 260 270 290 300 310 320 330 340 pF1KE5 YCLCQMLREQLESPQGRLLHAAQSSREICEAFGLGAFYEETTQELDAQQARLSAKTSEQT : :... . :...: .:. : CCDS57 QHLMYMMEKITSRQEKRVFNALSSTSAIIDYLTDHYGI 280 290 300 310 >>CCDS32775.1 DUS1L gene_id:64118|Hs108|chr17 (473 aa) initn: 353 init1: 207 opt: 340 Z-score: 380.5 bits: 79.8 E(32554): 7.1e-15 Smith-Waterman score: 373; 27.8% identity (56.5% similar) in 338 aa overlap (15-348:20-327) 10 20 30 40 50 pF1KE5 MILNSLSLCYHNKLILAPMVRVGTLPMRLLALDYGADIVYCEELIDLKMIQCKRV ..:::: . : :::. .::.. : : ... CCDS32 MPKLQGFEFWSRTLRGARHVVAPMVDQSELAWRLLSRRHGAQLCYTPMLHAQVFVRDANY 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE5 VNEVLSTVDFVAPDDRVVFRTCEREQNRVVFQMGTSDAERALAVARLVENDVAGIDVNMG .: : : :.:: . . :. ..: : . .: :... .::.:.: CCDS32 RKENLYCE--VCPEDRPL-----------IVQFCANDPEVFVQAALLAQDYCDAIDLNLG 70 80 90 100 120 130 140 150 160 170 pF1KE5 CPKQYSTKGGMGAALLSDPDKIEKILSTLVKGTRRPVTCKIRILPSLEDTLSLVKRIERT ::.. . .: .:: : .. : ..... . :::::::..: .. :. .. .:.. CCDS32 CPQMIAKRGHYGAFLQDEWDLLQRMILLAHEKLSVPVTCKIRVFPEIDKTVRYAQMLEKA 110 120 130 140 150 160 180 190 200 210 220 230 pF1KE5 GIAAIAVHGRKREER-P-QHPVSCEVIKAIADTLSIPVIANGGSHDHIQQYSDIEDFRQA : ..:::: .:.. : . .: : :::. ...:::.::: .:: .:.: . CCDS32 GCQLLTVHGRTKEQKGPLSGAASWEHIKAVRKAVAIPVFANG----NIQCLQDVERCLRD 170 180 190 200 210 220 240 250 260 270 280 290 pF1KE5 TAASSVMVARAAMWNPSIFLKEGLRP-LEEVMQKYIRYAVQYDNHYTNTKYCLCQMLREQ :....:: :.. . ::..: :: : . :. ..:. . .. : ...: . CCDS32 TGVQGVMSAEGNLHNPALF--EGRSPAVWELAEEYLDIVREHP--------CPLSYVRAH 230 240 250 260 270 300 310 320 330 340 350 pF1KE5 LESPQGRLLHAAQSSREICEAFGLGAFYEETTQELDAQQARLSAKTSEQTG-EPAEDTSG : . . :.. : :: ..::: . : . . :.: : .:. : CCDS32 LFKLWHHTLQVHQELREELAKVKTLEGIAAVSQEL---KLRCQEEISRQEGAKPTGDLPF 280 290 300 310 320 330 360 370 380 390 400 410 pF1KE5 VIKMAVKFDRRAYPAQITPKMCLLEWCRREKLAQPVYETVQRPLDRLFSSIVTVAEQKYQ CCDS32 HWICQPYIRPGPREGSKEKAGARSKRALEEEEGGTEVLSKNKQKKQLRNPHKTFDPSLKP 340 350 360 370 380 390 493 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 05:17:37 2016 done: Tue Nov 8 05:17:38 2016 Total Scan time: 3.280 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]