FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1306, 411 aa 1>>>pF1KE1306 411 - 411 aa - 411 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.5210+/-0.000894; mu= 15.4842+/- 0.054 mean_var=59.3109+/-11.924, 0's: 0 Z-trim(105.6): 12 B-trim: 0 in 0/48 Lambda= 0.166536 statistics sampled from 8496 (8503) to 8496 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.643), E-opt: 0.2 (0.261), width: 16 Scan time: 2.470 The best scores are: opt bits E(32554) CCDS10982.1 DPEP1 gene_id:1800|Hs108|chr16 ( 411) 2776 675.4 2.7e-194 CCDS10857.1 DPEP2 gene_id:64174|Hs108|chr16 ( 486) 1225 302.8 4.7e-82 CCDS10856.1 DPEP3 gene_id:64180|Hs108|chr16 ( 513) 1161 287.4 2.1e-77 >>CCDS10982.1 DPEP1 gene_id:1800|Hs108|chr16 (411 aa) initn: 2776 init1: 2776 opt: 2776 Z-score: 3601.8 bits: 675.4 E(32554): 2.7e-194 Smith-Waterman score: 2776; 100.0% identity (100.0% similar) in 411 aa overlap (1-411:1-411) 10 20 30 40 50 60 pF1KE1 MWSGWWLWPLVAVCTADFFRDEAERIMRDSPVIDGHNDLPWQLLDMFNNRLQDERANLTT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 MWSGWWLWPLVAVCTADFFRDEAERIMRDSPVIDGHNDLPWQLLDMFNNRLQDERANLTT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 LAGTHTNIPKLRAGFVGGQFWSVYTPCDTQNKDAVRRTLEQMDVVHRMCRMYPETFLYVT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 LAGTHTNIPKLRAGFVGGQFWSVYTPCDTQNKDAVRRTLEQMDVVHRMCRMYPETFLYVT 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 SSAGIRQAFREGKVASLIGVEGGHSIDSSLGVLRALYQLGMRYLTLTHSCNTPWADNWLV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 SSAGIRQAFREGKVASLIGVEGGHSIDSSLGVLRALYQLGMRYLTLTHSCNTPWADNWLV 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 DTGDSEPQSQGLSPFGQRVVKELNRLGVLIDLAHVSVATMKATLQLSRAPVIFSHSSAYS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 DTGDSEPQSQGLSPFGQRVVKELNRLGVLIDLAHVSVATMKATLQLSRAPVIFSHSSAYS 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 VCASRRNVPDDVLRLVKQTDSLVMVNFYNNYISCTNKANLSQVADHLDHIKEVAGARAVG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 VCASRRNVPDDVLRLVKQTDSLVMVNFYNNYISCTNKANLSQVADHLDHIKEVAGARAVG 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE1 FGGDFDGVPRVPEGLEDVSKYPDLIAELLRRNWTEAEVKGALADNLLRVFEAVEQASNLT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 FGGDFDGVPRVPEGLEDVSKYPDLIAELLRRNWTEAEVKGALADNLLRVFEAVEQASNLT 310 320 330 340 350 360 370 380 390 400 410 pF1KE1 QAPEEEPIPLDQLGGSCRTHYGYSSGASSLHRHWGLLLASLAPLVLCLSLL ::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 QAPEEEPIPLDQLGGSCRTHYGYSSGASSLHRHWGLLLASLAPLVLCLSLL 370 380 390 400 410 >>CCDS10857.1 DPEP2 gene_id:64174|Hs108|chr16 (486 aa) initn: 1217 init1: 843 opt: 1225 Z-score: 1586.6 bits: 302.8 E(32554): 4.7e-82 Smith-Waterman score: 1225; 50.7% identity (80.2% similar) in 363 aa overlap (19-379:72-428) 10 20 30 40 pF1KE1 MWSGWWLWPLVAVCTADFFRDEAERIMRDSPVIDGHNDLPWQLLDMFN ....:. .::: :..::::::: : .... CCDS10 ALTTLGAPRAHTMPGTYAPSTTLSSPSTQGLQEQARALMRDFPLVDGHNDLPLVLRQVYQ 50 60 70 80 90 100 50 60 70 80 90 100 pF1KE1 NRLQDERANLTTLAGTHTNIPKLRAGFVGGQFWSVYTPCDTQNKDAVRRTLEQMDVVHRM . ::: .:: ... .:.. .:: :.::.::::.:.::.::..::.: ::::.:...:: CCDS10 KGLQD--VNLRNFSYGQTSLDRLRDGLVGAQFWSAYVPCQTQDRDALRLTLEQIDLIRRM 110 120 130 140 150 110 120 130 140 150 160 pF1KE1 CRMYPETFLYVTSSAGIRQAFREGKVASLIGVEGGHSIDSSLGVLRALYQLGMRYLTLTH : : : : :::. .. .. :.: :::::::::.:.::..::..:.::.::::::: CCDS10 CASYSELEL-VTSAKALNDT---QKLACLIGVEGGHSLDNSLSILRTFYMLGVRYLTLTH 160 170 180 190 200 210 170 180 190 200 210 220 pF1KE1 SCNTPWADNWLVDTGDSEPQSQGLSPFGQRVVKELNRLGVLIDLAHVSVATMKATLQLSR .::::::.. . . . .::. ::..:: :.::::...::.::: :. . .:..:. CCDS10 TCNTPWAESSAKGVHSFYNNISGLTDFGEKVVAEMNRLGMMVDLSHVSDAVARRALEVSQ 220 230 240 250 260 270 230 240 250 260 270 280 pF1KE1 APVIFSHSSAYSVCASRRNVPDDVLRLVKQTDSLVMVNFYNNYISCTNKANLSQVADHLD ::::::::.: .:: : ::::::.:.:.:.. ..:::.. . :.:. .::.: ::::.: CCDS10 APVIFSHSAARGVCNSARNVPDDILQLLKKNGGVVMVSLSMGVIQCNPSANVSTVADHFD 280 290 300 310 320 330 290 300 310 320 330 340 pF1KE1 HIKEVAGARAVGFGGDFDGVPRVPEGLEDVSKYPDLIAELLRRNWTEAEVKGALADNLLR ::: : :.. .:.:::.::. . :.:::::: :: :: ::: :.:.: :..:.: :::: CCDS10 HIKAVIGSKFIGIGGDYDGAGKFPQGLEDVSTYPVLIEELLSRGWSEEELQGVLRGNLLR 340 350 360 370 380 390 350 360 370 380 390 400 pF1KE1 VFEAVE--QASNLTQAPEEEPIPLDQLGGSCRTHYGYSSGASSLHRHWGLLLASLAPLVL ::. :: : : :.: :. .: .::..::.. CCDS10 VFRQVEKVQEENKWQSPLEDKFPDEQLSSSCHSDLSRLRQRQSLTSGQELTEIPIHWTAK 400 410 420 430 440 450 410 pF1KE1 CLSLL CCDS10 LPAKWSVSESSPHMAPVLAVVATFPVLILWL 460 470 480 >>CCDS10856.1 DPEP3 gene_id:64180|Hs108|chr16 (513 aa) initn: 1113 init1: 790 opt: 1161 Z-score: 1503.2 bits: 287.4 E(32554): 2.1e-77 Smith-Waterman score: 1161; 47.7% identity (76.5% similar) in 375 aa overlap (19-391:105-473) 10 20 30 40 pF1KE1 MWSGWWLWPLVAVCTADFFRDEAERIMRDSPVIDGHNDLPWQLLDMFN .: .:. .::. :..::::::: : . .. CCDS10 GSPSLFTTPGVPSALTTPGLTTPGTPKTLDLRGRAQALMRSFPLVDGHNDLPQVLRQRYK 80 90 100 110 120 130 50 60 70 80 90 100 pF1KE1 NRLQDERANLTTLAGTHTNIPKLRAGFVGGQFWSVYTPCDTQNKDAVRRTLEQMDVVHRM : ::: .:: ... .:.. .:: :.::.::::. . :..:.. ::: .:::.:..::: CCDS10 NVLQD--VNLRNFSHGQTSLDRLRDGLVGAQFWSASVSCQSQDQTAVRLALEQIDLIHRM 140 150 160 170 180 190 110 120 130 140 150 160 pF1KE1 CRMYPETFLYVTSSAGIRQAFREGKVASLIGVEGGHSIDSSLGVLRALYQLGMRYLTLTH : : : : :::. :. .. :.: :::::::::.::::.:::..: ::.:::::: CCDS10 CASYSELEL-VTSAEGLNSS---QKLACLIGVEGGHSLDSSLSVLRSFYVLGVRYLTLTF 200 210 220 230 240 170 180 190 200 210 220 pF1KE1 SCNTPWADNWLVDTGDSEPQSQGLSPFGQRVVKELNRLGVLIDLAHVSVATMKATLQLSR .:.::::.. . .::. ::..::.::::::..:::...: . .. .:..:. CCDS10 TCSTPWAESSTKFRHHMYTNVSGLTSFGEKVVEELNRLGMMIDLSYASDTLIRRVLEVSQ 250 260 270 280 290 300 230 240 250 260 270 280 pF1KE1 APVIFSHSSAYSVCASRRNVPDDVLRLVKQTDSLVMVNFYNNYISCTNKANLSQVADHLD ::::::::.: .:: . :::::.:.:.:.. ..:::.. . ..:. ::.: ::::.: CCDS10 APVIFSHSAARAVCDNLLNVPDDILQLLKKNGGIVMVTLSMGVLQCNLLANVSTVADHFD 310 320 330 340 350 360 290 300 310 320 330 340 pF1KE1 HIKEVAGARAVGFGGDFDGVPRVPEGLEDVSKYPDLIAELLRRNWTEAEVKGALADNLLR ::. : :.. .:.::..::. : :.:::::: :: :: ::: :.:.: :..:.: :::: CCDS10 HIRAVIGSEFIGIGGNYDGTGRFPQGLEDVSTYPVLIEELLSRSWSEEELQGVLRGNLLR 370 380 390 400 410 420 350 360 370 380 390 400 pF1KE1 VFEAVEQASNLT--QAPEEEPIPLDQLGGSCRTHYGYSSGASSLHRHWGLLLASLAPLVL ::. ::.. . . :.: : .: ::. ::..: ..: .. : CCDS10 VFRQVEKVREESRAQSPVEAEFPYGQLSTSCHSHLVPQNGHQATHLEVTKQPTNRVPWRS 430 440 450 460 470 480 410 pF1KE1 CLSLL CCDS10 SNASPYLVPGLVAAATIPTFTQWLC 490 500 510 411 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 21:44:39 2016 done: Sun Nov 6 21:44:40 2016 Total Scan time: 2.470 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]