FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1940, 486 aa 1>>>pF1KE1940 486 - 486 aa - 486 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.9297+/-0.000796; mu= 15.8269+/- 0.048 mean_var=84.3440+/-16.446, 0's: 0 Z-trim(108.9): 8 B-trim: 0 in 0/53 Lambda= 0.139652 statistics sampled from 10546 (10550) to 10546 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.684), E-opt: 0.2 (0.324), width: 16 Scan time: 3.470 The best scores are: opt bits E(32554) CCDS10857.1 DPEP2 gene_id:64174|Hs108|chr16 ( 486) 3223 659.1 3e-189 CCDS10856.1 DPEP3 gene_id:64180|Hs108|chr16 ( 513) 2179 448.8 6.6e-126 CCDS10982.1 DPEP1 gene_id:1800|Hs108|chr16 ( 411) 1225 256.5 4e-68 >>CCDS10857.1 DPEP2 gene_id:64174|Hs108|chr16 (486 aa) initn: 3223 init1: 3223 opt: 3223 Z-score: 3511.0 bits: 659.1 E(32554): 3e-189 Smith-Waterman score: 3223; 100.0% identity (100.0% similar) in 486 aa overlap (1-486:1-486) 10 20 30 40 50 60 pF1KE1 MQPSGLEGPGTFGRWPLLSLLLLLLLLQPVTCAYTTPGPPRALTTLGAPRAHTMPGTYAP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 MQPSGLEGPGTFGRWPLLSLLLLLLLLQPVTCAYTTPGPPRALTTLGAPRAHTMPGTYAP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 STTLSSPSTQGLQEQARALMRDFPLVDGHNDLPLVLRQVYQKGLQDVNLRNFSYGQTSLD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 STTLSSPSTQGLQEQARALMRDFPLVDGHNDLPLVLRQVYQKGLQDVNLRNFSYGQTSLD 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 RLRDGLVGAQFWSAYVPCQTQDRDALRLTLEQIDLIRRMCASYSELELVTSAKALNDTQK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 RLRDGLVGAQFWSAYVPCQTQDRDALRLTLEQIDLIRRMCASYSELELVTSAKALNDTQK 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 LACLIGVEGGHSLDNSLSILRTFYMLGVRYLTLTHTCNTPWAESSAKGVHSFYNNISGLT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 LACLIGVEGGHSLDNSLSILRTFYMLGVRYLTLTHTCNTPWAESSAKGVHSFYNNISGLT 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 DFGEKVVAEMNRLGMMVDLSHVSDAVARRALEVSQAPVIFSHSAARGVCNSARNVPDDIL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 DFGEKVVAEMNRLGMMVDLSHVSDAVARRALEVSQAPVIFSHSAARGVCNSARNVPDDIL 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE1 QLLKKNGGVVMVSLSMGVIQCNPSANVSTVADHFDHIKAVIGSKFIGIGGDYDGAGKFPQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 QLLKKNGGVVMVSLSMGVIQCNPSANVSTVADHFDHIKAVIGSKFIGIGGDYDGAGKFPQ 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE1 GLEDVSTYPVLIEELLSRGWSEEELQGVLRGNLLRVFRQVEKVQEENKWQSPLEDKFPDE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 GLEDVSTYPVLIEELLSRGWSEEELQGVLRGNLLRVFRQVEKVQEENKWQSPLEDKFPDE 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE1 QLSSSCHSDLSRLRQRQSLTSGQELTEIPIHWTAKLPAKWSVSESSPHMAPVLAVVATFP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 QLSSSCHSDLSRLRQRQSLTSGQELTEIPIHWTAKLPAKWSVSESSPHMAPVLAVVATFP 430 440 450 460 470 480 pF1KE1 VLILWL :::::: CCDS10 VLILWL >>CCDS10856.1 DPEP3 gene_id:64180|Hs108|chr16 (513 aa) initn: 2157 init1: 1926 opt: 2179 Z-score: 2373.9 bits: 448.8 E(32554): 6.6e-126 Smith-Waterman score: 2179; 69.6% identity (86.0% similar) in 494 aa overlap (1-486:26-512) 10 20 30 pF1KE1 MQPSGLEGPGTFGRWPLLSLLLLLLLL---QPVTC :::.: :: ...: : :::::::: :::: CCDS10 MIRTPLSASAHRLLLPGSRGRPPRNMQPTGREGSRALSRRYLRRLLLLLLLLLLRQPVTR 10 20 30 40 50 60 40 50 60 70 80 pF1KE1 AYTTPGPPRALTTLGAPRAHTMPGTYA----PS-TTLSSPSTQGLQEQARALMRDFPLVD : :::: ::::.:::.: : ::. . :. :: ..:.: :. .:.::::.::::: CCDS10 AETTPGAPRALSTLGSPSLFTTPGVPSALTTPGLTTPGTPKTLDLRGRAQALMRSFPLVD 70 80 90 100 110 120 90 100 110 120 130 140 pF1KE1 GHNDLPLVLRQVYQKGLQDVNLRNFSYGQTSLDRLRDGLVGAQFWSAYVPCQTQDRDALR :::::: :::: :.. ::::::::::.:::::::::::::::::::: : ::.::. :.: CCDS10 GHNDLPQVLRQRYKNVLQDVNLRNFSHGQTSLDRLRDGLVGAQFWSASVSCQSQDQTAVR 130 140 150 160 170 180 150 160 170 180 190 200 pF1KE1 LTLEQIDLIRRMCASYSELELVTSAKALNDTQKLACLIGVEGGHSLDNSLSILRTFYMLG :.:::::::.:::::::::::::::..::..::::::::::::::::.:::.::.::.:: CCDS10 LALEQIDLIHRMCASYSELELVTSAEGLNSSQKLACLIGVEGGHSLDSSLSVLRSFYVLG 190 200 210 220 230 240 210 220 230 240 250 260 pF1KE1 VRYLTLTHTCNTPWAESSAKGVHSFYNNISGLTDFGEKVVAEMNRLGMMVDLSHVSDAVA ::::::: ::.:::::::.: : .:.:.::::.:::::: :.::::::.:::..::.. CCDS10 VRYLTLTFTCSTPWAESSTKFRHHMYTNVSGLTSFGEKVVEELNRLGMMIDLSYASDTLI 250 260 270 280 290 300 270 280 290 300 310 320 pF1KE1 RRALEVSQAPVIFSHSAARGVCNSARNVPDDILQLLKKNGGVVMVSLSMGVIQCNPSANV ::.::::::::::::::::.::.. :::::::::::::::.:::.:::::.::: ::: CCDS10 RRVLEVSQAPVIFSHSAARAVCDNLLNVPDDILQLLKKNGGIVMVTLSMGVLQCNLLANV 310 320 330 340 350 360 330 340 350 360 370 380 pF1KE1 STVADHFDHIKAVIGSKFIGIGGDYDGAGKFPQGLEDVSTYPVLIEELLSRGWSEEELQG ::::::::::.:::::.::::::.:::.:.:::::::::::::::::::::.:::::::: CCDS10 STVADHFDHIRAVIGSEFIGIGGNYDGTGRFPQGLEDVSTYPVLIEELLSRSWSEEELQG 370 380 390 400 410 420 390 400 410 420 430 440 pF1KE1 VLRGNLLRVFRQVEKVQEENKWQSPLEDKFPDEQLSSSCHSDLSRLRQRQSLTSGQELTE ::::::::::::::::.::.. :::.: .:: :::.:::: : . : .. :.:. CCDS10 VLRGNLLRVFRQVEKVREESRAQSPVEAEFPYGQLSTSCHSHL--VPQNGHQATHLEVTK 430 440 450 460 470 450 460 470 480 pF1KE1 IPIHWTAKLPAKWSVSESSPHMAPVLAVVATFPVLILWL : : ..: : :..::...: :...::.:.. :: CCDS10 QP---TNRVP--WRSSNASPYLVPGLVAAATIPTFTQWLC 480 490 500 510 >>CCDS10982.1 DPEP1 gene_id:1800|Hs108|chr16 (411 aa) initn: 1217 init1: 843 opt: 1225 Z-score: 1336.5 bits: 256.5 E(32554): 4e-68 Smith-Waterman score: 1225; 50.7% identity (80.2% similar) in 363 aa overlap (72-428:19-379) 50 60 70 80 90 100 pF1KE1 ALTTLGAPRAHTMPGTYAPSTTLSSPSTQGLQEQARALMRDFPLVDGHNDLPLVLRQVYQ ....:. .::: :..::::::: : .... CCDS10 MWSGWWLWPLVAVCTADFFRDEAERIMRDSPVIDGHNDLPWQLLDMFN 10 20 30 40 110 120 130 140 150 pF1KE1 KGLQD--VNLRNFSYGQTSLDRLRDGLVGAQFWSAYVPCQTQDRDALRLTLEQIDLIRRM . ::: .:: ... .:.. .:: :.::.::::.:.::.::..::.: ::::.:...:: CCDS10 NRLQDERANLTTLAGTHTNIPKLRAGFVGGQFWSVYTPCDTQNKDAVRRTLEQMDVVHRM 50 60 70 80 90 100 160 170 180 190 200 210 pF1KE1 CASYSELEL-VTSAKALNDT---QKLACLIGVEGGHSLDNSLSILRTFYMLGVRYLTLTH : : : : :::. .. .. :.: :::::::::.:.::..::..:.::.::::::: CCDS10 CRMYPETFLYVTSSAGIRQAFREGKVASLIGVEGGHSIDSSLGVLRALYQLGMRYLTLTH 110 120 130 140 150 160 220 230 240 250 260 270 pF1KE1 TCNTPWAESSAKGVHSFYNNISGLTDFGEKVVAEMNRLGMMVDLSHVSDAVARRALEVSQ .::::::.. . . . .::. ::..:: :.::::...::.::: :. . .:..:. CCDS10 SCNTPWADNWLVDTGDSEPQSQGLSPFGQRVVKELNRLGVLIDLAHVSVATMKATLQLSR 170 180 190 200 210 220 280 290 300 310 320 330 pF1KE1 APVIFSHSAARGVCNSARNVPDDILQLLKKNGGVVMVSLSMGVIQCNPSANVSTVADHFD ::::::::.: .:: : ::::::.:.:.:.. ..:::.. . :.:. .::.: ::::.: CCDS10 APVIFSHSSAYSVCASRRNVPDDVLRLVKQTDSLVMVNFYNNYISCTNKANLSQVADHLD 230 240 250 260 270 280 340 350 360 370 380 390 pF1KE1 HIKAVIGSKFIGIGGDYDGAGKFPQGLEDVSTYPVLIEELLSRGWSEEELQGVLRGNLLR ::: : :.. .:.:::.::. . :.:::::: :: :: ::: :.:.: :..:.: :::: CCDS10 HIKEVAGARAVGFGGDFDGVPRVPEGLEDVSKYPDLIAELLRRNWTEAEVKGALADNLLR 290 300 310 320 330 340 400 410 420 430 440 450 pF1KE1 VFRQVEKVQEENKWQSPLEDKFPDEQLSSSCHSDLSRLRQRQSLTSGQELTEIPIHWTAK ::. :: : : :.: :. .: .::..::.. CCDS10 VFEAVE--QASNLTQAPEEEPIPLDQLGGSCRTHYGYSSGASSLHRHWGLLLASLAPLVL 350 360 370 380 390 400 460 470 480 pF1KE1 LPAKWSVSESSPHMAPVLAVVATFPVLILWL CCDS10 CLSLL 410 486 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 14:04:13 2016 done: Sun Nov 6 14:04:14 2016 Total Scan time: 3.470 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]