FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1957, 512 aa 1>>>pF1KE1957 512 - 512 aa - 512 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.3446+/-0.000855; mu= 14.0625+/- 0.052 mean_var=94.2312+/-18.666, 0's: 0 Z-trim(109.2): 10 B-trim: 0 in 0/51 Lambda= 0.132123 statistics sampled from 10748 (10755) to 10748 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.683), E-opt: 0.2 (0.33), width: 16 Scan time: 3.210 The best scores are: opt bits E(32554) CCDS10856.1 DPEP3 gene_id:64180|Hs108|chr16 ( 513) 3339 646.7 1.9e-185 CCDS10857.1 DPEP2 gene_id:64174|Hs108|chr16 ( 486) 2161 422.1 6.9e-118 CCDS10982.1 DPEP1 gene_id:1800|Hs108|chr16 ( 411) 1147 228.8 9.2e-60 >>CCDS10856.1 DPEP3 gene_id:64180|Hs108|chr16 (513 aa) initn: 2190 init1: 2190 opt: 3339 Z-score: 3443.1 bits: 646.7 E(32554): 1.9e-185 Smith-Waterman score: 3339; 99.8% identity (99.8% similar) in 513 aa overlap (1-512:1-513) 10 20 30 40 50 60 pF1KE1 MIRTPLSASAHRLLLPGSRGRPPRNMQPTGREGSRALSRRYLRRLLLLLLLLLLRQPVTR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 MIRTPLSASAHRLLLPGSRGRPPRNMQPTGREGSRALSRRYLRRLLLLLLLLLLRQPVTR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 AETTPGAPRALSTLGSPSLFTTPGVPSALTTPGLTTPGTPKTLDLRGRAQALMRSFPLVD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 AETTPGAPRALSTLGSPSLFTTPGVPSALTTPGLTTPGTPKTLDLRGRAQALMRSFPLVD 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 GHNDLPQVLRQRYKNVLQDVNLRNFSHGQTSLDRLRDGLVGAQFWSASVSCQSQDQTAVR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 GHNDLPQVLRQRYKNVLQDVNLRNFSHGQTSLDRLRDGLVGAQFWSASVSCQSQDQTAVR 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 LALEQIDLIHRMCASYSELELVTSAEGLNSSQKLACLIGVEGGHSLDSSLSVLRSFYVLG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 LALEQIDLIHRMCASYSELELVTSAEGLNSSQKLACLIGVEGGHSLDSSLSVLRSFYVLG 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 VRYLTLTFTCSTPWAESSTKFRHHMYTNVSGLTSFGEKVVEELNRLGMMIDLSYASDTLI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 VRYLTLTFTCSTPWAESSTKFRHHMYTNVSGLTSFGEKVVEELNRLGMMIDLSYASDTLI 250 260 270 280 290 300 310 320 330 340 350 pF1KE1 RRVLEVSQAPVIFSHSAARAVCDNLLNVPDDILQLLK-NGGIVMVTLSMGVLQCNLLANV ::::::::::::::::::::::::::::::::::::: :::::::::::::::::::::: CCDS10 RRVLEVSQAPVIFSHSAARAVCDNLLNVPDDILQLLKKNGGIVMVTLSMGVLQCNLLANV 310 320 330 340 350 360 360 370 380 390 400 410 pF1KE1 STVADHFDHIRAVIGSEFIGIGGNYDGTGRFPQGLEDVSTYPVLIEELLSRSWSEEELQG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 STVADHFDHIRAVIGSEFIGIGGNYDGTGRFPQGLEDVSTYPVLIEELLSRSWSEEELQG 370 380 390 400 410 420 420 430 440 450 460 470 pF1KE1 VLRGNLLRVFRQVEKVREESRAQSPVEAEFPYGQLSTSCHSHLVPQNGHQATHLEVTKQP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 VLRGNLLRVFRQVEKVREESRAQSPVEAEFPYGQLSTSCHSHLVPQNGHQATHLEVTKQP 430 440 450 460 470 480 480 490 500 510 pF1KE1 TNRVPWRSSNASPYLVPGLVAAATIPTFTQWLC ::::::::::::::::::::::::::::::::: CCDS10 TNRVPWRSSNASPYLVPGLVAAATIPTFTQWLC 490 500 510 >>CCDS10857.1 DPEP2 gene_id:64174|Hs108|chr16 (486 aa) initn: 2068 init1: 1236 opt: 2161 Z-score: 2229.9 bits: 422.1 E(32554): 6.9e-118 Smith-Waterman score: 2161; 69.4% identity (85.8% similar) in 494 aa overlap (26-511:1-486) 10 20 30 40 50 60 pF1KE1 MIRTPLSASAHRLLLPGSRGRPPRNMQPTGREGSRALSRRYLRRLLLLLLLLLLRQPVTR :::.: :: ...: : :::::::: :::: CCDS10 MQPSGLEGPGTFGRWPLLSLLLLLLLL---QPVTC 10 20 30 70 80 90 100 110 120 pF1KE1 AETTPGAPRALSTLGSPSLFTTPGVPSALTTPGLTTPGTPKTLDLRGRAQALMRSFPLVD : :::: ::::.:::.: : ::. . :. :: ..:.: :. .:.::::.::::: CCDS10 AYTTPGPPRALTTLGAPRAHTMPGTYA----PS-TTLSSPSTQGLQEQARALMRDFPLVD 40 50 60 70 80 130 140 150 160 170 180 pF1KE1 GHNDLPQVLRQRYKNVLQDVNLRNFSHGQTSLDRLRDGLVGAQFWSASVSCQSQDQTAVR :::::: :::: :.. ::::::::::.:::::::::::::::::::: : ::.::. :.: CCDS10 GHNDLPLVLRQVYQKGLQDVNLRNFSYGQTSLDRLRDGLVGAQFWSAYVPCQTQDRDALR 90 100 110 120 130 140 190 200 210 220 230 240 pF1KE1 LALEQIDLIHRMCASYSELELVTSAEGLNSSQKLACLIGVEGGHSLDSSLSVLRSFYVLG :.:::::::.:::::::::::::::..::..::::::::::::::::.:::.::.::.:: CCDS10 LTLEQIDLIRRMCASYSELELVTSAKALNDTQKLACLIGVEGGHSLDNSLSILRTFYMLG 150 160 170 180 190 200 250 260 270 280 290 300 pF1KE1 VRYLTLTFTCSTPWAESSTKFRHHMYTNVSGLTSFGEKVVEELNRLGMMIDLSYASDTLI ::::::: ::.:::::::.: : .:.:.::::.:::::: :.::::::.:::..::.. CCDS10 VRYLTLTHTCNTPWAESSAKGVHSFYNNISGLTDFGEKVVAEMNRLGMMVDLSHVSDAVA 210 220 230 240 250 260 310 320 330 340 350 pF1KE1 RRVLEVSQAPVIFSHSAARAVCDNLLNVPDDILQLLK-NGGIVMVTLSMGVLQCNLLANV ::.::::::::::::::::.::.. ::::::::::: :::.:::.:::::.::: ::: CCDS10 RRALEVSQAPVIFSHSAARGVCNSARNVPDDILQLLKKNGGVVMVSLSMGVIQCNPSANV 270 280 290 300 310 320 360 370 380 390 400 410 pF1KE1 STVADHFDHIRAVIGSEFIGIGGNYDGTGRFPQGLEDVSTYPVLIEELLSRSWSEEELQG ::::::::::.:::::.::::::.:::.:.:::::::::::::::::::::.:::::::: CCDS10 STVADHFDHIKAVIGSKFIGIGGDYDGAGKFPQGLEDVSTYPVLIEELLSRGWSEEELQG 330 340 350 360 370 380 420 430 440 450 460 470 pF1KE1 VLRGNLLRVFRQVEKVREESRAQSPVEAEFPYGQLSTSCHSHL--VPQNGHQATHLEVTK ::::::::::::::::.::.. :::.: .:: :::.:::: : . : .. :.:. CCDS10 VLRGNLLRVFRQVEKVQEENKWQSPLEDKFPDEQLSSSCHSDLSRLRQRQSLTSGQELTE 390 400 410 420 430 440 480 490 500 510 pF1KE1 QP---TNRVP--WRSSNASPYLVPGLVAAATIPTFTQWLC : : ..: : :..::...: :...::.:.. :: CCDS10 IPIHWTAKLPAKWSVSESSPHMAPVLAVVATFPVLILWL 450 460 470 480 >>CCDS10982.1 DPEP1 gene_id:1800|Hs108|chr16 (411 aa) initn: 1096 init1: 443 opt: 1147 Z-score: 1186.4 bits: 228.8 E(32554): 9.2e-60 Smith-Waterman score: 1147; 47.7% identity (76.3% similar) in 375 aa overlap (105-472:19-391) 80 90 100 110 120 130 pF1KE1 GSPSLFTTPGVPSALTTPGLTTPGTPKTLDLRGRAQALMRSFPLVDGHNDLPQVLRQRYK .: .:. .::. :..::::::: : . .. CCDS10 MWSGWWLWPLVAVCTADFFRDEAERIMRDSPVIDGHNDLPWQLLDMFN 10 20 30 40 140 150 160 170 180 190 pF1KE1 NVLQD--VNLRNFSHGQTSLDRLRDGLVGAQFWSASVSCQSQDQTAVRLALEQIDLIHRM : ::: .:: ... .:.. .:: :.::.::::. . :..:.. ::: .:::.:..::: CCDS10 NRLQDERANLTTLAGTHTNIPKLRAGFVGGQFWSVYTPCDTQNKDAVRRTLEQMDVVHRM 50 60 70 80 90 100 200 210 220 230 240 pF1KE1 CASYSELEL-VTSAEGLNSS---QKLACLIGVEGGHSLDSSLSVLRSFYVLGVRYLTLTF : : : : :::. :. .. :.: :::::::::.::::.:::..: ::.:::::: CCDS10 CRMYPETFLYVTSSAGIRQAFREGKVASLIGVEGGHSIDSSLGVLRALYQLGMRYLTLTH 110 120 130 140 150 160 250 260 270 280 290 300 pF1KE1 TCSTPWAESSTKFRHHMYTNVSGLTSFGEKVVEELNRLGMMIDLSYASDTLIRRVLEVSQ .:.::::.. . .::. ::..::.::::::..:::...: . .. .:..:. CCDS10 SCNTPWADNWLVDTGDSEPQSQGLSPFGQRVVKELNRLGVLIDLAHVSVATMKATLQLSR 170 180 190 200 210 220 310 320 330 340 350 360 pF1KE1 APVIFSHSAARAVCDNLLNVPDDILQLLKN-GGIVMVTLSMGVLQCNLLANVSTVADHFD ::::::::.: .:: . :::::.:.:.:. ..:::.. . ..:. ::.: ::::.: CCDS10 APVIFSHSSAYSVCASRRNVPDDVLRLVKQTDSLVMVNFYNNYISCTNKANLSQVADHLD 230 240 250 260 270 280 370 380 390 400 410 420 pF1KE1 HIRAVIGSEFIGIGGNYDGTGRFPQGLEDVSTYPVLIEELLSRSWSEEELQGVLRGNLLR ::. : :.. .:.::..::. : :.:::::: :: :: ::: :.:.: :..:.: :::: CCDS10 HIKEVAGARAVGFGGDFDGVPRVPEGLEDVSKYPDLIAELLRRNWTEAEVKGALADNLLR 290 300 310 320 330 340 430 440 450 460 470 480 pF1KE1 VFRQVEKVREESRAQSPVEAEFPYGQLSTSCHSHLVPQNGHQATHLEVTKQPTNRVPWRS ::. ::.. . . :.: : .: ::. ::..: ..: .. : CCDS10 VFEAVEQASNLT--QAPEEEPIPLDQLGGSCRTHYGYSSGASSLHRHWGLLLASLAPLVL 350 360 370 380 390 400 490 500 510 pF1KE1 SNASPYLVPGLVAAATIPTFTQWLC CCDS10 CLSLL 410 512 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 21:50:43 2016 done: Sun Nov 6 21:50:43 2016 Total Scan time: 3.210 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]