FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4528, 577 aa 1>>>pF1KE4528 577 - 577 aa - 577 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.0258+/-0.000936; mu= 20.0156+/- 0.056 mean_var=60.9549+/-12.843, 0's: 0 Z-trim(104.9): 22 B-trim: 0 in 0/47 Lambda= 0.164274 statistics sampled from 8117 (8125) to 8117 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.615), E-opt: 0.2 (0.25), width: 16 Scan time: 3.230 The best scores are: opt bits E(32554) CCDS74436.1 HAS1 gene_id:3036|Hs108|chr19 ( 577) 3932 940.7 0 CCDS12838.1 HAS1 gene_id:3036|Hs108|chr19 ( 578) 3920 937.9 0 CCDS10871.1 HAS3 gene_id:3038|Hs108|chr16 ( 553) 1677 406.3 5.1e-113 CCDS6335.1 HAS2 gene_id:3037|Hs108|chr8 ( 552) 1671 404.9 1.4e-112 CCDS10870.1 HAS3 gene_id:3038|Hs108|chr16 ( 281) 357 93.3 4.4e-19 >>CCDS74436.1 HAS1 gene_id:3036|Hs108|chr19 (577 aa) initn: 3932 init1: 3932 opt: 3932 Z-score: 5030.5 bits: 940.7 E(32554): 0 Smith-Waterman score: 3932; 99.7% identity (99.8% similar) in 577 aa overlap (1-577:1-577) 10 20 30 40 50 60 pF1KE4 MRQDAPKPTPAARRCSGLARRVLTIAFALLILGLMTWAYAAGVPLASDRYGLLAFGLYGA :::::::::::: ::::::::::::::::::::::::::::::::::::::::::::::: CCDS74 MRQDAPKPTPAACRCSGLARRVLTIAFALLILGLMTWAYAAGVPLASDRYGLLAFGLYGA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 FLSAHLVAQSLFAYLEHRRVAAAARGPLDAATARSVALTISAYQEDPAYLRQCLASARAL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS74 FLSAHLVAQSLFAYLEHRRVAAAARGPLDAATARSVALTISAYQEDPAYLRQCLASARAL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 LYPRARLRVLMVVDGNRAEDLYMVDMFREVFADEDPATYVWDGNYHQPWEPAAAGAVGAG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS74 LYPRARLRVLMVVDGNRAEDLYMVDMFREVFADEDPATYVWDGNYHQPWEPAAAGAVGAG 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 AYREVEAEDPGRLAVEALVRTRRCVCVAQRWGGKREVMYTAFKALGDSMDYVQVCDSDTR ::::::::::::::::::::::::::::::::::::::::::::::::.::::::::::: CCDS74 AYREVEAEDPGRLAVEALVRTRRCVCVAQRWGGKREVMYTAFKALGDSVDYVQVCDSDTR 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 LDPMALLELVRVLDEDPRVGAVGGDVRILNPLDSWVSFLSSLRYWVAFNVERACQSYFHC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS74 LDPMALLELVRVLDEDPRVGAVGGDVRILNPLDSWVSFLSSLRYWVAFNVERACQSYFHC 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE4 VSCISGPLGLYRNNLLQQFLEAWYNQKFLGTHCTFGDDRHLTNRMLSMGYATKYTSRSRC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS74 VSCISGPLGLYRNNLLQQFLEAWYNQKFLGTHCTFGDDRHLTNRMLSMGYATKYTSRSRC 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE4 YSETPSSFLRWLSQQTRWSKSYFREWLYNALWWHRHHAWMTYEAVVSGLFPFFVAATVLR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS74 YSETPSSFLRWLSQQTRWSKSYFREWLYNALWWHRHHAWMTYEAVVSGLFPFFVAATVLR 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE4 LFYAGRPWALLWVLLCVQGVALAKAAFAAWLRGCLRMVLLSLYAPLYMCGLLPAKFLALV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS74 LFYAGRPWALLWVLLCVQGVALAKAAFAAWLRGCLRMVLLSLYAPLYMCGLLPAKFLALV 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE4 TMNQSGWGTSGRRKLAANYVPLLPLALWALLLLGGLVRSVAHEARADWSGPSRAAEAYHL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS74 TMNQSGWGTSGRRKLAANYVPLLPLALWALLLLGGLVRSVAHEARADWSGPSRAAEAYHL 490 500 510 520 530 540 550 560 570 pF1KE4 AAGAGAYVGYWVAMLTLYWVGVRRLCRRRTGGYRVQV ::::::::::::::::::::::::::::::::::::: CCDS74 AAGAGAYVGYWVAMLTLYWVGVRRLCRRRTGGYRVQV 550 560 570 >>CCDS12838.1 HAS1 gene_id:3036|Hs108|chr19 (578 aa) initn: 3919 init1: 3919 opt: 3920 Z-score: 5015.1 bits: 937.9 E(32554): 0 Smith-Waterman score: 3920; 99.5% identity (99.7% similar) in 578 aa overlap (1-577:1-578) 10 20 30 40 50 pF1KE4 MRQ-DAPKPTPAARRCSGLARRVLTIAFALLILGLMTWAYAAGVPLASDRYGLLAFGLYG ::: ::::::::: :::::::::::::::::::::::::::::::::::::::::::::: CCDS12 MRQQDAPKPTPAACRCSGLARRVLTIAFALLILGLMTWAYAAGVPLASDRYGLLAFGLYG 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE4 AFLSAHLVAQSLFAYLEHRRVAAAARGPLDAATARSVALTISAYQEDPAYLRQCLASARA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 AFLSAHLVAQSLFAYLEHRRVAAAARGPLDAATARSVALTISAYQEDPAYLRQCLASARA 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE4 LLYPRARLRVLMVVDGNRAEDLYMVDMFREVFADEDPATYVWDGNYHQPWEPAAAGAVGA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 LLYPRARLRVLMVVDGNRAEDLYMVDMFREVFADEDPATYVWDGNYHQPWEPAAAGAVGA 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE4 GAYREVEAEDPGRLAVEALVRTRRCVCVAQRWGGKREVMYTAFKALGDSMDYVQVCDSDT :::::::::::::::::::::::::::::::::::::::::::::::::.:::::::::: CCDS12 GAYREVEAEDPGRLAVEALVRTRRCVCVAQRWGGKREVMYTAFKALGDSVDYVQVCDSDT 190 200 210 220 230 240 240 250 260 270 280 290 pF1KE4 RLDPMALLELVRVLDEDPRVGAVGGDVRILNPLDSWVSFLSSLRYWVAFNVERACQSYFH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 RLDPMALLELVRVLDEDPRVGAVGGDVRILNPLDSWVSFLSSLRYWVAFNVERACQSYFH 250 260 270 280 290 300 300 310 320 330 340 350 pF1KE4 CVSCISGPLGLYRNNLLQQFLEAWYNQKFLGTHCTFGDDRHLTNRMLSMGYATKYTSRSR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 CVSCISGPLGLYRNNLLQQFLEAWYNQKFLGTHCTFGDDRHLTNRMLSMGYATKYTSRSR 310 320 330 340 350 360 360 370 380 390 400 410 pF1KE4 CYSETPSSFLRWLSQQTRWSKSYFREWLYNALWWHRHHAWMTYEAVVSGLFPFFVAATVL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 CYSETPSSFLRWLSQQTRWSKSYFREWLYNALWWHRHHAWMTYEAVVSGLFPFFVAATVL 370 380 390 400 410 420 420 430 440 450 460 470 pF1KE4 RLFYAGRPWALLWVLLCVQGVALAKAAFAAWLRGCLRMVLLSLYAPLYMCGLLPAKFLAL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 RLFYAGRPWALLWVLLCVQGVALAKAAFAAWLRGCLRMVLLSLYAPLYMCGLLPAKFLAL 430 440 450 460 470 480 480 490 500 510 520 530 pF1KE4 VTMNQSGWGTSGRRKLAANYVPLLPLALWALLLLGGLVRSVAHEARADWSGPSRAAEAYH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 VTMNQSGWGTSGRRKLAANYVPLLPLALWALLLLGGLVRSVAHEARADWSGPSRAAEAYH 490 500 510 520 530 540 540 550 560 570 pF1KE4 LAAGAGAYVGYWVAMLTLYWVGVRRLCRRRTGGYRVQV :::::::::::::::::::::::::::::::::::::: CCDS12 LAAGAGAYVGYWVAMLTLYWVGVRRLCRRRTGGYRVQV 550 560 570 >>CCDS10871.1 HAS3 gene_id:3038|Hs108|chr16 (553 aa) initn: 2044 init1: 1619 opt: 1677 Z-score: 2142.5 bits: 406.3 E(32554): 5.1e-113 Smith-Waterman score: 2081; 55.6% identity (77.7% similar) in 556 aa overlap (20-573:10-546) 10 20 30 40 50 60 pF1KE4 MRQDAPKPTPAARRCSGLARRVLTIAFALLILGLMTWAYAAGVPLASDRYGLLAFGLYGA : : : ::: .:: . ::..: . . :.:::::: CCDS10 MPVQLTTALRVVGTSLFALAVLGGILAAYVTGYQFIHTEKHYLSFGLYGA 10 20 30 40 50 70 80 90 100 110 pF1KE4 FLSAHLVAQSLFAYLEHRRVAAAARG-PLDAATARSVALTISAYQEDPAYLRQCLASARA .:. ::. :::::.:::::. :... : . :::: :.:::::: :::.:: ::. CCDS10 ILGLHLLIQSLFAFLEHRRMRRAGQALKLPSPRRGSVALCIAAYQEDPDYLRKCLRSAQR 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE4 LLYPRARLRVLMVVDGNRAEDLYMVDMFREVFADEDPATY-VWDGNYHQPWEPAAAGAVG . .: :.:.::::::: :: ::.:.:.::.. . : . :: .:.:. : . ... CCDS10 ISFPD--LKVVMVVDGNRQEDAYMLDIFHEVLGGTEQAGFFVWRSNFHEAGEGETEASLQ 120 130 140 150 160 180 190 200 210 220 230 pF1KE4 AGAYREVEAEDPGRLAVEALVRTRRCVCVAQRWGGKREVMYTAFKALGDSMDYVQVCDSD : : :. .::. :. :.::::::::::::::::::.::.:::::: CCDS10 EGMDR-----------VRDVVRASTFSCIMQKWGGKREVMYTAFKALGDSVDYIQVCDSD 170 180 190 200 210 240 250 260 270 280 290 pF1KE4 TRLDPMALLELVRVLDEDPRVGAVGGDVRILNPLDSWVSFLSSLRYWVAFNVERACQSYF : ::: .:..:::.:::.::.:::::.::: :::.:::::.:::.:::::::::::: CCDS10 TVLDPACTIEMLRVLEEDPQVGGVGGDVQILNKYDSWISFLSSVRYWMAFNVERACQSYF 220 230 240 250 260 270 300 310 320 330 340 350 pF1KE4 HCVSCISGPLGLYRNNLLQQFLEAWYNQKFLGTHCTFGDDRHLTNRMLSMGYATKYTSRS ::.:::::::.:::.::::::: ::.:::::..:.::::::::::.::.:: ::::.:: CCDS10 GCVQCISGPLGMYRNSLLQQFLEDWYHQKFLGSKCSFGDDRHLTNRVLSLGYRTKYTARS 280 290 300 310 320 330 360 370 380 390 400 410 pF1KE4 RCYSETPSSFLRWLSQQTRWSKSYFREWLYNALWWHRHHAWMTYEAVVSGLFPFFVAATV .: .:::...::::.::::::::::::::::.::.:.:: :::::.::.:.::::. ::: CCDS10 KCLTETPTKYLRWLNQQTRWSKSYFREWLYNSLWFHKHHLWMTYESVVTGFFPFFLIATV 340 350 360 370 380 390 420 430 440 450 460 470 pF1KE4 LRLFYAGRPWALLWVLLCVQGVALAKAAFAAWLRGCLRMVLLSLYAPLYMCGLLPAKFLA ..::: :: : .: :: :: :.. ::..: .::: .:...:::. ::: .:::::..: CCDS10 IQLFYRGRIWNILLFLLTVQLVGIIKATYACFLRGNAEMIFMSLYSLLYMSSLLPAKIFA 400 410 420 430 440 450 480 490 500 510 520 530 pF1KE4 LVTMNQSGWGTSGRRKLAANYVPLLPLALWALLLLGGLVRSVAHEARADWSGPSRAAEAY ..:.:.::::::::. ...:.. :.:...:. .:::::. .. . : . .. : CCDS10 IATINKSGWGTSGRKTIVVNFIGLIPVSIWVAVLLGGLAYTAYCQ---DLFSETELA--- 460 470 480 490 500 510 540 550 560 570 pF1KE4 HLAAGAGAYVGYWVAMLTLYWVGVRRLCRRRTGGYRVQV :..:: : ::::.: :: . . : : .. : CCDS10 FLVSGAILYGCYWVALLMLYLAIIARRCGKKPEQYSLAFAEV 520 530 540 550 >>CCDS6335.1 HAS2 gene_id:3037|Hs108|chr8 (552 aa) initn: 2058 init1: 1603 opt: 1671 Z-score: 2134.8 bits: 404.9 E(32554): 1.4e-112 Smith-Waterman score: 2081; 53.9% identity (78.5% similar) in 553 aa overlap (20-571:11-543) 10 20 30 40 50 pF1KE4 MRQDAPKPTPAARRCSGLARRVLTIAFALLILGLMTWAYAAGVP-LASDRYGLLAFGLYG : . : :.. .: .: :: .: . .: : ..::::: CCDS63 MHCERFLCILRIIGTTLFGVSLLLGITAAYIVGYQFIQTDNY-YFSFGLYG 10 20 30 40 50 60 70 80 90 100 110 pF1KE4 AFLSAHLVAQSLFAYLEHRRVAAAARGPLDAATARSVALTISAYQEDPAYLRQCLASARA :::..::. :::::.::::.. . . :. ..::: :.:::::: :::.:: :.. CCDS63 AFLASHLIIQSLFAFLEHRKMKKSLETPI--KLNKTVALCIAAYQEDPDYLRKCLQSVKR 60 70 80 90 100 120 130 140 150 160 170 pF1KE4 LLYPRARLRVLMVVDGNRAEDLYMVDMFREVFADEDPATYVWDGNYHQPWEPAAAGAVGA : :: ..:.::.::: .::::.:.: ::.. . :::.: .:.:. : CCDS63 LTYPG--IKVVMVIDGNSEDDLYMMDIFSEVMGRDKSATYIWKNNFHEK---------GP 110 120 130 140 150 180 190 200 210 220 230 pF1KE4 GAYREVEAEDPGRLAVEALVRTRRCVCVAQRWGGKREVMYTAFKALGDSMDYVQVCDSDT : : . :. . : :: . . .:. :.::::::::::::.::: :.:::::::::: CCDS63 GETDESHKESSQH--VTQLVLSNKSICIMQKWGGKREVMYTAFRALGRSVDYVQVCDSDT 160 170 180 190 200 210 240 250 260 270 280 290 pF1KE4 RLDPMALLELVRVLDEDPRVGAVGGDVRILNPLDSWVSFLSSLRYWVAFNVERACQSYFH ::: . .:.:.::.::: ::.:::::.::: :::.:::::.:::.:::.:::::::: CCDS63 MLDPASSVEMVKVLEEDPMVGGVGGDVQILNKYDSWISFLSSVRYWMAFNIERACQSYFG 220 230 240 250 260 270 300 310 320 330 340 350 pF1KE4 CVSCISGPLGLYRNNLLQQFLEAWYNQKFLGTHCTFGDDRHLTNRMLSMGYATKYTSRSR ::.:::::::.:::.::..:.: ::::.:.:..:.::::::::::.::.:::::::.::. CCDS63 CVQCISGPLGMYRNSLLHEFVEDWYNQEFMGNQCSFGDDRHLTNRVLSLGYATKYTARSK 280 290 300 310 320 330 360 370 380 390 400 410 pF1KE4 CYSETPSSFLRWLSQQTRWSKSYFREWLYNALWWHRHHAWMTYEAVVSGLFPFFVAATVL : .::: .::::.:::::::::::::::::.:.:.:: ::::::...:.::::. :::. CCDS63 CLTETPIEYLRWLNQQTRWSKSYFREWLYNAMWFHKHHLWMTYEAIITGFFPFFLIATVI 340 350 360 370 380 390 420 430 440 450 460 470 pF1KE4 RLFYAGRPWALLWVLLCVQGVALAKAAFAAWLRGCLRMVLLSLYAPLYMCGLLPAKFLAL .::: :. : .: :: :: :.: :..::. ::: . ::..:::. ::: .:::::..:. CCDS63 QLFYRGKIWNILLFLLTVQLVGLIKSSFASCLRGNIVMVFMSLYSVLYMSSLLPAKMFAI 400 410 420 430 440 450 480 490 500 510 520 530 pF1KE4 VTMNQSGWGTSGRRKLAANYVPLLPLALWALLLLGGLVRSVAHEARADWSGPSRAAEAYH .:.:..:::::::. ...:.. :.:...: .::::.. .. .:.. .: .... CCDS63 ATINKAGWGTSGRKTIVVNFIGLIPVSVWFTILLGGVIFTIYKESKRPFSESKQTV---- 460 470 480 490 500 510 540 550 560 570 pF1KE4 LAAGAGAYVGYWVAMLTLYWVGVRRLCRRRTGGYRVQV : .:. :. ::: .:::: : . . ::. : CCDS63 LIVGTLLYACYWVMLLTLYVVLINKCGRRKKGQQYDMVLDV 520 530 540 550 >>CCDS10870.1 HAS3 gene_id:3038|Hs108|chr16 (281 aa) initn: 727 init1: 355 opt: 357 Z-score: 456.2 bits: 93.3 E(32554): 4.4e-19 Smith-Waterman score: 761; 50.4% identity (70.8% similar) in 250 aa overlap (20-267:10-246) 10 20 30 40 50 60 pF1KE4 MRQDAPKPTPAARRCSGLARRVLTIAFALLILGLMTWAYAAGVPLASDRYGLLAFGLYGA : : : ::: .:: . ::..: . . :.:::::: CCDS10 MPVQLTTALRVVGTSLFALAVLGGILAAYVTGYQFIHTEKHYLSFGLYGA 10 20 30 40 50 70 80 90 100 110 pF1KE4 FLSAHLVAQSLFAYLEHRRVAAAARG-PLDAATARSVALTISAYQEDPAYLRQCLASARA .:. ::. :::::.:::::. :... : . :::: :.:::::: :::.:: ::. CCDS10 ILGLHLLIQSLFAFLEHRRMRRAGQALKLPSPRRGSVALCIAAYQEDPDYLRKCLRSAQR 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE4 LLYPRARLRVLMVVDGNRAEDLYMVDMFREVFADEDPATY-VWDGNYHQPWEPAAAGAVG . .: :.:.::::::: :: ::.:.:.::.. . : . :: .:.:. : . ... CCDS10 ISFP--DLKVVMVVDGNRQEDAYMLDIFHEVLGGTEQAGFFVWRSNFHEAGEGETEASLQ 120 130 140 150 160 180 190 200 210 220 230 pF1KE4 AGAYREVEAEDPGRLAVEALVRTRRCVCVAQRWGGKREVMYTAFKALGDSMDYVQVCDSD : : :. .::. :. :.::::::::::::::::::.::.:::::: CCDS10 EGMDR-----------VRDVVRASTFSCIMQKWGGKREVMYTAFKALGDSVDYIQVCDSD 170 180 190 200 210 240 250 260 270 280 290 pF1KE4 TRLDPMALLELVRVLDEDPRVGAVGGDVRILNPLDSWVSFLSSLRYWVAFNVERACQSYF : ::: .:..:::.:::.::.:::::. CCDS10 TVLDPACTIEMLRVLEEDPQVGGVGGDVQPPGKGMAVEDDQVQAAQVRATEAWSVHQRHV 220 230 240 250 260 270 577 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 00:14:17 2016 done: Sun Nov 6 00:14:18 2016 Total Scan time: 3.230 Total Display time: 0.040 Function used was FASTA [36.3.4 Apr, 2011]