FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6341, 424 aa 1>>>pF1KE6341 424 - 424 aa - 424 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.0626+/-0.000351; mu= 10.4234+/- 0.022 mean_var=134.6001+/-27.654, 0's: 0 Z-trim(118.4): 37 B-trim: 636 in 2/52 Lambda= 0.110548 statistics sampled from 31242 (31279) to 31242 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.717), E-opt: 0.2 (0.367), width: 16 Scan time: 7.760 The best scores are: opt bits E(85289) XP_016882632 (OMIM: 610190,616265) PREDICTED: carb ( 424) 2898 473.5 4.6e-133 NP_071912 (OMIM: 610190,616265) carbohydrate sulfo ( 424) 2898 473.5 4.6e-133 XP_011525526 (OMIM: 610190,616265) PREDICTED: carb ( 424) 2898 473.5 4.6e-133 NP_001121368 (OMIM: 610190,616265) carbohydrate su ( 424) 2898 473.5 4.6e-133 XP_011525524 (OMIM: 610190,616265) PREDICTED: carb ( 424) 2898 473.5 4.6e-133 NP_001121367 (OMIM: 610190,616265) carbohydrate su ( 424) 2898 473.5 4.6e-133 XP_005258419 (OMIM: 610191) PREDICTED: carbohydrat ( 358) 1304 219.2 1.4e-56 XP_016881523 (OMIM: 610191) PREDICTED: carbohydrat ( 386) 1304 219.2 1.4e-56 XP_016881522 (OMIM: 610191) PREDICTED: carbohydrat ( 443) 1304 219.2 1.6e-56 NP_113610 (OMIM: 610191) carbohydrate sulfotransfe ( 443) 1304 219.2 1.6e-56 NP_001167453 (OMIM: 610128) carbohydrate sulfotran ( 347) 737 128.7 2.2e-29 NP_060883 (OMIM: 610128) carbohydrate sulfotransfe ( 352) 737 128.7 2.2e-29 XP_016874858 (OMIM: 610128) PREDICTED: carbohydrat ( 375) 737 128.8 2.3e-29 NP_690849 (OMIM: 610124) carbohydrate sulfotransfe ( 341) 627 111.2 4.2e-24 NP_569735 (OMIM: 601776,608429) carbohydrate sulfo ( 376) 571 102.3 2.2e-21 XP_011510513 (OMIM: 606376) PREDICTED: carbohydrat ( 356) 562 100.8 5.7e-21 XP_011510512 (OMIM: 606376) PREDICTED: carbohydrat ( 356) 562 100.8 5.7e-21 XP_016860870 (OMIM: 606376) PREDICTED: carbohydrat ( 356) 562 100.8 5.7e-21 XP_011510514 (OMIM: 606376) PREDICTED: carbohydrat ( 356) 562 100.8 5.7e-21 XP_011510510 (OMIM: 606376) PREDICTED: carbohydrat ( 356) 562 100.8 5.7e-21 NP_004845 (OMIM: 606376) carbohydrate sulfotransfe ( 356) 562 100.8 5.7e-21 XP_016860869 (OMIM: 606376) PREDICTED: carbohydrat ( 356) 562 100.8 5.7e-21 XP_016860871 (OMIM: 606376) PREDICTED: carbohydrat ( 356) 562 100.8 5.7e-21 XP_016860872 (OMIM: 606376) PREDICTED: carbohydrat ( 356) 562 100.8 5.7e-21 XP_011510509 (OMIM: 606376) PREDICTED: carbohydrat ( 356) 562 100.8 5.7e-21 XP_011513745 (OMIM: 610129) PREDICTED: carbohydrat ( 414) 476 87.2 8.6e-17 XP_011513746 (OMIM: 610129) PREDICTED: carbohydrat ( 414) 476 87.2 8.6e-17 NP_001230723 (OMIM: 610129) carbohydrate sulfotran ( 414) 476 87.2 8.6e-17 NP_061111 (OMIM: 610129) carbohydrate sulfotransfe ( 414) 476 87.2 8.6e-17 NP_001230724 (OMIM: 610129) carbohydrate sulfotran ( 414) 476 87.2 8.6e-17 >>XP_016882632 (OMIM: 610190,616265) PREDICTED: carbohyd (424 aa) initn: 2898 init1: 2898 opt: 2898 Z-score: 2509.9 bits: 473.5 E(85289): 4.6e-133 Smith-Waterman score: 2898; 100.0% identity (100.0% similar) in 424 aa overlap (1-424:1-424) 10 20 30 40 50 60 pF1KE6 MTLRPGTMRLACMFSSILLFGAAGLLLFISLQDPTELAPQQVPGIKFNIRPRQPHHDLPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 MTLRPGTMRLACMFSSILLFGAAGLLLFISLQDPTELAPQQVPGIKFNIRPRQPHHDLPP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 GGSQDGDLKEPTERVTRDLSSGAPRGRNLPAPDQPQPPLQRGTRLRLRQRRRRLLIKKMP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 GGSQDGDLKEPTERVTRDLSSGAPRGRNLPAPDQPQPPLQRGTRLRLRQRRRRLLIKKMP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 AAATIPANSSDAPFIRPGPGTLDGRWVSLHRSQQERKRVMQEACAKYRASSSRRAVTPRH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 AAATIPANSSDAPFIRPGPGTLDGRWVSLHRSQQERKRVMQEACAKYRASSSRRAVTPRH 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 VSRIFVEDRHRVLYCEVPKAGCSNWKRVLMVLAGLASSTADIQHNTVHYGSALKRLDTFD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 VSRIFVEDRHRVLYCEVPKAGCSNWKRVLMVLAGLASSTADIQHNTVHYGSALKRLDTFD 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE6 RQGILHRLSTYTKMLFVREPFERLVSAFRDKFEHPNSYYHPVFGKAILARYRANASREAL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 RQGILHRLSTYTKMLFVREPFERLVSAFRDKFEHPNSYYHPVFGKAILARYRANASREAL 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE6 RTGSGVRFPEFVQYLLDVHRPVGMDIHWDHVSRLCSPCLIDYDFVGKFESMEDDANFFLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 RTGSGVRFPEFVQYLLDVHRPVGMDIHWDHVSRLCSPCLIDYDFVGKFESMEDDANFFLS 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE6 LIRAPRNLTFPRFKDRHSQEARTTARIAHQYFAQLSALQRQRTYDFYYMDYLMFNYSKPF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 LIRAPRNLTFPRFKDRHSQEARTTARIAHQYFAQLSALQRQRTYDFYYMDYLMFNYSKPF 370 380 390 400 410 420 pF1KE6 ADLY :::: XP_016 ADLY >>NP_071912 (OMIM: 610190,616265) carbohydrate sulfotran (424 aa) initn: 2898 init1: 2898 opt: 2898 Z-score: 2509.9 bits: 473.5 E(85289): 4.6e-133 Smith-Waterman score: 2898; 100.0% identity (100.0% similar) in 424 aa overlap (1-424:1-424) 10 20 30 40 50 60 pF1KE6 MTLRPGTMRLACMFSSILLFGAAGLLLFISLQDPTELAPQQVPGIKFNIRPRQPHHDLPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_071 MTLRPGTMRLACMFSSILLFGAAGLLLFISLQDPTELAPQQVPGIKFNIRPRQPHHDLPP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 GGSQDGDLKEPTERVTRDLSSGAPRGRNLPAPDQPQPPLQRGTRLRLRQRRRRLLIKKMP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_071 GGSQDGDLKEPTERVTRDLSSGAPRGRNLPAPDQPQPPLQRGTRLRLRQRRRRLLIKKMP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 AAATIPANSSDAPFIRPGPGTLDGRWVSLHRSQQERKRVMQEACAKYRASSSRRAVTPRH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_071 AAATIPANSSDAPFIRPGPGTLDGRWVSLHRSQQERKRVMQEACAKYRASSSRRAVTPRH 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 VSRIFVEDRHRVLYCEVPKAGCSNWKRVLMVLAGLASSTADIQHNTVHYGSALKRLDTFD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_071 VSRIFVEDRHRVLYCEVPKAGCSNWKRVLMVLAGLASSTADIQHNTVHYGSALKRLDTFD 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE6 RQGILHRLSTYTKMLFVREPFERLVSAFRDKFEHPNSYYHPVFGKAILARYRANASREAL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_071 RQGILHRLSTYTKMLFVREPFERLVSAFRDKFEHPNSYYHPVFGKAILARYRANASREAL 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE6 RTGSGVRFPEFVQYLLDVHRPVGMDIHWDHVSRLCSPCLIDYDFVGKFESMEDDANFFLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_071 RTGSGVRFPEFVQYLLDVHRPVGMDIHWDHVSRLCSPCLIDYDFVGKFESMEDDANFFLS 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE6 LIRAPRNLTFPRFKDRHSQEARTTARIAHQYFAQLSALQRQRTYDFYYMDYLMFNYSKPF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_071 LIRAPRNLTFPRFKDRHSQEARTTARIAHQYFAQLSALQRQRTYDFYYMDYLMFNYSKPF 370 380 390 400 410 420 pF1KE6 ADLY :::: NP_071 ADLY >>XP_011525526 (OMIM: 610190,616265) PREDICTED: carbohyd (424 aa) initn: 2898 init1: 2898 opt: 2898 Z-score: 2509.9 bits: 473.5 E(85289): 4.6e-133 Smith-Waterman score: 2898; 100.0% identity (100.0% similar) in 424 aa overlap (1-424:1-424) 10 20 30 40 50 60 pF1KE6 MTLRPGTMRLACMFSSILLFGAAGLLLFISLQDPTELAPQQVPGIKFNIRPRQPHHDLPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 MTLRPGTMRLACMFSSILLFGAAGLLLFISLQDPTELAPQQVPGIKFNIRPRQPHHDLPP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 GGSQDGDLKEPTERVTRDLSSGAPRGRNLPAPDQPQPPLQRGTRLRLRQRRRRLLIKKMP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 GGSQDGDLKEPTERVTRDLSSGAPRGRNLPAPDQPQPPLQRGTRLRLRQRRRRLLIKKMP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 AAATIPANSSDAPFIRPGPGTLDGRWVSLHRSQQERKRVMQEACAKYRASSSRRAVTPRH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 AAATIPANSSDAPFIRPGPGTLDGRWVSLHRSQQERKRVMQEACAKYRASSSRRAVTPRH 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 VSRIFVEDRHRVLYCEVPKAGCSNWKRVLMVLAGLASSTADIQHNTVHYGSALKRLDTFD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 VSRIFVEDRHRVLYCEVPKAGCSNWKRVLMVLAGLASSTADIQHNTVHYGSALKRLDTFD 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE6 RQGILHRLSTYTKMLFVREPFERLVSAFRDKFEHPNSYYHPVFGKAILARYRANASREAL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 RQGILHRLSTYTKMLFVREPFERLVSAFRDKFEHPNSYYHPVFGKAILARYRANASREAL 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE6 RTGSGVRFPEFVQYLLDVHRPVGMDIHWDHVSRLCSPCLIDYDFVGKFESMEDDANFFLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 RTGSGVRFPEFVQYLLDVHRPVGMDIHWDHVSRLCSPCLIDYDFVGKFESMEDDANFFLS 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE6 LIRAPRNLTFPRFKDRHSQEARTTARIAHQYFAQLSALQRQRTYDFYYMDYLMFNYSKPF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 LIRAPRNLTFPRFKDRHSQEARTTARIAHQYFAQLSALQRQRTYDFYYMDYLMFNYSKPF 370 380 390 400 410 420 pF1KE6 ADLY :::: XP_011 ADLY >>NP_001121368 (OMIM: 610190,616265) carbohydrate sulfot (424 aa) initn: 2898 init1: 2898 opt: 2898 Z-score: 2509.9 bits: 473.5 E(85289): 4.6e-133 Smith-Waterman score: 2898; 100.0% identity (100.0% similar) in 424 aa overlap (1-424:1-424) 10 20 30 40 50 60 pF1KE6 MTLRPGTMRLACMFSSILLFGAAGLLLFISLQDPTELAPQQVPGIKFNIRPRQPHHDLPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MTLRPGTMRLACMFSSILLFGAAGLLLFISLQDPTELAPQQVPGIKFNIRPRQPHHDLPP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 GGSQDGDLKEPTERVTRDLSSGAPRGRNLPAPDQPQPPLQRGTRLRLRQRRRRLLIKKMP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 GGSQDGDLKEPTERVTRDLSSGAPRGRNLPAPDQPQPPLQRGTRLRLRQRRRRLLIKKMP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 AAATIPANSSDAPFIRPGPGTLDGRWVSLHRSQQERKRVMQEACAKYRASSSRRAVTPRH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 AAATIPANSSDAPFIRPGPGTLDGRWVSLHRSQQERKRVMQEACAKYRASSSRRAVTPRH 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 VSRIFVEDRHRVLYCEVPKAGCSNWKRVLMVLAGLASSTADIQHNTVHYGSALKRLDTFD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 VSRIFVEDRHRVLYCEVPKAGCSNWKRVLMVLAGLASSTADIQHNTVHYGSALKRLDTFD 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE6 RQGILHRLSTYTKMLFVREPFERLVSAFRDKFEHPNSYYHPVFGKAILARYRANASREAL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 RQGILHRLSTYTKMLFVREPFERLVSAFRDKFEHPNSYYHPVFGKAILARYRANASREAL 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE6 RTGSGVRFPEFVQYLLDVHRPVGMDIHWDHVSRLCSPCLIDYDFVGKFESMEDDANFFLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 RTGSGVRFPEFVQYLLDVHRPVGMDIHWDHVSRLCSPCLIDYDFVGKFESMEDDANFFLS 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE6 LIRAPRNLTFPRFKDRHSQEARTTARIAHQYFAQLSALQRQRTYDFYYMDYLMFNYSKPF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 LIRAPRNLTFPRFKDRHSQEARTTARIAHQYFAQLSALQRQRTYDFYYMDYLMFNYSKPF 370 380 390 400 410 420 pF1KE6 ADLY :::: NP_001 ADLY >>XP_011525524 (OMIM: 610190,616265) PREDICTED: carbohyd (424 aa) initn: 2898 init1: 2898 opt: 2898 Z-score: 2509.9 bits: 473.5 E(85289): 4.6e-133 Smith-Waterman score: 2898; 100.0% identity (100.0% similar) in 424 aa overlap (1-424:1-424) 10 20 30 40 50 60 pF1KE6 MTLRPGTMRLACMFSSILLFGAAGLLLFISLQDPTELAPQQVPGIKFNIRPRQPHHDLPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 MTLRPGTMRLACMFSSILLFGAAGLLLFISLQDPTELAPQQVPGIKFNIRPRQPHHDLPP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 GGSQDGDLKEPTERVTRDLSSGAPRGRNLPAPDQPQPPLQRGTRLRLRQRRRRLLIKKMP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 GGSQDGDLKEPTERVTRDLSSGAPRGRNLPAPDQPQPPLQRGTRLRLRQRRRRLLIKKMP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 AAATIPANSSDAPFIRPGPGTLDGRWVSLHRSQQERKRVMQEACAKYRASSSRRAVTPRH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 AAATIPANSSDAPFIRPGPGTLDGRWVSLHRSQQERKRVMQEACAKYRASSSRRAVTPRH 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 VSRIFVEDRHRVLYCEVPKAGCSNWKRVLMVLAGLASSTADIQHNTVHYGSALKRLDTFD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 VSRIFVEDRHRVLYCEVPKAGCSNWKRVLMVLAGLASSTADIQHNTVHYGSALKRLDTFD 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE6 RQGILHRLSTYTKMLFVREPFERLVSAFRDKFEHPNSYYHPVFGKAILARYRANASREAL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 RQGILHRLSTYTKMLFVREPFERLVSAFRDKFEHPNSYYHPVFGKAILARYRANASREAL 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE6 RTGSGVRFPEFVQYLLDVHRPVGMDIHWDHVSRLCSPCLIDYDFVGKFESMEDDANFFLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 RTGSGVRFPEFVQYLLDVHRPVGMDIHWDHVSRLCSPCLIDYDFVGKFESMEDDANFFLS 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE6 LIRAPRNLTFPRFKDRHSQEARTTARIAHQYFAQLSALQRQRTYDFYYMDYLMFNYSKPF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 LIRAPRNLTFPRFKDRHSQEARTTARIAHQYFAQLSALQRQRTYDFYYMDYLMFNYSKPF 370 380 390 400 410 420 pF1KE6 ADLY :::: XP_011 ADLY >>NP_001121367 (OMIM: 610190,616265) carbohydrate sulfot (424 aa) initn: 2898 init1: 2898 opt: 2898 Z-score: 2509.9 bits: 473.5 E(85289): 4.6e-133 Smith-Waterman score: 2898; 100.0% identity (100.0% similar) in 424 aa overlap (1-424:1-424) 10 20 30 40 50 60 pF1KE6 MTLRPGTMRLACMFSSILLFGAAGLLLFISLQDPTELAPQQVPGIKFNIRPRQPHHDLPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MTLRPGTMRLACMFSSILLFGAAGLLLFISLQDPTELAPQQVPGIKFNIRPRQPHHDLPP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 GGSQDGDLKEPTERVTRDLSSGAPRGRNLPAPDQPQPPLQRGTRLRLRQRRRRLLIKKMP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 GGSQDGDLKEPTERVTRDLSSGAPRGRNLPAPDQPQPPLQRGTRLRLRQRRRRLLIKKMP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 AAATIPANSSDAPFIRPGPGTLDGRWVSLHRSQQERKRVMQEACAKYRASSSRRAVTPRH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 AAATIPANSSDAPFIRPGPGTLDGRWVSLHRSQQERKRVMQEACAKYRASSSRRAVTPRH 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 VSRIFVEDRHRVLYCEVPKAGCSNWKRVLMVLAGLASSTADIQHNTVHYGSALKRLDTFD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 VSRIFVEDRHRVLYCEVPKAGCSNWKRVLMVLAGLASSTADIQHNTVHYGSALKRLDTFD 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE6 RQGILHRLSTYTKMLFVREPFERLVSAFRDKFEHPNSYYHPVFGKAILARYRANASREAL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 RQGILHRLSTYTKMLFVREPFERLVSAFRDKFEHPNSYYHPVFGKAILARYRANASREAL 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE6 RTGSGVRFPEFVQYLLDVHRPVGMDIHWDHVSRLCSPCLIDYDFVGKFESMEDDANFFLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 RTGSGVRFPEFVQYLLDVHRPVGMDIHWDHVSRLCSPCLIDYDFVGKFESMEDDANFFLS 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE6 LIRAPRNLTFPRFKDRHSQEARTTARIAHQYFAQLSALQRQRTYDFYYMDYLMFNYSKPF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 LIRAPRNLTFPRFKDRHSQEARTTARIAHQYFAQLSALQRQRTYDFYYMDYLMFNYSKPF 370 380 390 400 410 420 pF1KE6 ADLY :::: NP_001 ADLY >>XP_005258419 (OMIM: 610191) PREDICTED: carbohydrate su (358 aa) initn: 1296 init1: 1296 opt: 1304 Z-score: 1137.0 bits: 219.2 E(85289): 1.4e-56 Smith-Waterman score: 1304; 59.8% identity (83.0% similar) in 311 aa overlap (115-420:47-357) 90 100 110 120 130 140 pF1KE6 RGRNLPAPDQPQPPLQRGTRLRLRQRRRRLLIKKMPAAATIPANSSDAPF---IRPGPGT ::.: .: :. . :. . :.: . XP_005 ERSTRLLTKTSHSQGGDQALSKSTGSPTEKLIEKRQGAKTVFNKFSNMNWPVDIHPLNKS 20 30 40 50 60 70 150 160 170 180 190 pF1KE6 L--DGRWVSLHRSQQERKRVMQEACAKYRASSSRRAVTPRHVSRIFVEDRHRVLYCEVPK : :..: . ...:..:. .:: : :: . : ... . ::::.:::.:..::::::: XP_005 LVKDNKWKKTEETQEKRRSFLQEFCKKYGGVSHHQSHLFHTVSRIYVEDKHKILYCEVPK 80 90 100 110 120 130 200 210 220 230 240 250 pF1KE6 AGCSNWKRVLMVLAGLASSTADIQHNTVHYGSALKRLDTFDRQGILHRLSTYTKMLFVRE ::::::::.:::: :::::. .:.::.::::. ::.::.:: .:: ::.:::: .:::. XP_005 AGCSNWKRILMVLNGLASSAYNISHNAVHYGKHLKKLDSFDLKGIYTRLNTYTKAVFVRD 140 150 160 170 180 190 260 270 280 290 300 310 pF1KE6 PFERLVSAFRDKFEHPNSYYHPVFGKAILARYRANASREALRTGSGVRFPEFVQYLLDVH :.::::::::::::::::::::::::::. .:: :: .::: .::::.: ::..:::: : XP_005 PMERLVSAFRDKFEHPNSYYHPVFGKAIIKKYRPNACEEALINGSGVKFKEFIHYLLDSH 200 210 220 230 240 250 320 330 340 350 360 370 pF1KE6 RPVGMDIHWDHVSRLCSPCLIDYDFVGKFESMEDDANFFLSLIRAPRNLTFPRFKDRHSQ :::::::::..::.:: ::::.::::::::..:.:::.::..: ::..: :: ::::::. XP_005 RPVGMDIHWEKVSKLCYPCLINYDFVGKFETLEEDANYFLQMIGAPKELKFPNFKDRHSS 260 270 280 290 300 310 380 390 400 410 420 pF1KE6 EARTTARIAHQYFAQLSALQRQRTYDFYYMDYLMFNYSKPFADLY . ::.:....::. .:. .:: :::::.:::::::. :: XP_005 DERTNAQVVRQYLKDLTRTERQLIYDFYYLDYLMFNYTTPFL 320 330 340 350 >>XP_016881523 (OMIM: 610191) PREDICTED: carbohydrate su (386 aa) initn: 1296 init1: 1296 opt: 1304 Z-score: 1136.6 bits: 219.2 E(85289): 1.4e-56 Smith-Waterman score: 1304; 59.8% identity (83.0% similar) in 311 aa overlap (115-420:75-385) 90 100 110 120 130 140 pF1KE6 RGRNLPAPDQPQPPLQRGTRLRLRQRRRRLLIKKMPAAATIPANSSDAPF---IRPGPGT ::.: .: :. . :. . :.: . XP_016 ERSTRLLTKTSHSQGGDQALSKSTGSPTEKLIEKRQGAKTVFNKFSNMNWPVDIHPLNKS 50 60 70 80 90 100 150 160 170 180 190 pF1KE6 L--DGRWVSLHRSQQERKRVMQEACAKYRASSSRRAVTPRHVSRIFVEDRHRVLYCEVPK : :..: . ...:..:. .:: : :: . : ... . ::::.:::.:..::::::: XP_016 LVKDNKWKKTEETQEKRRSFLQEFCKKYGGVSHHQSHLFHTVSRIYVEDKHKILYCEVPK 110 120 130 140 150 160 200 210 220 230 240 250 pF1KE6 AGCSNWKRVLMVLAGLASSTADIQHNTVHYGSALKRLDTFDRQGILHRLSTYTKMLFVRE ::::::::.:::: :::::. .:.::.::::. ::.::.:: .:: ::.:::: .:::. XP_016 AGCSNWKRILMVLNGLASSAYNISHNAVHYGKHLKKLDSFDLKGIYTRLNTYTKAVFVRD 170 180 190 200 210 220 260 270 280 290 300 310 pF1KE6 PFERLVSAFRDKFEHPNSYYHPVFGKAILARYRANASREALRTGSGVRFPEFVQYLLDVH :.::::::::::::::::::::::::::. .:: :: .::: .::::.: ::..:::: : XP_016 PMERLVSAFRDKFEHPNSYYHPVFGKAIIKKYRPNACEEALINGSGVKFKEFIHYLLDSH 230 240 250 260 270 280 320 330 340 350 360 370 pF1KE6 RPVGMDIHWDHVSRLCSPCLIDYDFVGKFESMEDDANFFLSLIRAPRNLTFPRFKDRHSQ :::::::::..::.:: ::::.::::::::..:.:::.::..: ::..: :: ::::::. XP_016 RPVGMDIHWEKVSKLCYPCLINYDFVGKFETLEEDANYFLQMIGAPKELKFPNFKDRHSS 290 300 310 320 330 340 380 390 400 410 420 pF1KE6 EARTTARIAHQYFAQLSALQRQRTYDFYYMDYLMFNYSKPFADLY . ::.:....::. .:. .:: :::::.:::::::. :: XP_016 DERTNAQVVRQYLKDLTRTERQLIYDFYYLDYLMFNYTTPFL 350 360 370 380 >>XP_016881522 (OMIM: 610191) PREDICTED: carbohydrate su (443 aa) initn: 1368 init1: 1296 opt: 1304 Z-score: 1135.7 bits: 219.2 E(85289): 1.6e-56 Smith-Waterman score: 1331; 48.1% identity (71.3% similar) in 449 aa overlap (3-420:1-442) 10 20 30 40 50 pF1KE6 MTLRPGTMRLA--CMFSSILLFGAAGLLLFISLQDPTELAPQQVPGIKFNIRPRQPHHDL ..:. : . .: :.:.::.::::::. :: : .: : .. :. .. XP_016 MQPSEMVMNPKQVFLSVLIFGVAGLLLFMYLQVWIE---EQHTG---RVEKRREQKVT 10 20 30 40 50 60 70 80 90 100 pF1KE6 PPGGSQDGDLKEP----TERVTRDLSSGAPRGRNLPAP--DQPQPPL---QRGTRLRLRQ : : ::.. . ... :. ..: .. . : .:.::: . XP_016 SGWGPVKYLRPVPRIMSTEKIQEHITNQNPK-FHMPEDVREKKENLLLNSERSTRLLTKT 60 70 80 90 100 110 110 120 130 140 pF1KE6 RRRR---------------LLIKKMPAAATIPANSSDAPF---IRPGPGTL--DGRWVSL . . ::.: .: :. . :. . :.: .: :..: . XP_016 SHSQGGDQALSKSTGSPTEKLIEKRQGAKTVFNKFSNMNWPVDIHPLNKSLVKDNKWKKT 120 130 140 150 160 170 150 160 170 180 190 200 pF1KE6 HRSQQERKRVMQEACAKYRASSSRRAVTPRHVSRIFVEDRHRVLYCEVPKAGCSNWKRVL ...:..:. .:: : :: . : ... . ::::.:::.:..:::::::::::::::.: XP_016 EETQEKRRSFLQEFCKKYGGVSHHQSHLFHTVSRIYVEDKHKILYCEVPKAGCSNWKRIL 180 190 200 210 220 230 210 220 230 240 250 260 pF1KE6 MVLAGLASSTADIQHNTVHYGSALKRLDTFDRQGILHRLSTYTKMLFVREPFERLVSAFR ::: :::::. .:.::.::::. ::.::.:: .:: ::.:::: .:::.:.:::::::: XP_016 MVLNGLASSAYNISHNAVHYGKHLKKLDSFDLKGIYTRLNTYTKAVFVRDPMERLVSAFR 240 250 260 270 280 290 270 280 290 300 310 320 pF1KE6 DKFEHPNSYYHPVFGKAILARYRANASREALRTGSGVRFPEFVQYLLDVHRPVGMDIHWD ::::::::::::::::::. .:: :: .::: .::::.: ::..:::: ::::::::::. XP_016 DKFEHPNSYYHPVFGKAIIKKYRPNACEEALINGSGVKFKEFIHYLLDSHRPVGMDIHWE 300 310 320 330 340 350 330 340 350 360 370 380 pF1KE6 HVSRLCSPCLIDYDFVGKFESMEDDANFFLSLIRAPRNLTFPRFKDRHSQEARTTARIAH .::.:: ::::.::::::::..:.:::.::..: ::..: :: ::::::.. ::.:.... XP_016 KVSKLCYPCLINYDFVGKFETLEEDANYFLQMIGAPKELKFPNFKDRHSSDERTNAQVVR 360 370 380 390 400 410 390 400 410 420 pF1KE6 QYFAQLSALQRQRTYDFYYMDYLMFNYSKPFADLY ::. .:. .:: :::::.:::::::. :: XP_016 QYLKDLTRTERQLIYDFYYLDYLMFNYTTPFL 420 430 440 >>NP_113610 (OMIM: 610191) carbohydrate sulfotransferase (443 aa) initn: 1368 init1: 1296 opt: 1304 Z-score: 1135.7 bits: 219.2 E(85289): 1.6e-56 Smith-Waterman score: 1331; 48.1% identity (71.3% similar) in 449 aa overlap (3-420:1-442) 10 20 30 40 50 pF1KE6 MTLRPGTMRLA--CMFSSILLFGAAGLLLFISLQDPTELAPQQVPGIKFNIRPRQPHHDL ..:. : . .: :.:.::.::::::. :: : .: : .. :. .. NP_113 MQPSEMVMNPKQVFLSVLIFGVAGLLLFMYLQVWIE---EQHTG---RVEKRREQKVT 10 20 30 40 50 60 70 80 90 100 pF1KE6 PPGGSQDGDLKEP----TERVTRDLSSGAPRGRNLPAP--DQPQPPL---QRGTRLRLRQ : : ::.. . ... :. ..: .. . : .:.::: . NP_113 SGWGPVKYLRPVPRIMSTEKIQEHITNQNPK-FHMPEDVREKKENLLLNSERSTRLLTKT 60 70 80 90 100 110 110 120 130 140 pF1KE6 RRRR---------------LLIKKMPAAATIPANSSDAPF---IRPGPGTL--DGRWVSL . . ::.: .: :. . :. . :.: .: :..: . NP_113 SHSQGGDQALSKSTGSPTEKLIEKRQGAKTVFNKFSNMNWPVDIHPLNKSLVKDNKWKKT 120 130 140 150 160 170 150 160 170 180 190 200 pF1KE6 HRSQQERKRVMQEACAKYRASSSRRAVTPRHVSRIFVEDRHRVLYCEVPKAGCSNWKRVL ...:..:. .:: : :: . : ... . ::::.:::.:..:::::::::::::::.: NP_113 EETQEKRRSFLQEFCKKYGGVSHHQSHLFHTVSRIYVEDKHKILYCEVPKAGCSNWKRIL 180 190 200 210 220 230 210 220 230 240 250 260 pF1KE6 MVLAGLASSTADIQHNTVHYGSALKRLDTFDRQGILHRLSTYTKMLFVREPFERLVSAFR ::: :::::. .:.::.::::. ::.::.:: .:: ::.:::: .:::.:.:::::::: NP_113 MVLNGLASSAYNISHNAVHYGKHLKKLDSFDLKGIYTRLNTYTKAVFVRDPMERLVSAFR 240 250 260 270 280 290 270 280 290 300 310 320 pF1KE6 DKFEHPNSYYHPVFGKAILARYRANASREALRTGSGVRFPEFVQYLLDVHRPVGMDIHWD ::::::::::::::::::. .:: :: .::: .::::.: ::..:::: ::::::::::. NP_113 DKFEHPNSYYHPVFGKAIIKKYRPNACEEALINGSGVKFKEFIHYLLDSHRPVGMDIHWE 300 310 320 330 340 350 330 340 350 360 370 380 pF1KE6 HVSRLCSPCLIDYDFVGKFESMEDDANFFLSLIRAPRNLTFPRFKDRHSQEARTTARIAH .::.:: ::::.::::::::..:.:::.::..: ::..: :: ::::::.. ::.:.... NP_113 KVSKLCYPCLINYDFVGKFETLEEDANYFLQMIGAPKELKFPNFKDRHSSDERTNAQVVR 360 370 380 390 400 410 390 400 410 420 pF1KE6 QYFAQLSALQRQRTYDFYYMDYLMFNYSKPFADLY ::. .:. .:: :::::.:::::::. :: NP_113 QYLKDLTRTERQLIYDFYYLDYLMFNYTTPFL 420 430 440 424 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 12:16:44 2016 done: Tue Nov 8 12:16:45 2016 Total Scan time: 7.760 Total Display time: 0.040 Function used was FASTA [36.3.4 Apr, 2011]