FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB4483, 533 aa 1>>>pF1KB4483 533 - 533 aa - 533 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.3261+/-0.000333; mu= 14.5332+/- 0.021 mean_var=104.5097+/-20.601, 0's: 0 Z-trim(117.5): 43 B-trim: 215 in 1/56 Lambda= 0.125457 statistics sampled from 29568 (29611) to 29568 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.695), E-opt: 0.2 (0.347), width: 16 Scan time: 9.920 The best scores are: opt bits E(85289) XP_016874631 (OMIM: 601873) PREDICTED: beta-1,4 N- ( 533) 3561 655.2 1.4e-187 NP_001469 (OMIM: 601873) beta-1,4 N-acetylgalactos ( 533) 3561 655.2 1.4e-187 XP_011536450 (OMIM: 601873) PREDICTED: beta-1,4 N- ( 539) 3539 651.2 2.3e-186 NP_001263397 (OMIM: 601873) beta-1,4 N-acetylgalac ( 478) 2681 495.9 1.2e-139 XP_005268830 (OMIM: 601873) PREDICTED: beta-1,4 N- ( 544) 2525 467.7 4.1e-131 XP_011536449 (OMIM: 601873) PREDICTED: beta-1,4 N- ( 550) 2503 463.7 6.5e-130 XP_016874629 (OMIM: 601873) PREDICTED: beta-1,4 N- ( 578) 2215 411.6 3.3e-114 XP_016874630 (OMIM: 601873) PREDICTED: beta-1,4 N- ( 578) 2215 411.6 3.3e-114 NP_001263398 (OMIM: 601873) beta-1,4 N-acetylgalac ( 328) 1568 294.3 3.8e-79 XP_016879662 (OMIM: 111730) PREDICTED: beta-1,4 N- ( 480) 1189 225.8 2.3e-58 NP_001152860 (OMIM: 111730) beta-1,4 N-acetylgalac ( 480) 1189 225.8 2.3e-58 NP_001152859 (OMIM: 111730) beta-1,4 N-acetylgalac ( 506) 1189 225.8 2.4e-58 NP_703147 (OMIM: 111730) beta-1,4 N-acetylgalactos ( 566) 1189 225.9 2.6e-58 XP_011522616 (OMIM: 111730) PREDICTED: beta-1,4 N- ( 320) 378 78.9 2.5e-14 >>XP_016874631 (OMIM: 601873) PREDICTED: beta-1,4 N-acet (533 aa) initn: 3561 init1: 3561 opt: 3561 Z-score: 3488.5 bits: 655.2 E(85289): 1.4e-187 Smith-Waterman score: 3561; 100.0% identity (100.0% similar) in 533 aa overlap (1-533:1-533) 10 20 30 40 50 60 pF1KB4 MWLGRRALCALVLLLACASLGLLYASTRDAPGLRLPLAPWAPPQSPRRPELPDLAPEPRY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 MWLGRRALCALVLLLACASLGLLYASTRDAPGLRLPLAPWAPPQSPRRPELPDLAPEPRY 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB4 AHIPVRIKEQVVGLLAWNNCSCESSGGGLPLPFQKQVRAIDLTKAFDPAELRAASATREQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 AHIPVRIKEQVVGLLAWNNCSCESSGGGLPLPFQKQVRAIDLTKAFDPAELRAASATREQ 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB4 EFQAFLSRSQSPADQLLIAPANSPLQYPLQGVEVQPLRSILVPGLSLQAASGQEVYQVNL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 EFQAFLSRSQSPADQLLIAPANSPLQYPLQGVEVQPLRSILVPGLSLQAASGQEVYQVNL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB4 TASLGTWDVAGEVTGVTLTGEGQADLTLVSPGLDQLNRQLQLVTYSSRSYQTNTADTVRF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 TASLGTWDVAGEVTGVTLTGEGQADLTLVSPGLDQLNRQLQLVTYSSRSYQTNTADTVRF 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB4 STEGHEAAFTIRIRHPPNPRLYPPGSLPQGAQYNISALVTIATKTFLRYDRLRALITSIR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 STEGHEAAFTIRIRHPPNPRLYPPGSLPQGAQYNISALVTIATKTFLRYDRLRALITSIR 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB4 RFYPTVTVVIADDSDKPERVSGPYVEHYLMPFGKGWFAGRNLAVSQVTTKYVLWVDDDFV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 RFYPTVTVVIADDSDKPERVSGPYVEHYLMPFGKGWFAGRNLAVSQVTTKYVLWVDDDFV 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB4 FTARTRLERLVDVLERTPLDLVGGAVREISGFATTYRQLLSVEPGAPGLGNCLRQRRGFH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 FTARTRLERLVDVLERTPLDLVGGAVREISGFATTYRQLLSVEPGAPGLGNCLRQRRGFH 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB4 HELVGFPGCVVTDGVVNFFLARTDKVREVGFDPRLSRVAHLEFFLDGLGSLRVGSCSDVV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 HELVGFPGCVVTDGVVNFFLARTDKVREVGFDPRLSRVAHLEFFLDGLGSLRVGSCSDVV 430 440 450 460 470 480 490 500 510 520 530 pF1KB4 VDHASKLKLPWTSRDAGAETYARYRYPGSLDESQMAKHRLLFFKHRLQCMTSQ ::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 VDHASKLKLPWTSRDAGAETYARYRYPGSLDESQMAKHRLLFFKHRLQCMTSQ 490 500 510 520 530 >>NP_001469 (OMIM: 601873) beta-1,4 N-acetylgalactosamin (533 aa) initn: 3561 init1: 3561 opt: 3561 Z-score: 3488.5 bits: 655.2 E(85289): 1.4e-187 Smith-Waterman score: 3561; 100.0% identity (100.0% similar) in 533 aa overlap (1-533:1-533) 10 20 30 40 50 60 pF1KB4 MWLGRRALCALVLLLACASLGLLYASTRDAPGLRLPLAPWAPPQSPRRPELPDLAPEPRY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MWLGRRALCALVLLLACASLGLLYASTRDAPGLRLPLAPWAPPQSPRRPELPDLAPEPRY 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB4 AHIPVRIKEQVVGLLAWNNCSCESSGGGLPLPFQKQVRAIDLTKAFDPAELRAASATREQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 AHIPVRIKEQVVGLLAWNNCSCESSGGGLPLPFQKQVRAIDLTKAFDPAELRAASATREQ 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB4 EFQAFLSRSQSPADQLLIAPANSPLQYPLQGVEVQPLRSILVPGLSLQAASGQEVYQVNL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 EFQAFLSRSQSPADQLLIAPANSPLQYPLQGVEVQPLRSILVPGLSLQAASGQEVYQVNL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB4 TASLGTWDVAGEVTGVTLTGEGQADLTLVSPGLDQLNRQLQLVTYSSRSYQTNTADTVRF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 TASLGTWDVAGEVTGVTLTGEGQADLTLVSPGLDQLNRQLQLVTYSSRSYQTNTADTVRF 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB4 STEGHEAAFTIRIRHPPNPRLYPPGSLPQGAQYNISALVTIATKTFLRYDRLRALITSIR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 STEGHEAAFTIRIRHPPNPRLYPPGSLPQGAQYNISALVTIATKTFLRYDRLRALITSIR 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB4 RFYPTVTVVIADDSDKPERVSGPYVEHYLMPFGKGWFAGRNLAVSQVTTKYVLWVDDDFV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 RFYPTVTVVIADDSDKPERVSGPYVEHYLMPFGKGWFAGRNLAVSQVTTKYVLWVDDDFV 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB4 FTARTRLERLVDVLERTPLDLVGGAVREISGFATTYRQLLSVEPGAPGLGNCLRQRRGFH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 FTARTRLERLVDVLERTPLDLVGGAVREISGFATTYRQLLSVEPGAPGLGNCLRQRRGFH 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB4 HELVGFPGCVVTDGVVNFFLARTDKVREVGFDPRLSRVAHLEFFLDGLGSLRVGSCSDVV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 HELVGFPGCVVTDGVVNFFLARTDKVREVGFDPRLSRVAHLEFFLDGLGSLRVGSCSDVV 430 440 450 460 470 480 490 500 510 520 530 pF1KB4 VDHASKLKLPWTSRDAGAETYARYRYPGSLDESQMAKHRLLFFKHRLQCMTSQ ::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 VDHASKLKLPWTSRDAGAETYARYRYPGSLDESQMAKHRLLFFKHRLQCMTSQ 490 500 510 520 530 >>XP_011536450 (OMIM: 601873) PREDICTED: beta-1,4 N-acet (539 aa) initn: 3547 init1: 1993 opt: 3539 Z-score: 3466.9 bits: 651.2 E(85289): 2.3e-186 Smith-Waterman score: 3539; 98.9% identity (98.9% similar) in 539 aa overlap (1-533:1-539) 10 20 30 40 50 60 pF1KB4 MWLGRRALCALVLLLACASLGLLYASTRDAPGLRLPLAPWAPPQSPRRPELPDLAPEPRY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 MWLGRRALCALVLLLACASLGLLYASTRDAPGLRLPLAPWAPPQSPRRPELPDLAPEPRY 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB4 AHIPVRIKEQVVGLLAWNNCSCESSGGGLPLPFQKQVRAIDLTKAFDPAELRAASATREQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 AHIPVRIKEQVVGLLAWNNCSCESSGGGLPLPFQKQVRAIDLTKAFDPAELRAASATREQ 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB4 EFQAFLSRSQSPADQLLIAPANSPLQYPLQGVEVQPLRSILVPGLSLQAASGQEVYQVNL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 EFQAFLSRSQSPADQLLIAPANSPLQYPLQGVEVQPLRSILVPGLSLQAASGQEVYQVNL 130 140 150 160 170 180 190 200 210 220 230 pF1KB4 TASLGTWDVAGEVTGVTLTGEGQADLTLVSPGLDQLNRQLQLVTYSSRSYQTNTADT--- ::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 TASLGTWDVAGEVTGVTLTGEGQADLTLVSPGLDQLNRQLQLVTYSSRSYQTNTADTECK 190 200 210 220 230 240 240 250 260 270 280 290 pF1KB4 ---VRFSTEGHEAAFTIRIRHPPNPRLYPPGSLPQGAQYNISALVTIATKTFLRYDRLRA ::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 FHKVRFSTEGHEAAFTIRIRHPPNPRLYPPGSLPQGAQYNISALVTIATKTFLRYDRLRA 250 260 270 280 290 300 300 310 320 330 340 350 pF1KB4 LITSIRRFYPTVTVVIADDSDKPERVSGPYVEHYLMPFGKGWFAGRNLAVSQVTTKYVLW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 LITSIRRFYPTVTVVIADDSDKPERVSGPYVEHYLMPFGKGWFAGRNLAVSQVTTKYVLW 310 320 330 340 350 360 360 370 380 390 400 410 pF1KB4 VDDDFVFTARTRLERLVDVLERTPLDLVGGAVREISGFATTYRQLLSVEPGAPGLGNCLR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 VDDDFVFTARTRLERLVDVLERTPLDLVGGAVREISGFATTYRQLLSVEPGAPGLGNCLR 370 380 390 400 410 420 420 430 440 450 460 470 pF1KB4 QRRGFHHELVGFPGCVVTDGVVNFFLARTDKVREVGFDPRLSRVAHLEFFLDGLGSLRVG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 QRRGFHHELVGFPGCVVTDGVVNFFLARTDKVREVGFDPRLSRVAHLEFFLDGLGSLRVG 430 440 450 460 470 480 480 490 500 510 520 530 pF1KB4 SCSDVVVDHASKLKLPWTSRDAGAETYARYRYPGSLDESQMAKHRLLFFKHRLQCMTSQ ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 SCSDVVVDHASKLKLPWTSRDAGAETYARYRYPGSLDESQMAKHRLLFFKHRLQCMTSQ 490 500 510 520 530 >>NP_001263397 (OMIM: 601873) beta-1,4 N-acetylgalactosa (478 aa) initn: 3178 init1: 2681 opt: 2681 Z-score: 2628.4 bits: 495.9 E(85289): 1.2e-139 Smith-Waterman score: 3072; 89.7% identity (89.7% similar) in 533 aa overlap (1-533:1-478) 10 20 30 40 50 60 pF1KB4 MWLGRRALCALVLLLACASLGLLYASTRDAPGLRLPLAPWAPPQSPRRPELPDLAPEPRY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MWLGRRALCALVLLLACASLGLLYASTRDAPGLRLPLAPWAPPQSPRRPELPDLAPEPRY 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB4 AHIPVRIKEQVVGLLAWNNCSCESSGGGLPLPFQKQVRAIDLTKAFDPAELRAASATREQ ::::::::::::: NP_001 AHIPVRIKEQVVG----------------------------------------------- 70 130 140 150 160 170 180 pF1KB4 EFQAFLSRSQSPADQLLIAPANSPLQYPLQGVEVQPLRSILVPGLSLQAASGQEVYQVNL :::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 --------SQSPADQLLIAPANSPLQYPLQGVEVQPLRSILVPGLSLQAASGQEVYQVNL 80 90 100 110 120 190 200 210 220 230 240 pF1KB4 TASLGTWDVAGEVTGVTLTGEGQADLTLVSPGLDQLNRQLQLVTYSSRSYQTNTADTVRF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 TASLGTWDVAGEVTGVTLTGEGQADLTLVSPGLDQLNRQLQLVTYSSRSYQTNTADTVRF 130 140 150 160 170 180 250 260 270 280 290 300 pF1KB4 STEGHEAAFTIRIRHPPNPRLYPPGSLPQGAQYNISALVTIATKTFLRYDRLRALITSIR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 STEGHEAAFTIRIRHPPNPRLYPPGSLPQGAQYNISALVTIATKTFLRYDRLRALITSIR 190 200 210 220 230 240 310 320 330 340 350 360 pF1KB4 RFYPTVTVVIADDSDKPERVSGPYVEHYLMPFGKGWFAGRNLAVSQVTTKYVLWVDDDFV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 RFYPTVTVVIADDSDKPERVSGPYVEHYLMPFGKGWFAGRNLAVSQVTTKYVLWVDDDFV 250 260 270 280 290 300 370 380 390 400 410 420 pF1KB4 FTARTRLERLVDVLERTPLDLVGGAVREISGFATTYRQLLSVEPGAPGLGNCLRQRRGFH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 FTARTRLERLVDVLERTPLDLVGGAVREISGFATTYRQLLSVEPGAPGLGNCLRQRRGFH 310 320 330 340 350 360 430 440 450 460 470 480 pF1KB4 HELVGFPGCVVTDGVVNFFLARTDKVREVGFDPRLSRVAHLEFFLDGLGSLRVGSCSDVV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 HELVGFPGCVVTDGVVNFFLARTDKVREVGFDPRLSRVAHLEFFLDGLGSLRVGSCSDVV 370 380 390 400 410 420 490 500 510 520 530 pF1KB4 VDHASKLKLPWTSRDAGAETYARYRYPGSLDESQMAKHRLLFFKHRLQCMTSQ ::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 VDHASKLKLPWTSRDAGAETYARYRYPGSLDESQMAKHRLLFFKHRLQCMTSQ 430 440 450 460 470 >>XP_005268830 (OMIM: 601873) PREDICTED: beta-1,4 N-acet (544 aa) initn: 3547 init1: 2525 opt: 2525 Z-score: 2475.0 bits: 467.7 E(85289): 4.1e-131 Smith-Waterman score: 3529; 98.0% identity (98.0% similar) in 544 aa overlap (1-533:1-544) 10 20 30 40 50 60 pF1KB4 MWLGRRALCALVLLLACASLGLLYASTRDAPGLRLPLAPWAPPQSPRRPELPDLAPEPRY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 MWLGRRALCALVLLLACASLGLLYASTRDAPGLRLPLAPWAPPQSPRRPELPDLAPEPRY 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB4 AHIPVRIKEQVVGLLAWNNCSCESSGGGLPLPFQKQVRAIDLTKAFDPAELRAASATREQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 AHIPVRIKEQVVGLLAWNNCSCESSGGGLPLPFQKQVRAIDLTKAFDPAELRAASATREQ 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB4 EFQAFLSRSQSPADQLLIAPANSPLQYPLQGVEVQPLRSILVPGLSLQAASGQEVYQVNL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 EFQAFLSRSQSPADQLLIAPANSPLQYPLQGVEVQPLRSILVPGLSLQAASGQEVYQVNL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB4 TASLGTWDVAGEVTGVTLTGEGQADLTLVSPGLDQLNRQLQLVTYSSRSYQTNTADTVRF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 TASLGTWDVAGEVTGVTLTGEGQADLTLVSPGLDQLNRQLQLVTYSSRSYQTNTADTVRF 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB4 STEGHEAAFTIRIRHPPNPRLYPPGSLPQGAQYNISALVTIATKTFLRYDRLRALITSIR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 STEGHEAAFTIRIRHPPNPRLYPPGSLPQGAQYNISALVTIATKTFLRYDRLRALITSIR 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB4 RFYPTVTVVIADDSDKPERVSGPYVEHYLMPFGKGWFAGRNLAVSQVTTKYVLWVDDDFV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 RFYPTVTVVIADDSDKPERVSGPYVEHYLMPFGKGWFAGRNLAVSQVTTKYVLWVDDDFV 310 320 330 340 350 360 370 380 390 400 pF1KB4 FTARTRLERLVDVLERTPLDL-----------VGGAVREISGFATTYRQLLSVEPGAPGL ::::::::::::::::::::: :::::::::::::::::::::::::::: XP_005 FTARTRLERLVDVLERTPLDLARLEATSLSLQVGGAVREISGFATTYRQLLSVEPGAPGL 370 380 390 400 410 420 410 420 430 440 450 460 pF1KB4 GNCLRQRRGFHHELVGFPGCVVTDGVVNFFLARTDKVREVGFDPRLSRVAHLEFFLDGLG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 GNCLRQRRGFHHELVGFPGCVVTDGVVNFFLARTDKVREVGFDPRLSRVAHLEFFLDGLG 430 440 450 460 470 480 470 480 490 500 510 520 pF1KB4 SLRVGSCSDVVVDHASKLKLPWTSRDAGAETYARYRYPGSLDESQMAKHRLLFFKHRLQC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 SLRVGSCSDVVVDHASKLKLPWTSRDAGAETYARYRYPGSLDESQMAKHRLLFFKHRLQC 490 500 510 520 530 540 530 pF1KB4 MTSQ :::: XP_005 MTSQ >>XP_011536449 (OMIM: 601873) PREDICTED: beta-1,4 N-acet (550 aa) initn: 3533 init1: 1568 opt: 2503 Z-score: 2453.4 bits: 463.7 E(85289): 6.5e-130 Smith-Waterman score: 3507; 96.9% identity (96.9% similar) in 550 aa overlap (1-533:1-550) 10 20 30 40 50 60 pF1KB4 MWLGRRALCALVLLLACASLGLLYASTRDAPGLRLPLAPWAPPQSPRRPELPDLAPEPRY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 MWLGRRALCALVLLLACASLGLLYASTRDAPGLRLPLAPWAPPQSPRRPELPDLAPEPRY 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB4 AHIPVRIKEQVVGLLAWNNCSCESSGGGLPLPFQKQVRAIDLTKAFDPAELRAASATREQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 AHIPVRIKEQVVGLLAWNNCSCESSGGGLPLPFQKQVRAIDLTKAFDPAELRAASATREQ 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB4 EFQAFLSRSQSPADQLLIAPANSPLQYPLQGVEVQPLRSILVPGLSLQAASGQEVYQVNL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 EFQAFLSRSQSPADQLLIAPANSPLQYPLQGVEVQPLRSILVPGLSLQAASGQEVYQVNL 130 140 150 160 170 180 190 200 210 220 230 pF1KB4 TASLGTWDVAGEVTGVTLTGEGQADLTLVSPGLDQLNRQLQLVTYSSRSYQTNTADT--- ::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 TASLGTWDVAGEVTGVTLTGEGQADLTLVSPGLDQLNRQLQLVTYSSRSYQTNTADTECK 190 200 210 220 230 240 240 250 260 270 280 290 pF1KB4 ---VRFSTEGHEAAFTIRIRHPPNPRLYPPGSLPQGAQYNISALVTIATKTFLRYDRLRA ::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 FHKVRFSTEGHEAAFTIRIRHPPNPRLYPPGSLPQGAQYNISALVTIATKTFLRYDRLRA 250 260 270 280 290 300 300 310 320 330 340 350 pF1KB4 LITSIRRFYPTVTVVIADDSDKPERVSGPYVEHYLMPFGKGWFAGRNLAVSQVTTKYVLW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 LITSIRRFYPTVTVVIADDSDKPERVSGPYVEHYLMPFGKGWFAGRNLAVSQVTTKYVLW 310 320 330 340 350 360 360 370 380 390 400 pF1KB4 VDDDFVFTARTRLERLVDVLERTPLDL-----------VGGAVREISGFATTYRQLLSVE ::::::::::::::::::::::::::: :::::::::::::::::::::: XP_011 VDDDFVFTARTRLERLVDVLERTPLDLARLEATSLSLQVGGAVREISGFATTYRQLLSVE 370 380 390 400 410 420 410 420 430 440 450 460 pF1KB4 PGAPGLGNCLRQRRGFHHELVGFPGCVVTDGVVNFFLARTDKVREVGFDPRLSRVAHLEF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 PGAPGLGNCLRQRRGFHHELVGFPGCVVTDGVVNFFLARTDKVREVGFDPRLSRVAHLEF 430 440 450 460 470 480 470 480 490 500 510 520 pF1KB4 FLDGLGSLRVGSCSDVVVDHASKLKLPWTSRDAGAETYARYRYPGSLDESQMAKHRLLFF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 FLDGLGSLRVGSCSDVVVDHASKLKLPWTSRDAGAETYARYRYPGSLDESQMAKHRLLFF 490 500 510 520 530 540 530 pF1KB4 KHRLQCMTSQ :::::::::: XP_011 KHRLQCMTSQ 550 >>XP_016874629 (OMIM: 601873) PREDICTED: beta-1,4 N-acet (578 aa) initn: 2215 init1: 2215 opt: 2215 Z-score: 2171.4 bits: 411.6 E(85289): 3.3e-114 Smith-Waterman score: 3368; 92.0% identity (92.0% similar) in 565 aa overlap (1-520:1-565) 10 20 30 40 50 60 pF1KB4 MWLGRRALCALVLLLACASLGLLYASTRDAPGLRLPLAPWAPPQSPRRPELPDLAPEPRY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 MWLGRRALCALVLLLACASLGLLYASTRDAPGLRLPLAPWAPPQSPRRPELPDLAPEPRY 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB4 AHIPVRIKEQVVGLLAWNNCSCESSGGGLPLPFQKQVRAIDLTKAFDPAELRAASATREQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 AHIPVRIKEQVVGLLAWNNCSCESSGGGLPLPFQKQVRAIDLTKAFDPAELRAASATREQ 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB4 EFQAFLSRSQSPADQLLIAPANSPLQYPLQGVEVQPLRSILVPGLSLQAASGQEVYQVNL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 EFQAFLSRSQSPADQLLIAPANSPLQYPLQGVEVQPLRSILVPGLSLQAASGQEVYQVNL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB4 TASLGTWDVAGEVTGVTLTGEGQADLTLVSPGLDQLNRQLQLVTYSSRSYQTNTADTVRF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 TASLGTWDVAGEVTGVTLTGEGQADLTLVSPGLDQLNRQLQLVTYSSRSYQTNTADTVRF 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB4 STEGHEAAFTIRIRHPPNPRLYPPGSLPQGAQYNISALVTIATKTFLRYDRLRALITSIR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 STEGHEAAFTIRIRHPPNPRLYPPGSLPQGAQYNISALVTIATKTFLRYDRLRALITSIR 250 260 270 280 290 300 310 320 330 pF1KB4 RFYPTVTVVIADDSDKPERVSGPYVEHYLMPFGK-------------------------- :::::::::::::::::::::::::::::::::: XP_016 RFYPTVTVVIADDSDKPERVSGPYVEHYLMPFGKLLAFRKHSKVWRRIQAEKGRGRERKV 310 320 330 340 350 360 340 350 360 370 pF1KB4 -------------------GWFAGRNLAVSQVTTKYVLWVDDDFVFTARTRLERLVDVLE ::::::::::::::::::::::::::::::::::::::::: XP_016 AEGHVLFLQPPQPHTPFCQGWFAGRNLAVSQVTTKYVLWVDDDFVFTARTRLERLVDVLE 370 380 390 400 410 420 380 390 400 410 420 430 pF1KB4 RTPLDLVGGAVREISGFATTYRQLLSVEPGAPGLGNCLRQRRGFHHELVGFPGCVVTDGV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 RTPLDLVGGAVREISGFATTYRQLLSVEPGAPGLGNCLRQRRGFHHELVGFPGCVVTDGV 430 440 450 460 470 480 440 450 460 470 480 490 pF1KB4 VNFFLARTDKVREVGFDPRLSRVAHLEFFLDGLGSLRVGSCSDVVVDHASKLKLPWTSRD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 VNFFLARTDKVREVGFDPRLSRVAHLEFFLDGLGSLRVGSCSDVVVDHASKLKLPWTSRD 490 500 510 520 530 540 500 510 520 530 pF1KB4 AGAETYARYRYPGSLDESQMAKHRLLFFKHRLQCMTSQ ::::::::::::::::::::::::: XP_016 AGAETYARYRYPGSLDESQMAKHRLLFFKHRLQCMTSQ 550 560 570 >>XP_016874630 (OMIM: 601873) PREDICTED: beta-1,4 N-acet (578 aa) initn: 2215 init1: 2215 opt: 2215 Z-score: 2171.4 bits: 411.6 E(85289): 3.3e-114 Smith-Waterman score: 3368; 92.0% identity (92.0% similar) in 565 aa overlap (1-520:1-565) 10 20 30 40 50 60 pF1KB4 MWLGRRALCALVLLLACASLGLLYASTRDAPGLRLPLAPWAPPQSPRRPELPDLAPEPRY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 MWLGRRALCALVLLLACASLGLLYASTRDAPGLRLPLAPWAPPQSPRRPELPDLAPEPRY 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB4 AHIPVRIKEQVVGLLAWNNCSCESSGGGLPLPFQKQVRAIDLTKAFDPAELRAASATREQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 AHIPVRIKEQVVGLLAWNNCSCESSGGGLPLPFQKQVRAIDLTKAFDPAELRAASATREQ 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB4 EFQAFLSRSQSPADQLLIAPANSPLQYPLQGVEVQPLRSILVPGLSLQAASGQEVYQVNL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 EFQAFLSRSQSPADQLLIAPANSPLQYPLQGVEVQPLRSILVPGLSLQAASGQEVYQVNL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB4 TASLGTWDVAGEVTGVTLTGEGQADLTLVSPGLDQLNRQLQLVTYSSRSYQTNTADTVRF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 TASLGTWDVAGEVTGVTLTGEGQADLTLVSPGLDQLNRQLQLVTYSSRSYQTNTADTVRF 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB4 STEGHEAAFTIRIRHPPNPRLYPPGSLPQGAQYNISALVTIATKTFLRYDRLRALITSIR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 STEGHEAAFTIRIRHPPNPRLYPPGSLPQGAQYNISALVTIATKTFLRYDRLRALITSIR 250 260 270 280 290 300 310 320 330 pF1KB4 RFYPTVTVVIADDSDKPERVSGPYVEHYLMPFGK-------------------------- :::::::::::::::::::::::::::::::::: XP_016 RFYPTVTVVIADDSDKPERVSGPYVEHYLMPFGKLLAFRKHSKVWRRIQAEKGRGRERKV 310 320 330 340 350 360 340 350 360 370 pF1KB4 -------------------GWFAGRNLAVSQVTTKYVLWVDDDFVFTARTRLERLVDVLE ::::::::::::::::::::::::::::::::::::::::: XP_016 AEGHVLFLQPPQPHTPFCQGWFAGRNLAVSQVTTKYVLWVDDDFVFTARTRLERLVDVLE 370 380 390 400 410 420 380 390 400 410 420 430 pF1KB4 RTPLDLVGGAVREISGFATTYRQLLSVEPGAPGLGNCLRQRRGFHHELVGFPGCVVTDGV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 RTPLDLVGGAVREISGFATTYRQLLSVEPGAPGLGNCLRQRRGFHHELVGFPGCVVTDGV 430 440 450 460 470 480 440 450 460 470 480 490 pF1KB4 VNFFLARTDKVREVGFDPRLSRVAHLEFFLDGLGSLRVGSCSDVVVDHASKLKLPWTSRD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 VNFFLARTDKVREVGFDPRLSRVAHLEFFLDGLGSLRVGSCSDVVVDHASKLKLPWTSRD 490 500 510 520 530 540 500 510 520 530 pF1KB4 AGAETYARYRYPGSLDESQMAKHRLLFFKHRLQCMTSQ ::::::::::::::::::::::::: XP_016 AGAETYARYRYPGSLDESQMAKHRLLFFKHRLQCMTSQ 550 560 570 >>NP_001263398 (OMIM: 601873) beta-1,4 N-acetylgalactosa (328 aa) initn: 1590 init1: 1568 opt: 1568 Z-score: 1542.0 bits: 294.3 E(85289): 3.8e-79 Smith-Waterman score: 1568; 100.0% identity (100.0% similar) in 237 aa overlap (1-237:1-237) 10 20 30 40 50 60 pF1KB4 MWLGRRALCALVLLLACASLGLLYASTRDAPGLRLPLAPWAPPQSPRRPELPDLAPEPRY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MWLGRRALCALVLLLACASLGLLYASTRDAPGLRLPLAPWAPPQSPRRPELPDLAPEPRY 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB4 AHIPVRIKEQVVGLLAWNNCSCESSGGGLPLPFQKQVRAIDLTKAFDPAELRAASATREQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 AHIPVRIKEQVVGLLAWNNCSCESSGGGLPLPFQKQVRAIDLTKAFDPAELRAASATREQ 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB4 EFQAFLSRSQSPADQLLIAPANSPLQYPLQGVEVQPLRSILVPGLSLQAASGQEVYQVNL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 EFQAFLSRSQSPADQLLIAPANSPLQYPLQGVEVQPLRSILVPGLSLQAASGQEVYQVNL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB4 TASLGTWDVAGEVTGVTLTGEGQADLTLVSPGLDQLNRQLQLVTYSSRSYQTNTADTVRF ::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 TASLGTWDVAGEVTGVTLTGEGQADLTLVSPGLDQLNRQLQLVTYSSRSYQTNTADTGAR 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB4 STEGHEAAFTIRIRHPPNPRLYPPGSLPQGAQYNISALVTIATKTFLRYDRLRALITSIR NP_001 PGWRDGQAGQTEKNQKGWSGQMAEGMGGIWAMARAVQPHNGCFNWTSRARGRKGAFVHLG 250 260 270 280 290 300 >>XP_016879662 (OMIM: 111730) PREDICTED: beta-1,4 N-acet (480 aa) initn: 1078 init1: 510 opt: 1189 Z-score: 1168.9 bits: 225.8 E(85289): 2.3e-58 Smith-Waterman score: 1261; 44.5% identity (69.0% similar) in 494 aa overlap (45-529:13-478) 20 30 40 50 60 70 pF1KB4 LACASLGLLYASTRDAPGLRLPLAPWAPPQSPRRPELPDLAPEPRYAHIPVRIKEQVVGL ::. :::: .: : .. . .:.. .: XP_016 MFGSMFLQAVFSSPK-PELP--SPAPGVQKLKLLPEERLRNL 10 20 30 80 90 100 110 120 pF1KB4 LA----W----NNCSCESSGGGLPLPFQKQVRAIDLTKAFDPAELRAASATREQEFQAFL .. : :.:.::.. :. . .. :. ..: :..: :. ::. : XP_016 FSYDGIWLFPKNQCKCEAN---------KEQGGYNFQDAYGQSDLPAVKARRQAEFEHFQ 40 50 60 70 80 90 130 140 150 160 170 180 pF1KB4 SRSQSPADQLLIAPANSPLQYPLQGVEVQPLRSILVPGLSLQAASGQEVYQVNLTASLGT : : :.. : :. ::..::::.::... .:::.... .. ::.:.::::::: XP_016 RREGLPRPLPLLVQPNLPFGYPVHGVEVMPLHTVPIPGLQFEGPDAP-VYEVTLTASLGT 100 110 120 130 140 190 200 210 220 230 240 pF1KB4 WDVAGEVTGVTLTGEGQADLTLVSPGLDQLNRQLQLVTYSSRSYQTNTADTVRFSTEGHE .. ..: .. :.:: .: . . :. :: :::.: .:: . .: : . ... XP_016 LNTLADVPDSVVQGRGQKQLIISTSDRKLLKFILQHVTYTSTGYQHQKVDIVSLESRSSV 150 160 170 180 190 200 250 260 270 280 290 300 pF1KB4 AAFTIRIRHPPNPRLYPPGSLPQGAQYNISALVTIATKTFLRYDRLRALITSIRRFYPTV : : . :::: :.:: :: :. : ::::::::::: .: .. :::..:: . XP_016 AKFPVTIRHPVIPKLYDPG--PERKLRN---LVTIATKTFLRPHKLMIMLRSIREYYPDL 210 220 230 240 250 260 310 320 330 340 350 360 pF1KB4 TVVIADDSDKPERVSGPYVEHYLMPFGKGWFAGRNLAVSQVTTKYVLWVDDDFVFTARTR ::..::::.:: ... .::.: ::::::::::::::.:::::::::::::::.:. .:. XP_016 TVIVADDSQKPLEIKDNHVEYYTMPFGKGWFAGRNLAISQVTTKYVLWVDDDFLFNEETK 270 280 290 300 310 320 370 380 390 400 410 420 pF1KB4 LERLVDVLERTPLDLVGGAVREISGFATTYRQLL-SVEPGAPGLGNCLRQRRGFHHELVG .: ::::::.: ::.:::.: : . .. :: . : :: ::..: :: . : : XP_016 IEVLVDVLEKTELDVVGGSVL---GNVFQFKLLLEQSENGA-----CLHKRMGFFQPLDG 330 340 350 360 370 430 440 450 460 470 480 pF1KB4 FPGCVVTDGVVNFFLARTDKVREVGFDPRLSRVAHLEFFLDGLGSLRVGSCSDVVVDHAS ::.::::.::::::::.:.....:::::::.:::: :::.::::.: :::: .:.. : : XP_016 FPSCVVTSGVVNFFLAHTERLQRVGFDPRLQRVAHSEFFIDGLGTLLVGSCPEVIIGHQS 380 390 400 410 420 430 490 500 510 520 530 pF1KB4 KLKLPWTSRDAGAETYARYRYPGSLDESQMAKHRLLFFKHRLQCMTSQ . . . : .:: :: ..: . :. : : .::..::: XP_016 RSPVVDSELAALEKTYNTYR-SNTLTRVQF-KLALHYFKNHLQCAA 440 450 460 470 480 533 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 20:37:02 2016 done: Mon Nov 7 20:37:04 2016 Total Scan time: 9.920 Total Display time: 0.080 Function used was FASTA [36.3.4 Apr, 2011]