FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6332, 415 aa 1>>>pF1KE6332 415 - 415 aa - 415 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.2181+/-0.000771; mu= 17.5503+/- 0.046 mean_var=63.2381+/-12.557, 0's: 0 Z-trim(107.2): 12 B-trim: 0 in 0/53 Lambda= 0.161282 statistics sampled from 9438 (9445) to 9438 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.672), E-opt: 0.2 (0.29), width: 16 Scan time: 2.170 The best scores are: opt bits E(32554) CCDS8136.1 B4GAT1 gene_id:11041|Hs108|chr11 ( 415) 2820 664.8 4.3e-191 CCDS13912.1 LARGE1 gene_id:9215|Hs108|chr22 ( 756) 359 92.3 1.7e-18 CCDS76399.1 LARGE2 gene_id:120071|Hs108|chr11 ( 690) 316 82.3 1.6e-15 CCDS31473.1 LARGE2 gene_id:120071|Hs108|chr11 ( 721) 316 82.3 1.7e-15 >>CCDS8136.1 B4GAT1 gene_id:11041|Hs108|chr11 (415 aa) initn: 2820 init1: 2820 opt: 2820 Z-score: 3544.2 bits: 664.8 E(32554): 4.3e-191 Smith-Waterman score: 2820; 100.0% identity (100.0% similar) in 415 aa overlap (1-415:1-415) 10 20 30 40 50 60 pF1KE6 MQMSYAIRCAFYQLLLAALMLVAMLQLLYLSLLSGLHGQEEQDQYFEFFPPSPRSVDQVK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 MQMSYAIRCAFYQLLLAALMLVAMLQLLYLSLLSGLHGQEEQDQYFEFFPPSPRSVDQVK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 AQLRTALASGGVLDASGDYRVYRGLLKTTMDPNDVILATHASVDNLLHLSGLLERWEGPL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 AQLRTALASGGVLDASGDYRVYRGLLKTTMDPNDVILATHASVDNLLHLSGLLERWEGPL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 SVSVFAATKEEAQLATVLAYALSSHCPDMRARVAMHLVCPSRYEAAVPDPREPGEFALLR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 SVSVFAATKEEAQLATVLAYALSSHCPDMRARVAMHLVCPSRYEAAVPDPREPGEFALLR 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 SCQEVFDKLARVAQPGINYALGTNVSYPNNLLRNLAREGANYALVIDVDMVPSEGLWRGL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 SCQEVFDKLARVAQPGINYALGTNVSYPNNLLRNLAREGANYALVIDVDMVPSEGLWRGL 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE6 REMLDQSNQWGGTALVVPAFEIRRARRMPMNKNELVQLYQVGEVRPFYYGLCTPCQAPTN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 REMLDQSNQWGGTALVVPAFEIRRARRMPMNKNELVQLYQVGEVRPFYYGLCTPCQAPTN 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE6 YSRWVNLPEESLLRPAYVVPWQDPWEPFYVAGGKVPTFDERFRQYGFNRISQACELHVAG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 YSRWVNLPEESLLRPAYVVPWQDPWEPFYVAGGKVPTFDERFRQYGFNRISQACELHVAG 310 320 330 340 350 360 370 380 390 400 410 pF1KE6 FDFEVLNEGFLVHKGFKEALKFHPQKEAENQHNKILYRQFKQELKAKYPNSPRRC ::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 FDFEVLNEGFLVHKGFKEALKFHPQKEAENQHNKILYRQFKQELKAKYPNSPRRC 370 380 390 400 410 >>CCDS13912.1 LARGE1 gene_id:9215|Hs108|chr22 (756 aa) initn: 339 init1: 123 opt: 359 Z-score: 445.5 bits: 92.3 E(32554): 1.7e-18 Smith-Waterman score: 395; 28.7% identity (54.6% similar) in 328 aa overlap (91-408:470-742) 70 80 90 100 110 120 pF1KE6 AQLRTALASGGVLDASGDYRVYRGLLKTTMDPNDVILATHASVDNLLHLSGLLERWEGPL : .:: :... :.: : : .. ..::::. CCDS13 DDLCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDRLQMLEAICKHWEGPI 440 450 460 470 480 490 130 140 150 160 170 180 pF1KE6 SVSVFAATKEEAQLATVLAYALSSHCPDMRARVAMHLVCPSRYEAAVPDPREPGEFALLR :.... . : :. : :: .:. : :..:.: . :.: CCDS13 SLALYLSDAEAQQF---LRYAQGSEVLMSRHNVGYHIV------------YKEGQF---- 500 510 520 530 540 190 200 210 220 230 pF1KE6 SCQEVFDKLARVAQPGINYALGTNVSYPNNLLRNLAREGAN--YALVIDVDMVPSEGLWR :: :::::.: . . : .. :.:..: ::.. CCDS13 --------------------------YPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYE 550 560 570 240 250 260 270 280 290 pF1KE6 GLRE---MLDQSNQWGGTALVVPAFEIRRAR-RMPMNKNELVQLYQVGEVRPFYYGLCTP ::. .:: .: :..::::: : : .: .: ::... ..: . : : . : CCDS13 YLRKSVIQLDLANT--KKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTK 580 590 600 610 620 630 300 310 320 330 340 350 pF1KE6 CQAPTNYSRWVNLPEESLLRPAYVVPWQDPWEPFYVAGGKVPTFDERFRQYGFNRISQAC .::::...: .. : : : :. .::. :. : .:.:: .:.:.... CCDS13 GHAPTNFAKW-----RTATTP-YRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNKVAHIM 640 650 660 670 680 360 370 380 390 400 410 pF1KE6 ELHVAGFDFEVLNEGFLVHKGFKEALKFHPQKEAENQHNKI----LYRQFKQELKAKYPN :: : ..: :: .....: . .: .: : :.. .: : ..:.:... .: CCDS13 ELDVQEYEFIVLPNAYMIH--MPHAPSFDITKFRSNKQYRICLKTLKEEFQQDMSRRYGF 690 700 710 720 730 740 pF1KE6 SPRRC CCDS13 AALKYLTAENNS 750 >>CCDS76399.1 LARGE2 gene_id:120071|Hs108|chr11 (690 aa) initn: 292 init1: 119 opt: 316 Z-score: 392.1 bits: 82.3 E(32554): 1.6e-15 Smith-Waterman score: 339; 28.9% identity (51.6% similar) in 322 aa overlap (92-408:398-662) 70 80 90 100 110 120 pF1KE6 QLRTALASGGVLDASGDYRVYRGLLKTTMDPNDVILATHASVDNLLHLSGLLERWEGPLS :.:: :... :.: : : .: ..: ::.: CCDS76 EDPCFEFRQQQLTVHRVHVTFLPHEPPPPRPHDVTLVAQLSMDRLQMLEALCRHWPGPMS 370 380 390 400 410 420 130 140 150 160 170 180 pF1KE6 VSVFAATKEEAQLATVLAYALSSHCPDMRARVAMHLVCPSRYEAAVPDPREPGEFALLRS .... . : :. : .. .: : ::.:.: : :: : . CCDS76 LALYLTDAEAQQF---LHFVEASPVLAARQDVAYHVV----Y-------RE-GPL----- 430 440 450 460 190 200 210 220 230 pF1KE6 CQEVFDKLARVAQPGINYALGTNVSYPNNLLRN--LAREGANYALVIDVDMVPSEGLWRG :: : ::: ::. . :... :.:..:. .:. CCDS76 -------------------------YPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDY 470 480 490 500 240 250 260 270 280 290 pF1KE6 LREMLDQSNQWG--GTALVVPAFEIRRAR-RMPMNKNELVQLYQVGEVRPFYYGLCTPCQ :: ..: . . .:::::::: : : .: .: ::. : ..: . : : . CCDS76 LRASIEQLGLGSRRKAALVVPAFETLRYRFSFPHSKVELLALLDAGTLYTFRYHEWPRGH 510 520 530 540 550 560 300 310 320 330 340 350 pF1KE6 APTNYSRWVNLPEESLLRPAYVVPWQDPWEPFYVAGGKVPTFDERFRQYGFNRISQACEL :::.:.:: .:. . : : : .::. :. : .: :: .:.:.... :: CCDS76 APTDYARW----REA--QAPYRVQWAANYEPYVVVPRDCPRYDPRFVGFGWNKVAHIVEL 570 580 590 600 610 360 370 380 390 400 410 pF1KE6 HVAGFDFEVLNEGFLVHKGFKEALKFHPQKEAENQHNKILYRQFKQELKAKYPNSPRRC . ... :: :.: .: : :. . ... ::. : :: .. CCDS76 DAQEYELLVLPEAFTIH------LPHAPSLDISRFRSSPTYRDCLQALKDEFHQDLSRHH 620 630 640 650 660 670 CCDS76 GAAALKYLPALQQPQSPARG 680 690 >>CCDS31473.1 LARGE2 gene_id:120071|Hs108|chr11 (721 aa) initn: 292 init1: 119 opt: 316 Z-score: 391.8 bits: 82.3 E(32554): 1.7e-15 Smith-Waterman score: 339; 28.9% identity (51.6% similar) in 322 aa overlap (92-408:429-693) 70 80 90 100 110 120 pF1KE6 QLRTALASGGVLDASGDYRVYRGLLKTTMDPNDVILATHASVDNLLHLSGLLERWEGPLS :.:: :... :.: : : .: ..: ::.: CCDS31 EDPCFEFRQQQLTVHRVHVTFLPHEPPPPRPHDVTLVAQLSMDRLQMLEALCRHWPGPMS 400 410 420 430 440 450 130 140 150 160 170 180 pF1KE6 VSVFAATKEEAQLATVLAYALSSHCPDMRARVAMHLVCPSRYEAAVPDPREPGEFALLRS .... . : :. : .. .: : ::.:.: : :: : . CCDS31 LALYLTDAEAQQF---LHFVEASPVLAARQDVAYHVV----Y-------RE-GPL----- 460 470 480 490 190 200 210 220 230 pF1KE6 CQEVFDKLARVAQPGINYALGTNVSYPNNLLRN--LAREGANYALVIDVDMVPSEGLWRG :: : ::: ::. . :... :.:..:. .:. CCDS31 -------------------------YPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDY 500 510 520 530 240 250 260 270 280 290 pF1KE6 LREMLDQSNQWG--GTALVVPAFEIRRAR-RMPMNKNELVQLYQVGEVRPFYYGLCTPCQ :: ..: . . .:::::::: : : .: .: ::. : ..: . : : . CCDS31 LRASIEQLGLGSRRKAALVVPAFETLRYRFSFPHSKVELLALLDAGTLYTFRYHEWPRGH 540 550 560 570 580 590 300 310 320 330 340 350 pF1KE6 APTNYSRWVNLPEESLLRPAYVVPWQDPWEPFYVAGGKVPTFDERFRQYGFNRISQACEL :::.:.:: .:. . : : : .::. :. : .: :: .:.:.... :: CCDS31 APTDYARW----REA--QAPYRVQWAANYEPYVVVPRDCPRYDPRFVGFGWNKVAHIVEL 600 610 620 630 640 360 370 380 390 400 410 pF1KE6 HVAGFDFEVLNEGFLVHKGFKEALKFHPQKEAENQHNKILYRQFKQELKAKYPNSPRRC . ... :: :.: .: : :. . ... ::. : :: .. CCDS31 DAQEYELLVLPEAFTIH------LPHAPSLDISRFRSSPTYRDCLQALKDEFHQDLSRHH 650 660 670 680 690 700 CCDS31 GAAALKYLPALQQPQSPARG 710 720 415 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 12:11:32 2016 done: Tue Nov 8 12:11:32 2016 Total Scan time: 2.170 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]