FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4488, 510 aa 1>>>pF1KE4488 510 - 510 aa - 510 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.0117+/-0.00092; mu= 11.1584+/- 0.055 mean_var=131.8774+/-26.370, 0's: 0 Z-trim(109.9): 17 B-trim: 0 in 0/52 Lambda= 0.111683 statistics sampled from 11223 (11228) to 11223 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.703), E-opt: 0.2 (0.345), width: 16 Scan time: 3.370 The best scores are: opt bits E(32554) CCDS11974.1 LMAN1 gene_id:3998|Hs108|chr18 ( 510) 3454 568.0 8.5e-162 CCDS10270.1 LMAN1L gene_id:79748|Hs108|chr15 ( 526) 1000 172.7 9.3e-43 CCDS4417.1 LMAN2 gene_id:10960|Hs108|chr5 ( 356) 570 103.2 4.9e-22 CCDS2023.1 LMAN2L gene_id:81562|Hs108|chr2 ( 348) 528 96.5 5.3e-20 >>CCDS11974.1 LMAN1 gene_id:3998|Hs108|chr18 (510 aa) initn: 3454 init1: 3454 opt: 3454 Z-score: 3018.2 bits: 568.0 E(32554): 8.5e-162 Smith-Waterman score: 3454; 99.8% identity (100.0% similar) in 510 aa overlap (1-510:1-510) 10 20 30 40 50 60 pF1KE4 MAGSRQRGLRARVRPLFCALLLSLGRFVRGDGVGGDPAVALPHRRFEYKYSFKGPHLVQS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 MAGSRQRGLRARVRPLFCALLLSLGRFVRGDGVGGDPAVALPHRRFEYKYSFKGPHLVQS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 DGTVPFWAHAGNAIPSSDQIRVAPSLKSQRGSVWTKTKAAFENWEVEVTFRVTGRGRIGA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 DGTVPFWAHAGNAIPSSDQIRVAPSLKSQRGSVWTKTKAAFENWEVEVTFRVTGRGRIGA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 DGLAIWYAENQGLEGPVFGSADLWNGVGIFFDSFDNDGKKNNPAIVIIGNNGQIHYDHQN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 DGLAIWYAENQGLEGPVFGSADLWNGVGIFFDSFDNDGKKNNPAIVIIGNNGQIHYDHQN 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 DGASQALASCQRDFRNKPYPVRAKITYYQNTLTVMINNGFTPDKNDYEFCAKVENMIIPA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 DGASQALASCQRDFRNKPYPVRAKITYYQNTLTVMINNGFTPDKNDYEFCAKVENMIIPA 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 QGHFGISAATGGLADDHDVLSFLTFQLTEPGKEPPTPDKEISEKEKEKYQEEFEHFQQEL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 QGHFGISAATGGLADDHDVLSFLTFQLTEPGKEPPTPDKEISEKEKEKYQEEFEHFQQEL 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE4 DKKKEEFQKGHPDLQGQPAEEIFESVGDRELRQVFEGQNRIHLEIKQLNRQLDMILDEQR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 DKKKEEFQKGHPDLQGQPAEEIFESVGDRELRQVFEGQNRIHLEIKQLNRQLDMILDEQR 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE4 RYVSSLTEEISKRGAGMPGQHGQITQQELDTVVKTQHEILRQVNEMKNSLSETVRLVSGM :::::::::::::::::::::::::::::::::::::::::::::::::.:::::::::: CCDS11 RYVSSLTEEISKRGAGMPGQHGQITQQELDTVVKTQHEILRQVNEMKNSMSETVRLVSGM 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE4 QHPGSAGGVYETTQHFIDIKEHLHIVKRDIDNLVQRNMPSNEKPKCPELPPFPSCLSTVH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 QHPGSAGGVYETTQHFIDIKEHLHIVKRDIDNLVQRNMPSNEKPKCPELPPFPSCLSTVH 430 440 450 460 470 480 490 500 510 pF1KE4 FIIFVVVQTVLFIGYIMYRSQQEAAAKKFF :::::::::::::::::::::::::::::: CCDS11 FIIFVVVQTVLFIGYIMYRSQQEAAAKKFF 490 500 510 >>CCDS10270.1 LMAN1L gene_id:79748|Hs108|chr15 (526 aa) initn: 900 init1: 435 opt: 1000 Z-score: 881.1 bits: 172.7 E(32554): 9.3e-43 Smith-Waterman score: 1000; 35.5% identity (66.3% similar) in 501 aa overlap (15-501:9-486) 10 20 30 40 50 60 pF1KE4 MAGSRQRGLRARVRPLFCALLLSLGRFVRGDGVGGDPAVALPHRRFEYKYSFKGPHLVQS :::: ::: : : : : :::::: :::::.:. CCDS10 MPAVSGPGPLFCLLLLLLDPHSPETGC---P----PLRRFEYKLSFKGPRLALP 10 20 30 40 70 80 90 100 110 120 pF1KE4 DGTVPFWAHAGNAIPSSDQIRVAPSLKSQRGSVWTKTKAAFENWEVEVTFRVTGRGRIGA . .:::.: :.:: . ...:..::.... :.::..... : ::::: .:::: :: :: CCDS10 GAGIPFWSHHGDAILGLEEVRLTPSMRNRSGAVWSRASVPFSAWEVEVQMRVTGLGRRGA 50 60 70 80 90 100 130 140 150 160 170 180 pF1KE4 DGLAIWYAENQGLEGPVFGSADLWNGVGIFFDSFDNDGKKNNPAIVIIGNNGQIHYDHQN .:.:.::....: : :.:. :.:.:::::: .: ...::: .....:.: .. . CCDS10 QGMAVWYTRGRGHVGSVLGGLASWDGIGIFFDSPAED-TQDSPAIRVLASDGHIPSEQPG 110 120 130 140 150 160 190 200 210 220 230 240 pF1KE4 DGASQALASCQRDFRNKPYPVRAKITYYQNTLTVMINNGFTPDKNDYEFCAKVENMIIPA :::::.:.::. ::::.:.: ::.:::. . : . .:.:.::. . :::. : ... CCDS10 DGASQGLGSCHWDFRNRPHPFRARITYWGQRLRMSLNSGLTPS-DPGEFCVDVGPLLLVP 170 180 190 200 210 220 250 260 270 280 290 pF1KE4 QGHFGISAATGGLADDHDVLSFLTFQLTEPGKE-PPTPDKEISEKEKEKYQEEFEHFQQE : ::.::::: :::::::::::::.:.::. : :: : :. .. . : : . CCDS10 GGFFGVSAATGTLADDHDVLSFLTFSLSEPSPEVPPQPFLEM-QQLRLARQLEGLWARLG 230 240 250 260 270 280 300 310 320 330 340 350 pF1KE4 LDKKKEEFQKGHPDLQGQPAEEIF---ESVG-DRELRQVFEGQNRIHLEIKQLNRQLDMI : ... :. . ::. .:..: :..: :.. :...: .. .. : .:: CCDS10 LGTREDVTPKSDSEAQGE-GERLFDLEETLGRHRRILQALRGLSK---QLAQAERQWKKQ 290 300 310 320 330 340 360 370 380 390 400 pF1KE4 LDE--QRRYVSSLTEEISKRGAGMPGQHGQITQQ------ELDTVVKTQHEILRQVNEMK : : : .. . . : . . ::. :..... .. .... : .:. ..::. CCDS10 LGPPGQARPDGGWALDASCQIPSTPGRGGHLSMSLNKDSAKVGALLHGQWTLLQALQEMR 350 360 370 380 390 400 410 420 430 440 450 460 pF1KE4 NSLSETVRLVSGMQHPGSAGGVYETTQHFIDIKEHLHIVKRDIDNLVQRNMPSNEKPKCP .. .::... : :. .::... . : ...... . .. . . :. : CCDS10 DA---AVRMAAEAQVSYLPVGI---EHHFLELDHILGLLQEELRGPAK---AAAKAPRPP 410 420 430 440 450 470 480 490 500 510 pF1KE4 ELPPFPS-CLSTVHFIIFVVVQTVLFIGYIMYRSQQEAAAKKFF :: : ::. :......::: :.::. .:.. CCDS10 GQPPRASSCLQPGIFLFYLLIQTVGFFGYVHFRQELNKSLQECLSTGSLPLGPAPHTPRA 460 470 480 490 500 510 CCDS10 LGILRRQPLPASMPA 520 >>CCDS4417.1 LMAN2 gene_id:10960|Hs108|chr5 (356 aa) initn: 408 init1: 160 opt: 570 Z-score: 509.1 bits: 103.2 E(32554): 4.9e-22 Smith-Waterman score: 570; 33.2% identity (64.8% similar) in 301 aa overlap (15-310:32-316) 10 20 30 40 pF1KE4 MAGSRQRGLRARVRPLFCALLLSLGRFVRGDGVGGDPAVALPHR ::: ::: :: : .: . :. : CCDS44 AAEGWIWRWGWGRRCLGRPGLLGPGPGPTTPLF--LLLLLGS-VTADITDGNS----EHL 10 20 30 40 50 50 60 70 80 90 100 pF1KE4 RFEYKYSFKGPHLVQSDGTVPFWAHAGNAIPSSDQIRVAPSLKSQRGSVWTKTKAAFENW . :. :. :. .....:.: :... .:. .:..:. .:..::.:.. ...: CCDS44 KREH--SLIKPYQGVGSSSMPLWDFQGSTMLTSQYVRLTPDERSKEGSIWNHQPCFLKDW 60 70 80 90 100 110 110 120 130 140 150 160 pF1KE4 EVEVTFRVTGRGR--IGADGLAIWYAENQGLEGPVFGSADLWNGVGIFFDSFDND--GKK :..: :.: : :. . .::.:.::.... . :::::: : ..:..::.:.. :: .. CCDS44 EMHVHFKVHGTGKKNLHGDGIALWYTRDRLVPGPVFGSKDNFHGLAIFLDTYPNDETTER 120 130 140 150 160 170 170 180 190 200 210 220 pF1KE4 NNPAIVIIGNNGQIHYDHQNDGASQALASCQRDFRNKPYPVRAKITYYQNTLTVMINNGF : : .. :::.. :::..:: ::.: ::::. . . . : .. :::: . CCDS44 VFPYISVMVNNGSLSYDHSKDGRWTELAGCTADFRNRDHDTFLAVRYSRGRLTVMTD--- 180 190 200 210 220 230 240 250 260 270 280 pF1KE4 TPDKNDYEFCAKVENMIIPAQGHFGISAATGGLADDHDVLSFLTFQLTEPGKEPPTPDKE :::... : . .. .:. .:: ::.:: :.:.::..:. ::: :::.: CCDS44 LEDKNEWKNCIDITGVRLPTGYYFGASAGTGDLSDNHDIISMKLFQLMV----EHTPDEE 230 240 250 260 270 280 290 300 310 320 330 pF1KE4 ISEKEKEKYQEEF-EHFQQELDKKKEEFQKGHPDLQGQPAEEIFESVGDRELRQVFEGQN . : . . .: . ....: .:..: CCDS44 SIDWTKIEPSVNFLKSPKDNVDDPTGNFRSGPLTGWRVFLLLLCALLGIVVCAVVGAVVF 290 300 310 320 330 340 340 350 360 370 380 390 pF1KE4 RIHLEIKQLNRQLDMILDEQRRYVSSLTEEISKRGAGMPGQHGQITQQELDTVVKTQHEI CCDS44 QKRQERNKRFY 350 >>CCDS2023.1 LMAN2L gene_id:81562|Hs108|chr2 (348 aa) initn: 265 init1: 180 opt: 528 Z-score: 472.6 bits: 96.5 E(32554): 5.3e-20 Smith-Waterman score: 528; 33.6% identity (62.7% similar) in 271 aa overlap (6-268:15-275) 10 20 30 40 pF1KE4 MAGSRQRGLRARVRPLFCALLLSLGRFVRGDGVGGDPAVALPHRRFEY--- .: : :: . ::: :: :.: : . . ::: CCDS20 MAATLGPLGSWQQWRRCLSARDGSRMLLLLLLLGS---GQG----PQQVGAGQTFEYLKR 10 20 30 40 50 50 60 70 80 90 100 pF1KE4 KYSFKGPHLVQSDGTVPFWAHAGNAIPSSDQIRVAPSLKSQRGSVWTKTKAAFENWEVEV ..:.. :. . :. .: :::. .. ::..:...:..:..:... ...::..: CCDS20 EHSLSKPYQGVGTGSSSLWNLMGNAMVMTQYIRLTPDMQSKQGALWNRVPCFLRDWELQV 60 70 80 90 100 110 110 120 130 140 150 160 pF1KE4 TFRVTGRGR--IGADGLAIWYAENQGLEGPVFGSADLWNGVGIFFDSFDNDGKKNN---P :.. :.:. . .:::::::.... :::::. : . :.:.: :.. :. :... : CCDS20 HFKIHGQGKKNLHGDGLAIWYTKDRMQPGPVFGNMDKFVGLGVFVDTYPNEEKQQERVFP 120 130 140 150 160 170 170 180 190 200 210 220 pF1KE4 AIVIIGNNGQIHYDHQNDGASQALASCQRDFRNKPYPVRAKITYYQNTLTVMINNGFTPD : . :::.. :::. :: :..: :: : . : : . ::.:.. CCDS20 YISAMVNNGSLSYDHERDGRPTELGGCTAIVRNLHYDTFLVIRYVKRHLTIMMD---IDG 180 190 200 210 220 230 230 240 250 260 270 280 pF1KE4 KNDYEFCAKVENMIIPAQGHFGISAATGGLADDHDVLSFLTFQLTEPGKEPPTPDKEISE :.... : .: .. .: .:: :. :: :.:.:::.:. :.:: CCDS20 KHEWRDCIEVPGVRLPRGYYFGTSSITGDLSDNHDVISLKLFELTVERTPEEEKLHRDVF 240 250 260 270 280 290 290 300 310 320 330 340 pF1KE4 KEKEKYQEEFEHFQQELDKKKEEFQKGHPDLQGQPAEEIFESVGDRELRQVFEGQNRIHL CCDS20 LPSVDNMKLPEMTAPLPPLSGLALFLIVFFSLVFSVFAIVIGIILYNKWQEQSRKRFY 300 310 320 330 340 510 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 00:48:31 2016 done: Sun Nov 6 00:48:31 2016 Total Scan time: 3.370 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]