FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1935, 474 aa 1>>>pF1KE1935 474 - 474 aa - 474 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.8560+/-0.000945; mu= 15.3898+/- 0.057 mean_var=77.9760+/-15.692, 0's: 0 Z-trim(106.2): 16 B-trim: 177 in 1/49 Lambda= 0.145243 statistics sampled from 8863 (8870) to 8863 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.647), E-opt: 0.2 (0.272), width: 16 Scan time: 3.220 The best scores are: opt bits E(32554) CCDS3550.1 GC gene_id:2638|Hs108|chr4 ( 474) 3171 674.2 8.4e-194 CCDS56332.1 GC gene_id:2638|Hs108|chr4 ( 493) 3171 674.2 8.7e-194 CCDS3555.1 ALB gene_id:213|Hs108|chr4 ( 609) 595 134.4 3.2e-31 >>CCDS3550.1 GC gene_id:2638|Hs108|chr4 (474 aa) initn: 3171 init1: 3171 opt: 3171 Z-score: 3592.9 bits: 674.2 E(32554): 8.4e-194 Smith-Waterman score: 3171; 99.6% identity (100.0% similar) in 474 aa overlap (1-474:1-474) 10 20 30 40 50 60 pF1KE1 MKRVLVLLLAVAFGHALERGRDYEKNKVCKEFSHLGKEDFTSLSLVLYSRKFPSGTFEQV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 MKRVLVLLLAVAFGHALERGRDYEKNKVCKEFSHLGKEDFTSLSLVLYSRKFPSGTFEQV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 SQLVKEVVSLTEACCAEGADPDCYDTRTSALSAKSCESNSPFPVHPGTAECCTKEGLERK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 SQLVKEVVSLTEACCAEGADPDCYDTRTSALSAKSCESNSPFPVHPGTAECCTKEGLERK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 LCMAALKHQPQEFPTYVEPTNDEICEAFRKDPKEYANQFMWEYSTNYGQAPLSLLVSYTK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 LCMAALKHQPQEFPTYVEPTNDEICEAFRKDPKEYANQFMWEYSTNYGQAPLSLLVSYTK 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 SYLSMVGSCCTSASPTVCFLKERLQLKHLSLLTTLSNRVCSQYAAYGEKKSRLSNLIKLA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 SYLSMVGSCCTSASPTVCFLKERLQLKHLSLLTTLSNRVCSQYAAYGEKKSRLSNLIKLA 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 QKVPTADLEDVLPLAEDITNILSKCCESASEDCMAKELPEHTVKLCDNLSTKNSKFEDCC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 QKVPTADLEDVLPLAEDITNILSKCCESASEDCMAKELPEHTVKLCDNLSTKNSKFEDCC 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE1 QEKTAMDVFVCTYFMPAAQLPELPDVELPTNKDVCDPGNTKVMDKYTFELSRRTHLPEVF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 QEKTAMDVFVCTYFMPAAQLPELPDVELPTNKDVCDPGNTKVMDKYTFELSRRTHLPEVF 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE1 LSKVLEPTLKSLGECCDVEDSTTCFNAKGPLLKKELSSFIDKGQELCADYSENTFTEYKK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 LSKVLEPTLKSLGECCDVEDSTTCFNAKGPLLKKELSSFIDKGQELCADYSENTFTEYKK 370 380 390 400 410 420 430 440 450 460 470 pF1KE1 KLAERLKAKLPEATPTELAKLVNKRSDFASNCCSINSPPLYCDSEIDAELKNIL :::::::::::.::::::::::::.::::::::::::::::::::::::::::: CCDS35 KLAERLKAKLPDATPTELAKLVNKHSDFASNCCSINSPPLYCDSEIDAELKNIL 430 440 450 460 470 >>CCDS56332.1 GC gene_id:2638|Hs108|chr4 (493 aa) initn: 3171 init1: 3171 opt: 3171 Z-score: 3592.6 bits: 674.2 E(32554): 8.7e-194 Smith-Waterman score: 3171; 99.6% identity (100.0% similar) in 474 aa overlap (1-474:20-493) 10 20 30 40 pF1KE1 MKRVLVLLLAVAFGHALERGRDYEKNKVCKEFSHLGKEDFT ::::::::::::::::::::::::::::::::::::::::: CCDS56 MLWSWSEERGGAARLSGRKMKRVLVLLLAVAFGHALERGRDYEKNKVCKEFSHLGKEDFT 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE1 SLSLVLYSRKFPSGTFEQVSQLVKEVVSLTEACCAEGADPDCYDTRTSALSAKSCESNSP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 SLSLVLYSRKFPSGTFEQVSQLVKEVVSLTEACCAEGADPDCYDTRTSALSAKSCESNSP 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE1 FPVHPGTAECCTKEGLERKLCMAALKHQPQEFPTYVEPTNDEICEAFRKDPKEYANQFMW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 FPVHPGTAECCTKEGLERKLCMAALKHQPQEFPTYVEPTNDEICEAFRKDPKEYANQFMW 130 140 150 160 170 180 170 180 190 200 210 220 pF1KE1 EYSTNYGQAPLSLLVSYTKSYLSMVGSCCTSASPTVCFLKERLQLKHLSLLTTLSNRVCS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 EYSTNYGQAPLSLLVSYTKSYLSMVGSCCTSASPTVCFLKERLQLKHLSLLTTLSNRVCS 190 200 210 220 230 240 230 240 250 260 270 280 pF1KE1 QYAAYGEKKSRLSNLIKLAQKVPTADLEDVLPLAEDITNILSKCCESASEDCMAKELPEH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 QYAAYGEKKSRLSNLIKLAQKVPTADLEDVLPLAEDITNILSKCCESASEDCMAKELPEH 250 260 270 280 290 300 290 300 310 320 330 340 pF1KE1 TVKLCDNLSTKNSKFEDCCQEKTAMDVFVCTYFMPAAQLPELPDVELPTNKDVCDPGNTK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 TVKLCDNLSTKNSKFEDCCQEKTAMDVFVCTYFMPAAQLPELPDVELPTNKDVCDPGNTK 310 320 330 340 350 360 350 360 370 380 390 400 pF1KE1 VMDKYTFELSRRTHLPEVFLSKVLEPTLKSLGECCDVEDSTTCFNAKGPLLKKELSSFID :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 VMDKYTFELSRRTHLPEVFLSKVLEPTLKSLGECCDVEDSTTCFNAKGPLLKKELSSFID 370 380 390 400 410 420 410 420 430 440 450 460 pF1KE1 KGQELCADYSENTFTEYKKKLAERLKAKLPEATPTELAKLVNKRSDFASNCCSINSPPLY ::::::::::::::::::::::::::::::.::::::::::::.:::::::::::::::: CCDS56 KGQELCADYSENTFTEYKKKLAERLKAKLPDATPTELAKLVNKHSDFASNCCSINSPPLY 430 440 450 460 470 480 470 pF1KE1 CDSEIDAELKNIL ::::::::::::: CCDS56 CDSEIDAELKNIL 490 >>CCDS3555.1 ALB gene_id:213|Hs108|chr4 (609 aa) initn: 424 init1: 302 opt: 595 Z-score: 674.0 bits: 134.4 E(32554): 3.2e-31 Smith-Waterman score: 598; 24.9% identity (58.1% similar) in 470 aa overlap (1-453:1-462) 10 20 30 40 50 pF1KE1 MKRVLVLLLAVAFGHALERG---RDYEKNKVCKEFSHLGKEDFTSLSLVLYSRKFPSGTF :: : . : :. : :: :: .:..: ..:. ::.:.: .: :. ... . . : CCDS35 MKWVTFISLLFLFSSAYSRGVFRRDAHKSEVAHRFKDLGEENFKALVLIAFAQYLQQCPF 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE1 EQVSQLVKEVVSLTEACCAEGADPDCYDTRTSALSAKSCESNSPFPVHPGTAECCTKEGL :. .::.::. ....: :. . .: . . .. : : . .. :.::.:. CCDS35 EDHVKLVNEVTEFAKTCVADESAENCDKSLHTLFGDKLCTVATLRETYGEMADCCAKQEP 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE1 ERKLCMAALKHQPQEFPTYVEPTNDEICEAFRKDPKEYANQFMWEYSTNYGQAPLSLLVS ::. :. : . ..: :.: : .: ::. . . . .....: . . :. CCDS35 ERNECFLQHKDDNPNLPRLVRPEVDVMCTAFHDNEETFLKKYLYEIARRHPYFYAPELLF 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE1 YTKSYLSMVGSCCTSASPTVCFLKERLQLKHLSLLTTLSNRV-CSQYAAYGEKKSRLSNL ..: : . :: .:. ..:.: . .:. . .. ..:. :.. .::. . . CCDS35 FAKRYKAAFTECCQAADKAACLLPKLDELRDEGKASSAKQRLKCASLQKFGERAFKAWAV 190 200 210 220 230 240 240 250 260 270 280 290 pF1KE1 IKLAQKVPTADLEDVLPLAEDITNILSKCCESASEDCMAKELPEHTVKLCDNLSTKNSKF .:.:. : :.. .: :. :.:.. ..::.. .: : . . . .:.: .. .::. CCDS35 ARLSQRFPKAEFAEVSKLVTDLTKVHTECCHGDLLEC-ADDRADLAKYICENQDSISSKL 250 260 270 280 290 300 310 320 330 340 350 pF1KE1 EDCCQEKTAMDVFVCTYFMPAAQLP-ELPDV--ELPTNKDVC-DPGNTK--VMDKYTFEL ..:: :: .. : . ..: .::.. .. .:::: . ...: . . .: CCDS35 KECC-EKPLLEKSHCIAEVENDEMPADLPSLAADFVESKDVCKNYAEAKDVFLGMFLYEY 300 310 320 330 340 350 360 370 380 390 400 pF1KE1 SRRTHLPE---VFLSKVLEPTLKSLGECCDVEDSTTC----FNAKGPLLKKELSSFIDKG .:: : :. :.: .. . .: .:: . : : :. ::.. : ...: .. CCDS35 ARR-H-PDYSVVLLLRLAKTYETTLEKCCAAADPHECYAKVFDEFKPLVE-EPQNLIKQN 360 370 380 390 400 410 410 420 430 440 450 460 pF1KE1 QELCADYSENTFTEYKKKLAERLKAKLPEATPTELAKLVNKRSDFASNCCSINSPPLYCD :: . .: : .. : : :.:... :... . . .:.:: CCDS35 CELFEQLGEYKF---QNALLVRYTKKVPQVSTPTLVEVSRNLGKVGSKCCKHPEAKRMPC 420 430 440 450 460 470 470 pF1KE1 SEIDAELKNIL CCDS35 AEDYLSVVLNQLCVLHEKTPVSDRVTKCCTESLVNRRPCFSALEVDETYVPKEFNAETFT 480 490 500 510 520 530 474 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 10:02:11 2016 done: Sun Nov 6 10:02:12 2016 Total Scan time: 3.220 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]