FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5716, 607 aa 1>>>pF1KE5716 607 - 607 aa - 607 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.2896+/-0.000825; mu= 10.8627+/- 0.050 mean_var=139.5336+/-27.761, 0's: 0 Z-trim(112.3): 13 B-trim: 0 in 0/52 Lambda= 0.108576 statistics sampled from 13083 (13094) to 13083 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.746), E-opt: 0.2 (0.402), width: 16 Scan time: 2.850 The best scores are: opt bits E(32554) CCDS4680.1 GNL1 gene_id:2794|Hs108|chr6 ( 607) 4131 658.7 6.1e-189 CCDS33922.1 LSG1 gene_id:55341|Hs108|chr3 ( 658) 426 78.4 3.3e-14 >>CCDS4680.1 GNL1 gene_id:2794|Hs108|chr6 (607 aa) initn: 4131 init1: 4131 opt: 4131 Z-score: 3505.5 bits: 658.7 E(32554): 6.1e-189 Smith-Waterman score: 4131; 100.0% identity (100.0% similar) in 607 aa overlap (1-607:1-607) 10 20 30 40 50 60 pF1KE5 MPRKKPFSVKQKKKQLQDKRERKRGLQDGLRSSSNSRSGSRERREEQTDTSDGESVTHHI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 MPRKKPFSVKQKKKQLQDKRERKRGLQDGLRSSSNSRSGSRERREEQTDTSDGESVTHHI 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 RRLNQQPSQGLGPRGYDPNRYRLHFERDSREEVERRKRAAREQVLQPVSAELLELDIREV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 RRLNQQPSQGLGPRGYDPNRYRLHFERDSREEVERRKRAAREQVLQPVSAELLELDIREV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 YQPGSVLDFPRRPPWSYEMSKEQLMSQEERSFQDYLGKIHGAYSSEKLSYFEHNLETWRQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 YQPGSVLDFPRRPPWSYEMSKEQLMSQEERSFQDYLGKIHGAYSSEKLSYFEHNLETWRQ 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 LWRVLEMSDIVLLITDIRHPVVNFPPALYEYVTGELGLALVLVLNKVDLAPPALVVAWKH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 LWRVLEMSDIVLLITDIRHPVVNFPPALYEYVTGELGLALVLVLNKVDLAPPALVVAWKH 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE5 YFHQHYPQLHVVLFTSFPRDPRTPQDPSSVLKKSRRRGRGWTRALGPEQLLRACEAITVG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 YFHQHYPQLHVVLFTSFPRDPRTPQDPSSVLKKSRRRGRGWTRALGPEQLLRACEAITVG 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE5 KVDLSSWREKIARDVAGATWGNGSGEEEEEEDGPAVLVEQQTDSAMEPTGPTQERYKDGV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 KVDLSSWREKIARDVAGATWGNGSGEEEEEEDGPAVLVEQQTDSAMEPTGPTQERYKDGV 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE5 VTIGCVGFPNVGKSSLINGLVGRKVVSVSRTPGHTRYFQTYFLTPSVKLCDCPGLIFPSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 VTIGCVGFPNVGKSSLINGLVGRKVVSVSRTPGHTRYFQTYFLTPSVKLCDCPGLIFPSL 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE5 LPRQLQVLAGIYPIAQIQEPYTAVGYLASRIPVQALLHLRHPEAEDPSAEHPWCAWDICE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 LPRQLQVLAGIYPIAQIQEPYTAVGYLASRIPVQALLHLRHPEAEDPSAEHPWCAWDICE 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE5 AWAEKRGYKTAKAARNDVYRAANSLLRLAVDGRLSLCFHPPGYSEQKGTWESHPETTELV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 AWAEKRGYKTAKAARNDVYRAANSLLRLAVDGRLSLCFHPPGYSEQKGTWESHPETTELV 490 500 510 520 530 540 550 560 570 580 590 600 pF1KE5 VLQGRVGPAGDEEEEEEEELSSSCEEEGEEDRDADEEGEGDEETPTSAPGSSLAGRNPYA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 VLQGRVGPAGDEEEEEEEELSSSCEEEGEEDRDADEEGEGDEETPTSAPGSSLAGRNPYA 550 560 570 580 590 600 pF1KE5 LLGEDEC ::::::: CCDS46 LLGEDEC >>CCDS33922.1 LSG1 gene_id:55341|Hs108|chr3 (658 aa) initn: 559 init1: 302 opt: 426 Z-score: 368.5 bits: 78.4 E(32554): 3.3e-14 Smith-Waterman score: 602; 31.0% identity (55.2% similar) in 449 aa overlap (127-522:111-552) 100 110 120 130 140 150 pF1KE5 KRAAREQVLQPVSAELLELDIREVYQPGSVLDFPRRPPWSYEMSKEQLMSQEERSFQDYL : .:::: :. . . :.: . :. .: .. CCDS33 KFVPAEARTGLLSFEESQRIKKLHEENKQFLCIPRRPNWNQNTTPEELKQAEKDNFLEWR 90 100 110 120 130 140 160 170 180 190 200 210 pF1KE5 GKIHGAYSSEKL--SYFEHNLETWRQLWRVLEMSDIVLLITDIRHPVVNFPPALYEYVTG .. .:: . ::.::. :::::::.: ::::. :.: :.:.. : :: CCDS33 RQLVRLEEEQKLILTPFERNLDFWRQLWRVIERSDIVVQIVDARNPLLFRCEDLECYVKE 150 160 170 180 190 200 220 230 240 250 260 270 pF1KE5 -ELGLALVLVLNKVDLAPPALVVAWKHYFHQHYPQLHVVLFTSFPRDPRTPQDPSSVLKK . . :...::.:: :: ::... ...:...... : . .: . CCDS33 MDANKENVILINKADLLTAEQRSAWAMYFEKE--DVKVIFWSALA--GAIPLNGDSEEEA 210 220 230 240 250 280 290 300 310 320 pF1KE5 SRRRGRGWTRALGPEQLLRA----CEAITVGKVDLSSWREKIARDVAGATWGNGSGEEEE .: .. : .: .. .: :. . : : :. . : . . . :::. CCDS33 NRDDRQSNTTKFGHSSFDQAEISHSESEHLPARDSPSLSENPTTDEDDSEYEDCPEEEED 260 270 280 290 300 310 330 340 350 pF1KE5 ------EEDGP---------------------------------AVLVEQQTDSAMEPTG ::::: . :: .: . CCDS33 DWQTCSEEDGPKEEDCSQDWKESSTADSEARSRKTPQKRQIHNFSHLVSKQELLELFKEL 320 330 340 350 360 370 360 370 380 390 400 410 pF1KE5 PTQERYKDGVVTIGCVGFPNVGKSSLINGLVGRKVVSVSRTPGHTRYFQTYFLTPSVKLC : .. ::: .:.: ::.::::::: :: ..: : :::: :::::..::: .. :.. :: CCDS33 HTGRKVKDGQLTVGLVGYPNVGKSSTINTIMGNKKVSVSATPGHTKHFQTLYVEPGLCLC 380 390 400 410 420 430 420 430 440 450 460 pF1KE5 DCPGLIFPSLLPRQLQVL-AGIYPIAQIQEPYTAVGYLASRIPVQAL-----LHLRHP-E :::::..::.. . .. .:: :: :... :. . . :: ..: ... : : CCDS33 DCPGLVMPSFVSTKAEMTCSGILPIDQMRDHVPPVSLVCQNIPRHVLEATYGINIITPRE 440 450 460 470 480 490 470 480 490 500 510 520 pF1KE5 AEDPSAEHPWCAWDICEAWAEKRGYKTAKAARNDVYRAANSLLRLAVDGRLSLCFHPPGY ::: ..: . .. :.. ::. ::.. . : :.: .:. :.:.: : ::: CCDS33 DEDP--HRPPTSEELLTAYGYMRGFMTAHG-QPDQPRSARYILKDYVSGKLLYCHPPPGR 500 510 520 530 540 550 530 540 550 560 570 580 pF1KE5 SEQKGTWESHPETTELVVLQGRVGPAGDEEEEEEEELSSSCEEEGEEDRDADEEGEGDEE CCDS33 DPVTFQHQHQRLLENKMNSDEIKMQLGRNKKAKQIENIVDKTFFHQENVRALTKGVQAVM 560 570 580 590 600 610 607 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 06:00:59 2016 done: Tue Nov 8 06:01:00 2016 Total Scan time: 2.850 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]