FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6471, 474 aa 1>>>pF1KE6471 474 - 474 aa - 474 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.3813+/-0.000947; mu= 17.3317+/- 0.056 mean_var=69.5515+/-14.432, 0's: 0 Z-trim(104.9): 61 B-trim: 0 in 0/49 Lambda= 0.153787 statistics sampled from 8091 (8137) to 8091 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.605), E-opt: 0.2 (0.25), width: 16 Scan time: 2.540 The best scores are: opt bits E(32554) CCDS2063.1 MFSD9 gene_id:84804|Hs108|chr2 ( 474) 3025 680.6 9.9e-196 CCDS7740.1 SLC22A18 gene_id:5002|Hs108|chr11 ( 424) 262 67.5 3.1e-11 >>CCDS2063.1 MFSD9 gene_id:84804|Hs108|chr2 (474 aa) initn: 3025 init1: 3025 opt: 3025 Z-score: 3627.5 bits: 680.6 E(32554): 9.9e-196 Smith-Waterman score: 3025; 100.0% identity (100.0% similar) in 474 aa overlap (1-474:1-474) 10 20 30 40 50 60 pF1KE6 MELGGHWDMNSAPRLVSETAERKQEQKTGTEAEAADSGAVGARRFLLCLYLVGFLDLFGV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS20 MELGGHWDMNSAPRLVSETAERKQEQKTGTEAEAADSGAVGARRFLLCLYLVGFLDLFGV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 SMVVPLLSLHVKSLGASPTVAGIVGSSYGILQLFSSTLVGCWSDVVGRRSSLLACILLSA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS20 SMVVPLLSLHVKSLGASPTVAGIVGSSYGILQLFSSTLVGCWSDVVGRRSSLLACILLSA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 LGYLLLGAATNVFLFVLARVPAGIFKHTLSISRALLSDVVPEKERPLVIGHFNTASGVGF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS20 LGYLLLGAATNVFLFVLARVPAGIFKHTLSISRALLSDVVPEKERPLVIGHFNTASGVGF 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 ILGPVVGGYLTELEDGFYLTAFICFLVFILNAGLVWFFPWREAKPGSTEKGLPLRKTHVL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS20 ILGPVVGGYLTELEDGFYLTAFICFLVFILNAGLVWFFPWREAKPGSTEKGLPLRKTHVL 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE6 LGRSHDTVQEAATSRRARASKKTAQPWVEVVLALRNMKNLLFSEMWDIFLVRLLMAMAVM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS20 LGRSHDTVQEAATSRRARASKKTAQPWVEVVLALRNMKNLLFSEMWDIFLVRLLMAMAVM 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE6 LYYSNFVLALEERFGVRPKVTGYLISYSSMLGAVAGLALGPILRLYKHNSQALLLHSSIL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS20 LYYSNFVLALEERFGVRPKVTGYLISYSSMLGAVAGLALGPILRLYKHNSQALLLHSSIL 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE6 TCTLLLLYSLAPTMGAVVLSSTLLSFSTAIGRTCITDLQLTVGGAQASGTLIGVGQSVTA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS20 TCTLLLLYSLAPTMGAVVLSSTLLSFSTAIGRTCITDLQLTVGGAQASGTLIGVGQSVTA 370 380 390 400 410 420 430 440 450 460 470 pF1KE6 VGRIIAPLLSGVAQEVSPCGPPSLGAVLALVAIFIMSLNKRHSSGDGNSKLKSE :::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS20 VGRIIAPLLSGVAQEVSPCGPPSLGAVLALVAIFIMSLNKRHSSGDGNSKLKSE 430 440 450 460 470 >>CCDS7740.1 SLC22A18 gene_id:5002|Hs108|chr11 (424 aa) initn: 264 init1: 115 opt: 262 Z-score: 315.1 bits: 67.5 E(32554): 3.1e-11 Smith-Waterman score: 354; 25.9% identity (55.1% similar) in 432 aa overlap (39-456:19-408) 10 20 30 40 50 60 pF1KE6 MNSAPRLVSETAERKQEQKTGTEAEAADSGAVGARRFLLCLYLVGFLDLFGVSM---VVP :.: .: :... .: . : .:: CCDS77 MQGARAPRDQGRSPGRMSALGRSSVILLTYVLAATELTCLFMQFSIVP 10 20 30 40 70 80 90 100 110 120 pF1KE6 LLSLHVKSLGASPTVAGIVGSSYGILQLFSSTLVGCWSDVVGRRSSLLACILLSALGYLL :: ..:: . . : . ...:.:::... . : ..: : :..: .: . ::: CCDS77 YLS---RKLGLDSIAFGYLQTTFGVLQLLGGPVFGRFADQRGARAALTLSFLAALALYLL 50 60 70 80 90 100 130 140 150 160 170 180 pF1KE6 LGAATN-----VFLFVLARVPAGIFKHTLSISRALLSDVVPEKERPLVIGHFNTASGVGF :.::.. :.:. .:.: : . ::: .. ...:. .::: ..:... ::: CCDS77 LAAASSPALPGVYLLFASRLP-GALMHTLPAAQMVITDLSAPEERPAALGRLGLCFGVGV 110 120 130 140 150 160 190 200 210 220 230 240 pF1KE6 ILGPVVGGYLTELEDGFYLTAFICFLVFILNAGLVWFFPWREAKPGSTEKGLPLRKTHVL ::: ..:: :. :. :.. :. .:.: : . :.::. CCDS77 ILGSLLGGTLVSAY-GIQCPAILAALATLLGAVLSF-----TCIPASTK----------- 170 180 190 200 250 260 270 280 290 pF1KE6 LGRSHDTVQEAATSRRARASKKTAQPWVEVVLALRNMKNLL-FSEMWDIFLVRLLMAMAV : . :. .: ::: :. :. . .:: . .. ::::.. . CCDS77 -GAKTDA--QAPLPGGPRAS----------VFDLKAIASLLRLPDVPRIFLVKVASNCPT 210 220 230 240 250 300 310 320 330 340 350 pF1KE6 MLYYSNFVLALEERFGVRPKVTGYLISYSSMLGAVA-GLALGPILRLYKHNSQALLLHSS :.. : . . : .. .:::.:. ..: :. ::..: .: .: :. .::..: CCDS77 GLFMVMFSIISMDFFQLEAAQAGYLMSFFGLLQMVTQGLVIG---QLSSHFSEEVLLRAS 260 270 280 290 300 310 360 370 380 390 400 410 pF1KE6 ILTCTLLLLYSLAPTMGAVVLSSTLLS----FSTAIGRTCITDLQLTVGGAQASGTLIGV .: .... .:: . . :. :: :: . .. . . ... .::..:. CCDS77 VL---VFIVVGLAMAWMSSVFHFCLLVPGLVFSLCTLNVVTDSMLIKAVSTSDTGTMLGL 320 330 340 350 360 420 430 440 450 460 470 pF1KE6 GQSVTAVGRIIAPLLSGVAQEVSPCGPPSLGAVLALVAIFIMSLNKRHSSGDGNSKLKSE :: . : ..: ..:. . : : .: : . . ... CCDS77 CASVQPLLRTLGPTVGGLLYR--SFGVPVFGHVQVAINTLVLLVLWRKPMPQRKDKVR 370 380 390 400 410 420 474 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 13:37:19 2016 done: Tue Nov 8 13:37:19 2016 Total Scan time: 2.540 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]