FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1836, 354 aa 1>>>pF1KE1836 354 - 354 aa - 354 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.0040+/-0.000988; mu= 14.6234+/- 0.059 mean_var=129.4029+/-25.360, 0's: 0 Z-trim(109.4): 46 B-trim: 50 in 1/50 Lambda= 0.112746 statistics sampled from 10844 (10884) to 10844 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.705), E-opt: 0.2 (0.334), width: 16 Scan time: 2.740 The best scores are: opt bits E(32554) CCDS14166.1 VEGFD gene_id:2277|Hs108|chrX ( 354) 2546 425.4 3.7e-119 CCDS43285.1 VEGFC gene_id:7424|Hs108|chr4 ( 419) 865 152.0 8.4e-37 >>CCDS14166.1 VEGFD gene_id:2277|Hs108|chrX (354 aa) initn: 2546 init1: 2546 opt: 2546 Z-score: 2252.8 bits: 425.4 E(32554): 3.7e-119 Smith-Waterman score: 2546; 100.0% identity (100.0% similar) in 354 aa overlap (1-354:1-354) 10 20 30 40 50 60 pF1KE1 MYREWVVVNVFMMLYVQLVQGSSNEHGPVKRSSQSTLERSEQQIRAASSLEELLRITHSE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 MYREWVVVNVFMMLYVQLVQGSSNEHGPVKRSSQSTLERSEQQIRAASSLEELLRITHSE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 DWKLWRCRLRLKSFTSMDSRSASHRSTRFAATFYDIETLKVIDEEWQRTQCSPRETCVEV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 DWKLWRCRLRLKSFTSMDSRSASHRSTRFAATFYDIETLKVIDEEWQRTQCSPRETCVEV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 ASELGKSTNTFFKPPCVNVFRCGGCCNEESLICMNTSTSYISKQLFEISVPLTSVPELVP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 ASELGKSTNTFFKPPCVNVFRCGGCCNEESLICMNTSTSYISKQLFEISVPLTSVPELVP 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 VKVANHTGCKCLPTAPRHPYSIIRRSIQIPEEDRCSHSKKLCPIDMLWDSNKCKCVLQEE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 VKVANHTGCKCLPTAPRHPYSIIRRSIQIPEEDRCSHSKKLCPIDMLWDSNKCKCVLQEE 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 NPLAGTEDHSHLQEPALCGPHMMFDEDRCECVCKTPCPKDLIQHPKNCSCFECKESLETC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 NPLAGTEDHSHLQEPALCGPHMMFDEDRCECVCKTPCPKDLIQHPKNCSCFECKESLETC 250 260 270 280 290 300 310 320 330 340 350 pF1KE1 CQKHKLFHPDTCSCEDRCPFHTRPCASGKTACAKHCRFPKEKRAAQGPHSRKNP :::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 CQKHKLFHPDTCSCEDRCPFHTRPCASGKTACAKHCRFPKEKRAAQGPHSRKNP 310 320 330 340 350 >>CCDS43285.1 VEGFC gene_id:7424|Hs108|chr4 (419 aa) initn: 826 init1: 508 opt: 865 Z-score: 774.2 bits: 152.0 E(32554): 8.4e-37 Smith-Waterman score: 891; 39.4% identity (62.8% similar) in 360 aa overlap (41-341:57-404) 20 30 40 50 60 70 pF1KE1 FMMLYVQLVQGSSNEHGPVKRSSQSTLERSEQQIRAASSLEELLRITHSEDWKLWRCRLR :.:.:..::..::. . . : ::...:.:: CCDS43 AAAAAFESGLDLSDAEPDAGEATAYASKDLEEQLRSVSSVDELMTVLYPEYWKMYKCQLR 30 40 50 60 70 80 80 90 100 110 120 pF1KE1 LKSF------TSMDSRSASHRSTRFAATFYDIETLKVIDEEWQRTQCSPRETCVEVASEL .. ....::. ... .:::. :. : :: ::.::..::: :::.:..:..:. CCDS43 KGGWQHNREQANLNSRT--EETIKFAAAHYNTEILKSIDNEWRKTQCMPREVCIDVGKEF 90 100 110 120 130 140 130 140 150 160 170 180 pF1KE1 GKSTNTFFKPPCVNVFRCGGCCNEESLICMNTSTSYISKQLFEISVPLTSVPELVPVKVA : .::::::::::.:.::::::: :.: ::::::::.:: ::::.:::.. :. : .. : CCDS43 GVATNTFFKPPCVSVYRCGGCCNSEGLQCMNTSTSYLSKTLFEITVPLSQGPKPVTISFA 150 160 170 180 190 200 190 200 210 220 230 240 pF1KE1 NHTGCKCLPTAP--RHPYSIIRRSIQ--IPEEDRCSHSKKLCPIDMLWDSNKCKCVLQEE :::.:.:. :. .::::::. .:. :. ..: :: ...:... :.:. ::. CCDS43 NHTSCRCMSKLDVYRQVHSIIRRSLPATLPQ---CQAANKTCPTNYMWNNHICRCLAQED 210 220 230 240 250 260 250 260 pF1KE1 ---NPLAGTE--DHSH--------LQE------------PALCGPHMM------------ . :: . : : :.: :: :::: CCDS43 FMFSSDAGDDSTDGFHDICGPNKELDEETCQCVCRAGLRPASCGPHKELDRNSCQCVCKN 270 280 290 300 310 320 270 280 290 300 310 pF1KE1 ------------FDEDRCECVCKTPCPKDLIQHPKNCSCFECKESLETCCQKHKLFHPDT :::. :.:::: ::.. .: .:.: :: :: . : : : :: .: CCDS43 KLFPSQCGANREFDENTCQCVCKRTCPRNQPLNPGKCAC-ECTESPQKCLLKGKKFHHQT 330 340 350 360 370 380 320 330 340 350 pF1KE1 CSCEDRCPFHTRPCASGKTACAKHCRFPKEKRAAQGPHSRKNP ::: . :::.. . :: . .: CCDS43 CSC------YRRPCTNRQKACEPGFSYSEEVCRCVPSYWKRPQMS 390 400 410 354 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 12:52:05 2016 done: Sun Nov 6 12:52:05 2016 Total Scan time: 2.740 Total Display time: -0.030 Function used was FASTA [36.3.4 Apr, 2011]