FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1426, 419 aa 1>>>pF1KE1426 419 - 419 aa - 419 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.1324+/-0.000918; mu= 14.5491+/- 0.055 mean_var=139.5706+/-26.388, 0's: 0 Z-trim(110.4): 71 B-trim: 39 in 1/52 Lambda= 0.108562 statistics sampled from 11526 (11596) to 11526 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.714), E-opt: 0.2 (0.356), width: 16 Scan time: 2.750 The best scores are: opt bits E(32554) CCDS43285.1 VEGFC gene_id:7424|Hs108|chr4 ( 419) 3029 486.1 2.7e-137 CCDS14166.1 VEGFD gene_id:2277|Hs108|chrX ( 354) 865 147.1 2.6e-35 >>CCDS43285.1 VEGFC gene_id:7424|Hs108|chr4 (419 aa) initn: 3029 init1: 3029 opt: 3029 Z-score: 2578.3 bits: 486.1 E(32554): 2.7e-137 Smith-Waterman score: 3029; 100.0% identity (100.0% similar) in 419 aa overlap (1-419:1-419) 10 20 30 40 50 60 pF1KE1 MHLLGFFSVACSLLAAALLPGPREAPAAAAAFESGLDLSDAEPDAGEATAYASKDLEEQL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 MHLLGFFSVACSLLAAALLPGPREAPAAAAAFESGLDLSDAEPDAGEATAYASKDLEEQL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 RSVSSVDELMTVLYPEYWKMYKCQLRKGGWQHNREQANLNSRTEETIKFAAAHYNTEILK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 RSVSSVDELMTVLYPEYWKMYKCQLRKGGWQHNREQANLNSRTEETIKFAAAHYNTEILK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 SIDNEWRKTQCMPREVCIDVGKEFGVATNTFFKPPCVSVYRCGGCCNSEGLQCMNTSTSY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 SIDNEWRKTQCMPREVCIDVGKEFGVATNTFFKPPCVSVYRCGGCCNSEGLQCMNTSTSY 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 LSKTLFEITVPLSQGPKPVTISFANHTSCRCMSKLDVYRQVHSIIRRSLPATLPQCQAAN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 LSKTLFEITVPLSQGPKPVTISFANHTSCRCMSKLDVYRQVHSIIRRSLPATLPQCQAAN 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 KTCPTNYMWNNHICRCLAQEDFMFSSDAGDDSTDGFHDICGPNKELDEETCQCVCRAGLR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 KTCPTNYMWNNHICRCLAQEDFMFSSDAGDDSTDGFHDICGPNKELDEETCQCVCRAGLR 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE1 PASCGPHKELDRNSCQCVCKNKLFPSQCGANREFDENTCQCVCKRTCPRNQPLNPGKCAC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 PASCGPHKELDRNSCQCVCKNKLFPSQCGANREFDENTCQCVCKRTCPRNQPLNPGKCAC 310 320 330 340 350 360 370 380 390 400 410 pF1KE1 ECTESPQKCLLKGKKFHHQTCSCYRRPCTNRQKACEPGFSYSEEVCRCVPSYWKRPQMS ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 ECTESPQKCLLKGKKFHHQTCSCYRRPCTNRQKACEPGFSYSEEVCRCVPSYWKRPQMS 370 380 390 400 410 >>CCDS14166.1 VEGFD gene_id:2277|Hs108|chrX (354 aa) initn: 826 init1: 508 opt: 865 Z-score: 747.4 bits: 147.1 E(32554): 2.6e-35 Smith-Waterman score: 892; 39.2% identity (63.3% similar) in 357 aa overlap (57-407:41-337) 30 40 50 60 70 80 pF1KE1 AAAAAFESGLDLSDAEPDAGEATAYASKDLEEQLRSVSSVDELMTVLYPEYWKMYKCQLR :.:.:..::..::. . . : ::...:.:: CCDS14 FMMLYVQLVQGSSNEHGPVKRSSQSTLERSEQQIRAASSLEELLRITHSEDWKLWRCRLR 20 30 40 50 60 70 90 100 110 120 130 140 pF1KE1 KGGWQHNREQANLNSRT--EETIKFAAAHYNTEILKSIDNEWRKTQCMPREVCIDVGKEF .. ....::. ... .:::. :. : :: ::.::..::: :::.:..:..:. CCDS14 LKSF------TSMDSRSASHRSTRFAATFYDIETLKVIDEEWQRTQCSPRETCVEVASEL 80 90 100 110 120 150 160 170 180 190 200 pF1KE1 GVATNTFFKPPCVSVYRCGGCCNSEGLQCMNTSTSYLSKTLFEITVPLSQGPKPVTISFA : .::::::::::.:.::::::: :.: ::::::::.:: ::::.:::.. :. : .. : CCDS14 GKSTNTFFKPPCVNVFRCGGCCNEESLICMNTSTSYISKQLFEISVPLTSVPELVPVKVA 130 140 150 160 170 180 210 220 230 240 250 260 pF1KE1 NHTSCRCMSKLDVYRQVHSIIRRSLPATLPQ---CQAANKTCPTNYMWNNHICRCLAQED :::.:.:. . :. .::::::. .:. :. ..: :: ...:... :.:. ::. CCDS14 NHTGCKCLPT--APRHPYSIIRRSI--QIPEEDRCSHSKKLCPIDMLWDSNKCKCVLQEE 190 200 210 220 230 240 270 280 290 300 310 320 pF1KE1 FMFSSDAGDDSTDGFHDICGPNKELDEETCQCVCRAGLRPASCGPHKELDRNSCQCVCKN . : .: ...:.: :: :::: CCDS14 ---------NPLAGTED----HSHLQE------------PALCGPHMM------------ 250 260 330 340 350 360 370 380 pF1KE1 KLFPSQCGANREFDENTCQCVCKRTCPRNQPLNPGKCAC-ECTESPQKCLLKGKKFHHQT :::. :.:::: ::.. .: .:.: :: :: . : : : :: .: CCDS14 ------------FDEDRCECVCKTPCPKDLIQHPKNCSCFECKESLETCCQKHKLFHPDT 270 280 290 300 310 390 400 410 pF1KE1 CSCYRRPCTNRQKACEPGFSYSEEVCRCVPSYWKRPQMS ::: : : . . : : . . :: CCDS14 CSCEDR-CPFHTRPCASGKTACAKHCRFPKEKRAAQGPHSRKNP 320 330 340 350 419 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 23:54:30 2016 done: Sun Nov 6 23:54:30 2016 Total Scan time: 2.750 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]