FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE1836, 354 aa
1>>>pF1KE1836 354 - 354 aa - 354 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.0040+/-0.000988; mu= 14.6234+/- 0.059
mean_var=129.4029+/-25.360, 0's: 0 Z-trim(109.4): 46 B-trim: 50 in 1/50
Lambda= 0.112746
statistics sampled from 10844 (10884) to 10844 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.705), E-opt: 0.2 (0.334), width: 16
Scan time: 2.740
The best scores are: opt bits E(32554)
CCDS14166.1 VEGFD gene_id:2277|Hs108|chrX ( 354) 2546 425.4 3.7e-119
CCDS43285.1 VEGFC gene_id:7424|Hs108|chr4 ( 419) 865 152.0 8.4e-37
>>CCDS14166.1 VEGFD gene_id:2277|Hs108|chrX (354 aa)
initn: 2546 init1: 2546 opt: 2546 Z-score: 2252.8 bits: 425.4 E(32554): 3.7e-119
Smith-Waterman score: 2546; 100.0% identity (100.0% similar) in 354 aa overlap (1-354:1-354)
10 20 30 40 50 60
pF1KE1 MYREWVVVNVFMMLYVQLVQGSSNEHGPVKRSSQSTLERSEQQIRAASSLEELLRITHSE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 MYREWVVVNVFMMLYVQLVQGSSNEHGPVKRSSQSTLERSEQQIRAASSLEELLRITHSE
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 DWKLWRCRLRLKSFTSMDSRSASHRSTRFAATFYDIETLKVIDEEWQRTQCSPRETCVEV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 DWKLWRCRLRLKSFTSMDSRSASHRSTRFAATFYDIETLKVIDEEWQRTQCSPRETCVEV
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE1 ASELGKSTNTFFKPPCVNVFRCGGCCNEESLICMNTSTSYISKQLFEISVPLTSVPELVP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 ASELGKSTNTFFKPPCVNVFRCGGCCNEESLICMNTSTSYISKQLFEISVPLTSVPELVP
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE1 VKVANHTGCKCLPTAPRHPYSIIRRSIQIPEEDRCSHSKKLCPIDMLWDSNKCKCVLQEE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 VKVANHTGCKCLPTAPRHPYSIIRRSIQIPEEDRCSHSKKLCPIDMLWDSNKCKCVLQEE
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE1 NPLAGTEDHSHLQEPALCGPHMMFDEDRCECVCKTPCPKDLIQHPKNCSCFECKESLETC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 NPLAGTEDHSHLQEPALCGPHMMFDEDRCECVCKTPCPKDLIQHPKNCSCFECKESLETC
250 260 270 280 290 300
310 320 330 340 350
pF1KE1 CQKHKLFHPDTCSCEDRCPFHTRPCASGKTACAKHCRFPKEKRAAQGPHSRKNP
::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 CQKHKLFHPDTCSCEDRCPFHTRPCASGKTACAKHCRFPKEKRAAQGPHSRKNP
310 320 330 340 350
>>CCDS43285.1 VEGFC gene_id:7424|Hs108|chr4 (419 aa)
initn: 826 init1: 508 opt: 865 Z-score: 774.2 bits: 152.0 E(32554): 8.4e-37
Smith-Waterman score: 891; 39.4% identity (62.8% similar) in 360 aa overlap (41-341:57-404)
20 30 40 50 60 70
pF1KE1 FMMLYVQLVQGSSNEHGPVKRSSQSTLERSEQQIRAASSLEELLRITHSEDWKLWRCRLR
:.:.:..::..::. . . : ::...:.::
CCDS43 AAAAAFESGLDLSDAEPDAGEATAYASKDLEEQLRSVSSVDELMTVLYPEYWKMYKCQLR
30 40 50 60 70 80
80 90 100 110 120
pF1KE1 LKSF------TSMDSRSASHRSTRFAATFYDIETLKVIDEEWQRTQCSPRETCVEVASEL
.. ....::. ... .:::. :. : :: ::.::..::: :::.:..:..:.
CCDS43 KGGWQHNREQANLNSRT--EETIKFAAAHYNTEILKSIDNEWRKTQCMPREVCIDVGKEF
90 100 110 120 130 140
130 140 150 160 170 180
pF1KE1 GKSTNTFFKPPCVNVFRCGGCCNEESLICMNTSTSYISKQLFEISVPLTSVPELVPVKVA
: .::::::::::.:.::::::: :.: ::::::::.:: ::::.:::.. :. : .. :
CCDS43 GVATNTFFKPPCVSVYRCGGCCNSEGLQCMNTSTSYLSKTLFEITVPLSQGPKPVTISFA
150 160 170 180 190 200
190 200 210 220 230 240
pF1KE1 NHTGCKCLPTAP--RHPYSIIRRSIQ--IPEEDRCSHSKKLCPIDMLWDSNKCKCVLQEE
:::.:.:. :. .::::::. .:. :. ..: :: ...:... :.:. ::.
CCDS43 NHTSCRCMSKLDVYRQVHSIIRRSLPATLPQ---CQAANKTCPTNYMWNNHICRCLAQED
210 220 230 240 250 260
250 260
pF1KE1 ---NPLAGTE--DHSH--------LQE------------PALCGPHMM------------
. :: . : : :.: :: ::::
CCDS43 FMFSSDAGDDSTDGFHDICGPNKELDEETCQCVCRAGLRPASCGPHKELDRNSCQCVCKN
270 280 290 300 310 320
270 280 290 300 310
pF1KE1 ------------FDEDRCECVCKTPCPKDLIQHPKNCSCFECKESLETCCQKHKLFHPDT
:::. :.:::: ::.. .: .:.: :: :: . : : : :: .:
CCDS43 KLFPSQCGANREFDENTCQCVCKRTCPRNQPLNPGKCAC-ECTESPQKCLLKGKKFHHQT
330 340 350 360 370 380
320 330 340 350
pF1KE1 CSCEDRCPFHTRPCASGKTACAKHCRFPKEKRAAQGPHSRKNP
::: . :::.. . :: . .:
CCDS43 CSC------YRRPCTNRQKACEPGFSYSEEVCRCVPSYWKRPQMS
390 400 410
354 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 12:52:05 2016 done: Sun Nov 6 12:52:05 2016
Total Scan time: 2.740 Total Display time: -0.030
Function used was FASTA [36.3.4 Apr, 2011]