FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB5659, 308 aa
1>>>pF1KB5659 308 - 308 aa - 308 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.1800+/-0.000836; mu= 17.3670+/- 0.051
mean_var=89.7140+/-17.057, 0's: 0 Z-trim(109.4): 58 B-trim: 22 in 1/50
Lambda= 0.135408
statistics sampled from 10801 (10859) to 10801 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.706), E-opt: 0.2 (0.334), width: 16
Scan time: 2.610
The best scores are: opt bits E(32554)
CCDS2998.1 FSTL1 gene_id:11167|Hs108|chr3 ( 308) 2146 429.0 2.3e-120
CCDS47157.1 FSTL5 gene_id:56884|Hs108|chr4 ( 837) 304 69.6 9.5e-12
CCDS47158.1 FSTL5 gene_id:56884|Hs108|chr4 ( 846) 304 69.6 9.6e-12
CCDS3802.1 FSTL5 gene_id:56884|Hs108|chr4 ( 847) 304 69.6 9.6e-12
CCDS34238.1 FSTL4 gene_id:23105|Hs108|chr5 ( 842) 299 68.6 1.9e-11
>>CCDS2998.1 FSTL1 gene_id:11167|Hs108|chr3 (308 aa)
initn: 2146 init1: 2146 opt: 2146 Z-score: 2274.6 bits: 429.0 E(32554): 2.3e-120
Smith-Waterman score: 2146; 100.0% identity (100.0% similar) in 308 aa overlap (1-308:1-308)
10 20 30 40 50 60
pF1KB5 MWKRWLALALALVAVAWVRAEEELRSKSKICANVFCGAGRECAVTEKGEPTCLCIEQCKP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS29 MWKRWLALALALVAVAWVRAEEELRSKSKICANVFCGAGRECAVTEKGEPTCLCIEQCKP
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB5 HKRPVCGSNGKTYLNHCELHRDACLTGSKIQVDYDGHCKEKKSVSPSASPVVCYQSNRDE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS29 HKRPVCGSNGKTYLNHCELHRDACLTGSKIQVDYDGHCKEKKSVSPSASPVVCYQSNRDE
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB5 LRRRIIQWLEAEIIPDGWFSKGSNYSEILDKYFKNFDNGDSRLDSSEFLKFVEQNETAIN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS29 LRRRIIQWLEAEIIPDGWFSKGSNYSEILDKYFKNFDNGDSRLDSSEFLKFVEQNETAIN
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB5 ITTYPDQENNKLLRGLCVDALIELSDENADWKLSFQEFLKCLNPSFNPPEKKCALEDETY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS29 ITTYPDQENNKLLRGLCVDALIELSDENADWKLSFQEFLKCLNPSFNPPEKKCALEDETY
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB5 ADGAETEVDCNRCVCACGNWVCTAMTCDGKNQKGAQTQTEEEMTRYVQELQKHQETAEKT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS29 ADGAETEVDCNRCVCACGNWVCTAMTCDGKNQKGAQTQTEEEMTRYVQELQKHQETAEKT
250 260 270 280 290 300
pF1KB5 KRVSTKEI
::::::::
CCDS29 KRVSTKEI
>>CCDS47157.1 FSTL5 gene_id:56884|Hs108|chr4 (837 aa)
initn: 280 init1: 216 opt: 304 Z-score: 324.4 bits: 69.6 E(32554): 9.5e-12
Smith-Waterman score: 322; 30.5% identity (60.0% similar) in 220 aa overlap (31-244:64-265)
10 20 30 40 50
pF1KB5 MWKRWLALALALVAVAWVRAEEELRSKSKICANVFCGAGRECAVT-EKGEPTCLCIEQCK
: : .:: ::.:... : :. : :.. ::
CCDS47 QPLMRLRHKEKNQESSRVKGFMIQDGPFGSCENKYCGLGRHCVTSRETGQAECACMDLCK
40 50 60 70 80 90
60 70 80 90 100 110
pF1KB5 PHKRPVCGSNGKTYLNHCELHRDACLTGSKIQVDYDGHC--KEKKSVSPSASPVVCYQSN
: .:::::.:. : ::::.:: ::: .:: . .. : : : . : . . .
CCDS47 RHYKPVCGSDGEFYENHCEVHRAACLKKQKITIVHNEDCFFKGDKCKTTEYSKMKNMLLD
100 110 120 130 140 150
120 130 140 150 160 170
pF1KB5 RDELRRRIIQWLEAEIIPDGWFSKGSNYSEILDKYFKNFD-NGDSRLDSSEFLKFVEQNE
.. .. :.: : : :.: . : . ..:..:: :: .... .: .:. . ..:.:
CCDS47 LQN-QKYIMQ--ENEN-PNG--DDISRKKLLVDQMFKYFDADSNGLVDINELTQVIKQEE
160 170 180 190 200
180 190 200 210 220 230
pF1KB5 TAINITTYPDQENNKLLRGLCVDALIELSDENADWKLSFQEFLKCLNP-SFNPPE-KKCA
. : : . .:.. .: ::: .:...:: . .. ... :: .: .
CCDS47 LG------------KDLFDCTLYVLLKYDDFNADKHLALEEFYRAFQVIQLSLPEDQKLS
210 220 230 240 250
240 250 260 270 280 290
pF1KB5 LEDETYADGAETEVDCNRCVCACGNWVCTAMTCDGKNQKGAQTQTEEEMTRYVQELQKHQ
. : ...:
CCDS47 ITAATVGQSAVLSCAIQGTLRPPIIWKRNNIILNNLDLEDINDFGDDGSLYITKVTTTHV
260 270 280 290 300 310
>>CCDS47158.1 FSTL5 gene_id:56884|Hs108|chr4 (846 aa)
initn: 280 init1: 216 opt: 304 Z-score: 324.3 bits: 69.6 E(32554): 9.6e-12
Smith-Waterman score: 322; 30.5% identity (60.0% similar) in 220 aa overlap (31-244:64-265)
10 20 30 40 50
pF1KB5 MWKRWLALALALVAVAWVRAEEELRSKSKICANVFCGAGRECAVT-EKGEPTCLCIEQCK
: : .:: ::.:... : :. : :.. ::
CCDS47 QPLMRLRHKEKNQESSRVKGFMIQDGPFGSCENKYCGLGRHCVTSRETGQAECACMDLCK
40 50 60 70 80 90
60 70 80 90 100 110
pF1KB5 PHKRPVCGSNGKTYLNHCELHRDACLTGSKIQVDYDGHC--KEKKSVSPSASPVVCYQSN
: .:::::.:. : ::::.:: ::: .:: . .. : : : . : . . .
CCDS47 RHYKPVCGSDGEFYENHCEVHRAACLKKQKITIVHNEDCFFKGDKCKTTEYSKMKNMLLD
100 110 120 130 140 150
120 130 140 150 160 170
pF1KB5 RDELRRRIIQWLEAEIIPDGWFSKGSNYSEILDKYFKNFD-NGDSRLDSSEFLKFVEQNE
.. .. :.: : : :.: . : . ..:..:: :: .... .: .:. . ..:.:
CCDS47 LQN-QKYIMQ--ENEN-PNG--DDISRKKLLVDQMFKYFDADSNGLVDINELTQVIKQEE
160 170 180 190 200
180 190 200 210 220 230
pF1KB5 TAINITTYPDQENNKLLRGLCVDALIELSDENADWKLSFQEFLKCLNP-SFNPPE-KKCA
. : : . .:.. .: ::: .:...:: . .. ... :: .: .
CCDS47 LG------------KDLFDCTLYVLLKYDDFNADKHLALEEFYRAFQVIQLSLPEDQKLS
210 220 230 240 250
240 250 260 270 280 290
pF1KB5 LEDETYADGAETEVDCNRCVCACGNWVCTAMTCDGKNQKGAQTQTEEEMTRYVQELQKHQ
. : ...:
CCDS47 ITAATVGQSAVLSCAIQGTLRPPIIWKRNNIILNNLDLEDINDFGDDGSLYITKVTTTHV
260 270 280 290 300 310
>>CCDS3802.1 FSTL5 gene_id:56884|Hs108|chr4 (847 aa)
initn: 280 init1: 216 opt: 304 Z-score: 324.3 bits: 69.6 E(32554): 9.6e-12
Smith-Waterman score: 322; 30.5% identity (60.0% similar) in 220 aa overlap (31-244:65-266)
10 20 30 40 50
pF1KB5 MWKRWLALALALVAVAWVRAEEELRSKSKICANVFCGAGRECAVT-EKGEPTCLCIEQCK
: : .:: ::.:... : :. : :.. ::
CCDS38 PLMRLRHKQEKNQESSRVKGFMIQDGPFGSCENKYCGLGRHCVTSRETGQAECACMDLCK
40 50 60 70 80 90
60 70 80 90 100 110
pF1KB5 PHKRPVCGSNGKTYLNHCELHRDACLTGSKIQVDYDGHC--KEKKSVSPSASPVVCYQSN
: .:::::.:. : ::::.:: ::: .:: . .. : : : . : . . .
CCDS38 RHYKPVCGSDGEFYENHCEVHRAACLKKQKITIVHNEDCFFKGDKCKTTEYSKMKNMLLD
100 110 120 130 140 150
120 130 140 150 160 170
pF1KB5 RDELRRRIIQWLEAEIIPDGWFSKGSNYSEILDKYFKNFD-NGDSRLDSSEFLKFVEQNE
.. .. :.: : : :.: . : . ..:..:: :: .... .: .:. . ..:.:
CCDS38 LQN-QKYIMQ--ENEN-PNG--DDISRKKLLVDQMFKYFDADSNGLVDINELTQVIKQEE
160 170 180 190 200
180 190 200 210 220 230
pF1KB5 TAINITTYPDQENNKLLRGLCVDALIELSDENADWKLSFQEFLKCLNP-SFNPPE-KKCA
. : : . .:.. .: ::: .:...:: . .. ... :: .: .
CCDS38 LG------------KDLFDCTLYVLLKYDDFNADKHLALEEFYRAFQVIQLSLPEDQKLS
210 220 230 240 250
240 250 260 270 280 290
pF1KB5 LEDETYADGAETEVDCNRCVCACGNWVCTAMTCDGKNQKGAQTQTEEEMTRYVQELQKHQ
. : ...:
CCDS38 ITAATVGQSAVLSCAIQGTLRPPIIWKRNNIILNNLDLEDINDFGDDGSLYITKVTTTHV
260 270 280 290 300 310
>>CCDS34238.1 FSTL4 gene_id:23105|Hs108|chr5 (842 aa)
initn: 232 init1: 232 opt: 299 Z-score: 319.1 bits: 68.6 E(32554): 1.9e-11
Smith-Waterman score: 328; 28.8% identity (61.5% similar) in 226 aa overlap (18-231:49-254)
10 20 30 40
pF1KB5 MWKRWLALALALVAVAWVRAEEELRSKSKI---CANVFCGAGRECAV
: .: : :.... :.. ::. : .:..
CCDS34 AALGWMDPGTSRGPDVGVGESQAEEPRSFEVTRREGLSSHNELLASCGKKFCSRGSRCVL
20 30 40 50 60 70
50 60 70 80 90 100
pF1KB5 TEK-GEPTCLCIEQCKPHKRPVCGSNGKTYLNHCELHRDACLTGSKIQVDYDGHCKEKKS
..: ::: : :.: :.: :::::.:. : :::.::: ::: :..: : .. : : .
CCDS34 SRKTGEPECQCLEACRPSYVPVCGSDGRFYENHCKLHRAACLLGKRITVIHSKDCFLKGD
80 90 100 110 120 130
110 120 130 140 150
pF1KB5 VSPSASPVVCYQSNRDELRRRIIQWLEAEIIP----DGWFSKGSNYSEILDKYFKNFD-N
. : ... .:. .. :.... : :. . .:. .... :...: .
CCDS34 T--------CTMAGYARLKN-VLLALQTRLQPLQEGDSRQDPASQKRLLVESLFRDLDAD
140 150 160 170 180
160 170 180 190 200 210
pF1KB5 GDSRLDSSEFLKFVEQNETAINITTYPDQENNKLLRGLCVDALIELSDENADWKLSFQEF
:...:.:::. . : .. :. .. : : :....: :.: .:...::
CCDS34 GNGHLSSSELAQHVLKK-----------QDLDEDLLGCSPGDLLRFDDYNSDSSLTLREF
190 200 210 220 230
220 230 240 250 260 270
pF1KB5 ---LKCLNPSFNPPEKKCALEDETYADGAETEVDCNRCVCACGNWVCTAMTCDGKNQKGA
.. .. :. : ..
CCDS34 YMAFQVVQLSLAPEDRVSVTTVTVGLSTVLTCAVHGDLRPPIIWKRNGLTLNFLDLEDIN
240 250 260 270 280 290
308 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 13:11:01 2016 done: Sat Nov 5 13:11:01 2016
Total Scan time: 2.610 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]