FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE4326, 328 aa
1>>>pF1KE4326 328 - 328 aa - 328 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.2746+/-0.00063; mu= 14.9078+/- 0.039
mean_var=148.0931+/-29.679, 0's: 0 Z-trim(117.5): 25 B-trim: 161 in 2/54
Lambda= 0.105392
statistics sampled from 18184 (18210) to 18184 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.842), E-opt: 0.2 (0.559), width: 16
Scan time: 2.860
The best scores are: opt bits E(32554)
CCDS42815.1 IGFBP2 gene_id:3485|Hs108|chr2 ( 325) 2331 365.0 4.6e-101
CCDS82570.1 IGFBP2 gene_id:3485|Hs108|chr2 ( 181) 1261 202.0 2.9e-52
CCDS5505.1 IGFBP3 gene_id:3486|Hs108|chr7 ( 291) 503 87.0 2e-17
CCDS34632.1 IGFBP3 gene_id:3486|Hs108|chr7 ( 297) 456 79.9 2.9e-15
>>CCDS42815.1 IGFBP2 gene_id:3485|Hs108|chr2 (325 aa)
initn: 2227 init1: 2227 opt: 2331 Z-score: 1927.9 bits: 365.0 E(32554): 4.6e-101
Smith-Waterman score: 2331; 99.1% identity (99.1% similar) in 328 aa overlap (1-328:1-325)
10 20 30 40 50 60
pF1KE4 MLPRVGCPALPLPPPPLLPLLPLLLLLLGASGGGGGARAEVLFRCPPCTPERLAACGPPP
::::::::::::::::::::: ::::::::::::::::::::::::::::::::::::
CCDS42 MLPRVGCPALPLPPPPLLPLL---LLLLGASGGGGGARAEVLFRCPPCTPERLAACGPPP
10 20 30 40 50
70 80 90 100 110 120
pF1KE4 VAPPAAVAAVAGGARMPCAELVREPGCGCCSVCARLEGEACGVYTPRCGQGLRCYPHPGS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 VAPPAAVAAVAGGARMPCAELVREPGCGCCSVCARLEGEACGVYTPRCGQGLRCYPHPGS
60 70 80 90 100 110
130 140 150 160 170 180
pF1KE4 ELPLQALVMGEGTCEKRRDAEYGASPEQVADNGDDHSEGGLVENHVDSTMNMLGGGGSAG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 ELPLQALVMGEGTCEKRRDAEYGASPEQVADNGDDHSEGGLVENHVDSTMNMLGGGGSAG
120 130 140 150 160 170
190 200 210 220 230 240
pF1KE4 RKPLKSGMKELAVFREKVTEQHRQMGKGGKHHLGLEEPKKLRPPPARTPCQQELDQVLER
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 RKPLKSGMKELAVFREKVTEQHRQMGKGGKHHLGLEEPKKLRPPPARTPCQQELDQVLER
180 190 200 210 220 230
250 260 270 280 290 300
pF1KE4 ISTMRLPDERGPLEHLYSLHIPNCDKHGLYNLKQCKMSLNGQRGECWCVNPNTGKLIQGA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 ISTMRLPDERGPLEHLYSLHIPNCDKHGLYNLKQCKMSLNGQRGECWCVNPNTGKLIQGA
240 250 260 270 280 290
310 320
pF1KE4 PTIRGDPECHLFYNEQQEARGVHTQRMQ
::::::::::::::::::::::::::::
CCDS42 PTIRGDPECHLFYNEQQEARGVHTQRMQ
300 310 320
>>CCDS82570.1 IGFBP2 gene_id:3485|Hs108|chr2 (181 aa)
initn: 1261 init1: 1261 opt: 1261 Z-score: 1051.7 bits: 202.0 E(32554): 2.9e-52
Smith-Waterman score: 1261; 99.4% identity (100.0% similar) in 178 aa overlap (151-328:4-181)
130 140 150 160 170 180
pF1KE4 ELPLQALVMGEGTCEKRRDAEYGASPEQVADNGDDHSEGGLVENHVDSTMNMLGGGGSAG
.:::::::::::::::::::::::::::::
CCDS82 MPCNNGDDHSEGGLVENHVDSTMNMLGGGGSAG
10 20 30
190 200 210 220 230 240
pF1KE4 RKPLKSGMKELAVFREKVTEQHRQMGKGGKHHLGLEEPKKLRPPPARTPCQQELDQVLER
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS82 RKPLKSGMKELAVFREKVTEQHRQMGKGGKHHLGLEEPKKLRPPPARTPCQQELDQVLER
40 50 60 70 80 90
250 260 270 280 290 300
pF1KE4 ISTMRLPDERGPLEHLYSLHIPNCDKHGLYNLKQCKMSLNGQRGECWCVNPNTGKLIQGA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS82 ISTMRLPDERGPLEHLYSLHIPNCDKHGLYNLKQCKMSLNGQRGECWCVNPNTGKLIQGA
100 110 120 130 140 150
310 320
pF1KE4 PTIRGDPECHLFYNEQQEARGVHTQRMQ
::::::::::::::::::::::::::::
CCDS82 PTIRGDPECHLFYNEQQEARGVHTQRMQ
160 170 180
>>CCDS5505.1 IGFBP3 gene_id:3486|Hs108|chr7 (291 aa)
initn: 542 init1: 303 opt: 503 Z-score: 426.4 bits: 87.0 E(32554): 2e-17
Smith-Waterman score: 539; 33.2% identity (57.6% similar) in 304 aa overlap (21-312:13-286)
10 20 30 40 50
pF1KE4 MLPRVGCPALPLPPPPLLPLLPLLLLLLG---ASGGGGGARAEVLFRCPPCTPERLAACG
: ::.:: : : .:...: . :: :: . :: :.
CCDS55 MQRARPTLWAAALTLLVLLRGPPVARAGASSAGLGPVVRCEPCDARALAQCA
10 20 30 40 50
60 70 80 90 100 110
pF1KE4 PPPVAPPAAVAAVAGGARMPCAELVREPGCGCCSVCARLEGEACGVYTPRCGQGLRCYPH
:::.. ::::::::::::: .:: ::. ::.:: :::.:::: :
CCDS55 PPPAV---------------CAELVREPGCGCCLTCALSEGQPCGIYTERCGSGLRCQPS
60 70 80 90
120 130 140 150 160
pF1KE4 PGSELPLQALVMGEGTC------EKRRDAEYGASPE--QVADNGDDHSEGGLVENHVDST
: :::::. :.: : . : : : ..... .:.: :.. :.::
CCDS55 PDEARPLQALLDGRGLCVNASAVSRLRAYLLPAPPAPGNASESEEDRSAGSVESPSVSST
100 110 120 130 140 150
170 180 190 200 210 220
pF1KE4 MNMLGGGGSAGRKPLKSGMKELAVFREKVTEQHRQMGKGGKHHLGLEE-PKKLRPPPART
. .. .::.: : . . . .. ...: .. .. .. .
CCDS55 HRV----SDPKFHPLHS--KIIIIKKGHAKDSQRYKVDYESQSTDTQNFSSESKRETEYG
160 170 180 190 200 210
230 240 250 260 270 280
pF1KE4 PCQQELDQVLERISTMRLPDERGPLEHLYSLHIPNCDKHGLYNLKQCKMSLNGQRGECWC
::..:....:.... . . . :: .:::::::.:.:. :::. : . .:: :::
CCDS55 PCRREMEDTLNHLKFLNVLSPRG-------VHIPNCDKKGFYKKKQCRPSKGRKRGFCWC
220 230 240 250 260
290 300 310 320
pF1KE4 VNPNTGKLIQGAPTIRGDPECHLFYNEQQEARGVHTQRMQ
:. . :. . : : .: . : .
CCDS55 VD-KYGQPLPGYTT-KGKEDVHCYSMQSK
270 280 290
>>CCDS34632.1 IGFBP3 gene_id:3486|Hs108|chr7 (297 aa)
initn: 542 init1: 303 opt: 456 Z-score: 387.6 bits: 79.9 E(32554): 2.9e-15
Smith-Waterman score: 527; 32.6% identity (56.5% similar) in 310 aa overlap (21-312:13-292)
10 20 30 40 50
pF1KE4 MLPRVGCPALPLPPPPLLPLLPLLLLLLG---ASGGGGGARAEVLFRCPPCTPERLAACG
: ::.:: : : .:...: . :: :: . :: :.
CCDS34 MQRARPTLWAAALTLLVLLRGPPVARAGASSAGLGPVVRCEPCDARALAQCA
10 20 30 40 50
60 70 80 90 100 110
pF1KE4 PPPVAPPAAVAAVAGGARMPCAELVREPGCGCCSVCARLEGEACGVYTPRCGQGLRCYPH
:::.. ::::::::::::: .:: ::. ::.:: :::.:::: :
CCDS34 PPPAV---------------CAELVREPGCGCCLTCALSEGQPCGIYTERCGSGLRCQPS
60 70 80 90
120 130 140 150 160
pF1KE4 PGSELPLQALVMGEGTC------EKRRDAEYGASPE--------QVADNGDDHSEGGLVE
: :::::. :.: : . : : : ..... .:.: :..
CCDS34 PDEARPLQALLDGRGLCVNASAVSRLRAYLLPAPPAPGEPPAPGNASESEEDRSAGSVES
100 110 120 130 140 150
170 180 190 200 210 220
pF1KE4 NHVDSTMNMLGGGGSAGRKPLKSGMKELAVFREKVTEQHRQMGKGGKHHLGLEE-PKKLR
:.:: . .. .::.: : . . . .. ...: .. .. .. .
CCDS34 PSVSSTHRV----SDPKFHPLHS--KIIIIKKGHAKDSQRYKVDYESQSTDTQNFSSESK
160 170 180 190 200 210
230 240 250 260 270 280
pF1KE4 PPPARTPCQQELDQVLERISTMRLPDERGPLEHLYSLHIPNCDKHGLYNLKQCKMSLNGQ
::..:....:.... . . . :: .:::::::.:.:. :::. : . .
CCDS34 RETEYGPCRREMEDTLNHLKFLNVLSPRG-------VHIPNCDKKGFYKKKQCRPSKGRK
220 230 240 250 260
290 300 310 320
pF1KE4 RGECWCVNPNTGKLIQGAPTIRGDPECHLFYNEQQEARGVHTQRMQ
:: ::::. . :. . : : .: . : .
CCDS34 RGFCWCVD-KYGQPLPGYTT-KGKEDVHCYSMQSK
270 280 290
328 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 23:09:30 2016 done: Sat Nov 5 23:09:31 2016
Total Scan time: 2.860 Total Display time: -0.020
Function used was FASTA [36.3.4 Apr, 2011]