FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB9475, 343 aa
1>>>pF1KB9475 343 - 343 aa - 343 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.0811+/-0.000783; mu= 13.3852+/- 0.047
mean_var=92.3811+/-18.367, 0's: 0 Z-trim(109.7): 8 B-trim: 7 in 1/52
Lambda= 0.133439
statistics sampled from 11094 (11096) to 11094 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.718), E-opt: 0.2 (0.341), width: 16
Scan time: 2.940
The best scores are: opt bits E(32554)
CCDS83050.1 UIMC1 gene_id:51720|Hs108|chr5 ( 553) 2037 402.1 5.6e-112
CCDS4408.1 UIMC1 gene_id:51720|Hs108|chr5 ( 719) 2032 401.2 1.3e-111
>>CCDS83050.1 UIMC1 gene_id:51720|Hs108|chr5 (553 aa)
initn: 2039 init1: 1224 opt: 2037 Z-score: 2123.8 bits: 402.1 E(32554): 5.6e-112
Smith-Waterman score: 2037; 92.2% identity (95.1% similar) in 344 aa overlap (6-343:211-553)
10 20 30
pF1KB9 MLPLPDLDLWPLDRLPSPIKRKPQTLGSLKSSQGI
.... .:: . .. : :. .:::::
CCDS83 EPWDHTEKTEEEPVSGSSGSWDQSSQPVFENVNVKSFDRCTGHSAEHTQC-GKPQSSQGI
190 200 210 220 230
40 50 60 70 80 90
pF1KB9 VEETSEEGNSVPASQSVAALTSKRSLVLMPESSAEEITVCPETQLSSSETFDLEREVSPG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS83 VEETSEEGNSVPASQSVAALTSKRSLVLMPESSAEEITVCPETQLSSSETFDLEREVSPG
240 250 260 270 280 290
100 110 120 130 140 150
pF1KB9 SRDILDGVRIIMADKEVGNKEDAEKEVAISTFSSSNQVSCPLCDQCFPPTKIERHAMYCN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS83 SRDILDGVRIIMADKEVGNKEDAEKEVAISTFSSSNQVSCPLCDQCFPPTKIERHAMYCN
300 310 320 330 340 350
160 170 180 190 200 210
pF1KB9 GLMEEDTVLTRRQKEAKTKSDSGTAAQTSLDIDKNEKCYLCKSLVPFREYQCHVDSCLQL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS83 GLMEEDTVLTRRQKEAKTKSDSGTAAQTSLDIDKNEKCYLCKSLVPFREYQCHVDSCLQL
360 370 380 390 400 410
220 230 240 250 260
pF1KB9 AKA------EGSGRACSTVEGKWQQRLKNPKEKGHSEGRLLSFLEQSEHKTSDADIKSSE
::: :::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS83 AKADQGDGPEGSGRACSTVEGKWQQRLKNPKEKGHSEGRLLSFLEQSEHKTSDADIKSSE
420 430 440 450 460 470
270 280 290 300 310 320
pF1KB9 TGAFRVPSPGMEEAGCSREMQSSFTRRDLNESPVKSFVSISEATDCLVDFKKQVTVQPGS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS83 TGAFRVPSPGMEEAGCSREMQSSFTRRDLNESPVKSFVSISEATDCLVDFKKQVTVQPGS
480 490 500 510 520 530
330 340
pF1KB9 RTRTKAGRGRRRKF
::::::::::::::
CCDS83 RTRTKAGRGRRRKF
540 550
>>CCDS4408.1 UIMC1 gene_id:51720|Hs108|chr5 (719 aa)
initn: 2039 init1: 1224 opt: 2032 Z-score: 2117.0 bits: 401.2 E(32554): 1.3e-111
Smith-Waterman score: 2032; 95.2% identity (97.0% similar) in 331 aa overlap (19-343:390-719)
10 20 30 40
pF1KB9 MLPLPDLDLWPLDRLPSPIKRKPQTLGSLKSSQGIVEETSEEGNSVPA
....: : . .::::::::::::::::::
CCDS44 ERQESRASDWHSKTKDFQESSIKSLKEKLLLEEEPTTSHG-QSSQGIVEETSEEGNSVPA
360 370 380 390 400 410
50 60 70 80 90 100
pF1KB9 SQSVAALTSKRSLVLMPESSAEEITVCPETQLSSSETFDLEREVSPGSRDILDGVRIIMA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 SQSVAALTSKRSLVLMPESSAEEITVCPETQLSSSETFDLEREVSPGSRDILDGVRIIMA
420 430 440 450 460 470
110 120 130 140 150 160
pF1KB9 DKEVGNKEDAEKEVAISTFSSSNQVSCPLCDQCFPPTKIERHAMYCNGLMEEDTVLTRRQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 DKEVGNKEDAEKEVAISTFSSSNQVSCPLCDQCFPPTKIERHAMYCNGLMEEDTVLTRRQ
480 490 500 510 520 530
170 180 190 200 210 220
pF1KB9 KEAKTKSDSGTAAQTSLDIDKNEKCYLCKSLVPFREYQCHVDSCLQLAKA------EGSG
:::::::::::::::::::::::::::::::::::::::::::::::::: ::::
CCDS44 KEAKTKSDSGTAAQTSLDIDKNEKCYLCKSLVPFREYQCHVDSCLQLAKADQGDGPEGSG
540 550 560 570 580 590
230 240 250 260 270 280
pF1KB9 RACSTVEGKWQQRLKNPKEKGHSEGRLLSFLEQSEHKTSDADIKSSETGAFRVPSPGMEE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 RACSTVEGKWQQRLKNPKEKGHSEGRLLSFLEQSEHKTSDADIKSSETGAFRVPSPGMEE
600 610 620 630 640 650
290 300 310 320 330 340
pF1KB9 AGCSREMQSSFTRRDLNESPVKSFVSISEATDCLVDFKKQVTVQPGSRTRTKAGRGRRRK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 AGCSREMQSSFTRRDLNESPVKSFVSISEATDCLVDFKKQVTVQPGSRTRTKAGRGRRRK
660 670 680 690 700 710
pF1KB9 F
:
CCDS44 F
343 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Thu Nov 3 23:44:23 2016 done: Thu Nov 3 23:44:24 2016
Total Scan time: 2.940 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]