FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB0431, 476 aa
1>>>pF1KB0431 476 - 476 aa - 476 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 9.8537+/-0.00101; mu= -1.4384+/- 0.060
mean_var=325.4706+/-66.479, 0's: 0 Z-trim(114.5): 46 B-trim: 250 in 1/52
Lambda= 0.071092
statistics sampled from 15059 (15096) to 15059 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.78), E-opt: 0.2 (0.464), width: 16
Scan time: 3.390
The best scores are: opt bits E(32554)
CCDS9543.1 UPF3A gene_id:65110|Hs108|chr13 ( 476) 3186 340.4 2.5e-93
CCDS9544.1 UPF3A gene_id:65110|Hs108|chr13 ( 443) 2039 222.7 6.1e-58
CCDS14587.1 UPF3B gene_id:65109|Hs108|chrX ( 470) 1291 146.1 7.9e-35
CCDS14588.1 UPF3B gene_id:65109|Hs108|chrX ( 483) 1070 123.4 5.3e-28
>>CCDS9543.1 UPF3A gene_id:65110|Hs108|chr13 (476 aa)
initn: 3186 init1: 3186 opt: 3186 Z-score: 1789.1 bits: 340.4 E(32554): 2.5e-93
Smith-Waterman score: 3186; 100.0% identity (100.0% similar) in 476 aa overlap (1-476:1-476)
10 20 30 40 50 60
pF1KB0 MRSEKEGAGGLRAAVAARGPSGREKLSALEVQFHRDSQQQEAETPPTSSSGCGGGAGKPR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS95 MRSEKEGAGGLRAAVAARGPSGREKLSALEVQFHRDSQQQEAETPPTSSSGCGGGAGKPR
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB0 EEKRTALSKVVIRRLPPGLTKEQLEEQLRPLPAHDYFEFFAADLSLYPHLYSRAYINFRN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS95 EEKRTALSKVVIRRLPPGLTKEQLEEQLRPLPAHDYFEFFAADLSLYPHLYSRAYINFRN
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB0 PDDILLFRDRFDGYIFLDSKGLEYPAVVEFAPFQKIAKKKLRKKDAKTGSIEDDPEYKKF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS95 PDDILLFRDRFDGYIFLDSKGLEYPAVVEFAPFQKIAKKKLRKKDAKTGSIEDDPEYKKF
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB0 LETYCVEEEKTSANPETLLGEMEAKTRELIARRTTPLLEYIKNRKLEKQRIREEKREERR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS95 LETYCVEEEKTSANPETLLGEMEAKTRELIARRTTPLLEYIKNRKLEKQRIREEKREERR
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB0 RRELEKKRLREEEKRRRREEERCKKKETDKQKKIAEKEVRIKLLKKPEKGEEPTTEKPKE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS95 RRELEKKRLREEEKRRRREEERCKKKETDKQKKIAEKEVRIKLLKKPEKGEEPTTEKPKE
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB0 RGEEIDTGGGKQESCAPGAVVKARPMEGSLEEPQETSHSGSDKEHRDVERSQEQESEAQR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS95 RGEEIDTGGGKQESCAPGAVVKARPMEGSLEEPQETSHSGSDKEHRDVERSQEQESEAQR
310 320 330 340 350 360
370 380 390 400 410 420
pF1KB0 YHVDDGRRHRAHHEPERLSRRSEDEQRWGKGPGQDRGKKGSQDSGAPGEAMERLGRAQRC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS95 YHVDDGRRHRAHHEPERLSRRSEDEQRWGKGPGQDRGKKGSQDSGAPGEAMERLGRAQRC
370 380 390 400 410 420
430 440 450 460 470
pF1KB0 DDSPAPRKERLANKDRPALQLYDPGARFRARECGGNRRICKAEGSGTGPEKREEAE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS95 DDSPAPRKERLANKDRPALQLYDPGARFRARECGGNRRICKAEGSGTGPEKREEAE
430 440 450 460 470
>>CCDS9544.1 UPF3A gene_id:65110|Hs108|chr13 (443 aa)
initn: 2039 init1: 2039 opt: 2039 Z-score: 1153.7 bits: 222.7 E(32554): 6.1e-58
Smith-Waterman score: 2900; 93.1% identity (93.1% similar) in 476 aa overlap (1-476:1-443)
10 20 30 40 50 60
pF1KB0 MRSEKEGAGGLRAAVAARGPSGREKLSALEVQFHRDSQQQEAETPPTSSSGCGGGAGKPR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS95 MRSEKEGAGGLRAAVAARGPSGREKLSALEVQFHRDSQQQEAETPPTSSSGCGGGAGKPR
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB0 EEKRTALSKVVIRRLPPGLTKEQLEEQLRPLPAHDYFEFFAADLSLYPHLYSRAYINFRN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS95 EEKRTALSKVVIRRLPPGLTKEQLEEQLRPLPAHDYFEFFAADLSLYPHLYSRAYINFRN
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB0 PDDILLFRDRFDGYIFLDSKGLEYPAVVEFAPFQKIAKKKLRKKDAKTGSIEDDPEYKKF
:::::::::::::::::::: :::::::
CCDS95 PDDILLFRDRFDGYIFLDSK---------------------------------DPEYKKF
130 140
190 200 210 220 230 240
pF1KB0 LETYCVEEEKTSANPETLLGEMEAKTRELIARRTTPLLEYIKNRKLEKQRIREEKREERR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS95 LETYCVEEEKTSANPETLLGEMEAKTRELIARRTTPLLEYIKNRKLEKQRIREEKREERR
150 160 170 180 190 200
250 260 270 280 290 300
pF1KB0 RRELEKKRLREEEKRRRREEERCKKKETDKQKKIAEKEVRIKLLKKPEKGEEPTTEKPKE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS95 RRELEKKRLREEEKRRRREEERCKKKETDKQKKIAEKEVRIKLLKKPEKGEEPTTEKPKE
210 220 230 240 250 260
310 320 330 340 350 360
pF1KB0 RGEEIDTGGGKQESCAPGAVVKARPMEGSLEEPQETSHSGSDKEHRDVERSQEQESEAQR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS95 RGEEIDTGGGKQESCAPGAVVKARPMEGSLEEPQETSHSGSDKEHRDVERSQEQESEAQR
270 280 290 300 310 320
370 380 390 400 410 420
pF1KB0 YHVDDGRRHRAHHEPERLSRRSEDEQRWGKGPGQDRGKKGSQDSGAPGEAMERLGRAQRC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS95 YHVDDGRRHRAHHEPERLSRRSEDEQRWGKGPGQDRGKKGSQDSGAPGEAMERLGRAQRC
330 340 350 360 370 380
430 440 450 460 470
pF1KB0 DDSPAPRKERLANKDRPALQLYDPGARFRARECGGNRRICKAEGSGTGPEKREEAE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS95 DDSPAPRKERLANKDRPALQLYDPGARFRARECGGNRRICKAEGSGTGPEKREEAE
390 400 410 420 430 440
>>CCDS14587.1 UPF3B gene_id:65109|Hs108|chrX (470 aa)
initn: 1120 init1: 880 opt: 1291 Z-score: 738.8 bits: 146.1 E(32554): 7.9e-35
Smith-Waterman score: 1336; 46.3% identity (74.0% similar) in 473 aa overlap (30-475:3-459)
10 20 30 40 50
pF1KB0 MRSEKEGAGGLRAAVAARGPSGREKLSALEVQFHRDSQQQEAETPPTSSSGCGGGA----
: . :: .... . :....: :::.
CCDS14 MKEEKEHRPKEKRVTLLTPAGATGSGGGTSGDS
10 20 30
60 70 80 90 100 110
pF1KB0 --GKPRE----EKRTALSKVVIRRLPPGLTKEQLEEQLRPLPAHDYFEFFAADLSLYPHL
:. .. ::. :::::::::::: ::::::.:.:.:.: :::::::. : :::::.
CCDS14 SKGEDKQDRNKEKKEALSKVVIRRLPPTLTKEQLQEHLQPMPEHDYFEFFSNDTSLYPHM
40 50 60 70 80 90
120 130 140 150 160 170
pF1KB0 YSRAYINFRNPDDILLFRDRFDGYIFLDSKGLEYPAVVEFAPFQKIAKKKLRKKDAKTGS
:.::::::.: .::.:::::::::.:::.:: ::::.:::::::: :::: .:.:.:.:.
CCDS14 YARAYINFKNQEDIILFRDRFDGYVFLDNKGQEYPAIVEFAPFQKAAKKKTKKRDTKVGT
100 110 120 130 140 150
180 190 200 210 220 230
pF1KB0 IEDDPEYKKFLETYCVEEEKTSANPETLLGEMEAKTRELIARRTTPLLEYIKNRKLEKQR
:.:::::.::::.: ...:: ...::::: :.:::.:::::..::::: ..:: :::
CCDS14 IDDDPEYRKFLESYATDNEKMTSTPETLLEEIEAKNRELIAKKTTPLLSFLKN----KQR
160 170 180 190 200
240 250 260 270 280
pF1KB0 IREEKREERRRRELEKKRLREEEKRRRREEERCKKKETDKQKKIA--------EKEVRIK
.::::::::::::.:.:: ::::.:. .:::. :.:. .: ::: . : .::
CCDS14 MREEKREERRRREIERKRQREEERRKWKEEEKRKRKDIEKLKKIDRIPERDKLKDEPKIK
210 220 230 240 250 260
290 300 310 320 330
pF1KB0 LLKKPEKGEEPTTEKPKERGEEIDTGGGKQESCAPGAVVKARPMEGSL--EEPQET-SHS
::::::::.: .: .:.....: . ..: . . . . .. : :.:.. ..:
CCDS14 LLKKPEKGDEKELDK-REKAKKLDKENLSDERASGQSCTLPKRSDSELKDEKPKRPEDES
270 280 290 300 310 320
340 350 360 370 380 390
pF1KB0 GSD--KEHRDVERSQEQ---ESEAQRYHVDDGRRHRAHHEPERLSRRSEDEQRWGKGPGQ
: : ...:. ::.::. : : . . .. ::.. ..: :. .:.:.:.. : .
CCDS14 GRDYREREREYERDQERILRERERLKRQEEERRRQKERYEKEKTFKRKEEEMKKEKDTLR
330 340 350 360 370 380
400 410 420 430 440 450
pF1KB0 DRGKKGSQDSGAPGEAMERLGRAQRCDDSP-APRKERLANKDRPALQLYDPGARFRAREC
:.:::. :. : .: ... . . . ...:. ::::::.:::.:::: : : :
CCDS14 DKGKKA--------ESTESIGSSEKTEKKEEVVKRDRIRNKDRPAMQLYQPGARSRNRLC
390 400 410 420 430 440
460 470
pF1KB0 GGNRRICKAEGSGTGPEKREEAE
. ..... .. :...:.
CCDS14 PPDD---STKSGDSAAERKQESGISHRKEGGEE
450 460 470
>>CCDS14588.1 UPF3B gene_id:65109|Hs108|chrX (483 aa)
initn: 1261 init1: 875 opt: 1070 Z-score: 616.1 bits: 123.4 E(32554): 5.3e-28
Smith-Waterman score: 1300; 45.1% identity (72.0% similar) in 486 aa overlap (30-475:3-472)
10 20 30 40 50
pF1KB0 MRSEKEGAGGLRAAVAARGPSGREKLSALEVQFHRDSQQQEAETPPTSSSGCGGGA----
: . :: .... . :....: :::.
CCDS14 MKEEKEHRPKEKRVTLLTPAGATGSGGGTSGDS
10 20 30
60 70 80 90 100 110
pF1KB0 --GKPRE----EKRTALSKVVIRRLPPGLTKEQLEEQLRPLPAHDYFEFFAADLSLYPHL
:. .. ::. :::::::::::: ::::::.:.:.:.: :::::::. : :::::.
CCDS14 SKGEDKQDRNKEKKEALSKVVIRRLPPTLTKEQLQEHLQPMPEHDYFEFFSNDTSLYPHM
40 50 60 70 80 90
120 130 140 150 160 170
pF1KB0 YSRAYINFRNPDDILLFRDRFDGYIFLDSKGLEYPAVVEFAPFQKIAKKKLRKKDAKTGS
:.::::::.: .::.:::::::::.:::.:: ::::.:::::::: :::: .:.:.:.:.
CCDS14 YARAYINFKNQEDIILFRDRFDGYVFLDNKGQEYPAIVEFAPFQKAAKKKTKKRDTKVGT
100 110 120 130 140 150
180 190 200 210 220 230
pF1KB0 IEDDPEYKKFLETYCVEEEKTSANPETLLGEMEAKTRELIARRTTPLLEYIKNRKLEKQR
:.:::::.::::.: ...:: ...::::: :.:::.:::::..::::: ..:: :::
CCDS14 IDDDPEYRKFLESYATDNEKMTSTPETLLEEIEAKNRELIAKKTTPLLSFLKN----KQR
160 170 180 190 200
240 250 260 270 280
pF1KB0 IREEKREERRRRELEKKRLREEEKRRRREEERCKKKETDKQKKIA--------EKEVRIK
.::::::::::::.:.:: ::::.:. .:::. :.:. .: ::: . : .::
CCDS14 MREEKREERRRREIERKRQREEERRKWKEEEKRKRKDIEKLKKIDRIPERDKLKDEPKIK
210 220 230 240 250 260
290 300 310 320
pF1KB0 -------------LLKKPEKGEEPTTEKPKERGEEIDTGGGKQESCAPGAVVKARPMEGS
::::::::.: .: .:.....: . ..: . . . . ..
CCDS14 VHRFLLQAVNQKNLLKKPEKGDEKELDK-REKAKKLDKENLSDERASGQSCTLPKRSDSE
270 280 290 300 310 320
330 340 350 360 370 380
pF1KB0 L--EEPQET-SHSGSD--KEHRDVERSQEQ---ESEAQRYHVDDGRRHRAHHEPERLSRR
: :.:.. ..:: : ...:. ::.::. : : . . .. ::.. ..: :. .:
CCDS14 LKDEKPKRPEDESGRDYREREREYERDQERILRERERLKRQEEERRRQKERYEKEKTFKR
330 340 350 360 370 380
390 400 410 420 430 440
pF1KB0 SEDEQRWGKGPGQDRGKKGSQDSGAPGEAMERLGRAQRCDDSP-APRKERLANKDRPALQ
.:.:.. : .:.:::. :. : .: ... . . . ...:. ::::::.:
CCDS14 KEEEMKKEKDTLRDKGKKA--------ESTESIGSSEKTEKKEEVVKRDRIRNKDRPAMQ
390 400 410 420 430 440
450 460 470
pF1KB0 LYDPGARFRARECGGNRRICKAEGSGTGPEKREEAE
::.:::: : : : . ..... .. :...:.
CCDS14 LYQPGARSRNRLCPPDD---STKSGDSAAERKQESGISHRKEGGEE
450 460 470 480
476 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 14:31:25 2016 done: Sun Nov 6 14:31:26 2016
Total Scan time: 3.390 Total Display time: 0.030
Function used was FASTA [36.3.4 Apr, 2011]