FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB0431, 476 aa 1>>>pF1KB0431 476 - 476 aa - 476 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 9.8537+/-0.00101; mu= -1.4384+/- 0.060 mean_var=325.4706+/-66.479, 0's: 0 Z-trim(114.5): 46 B-trim: 250 in 1/52 Lambda= 0.071092 statistics sampled from 15059 (15096) to 15059 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.78), E-opt: 0.2 (0.464), width: 16 Scan time: 3.390 The best scores are: opt bits E(32554) CCDS9543.1 UPF3A gene_id:65110|Hs108|chr13 ( 476) 3186 340.4 2.5e-93 CCDS9544.1 UPF3A gene_id:65110|Hs108|chr13 ( 443) 2039 222.7 6.1e-58 CCDS14587.1 UPF3B gene_id:65109|Hs108|chrX ( 470) 1291 146.1 7.9e-35 CCDS14588.1 UPF3B gene_id:65109|Hs108|chrX ( 483) 1070 123.4 5.3e-28 >>CCDS9543.1 UPF3A gene_id:65110|Hs108|chr13 (476 aa) initn: 3186 init1: 3186 opt: 3186 Z-score: 1789.1 bits: 340.4 E(32554): 2.5e-93 Smith-Waterman score: 3186; 100.0% identity (100.0% similar) in 476 aa overlap (1-476:1-476) 10 20 30 40 50 60 pF1KB0 MRSEKEGAGGLRAAVAARGPSGREKLSALEVQFHRDSQQQEAETPPTSSSGCGGGAGKPR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS95 MRSEKEGAGGLRAAVAARGPSGREKLSALEVQFHRDSQQQEAETPPTSSSGCGGGAGKPR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB0 EEKRTALSKVVIRRLPPGLTKEQLEEQLRPLPAHDYFEFFAADLSLYPHLYSRAYINFRN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS95 EEKRTALSKVVIRRLPPGLTKEQLEEQLRPLPAHDYFEFFAADLSLYPHLYSRAYINFRN 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB0 PDDILLFRDRFDGYIFLDSKGLEYPAVVEFAPFQKIAKKKLRKKDAKTGSIEDDPEYKKF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS95 PDDILLFRDRFDGYIFLDSKGLEYPAVVEFAPFQKIAKKKLRKKDAKTGSIEDDPEYKKF 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB0 LETYCVEEEKTSANPETLLGEMEAKTRELIARRTTPLLEYIKNRKLEKQRIREEKREERR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS95 LETYCVEEEKTSANPETLLGEMEAKTRELIARRTTPLLEYIKNRKLEKQRIREEKREERR 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB0 RRELEKKRLREEEKRRRREEERCKKKETDKQKKIAEKEVRIKLLKKPEKGEEPTTEKPKE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS95 RRELEKKRLREEEKRRRREEERCKKKETDKQKKIAEKEVRIKLLKKPEKGEEPTTEKPKE 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB0 RGEEIDTGGGKQESCAPGAVVKARPMEGSLEEPQETSHSGSDKEHRDVERSQEQESEAQR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS95 RGEEIDTGGGKQESCAPGAVVKARPMEGSLEEPQETSHSGSDKEHRDVERSQEQESEAQR 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB0 YHVDDGRRHRAHHEPERLSRRSEDEQRWGKGPGQDRGKKGSQDSGAPGEAMERLGRAQRC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS95 YHVDDGRRHRAHHEPERLSRRSEDEQRWGKGPGQDRGKKGSQDSGAPGEAMERLGRAQRC 370 380 390 400 410 420 430 440 450 460 470 pF1KB0 DDSPAPRKERLANKDRPALQLYDPGARFRARECGGNRRICKAEGSGTGPEKREEAE :::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS95 DDSPAPRKERLANKDRPALQLYDPGARFRARECGGNRRICKAEGSGTGPEKREEAE 430 440 450 460 470 >>CCDS9544.1 UPF3A gene_id:65110|Hs108|chr13 (443 aa) initn: 2039 init1: 2039 opt: 2039 Z-score: 1153.7 bits: 222.7 E(32554): 6.1e-58 Smith-Waterman score: 2900; 93.1% identity (93.1% similar) in 476 aa overlap (1-476:1-443) 10 20 30 40 50 60 pF1KB0 MRSEKEGAGGLRAAVAARGPSGREKLSALEVQFHRDSQQQEAETPPTSSSGCGGGAGKPR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS95 MRSEKEGAGGLRAAVAARGPSGREKLSALEVQFHRDSQQQEAETPPTSSSGCGGGAGKPR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB0 EEKRTALSKVVIRRLPPGLTKEQLEEQLRPLPAHDYFEFFAADLSLYPHLYSRAYINFRN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS95 EEKRTALSKVVIRRLPPGLTKEQLEEQLRPLPAHDYFEFFAADLSLYPHLYSRAYINFRN 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB0 PDDILLFRDRFDGYIFLDSKGLEYPAVVEFAPFQKIAKKKLRKKDAKTGSIEDDPEYKKF :::::::::::::::::::: ::::::: CCDS95 PDDILLFRDRFDGYIFLDSK---------------------------------DPEYKKF 130 140 190 200 210 220 230 240 pF1KB0 LETYCVEEEKTSANPETLLGEMEAKTRELIARRTTPLLEYIKNRKLEKQRIREEKREERR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS95 LETYCVEEEKTSANPETLLGEMEAKTRELIARRTTPLLEYIKNRKLEKQRIREEKREERR 150 160 170 180 190 200 250 260 270 280 290 300 pF1KB0 RRELEKKRLREEEKRRRREEERCKKKETDKQKKIAEKEVRIKLLKKPEKGEEPTTEKPKE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS95 RRELEKKRLREEEKRRRREEERCKKKETDKQKKIAEKEVRIKLLKKPEKGEEPTTEKPKE 210 220 230 240 250 260 310 320 330 340 350 360 pF1KB0 RGEEIDTGGGKQESCAPGAVVKARPMEGSLEEPQETSHSGSDKEHRDVERSQEQESEAQR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS95 RGEEIDTGGGKQESCAPGAVVKARPMEGSLEEPQETSHSGSDKEHRDVERSQEQESEAQR 270 280 290 300 310 320 370 380 390 400 410 420 pF1KB0 YHVDDGRRHRAHHEPERLSRRSEDEQRWGKGPGQDRGKKGSQDSGAPGEAMERLGRAQRC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS95 YHVDDGRRHRAHHEPERLSRRSEDEQRWGKGPGQDRGKKGSQDSGAPGEAMERLGRAQRC 330 340 350 360 370 380 430 440 450 460 470 pF1KB0 DDSPAPRKERLANKDRPALQLYDPGARFRARECGGNRRICKAEGSGTGPEKREEAE :::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS95 DDSPAPRKERLANKDRPALQLYDPGARFRARECGGNRRICKAEGSGTGPEKREEAE 390 400 410 420 430 440 >>CCDS14587.1 UPF3B gene_id:65109|Hs108|chrX (470 aa) initn: 1120 init1: 880 opt: 1291 Z-score: 738.8 bits: 146.1 E(32554): 7.9e-35 Smith-Waterman score: 1336; 46.3% identity (74.0% similar) in 473 aa overlap (30-475:3-459) 10 20 30 40 50 pF1KB0 MRSEKEGAGGLRAAVAARGPSGREKLSALEVQFHRDSQQQEAETPPTSSSGCGGGA---- : . :: .... . :....: :::. CCDS14 MKEEKEHRPKEKRVTLLTPAGATGSGGGTSGDS 10 20 30 60 70 80 90 100 110 pF1KB0 --GKPRE----EKRTALSKVVIRRLPPGLTKEQLEEQLRPLPAHDYFEFFAADLSLYPHL :. .. ::. :::::::::::: ::::::.:.:.:.: :::::::. : :::::. CCDS14 SKGEDKQDRNKEKKEALSKVVIRRLPPTLTKEQLQEHLQPMPEHDYFEFFSNDTSLYPHM 40 50 60 70 80 90 120 130 140 150 160 170 pF1KB0 YSRAYINFRNPDDILLFRDRFDGYIFLDSKGLEYPAVVEFAPFQKIAKKKLRKKDAKTGS :.::::::.: .::.:::::::::.:::.:: ::::.:::::::: :::: .:.:.:.:. CCDS14 YARAYINFKNQEDIILFRDRFDGYVFLDNKGQEYPAIVEFAPFQKAAKKKTKKRDTKVGT 100 110 120 130 140 150 180 190 200 210 220 230 pF1KB0 IEDDPEYKKFLETYCVEEEKTSANPETLLGEMEAKTRELIARRTTPLLEYIKNRKLEKQR :.:::::.::::.: ...:: ...::::: :.:::.:::::..::::: ..:: ::: CCDS14 IDDDPEYRKFLESYATDNEKMTSTPETLLEEIEAKNRELIAKKTTPLLSFLKN----KQR 160 170 180 190 200 240 250 260 270 280 pF1KB0 IREEKREERRRRELEKKRLREEEKRRRREEERCKKKETDKQKKIA--------EKEVRIK .::::::::::::.:.:: ::::.:. .:::. :.:. .: ::: . : .:: CCDS14 MREEKREERRRREIERKRQREEERRKWKEEEKRKRKDIEKLKKIDRIPERDKLKDEPKIK 210 220 230 240 250 260 290 300 310 320 330 pF1KB0 LLKKPEKGEEPTTEKPKERGEEIDTGGGKQESCAPGAVVKARPMEGSL--EEPQET-SHS ::::::::.: .: .:.....: . ..: . . . . .. : :.:.. ..: CCDS14 LLKKPEKGDEKELDK-REKAKKLDKENLSDERASGQSCTLPKRSDSELKDEKPKRPEDES 270 280 290 300 310 320 340 350 360 370 380 390 pF1KB0 GSD--KEHRDVERSQEQ---ESEAQRYHVDDGRRHRAHHEPERLSRRSEDEQRWGKGPGQ : : ...:. ::.::. : : . . .. ::.. ..: :. .:.:.:.. : . CCDS14 GRDYREREREYERDQERILRERERLKRQEEERRRQKERYEKEKTFKRKEEEMKKEKDTLR 330 340 350 360 370 380 400 410 420 430 440 450 pF1KB0 DRGKKGSQDSGAPGEAMERLGRAQRCDDSP-APRKERLANKDRPALQLYDPGARFRAREC :.:::. :. : .: ... . . . ...:. ::::::.:::.:::: : : : CCDS14 DKGKKA--------ESTESIGSSEKTEKKEEVVKRDRIRNKDRPAMQLYQPGARSRNRLC 390 400 410 420 430 440 460 470 pF1KB0 GGNRRICKAEGSGTGPEKREEAE . ..... .. :...:. CCDS14 PPDD---STKSGDSAAERKQESGISHRKEGGEE 450 460 470 >>CCDS14588.1 UPF3B gene_id:65109|Hs108|chrX (483 aa) initn: 1261 init1: 875 opt: 1070 Z-score: 616.1 bits: 123.4 E(32554): 5.3e-28 Smith-Waterman score: 1300; 45.1% identity (72.0% similar) in 486 aa overlap (30-475:3-472) 10 20 30 40 50 pF1KB0 MRSEKEGAGGLRAAVAARGPSGREKLSALEVQFHRDSQQQEAETPPTSSSGCGGGA---- : . :: .... . :....: :::. CCDS14 MKEEKEHRPKEKRVTLLTPAGATGSGGGTSGDS 10 20 30 60 70 80 90 100 110 pF1KB0 --GKPRE----EKRTALSKVVIRRLPPGLTKEQLEEQLRPLPAHDYFEFFAADLSLYPHL :. .. ::. :::::::::::: ::::::.:.:.:.: :::::::. : :::::. CCDS14 SKGEDKQDRNKEKKEALSKVVIRRLPPTLTKEQLQEHLQPMPEHDYFEFFSNDTSLYPHM 40 50 60 70 80 90 120 130 140 150 160 170 pF1KB0 YSRAYINFRNPDDILLFRDRFDGYIFLDSKGLEYPAVVEFAPFQKIAKKKLRKKDAKTGS :.::::::.: .::.:::::::::.:::.:: ::::.:::::::: :::: .:.:.:.:. CCDS14 YARAYINFKNQEDIILFRDRFDGYVFLDNKGQEYPAIVEFAPFQKAAKKKTKKRDTKVGT 100 110 120 130 140 150 180 190 200 210 220 230 pF1KB0 IEDDPEYKKFLETYCVEEEKTSANPETLLGEMEAKTRELIARRTTPLLEYIKNRKLEKQR :.:::::.::::.: ...:: ...::::: :.:::.:::::..::::: ..:: ::: CCDS14 IDDDPEYRKFLESYATDNEKMTSTPETLLEEIEAKNRELIAKKTTPLLSFLKN----KQR 160 170 180 190 200 240 250 260 270 280 pF1KB0 IREEKREERRRRELEKKRLREEEKRRRREEERCKKKETDKQKKIA--------EKEVRIK .::::::::::::.:.:: ::::.:. .:::. :.:. .: ::: . : .:: CCDS14 MREEKREERRRREIERKRQREEERRKWKEEEKRKRKDIEKLKKIDRIPERDKLKDEPKIK 210 220 230 240 250 260 290 300 310 320 pF1KB0 -------------LLKKPEKGEEPTTEKPKERGEEIDTGGGKQESCAPGAVVKARPMEGS ::::::::.: .: .:.....: . ..: . . . . .. CCDS14 VHRFLLQAVNQKNLLKKPEKGDEKELDK-REKAKKLDKENLSDERASGQSCTLPKRSDSE 270 280 290 300 310 320 330 340 350 360 370 380 pF1KB0 L--EEPQET-SHSGSD--KEHRDVERSQEQ---ESEAQRYHVDDGRRHRAHHEPERLSRR : :.:.. ..:: : ...:. ::.::. : : . . .. ::.. ..: :. .: CCDS14 LKDEKPKRPEDESGRDYREREREYERDQERILRERERLKRQEEERRRQKERYEKEKTFKR 330 340 350 360 370 380 390 400 410 420 430 440 pF1KB0 SEDEQRWGKGPGQDRGKKGSQDSGAPGEAMERLGRAQRCDDSP-APRKERLANKDRPALQ .:.:.. : .:.:::. :. : .: ... . . . ...:. ::::::.: CCDS14 KEEEMKKEKDTLRDKGKKA--------ESTESIGSSEKTEKKEEVVKRDRIRNKDRPAMQ 390 400 410 420 430 440 450 460 470 pF1KB0 LYDPGARFRARECGGNRRICKAEGSGTGPEKREEAE ::.:::: : : : . ..... .. :...:. CCDS14 LYQPGARSRNRLCPPDD---STKSGDSAAERKQESGISHRKEGGEE 450 460 470 480 476 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 14:31:25 2016 done: Sun Nov 6 14:31:26 2016 Total Scan time: 3.390 Total Display time: 0.030 Function used was FASTA [36.3.4 Apr, 2011]