FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB8942, 301 aa
1>>>pF1KB8942 301 - 301 aa - 301 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 7.6748+/-0.000769; mu= 7.4082+/- 0.047
mean_var=191.9625+/-39.968, 0's: 0 Z-trim(115.7): 174 B-trim: 919 in 1/51
Lambda= 0.092569
statistics sampled from 16056 (16247) to 16056 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.809), E-opt: 0.2 (0.499), width: 16
Scan time: 3.050
The best scores are: opt bits E(32554)
CCDS4387.1 5 gene_id:1482|Hs108|chr5 ( 324) 718 107.4 1.5e-23
CCDS41558.1 3 gene_id:159296|Hs108|chr10 ( 364) 656 99.2 5.1e-21
CCDS13145.1 2 gene_id:4821|Hs108|chr20 ( 273) 506 79.0 4.4e-15
>>CCDS4387.1 5 gene_id:1482|Hs108|chr5 (324 aa)
initn: 652 init1: 424 opt: 718 Z-score: 536.3 bits: 107.4 E(32554): 1.5e-23
Smith-Waterman score: 759; 45.2% identity (64.8% similar) in 330 aa overlap (1-301:1-324)
10 20 30 40 50
pF1KB8 MLLSP-VTSTPFSVKDILRLERE-RSCPAA---SPHPRVRKSPENFQYLRMDAEPRGSEV
:. :: .: ::::::::: ::.. :: :: : . .. .: . . . : ..
CCDS43 MFPSPALTPTPFSVKDILNLEQQQRSLAAAGELSARLEATLAPSSCMLAAFKPEAYAGPE
10 20 30 40 50 60
60 70 80 90 100 110
pF1KB8 HNAGGGGGDRKLDGSEPPGGPCEAVLEMDA----ERMGEPQPGLNAASPLGGGTRVPERG
: : : : : . : ... . ...:.: : .: . ..
CCDS43 AAAPGLPELRAELGRAPSPAKCASAFPAAPAFYPRAYSDPDP---AKDPRAEKKELCALQ
70 80 90 100 110
120 130 140 150 160 170
pF1KB8 VGNSGDSVRGGRSEQPKARQRRKPRVLFSQAQVLALERRFKQQRYLSAPEREHLASALQL
. ..... .:.:.::.::::::::::::: ::::::::::::::::..:::.:.:
CCDS43 KAVELEKTEADNAERPRARRRRKPRVLFSQAQVYELERRFKQQRYLSAPERDQLASVLKL
120 130 140 150 160 170
180 190 200 210 220
pF1KB8 TSTQVKIWFQNRRYKCKRQRQDKSLELAGHPLTP----RRVAVPVLVRDGKPCLGPG-PG
::::::::::::::::::::::..:::.: : : ::.:::::::::::::: . :
CCDS43 TSTQVKIWFQNRRYKCKRQRQDQTLELVGLPPPPPPPARRIAVPVLVRDGKPCLGDSAPY
180 190 200 210 220 230
230 240 250 260 270
pF1KB8 APAFP---SPYSAAVSPYSCYGGYSGAPYGAGYGTCYAGAPSGPAPHTPLASAG------
:::. .:: . . : : ::.:: . :: .: :. :.::.: : ..:.
CCDS43 APAYGVGLNPY--GYNAYPAYPGYGGAACSPGY-SCTAAYPAGPSPAQPATAAANNNFVN
240 250 260 270 280 290
280 290 300
pF1KB8 FGHGGQNAT-----PQGHLA-ATLQGVRAW
:: : ::. ::.. . .::.:.:::
CCDS43 FGVGDLNAVQSPGIPQSNSGVSTLHGIRAW
300 310 320
>>CCDS41558.1 3 gene_id:159296|Hs108|chr10 (364 aa)
initn: 671 init1: 510 opt: 656 Z-score: 490.9 bits: 99.2 E(32554): 5.1e-21
Smith-Waterman score: 722; 46.7% identity (68.3% similar) in 306 aa overlap (1-289:2-298)
10 20 30 40
pF1KB8 MLLSPVTSTPFSVKDILRLERERS------CPAASPH-----PRVRKSPENFQYL---R
:: :::::::::::::: ::.... : : : . . :. :. .
CCDS41 MMLPSPVTSTPFSVKDILNLEQQHQHFHGAHLQADLEHHFHSAPCMLAAAEGTQFSDGGE
10 20 30 40 50 60
50 60 70 80 90 100
pF1KB8 MDAEPRGSEVHNAGG-GGGDRKLDGSEPPGGPCEAVLEMDAERMGEPQPGLNAASPLGGG
: : .: .. .. ...: . :.. : : ..::. . . : . ... .
CCDS41 EDEEDEGEKLSYLNSLAAADGHGDSGLCPQGYVHTVLRDSCSEPKEHEEEPEVVRDRSQK
70 80 90 100 110 120
110 120 130 140 150 160
pF1KB8 TRVPERGVGNSGDSVRGGRSEQPKARQRRKPRVLFSQAQVLALERRFKQQRYLSAPEREH
. .... ..:: . .::.:: :.:::::::::::::. ::::::::::::::::::
CCDS41 SCQLKKSLETAGDCKAAEESERPKPRSRRKPRVLFSQAQVFELERRFKQQRYLSAPEREH
130 140 150 160 170 180
170 180 190 200 210 220
pF1KB8 LASALQLTSTQVKIWFQNRRYKCKRQRQDKSLELAGH--PLTPRRVAVPVLVRDGKPCLG
:::.:.::::::::::::::::::::::::::::..: : ::::::::::::::::.
CCDS41 LASSLKLTSTQVKIWFQNRRYKCKRQRQDKSLELGAHAPPPPPRRVAVPVLVRDGKPCVT
190 200 210 220 230 240
230 240 250 260 270 280
pF1KB8 PGPGAPAFPSPYSAAVSPYSCYGGYSGAPYGAGYGTCYAGAPSGPAPHTPLASAGFGHGG
: .: :. .:::...: :: :.. : . :::. :.: .. : . :.:... .
CCDS41 P--SAQAYGAPYSVGASAYS----YNSFP-AYGYGNSAAAAAAAAA--AAAAAAAYSSSY
250 260 270 280 290
290 300
pF1KB8 QNATPQGHLAATLQGVRAW
: : :
CCDS41 GCAYPAGGGGGGGGTSAATTAMQPACSAAGGGPFVNVSNLGGFGSGGSAQPLHQGTAAGA
300 310 320 330 340 350
>>CCDS13145.1 2 gene_id:4821|Hs108|chr20 (273 aa)
initn: 484 init1: 349 opt: 506 Z-score: 384.2 bits: 79.0 E(32554): 4.4e-15
Smith-Waterman score: 509; 40.4% identity (61.0% similar) in 282 aa overlap (7-276:6-268)
10 20 30 40 50 60
pF1KB8 MLLSPVTSTPFSVKDILRLERERSCPAASPHPRVRKSPENFQYLRMDAEPRGSEVHNAGG
:.: ::::::: : . . . : ..::. : .: : . .:
CCDS13 MSLTNTKTGFSVKDILDLP-----DTNDEEGSVAEGPEE--------ENEGPEPAKRAG
10 20 30 40
70 80 90 100 110
pF1KB8 GGGDRKLDG--SEPPGGPCEAVLEMDAERMGEPQPGLN--------AASPLGGGTRVPER
:. ::. : : .: . : ::. .: : .... ::
CCDS13 PLGQGALDAVQSLPLKNPFYDSSDNPYTRWLASTEGLQYSLHGLAAGAPPQDSSSKSPEP
50 60 70 80 90 100
120 130 140 150 160 170
pF1KB8 GVGNSGDSVRGGRSEQPKARQRRKPRVLFSQAQVLALERRFKQQRYLSAPEREHLASALQ
.. .: :. . . : ..:: :::::.::. :::::.::::::::::::::: ..
CCDS13 SADESPDNDKETPGGGGDAGKKRKRRVLFSKAQTYELERRFRQQRYLSAPEREHLASLIR
110 120 130 140 150 160
180 190 200 210 220 230
pF1KB8 LTSTQVKIWFQNRRYKCKRQRQDKSLELAGHPLTPRRVAVPVLVRDGKPCLGPGPGAPAF
:: :::::::::.::: :: : .:..:.. : .:::::::::::::::: . :
CCDS13 LTPTQVKIWFQNHRYKMKRARAEKGMEVTPLP-SPRRVAVPVLVRDGKPCHALKAQDLA-
170 180 190 200 210 220
240 250 260 270 280
pF1KB8 PSPYSAAVSPYSCYGGYS--GAPYGAGYGTCYAGAPSGPAPHTPLASAGFGHGGQNATPQ
. ..:.. :.: :.. : :.: :.. :..:. :. : ::..:
CCDS13 AATFQAGI-PFSAYSAQSLQHMQYNAQYSS--ASTPQYPTAH-PLVQAQQWTW
230 240 250 260 270
290 300
pF1KB8 GHLAATLQGVRAW
301 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 16:36:22 2016 done: Fri Nov 4 16:36:23 2016
Total Scan time: 3.050 Total Display time: -0.030
Function used was FASTA [36.3.4 Apr, 2011]