FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE3520, 293 aa
1>>>pF1KE3520 293 - 293 aa - 293 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.9712+/-0.000686; mu= 10.6928+/- 0.042
mean_var=163.9746+/-34.347, 0's: 0 Z-trim(116.5): 171 B-trim: 1781 in 2/51
Lambda= 0.100158
statistics sampled from 16959 (17148) to 16959 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.832), E-opt: 0.2 (0.527), width: 16
Scan time: 2.600
The best scores are: opt bits E(32554)
CCDS41558.1 3 gene_id:159296|Hs108|chr10 ( 364) 1860 279.7 2.1e-75
CCDS4387.1 5 gene_id:1482|Hs108|chr5 ( 324) 724 115.5 5.1e-26
CCDS13145.1 2 gene_id:4821|Hs108|chr20 ( 273) 491 81.8 6.2e-16
CCDS42855.1 4 gene_id:644524|Hs108|chr20 ( 354) 474 79.5 4.1e-15
CCDS9660.1 8 gene_id:26257|Hs108|chr14 ( 239) 428 72.6 3.1e-13
CCDS3410.1 2 gene_id:579|Hs108|chr4 ( 333) 374 65.0 8.7e-11
>>CCDS41558.1 3 gene_id:159296|Hs108|chr10 (364 aa)
initn: 1860 init1: 1860 opt: 1860 Z-score: 1467.1 bits: 279.7 E(32554): 2.1e-75
Smith-Waterman score: 1860; 100.0% identity (100.0% similar) in 272 aa overlap (1-272:1-272)
10 20 30 40 50 60
pF1KE3 MMLPSPVTSTPFSVKDILNLEQQHQHFHGAHLQADLEHHFHSAPCMLAAAEGTQFSDGGE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 MMLPSPVTSTPFSVKDILNLEQQHQHFHGAHLQADLEHHFHSAPCMLAAAEGTQFSDGGE
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE3 EDEEDEGEKLSYLNSLAAADGHGDSGLCPQGYVHTVLRDSCSEPKEHEEEPEVVRDRSQK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 EDEEDEGEKLSYLNSLAAADGHGDSGLCPQGYVHTVLRDSCSEPKEHEEEPEVVRDRSQK
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE3 SCQLKKSLETAGDCKAAEESERPKPRSRRKPRVLFSQAQVFELERRFKQQRYLSAPEREH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 SCQLKKSLETAGDCKAAEESERPKPRSRRKPRVLFSQAQVFELERRFKQQRYLSAPEREH
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE3 LASSLKLTSTQVKIWFQNRRYKCKRQRQDKSLELGAHAPPPPPRRVAVPVLVRDGKPCVT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 LASSLKLTSTQVKIWFQNRRYKCKRQRQDKSLELGAHAPPPPPRRVAVPVLVRDGKPCVT
190 200 210 220 230 240
250 260 270 280 290
pF1KE3 PSAQAYGAPYSVGASAYSYNSFPAYGYGNSAAGPPRRPLPCSPPAARPEAAPL
::::::::::::::::::::::::::::::::
CCDS41 PSAQAYGAPYSVGASAYSYNSFPAYGYGNSAAAAAAAAAAAAAAAAYSSSYGCAYPAGGG
250 260 270 280 290 300
CCDS41 GGGGGTSAATTAMQPACSAAGGGPFVNVSNLGGFGSGGSAQPLHQGTAAGAACAQGTLQG
310 320 330 340 350 360
>>CCDS4387.1 5 gene_id:1482|Hs108|chr5 (324 aa)
initn: 611 init1: 491 opt: 724 Z-score: 580.6 bits: 115.5 E(32554): 5.1e-26
Smith-Waterman score: 837; 51.5% identity (69.1% similar) in 301 aa overlap (2-291:1-286)
10 20 30 40 50
pF1KE3 MMLPSP-VTSTPFSVKDILNLEQQHQHFHGA-HLQADLEHHFHSAPCMLAAAEGTQFSDG
:.::: .: ::::::::::::::.. . .: .:.: :: . . ::::: . ..
CCDS43 MFPSPALTPTPFSVKDILNLEQQQRSLAAAGELSARLEATLAPSSCMLAAFKPEAYA--
10 20 30 40 50
60 70 80 90 100 110
pF1KE3 GEEDEEDEGEKLSYLNSLAAADGHGDSGLCPQGYVHTVLRDSCSEPKEHEEEPEVVRD-R
: : : : : : :.. : : . . :. . . :. ..: :
CCDS43 GPEAAAP-G-----LPELRAELGRAPS---PAKCASAFPAAPAFYPRAYSD-PDPAKDPR
60 70 80 90 100
120 130 140 150 160 170
pF1KE3 SQKS--CQLKKSLETAGDCKAAEESERPKPRSRRKPRVLFSQAQVFELERRFKQQRYLSA
..:. : :.:..: . :...:::. : :::::::::::::.::::::::::::::
CCDS43 AEKKELCALQKAVEL--EKTEADNAERPRARRRRKPRVLFSQAQVYELERRFKQQRYLSA
110 120 130 140 150 160
180 190 200 210 220 230
pF1KE3 PEREHLASSLKLTSTQVKIWFQNRRYKCKRQRQDKSLEL-GAHAPPPPP-RRVAVPVLVR
:::..::: :::::::::::::::::::::::::..::: : ::::: ::.:::::::
CCDS43 PERDQLASVLKLTSTQVKIWFQNRRYKCKRQRQDQTLELVGLPPPPPPPARRIAVPVLVR
170 180 190 200 210 220
240 250 260 270 280
pF1KE3 DGKPCVTPSAQAYGAPYSVGASAYSYNSFPAY-GYGNSAAGPPRR---PLPCSPPAARPE
:::::. :: :. :.:: . :.::..::: :::..: .: : .: :.:
CCDS43 DGKPCLGDSAP-YAPAYGVGLNPYGYNAYPAYPGYGGAACSPGYSCTAAYPAGPSPAQPA
230 240 250 260 270 280
290
pF1KE3 AAPL
.:
CCDS43 TAAANNNFVNFGVGDLNAVQSPGIPQSNSGVSTLHGIRAW
290 300 310 320
>>CCDS13145.1 2 gene_id:4821|Hs108|chr20 (273 aa)
initn: 477 init1: 334 opt: 491 Z-score: 399.5 bits: 81.8 E(32554): 6.2e-16
Smith-Waterman score: 495; 40.5% identity (60.9% similar) in 284 aa overlap (8-278:6-260)
10 20 30 40 50 60
pF1KE3 MMLPSPVTSTPFSVKDILNLEQQHQHFHGAHLQADLEHHFHSAPCMLAAAEGTQFSDGGE
:.: :::::::.: . . : : ..::: . . :
CCDS13 MSLTNTKTGFSVKDILDLPDTN----------DEEG---------SVAEGPEEENEGP
10 20 30
70 80 90 100 110
pF1KE3 EDEEDEGE----KLSYLNSLAAADGHGDSGLCPQGYVHTVLRDSCSEPKEHEEEPEVVRD
: . : :. ..:: . ::. : :.. . . .: .. . ..
CCDS13 EPAKRAGPLGQGALDAVQSLPLKNPFYDSSDNP--YTRWL---ASTEGLQYSLHGLAAGA
40 50 60 70 80 90
120 130 140 150 160 170
pF1KE3 RSQKSCQLKKSLETAGDCKAAEESERP----KPRSRRKPRVLFSQAQVFELERRFKQQRY
: : . :: : ..: . ...: : ..:: :::::.::..::::::.::::
CCDS13 PPQDSSS--KSPEPSADESPDNDKETPGGGGDAGKKRKRRVLFSKAQTYELERRFRQQRY
100 110 120 130 140 150
180 190 200 210 220 230
pF1KE3 LSAPEREHLASSLKLTSTQVKIWFQNRRYKCKRQRQDKSLELGAHAPPPPPRRVAVPVLV
::::::::::: ..:: :::::::::.::: :: : .:..:. .: : ::::::::::
CCDS13 LSAPEREHLASLIRLTPTQVKIWFQNHRYKMKRARAEKGMEV---TPLPSPRRVAVPVLV
160 170 180 190 200
240 250 260 270 280
pF1KE3 RDGKPCVTPSAQAYGAP-YSVGA--SAYSYNSFPAYGYGN--SAAGPPRRPLPCSPPAAR
:::::: . .:: .: ...: :::: .:. . :. :.:. :. :
CCDS13 RDGKPCHALKAQDLAAATFQAGIPFSAYSAQSLQHMQYNAQYSSASTPQYPTAHPLVQAQ
210 220 230 240 250 260
290
pF1KE3 PEAAPL
CCDS13 QWTW
270
>>CCDS42855.1 4 gene_id:644524|Hs108|chr20 (354 aa)
initn: 519 init1: 342 opt: 474 Z-score: 384.9 bits: 79.5 E(32554): 4.1e-15
Smith-Waterman score: 474; 51.5% identity (66.7% similar) in 171 aa overlap (129-291:174-336)
100 110 120 130 140 150
pF1KE3 DSCSEPKEHEEEPEVVRDRSQKSCQLKKSLETAGDCKAAEESERPKPRSRRKPRVLFSQA
.. : .:: . : ::: :::::::
CCDS42 RYSSISRFMGPSAGVNVAGMGSLTGIADAAKSLGPLHAAAAAAAP----RRKRRVLFSQA
150 160 170 180 190
160 170 180 190 200 210
pF1KE3 QVFELERRFKQQRYLSAPEREHLASSLKLTSTQVKIWFQNRRYKCKRQRQDKSLE-----
::.:::::::::.:::::::::::: ..:: :::::::::.::: ::: .::. .
CCDS42 QVYELERRFKQQKYLSAPEREHLASMIHLTPTQVKIWFQNHRYKMKRQAKDKAAQQLQQE
200 210 220 230 240 250
220 230 240 250 260 270
pF1KE3 --LGAHAPPPP-PRRVAVPVLVRDGKPCVTPSAQAYGAPYSVGASAYSYNSFPAYGYGNS
:: :::: ::::::::::.::::: .. . .: ..: . . . :: .
CCDS42 GGLGPPPPPPPSPRRVAVPVLVKDGKPC--QNGASTPTPGQAGPQPPAPT--PAPELEEL
260 270 280 290 300 310
280 290
pF1KE3 AAGPPRRPLPCSPPAARPEAAPL
. .:: : . :: ::
CCDS42 SPSPPALHGPGGGLAALDAAAGEYSGGVLGANLLYGRTW
320 330 340 350
>>CCDS9660.1 8 gene_id:26257|Hs108|chr14 (239 aa)
initn: 390 init1: 293 opt: 428 Z-score: 351.1 bits: 72.6 E(32554): 3.1e-13
Smith-Waterman score: 446; 45.9% identity (63.9% similar) in 205 aa overlap (103-283:31-232)
80 90 100 110 120
pF1KE3 LNSLAAADGHGDSGLCPQGYVHTVLRDSCSEPKEHEEEPEVV---RDRSQKSCQLKKSLE
::. . .: .. .:.. . ..:::
CCDS96 MATSGRLSFTVRSLLDLPEQDAQHLPRREPEPRAPQPDPCAAWLDSERGHYPSSDESSLE
10 20 30 40 50 60
130 140 150 160 170 180
pF1KE3 TAGDCKAAEESERP-KPRS----RRKPRVLFSQAQVFELERRFKQQRYLSAPEREHLASS
:. .. . : :: .: : :.: :::::.::..::::::.:::::::::::.:::
CCDS96 TSPPDSSQRPSARPASPGSDAEKRKKRRVLFSKAQTLELERRFRQQRYLSAPEREQLASL
70 80 90 100 110 120
190 200 210 220 230
pF1KE3 LKLTSTQVKIWFQNRRYKCKRQRQD---KSLELGA----HAPPPPPRRVAVPVLVRDGKP
:.:: :::::::::.::: :: : .: .:.: :: : :::.::::::::.:
CCDS96 LRLTPTQVKIWFQNHRYKLKRARAPGAAESPDLAASAELHAAPGLLRRVVVPVLVRDGQP
130 140 150 160 170 180
240 250 260 270 280
pF1KE3 CV--------TPSAQAY-GAPYSVGASAYSYNSFPAYGYGNSAAGPPRRPLPCSPPAARP
: : .:: ::: :.: ..::.: :.. . : ::
CCDS96 CGGGGGGEVGTAAAQEKCGAP---PAAACPLPGYPAFGPGSALGLFPAYQHLASPALVSW
190 200 210 220 230
290
pF1KE3 EAAPL
CCDS96 NW
>>CCDS3410.1 2 gene_id:579|Hs108|chr4 (333 aa)
initn: 348 init1: 295 opt: 374 Z-score: 307.1 bits: 65.0 E(32554): 8.7e-11
Smith-Waterman score: 374; 48.3% identity (67.6% similar) in 145 aa overlap (131-272:190-328)
110 120 130 140 150 160
pF1KE3 CSEPKEHEEEPEVVRDRSQKSCQLKKSLETAGDCKAAEESERPKPRSRRKPRVLFSQAQV
:: . :: ::::..:. :. ::.:::
CCDS34 PRTEDDGVGPRGAHVSALCSGAGGGGGSGPAGVAEEEEEPAAPKPRKKRS-RAAFSHAQV
160 170 180 190 200 210
170 180 190 200 210 220
pF1KE3 FELERRFKQQRYLSAPEREHLASSLKLTSTQVKIWFQNRRYKCKRQRQDKSLELGAHAPP
:::::::..:::::.::: ::.::::: ::::::::::::: ::... .: :
CCDS34 FELERRFNHQRYLSGPERADLAASLKLTETQVKIWFQNRRYKTKRRQMAADLL----ASA
220 230 240 250 260 270
230 240 250 260 270
pF1KE3 PPPRRVAVPVLVRDGKPCVTPSAQAYGAPYSVGASA---YSYNSFPAYGYGNSAAGPPRR
: ..::: ::::: . :. .. : . . : : .:... .. ::
CCDS34 PAAKKVAVKVLVRDDQRQYLPG-EVLRPPSLLPLQPSYYYPYYCLPGWALSTCAAAAGTQ
280 290 300 310 320 330
280 290
pF1KE3 PLPCSPPAARPEAAPL
293 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 22:33:24 2016 done: Sat Nov 5 22:33:25 2016
Total Scan time: 2.600 Total Display time: 0.020
Function used was FASTA [36.3.4 Apr, 2011]