FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE1775, 282 aa
1>>>pF1KE1775 282 - 282 aa - 282 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.0848+/-0.000697; mu= 17.6832+/- 0.042
mean_var=75.3109+/-15.521, 0's: 0 Z-trim(111.6): 15 B-trim: 1063 in 2/50
Lambda= 0.147790
statistics sampled from 12476 (12487) to 12476 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.744), E-opt: 0.2 (0.384), width: 16
Scan time: 2.670
The best scores are: opt bits E(32554)
CCDS11343.1 PNMT gene_id:5409|Hs108|chr17 ( 282) 1928 419.7 1.2e-117
CCDS5430.1 INMT gene_id:11185|Hs108|chr7 ( 263) 602 136.9 1.4e-32
CCDS56479.1 INMT gene_id:11185|Hs108|chr7 ( 262) 593 135.0 5.5e-32
CCDS8368.1 NNMT gene_id:4837|Hs108|chr11 ( 264) 592 134.8 6.4e-32
>>CCDS11343.1 PNMT gene_id:5409|Hs108|chr17 (282 aa)
initn: 1928 init1: 1928 opt: 1928 Z-score: 2225.7 bits: 419.7 E(32554): 1.2e-117
Smith-Waterman score: 1928; 100.0% identity (100.0% similar) in 282 aa overlap (1-282:1-282)
10 20 30 40 50 60
pF1KE1 MSGADRSPNAGAAPDSAPGQAAVASAYQRFEPRAYLRNNYAPPRGDLCNPNGVGPWKLRC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 MSGADRSPNAGAAPDSAPGQAAVASAYQRFEPRAYLRNNYAPPRGDLCNPNGVGPWKLRC
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 LAQTFATGEVSGRTLIDIGSGPTVYQLLSACSHFEDITMTDFLEVNRQELGRWLQEEPGA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 LAQTFATGEVSGRTLIDIGSGPTVYQLLSACSHFEDITMTDFLEVNRQELGRWLQEEPGA
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE1 FNWSMYSQHACLIEGKGECWQDKERQLRARVKRVLPIDVHQPQPLGAGSPAPLPADALVS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 FNWSMYSQHACLIEGKGECWQDKERQLRARVKRVLPIDVHQPQPLGAGSPAPLPADALVS
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE1 AFCLEAVSPDLASFQRALDHITTLLRPGGHLLLIGALEESWYLAGEARLTVVPVSEEEVR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 AFCLEAVSPDLASFQRALDHITTLLRPGGHLLLIGALEESWYLAGEARLTVVPVSEEEVR
190 200 210 220 230 240
250 260 270 280
pF1KE1 EALVRSGYKVRDLRTYIMPAHLQTGVDDVKGVFFAWAQKVGL
::::::::::::::::::::::::::::::::::::::::::
CCDS11 EALVRSGYKVRDLRTYIMPAHLQTGVDDVKGVFFAWAQKVGL
250 260 270 280
>>CCDS5430.1 INMT gene_id:11185|Hs108|chr7 (263 aa)
initn: 566 init1: 429 opt: 602 Z-score: 698.1 bits: 136.9 E(32554): 1.4e-32
Smith-Waterman score: 602; 39.2% identity (70.6% similar) in 255 aa overlap (27-279:11-259)
10 20 30 40 50
pF1KE1 MSGADRSPNAGAAPDSAPGQAAVASAYQR-FEPRAYLRNNYAPPRGDLCNPNGVGPWKLR
::. : :: :: . :. :. . ..:.
CCDS54 MKGGFTGGDEYQKHFLPRDYLATYYSFD-GSPSPEAEMLKFNLE
10 20 30 40
60 70 80 90 100 110
pF1KE1 CLAQTFATGEVSGRTLIDIGSGPTVYQLLSACSHFEDITMTDFLEVNRQELGRWLQEEPG
:: .::. : ..: ::::::::::.::.:.::. :.:::..:: . ::.:: .::..:::
CCDS54 CLHKTFGPGGLQGDTLIDIGSGPTIYQVLAACDSFQDITLSDFTDRNREELEKWLKKEPG
50 60 70 80 90 100
120 130 140 150 160 170
pF1KE1 AFNWSMYSQHACLIEGKGECWQDKERQLRARVKRVLPIDVHQPQPLGAGSPAPLP-ADAL
:..:. . :: .::.. :..::..::: ::::: ::: .:: .:: :: :: .
CCDS54 AYDWTPAVKFACELEGNSGRWEEKEEKLRAAVKRVLKCDVHLGNPL---APAVLPLADCV
110 120 130 140 150 160
180 190 200 210 220 230
pF1KE1 VSAFCLEAVSPDLASFQRALDHITTLLRPGGHLLLIGALEESWYLAGEARLTVVPVSEEE
.. . .: . .: ... :: ....::.:::::. .:. :..:. ... : . .::
CCDS54 LTLLAMECACCSLDAYRAALCNLASLLKPGGHLVTTVTLRLPSYMVGKREFSCVALEKEE
170 180 190 200 210 220
240 250 260 270 280
pF1KE1 VREALVRSGYKVRDLRTYIMPAHLQTGVDDVKGVFFAWAQKVGL
:..:.. .:. ...: . . ..... :: : :.:
CCDS54 VEQAVLDAGFDIEQLLHSPQSYSVTNAANN--GVCFIVARKKPGP
230 240 250 260
>>CCDS56479.1 INMT gene_id:11185|Hs108|chr7 (262 aa)
initn: 525 init1: 388 opt: 593 Z-score: 687.8 bits: 135.0 E(32554): 5.5e-32
Smith-Waterman score: 593; 39.2% identity (70.6% similar) in 255 aa overlap (27-279:11-258)
10 20 30 40 50
pF1KE1 MSGADRSPNAGAAPDSAPGQAAVASAYQR-FEPRAYLRNNYAPPRGDLCNPNGVGPWKLR
::. : :: :: . :. :. . ..:.
CCDS56 MKGGFTGGDEYQKHFLPRDYLATYYSFD-GSPSPEAEMLKFNLE
10 20 30 40
60 70 80 90 100 110
pF1KE1 CLAQTFATGEVSGRTLIDIGSGPTVYQLLSACSHFEDITMTDFLEVNRQELGRWLQEEPG
:: .::. : ..: ::::::::::.::.:.::. :.:::..:: . ::.:: .::..:::
CCDS56 CLHKTFGPG-LQGDTLIDIGSGPTIYQVLAACDSFQDITLSDFTDRNREELEKWLKKEPG
50 60 70 80 90 100
120 130 140 150 160 170
pF1KE1 AFNWSMYSQHACLIEGKGECWQDKERQLRARVKRVLPIDVHQPQPLGAGSPAPLP-ADAL
:..:. . :: .::.. :..::..::: ::::: ::: .:: .:: :: :: .
CCDS56 AYDWTPAVKFACELEGNSGRWEEKEEKLRAAVKRVLKCDVHLGNPL---APAVLPLADCV
110 120 130 140 150
180 190 200 210 220 230
pF1KE1 VSAFCLEAVSPDLASFQRALDHITTLLRPGGHLLLIGALEESWYLAGEARLTVVPVSEEE
.. . .: . .: ... :: ....::.:::::. .:. :..:. ... : . .::
CCDS56 LTLLAMECACCSLDAYRAALCNLASLLKPGGHLVTTVTLRLPSYMVGKREFSCVALEKEE
160 170 180 190 200 210
240 250 260 270 280
pF1KE1 VREALVRSGYKVRDLRTYIMPAHLQTGVDDVKGVFFAWAQKVGL
:..:.. .:. ...: . . ..... :: : :.:
CCDS56 VEQAVLDAGFDIEQLLHSPQSYSVTNAANN--GVCFIVARKKPGP
220 230 240 250 260
>>CCDS8368.1 NNMT gene_id:4837|Hs108|chr11 (264 aa)
initn: 583 init1: 345 opt: 592 Z-score: 686.6 bits: 134.8 E(32554): 6.4e-32
Smith-Waterman score: 592; 39.7% identity (68.7% similar) in 252 aa overlap (30-280:15-260)
10 20 30 40 50
pF1KE1 MSGADRSPNAGAAPDSAPGQAAVASAYQRFEPRAYLRNNYAPPRGDLCNPNG-VGPWKLR
:.:: ::.. : :. . .. . :.
CCDS83 MESGFTSKDTYLSHFNPRDYLEKYY--KFGSRHSAESQILKHLLK
10 20 30 40
60 70 80 90 100 110
pF1KE1 CLAQTFATGEVSGRTLIDIGSGPTVYQLLSACSHFEDITMTDFLEVNRQELGRWLQEEPG
: . : :.: :::::::::.::::::: :..:..::. . : ::: .::..::
CCDS83 NLFKIFCLDGVKGDLLIDIGSGPTIYQLLSACESFKEIVVTDYSDQNLQELEKWLKKEPE
50 60 70 80 90 100
120 130 140 150 160 170
pF1KE1 AFNWSMYSQHACLIEGKGECWQDKERQLRARVKRVLPIDVHQPQPLGAGSPAPLPADALV
::.:: ..: .::. .::..:: ::.:: :: : ::::: : : ::: ..
CCDS83 AFDWSPVVTYVCDLEGNRVKGPEKEEKLRQAVKQVLKCDVTQSQPLGA-VPLP-PADCVL
110 120 130 140 150 160
180 190 200 210 220 230
pF1KE1 SAFCLEAVSPDLASFQRALDHITTLLRPGGHLLLIGALEESWYLAGEARLTVVPVSEEEV
:..::.:. ::: .. ::: .. .::.::: :... ::. :.:. :: ... .:...: :
CCDS83 STLCLDAACPDLPTYCRALRNLGSLLKPGGFLVIMDALKSSYYMIGEQKFSSLPLGREAV
170 180 190 200 210 220
240 250 260 270 280
pF1KE1 REALVRSGYKVRDLRTYIMPAHLQTGVDDVKGVFFAWAQKVGL
. :. ..:: .. .. .. .. . . .:.: :.:.
CCDS83 EAAVKEAGYTIEWFE--VISQSYSSTMANNEGLFSLVARKLSRPL
230 240 250 260
282 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 13:51:02 2016 done: Sun Nov 6 13:51:02 2016
Total Scan time: 2.670 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]