FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE6666, 290 aa
1>>>pF1KE6666 290 - 290 aa - 290 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.2823+/-0.000863; mu= 15.1211+/- 0.051
mean_var=59.6192+/-12.026, 0's: 0 Z-trim(105.1): 16 B-trim: 148 in 1/51
Lambda= 0.166104
statistics sampled from 8227 (8232) to 8227 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.638), E-opt: 0.2 (0.253), width: 16
Scan time: 2.240
The best scores are: opt bits E(32554)
CCDS6008.1 NAT2 gene_id:10|Hs108|chr8 ( 290) 1936 472.3 1.8e-133
CCDS6007.1 NAT1 gene_id:9|Hs108|chr8 ( 290) 1594 390.4 8.5e-109
CCDS55205.1 NAT1 gene_id:9|Hs108|chr8 ( 352) 1594 390.4 1e-108
>>CCDS6008.1 NAT2 gene_id:10|Hs108|chr8 (290 aa)
initn: 1936 init1: 1936 opt: 1936 Z-score: 2509.7 bits: 472.3 E(32554): 1.8e-133
Smith-Waterman score: 1936; 99.3% identity (100.0% similar) in 290 aa overlap (1-290:1-290)
10 20 30 40 50 60
pF1KE6 MDIEAYFERIGYKNSRNKLDLETLTDILEHQIRAVPFENLNMHCGQAMELGLEAIFDHIV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS60 MDIEAYFERIGYKNSRNKLDLETLTDILEHQIRAVPFENLNMHCGQAMELGLEAIFDHIV
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE6 RRNRGGWCLQVNQLLYWALTTIGFQTTMLGGYFYIPPVNKYSTGMVHLLLQVTIDGRNYI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS60 RRNRGGWCLQVNQLLYWALTTIGFQTTMLGGYFYIPPVNKYSTGMVHLLLQVTIDGRNYI
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE6 VDAGSGSSSQMWQPLELISGKDQPQVPCIFCLTEERGIWYLDQIRREQYITNKEFLNSHL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS60 VDAGSGSSSQMWQPLELISGKDQPQVPCIFCLTEERGIWYLDQIRREQYITNKEFLNSHL
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE6 LPKKKHQKIYLFTLEPQTIEDFESMNTYLQTSPTSSFITTSFCSLQTPEGVYCLVGFILT
::::::::::::::::.:::::::::::::::::::::::::::::::::::::::::::
CCDS60 LPKKKHQKIYLFTLEPRTIEDFESMNTYLQTSPTSSFITTSFCSLQTPEGVYCLVGFILT
190 200 210 220 230 240
250 260 270 280 290
pF1KE6 YRKFNYKDNTDLVEFKTLTEEEVEEVLKNIFKISLGRNLVPKPGDGSLTI
:::::::::::::::::::::::::::.::::::::::::::::::::::
CCDS60 YRKFNYKDNTDLVEFKTLTEEEVEEVLRNIFKISLGRNLVPKPGDGSLTI
250 260 270 280 290
>>CCDS6007.1 NAT1 gene_id:9|Hs108|chr8 (290 aa)
initn: 1594 init1: 1594 opt: 1594 Z-score: 2066.8 bits: 390.4 E(32554): 8.5e-109
Smith-Waterman score: 1594; 80.7% identity (92.8% similar) in 290 aa overlap (1-290:1-290)
10 20 30 40 50 60
pF1KE6 MDIEAYFERIGYKNSRNKLDLETLTDILEHQIRAVPFENLNMHCGQAMELGLEAIFDHIV
::::::.::::::.::::::::::::::.::::::::::::.:::.::.::::::::..:
CCDS60 MDIEAYLERIGYKKSRNKLDLETLTDILQHQIRAVPFENLNIHCGDAMDLGLEAIFDQVV
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE6 RRNRGGWCLQVNQLLYWALTTIGFQTTMLGGYFYIPPVNKYSTGMVHLLLQVTIDGRNYI
::::::::::::.:::::::::::.::::::: : :..::::::.::::::::::::::
CCDS60 RRNRGGWCLQVNHLLYWALTTIGFETTMLGGYVYSTPAKKYSTGMIHLLLQVTIDGRNYI
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE6 VDAGSGSSSQMWQPLELISGKDQPQVPCIFCLTEERGIWYLDQIRREQYITNKEFLNSHL
:::: : : :::::::::::::::::::.: :::: :.:::::::::::: :.:::.: :
CCDS60 VDAGFGRSYQMWQPLELISGKDQPQVPCVFRLTEENGFWYLDQIRREQYIPNEEFLHSDL
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE6 LPKKKHQKIYLFTLEPQTIEDFESMNTYLQTSPTSSFITTSFCSLQTPEGVYCLVGFILT
: .:..::: :::.:.::::::::::::::::.: : . ::::::::.::.::::: ::
CCDS60 LEDSKYRKIYSFTLKPRTIEDFESMNTYLQTSPSSVFTSKSFCSLQTPDGVHCLVGFTLT
190 200 210 220 230 240
250 260 270 280 290
pF1KE6 YRKFNYKDNTDLVEFKTLTEEEVEEVLKNIFKISLGRNLVPKPGDGSLTI
.:.:::::::::.:::::.:::.:.::::::.::: :.:::: :: .::
CCDS60 HRRFNYKDNTDLIEFKTLSEEEIEKVLKNIFNISLQRKLVPKHGDRFFTI
250 260 270 280 290
>>CCDS55205.1 NAT1 gene_id:9|Hs108|chr8 (352 aa)
initn: 1594 init1: 1594 opt: 1594 Z-score: 2065.4 bits: 390.4 E(32554): 1e-108
Smith-Waterman score: 1594; 80.7% identity (92.8% similar) in 290 aa overlap (1-290:63-352)
10 20 30
pF1KE6 MDIEAYFERIGYKNSRNKLDLETLTDILEH
::::::.::::::.::::::::::::::.:
CCDS55 SGIQARKKQQSVFWIKTEDQPTFNLLRKGIMDIEAYLERIGYKKSRNKLDLETLTDILQH
40 50 60 70 80 90
40 50 60 70 80 90
pF1KE6 QIRAVPFENLNMHCGQAMELGLEAIFDHIVRRNRGGWCLQVNQLLYWALTTIGFQTTMLG
:::::::::::.:::.::.::::::::..:::::::::::::.:::::::::::.:::::
CCDS55 QIRAVPFENLNIHCGDAMDLGLEAIFDQVVRRNRGGWCLQVNHLLYWALTTIGFETTMLG
100 110 120 130 140 150
100 110 120 130 140 150
pF1KE6 GYFYIPPVNKYSTGMVHLLLQVTIDGRNYIVDAGSGSSSQMWQPLELISGKDQPQVPCIF
:: : :..::::::.:::::::::::::::::: : : :::::::::::::::::::.:
CCDS55 GYVYSTPAKKYSTGMIHLLLQVTIDGRNYIVDAGFGRSYQMWQPLELISGKDQPQVPCVF
160 170 180 190 200 210
160 170 180 190 200 210
pF1KE6 CLTEERGIWYLDQIRREQYITNKEFLNSHLLPKKKHQKIYLFTLEPQTIEDFESMNTYLQ
:::: :.:::::::::::: :.:::.: :: .:..::: :::.:.:::::::::::::
CCDS55 RLTEENGFWYLDQIRREQYIPNEEFLHSDLLEDSKYRKIYSFTLKPRTIEDFESMNTYLQ
220 230 240 250 260 270
220 230 240 250 260 270
pF1KE6 TSPTSSFITTSFCSLQTPEGVYCLVGFILTYRKFNYKDNTDLVEFKTLTEEEVEEVLKNI
:::.: : . ::::::::.::.::::: ::.:.:::::::::.:::::.:::.:.:::::
CCDS55 TSPSSVFTSKSFCSLQTPDGVHCLVGFTLTHRRFNYKDNTDLIEFKTLSEEEIEKVLKNI
280 290 300 310 320 330
280 290
pF1KE6 FKISLGRNLVPKPGDGSLTI
:.::: :.:::: :: .::
CCDS55 FNISLQRKLVPKHGDRFFTI
340 350
290 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Tue Nov 8 15:14:57 2016 done: Tue Nov 8 15:14:57 2016
Total Scan time: 2.240 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]