FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KSDA0174, 364 aa
1>>>pF1KSDA0174 364 - 364 aa - 364 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 9.1782+/-0.000992; mu= 0.5002+/- 0.060
mean_var=258.0967+/-52.282, 0's: 0 Z-trim(113.5): 13 B-trim: 0 in 0/53
Lambda= 0.079833
statistics sampled from 14168 (14175) to 14168 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.762), E-opt: 0.2 (0.435), width: 16
Scan time: 2.520
The best scores are: opt bits E(32554)
CCDS59272.1 IST1 gene_id:9798|Hs108|chr16 ( 366) 2393 288.4 6.7e-78
CCDS59271.1 IST1 gene_id:9798|Hs108|chr16 ( 379) 2393 288.4 6.9e-78
CCDS10905.1 IST1 gene_id:9798|Hs108|chr16 ( 360) 1863 227.3 1.6e-59
CCDS59273.1 IST1 gene_id:9798|Hs108|chr16 ( 335) 1654 203.2 2.6e-52
CCDS59274.1 IST1 gene_id:9798|Hs108|chr16 ( 218) 1468 181.6 5.4e-46
>>CCDS59272.1 IST1 gene_id:9798|Hs108|chr16 (366 aa)
initn: 1586 init1: 1550 opt: 2393 Z-score: 1512.0 bits: 288.4 E(32554): 6.7e-78
Smith-Waterman score: 2393; 99.5% identity (99.5% similar) in 366 aa overlap (1-364:1-366)
10 20 30 40 50 60
pF1KSD MLGSGFKAERLRVNLRLVINRLKLLEKKKTELAQKARKEIADYLAAGKDERARIRVEHII
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 MLGSGFKAERLRVNLRLVINRLKLLEKKKTELAQKARKEIADYLAAGKDERARIRVEHII
10 20 30 40 50 60
70 80 90 100 110 120
pF1KSD REDYLVEAMEILELYCDLLLARFGLIQSMKELDSGLAESVSTLIWAAPRLQSEVAELKIV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 REDYLVEAMEILELYCDLLLARFGLIQSMKELDSGLAESVSTLIWAAPRLQSEVAELKIV
70 80 90 100 110 120
130 140 150 160 170 180
pF1KSD ADQLCAKYSKEYGKLCRTNQIGTVNDRLMHKLSVEAPPKILVERYLIEIAKNYNVPYEPD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 ADQLCAKYSKEYGKLCRTNQIGTVNDRLMHKLSVEAPPKILVERYLIEIAKNYNVPYEPD
130 140 150 160 170 180
190 200 210 220 230
pF1KSD SVVMAEAPPGVETDLIDVGFTDDVKKGGPGRGGSGGFTAPVGGPDGTVPMPMPMPMP--S
::::::::::::::::::::::::::::::::::::::::::::::::::::::::: :
CCDS59 SVVMAEAPPGVETDLIDVGFTDDVKKGGPGRGGSGGFTAPVGGPDGTVPMPMPMPMPMPS
190 200 210 220 230 240
240 250 260 270 280 290
pF1KSD ANTPFSYPLPKGPSDFNGLPMGTYQAFPNIHPPQIPATPPSYESVDDINADKNISSAQIV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 ANTPFSYPLPKGPSDFNGLPMGTYQAFPNIHPPQIPATPPSYESVDDINADKNISSAQIV
250 260 270 280 290 300
300 310 320 330 340 350
pF1KSD GPGPKPEASAKLPSRPADNYDNFVLPELPSVPDTLPTASAGASTSASEDIDFDDLSRRFE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 GPGPKPEASAKLPSRPADNYDNFVLPELPSVPDTLPTASAGASTSASEDIDFDDLSRRFE
310 320 330 340 350 360
360
pF1KSD ELKKKT
::::::
CCDS59 ELKKKT
>>CCDS59271.1 IST1 gene_id:9798|Hs108|chr16 (379 aa)
initn: 1586 init1: 1550 opt: 2393 Z-score: 1511.8 bits: 288.4 E(32554): 6.9e-78
Smith-Waterman score: 2393; 99.5% identity (99.5% similar) in 366 aa overlap (1-364:14-379)
10 20 30 40
pF1KSD MLGSGFKAERLRVNLRLVINRLKLLEKKKTELAQKARKEIADYLAAG
:::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 MVFKLKTKEEQHSMLGSGFKAERLRVNLRLVINRLKLLEKKKTELAQKARKEIADYLAAG
10 20 30 40 50 60
50 60 70 80 90 100
pF1KSD KDERARIRVEHIIREDYLVEAMEILELYCDLLLARFGLIQSMKELDSGLAESVSTLIWAA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 KDERARIRVEHIIREDYLVEAMEILELYCDLLLARFGLIQSMKELDSGLAESVSTLIWAA
70 80 90 100 110 120
110 120 130 140 150 160
pF1KSD PRLQSEVAELKIVADQLCAKYSKEYGKLCRTNQIGTVNDRLMHKLSVEAPPKILVERYLI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 PRLQSEVAELKIVADQLCAKYSKEYGKLCRTNQIGTVNDRLMHKLSVEAPPKILVERYLI
130 140 150 160 170 180
170 180 190 200 210 220
pF1KSD EIAKNYNVPYEPDSVVMAEAPPGVETDLIDVGFTDDVKKGGPGRGGSGGFTAPVGGPDGT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 EIAKNYNVPYEPDSVVMAEAPPGVETDLIDVGFTDDVKKGGPGRGGSGGFTAPVGGPDGT
190 200 210 220 230 240
230 240 250 260 270 280
pF1KSD VPMPMPMPMP--SANTPFSYPLPKGPSDFNGLPMGTYQAFPNIHPPQIPATPPSYESVDD
:::::::::: ::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 VPMPMPMPMPMPSANTPFSYPLPKGPSDFNGLPMGTYQAFPNIHPPQIPATPPSYESVDD
250 260 270 280 290 300
290 300 310 320 330 340
pF1KSD INADKNISSAQIVGPGPKPEASAKLPSRPADNYDNFVLPELPSVPDTLPTASAGASTSAS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 INADKNISSAQIVGPGPKPEASAKLPSRPADNYDNFVLPELPSVPDTLPTASAGASTSAS
310 320 330 340 350 360
350 360
pF1KSD EDIDFDDLSRRFEELKKKT
:::::::::::::::::::
CCDS59 EDIDFDDLSRRFEELKKKT
370
>>CCDS10905.1 IST1 gene_id:9798|Hs108|chr16 (360 aa)
initn: 1572 init1: 1550 opt: 1863 Z-score: 1182.2 bits: 227.3 E(32554): 1.6e-59
Smith-Waterman score: 1863; 92.6% identity (94.5% similar) in 310 aa overlap (1-307:1-310)
10 20 30 40 50 60
pF1KSD MLGSGFKAERLRVNLRLVINRLKLLEKKKTELAQKARKEIADYLAAGKDERARIRVEHII
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 MLGSGFKAERLRVNLRLVINRLKLLEKKKTELAQKARKEIADYLAAGKDERARIRVEHII
10 20 30 40 50 60
70 80 90 100 110 120
pF1KSD REDYLVEAMEILELYCDLLLARFGLIQSMKELDSGLAESVSTLIWAAPRLQSEVAELKIV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 REDYLVEAMEILELYCDLLLARFGLIQSMKELDSGLAESVSTLIWAAPRLQSEVAELKIV
70 80 90 100 110 120
130 140 150 160 170 180
pF1KSD ADQLCAKYSKEYGKLCRTNQIGTVNDRLMHKLSVEAPPKILVERYLIEIAKNYNVPYEPD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 ADQLCAKYSKEYGKLCRTNQIGTVNDRLMHKLSVEAPPKILVERYLIEIAKNYNVPYEPD
130 140 150 160 170 180
190 200 210 220 230
pF1KSD SVVMAEAPPGVETDLIDVGFTDDVKKGGPGRGGSGGFTAPVGGPDGTVPMPMPMPMP--S
::::::::::::::::::::::::::::::::::::::::::::::::::::::::: :
CCDS10 SVVMAEAPPGVETDLIDVGFTDDVKKGGPGRGGSGGFTAPVGGPDGTVPMPMPMPMPMPS
190 200 210 220 230 240
240 250 260 270 280 290
pF1KSD ANTPFSYPLPKGPSDFNGLPMGTYQAFPNIHPPQIPATPPSYESVDDINADKNISSAQIV
::::::::::::::::::::::::::::::::::::::::::::. . .. .:
CCDS10 ANTPFSYPLPKGPSDFNGLPMGTYQAFPNIHPPQIPATPPSYESMTLMLIRISLLHRLLV
250 260 270 280 290 300
300 310 320 330 340 350
pF1KSD -GPGPKPEASAKLPSRPADNYDNFVLPELPSVPDTLPTASAGASTSASEDIDFDDLSRRF
:. :: :
CCDS10 LDPSQKPLQSFLPDLQITMTTLSYQSCHLCQTHYQLHLLVPAPQHLKTLTLMIFPGGLKS
310 320 330 340 350 360
>>CCDS59273.1 IST1 gene_id:9798|Hs108|chr16 (335 aa)
initn: 1753 init1: 1545 opt: 1654 Z-score: 1052.5 bits: 203.2 E(32554): 2.6e-52
Smith-Waterman score: 2094; 91.0% identity (91.0% similar) in 366 aa overlap (1-364:1-335)
10 20 30 40 50 60
pF1KSD MLGSGFKAERLRVNLRLVINRLKLLEKKKTELAQKARKEIADYLAAGKDERARIRVEHII
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 MLGSGFKAERLRVNLRLVINRLKLLEKKKTELAQKARKEIADYLAAGKDERARIRVEHII
10 20 30 40 50 60
70 80 90 100 110 120
pF1KSD REDYLVEAMEILELYCDLLLARFGLIQSMKELDSGLAESVSTLIWAAPRLQSEVAELKIV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 REDYLVEAMEILELYCDLLLARFGLIQSMKELDSGLAESVSTLIWAAPRLQSEVAELKIV
70 80 90 100 110 120
130 140 150 160 170 180
pF1KSD ADQLCAKYSKEYGKLCRTNQIGTVNDRLMHKLSVEAPPKILVERYLIEIAKNYNVPYEPD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 ADQLCAKYSKEYGKLCRTNQIGTVNDRLMHKLSVEAPPKILVERYLIEIAKNYNVPYEPD
130 140 150 160 170 180
190 200 210 220 230
pF1KSD SVVMAEAPPGVETDLIDVGFTDDVKKGGPGRGGSGGFTAPVGGPDGTVPMPMPMPMP--S
::::::::::::::::::::::::::::::::::::::::::::::::::::::::: :
CCDS59 SVVMAEAPPGVETDLIDVGFTDDVKKGGPGRGGSGGFTAPVGGPDGTVPMPMPMPMPMPS
190 200 210 220 230 240
240 250 260 270 280 290
pF1KSD ANTPFSYPLPKGPSDFNGLPMGTYQAFPNIHPPQIPATPPSYESVDDINADKNISSAQIV
::::::::::::: ::::::::::::::::
CCDS59 ANTPFSYPLPKGP-------------------------------VDDINADKNISSAQIV
250 260
300 310 320 330 340 350
pF1KSD GPGPKPEASAKLPSRPADNYDNFVLPELPSVPDTLPTASAGASTSASEDIDFDDLSRRFE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 GPGPKPEASAKLPSRPADNYDNFVLPELPSVPDTLPTASAGASTSASEDIDFDDLSRRFE
270 280 290 300 310 320
360
pF1KSD ELKKKT
::::::
CCDS59 ELKKKT
330
>>CCDS59274.1 IST1 gene_id:9798|Hs108|chr16 (218 aa)
initn: 941 init1: 941 opt: 1468 Z-score: 939.2 bits: 181.6 E(32554): 5.4e-46
Smith-Waterman score: 1468; 99.1% identity (99.1% similar) in 218 aa overlap (149-364:1-218)
120 130 140 150 160 170
pF1KSD IVADQLCAKYSKEYGKLCRTNQIGTVNDRLMHKLSVEAPPKILVERYLIEIAKNYNVPYE
::::::::::::::::::::::::::::::
CCDS59 MHKLSVEAPPKILVERYLIEIAKNYNVPYE
10 20 30
180 190 200 210 220 230
pF1KSD PDSVVMAEAPPGVETDLIDVGFTDDVKKGGPGRGGSGGFTAPVGGPDGTVPMPMPMPMP-
:::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 PDSVVMAEAPPGVETDLIDVGFTDDVKKGGPGRGGSGGFTAPVGGPDGTVPMPMPMPMPM
40 50 60 70 80 90
240 250 260 270 280 290
pF1KSD -SANTPFSYPLPKGPSDFNGLPMGTYQAFPNIHPPQIPATPPSYESVDDINADKNISSAQ
:::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 PSANTPFSYPLPKGPSDFNGLPMGTYQAFPNIHPPQIPATPPSYESVDDINADKNISSAQ
100 110 120 130 140 150
300 310 320 330 340 350
pF1KSD IVGPGPKPEASAKLPSRPADNYDNFVLPELPSVPDTLPTASAGASTSASEDIDFDDLSRR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 IVGPGPKPEASAKLPSRPADNYDNFVLPELPSVPDTLPTASAGASTSASEDIDFDDLSRR
160 170 180 190 200 210
360
pF1KSD FEELKKKT
::::::::
CCDS59 FEELKKKT
364 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Thu Nov 3 00:28:50 2016 done: Thu Nov 3 00:28:50 2016
Total Scan time: 2.520 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]