FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE5231, 225 aa
1>>>pF1KE5231 225 - 225 aa - 225 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.2872+/-0.000813; mu= 11.1033+/- 0.049
mean_var=129.7660+/-26.664, 0's: 0 Z-trim(111.4): 194 B-trim: 0 in 0/51
Lambda= 0.112588
statistics sampled from 12127 (12335) to 12127 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.741), E-opt: 0.2 (0.379), width: 16
Scan time: 1.810
The best scores are: opt bits E(32554)
CCDS14106.1 SHOX gene_id:6473|Hs108|chrX ( 225) 1528 258.7 2.2e-69
CCDS14106.1 SHOX gene_id:6473|Hs108|chrY ( 225) 1528 258.7 2.2e-69
CCDS14107.1 SHOX gene_id:6473|Hs108|chrX ( 292) 1417 240.8 7e-64
CCDS14107.1 SHOX gene_id:6473|Hs108|chrY ( 292) 1417 240.8 7e-64
CCDS43164.1 SHOX2 gene_id:6474|Hs108|chr3 ( 331) 730 129.3 3e-30
CCDS54664.1 SHOX2 gene_id:6474|Hs108|chr3 ( 319) 717 127.1 1.2e-29
CCDS33884.2 SHOX2 gene_id:6474|Hs108|chr3 ( 355) 710 126.0 3e-29
>>CCDS14106.1 SHOX gene_id:6473|Hs108|chrX (225 aa)
initn: 1528 init1: 1528 opt: 1528 Z-score: 1359.3 bits: 258.7 E(32554): 2.2e-69
Smith-Waterman score: 1528; 100.0% identity (100.0% similar) in 225 aa overlap (1-225:1-225)
10 20 30 40 50 60
pF1KE5 MEELTAFVSKSFDQKSKDGNGGGGGGGGKKDSITYREVLESGLARSRELGTSDSSLQDIT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 MEELTAFVSKSFDQKSKDGNGGGGGGGGKKDSITYREVLESGLARSRELGTSDSSLQDIT
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE5 EGGGHCPVHLFKDHVDNDKEKLKEFGTARVAEGIYECKEKREDVKSEDEDGQTKLKQRRS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 EGGGHCPVHLFKDHVDNDKEKLKEFGTARVAEGIYECKEKREDVKSEDEDGQTKLKQRRS
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE5 RTNFTLEQLNELERLFDETHYPDAFMREELSQRLGLSEARVQVWFQNRRAKCRKQENQMH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 RTNFTLEQLNELERLFDETHYPDAFMREELSQRLGLSEARVQVWFQNRRAKCRKQENQMH
130 140 150 160 170 180
190 200 210 220
pF1KE5 KGVILGTANHLDACRVAPYVNMGALRMPFQQMEFCSCRPGWSIMA
:::::::::::::::::::::::::::::::::::::::::::::
CCDS14 KGVILGTANHLDACRVAPYVNMGALRMPFQQMEFCSCRPGWSIMA
190 200 210 220
>>CCDS14106.1 SHOX gene_id:6473|Hs108|chrY (225 aa)
initn: 1528 init1: 1528 opt: 1528 Z-score: 1359.3 bits: 258.7 E(32554): 2.2e-69
Smith-Waterman score: 1528; 100.0% identity (100.0% similar) in 225 aa overlap (1-225:1-225)
10 20 30 40 50 60
pF1KE5 MEELTAFVSKSFDQKSKDGNGGGGGGGGKKDSITYREVLESGLARSRELGTSDSSLQDIT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 MEELTAFVSKSFDQKSKDGNGGGGGGGGKKDSITYREVLESGLARSRELGTSDSSLQDIT
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE5 EGGGHCPVHLFKDHVDNDKEKLKEFGTARVAEGIYECKEKREDVKSEDEDGQTKLKQRRS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 EGGGHCPVHLFKDHVDNDKEKLKEFGTARVAEGIYECKEKREDVKSEDEDGQTKLKQRRS
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE5 RTNFTLEQLNELERLFDETHYPDAFMREELSQRLGLSEARVQVWFQNRRAKCRKQENQMH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 RTNFTLEQLNELERLFDETHYPDAFMREELSQRLGLSEARVQVWFQNRRAKCRKQENQMH
130 140 150 160 170 180
190 200 210 220
pF1KE5 KGVILGTANHLDACRVAPYVNMGALRMPFQQMEFCSCRPGWSIMA
:::::::::::::::::::::::::::::::::::::::::::::
CCDS14 KGVILGTANHLDACRVAPYVNMGALRMPFQQMEFCSCRPGWSIMA
190 200 210 220
>>CCDS14107.1 SHOX gene_id:6473|Hs108|chrX (292 aa)
initn: 1465 init1: 1417 opt: 1417 Z-score: 1260.4 bits: 240.8 E(32554): 7e-64
Smith-Waterman score: 1417; 99.1% identity (100.0% similar) in 213 aa overlap (1-213:1-213)
10 20 30 40 50 60
pF1KE5 MEELTAFVSKSFDQKSKDGNGGGGGGGGKKDSITYREVLESGLARSRELGTSDSSLQDIT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 MEELTAFVSKSFDQKSKDGNGGGGGGGGKKDSITYREVLESGLARSRELGTSDSSLQDIT
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE5 EGGGHCPVHLFKDHVDNDKEKLKEFGTARVAEGIYECKEKREDVKSEDEDGQTKLKQRRS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 EGGGHCPVHLFKDHVDNDKEKLKEFGTARVAEGIYECKEKREDVKSEDEDGQTKLKQRRS
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE5 RTNFTLEQLNELERLFDETHYPDAFMREELSQRLGLSEARVQVWFQNRRAKCRKQENQMH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 RTNFTLEQLNELERLFDETHYPDAFMREELSQRLGLSEARVQVWFQNRRAKCRKQENQMH
130 140 150 160 170 180
190 200 210 220
pF1KE5 KGVILGTANHLDACRVAPYVNMGALRMPFQQMEFCSCRPGWSIMA
:::::::::::::::::::::::::::::::..
CCDS14 KGVILGTANHLDACRVAPYVNMGALRMPFQQVQAQLQLEGVAHAHPHLHPHLAAHAPYLM
190 200 210 220 230 240
CCDS14 FPPPPFGLPIASLAESASAAAVVAAAAKSNSKNSSIADLRLKARKHAEALGL
250 260 270 280 290
>>CCDS14107.1 SHOX gene_id:6473|Hs108|chrY (292 aa)
initn: 1465 init1: 1417 opt: 1417 Z-score: 1260.4 bits: 240.8 E(32554): 7e-64
Smith-Waterman score: 1417; 99.1% identity (100.0% similar) in 213 aa overlap (1-213:1-213)
10 20 30 40 50 60
pF1KE5 MEELTAFVSKSFDQKSKDGNGGGGGGGGKKDSITYREVLESGLARSRELGTSDSSLQDIT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 MEELTAFVSKSFDQKSKDGNGGGGGGGGKKDSITYREVLESGLARSRELGTSDSSLQDIT
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE5 EGGGHCPVHLFKDHVDNDKEKLKEFGTARVAEGIYECKEKREDVKSEDEDGQTKLKQRRS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 EGGGHCPVHLFKDHVDNDKEKLKEFGTARVAEGIYECKEKREDVKSEDEDGQTKLKQRRS
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE5 RTNFTLEQLNELERLFDETHYPDAFMREELSQRLGLSEARVQVWFQNRRAKCRKQENQMH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 RTNFTLEQLNELERLFDETHYPDAFMREELSQRLGLSEARVQVWFQNRRAKCRKQENQMH
130 140 150 160 170 180
190 200 210 220
pF1KE5 KGVILGTANHLDACRVAPYVNMGALRMPFQQMEFCSCRPGWSIMA
:::::::::::::::::::::::::::::::..
CCDS14 KGVILGTANHLDACRVAPYVNMGALRMPFQQVQAQLQLEGVAHAHPHLHPHLAAHAPYLM
190 200 210 220 230 240
CCDS14 FPPPPFGLPIASLAESASAAAVVAAAAKSNSKNSSIADLRLKARKHAEALGL
250 260 270 280 290
>>CCDS43164.1 SHOX2 gene_id:6474|Hs108|chr3 (331 aa)
initn: 880 init1: 676 opt: 730 Z-score: 656.6 bits: 129.3 E(32554): 3e-30
Smith-Waterman score: 800; 55.7% identity (69.0% similar) in 255 aa overlap (1-219:1-242)
10 20 30 40 50
pF1KE5 MEELTAFVSKSFDQKSKDGNGGGGGGGGKKDSITYREVLESGLARS-------RELGTSD
::::::::::::::: :. ::..:::::::::: :. : : .:
CCDS43 MEELTAFVSKSFDQKVKE----------KKEAITYREVLESGPLRGAKEPTGCTEAGRDD
10 20 30 40 50
60 70 80
pF1KE5 SSLQDITE-----------------------------GGGHCPVHLFKDHVDNDKEKLKE
: . :::. ::. . : . :. .:
CCDS43 RSSPAVRAAGGGGGGGGGGGGGGGGGGVGGGGAGGGAGGGRSPVREL-DM--GAAERSRE
60 70 80 90 100
90 100 110 120 130 140
pF1KE5 FGTARVAEGIYECKEKREDVKSEDEDGQTKLKQRRSRTNFTLEQLNELERLFDETHYPDA
:. :..: : :...::.:. ...::::.:::::::::::::::::::::::::::::
CCDS43 PGSPRLTEVSPELKDRKEDAKGMEDEGQTKIKQRRSRTNFTLEQLNELERLFDETHYPDA
110 120 130 140 150 160
150 160 170 180 190 200
pF1KE5 FMREELSQRLGLSEARVQVWFQNRRAKCRKQENQMHKGVILGTANHLDACRVAPYVNMGA
::::::::::::::::::::::::::::::::::.::::..:.:....:::::::::.::
CCDS43 FMREELSQRLGLSEARVQVWFQNRRAKCRKQENQLHKGVLIGAASQFEACRVAPYVNVGA
170 180 190 200 210 220
210 220
pF1KE5 LRMPFQQMEFCSCRPGWSIMA
::::::: :. :
CCDS43 LRMPFQQDSHCNVTPLSFQVQAQLQLDSAVAHAHHHLHPHLAAHAPYMMFPAPPFGLPLA
230 240 250 260 270 280
>>CCDS54664.1 SHOX2 gene_id:6474|Hs108|chr3 (319 aa)
initn: 865 init1: 661 opt: 717 Z-score: 645.4 bits: 127.1 E(32554): 1.2e-29
Smith-Waterman score: 787; 56.2% identity (70.3% similar) in 249 aa overlap (1-213:1-236)
10 20 30 40 50
pF1KE5 MEELTAFVSKSFDQKSKDGNGGGGGGGGKKDSITYREVLESGLARS-------RELGTSD
::::::::::::::: :. ::..:::::::::: :. : : .:
CCDS54 MEELTAFVSKSFDQKVKE----------KKEAITYREVLESGPLRGAKEPTGCTEAGRDD
10 20 30 40 50
60 70 80
pF1KE5 SSLQDITE-----------------------------GGGHCPVHLFKDHVDNDKEKLKE
: . :::. ::. . : . :. .:
CCDS54 RSSPAVRAAGGGGGGGGGGGGGGGGGGVGGGGAGGGAGGGRSPVREL-DM--GAAERSRE
60 70 80 90 100
90 100 110 120 130 140
pF1KE5 FGTARVAEGIYECKEKREDVKSEDEDGQTKLKQRRSRTNFTLEQLNELERLFDETHYPDA
:. :..: : :...::.:. ...::::.:::::::::::::::::::::::::::::
CCDS54 PGSPRLTEVSPELKDRKEDAKGMEDEGQTKIKQRRSRTNFTLEQLNELERLFDETHYPDA
110 120 130 140 150 160
150 160 170 180 190 200
pF1KE5 FMREELSQRLGLSEARVQVWFQNRRAKCRKQENQMHKGVILGTANHLDACRVAPYVNMGA
::::::::::::::::::::::::::::::::::.::::..:.:....:::::::::.::
CCDS54 FMREELSQRLGLSEARVQVWFQNRRAKCRKQENQLHKGVLIGAASQFEACRVAPYVNVGA
170 180 190 200 210 220
210 220
pF1KE5 LRMPFQQMEFCSCRPGWSIMA
:::::::..
CCDS54 LRMPFQQVQAQLQLDSAVAHAHHHLHPHLAAHAPYMMFPAPPFGLPLATLAADSASAASV
230 240 250 260 270 280
>>CCDS33884.2 SHOX2 gene_id:6474|Hs108|chr3 (355 aa)
initn: 880 init1: 676 opt: 710 Z-score: 638.7 bits: 126.0 E(32554): 3e-29
Smith-Waterman score: 726; 61.4% identity (77.2% similar) in 202 aa overlap (19-219:80-266)
10 20 30 40
pF1KE5 MEELTAFVSKSFDQKSKDGNGGGGGGGGKKDSITYREVLESGLA-RSR
:.:.:::.:: .. . :: :. : : :::
CCDS33 DRSSPAVRAAGGGGGGGGGGGGGGGGGGVGGGGAGGGAGGGRSPV--RE-LDMGAAERSR
50 60 70 80 90 100
50 60 70 80 90 100
pF1KE5 ELGTSDSSLQDITEGGGHCPVHLFKDHVDNDKEKLKEFGTARVAEGIYECKEKREDVKSE
: :. .::: . :. : .: . : : : : :...::.:.
CCDS33 EPGSPR-----LTEGRRK-PT---KAEV---QATLLLPGEAFRFLVSPELKDRKEDAKGM
110 120 130 140 150
110 120 130 140 150 160
pF1KE5 DEDGQTKLKQRRSRTNFTLEQLNELERLFDETHYPDAFMREELSQRLGLSEARVQVWFQN
...::::.::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS33 EDEGQTKIKQRRSRTNFTLEQLNELERLFDETHYPDAFMREELSQRLGLSEARVQVWFQN
160 170 180 190 200 210
170 180 190 200 210 220
pF1KE5 RRAKCRKQENQMHKGVILGTANHLDACRVAPYVNMGALRMPFQQMEFCSCRPGWSIMA
:::::::::::.::::..:.:....:::::::::.::::::::: :. :
CCDS33 RRAKCRKQENQLHKGVLIGAASQFEACRVAPYVNVGALRMPFQQDSHCNVTPLSFQVQAQ
220 230 240 250 260 270
CCDS33 LQLDSAVAHAHHHLHPHLAAHAPYMMFPAPPFGLPLATLAADSASAASVVAAAAAAKTTS
280 290 300 310 320 330
225 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Mon Nov 7 22:43:18 2016 done: Mon Nov 7 22:43:18 2016
Total Scan time: 1.810 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]