FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5231, 225 aa 1>>>pF1KE5231 225 - 225 aa - 225 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.2872+/-0.000813; mu= 11.1033+/- 0.049 mean_var=129.7660+/-26.664, 0's: 0 Z-trim(111.4): 194 B-trim: 0 in 0/51 Lambda= 0.112588 statistics sampled from 12127 (12335) to 12127 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.741), E-opt: 0.2 (0.379), width: 16 Scan time: 1.810 The best scores are: opt bits E(32554) CCDS14106.1 SHOX gene_id:6473|Hs108|chrX ( 225) 1528 258.7 2.2e-69 CCDS14106.1 SHOX gene_id:6473|Hs108|chrY ( 225) 1528 258.7 2.2e-69 CCDS14107.1 SHOX gene_id:6473|Hs108|chrX ( 292) 1417 240.8 7e-64 CCDS14107.1 SHOX gene_id:6473|Hs108|chrY ( 292) 1417 240.8 7e-64 CCDS43164.1 SHOX2 gene_id:6474|Hs108|chr3 ( 331) 730 129.3 3e-30 CCDS54664.1 SHOX2 gene_id:6474|Hs108|chr3 ( 319) 717 127.1 1.2e-29 CCDS33884.2 SHOX2 gene_id:6474|Hs108|chr3 ( 355) 710 126.0 3e-29 >>CCDS14106.1 SHOX gene_id:6473|Hs108|chrX (225 aa) initn: 1528 init1: 1528 opt: 1528 Z-score: 1359.3 bits: 258.7 E(32554): 2.2e-69 Smith-Waterman score: 1528; 100.0% identity (100.0% similar) in 225 aa overlap (1-225:1-225) 10 20 30 40 50 60 pF1KE5 MEELTAFVSKSFDQKSKDGNGGGGGGGGKKDSITYREVLESGLARSRELGTSDSSLQDIT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 MEELTAFVSKSFDQKSKDGNGGGGGGGGKKDSITYREVLESGLARSRELGTSDSSLQDIT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 EGGGHCPVHLFKDHVDNDKEKLKEFGTARVAEGIYECKEKREDVKSEDEDGQTKLKQRRS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 EGGGHCPVHLFKDHVDNDKEKLKEFGTARVAEGIYECKEKREDVKSEDEDGQTKLKQRRS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 RTNFTLEQLNELERLFDETHYPDAFMREELSQRLGLSEARVQVWFQNRRAKCRKQENQMH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 RTNFTLEQLNELERLFDETHYPDAFMREELSQRLGLSEARVQVWFQNRRAKCRKQENQMH 130 140 150 160 170 180 190 200 210 220 pF1KE5 KGVILGTANHLDACRVAPYVNMGALRMPFQQMEFCSCRPGWSIMA ::::::::::::::::::::::::::::::::::::::::::::: CCDS14 KGVILGTANHLDACRVAPYVNMGALRMPFQQMEFCSCRPGWSIMA 190 200 210 220 >>CCDS14106.1 SHOX gene_id:6473|Hs108|chrY (225 aa) initn: 1528 init1: 1528 opt: 1528 Z-score: 1359.3 bits: 258.7 E(32554): 2.2e-69 Smith-Waterman score: 1528; 100.0% identity (100.0% similar) in 225 aa overlap (1-225:1-225) 10 20 30 40 50 60 pF1KE5 MEELTAFVSKSFDQKSKDGNGGGGGGGGKKDSITYREVLESGLARSRELGTSDSSLQDIT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 MEELTAFVSKSFDQKSKDGNGGGGGGGGKKDSITYREVLESGLARSRELGTSDSSLQDIT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 EGGGHCPVHLFKDHVDNDKEKLKEFGTARVAEGIYECKEKREDVKSEDEDGQTKLKQRRS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 EGGGHCPVHLFKDHVDNDKEKLKEFGTARVAEGIYECKEKREDVKSEDEDGQTKLKQRRS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 RTNFTLEQLNELERLFDETHYPDAFMREELSQRLGLSEARVQVWFQNRRAKCRKQENQMH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 RTNFTLEQLNELERLFDETHYPDAFMREELSQRLGLSEARVQVWFQNRRAKCRKQENQMH 130 140 150 160 170 180 190 200 210 220 pF1KE5 KGVILGTANHLDACRVAPYVNMGALRMPFQQMEFCSCRPGWSIMA ::::::::::::::::::::::::::::::::::::::::::::: CCDS14 KGVILGTANHLDACRVAPYVNMGALRMPFQQMEFCSCRPGWSIMA 190 200 210 220 >>CCDS14107.1 SHOX gene_id:6473|Hs108|chrX (292 aa) initn: 1465 init1: 1417 opt: 1417 Z-score: 1260.4 bits: 240.8 E(32554): 7e-64 Smith-Waterman score: 1417; 99.1% identity (100.0% similar) in 213 aa overlap (1-213:1-213) 10 20 30 40 50 60 pF1KE5 MEELTAFVSKSFDQKSKDGNGGGGGGGGKKDSITYREVLESGLARSRELGTSDSSLQDIT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 MEELTAFVSKSFDQKSKDGNGGGGGGGGKKDSITYREVLESGLARSRELGTSDSSLQDIT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 EGGGHCPVHLFKDHVDNDKEKLKEFGTARVAEGIYECKEKREDVKSEDEDGQTKLKQRRS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 EGGGHCPVHLFKDHVDNDKEKLKEFGTARVAEGIYECKEKREDVKSEDEDGQTKLKQRRS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 RTNFTLEQLNELERLFDETHYPDAFMREELSQRLGLSEARVQVWFQNRRAKCRKQENQMH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 RTNFTLEQLNELERLFDETHYPDAFMREELSQRLGLSEARVQVWFQNRRAKCRKQENQMH 130 140 150 160 170 180 190 200 210 220 pF1KE5 KGVILGTANHLDACRVAPYVNMGALRMPFQQMEFCSCRPGWSIMA :::::::::::::::::::::::::::::::.. CCDS14 KGVILGTANHLDACRVAPYVNMGALRMPFQQVQAQLQLEGVAHAHPHLHPHLAAHAPYLM 190 200 210 220 230 240 CCDS14 FPPPPFGLPIASLAESASAAAVVAAAAKSNSKNSSIADLRLKARKHAEALGL 250 260 270 280 290 >>CCDS14107.1 SHOX gene_id:6473|Hs108|chrY (292 aa) initn: 1465 init1: 1417 opt: 1417 Z-score: 1260.4 bits: 240.8 E(32554): 7e-64 Smith-Waterman score: 1417; 99.1% identity (100.0% similar) in 213 aa overlap (1-213:1-213) 10 20 30 40 50 60 pF1KE5 MEELTAFVSKSFDQKSKDGNGGGGGGGGKKDSITYREVLESGLARSRELGTSDSSLQDIT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 MEELTAFVSKSFDQKSKDGNGGGGGGGGKKDSITYREVLESGLARSRELGTSDSSLQDIT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 EGGGHCPVHLFKDHVDNDKEKLKEFGTARVAEGIYECKEKREDVKSEDEDGQTKLKQRRS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 EGGGHCPVHLFKDHVDNDKEKLKEFGTARVAEGIYECKEKREDVKSEDEDGQTKLKQRRS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 RTNFTLEQLNELERLFDETHYPDAFMREELSQRLGLSEARVQVWFQNRRAKCRKQENQMH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 RTNFTLEQLNELERLFDETHYPDAFMREELSQRLGLSEARVQVWFQNRRAKCRKQENQMH 130 140 150 160 170 180 190 200 210 220 pF1KE5 KGVILGTANHLDACRVAPYVNMGALRMPFQQMEFCSCRPGWSIMA :::::::::::::::::::::::::::::::.. CCDS14 KGVILGTANHLDACRVAPYVNMGALRMPFQQVQAQLQLEGVAHAHPHLHPHLAAHAPYLM 190 200 210 220 230 240 CCDS14 FPPPPFGLPIASLAESASAAAVVAAAAKSNSKNSSIADLRLKARKHAEALGL 250 260 270 280 290 >>CCDS43164.1 SHOX2 gene_id:6474|Hs108|chr3 (331 aa) initn: 880 init1: 676 opt: 730 Z-score: 656.6 bits: 129.3 E(32554): 3e-30 Smith-Waterman score: 800; 55.7% identity (69.0% similar) in 255 aa overlap (1-219:1-242) 10 20 30 40 50 pF1KE5 MEELTAFVSKSFDQKSKDGNGGGGGGGGKKDSITYREVLESGLARS-------RELGTSD ::::::::::::::: :. ::..:::::::::: :. : : .: CCDS43 MEELTAFVSKSFDQKVKE----------KKEAITYREVLESGPLRGAKEPTGCTEAGRDD 10 20 30 40 50 60 70 80 pF1KE5 SSLQDITE-----------------------------GGGHCPVHLFKDHVDNDKEKLKE : . :::. ::. . : . :. .: CCDS43 RSSPAVRAAGGGGGGGGGGGGGGGGGGVGGGGAGGGAGGGRSPVREL-DM--GAAERSRE 60 70 80 90 100 90 100 110 120 130 140 pF1KE5 FGTARVAEGIYECKEKREDVKSEDEDGQTKLKQRRSRTNFTLEQLNELERLFDETHYPDA :. :..: : :...::.:. ...::::.::::::::::::::::::::::::::::: CCDS43 PGSPRLTEVSPELKDRKEDAKGMEDEGQTKIKQRRSRTNFTLEQLNELERLFDETHYPDA 110 120 130 140 150 160 150 160 170 180 190 200 pF1KE5 FMREELSQRLGLSEARVQVWFQNRRAKCRKQENQMHKGVILGTANHLDACRVAPYVNMGA ::::::::::::::::::::::::::::::::::.::::..:.:....:::::::::.:: CCDS43 FMREELSQRLGLSEARVQVWFQNRRAKCRKQENQLHKGVLIGAASQFEACRVAPYVNVGA 170 180 190 200 210 220 210 220 pF1KE5 LRMPFQQMEFCSCRPGWSIMA ::::::: :. : CCDS43 LRMPFQQDSHCNVTPLSFQVQAQLQLDSAVAHAHHHLHPHLAAHAPYMMFPAPPFGLPLA 230 240 250 260 270 280 >>CCDS54664.1 SHOX2 gene_id:6474|Hs108|chr3 (319 aa) initn: 865 init1: 661 opt: 717 Z-score: 645.4 bits: 127.1 E(32554): 1.2e-29 Smith-Waterman score: 787; 56.2% identity (70.3% similar) in 249 aa overlap (1-213:1-236) 10 20 30 40 50 pF1KE5 MEELTAFVSKSFDQKSKDGNGGGGGGGGKKDSITYREVLESGLARS-------RELGTSD ::::::::::::::: :. ::..:::::::::: :. : : .: CCDS54 MEELTAFVSKSFDQKVKE----------KKEAITYREVLESGPLRGAKEPTGCTEAGRDD 10 20 30 40 50 60 70 80 pF1KE5 SSLQDITE-----------------------------GGGHCPVHLFKDHVDNDKEKLKE : . :::. ::. . : . :. .: CCDS54 RSSPAVRAAGGGGGGGGGGGGGGGGGGVGGGGAGGGAGGGRSPVREL-DM--GAAERSRE 60 70 80 90 100 90 100 110 120 130 140 pF1KE5 FGTARVAEGIYECKEKREDVKSEDEDGQTKLKQRRSRTNFTLEQLNELERLFDETHYPDA :. :..: : :...::.:. ...::::.::::::::::::::::::::::::::::: CCDS54 PGSPRLTEVSPELKDRKEDAKGMEDEGQTKIKQRRSRTNFTLEQLNELERLFDETHYPDA 110 120 130 140 150 160 150 160 170 180 190 200 pF1KE5 FMREELSQRLGLSEARVQVWFQNRRAKCRKQENQMHKGVILGTANHLDACRVAPYVNMGA ::::::::::::::::::::::::::::::::::.::::..:.:....:::::::::.:: CCDS54 FMREELSQRLGLSEARVQVWFQNRRAKCRKQENQLHKGVLIGAASQFEACRVAPYVNVGA 170 180 190 200 210 220 210 220 pF1KE5 LRMPFQQMEFCSCRPGWSIMA :::::::.. CCDS54 LRMPFQQVQAQLQLDSAVAHAHHHLHPHLAAHAPYMMFPAPPFGLPLATLAADSASAASV 230 240 250 260 270 280 >>CCDS33884.2 SHOX2 gene_id:6474|Hs108|chr3 (355 aa) initn: 880 init1: 676 opt: 710 Z-score: 638.7 bits: 126.0 E(32554): 3e-29 Smith-Waterman score: 726; 61.4% identity (77.2% similar) in 202 aa overlap (19-219:80-266) 10 20 30 40 pF1KE5 MEELTAFVSKSFDQKSKDGNGGGGGGGGKKDSITYREVLESGLA-RSR :.:.:::.:: .. . :: :. : : ::: CCDS33 DRSSPAVRAAGGGGGGGGGGGGGGGGGGVGGGGAGGGAGGGRSPV--RE-LDMGAAERSR 50 60 70 80 90 100 50 60 70 80 90 100 pF1KE5 ELGTSDSSLQDITEGGGHCPVHLFKDHVDNDKEKLKEFGTARVAEGIYECKEKREDVKSE : :. .::: . :. : .: . : : : : :...::.:. CCDS33 EPGSPR-----LTEGRRK-PT---KAEV---QATLLLPGEAFRFLVSPELKDRKEDAKGM 110 120 130 140 150 110 120 130 140 150 160 pF1KE5 DEDGQTKLKQRRSRTNFTLEQLNELERLFDETHYPDAFMREELSQRLGLSEARVQVWFQN ...::::.:::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 EDEGQTKIKQRRSRTNFTLEQLNELERLFDETHYPDAFMREELSQRLGLSEARVQVWFQN 160 170 180 190 200 210 170 180 190 200 210 220 pF1KE5 RRAKCRKQENQMHKGVILGTANHLDACRVAPYVNMGALRMPFQQMEFCSCRPGWSIMA :::::::::::.::::..:.:....:::::::::.::::::::: :. : CCDS33 RRAKCRKQENQLHKGVLIGAASQFEACRVAPYVNVGALRMPFQQDSHCNVTPLSFQVQAQ 220 230 240 250 260 270 CCDS33 LQLDSAVAHAHHHLHPHLAAHAPYMMFPAPPFGLPLATLAADSASAASVVAAAAAAKTTS 280 290 300 310 320 330 225 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 22:43:18 2016 done: Mon Nov 7 22:43:18 2016 Total Scan time: 1.810 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]