FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB7912, 331 aa 1>>>pF1KB7912 331 - 331 aa - 331 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.3300+/-0.00129; mu= 6.3271+/- 0.078 mean_var=380.4803+/-79.082, 0's: 0 Z-trim(112.4): 145 B-trim: 0 in 0/50 Lambda= 0.065752 statistics sampled from 13000 (13136) to 13000 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.732), E-opt: 0.2 (0.404), width: 16 Scan time: 3.060 The best scores are: opt bits E(32554) CCDS43164.1 SHOX2 gene_id:6474|Hs108|chr3 ( 331) 2199 222.3 4.3e-58 CCDS54664.1 SHOX2 gene_id:6474|Hs108|chr3 ( 319) 1614 166.8 2.1e-41 CCDS33884.2 SHOX2 gene_id:6474|Hs108|chr3 ( 355) 1479 154.0 1.6e-37 CCDS14106.1 SHOX gene_id:6473|Hs108|chrX ( 225) 730 82.7 3.1e-16 CCDS14106.1 SHOX gene_id:6473|Hs108|chrY ( 225) 730 82.7 3.1e-16 CCDS14107.1 SHOX gene_id:6473|Hs108|chrY ( 292) 722 82.1 6e-16 CCDS14107.1 SHOX gene_id:6473|Hs108|chrX ( 292) 722 82.1 6e-16 >>CCDS43164.1 SHOX2 gene_id:6474|Hs108|chr3 (331 aa) initn: 2199 init1: 2199 opt: 2199 Z-score: 1156.4 bits: 222.3 E(32554): 4.3e-58 Smith-Waterman score: 2199; 100.0% identity (100.0% similar) in 331 aa overlap (1-331:1-331) 10 20 30 40 50 60 pF1KB7 MEELTAFVSKSFDQKVKEKKEAITYREVLESGPLRGAKEPTGCTEAGRDDRSSPAVRAAG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 MEELTAFVSKSFDQKVKEKKEAITYREVLESGPLRGAKEPTGCTEAGRDDRSSPAVRAAG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 GGGGGGGGGGGGGGGGGVGGGGAGGGAGGGRSPVRELDMGAAERSREPGSPRLTEVSPEL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 GGGGGGGGGGGGGGGGGVGGGGAGGGAGGGRSPVRELDMGAAERSREPGSPRLTEVSPEL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 KDRKEDAKGMEDEGQTKIKQRRSRTNFTLEQLNELERLFDETHYPDAFMREELSQRLGLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 KDRKEDAKGMEDEGQTKIKQRRSRTNFTLEQLNELERLFDETHYPDAFMREELSQRLGLS 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB7 EARVQVWFQNRRAKCRKQENQLHKGVLIGAASQFEACRVAPYVNVGALRMPFQQDSHCNV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 EARVQVWFQNRRAKCRKQENQLHKGVLIGAASQFEACRVAPYVNVGALRMPFQQDSHCNV 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB7 TPLSFQVQAQLQLDSAVAHAHHHLHPHLAAHAPYMMFPAPPFGLPLATLAADSASAASVV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 TPLSFQVQAQLQLDSAVAHAHHHLHPHLAAHAPYMMFPAPPFGLPLATLAADSASAASVV 250 260 270 280 290 300 310 320 330 pF1KB7 AAAAAAKTTSKNSSIADLRLKAKKHAAALGL ::::::::::::::::::::::::::::::: CCDS43 AAAAAAKTTSKNSSIADLRLKAKKHAAALGL 310 320 330 >>CCDS54664.1 SHOX2 gene_id:6474|Hs108|chr3 (319 aa) initn: 1573 init1: 1573 opt: 1614 Z-score: 856.7 bits: 166.8 E(32554): 2.1e-41 Smith-Waterman score: 2077; 96.4% identity (96.4% similar) in 331 aa overlap (1-331:1-319) 10 20 30 40 50 60 pF1KB7 MEELTAFVSKSFDQKVKEKKEAITYREVLESGPLRGAKEPTGCTEAGRDDRSSPAVRAAG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 MEELTAFVSKSFDQKVKEKKEAITYREVLESGPLRGAKEPTGCTEAGRDDRSSPAVRAAG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 GGGGGGGGGGGGGGGGGVGGGGAGGGAGGGRSPVRELDMGAAERSREPGSPRLTEVSPEL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 GGGGGGGGGGGGGGGGGVGGGGAGGGAGGGRSPVRELDMGAAERSREPGSPRLTEVSPEL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 KDRKEDAKGMEDEGQTKIKQRRSRTNFTLEQLNELERLFDETHYPDAFMREELSQRLGLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 KDRKEDAKGMEDEGQTKIKQRRSRTNFTLEQLNELERLFDETHYPDAFMREELSQRLGLS 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB7 EARVQVWFQNRRAKCRKQENQLHKGVLIGAASQFEACRVAPYVNVGALRMPFQQDSHCNV :::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 EARVQVWFQNRRAKCRKQENQLHKGVLIGAASQFEACRVAPYVNVGALRMPFQQ------ 190 200 210 220 230 250 260 270 280 290 300 pF1KB7 TPLSFQVQAQLQLDSAVAHAHHHLHPHLAAHAPYMMFPAPPFGLPLATLAADSASAASVV :::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 ------VQAQLQLDSAVAHAHHHLHPHLAAHAPYMMFPAPPFGLPLATLAADSASAASVV 240 250 260 270 280 310 320 330 pF1KB7 AAAAAAKTTSKNSSIADLRLKAKKHAAALGL ::::::::::::::::::::::::::::::: CCDS54 AAAAAAKTTSKNSSIADLRLKAKKHAAALGL 290 300 310 >>CCDS33884.2 SHOX2 gene_id:6474|Hs108|chr3 (355 aa) initn: 1426 init1: 1426 opt: 1479 Z-score: 787.0 bits: 154.0 E(32554): 1.6e-37 Smith-Waterman score: 2141; 93.2% identity (93.2% similar) in 355 aa overlap (1-331:1-355) 10 20 30 40 50 60 pF1KB7 MEELTAFVSKSFDQKVKEKKEAITYREVLESGPLRGAKEPTGCTEAGRDDRSSPAVRAAG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 MEELTAFVSKSFDQKVKEKKEAITYREVLESGPLRGAKEPTGCTEAGRDDRSSPAVRAAG 10 20 30 40 50 60 70 80 90 100 110 pF1KB7 GGGGGGGGGGGGGGGGGVGGGGAGGGAGGGRSPVRELDMGAAERSREPGSPRLTE----- ::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 GGGGGGGGGGGGGGGGGVGGGGAGGGAGGGRSPVRELDMGAAERSREPGSPRLTEGRRKP 70 80 90 100 110 120 120 130 140 150 pF1KB7 -------------------VSPELKDRKEDAKGMEDEGQTKIKQRRSRTNFTLEQLNELE ::::::::::::::::::::::::::::::::::::::::: CCDS33 TKAEVQATLLLPGEAFRFLVSPELKDRKEDAKGMEDEGQTKIKQRRSRTNFTLEQLNELE 130 140 150 160 170 180 160 170 180 190 200 210 pF1KB7 RLFDETHYPDAFMREELSQRLGLSEARVQVWFQNRRAKCRKQENQLHKGVLIGAASQFEA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 RLFDETHYPDAFMREELSQRLGLSEARVQVWFQNRRAKCRKQENQLHKGVLIGAASQFEA 190 200 210 220 230 240 220 230 240 250 260 270 pF1KB7 CRVAPYVNVGALRMPFQQDSHCNVTPLSFQVQAQLQLDSAVAHAHHHLHPHLAAHAPYMM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 CRVAPYVNVGALRMPFQQDSHCNVTPLSFQVQAQLQLDSAVAHAHHHLHPHLAAHAPYMM 250 260 270 280 290 300 280 290 300 310 320 330 pF1KB7 FPAPPFGLPLATLAADSASAASVVAAAAAAKTTSKNSSIADLRLKAKKHAAALGL ::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 FPAPPFGLPLATLAADSASAASVVAAAAAAKTTSKNSSIADLRLKAKKHAAALGL 310 320 330 340 350 >>CCDS14106.1 SHOX gene_id:6473|Hs108|chrX (225 aa) initn: 880 init1: 676 opt: 730 Z-score: 404.9 bits: 82.7 E(32554): 3.1e-16 Smith-Waterman score: 800; 55.5% identity (69.1% similar) in 256 aa overlap (1-242:1-219) 10 20 30 40 50 pF1KB7 MEELTAFVSKSFDQKVKE----------KKEAITYREVLESGPLRGAKEPTGCTEAGRDD ::::::::::::::: :. ::..:::::::::: :. : : .: CCDS14 MEELTAFVSKSFDQKSKDGNGGGGGGGGKKDSITYREVLESGLARS-------RELGTSD 10 20 30 40 50 60 70 80 90 100 pF1KB7 RSSPAVRAAGGGGGGGGGGGGGGGGGGVGGGGAGGGAGGGRSPVR----ELDMGAAERSR : . :::. ::. ..: . :. . CCDS14 SSLQDITE-----------------------------GGGHCPVHLFKDHVD-NDKEKLK 60 70 80 110 120 130 140 150 160 pF1KB7 EPGSPRLTEVSPELKDRKEDAKGMEDEGQTKIKQRRSRTNFTLEQLNELERLFDETHYPD : :. :..: : :...::.:. ...::::.:::::::::::::::::::::::::::: CCDS14 EFGTARVAEGIYECKEKREDVKSEDEDGQTKLKQRRSRTNFTLEQLNELERLFDETHYPD 90 100 110 120 130 140 170 180 190 200 210 220 pF1KB7 AFMREELSQRLGLSEARVQVWFQNRRAKCRKQENQLHKGVLIGAASQFEACRVAPYVNVG :::::::::::::::::::::::::::::::::::.::::..:.:....:::::::::.: CCDS14 AFMREELSQRLGLSEARVQVWFQNRRAKCRKQENQMHKGVILGTANHLDACRVAPYVNMG 150 160 170 180 190 200 230 240 250 260 270 280 pF1KB7 ALRMPFQQDSHCNVTPLSFQVQAQLQLDSAVAHAHHHLHPHLAAHAPYMMFPAPPFGLPL :::::::: :. : CCDS14 ALRMPFQQMEFCSCRPGWSIMA 210 220 >>CCDS14106.1 SHOX gene_id:6473|Hs108|chrY (225 aa) initn: 880 init1: 676 opt: 730 Z-score: 404.9 bits: 82.7 E(32554): 3.1e-16 Smith-Waterman score: 800; 55.5% identity (69.1% similar) in 256 aa overlap (1-242:1-219) 10 20 30 40 50 pF1KB7 MEELTAFVSKSFDQKVKE----------KKEAITYREVLESGPLRGAKEPTGCTEAGRDD ::::::::::::::: :. ::..:::::::::: :. : : .: CCDS14 MEELTAFVSKSFDQKSKDGNGGGGGGGGKKDSITYREVLESGLARS-------RELGTSD 10 20 30 40 50 60 70 80 90 100 pF1KB7 RSSPAVRAAGGGGGGGGGGGGGGGGGGVGGGGAGGGAGGGRSPVR----ELDMGAAERSR : . :::. ::. ..: . :. . CCDS14 SSLQDITE-----------------------------GGGHCPVHLFKDHVD-NDKEKLK 60 70 80 110 120 130 140 150 160 pF1KB7 EPGSPRLTEVSPELKDRKEDAKGMEDEGQTKIKQRRSRTNFTLEQLNELERLFDETHYPD : :. :..: : :...::.:. ...::::.:::::::::::::::::::::::::::: CCDS14 EFGTARVAEGIYECKEKREDVKSEDEDGQTKLKQRRSRTNFTLEQLNELERLFDETHYPD 90 100 110 120 130 140 170 180 190 200 210 220 pF1KB7 AFMREELSQRLGLSEARVQVWFQNRRAKCRKQENQLHKGVLIGAASQFEACRVAPYVNVG :::::::::::::::::::::::::::::::::::.::::..:.:....:::::::::.: CCDS14 AFMREELSQRLGLSEARVQVWFQNRRAKCRKQENQMHKGVILGTANHLDACRVAPYVNMG 150 160 170 180 190 200 230 240 250 260 270 280 pF1KB7 ALRMPFQQDSHCNVTPLSFQVQAQLQLDSAVAHAHHHLHPHLAAHAPYMMFPAPPFGLPL :::::::: :. : CCDS14 ALRMPFQQMEFCSCRPGWSIMA 210 220 >>CCDS14107.1 SHOX gene_id:6473|Hs108|chrY (292 aa) initn: 1174 init1: 676 opt: 722 Z-score: 399.7 bits: 82.1 E(32554): 6e-16 Smith-Waterman score: 1164; 60.3% identity (73.0% similar) in 345 aa overlap (1-331:1-292) 10 20 30 40 50 pF1KB7 MEELTAFVSKSFDQKVKE----------KKEAITYREVLESGPLRGAKEPTGCTEAGRDD ::::::::::::::: :. ::..:::::::::: :. : : .: CCDS14 MEELTAFVSKSFDQKSKDGNGGGGGGGGKKDSITYREVLESGLARS-------RELGTSD 10 20 30 40 50 60 70 80 90 100 pF1KB7 RSSPAVRAAGGGGGGGGGGGGGGGGGGVGGGGAGGGAGGGRSPVR----ELDMGAAERSR : . :::. ::. ..: . :. . CCDS14 SSLQDITE-----------------------------GGGHCPVHLFKDHVD-NDKEKLK 60 70 80 110 120 130 140 150 160 pF1KB7 EPGSPRLTEVSPELKDRKEDAKGMEDEGQTKIKQRRSRTNFTLEQLNELERLFDETHYPD : :. :..: : :...::.:. ...::::.:::::::::::::::::::::::::::: CCDS14 EFGTARVAEGIYECKEKREDVKSEDEDGQTKLKQRRSRTNFTLEQLNELERLFDETHYPD 90 100 110 120 130 140 170 180 190 200 210 220 pF1KB7 AFMREELSQRLGLSEARVQVWFQNRRAKCRKQENQLHKGVLIGAASQFEACRVAPYVNVG :::::::::::::::::::::::::::::::::::.::::..:.:....:::::::::.: CCDS14 AFMREELSQRLGLSEARVQVWFQNRRAKCRKQENQMHKGVILGTANHLDACRVAPYVNMG 150 160 170 180 190 200 230 240 250 260 270 280 pF1KB7 ALRMPFQQDSHCNVTPLSFQVQAQLQLDSAVAHAHHHLHPHLAAHAPYMMFPAPPFGLPL :::::::: :::::::.. ::::: ::::::::::::.::: ::::::. CCDS14 ALRMPFQQ------------VQAQLQLEG-VAHAHPHLHPHLAAHAPYLMFPPPPFGLPI 210 220 230 240 250 290 300 310 320 330 pF1KB7 ATLAADSASAASVVAAAAAAKTTSKNSSIADLRLKAKKHAAALGL :.:: .:::::.:::::: :..:::::::::::::.::: :::: CCDS14 ASLA-ESASAAAVVAAAA--KSNSKNSSIADLRLKARKHAEALGL 260 270 280 290 >>CCDS14107.1 SHOX gene_id:6473|Hs108|chrX (292 aa) initn: 1174 init1: 676 opt: 722 Z-score: 399.7 bits: 82.1 E(32554): 6e-16 Smith-Waterman score: 1164; 60.3% identity (73.0% similar) in 345 aa overlap (1-331:1-292) 10 20 30 40 50 pF1KB7 MEELTAFVSKSFDQKVKE----------KKEAITYREVLESGPLRGAKEPTGCTEAGRDD ::::::::::::::: :. ::..:::::::::: :. : : .: CCDS14 MEELTAFVSKSFDQKSKDGNGGGGGGGGKKDSITYREVLESGLARS-------RELGTSD 10 20 30 40 50 60 70 80 90 100 pF1KB7 RSSPAVRAAGGGGGGGGGGGGGGGGGGVGGGGAGGGAGGGRSPVR----ELDMGAAERSR : . :::. ::. ..: . :. . CCDS14 SSLQDITE-----------------------------GGGHCPVHLFKDHVD-NDKEKLK 60 70 80 110 120 130 140 150 160 pF1KB7 EPGSPRLTEVSPELKDRKEDAKGMEDEGQTKIKQRRSRTNFTLEQLNELERLFDETHYPD : :. :..: : :...::.:. ...::::.:::::::::::::::::::::::::::: CCDS14 EFGTARVAEGIYECKEKREDVKSEDEDGQTKLKQRRSRTNFTLEQLNELERLFDETHYPD 90 100 110 120 130 140 170 180 190 200 210 220 pF1KB7 AFMREELSQRLGLSEARVQVWFQNRRAKCRKQENQLHKGVLIGAASQFEACRVAPYVNVG :::::::::::::::::::::::::::::::::::.::::..:.:....:::::::::.: CCDS14 AFMREELSQRLGLSEARVQVWFQNRRAKCRKQENQMHKGVILGTANHLDACRVAPYVNMG 150 160 170 180 190 200 230 240 250 260 270 280 pF1KB7 ALRMPFQQDSHCNVTPLSFQVQAQLQLDSAVAHAHHHLHPHLAAHAPYMMFPAPPFGLPL :::::::: :::::::.. ::::: ::::::::::::.::: ::::::. CCDS14 ALRMPFQQ------------VQAQLQLEG-VAHAHPHLHPHLAAHAPYLMFPPPPFGLPI 210 220 230 240 250 290 300 310 320 330 pF1KB7 ATLAADSASAASVVAAAAAAKTTSKNSSIADLRLKAKKHAAALGL :.:: .:::::.:::::: :..:::::::::::::.::: :::: CCDS14 ASLA-ESASAAAVVAAAA--KSNSKNSSIADLRLKARKHAEALGL 260 270 280 290 331 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 05:53:16 2016 done: Sun Nov 6 05:53:16 2016 Total Scan time: 3.060 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]