FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1614, 126 aa 1>>>pF1KE1614 126 - 126 aa - 126 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.6028+/-0.000281; mu= 13.5396+/- 0.018 mean_var=50.4079+/- 9.973, 0's: 0 Z-trim(116.6): 40 B-trim: 55 in 1/54 Lambda= 0.180645 statistics sampled from 27904 (27944) to 27904 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.736), E-opt: 0.2 (0.328), width: 16 Scan time: 4.180 The best scores are: opt bits E(85289) NP_005331 (OMIM: 137200,601314) histidine triad nu ( 126) 865 232.5 1.4e-61 NP_115982 (OMIM: 609997) histidine triad nucleotid ( 163) 543 148.6 3.1e-36 XP_016864899 (OMIM: 137200,601314) PREDICTED: hist ( 103) 481 132.4 1.6e-31 XP_016870327 (OMIM: 208920,606350) PREDICTED: apra ( 204) 157 48.1 7.3e-06 XP_011516240 (OMIM: 208920,606350) PREDICTED: apra ( 254) 157 48.2 8.7e-06 XP_006716855 (OMIM: 208920,606350) PREDICTED: apra ( 254) 157 48.2 8.7e-06 XP_011516241 (OMIM: 208920,606350) PREDICTED: apra ( 254) 157 48.2 8.7e-06 NP_001182181 (OMIM: 208920,606350) aprataxin isofo ( 284) 157 48.2 9.6e-06 NP_001182183 (OMIM: 208920,606350) aprataxin isofo ( 288) 157 48.2 9.7e-06 XP_016870326 (OMIM: 208920,606350) PREDICTED: apra ( 288) 157 48.2 9.7e-06 NP_001182180 (OMIM: 208920,606350) aprataxin isofo ( 292) 157 48.2 9.8e-06 NP_001182179 (OMIM: 208920,606350) aprataxin isofo ( 302) 157 48.2 1e-05 NP_778239 (OMIM: 208920,606350) aprataxin isoform ( 306) 157 48.2 1e-05 XP_006716854 (OMIM: 208920,606350) PREDICTED: apra ( 342) 157 48.2 1.1e-05 NP_001182178 (OMIM: 208920,606350) aprataxin isofo ( 342) 157 48.2 1.1e-05 NP_778243 (OMIM: 208920,606350) aprataxin isoform ( 342) 157 48.2 1.1e-05 NP_001182177 (OMIM: 208920,606350) aprataxin isofo ( 356) 157 48.2 1.2e-05 NP_612638 (OMIM: 609998) histidine triad nucleotid ( 182) 152 46.8 1.6e-05 XP_016870325 (OMIM: 208920,606350) PREDICTED: apra ( 308) 122 39.1 0.0057 XP_016870324 (OMIM: 208920,606350) PREDICTED: apra ( 315) 122 39.1 0.0058 XP_016870323 (OMIM: 208920,606350) PREDICTED: apra ( 325) 122 39.1 0.006 XP_016870322 (OMIM: 208920,606350) PREDICTED: apra ( 338) 122 39.1 0.0062 NP_002003 (OMIM: 601153) bis(5'-adenosyl)-triphosp ( 147) 118 37.9 0.0063 NP_001307828 (OMIM: 601153) bis(5'-adenosyl)-triph ( 147) 118 37.9 0.0063 NP_001159715 (OMIM: 601153) bis(5'-adenosyl)-triph ( 147) 118 37.9 0.0063 NP_001307829 (OMIM: 601153) bis(5'-adenosyl)-triph ( 147) 118 37.9 0.0063 XP_016870321 (OMIM: 208920,606350) PREDICTED: apra ( 369) 122 39.1 0.0066 XP_016870320 (OMIM: 208920,606350) PREDICTED: apra ( 423) 122 39.2 0.0074 >>NP_005331 (OMIM: 137200,601314) histidine triad nucleo (126 aa) initn: 865 init1: 865 opt: 865 Z-score: 1226.6 bits: 232.5 E(85289): 1.4e-61 Smith-Waterman score: 865; 100.0% identity (100.0% similar) in 126 aa overlap (1-126:1-126) 10 20 30 40 50 60 pF1KE1 MADEIAKAQVARPGGDTIFGKIIRKEIPAKIIFEDDRCLAFHDISPQAPTHFLVIPKKHI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_005 MADEIAKAQVARPGGDTIFGKIIRKEIPAKIIFEDDRCLAFHDISPQAPTHFLVIPKKHI 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 SQISVAEDDDESLLGHLMIVGKKCAADLGLNKGYRMVVNEGSDGGQSVYHVHLHVLGGRQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_005 SQISVAEDDDESLLGHLMIVGKKCAADLGLNKGYRMVVNEGSDGGQSVYHVHLHVLGGRQ 70 80 90 100 110 120 pF1KE1 MHWPPG :::::: NP_005 MHWPPG >>NP_115982 (OMIM: 609997) histidine triad nucleotide-bi (163 aa) initn: 544 init1: 496 opt: 543 Z-score: 771.4 bits: 148.6 E(85289): 3.1e-36 Smith-Waterman score: 543; 59.5% identity (86.5% similar) in 126 aa overlap (3-126:38-163) 10 20 30 pF1KE1 MADEIAKAQVARPGG--DTIFGKIIRKEIPAK .:.:::: : ::: :::..:. : .:: NP_115 AAGLRAARRAVAATGVRGGQVRGAAGVTDGNEVAKAQQATPGGAAPTIFSRILDKSLPAD 10 20 30 40 50 60 40 50 60 70 80 90 pF1KE1 IIFEDDRCLAFHDISPQAPTHFLVIPKKHISQISVAEDDDESLLGHLMIVGKKCAADLGL :..::..::.:.:..::::.:::::::: : .:: ::..:..:::::..:.:. : :: NP_115 ILYEDQQCLVFRDVAPQAPVHFLVIPKKPIPRISQAEEEDQQLLGHLLLVAKQTAKAEGL 70 80 90 100 110 120 100 110 120 pF1KE1 NKGYRMVVNEGSDGGQSVYHVHLHVLGGRQMHWPPG . :::.:.:.:. :.:::::.:.:::::::..:::: NP_115 GDGYRLVINDGKLGAQSVYHLHIHVLGGRQLQWPPG 130 140 150 160 >>XP_016864899 (OMIM: 137200,601314) PREDICTED: histidin (103 aa) initn: 481 init1: 481 opt: 481 Z-score: 687.1 bits: 132.4 E(85289): 1.6e-31 Smith-Waterman score: 481; 97.3% identity (100.0% similar) in 74 aa overlap (1-74:1-74) 10 20 30 40 50 60 pF1KE1 MADEIAKAQVARPGGDTIFGKIIRKEIPAKIIFEDDRCLAFHDISPQAPTHFLVIPKKHI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 MADEIAKAQVARPGGDTIFGKIIRKEIPAKIIFEDDRCLAFHDISPQAPTHFLVIPKKHI 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 SQISVAEDDDESLLGHLMIVGKKCAADLGLNKGYRMVVNEGSDGGQSVYHVHLHVLGGRQ ::::::::::::.. XP_016 SQISVAEDDDESVITKEKPEKPLGLQLPSCFPKLLHHFVSHQQ 70 80 90 100 >>XP_016870327 (OMIM: 208920,606350) PREDICTED: aprataxi (204 aa) initn: 102 init1: 69 opt: 157 Z-score: 226.3 bits: 48.1 E(85289): 7.3e-06 Smith-Waterman score: 157; 29.9% identity (61.9% similar) in 97 aa overlap (20-116:82-176) 10 20 30 40 pF1KE1 MADEIAKAQVARPGGDTIFGKIIRKEIPAKIIFEDDRCLAFHDISPQAP : : . : ...:.. ....: :.: XP_016 PGSNSGQCSVPLKKGKDAPIKKESLGHWSQGLKISMQDPKMQVYKDEQVVVIKDKYPKAR 60 70 80 90 100 110 50 60 70 80 90 100 pF1KE1 THFLVIPKKHISQISVAEDDDESLLGHLMIVGKKCAADLGLNKGYRMVVNEGSDGGQSVY :.::.: ::..... . :: :. ::.: .:.. .. :. . : . :. XP_016 YHWLVLPWTSISSLKAVAREHLELLKHMHTVGEKVIVDFAGSSKLRFRL--GYHAIPSMS 120 130 140 150 160 110 120 pF1KE1 HVHLHVLGGRQMHWPPG ::::::. XP_016 HVHLHVISQDFDSPCLKNKKHWNSFNTEYFLESQE 170 180 190 200 >>XP_011516240 (OMIM: 208920,606350) PREDICTED: aprataxi (254 aa) initn: 102 init1: 69 opt: 157 Z-score: 224.9 bits: 48.2 E(85289): 8.7e-06 Smith-Waterman score: 157; 29.9% identity (61.9% similar) in 97 aa overlap (20-116:82-176) 10 20 30 40 pF1KE1 MADEIAKAQVARPGGDTIFGKIIRKEIPAKIIFEDDRCLAFHDISPQAP : : . : ...:.. ....: :.: XP_011 PGSNSGQCSVPLKKGKDAPIKKESLGHWSQGLKISMQDPKMQVYKDEQVVVIKDKYPKAR 60 70 80 90 100 110 50 60 70 80 90 100 pF1KE1 THFLVIPKKHISQISVAEDDDESLLGHLMIVGKKCAADLGLNKGYRMVVNEGSDGGQSVY :.::.: ::..... . :: :. ::.: .:.. .. :. . : . :. XP_011 YHWLVLPWTSISSLKAVAREHLELLKHMHTVGEKVIVDFAGSSKLRFRL--GYHAIPSMS 120 130 140 150 160 110 120 pF1KE1 HVHLHVLGGRQMHWPPG ::::::. XP_011 HVHLHVISQDFDSPCLKNKKHWNSFNTEYFLESQAVIEMVQEAGRVTVRDGMPELLKLPL 170 180 190 200 210 220 >>XP_006716855 (OMIM: 208920,606350) PREDICTED: aprataxi (254 aa) initn: 102 init1: 69 opt: 157 Z-score: 224.9 bits: 48.2 E(85289): 8.7e-06 Smith-Waterman score: 157; 29.9% identity (61.9% similar) in 97 aa overlap (20-116:82-176) 10 20 30 40 pF1KE1 MADEIAKAQVARPGGDTIFGKIIRKEIPAKIIFEDDRCLAFHDISPQAP : : . : ...:.. ....: :.: XP_006 PGSNSGQCSVPLKKGKDAPIKKESLGHWSQGLKISMQDPKMQVYKDEQVVVIKDKYPKAR 60 70 80 90 100 110 50 60 70 80 90 100 pF1KE1 THFLVIPKKHISQISVAEDDDESLLGHLMIVGKKCAADLGLNKGYRMVVNEGSDGGQSVY :.::.: ::..... . :: :. ::.: .:.. .. :. . : . :. XP_006 YHWLVLPWTSISSLKAVAREHLELLKHMHTVGEKVIVDFAGSSKLRFRL--GYHAIPSMS 120 130 140 150 160 110 120 pF1KE1 HVHLHVLGGRQMHWPPG ::::::. XP_006 HVHLHVISQDFDSPCLKNKKHWNSFNTEYFLESQAVIEMVQEAGRVTVRDGMPELLKLPL 170 180 190 200 210 220 >>XP_011516241 (OMIM: 208920,606350) PREDICTED: aprataxi (254 aa) initn: 102 init1: 69 opt: 157 Z-score: 224.9 bits: 48.2 E(85289): 8.7e-06 Smith-Waterman score: 157; 29.9% identity (61.9% similar) in 97 aa overlap (20-116:82-176) 10 20 30 40 pF1KE1 MADEIAKAQVARPGGDTIFGKIIRKEIPAKIIFEDDRCLAFHDISPQAP : : . : ...:.. ....: :.: XP_011 PGSNSGQCSVPLKKGKDAPIKKESLGHWSQGLKISMQDPKMQVYKDEQVVVIKDKYPKAR 60 70 80 90 100 110 50 60 70 80 90 100 pF1KE1 THFLVIPKKHISQISVAEDDDESLLGHLMIVGKKCAADLGLNKGYRMVVNEGSDGGQSVY :.::.: ::..... . :: :. ::.: .:.. .. :. . : . :. XP_011 YHWLVLPWTSISSLKAVAREHLELLKHMHTVGEKVIVDFAGSSKLRFRL--GYHAIPSMS 120 130 140 150 160 110 120 pF1KE1 HVHLHVLGGRQMHWPPG ::::::. XP_011 HVHLHVISQDFDSPCLKNKKHWNSFNTEYFLESQAVIEMVQEAGRVTVRDGMPELLKLPL 170 180 190 200 210 220 >>NP_001182181 (OMIM: 208920,606350) aprataxin isoform i (284 aa) initn: 102 init1: 69 opt: 157 Z-score: 224.1 bits: 48.2 E(85289): 9.6e-06 Smith-Waterman score: 157; 29.9% identity (61.9% similar) in 97 aa overlap (20-116:112-206) 10 20 30 40 pF1KE1 MADEIAKAQVARPGGDTIFGKIIRKEIPAKIIFEDDRCLAFHDISPQAP : : . : ...:.. ....: :.: NP_001 IDSVVIGKDQEVKLQPGQVLHMESLGHWSQGLKISMQDPKMQVYKDEQVVVIKDKYPKAR 90 100 110 120 130 140 50 60 70 80 90 100 pF1KE1 THFLVIPKKHISQISVAEDDDESLLGHLMIVGKKCAADLGLNKGYRMVVNEGSDGGQSVY :.::.: ::..... . :: :. ::.: .:.. .. :. . : . :. NP_001 YHWLVLPWTSISSLKAVAREHLELLKHMHTVGEKVIVDFAGSSKLRFRL--GYHAIPSMS 150 160 170 180 190 110 120 pF1KE1 HVHLHVLGGRQMHWPPG ::::::. NP_001 HVHLHVISQDFDSPCLKNKKHWNSFNTEYFLESQAVIEMVQEAGRVTVRDGMPELLKLPL 200 210 220 230 240 250 >>NP_001182183 (OMIM: 208920,606350) aprataxin isoform h (288 aa) initn: 102 init1: 69 opt: 157 Z-score: 224.0 bits: 48.2 E(85289): 9.7e-06 Smith-Waterman score: 157; 29.9% identity (61.9% similar) in 97 aa overlap (20-116:116-210) 10 20 30 40 pF1KE1 MADEIAKAQVARPGGDTIFGKIIRKEIPAKIIFEDDRCLAFHDISPQAP : : . : ...:.. ....: :.: NP_001 PGSNSGQCSVPLKKGKDAPIKKESLGHWSQGLKISMQDPKMQVYKDEQVVVIKDKYPKAR 90 100 110 120 130 140 50 60 70 80 90 100 pF1KE1 THFLVIPKKHISQISVAEDDDESLLGHLMIVGKKCAADLGLNKGYRMVVNEGSDGGQSVY :.::.: ::..... . :: :. ::.: .:.. .. :. . : . :. NP_001 YHWLVLPWTSISSLKAVAREHLELLKHMHTVGEKVIVDFAGSSKLRFRL--GYHAIPSMS 150 160 170 180 190 200 110 120 pF1KE1 HVHLHVLGGRQMHWPPG ::::::. NP_001 HVHLHVISQDFDSPCLKNKKHWNSFNTEYFLESQAVIEMVQEAGRVTVRDGMPELLKLPL 210 220 230 240 250 260 >>XP_016870326 (OMIM: 208920,606350) PREDICTED: aprataxi (288 aa) initn: 102 init1: 69 opt: 157 Z-score: 224.0 bits: 48.2 E(85289): 9.7e-06 Smith-Waterman score: 157; 29.9% identity (61.9% similar) in 97 aa overlap (20-116:116-210) 10 20 30 40 pF1KE1 MADEIAKAQVARPGGDTIFGKIIRKEIPAKIIFEDDRCLAFHDISPQAP : : . : ...:.. ....: :.: XP_016 PGSNSGQCSVPLKKGKDAPIKKESLGHWSQGLKISMQDPKMQVYKDEQVVVIKDKYPKAR 90 100 110 120 130 140 50 60 70 80 90 100 pF1KE1 THFLVIPKKHISQISVAEDDDESLLGHLMIVGKKCAADLGLNKGYRMVVNEGSDGGQSVY :.::.: ::..... . :: :. ::.: .:.. .. :. . : . :. XP_016 YHWLVLPWTSISSLKAVAREHLELLKHMHTVGEKVIVDFAGSSKLRFRL--GYHAIPSMS 150 160 170 180 190 200 110 120 pF1KE1 HVHLHVLGGRQMHWPPG ::::::. XP_016 HVHLHVISQDFDSPCLKNKKHWNSFNTEYFLESQAVIEMVQEAGRVTVRDGMPELLKLPL 210 220 230 240 250 260 126 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 15:33:47 2016 done: Mon Nov 7 15:33:48 2016 Total Scan time: 4.180 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]