FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1884, 410 aa 1>>>pF1KE1884 410 - 410 aa - 410 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.2413+/-0.000354; mu= 10.2514+/- 0.022 mean_var=152.3233+/-30.456, 0's: 0 Z-trim(118.6): 4 B-trim: 44 in 1/57 Lambda= 0.103918 statistics sampled from 31712 (31716) to 31712 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.729), E-opt: 0.2 (0.372), width: 16 Scan time: 9.400 The best scores are: opt bits E(85289) NP_003202 (OMIM: 601423) G/T mismatch-specific thy ( 410) 2774 427.4 3.1e-119 XP_011537016 (OMIM: 601423) PREDICTED: G/T mismatc ( 267) 1846 288.2 1.7e-77 XP_005269182 (OMIM: 601423) PREDICTED: G/T mismatc ( 206) 1216 193.6 3.8e-49 XP_011537017 (OMIM: 601423) PREDICTED: G/T mismatc ( 206) 1216 193.6 3.8e-49 >>NP_003202 (OMIM: 601423) G/T mismatch-specific thymine (410 aa) initn: 2774 init1: 2774 opt: 2774 Z-score: 2261.7 bits: 427.4 E(85289): 3.1e-119 Smith-Waterman score: 2774; 100.0% identity (100.0% similar) in 410 aa overlap (1-410:1-410) 10 20 30 40 50 60 pF1KE1 MEAENAGSYSLQQAQAFYTFPFQQLMAEAPNMAVVNEQQMPEEVPAPAPAQEPVQEAPKG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 MEAENAGSYSLQQAQAFYTFPFQQLMAEAPNMAVVNEQQMPEEVPAPAPAQEPVQEAPKG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 RKRKPRTTEPKQPVEPKKPVESKKSGKSAKSKEKQEKITDTFKVKRKVDRFNGVSEAELL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 RKRKPRTTEPKQPVEPKKPVESKKSGKSAKSKEKQEKITDTFKVKRKVDRFNGVSEAELL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 TKTLPDILTFNLDIVIIGINPGLMAAYKGHHYPGPGNHFWKCLFMSGLSEVQLNHMDDHT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 TKTLPDILTFNLDIVIIGINPGLMAAYKGHHYPGPGNHFWKCLFMSGLSEVQLNHMDDHT 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 LPGKYGIGFTNMVERTTPGSKDLSSKEFREGGRILVQKLQKYQPRIAVFNGKCIYEIFSK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 LPGKYGIGFTNMVERTTPGSKDLSSKEFREGGRILVQKLQKYQPRIAVFNGKCIYEIFSK 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 EVFGVKVKNLEFGLQPHKIPDTETLCYVMPSSSARCAQFPRAQDKVHYYIKLKDLRDQLK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 EVFGVKVKNLEFGLQPHKIPDTETLCYVMPSSSARCAQFPRAQDKVHYYIKLKDLRDQLK 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE1 GIERNMDVQEVQYTFDLQLAQEDAKKMAVKEEKYDPGYEAAYGGAYGENPCSSEPCGFSS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 GIERNMDVQEVQYTFDLQLAQEDAKKMAVKEEKYDPGYEAAYGGAYGENPCSSEPCGFSS 310 320 330 340 350 360 370 380 390 400 410 pF1KE1 NGLIESVELRGESAFSGIPNGQWMTQSFTDQIPSFSNHCGTQEQEEESHA :::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 NGLIESVELRGESAFSGIPNGQWMTQSFTDQIPSFSNHCGTQEQEEESHA 370 380 390 400 410 >>XP_011537016 (OMIM: 601423) PREDICTED: G/T mismatch-sp (267 aa) initn: 1846 init1: 1846 opt: 1846 Z-score: 1512.3 bits: 288.2 E(85289): 1.7e-77 Smith-Waterman score: 1846; 100.0% identity (100.0% similar) in 267 aa overlap (144-410:1-267) 120 130 140 150 160 170 pF1KE1 VSEAELLTKTLPDILTFNLDIVIIGINPGLMAAYKGHHYPGPGNHFWKCLFMSGLSEVQL :::::::::::::::::::::::::::::: XP_011 MAAYKGHHYPGPGNHFWKCLFMSGLSEVQL 10 20 30 180 190 200 210 220 230 pF1KE1 NHMDDHTLPGKYGIGFTNMVERTTPGSKDLSSKEFREGGRILVQKLQKYQPRIAVFNGKC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 NHMDDHTLPGKYGIGFTNMVERTTPGSKDLSSKEFREGGRILVQKLQKYQPRIAVFNGKC 40 50 60 70 80 90 240 250 260 270 280 290 pF1KE1 IYEIFSKEVFGVKVKNLEFGLQPHKIPDTETLCYVMPSSSARCAQFPRAQDKVHYYIKLK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 IYEIFSKEVFGVKVKNLEFGLQPHKIPDTETLCYVMPSSSARCAQFPRAQDKVHYYIKLK 100 110 120 130 140 150 300 310 320 330 340 350 pF1KE1 DLRDQLKGIERNMDVQEVQYTFDLQLAQEDAKKMAVKEEKYDPGYEAAYGGAYGENPCSS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 DLRDQLKGIERNMDVQEVQYTFDLQLAQEDAKKMAVKEEKYDPGYEAAYGGAYGENPCSS 160 170 180 190 200 210 360 370 380 390 400 410 pF1KE1 EPCGFSSNGLIESVELRGESAFSGIPNGQWMTQSFTDQIPSFSNHCGTQEQEEESHA ::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 EPCGFSSNGLIESVELRGESAFSGIPNGQWMTQSFTDQIPSFSNHCGTQEQEEESHA 220 230 240 250 260 >>XP_005269182 (OMIM: 601423) PREDICTED: G/T mismatch-sp (206 aa) initn: 1215 init1: 1215 opt: 1216 Z-score: 1003.4 bits: 193.6 E(85289): 3.8e-49 Smith-Waterman score: 1216; 90.5% identity (93.0% similar) in 201 aa overlap (210-410:6-206) 180 190 200 210 220 230 pF1KE1 TLPGKYGIGFTNMVERTTPGSKDLSSKEFREGGRILVQKLQKYQPRIAVFNGKCIYEIFS .:. .: :: :. . :::::: XP_005 MITLYQGSMVLDLPTWWKGPRPAAKISPGIYEIFS 10 20 30 240 250 260 270 280 290 pF1KE1 KEVFGVKVKNLEFGLQPHKIPDTETLCYVMPSSSARCAQFPRAQDKVHYYIKLKDLRDQL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 KEVFGVKVKNLEFGLQPHKIPDTETLCYVMPSSSARCAQFPRAQDKVHYYIKLKDLRDQL 40 50 60 70 80 90 300 310 320 330 340 350 pF1KE1 KGIERNMDVQEVQYTFDLQLAQEDAKKMAVKEEKYDPGYEAAYGGAYGENPCSSEPCGFS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 KGIERNMDVQEVQYTFDLQLAQEDAKKMAVKEEKYDPGYEAAYGGAYGENPCSSEPCGFS 100 110 120 130 140 150 360 370 380 390 400 410 pF1KE1 SNGLIESVELRGESAFSGIPNGQWMTQSFTDQIPSFSNHCGTQEQEEESHA ::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 SNGLIESVELRGESAFSGIPNGQWMTQSFTDQIPSFSNHCGTQEQEEESHA 160 170 180 190 200 >>XP_011537017 (OMIM: 601423) PREDICTED: G/T mismatch-sp (206 aa) initn: 1215 init1: 1215 opt: 1216 Z-score: 1003.4 bits: 193.6 E(85289): 3.8e-49 Smith-Waterman score: 1216; 90.5% identity (93.0% similar) in 201 aa overlap (210-410:6-206) 180 190 200 210 220 230 pF1KE1 TLPGKYGIGFTNMVERTTPGSKDLSSKEFREGGRILVQKLQKYQPRIAVFNGKCIYEIFS .:. .: :: :. . :::::: XP_011 MITLYQGSMVLDLPTWWKGPRPAAKISPGIYEIFS 10 20 30 240 250 260 270 280 290 pF1KE1 KEVFGVKVKNLEFGLQPHKIPDTETLCYVMPSSSARCAQFPRAQDKVHYYIKLKDLRDQL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 KEVFGVKVKNLEFGLQPHKIPDTETLCYVMPSSSARCAQFPRAQDKVHYYIKLKDLRDQL 40 50 60 70 80 90 300 310 320 330 340 350 pF1KE1 KGIERNMDVQEVQYTFDLQLAQEDAKKMAVKEEKYDPGYEAAYGGAYGENPCSSEPCGFS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 KGIERNMDVQEVQYTFDLQLAQEDAKKMAVKEEKYDPGYEAAYGGAYGENPCSSEPCGFS 100 110 120 130 140 150 360 370 380 390 400 410 pF1KE1 SNGLIESVELRGESAFSGIPNGQWMTQSFTDQIPSFSNHCGTQEQEEESHA ::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 SNGLIESVELRGESAFSGIPNGQWMTQSFTDQIPSFSNHCGTQEQEEESHA 160 170 180 190 200 410 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 14:43:13 2016 done: Sun Nov 6 14:43:14 2016 Total Scan time: 9.400 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]