FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3120, 299 aa 1>>>pF1KE3120 299 - 299 aa - 299 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.1237+/-0.000361; mu= 16.5184+/- 0.023 mean_var=65.2298+/-13.053, 0's: 0 Z-trim(113.2): 27 B-trim: 0 in 0/55 Lambda= 0.158800 statistics sampled from 22360 (22379) to 22360 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.654), E-opt: 0.2 (0.262), width: 16 Scan time: 7.570 The best scores are: opt bits E(85289) NP_005435 (OMIM: 612054) cell differentiation prot ( 299) 1944 454.1 1.5e-127 XP_016860737 (OMIM: 612054) PREDICTED: cell differ ( 245) 1600 375.2 6.9e-104 NP_001258564 (OMIM: 612054) cell differentiation p ( 258) 1572 368.8 6.1e-102 XP_011510440 (OMIM: 612054) PREDICTED: cell differ ( 246) 1507 353.9 1.8e-97 XP_016860738 (OMIM: 612054) PREDICTED: cell differ ( 205) 1135 268.7 7e-72 NP_001258563 (OMIM: 612054) cell differentiation p ( 331) 1025 243.6 4e-64 >>NP_005435 (OMIM: 612054) cell differentiation protein (299 aa) initn: 1944 init1: 1944 opt: 1944 Z-score: 2410.8 bits: 454.1 E(85289): 1.5e-127 Smith-Waterman score: 1944; 100.0% identity (100.0% similar) in 299 aa overlap (1-299:1-299) 10 20 30 40 50 60 pF1KE3 MHSLATAAPVPTTLAQVDREKIYQWINELSSPETRENALLELSKKRESVPDLAPMLWHSF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_005 MHSLATAAPVPTTLAQVDREKIYQWINELSSPETRENALLELSKKRESVPDLAPMLWHSF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 GTIAALLQEIVNIYPSINPPTLTAHQSNRVCNALALLQCVASHPETRSAFLAAHIPLFLY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_005 GTIAALLQEIVNIYPSINPPTLTAHQSNRVCNALALLQCVASHPETRSAFLAAHIPLFLY 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 PFLHTVSKTRPFEYLRLTSLGVIGALVKTDEQEVINFLLTTEIIPLCLRIMESGSELSKT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_005 PFLHTVSKTRPFEYLRLTSLGVIGALVKTDEQEVINFLLTTEIIPLCLRIMESGSELSKT 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 VATFILQKILLDDTGLAYICQTYERFSHVAMILGKMVLQLSKEPSARLLKHVVRCYLRLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_005 VATFILQKILLDDTGLAYICQTYERFSHVAMILGKMVLQLSKEPSARLLKHVVRCYLRLS 190 200 210 220 230 240 250 260 270 280 290 pF1KE3 DNPRAREALRQCLPDQLKDTTFAQVLKDDTTTKRWLAQLVKNLQEGQVTDPRGIPLPPQ ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_005 DNPRAREALRQCLPDQLKDTTFAQVLKDDTTTKRWLAQLVKNLQEGQVTDPRGIPLPPQ 250 260 270 280 290 >>XP_016860737 (OMIM: 612054) PREDICTED: cell differenti (245 aa) initn: 1600 init1: 1600 opt: 1600 Z-score: 1986.1 bits: 375.2 E(85289): 6.9e-104 Smith-Waterman score: 1600; 100.0% identity (100.0% similar) in 245 aa overlap (55-299:1-245) 30 40 50 60 70 80 pF1KE3 WINELSSPETRENALLELSKKRESVPDLAPMLWHSFGTIAALLQEIVNIYPSINPPTLTA :::::::::::::::::::::::::::::: XP_016 MLWHSFGTIAALLQEIVNIYPSINPPTLTA 10 20 30 90 100 110 120 130 140 pF1KE3 HQSNRVCNALALLQCVASHPETRSAFLAAHIPLFLYPFLHTVSKTRPFEYLRLTSLGVIG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 HQSNRVCNALALLQCVASHPETRSAFLAAHIPLFLYPFLHTVSKTRPFEYLRLTSLGVIG 40 50 60 70 80 90 150 160 170 180 190 200 pF1KE3 ALVKTDEQEVINFLLTTEIIPLCLRIMESGSELSKTVATFILQKILLDDTGLAYICQTYE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 ALVKTDEQEVINFLLTTEIIPLCLRIMESGSELSKTVATFILQKILLDDTGLAYICQTYE 100 110 120 130 140 150 210 220 230 240 250 260 pF1KE3 RFSHVAMILGKMVLQLSKEPSARLLKHVVRCYLRLSDNPRAREALRQCLPDQLKDTTFAQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 RFSHVAMILGKMVLQLSKEPSARLLKHVVRCYLRLSDNPRAREALRQCLPDQLKDTTFAQ 160 170 180 190 200 210 270 280 290 pF1KE3 VLKDDTTTKRWLAQLVKNLQEGQVTDPRGIPLPPQ ::::::::::::::::::::::::::::::::::: XP_016 VLKDDTTTKRWLAQLVKNLQEGQVTDPRGIPLPPQ 220 230 240 >>NP_001258564 (OMIM: 612054) cell differentiation prote (258 aa) initn: 1572 init1: 1572 opt: 1572 Z-score: 1951.1 bits: 368.8 E(85289): 6.1e-102 Smith-Waterman score: 1572; 100.0% identity (100.0% similar) in 244 aa overlap (1-244:1-244) 10 20 30 40 50 60 pF1KE3 MHSLATAAPVPTTLAQVDREKIYQWINELSSPETRENALLELSKKRESVPDLAPMLWHSF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MHSLATAAPVPTTLAQVDREKIYQWINELSSPETRENALLELSKKRESVPDLAPMLWHSF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 GTIAALLQEIVNIYPSINPPTLTAHQSNRVCNALALLQCVASHPETRSAFLAAHIPLFLY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 GTIAALLQEIVNIYPSINPPTLTAHQSNRVCNALALLQCVASHPETRSAFLAAHIPLFLY 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 PFLHTVSKTRPFEYLRLTSLGVIGALVKTDEQEVINFLLTTEIIPLCLRIMESGSELSKT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 PFLHTVSKTRPFEYLRLTSLGVIGALVKTDEQEVINFLLTTEIIPLCLRIMESGSELSKT 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 VATFILQKILLDDTGLAYICQTYERFSHVAMILGKMVLQLSKEPSARLLKHVVRCYLRLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 VATFILQKILLDDTGLAYICQTYERFSHVAMILGKMVLQLSKEPSARLLKHVVRCYLRLS 190 200 210 220 230 240 250 260 270 280 290 pF1KE3 DNPRAREALRQCLPDQLKDTTFAQVLKDDTTTKRWLAQLVKNLQEGQVTDPRGIPLPPQ :::: NP_001 DNPRFSDLTFCWSSFQRK 250 >>XP_011510440 (OMIM: 612054) PREDICTED: cell differenti (246 aa) initn: 1507 init1: 1507 opt: 1507 Z-score: 1871.0 bits: 353.9 E(85289): 1.8e-97 Smith-Waterman score: 1507; 99.6% identity (100.0% similar) in 232 aa overlap (68-299:15-246) 40 50 60 70 80 90 pF1KE3 ALLELSKKRESVPDLAPMLWHSFGTIAALLQEIVNIYPSINPPTLTAHQSNRVCNALALL .::::::::::::::::::::::::::::: XP_011 MTGGGRWKVEYKDHKEIVNIYPSINPPTLTAHQSNRVCNALALL 10 20 30 40 100 110 120 130 140 150 pF1KE3 QCVASHPETRSAFLAAHIPLFLYPFLHTVSKTRPFEYLRLTSLGVIGALVKTDEQEVINF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 QCVASHPETRSAFLAAHIPLFLYPFLHTVSKTRPFEYLRLTSLGVIGALVKTDEQEVINF 50 60 70 80 90 100 160 170 180 190 200 210 pF1KE3 LLTTEIIPLCLRIMESGSELSKTVATFILQKILLDDTGLAYICQTYERFSHVAMILGKMV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 LLTTEIIPLCLRIMESGSELSKTVATFILQKILLDDTGLAYICQTYERFSHVAMILGKMV 110 120 130 140 150 160 220 230 240 250 260 270 pF1KE3 LQLSKEPSARLLKHVVRCYLRLSDNPRAREALRQCLPDQLKDTTFAQVLKDDTTTKRWLA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 LQLSKEPSARLLKHVVRCYLRLSDNPRAREALRQCLPDQLKDTTFAQVLKDDTTTKRWLA 170 180 190 200 210 220 280 290 pF1KE3 QLVKNLQEGQVTDPRGIPLPPQ :::::::::::::::::::::: XP_011 QLVKNLQEGQVTDPRGIPLPPQ 230 240 >>XP_016860738 (OMIM: 612054) PREDICTED: cell differenti (205 aa) initn: 1135 init1: 1135 opt: 1135 Z-score: 1411.5 bits: 268.7 E(85289): 7e-72 Smith-Waterman score: 1135; 99.4% identity (100.0% similar) in 177 aa overlap (68-244:15-191) 40 50 60 70 80 90 pF1KE3 ALLELSKKRESVPDLAPMLWHSFGTIAALLQEIVNIYPSINPPTLTAHQSNRVCNALALL .::::::::::::::::::::::::::::: XP_016 MTGGGRWKVEYKDHKEIVNIYPSINPPTLTAHQSNRVCNALALL 10 20 30 40 100 110 120 130 140 150 pF1KE3 QCVASHPETRSAFLAAHIPLFLYPFLHTVSKTRPFEYLRLTSLGVIGALVKTDEQEVINF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 QCVASHPETRSAFLAAHIPLFLYPFLHTVSKTRPFEYLRLTSLGVIGALVKTDEQEVINF 50 60 70 80 90 100 160 170 180 190 200 210 pF1KE3 LLTTEIIPLCLRIMESGSELSKTVATFILQKILLDDTGLAYICQTYERFSHVAMILGKMV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 LLTTEIIPLCLRIMESGSELSKTVATFILQKILLDDTGLAYICQTYERFSHVAMILGKMV 110 120 130 140 150 160 220 230 240 250 260 270 pF1KE3 LQLSKEPSARLLKHVVRCYLRLSDNPRAREALRQCLPDQLKDTTFAQVLKDDTTTKRWLA ::::::::::::::::::::::::::: XP_016 LQLSKEPSARLLKHVVRCYLRLSDNPRFSDLTFCWSSFQRK 170 180 190 200 >>NP_001258563 (OMIM: 612054) cell differentiation prote (331 aa) initn: 1025 init1: 1025 opt: 1025 Z-score: 1272.3 bits: 243.6 E(85289): 4e-64 Smith-Waterman score: 1870; 90.3% identity (90.3% similar) in 331 aa overlap (1-299:1-331) 10 20 30 40 50 60 pF1KE3 MHSLATAAPVPTTLAQVDREKIYQWINELSSPETRENALLELSKKRESVPDLAPMLWHSF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MHSLATAAPVPTTLAQVDREKIYQWINELSSPETRENALLELSKKRESVPDLAPMLWHSF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 GTIAALLQEIVNIYPSINPPTLTAHQSNRVCNALALLQCVASHPETRSAFLAAHIPLFLY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 GTIAALLQEIVNIYPSINPPTLTAHQSNRVCNALALLQCVASHPETRSAFLAAHIPLFLY 70 80 90 100 110 120 130 140 pF1KE3 PFLHTVSKTRPFEYLRLTSLGVI--------------------------------GALVK ::::::::::::::::::::::: ::::: NP_001 PFLHTVSKTRPFEYLRLTSLGVIVETGFHHVGQADLELPTSSDLPASASQSAGITGALVK 130 140 150 160 170 180 150 160 170 180 190 200 pF1KE3 TDEQEVINFLLTTEIIPLCLRIMESGSELSKTVATFILQKILLDDTGLAYICQTYERFSH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 TDEQEVINFLLTTEIIPLCLRIMESGSELSKTVATFILQKILLDDTGLAYICQTYERFSH 190 200 210 220 230 240 210 220 230 240 250 260 pF1KE3 VAMILGKMVLQLSKEPSARLLKHVVRCYLRLSDNPRAREALRQCLPDQLKDTTFAQVLKD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 VAMILGKMVLQLSKEPSARLLKHVVRCYLRLSDNPRAREALRQCLPDQLKDTTFAQVLKD 250 260 270 280 290 300 270 280 290 pF1KE3 DTTTKRWLAQLVKNLQEGQVTDPRGIPLPPQ ::::::::::::::::::::::::::::::: NP_001 DTTTKRWLAQLVKNLQEGQVTDPRGIPLPPQ 310 320 330 299 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 12:05:49 2016 done: Sun Nov 6 12:05:50 2016 Total Scan time: 7.570 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]