FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6632, 247 aa 1>>>pF1KE6632 247 - 247 aa - 247 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.1732+/-0.000378; mu= 20.6655+/- 0.023 mean_var=58.5102+/-12.651, 0's: 0 Z-trim(111.6): 22 B-trim: 897 in 1/53 Lambda= 0.167671 statistics sampled from 20190 (20199) to 20190 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.6), E-opt: 0.2 (0.237), width: 16 Scan time: 6.410 The best scores are: opt bits E(85289) NP_004861 (OMIM: 604041,609180) mannose-P-dolichol ( 247) 1589 392.6 3.3e-109 XP_006721661 (OMIM: 604041,609180) PREDICTED: mann ( 259) 1083 270.2 2.4e-72 XP_011522383 (OMIM: 604041,609180) PREDICTED: mann ( 138) 884 221.8 4.8e-58 NP_001317002 (OMIM: 604041,609180) mannose-P-dolic ( 192) 850 213.8 1.8e-55 XP_006721660 (OMIM: 604041,609180) PREDICTED: mann ( 260) 838 211.0 1.7e-54 >>NP_004861 (OMIM: 604041,609180) mannose-P-dolichol uti (247 aa) initn: 1589 init1: 1589 opt: 1589 Z-score: 2081.5 bits: 392.6 E(85289): 3.3e-109 Smith-Waterman score: 1589; 99.6% identity (100.0% similar) in 247 aa overlap (1-247:1-247) 10 20 30 40 50 60 pF1KE6 MAAEADGPLKRLLVPILLPEKCYDQLFVQWDLLHVPCLKILLSKGLGLGIVAGSLLVKLP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_004 MAAEADGPLKRLLVPILLPEKCYDQLFVQWDLLHVPCLKILLSKGLGLGIVAGSLLVKLP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 QVFKILGAKSAEGLSLQSVMLELVALTGTMVYSITNNFPFSSWGEALFLMLQTITICFLV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_004 QVFKILGAKSAEGLSLQSVMLELVALTGTMVYSITNNFPFSSWGEALFLMLQTITICFLV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 MHYRGQTVKGVAFLACYGLVLLVLLSPLTPLTVVTLLQASNVPAVVVGRLLQAATNYHNG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_004 MHYRGQTVKGVAFLACYGLVLLVLLSPLTPLTVVTLLQASNVPAVVVGRLLQAATNYHNG 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 HTGQLSAITVFLLFGGSLARIFTSIQETGDPLMAGTFVVSSLCNGLIATQLLFYWNAKPP ::::::::::::::::::::::::::::::::::::::::::::::::.::::::::::: NP_004 HTGQLSAITVFLLFGGSLARIFTSIQETGDPLMAGTFVVSSLCNGLIAAQLLFYWNAKPP 190 200 210 220 230 240 pF1KE6 HKQKKAQ ::::::: NP_004 HKQKKAQ >>XP_006721661 (OMIM: 604041,609180) PREDICTED: mannose- (259 aa) initn: 1083 init1: 1083 opt: 1083 Z-score: 1419.7 bits: 270.2 E(85289): 2.4e-72 Smith-Waterman score: 1083; 99.4% identity (100.0% similar) in 170 aa overlap (1-170:1-170) 10 20 30 40 50 60 pF1KE6 MAAEADGPLKRLLVPILLPEKCYDQLFVQWDLLHVPCLKILLSKGLGLGIVAGSLLVKLP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_006 MAAEADGPLKRLLVPILLPEKCYDQLFVQWDLLHVPCLKILLSKGLGLGIVAGSLLVKLP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 QVFKILGAKSAEGLSLQSVMLELVALTGTMVYSITNNFPFSSWGEALFLMLQTITICFLV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_006 QVFKILGAKSAEGLSLQSVMLELVALTGTMVYSITNNFPFSSWGEALFLMLQTITICFLV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 MHYRGQTVKGVAFLACYGLVLLVLLSPLTPLTVVTLLQASNVPAVVVGRLLQAATNYHNG :::::::::::::::::::::::::::::::::::::::::::::::::. XP_006 MHYRGQTVKGVAFLACYGLVLLVLLSPLTPLTVVTLLQASNVPAVVVGRVGTRSKGQDVV 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 HTGQLSAITVFLLFGGSLARIFTSIQETGDPLMAGTFVVSSLCNGLIATQLLFYWNAKPP XP_006 GAGWGGRVEDQSVGVLYLGEHGKSSGDRAKGLNSSPSFSRQPPTTTTGTQASSQPSQSSC 190 200 210 220 230 240 >>XP_011522383 (OMIM: 604041,609180) PREDICTED: mannose- (138 aa) initn: 884 init1: 884 opt: 884 Z-score: 1163.0 bits: 221.8 E(85289): 4.8e-58 Smith-Waterman score: 884; 99.3% identity (100.0% similar) in 138 aa overlap (110-247:1-138) 80 90 100 110 120 130 pF1KE6 MLELVALTGTMVYSITNNFPFSSWGEALFLMLQTITICFLVMHYRGQTVKGVAFLACYGL :::::::::::::::::::::::::::::: XP_011 MLQTITICFLVMHYRGQTVKGVAFLACYGL 10 20 30 140 150 160 170 180 190 pF1KE6 VLLVLLSPLTPLTVVTLLQASNVPAVVVGRLLQAATNYHNGHTGQLSAITVFLLFGGSLA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 VLLVLLSPLTPLTVVTLLQASNVPAVVVGRLLQAATNYHNGHTGQLSAITVFLLFGGSLA 40 50 60 70 80 90 200 210 220 230 240 pF1KE6 RIFTSIQETGDPLMAGTFVVSSLCNGLIATQLLFYWNAKPPHKQKKAQ :::::::::::::::::::::::::::::.:::::::::::::::::: XP_011 RIFTSIQETGDPLMAGTFVVSSLCNGLIAAQLLFYWNAKPPHKQKKAQ 100 110 120 130 >>NP_001317002 (OMIM: 604041,609180) mannose-P-dolichol (192 aa) initn: 846 init1: 846 opt: 850 Z-score: 1116.8 bits: 213.8 E(85289): 1.8e-55 Smith-Waterman score: 850; 83.4% identity (87.6% similar) in 169 aa overlap (1-164:1-169) 10 20 30 40 50 60 pF1KE6 MAAEADGPLKRLLVPILLPEKCYDQLFVQWDLLHVPCLKILLSKGLGLGIVAGSLLVKLP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MAAEADGPLKRLLVPILLPEKCYDQLFVQWDLLHVPCLKILLSKGLGLGIVAGSLLVKLP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 QVFKILGAKSAEGLSLQSVMLELVALTGTMVYSITNNFPFSSWGEALFLMLQTITICFLV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 QVFKILGAKSAEGLSLQSVMLELVALTGTMVYSITNNFPFSSWGEALFLMLQTITICFLV 70 80 90 100 110 120 130 140 150 160 170 pF1KE6 MHYRGQTVKGVAFLA----CYGLVLLVLLS-PLTPLTVVTLLQASNVPAVVVGRLLQAAT ::::::::::.. : : . :: : :..: :::. :. NP_001 MHYRGQTVKGAGDLPKSRLCGSWEPCWELSFSRQPPTTTTGTQASSQPSQSSCCLGAPWP 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE6 NYHNGHTGQLSAITVFLLFGGSLARIFTSIQETGDPLMAGTFVVSSLCNGLIATQLLFYW NP_001 ESSLPFRKPEIP 190 >>XP_006721660 (OMIM: 604041,609180) PREDICTED: mannose- (260 aa) initn: 838 init1: 838 opt: 838 Z-score: 1099.4 bits: 211.0 E(85289): 1.7e-54 Smith-Waterman score: 838; 100.0% identity (100.0% similar) in 129 aa overlap (1-129:1-129) 10 20 30 40 50 60 pF1KE6 MAAEADGPLKRLLVPILLPEKCYDQLFVQWDLLHVPCLKILLSKGLGLGIVAGSLLVKLP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_006 MAAEADGPLKRLLVPILLPEKCYDQLFVQWDLLHVPCLKILLSKGLGLGIVAGSLLVKLP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 QVFKILGAKSAEGLSLQSVMLELVALTGTMVYSITNNFPFSSWGEALFLMLQTITICFLV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_006 QVFKILGAKSAEGLSLQSVMLELVALTGTMVYSITNNFPFSSWGEALFLMLQTITICFLV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 MHYRGQTVKGVAFLACYGLVLLVLLSPLTPLTVVTLLQASNVPAVVVGRLLQAATNYHNG ::::::::: XP_006 MHYRGQTVKASPGSHQLPQRAHRPALSHHSLPAVWGLPGPNLHFHSGNRRSPDGWDLCGL 130 140 150 160 170 180 247 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 14:57:14 2016 done: Tue Nov 8 14:57:15 2016 Total Scan time: 6.410 Total Display time: -0.030 Function used was FASTA [36.3.4 Apr, 2011]