FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3628, 180 aa 1>>>pF1KE3628 180 - 180 aa - 180 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.5612+/-0.000342; mu= 11.6764+/- 0.021 mean_var=66.9881+/-13.221, 0's: 0 Z-trim(114.2): 9 B-trim: 0 in 0/54 Lambda= 0.156702 statistics sampled from 23979 (23986) to 23979 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.672), E-opt: 0.2 (0.281), width: 16 Scan time: 4.950 The best scores are: opt bits E(85289) NP_002045 (OMIM: 138030) glucagon preproprotein [H ( 180) 1196 278.9 3.1e-75 XP_006715625 (OMIM: 192320) PREDICTED: VIP peptide ( 169) 174 47.8 1.1e-05 NP_003372 (OMIM: 192320) VIP peptides isoform 1 pr ( 170) 172 47.4 1.5e-05 XP_005267192 (OMIM: 192320) PREDICTED: VIP peptide ( 168) 163 45.3 5.9e-05 NP_919416 (OMIM: 192320) VIP peptides isoform 2 pr ( 169) 161 44.9 8.1e-05 NP_004114 (OMIM: 137240) gastric inhibitory polype ( 153) 143 40.8 0.0012 >>NP_002045 (OMIM: 138030) glucagon preproprotein [Homo (180 aa) initn: 1196 init1: 1196 opt: 1196 Z-score: 1471.6 bits: 278.9 E(85289): 3.1e-75 Smith-Waterman score: 1196; 100.0% identity (100.0% similar) in 180 aa overlap (1-180:1-180) 10 20 30 40 50 60 pF1KE3 MKSIYFVAGLFVMLVQGSWQRSLQDTEEKSRSFSASQADPLSDPDQMNEDKRHSQGTFTS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_002 MKSIYFVAGLFVMLVQGSWQRSLQDTEEKSRSFSASQADPLSDPDQMNEDKRHSQGTFTS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 DYSKYLDSRRAQDFVQWLMNTKRNRNNIAKRHDEFERHAEGTFTSDVSSYLEGQAAKEFI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_002 DYSKYLDSRRAQDFVQWLMNTKRNRNNIAKRHDEFERHAEGTFTSDVSSYLEGQAAKEFI 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 AWLVKGRGRRDFPEEVAIVEELGRRHADGSFSDEMNTILDNLAARDFINWLIQTKITDRK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_002 AWLVKGRGRRDFPEEVAIVEELGRRHADGSFSDEMNTILDNLAARDFINWLIQTKITDRK 130 140 150 160 170 180 >>XP_006715625 (OMIM: 192320) PREDICTED: VIP peptides is (169 aa) initn: 192 init1: 93 opt: 174 Z-score: 223.4 bits: 47.8 E(85289): 1.1e-05 Smith-Waterman score: 174; 29.8% identity (65.3% similar) in 121 aa overlap (20-136:51-167) 10 20 30 40 pF1KE3 MKSIYFVAGLFVMLVQGSWQRSLQDTEEKSRSFSASQADPLSDPDQMNE : ::.. . .. : . : : ... XP_006 SQTSAWPLYRAPSALRLGDRIPFEGANEPDQVSLKEDIDMLQNALAENDTPYYD---VSR 30 40 50 60 70 50 60 70 80 90 100 pF1KE3 DKRHSQGTFTSDYSKYLDSRRAQDFVQWLMNTKRNRNNIAKRHDEFERHAEGTFTSDVSS . ::..:.::::.:: : . :. ... ::. :: .::.. .::....::.. . XP_006 NARHADGVFTSDFSKLLGQLSAKKYLESLMG-KRVSSNISEDPVPVKRHSDAVFTDNYTR 80 90 100 110 120 130 110 120 130 140 150 160 pF1KE3 YLEGQAAKEFIAWLVKGR---GRR-DFPEEVAIVEELGRRHADGSFSDEMNTILDNLAAR . .:.:... ...:. :. :::::. XP_006 LRKQMAVKKYLNSILNGKRSEGESPDFPEELEK 140 150 160 170 180 pF1KE3 DFINWLIQTKITDRK >>NP_003372 (OMIM: 192320) VIP peptides isoform 1 prepro (170 aa) initn: 192 init1: 93 opt: 172 Z-score: 220.9 bits: 47.4 E(85289): 1.5e-05 Smith-Waterman score: 172; 29.5% identity (64.8% similar) in 122 aa overlap (20-136:51-168) 10 20 30 40 pF1KE3 MKSIYFVAGLFVMLVQGSWQRSLQDTEEKSRSFSASQADPLSDPDQMNE : ::.. . .. : . : : ... NP_003 SQTSAWPLYRAPSALRLGDRIPFEGANEPDQVSLKEDIDMLQNALAENDTPYYD---VSR 30 40 50 60 70 50 60 70 80 90 100 pF1KE3 DKRHSQGTFTSDYSKYLDSRRAQDFVQWLMNTKRNRNNIAKRHDEFERHAEGTFTSDVSS . ::..:.::::.:: : . :. ... ::. :: .::.. .::....::.. . NP_003 NARHADGVFTSDFSKLLGQLSAKKYLESLMG-KRVSSNISEDPVPVKRHSDAVFTDNYTR 80 90 100 110 120 130 110 120 130 140 150 160 pF1KE3 YLEGQAAKEFIAWLVKGR----GRR-DFPEEVAIVEELGRRHADGSFSDEMNTILDNLAA . .:.:... ...:. :. :::::. NP_003 LRKQMAVKKYLNSILNGKRSSEGESPDFPEELEK 140 150 160 170 170 180 pF1KE3 RDFINWLIQTKITDRK >>XP_005267192 (OMIM: 192320) PREDICTED: VIP peptides is (168 aa) initn: 201 init1: 89 opt: 163 Z-score: 210.0 bits: 45.3 E(85289): 5.9e-05 Smith-Waterman score: 163; 29.8% identity (65.3% similar) in 121 aa overlap (20-136:51-166) 10 20 30 40 pF1KE3 MKSIYFVAGLFVMLVQGSWQRSLQDTEEKSRSFSASQADPLSDPDQMNE : ::.. . .. : . : : ... XP_005 SQTSAWPLYRAPSALRLGDRIPFEGANEPDQVSLKEDIDMLQNALAENDTPYYD---VSR 30 40 50 60 70 50 60 70 80 90 100 pF1KE3 DKRHSQGTFTSDYSKYLDSRRAQDFVQWLMNTKRNRNNIAKRHDEFERHAEGTFTSDVSS . ::..:.::::.:: : . :. ... ::. :: .::.. .::....::.. . XP_005 NARHADGVFTSDFSKLLGQLSAKKYLESLMG-KRV-SNISEDPVPVKRHSDAVFTDNYTR 80 90 100 110 120 130 110 120 130 140 150 160 pF1KE3 YLEGQAAKEFIAWLVKGR---GRR-DFPEEVAIVEELGRRHADGSFSDEMNTILDNLAAR . .:.:... ...:. :. :::::. XP_005 LRKQMAVKKYLNSILNGKRSEGESPDFPEELEK 140 150 160 170 180 pF1KE3 DFINWLIQTKITDRK >>NP_919416 (OMIM: 192320) VIP peptides isoform 2 prepro (169 aa) initn: 188 init1: 89 opt: 161 Z-score: 207.5 bits: 44.9 E(85289): 8.1e-05 Smith-Waterman score: 161; 29.5% identity (64.8% similar) in 122 aa overlap (20-136:51-167) 10 20 30 40 pF1KE3 MKSIYFVAGLFVMLVQGSWQRSLQDTEEKSRSFSASQADPLSDPDQMNE : ::.. . .. : . : : ... NP_919 SQTSAWPLYRAPSALRLGDRIPFEGANEPDQVSLKEDIDMLQNALAENDTPYYD---VSR 30 40 50 60 70 50 60 70 80 90 100 pF1KE3 DKRHSQGTFTSDYSKYLDSRRAQDFVQWLMNTKRNRNNIAKRHDEFERHAEGTFTSDVSS . ::..:.::::.:: : . :. ... ::. :: .::.. .::....::.. . NP_919 NARHADGVFTSDFSKLLGQLSAKKYLESLMG-KRV-SNISEDPVPVKRHSDAVFTDNYTR 80 90 100 110 120 130 110 120 130 140 150 160 pF1KE3 YLEGQAAKEFIAWLVKGR----GRR-DFPEEVAIVEELGRRHADGSFSDEMNTILDNLAA . .:.:... ...:. :. :::::. NP_919 LRKQMAVKKYLNSILNGKRSSEGESPDFPEELEK 140 150 160 170 180 pF1KE3 RDFINWLIQTKITDRK >>NP_004114 (OMIM: 137240) gastric inhibitory polypeptid (153 aa) initn: 111 init1: 111 opt: 143 Z-score: 186.2 bits: 40.8 E(85289): 0.0012 Smith-Waterman score: 143; 29.5% identity (64.2% similar) in 95 aa overlap (27-117:22-114) 10 20 30 40 50 pF1KE3 MKSIYFVAGLFVMLVQGSWQRSLQDTEEKSRSFSASQADPLSDPDQMNEDK----RHSQG :.: ::: . :... ... . :...: NP_004 MVATKTFALLLLSLFLAVGLGEKKEGHFSALPSLPVGSHAKVSSPQPRGPRYAEG 10 20 30 40 50 60 70 80 90 100 110 pF1KE3 TFTSDYSKYLDSRRAQDFVQWLMNTKRNRNNIAKRHDEFERHAEGTFTSDVSSYLEGQAA :: :::: .:. . ::::.::. : ..:. .:. .:.:.. .. .. : .:. NP_004 TFISDYSIAMDKIHQQDFVNWLLAQKGKKNDW--KHNITQREARALELASQANRKEEEAV 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE3 KEFIAWLVKGRGRRDFPEEVAIVEELGRRHADGSFSDEMNTILDNLAARDFINWLIQTKI . NP_004 EPQSSPAKNPSDEDLLRDLLIQELLACLLDQTNLCRLRSR 120 130 140 150 180 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 02:22:30 2016 done: Mon Nov 7 02:22:30 2016 Total Scan time: 4.950 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]