GENSCAN 1.0 Date run: 7-Nov-116 Time: 22:01:50 Sequence gi568815597f:28136184_28337975 : 201792 bp : 46.54% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 177 172 6 1.05 1.01 Sngl - 14838 13810 1029 0 0 85 37 1671 0.802 158.80 1.00 Prom - 20150 20111 40 -4.96 2.15 PlyA - 20373 20368 6 1.05 2.14 Term - 28281 28111 171 0 0 69 33 110 0.245 1.43 2.13 Intr - 37940 37914 27 2 0 96 70 41 0.286 1.41 2.12 Intr - 39418 39285 134 2 2 103 54 23 0.169 0.76 2.11 Intr - 40685 40583 103 2 1 110 25 72 0.256 2.85 2.10 Intr - 61551 61466 86 1 2 71 80 86 0.489 5.64 2.09 Intr - 65187 65073 115 0 1 98 92 171 0.987 18.52 2.08 Intr - 67639 67564 76 0 1 92 84 82 0.998 7.72 2.07 Intr - 69166 69075 92 1 2 66 83 50 0.989 1.09 2.06 Intr - 72230 72159 72 2 0 60 105 57 0.945 4.20 2.05 Intr - 73883 73789 95 0 2 5 93 137 0.971 5.58 2.04 Intr - 74454 74388 67 0 1 90 63 52 0.950 1.38 2.03 Intr - 78813 78757 57 0 0 63 92 44 0.452 1.38 2.02 Intr - 92840 92739 102 2 0 71 83 20 0.127 0.17 2.01 Init - 96815 96738 78 2 0 92 51 66 0.268 4.37 2.00 Prom - 97263 97224 40 -5.76 3.00 Prom + 97719 97758 40 -10.05 3.01 Init + 100001 100087 87 1 0 87 101 163 0.914 16.29 3.02 Intr + 100178 100269 92 1 2 93 93 120 0.913 11.79 3.03 Term + 101654 101795 142 2 1 65 42 183 0.999 8.80 3.04 PlyA + 101898 101903 6 1.05 4.00 Prom + 104654 104693 40 -9.16 4.01 Init + 108455 108504 50 1 2 91 95 6 0.401 2.10 4.02 Intr + 117226 117367 142 1 1 62 73 53 0.499 1.56 4.03 Intr + 123301 123552 252 0 0 25 -30 316 0.944 11.13 4.04 Intr + 123560 123754 195 1 0 1 99 281 0.835 20.11 4.05 Intr + 131675 131899 225 1 0 116 75 81 0.865 7.78 4.06 Intr + 133000 133065 66 0 0 94 99 76 0.992 8.40 4.07 Intr + 135491 135688 198 1 0 92 89 248 0.988 24.85 4.08 Intr + 136101 136283 183 2 0 83 61 266 0.789 23.38 4.09 Intr + 136398 136610 213 2 0 149 98 225 0.999 28.51 4.10 Intr + 137175 137325 151 2 1 47 92 344 0.999 30.44 4.11 Intr + 137857 137975 119 2 2 110 91 133 0.996 15.98 4.12 Intr + 138642 138832 191 2 2 68 99 316 0.968 29.08 4.13 Intr + 142842 143058 217 0 1 119 86 188 0.726 20.21 4.14 Term + 144533 144619 87 1 0 109 41 180 0.999 13.06 4.15 PlyA + 146279 146284 6 1.05 5.00 Prom + 189013 189052 40 -3.16 5.01 Init + 192759 192812 54 2 0 79 68 54 0.778 3.80 5.02 Intr + 194414 194552 139 1 1 69 110 137 0.788 14.04 5.03 Term + 198234 198787 554 1 2 122 42 553 0.994 48.58 5.04 PlyA + 199647 199652 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:28136184_28337975|GENSCAN_predicted_peptide_1|342_aa MEPHDSSHMDSEFRYTLFPIVYSIIFVLGVIANGYVLWVFARLYPCKKFNEIKIFMVNLT MADMLFLITLPLWIVYYQNQGNWILPKFLCNVAGCLFFINTYCSVAFLGVITYNRFQAVT RPIKTAQANTRKRGISLSLVIWVAIVGAASYFLILDSTNTVPDSAGSGNVTRCFEHYEKG SVPVLIIHIFIVFSFFLVFLIILFCNLVIIRTLLMQPVQQQRNAEVKRRALWMVCTVLAV FIICFVPHHVVQLPWTLAELGFQDSKFHQAINDAHQVTLCLLSTNCVLDPVIYCFLTKKF RKHLTEKFYSMRSSRKCSRATTDTVTEVVVPFNQIPGNSLKN >gi568815597f:28136184_28337975|GENSCAN_predicted_CDS_1|1029_bp atggagccacatgactcctcccacatggactctgagttccgatacactctcttcccgatt gtttacagcatcatctttgtgctcggggtcattgctaatggctacgtgctgtgggtcttt gcccgcctgtacccttgcaagaaattcaatgagataaagatcttcatggtgaacctcacc atggcggacatgctcttcttgatcaccctgccactttggattgtctactaccaaaaccag ggcaactggatactccccaaattcctgtgcaacgtggctggctgccttttcttcatcaac acctactgctctgtggccttcctgggcgtcatcacttataaccgcttccaggcagtaact cggcccatcaagactgctcaggccaacacccgcaagcgtggcatctctttgtccttggtc atctgggtggccattgtgggagctgcatcctacttcctcatcctggactccaccaacaca gtgcccgacagtgctggctcaggcaacgtcactcgctgctttgagcattacgagaagggc agcgtgccagtcctcatcatccacatcttcatcgtgttcagcttcttcctggtcttcctc atcatcctcttctgcaacctggtcatcatccgtaccttgctcatgcagccggtgcagcag cagcgcaacgctgaagtcaagcgccgggcgctgtggatggtgtgcacggtcttggcggtg ttcatcatctgcttcgtgccccaccacgtggtgcagctgccctggacccttgctgagctg ggcttccaggacagcaaattccaccaggccattaatgatgcacatcaggtcaccctctgc ctccttagcaccaactgtgtcttagaccctgttatctactgtttcctcaccaagaagttc cgcaagcacctcaccgaaaagttctacagcatgcgcagtagccggaaatgctcccgggcc accacggatacggtcactgaagtggttgtgccattcaaccagatccctggcaattccctc aaaaattag >gi568815597f:28136184_28337975|GENSCAN_predicted_peptide_2|424_aa MAASGESGTSGGGGSTEEAFMTFYSEVKQIEKRDSVLTSKNQIERLTRPGSSYFNLNPFE VLQIDPEVTDEEIKKRFRQLSILVHPDKNQDDADRAQKAFEAVDKAYKLLLDQEQKKRAL DVIQAGKEYVEHTVKERKKQLKKEGKPTIVEEDDPELFKQAVYKQTMKLFAELEIKRKER EAKEMHERKRQREEEIEAQEKAKREREWQKNFEESRDGRVDSWRNFQANTKGKKEKKNRT FLRPPKVKMEQLEQIGSPAVEPTAGKRTFVGRGEVEGASQLLASGAVYTPPHPSLFEKPL CSAQVLAAQLGSSCGGPDMGLCTHSLVKYDRTSRLSPHTSGCSPLALTEYPAVFQVFGVH DESGEKPLSKEYLDPRECAHLECGVPPPSPLLVLALLVNAPSGLRGCKELLVQELWMNLD VSYL >gi568815597f:28136184_28337975|GENSCAN_predicted_CDS_2|1275_bp atggcggcttcaggagagagcgggacttcaggcggcggaggcagcaccgaggaagcattt atgaccttctacagtgaggtgaaacaaatagagaagagagactcggttctaacttcgaaa aatcagattgaaagactgacccgtcctggttcctcttacttcaatttgaacccatttgag gttcttcagatagatcctgaagttacagatgaagaaataaaaaagaggtttcggcagtta tccatcttggtgcatcctgacaaaaatcaagatgatgctgacagagcacaaaaggctttt gaagctgtggacaaagcttacaagttgctactggatcaggagcaaaagaagagggccctg gatgtaattcaggcaggaaaagaatacgtggaacacactgtgaaagagcgaaaaaaacaa ttaaagaaggaaggaaaacctacaattgtagaggaggatgatcctgagctgttcaaacaa gctgtatataaacagacaatgaaactctttgcagagctggaaattaaaaggaaagagaga gaagccaaagagatgcatgaaaggaaacgacaaagggaagaagagattgaagctcaagaa aaagccaaacgggaaagagagtggcagaaaaactttgaggaaagtcgagatggtcgtgtg gacagctggcgaaacttccaagccaatacgaaggggaagaaagagaagaaaaatcggacc ttcctgagaccaccgaaagtaaaaatggagcaactggagcagataggctccccagctgtg gagccaactgctggcaagaggacctttgtgggcagaggtgaggtggaaggagccagccag ctcctggcctcgggcgctgtctacaccccaccccatccctcgctctttgagaagcctctg tgcagtgcccaggtcctggcggcccagcttggaagttcctgtggaggtccagacatgggg ctgtgcactcactcacttgtgaaatatgacagaaccagcaggctctctccccacacctct ggctgctctcctttagctctgactgaatacccagctgtgttccaggtgtttggggtgcat gatgagagcggggagaagcctctgagcaaggagtacttggatccaagagagtgtgcccac ttggaatgtggtgttcccccgccttcaccacttctagtccttgccctgttggtgaatgca ccctcaggcctgcggggctgcaaggaactgctggtgcaggagctgtggatgaacttggat gtatcttatttatag >gi568815597f:28136184_28337975|GENSCAN_predicted_peptide_3|106_aa MAVTALAARTWLGVWGVRTMQARGFGSDQSENVDRGAGSIREAGGAFGKREQAEEERYFR AQSREQLAALKKHHEEEIVHHKKEIERLQKEIERHKQKIKMLKHDD >gi568815597f:28136184_28337975|GENSCAN_predicted_CDS_3|321_bp atggcagtgacggcgttggcggcgcggacgtggcttggcgtgtggggcgtgaggaccatg caagcccgaggcttcggctcggatcagtccgagaatgtcgaccggggcgcgggctccatc cgggaagccggtggggccttcggaaagagagagcaggctgaagaggaacgatatttccga gcacagagtagagaacaactggcagctttgaaaaaacaccatgaagaagaaatcgttcat cataagaaggagattgagcgtctgcagaaagaaattgagcgccataagcagaagatcaaa atgctaaaacatgatgattaa >gi568815597f:28136184_28337975|GENSCAN_predicted_peptide_4|762_aa MTLSNSFELKLLQGPGRKAERREEAAVEKLKPSIVCFMRLKERSLLNNRKIQGEAAHADG EAAARAELLVSEPGERWQFRGGDAEERWVREQPWPLRTSEAVKTPALRPFPGPRGRSPFP KPDWGKSPAPKRPFSDSGAFWSPERRPGKLPGGAQSRLHSGVPPKPTRVHGSSASRDRVL ARTMIVADSECRAELKDYLRFAPGGVGDSGPGETSMNDLALLSTKCDLVSGKQNRQFDDS VFRSKSPGNFGVIGEGQKKVPRLLWFSFAKGVTTGTGPVVSGSPWEGEEQRESRARRGPR GPSAFIPVEEVLREGAESLEQHLGLEALMSSGRVDNLAVVMGLHPDYFTSFWRLHYLLLH TDGPLASSWRHYIAIMAAARHQCSYLVGSHMAEFLQTGGDPEWLLGLHRAPEKLRKLSEI NKLLAHRPWLITKEHIQALLKTGEHTWSLAELIQALVLLTHCHSLSSFVFGCGILPEGDA DGSPAPQAPTPPSEQSSPPSRDPLNNSGGFESARDVEALMERMQQLQESLLRDEGTSQEE MESRFELEKSESLLVTPSADILEPSPHPDMLCFVEDPTFGYEDFTRRGAQAPPTFRAQDY TWEDHGYSLIQRLYPEGGQLLDEKFQAAYSLTYNTIAMHSGVDTSVLRRAIWNYIHCVFG IRVGLRGGSCLWEESMSNAAGTFPSRYDDYDYGEVNQLLERNLKVYIKTVACYPEKTTRR MYNLFWRHFRHSEKVHVNLLLLEARMQAALLYALRAITRYMT >gi568815597f:28136184_28337975|GENSCAN_predicted_CDS_4|2289_bp atgaccctgtccaacagttttgaactcaagttgctacaggggccaggcaggaaagctgag agacgtgaggaagctgctgtagaaaagcttaaacctagcatagtttgcttcatgaggttg aaagaaagaagccttctcaataacagaaaaatacaaggtgaagcagcccatgctgatgga gaagctgctgcacgggctgaactgctggtgtcagagcccggcgagcgctggcagttccgc ggcggggatgctgaggagcgctgggtccgggagcagccctggcccctgcggacttccgag gccgtgaaaacccctgcgctgcggcccttcccaggcccccgaggccgttcgccgttcccg aagcccgactgggggaagagtccagcaccaaagcggccgttctcggattccggagcgttc tggagccccgagagacgccccgggaagctccccggcggcgcccagtcccggcttcattcg ggcgtccctccgaaacccactcgggtgcacgggtcgtcggcgagccgcgaccgggtcctg gcgcgcaccatgatcgtggcggactccgagtgccgcgcagagctcaaggactacctgcgg ttcgccccgggcggcgtcggcgactcgggccccggagagactagtatgaatgacctggcg ctgctctccactaaatgtgatctggtgtcagggaaacaaaacagacaatttgatgacagt gtttttagaagcaaaagccctgggaattttggggtcataggtgagggccagaaaaaggtt cctagactattgtggtttagctttgcaaaaggagtgaccactgggactgggccagttgta tctggaagcccatgggagggggaggagcagagggagagccgggctcggcgaggccctcga gggcccagcgccttcatccccgtggaggaggtccttcgggagggggctgagagcctcgag cagcacctggggctggaggcactgatgtcctctgggcgagtagacaacctggcagtggtg atgggcctgcaccctgactactttaccagcttctggcgcctgcactacctgctgctgcac acggatggtcccttggccagctcctggcgccactacattgccatcatggctgccgcccgc catcagtgttcttacctggtaggctcccacatggccgagtttctgcagactggtggtgac cctgagtggctgctgggcctccaccgggcccccgagaagctgcgcaaactcagcgagatc aacaagttgctggcgcatcggccatggctcatcaccaaggaacacatccaggccttgctg aagaccggcgagcacacttggtccctggccgagctcattcaggctctggtcctgctcacc cactgccactcgctctcctccttcgtgtttggctgtggcatcctccctgagggggatgca gatggcagccctgccccccaggcacctacaccccctagtgaacagagcagccccccaagc agggacccgttgaacaactctgggggctttgagtctgcccgcgacgtggaggcgctgatg gagcgcatgcagcagctgcaggagagcctgctgcgggatgaggggacgtcccaggaggag atggagagccgctttgagctggagaagtcagagagcctgctggtgaccccctcagctgac atcctggagccctctccacacccagacatgctgtgctttgtggaagaccctactttcgga tatgaggacttcactcggagaggggctcaggcaccccctaccttccgggcccaggattat acctgggaagaccatggctactcgctgatccagcggctttaccctgagggtgggcagctg ctggatgagaagttccaggcagcctatagcctcacctacaataccatcgccatgcacagt ggtgtggacacctccgtgctccgcagggccatctggaactatatccactgcgtctttggc atcagggtggggttgcggggagggagctgcctttgggaagaaagcatgagtaatgctgct gggacctttccatccagatatgatgactatgattatggggaggtgaaccagctcctggag cggaacctcaaggtctatatcaagacagtggcctgctacccagagaagaccacccgaaga atgtacaacctcttctggaggcacttccgccactcagagaaggtccacgtgaacttgctg ctcctggaggcgcgcatgcaagccgctctgctgtacgccctccgtgccatcacccgctac atgacctga >gi568815597f:28136184_28337975|GENSCAN_predicted_peptide_5|248_aa MKLFGLLGTHTFGNGSFEVYPVPYLTGGSESSCVVFNLDTMEAPPVTMMPVTGGTINMME YLLQGSVLDHSLESLIHRLRGLCDNMEPETFLDHEMVFLLKGQQASPFVLRARRSMDRAG APWHLRYLGQPEMGDKNRHALVRNCVDIATSENLTDFLMEMGFRMDHEFVAKGHLFRKGI MKIMVYKIFRILVPGNTDSTEALSLSYLVELSVVAPAGQDMVSDDMKNFAEQLKPLVHLE KIDPKRLM >gi568815597f:28136184_28337975|GENSCAN_predicted_CDS_5|747_bp atgaagctcttcggccttctcggcacccatacgtttgggaacggcagtttcgaggtatat cccgtgccttacctgactgggggctctgagtccagttgtgttgtcttcaacttagacacc atggaggcacctccagtcaccatgatgcctgtcactgggggcaccattaacatgatggag tacctgttgcagggaagtgttttagatcacagtttggaaagcctcatccaccgccttcgt ggtttgtgtgacaacatggaacctgagactttccttgaccatgagatggtattcctcctt aagggccagcaagccagcccatttgttctcagggcccgacgctctatggacagggcaggg gcaccctggcatctgcgctacctgggacagccagaaatgggagacaagaaccgccatgcc ctggtgcgaaactgcgtggacattgccacatctgagaacctcaccgacttcttgatggaa atgggcttccgcatggaccatgagtttgttgctaagggacatttgttccgtaagggcatc atgaagattatggtgtacaagattttccgcatcctggtgccagggaacacagacagcact gaggccttgtcactctcctatctcgtggaattaagtgtggtagcacccgctgggcaggac atggtctctgatgacatgaagaacttcgcagaacagctaaaacctctggttcacctagag aaaatagaccccaagaggctcatgtga