GENSCAN 1.0 Date run: 3-Nov-116 Time: 18:15:45 Sequence gi568815580f:63384429_63593221 : 208793 bp : 40.07% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.10 PlyA - 577 572 6 1.05 1.09 Term - 9121 8975 147 1 0 61 44 175 0.883 7.32 1.08 Intr - 12825 12606 220 2 1 72 115 170 0.979 15.48 1.07 Intr - 14895 14814 82 1 1 76 97 72 0.947 4.68 1.06 Intr - 19398 19279 120 1 0 84 61 107 0.952 7.15 1.05 Intr - 23071 23004 68 0 2 56 4 77 0.320 -6.27 1.04 Intr - 26018 25862 157 0 1 69 71 128 0.759 7.35 1.03 Intr - 27150 27039 112 0 1 75 54 111 0.873 5.43 1.02 Intr - 31302 31188 115 2 1 36 23 170 0.527 4.83 1.01 Init - 31593 31478 116 0 2 45 21 124 0.347 1.13 1.00 Prom - 36177 36138 40 -6.55 2.00 Prom + 36830 36869 40 -9.55 2.01 Init + 37798 37908 111 0 0 45 44 96 0.762 1.18 2.02 Intr + 38398 38543 146 0 2 102 77 100 0.876 8.46 2.03 Term + 39367 39475 109 1 1 111 45 65 0.887 1.50 2.04 PlyA + 40553 40558 6 1.05 3.06 PlyA - 40713 40708 6 1.05 3.05 Term - 47102 46996 107 0 2 73 48 107 0.960 2.79 3.04 Intr - 51320 51152 169 2 1 -81 87 203 0.044 1.90 3.03 Intr - 56127 55777 351 0 0 56 13 184 0.006 2.29 3.02 Intr - 62182 61983 200 2 2 65 100 112 0.627 8.25 3.01 Init - 71617 71575 43 1 1 62 81 37 0.196 1.03 3.00 Prom - 71772 71733 40 -5.55 4.00 Prom + 72183 72222 40 -4.95 4.01 Init + 80816 80872 57 1 0 74 72 49 0.317 3.30 4.02 Intr + 83325 83514 190 2 1 18 41 208 0.027 7.34 4.03 Intr + 92554 92617 64 2 1 57 115 65 0.001 2.96 4.04 Intr + 99994 100168 175 1 1 112 115 123 0.084 16.42 4.05 Intr + 102518 102655 138 1 0 94 98 152 0.999 16.54 4.06 Intr + 104919 105000 82 2 1 91 77 119 0.966 9.39 4.07 Intr + 105822 105947 126 1 0 54 117 104 0.965 9.63 4.08 Intr + 108525 108667 143 1 2 65 95 170 0.964 14.55 4.09 Intr + 111493 111544 52 0 1 101 107 10 0.961 1.76 4.10 Term + 112656 112972 317 1 2 72 48 213 0.920 9.82 4.11 PlyA + 113217 113222 6 1.05 5.00 Prom + 114493 114532 40 -7.65 5.01 Init + 114722 114859 138 1 0 89 116 83 0.562 11.39 5.02 Term + 118902 119294 393 2 0 115 42 303 0.996 22.45 5.03 PlyA + 119384 119389 6 1.05 6.00 Prom + 120507 120546 40 -5.05 6.01 Init + 126769 127060 292 0 1 67 38 138 0.198 4.06 6.02 Intr + 134197 134270 74 2 2 96 66 47 0.076 1.51 6.03 Intr + 134636 134756 121 1 1 29 110 114 0.085 6.85 6.04 Term + 138949 139097 149 2 2 23 42 152 0.785 1.18 6.05 PlyA + 139120 139125 6 1.05 7.00 Prom + 145091 145130 40 -5.55 7.01 Init + 165261 165344 84 2 0 36 91 88 0.851 4.77 7.02 Intr + 171714 171899 186 2 0 99 94 105 0.961 11.16 7.03 Intr + 173924 174058 135 1 0 68 66 135 0.993 9.14 7.04 Intr + 175150 175290 141 0 0 102 109 114 0.999 14.53 7.05 Intr + 176657 176774 118 1 1 34 116 40 0.697 0.52 7.06 Intr + 179550 179692 143 1 2 76 14 134 0.587 3.95 7.07 Intr + 181017 181184 168 2 0 85 84 163 0.998 14.82 7.08 Intr + 182179 182474 296 0 2 111 38 280 0.711 20.18 7.09 Intr + 184271 184319 49 2 1 105 66 40 0.266 1.26 7.10 Term + 196561 196734 174 0 0 22 44 213 0.487 7.08 7.11 PlyA + 198094 198099 6 1.05 8.00 Prom + 198196 198235 40 -4.05 8.01 Init + 203829 204108 280 2 1 64 6 188 0.650 5.47 8.02 Intr + 204223 204404 182 2 2 114 31 206 0.995 16.17 8.03 Intr + 205228 205287 60 0 0 104 83 60 0.550 5.11 8.04 Intr + 207920 208048 129 1 0 48 108 79 0.981 5.87 8.05 Intr + 208426 208554 129 0 0 86 55 98 0.737 6.27 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 51322 51152 171 2 0 -41 87 202 0.908 6.19 S.002 Intr - 56127 55821 307 0 1 56 52 170 0.908 5.30 S.003 Term - 86488 86352 137 2 2 75 49 119 0.870 3.90 S.004 Init + 95454 95576 123 2 0 80 64 110 0.906 8.02 S.005 Term + 96427 96576 150 0 0 70 39 118 0.890 2.03 S.006 Sngl - 99376 99143 234 1 0 62 46 213 0.862 9.35 S.007 Init + 100001 100168 168 1 0 86 115 95 0.916 11.58 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815580f:63384429_63593221|GENSCAN_predicted_peptide_1|378_aa MIGGLDKQIKDITEVIEQPVKHSKLFEALGITQPRVVLLSISSSQLEGGSGGDSEVLHIV LELLNQLDSFEAPKKIKKAIDLASKAAQEDKAGNYEEALQLYQHAVQYFLHVVKYEAQGD KAKQSIRAKCTEYLDRAEKLKEYLKNKEKKAQKPVKEGQPSPADEKGNDSDGEGESDDPE KKKLQNQLQGAIVIERPNVKWSDVAGLEGAKEALKEAVILPIKFPHLFTGVGVDNDGILV LGATNIPWVLDSAIRRRFEKRIYIPLPEPHARAAMFKLHLGTTQNSLTEADFRELGRKTD GYSGADISIIVRDALMQPVRKVQSATHFKKVRGPSRADPNHLVDDLLTPCSPGDPGAIEM TWMDVPGDKLLEPVVSMV >gi568815580f:63384429_63593221|GENSCAN_predicted_CDS_1|1137_bp atgattggtggactggacaagcagattaaggacatcacagaagtgatcgagcagcctgtt aagcattccaagctctttgaagcactgggtatcacacaacccagggtggtactgctctcc atcagctcctcgcagctggaggggggttctggaggggacagtgaagtgctgcacatcgtg ctggaactgctcaaccaactggacagctttgaggcccccaagaagatcaagaaagcgata gatctggctagcaaagcagcgcaagaagacaaggctgggaactacgaagaagcccttcag ctctatcagcatgctgtgcagtattttcttcatgtcgttaaatatgaagcacagggtgat aaagccaagcaaagtatcagggcaaagtgtacagaatatcttgatagagcagaaaaacta aaggagtacctgaaaaataaagagaaaaaagcacagaagccagtgaaagaaggacagccg agtccagcagatgagaaggggaatgacagtgatggggaaggagaatctgatgatcctgaa aaaaagaaactacagaatcaacttcaaggtgccattgttatagaacgaccaaatgtgaaa tggagtgacgttgctggacttgaaggagccaaagaagcactgaaagaggctgtgatactg cctattaaatttcctcatctttttacaggggttggtgtagacaatgatggaattttggtt ctgggagctacaaatataccctgggttctggattctgccattaggcgaagatttgagaaa cgaatttatattcccttgccggaaccccatgcccgagcagcaatgtttaaactgcaccta gggaccactcagaacagtctcacggaagcagactttcgggaacttgggaggaaaacagat ggttattcaggggcagatataagtatcattgtacgtgatgcccttatgcagcctgttagg aaagtacagtcagctactcattttaaaaaggttcgcggaccttcccgagctgatcctaac catcttgtagatgatctgctaacaccttgctctccaggtgaccctggtgccattgaaatg acatggatggatgtccctggagataaacttttggagccagttgtttccatggtttga >gi568815580f:63384429_63593221|GENSCAN_predicted_peptide_2|121_aa MIPGGWAKWMTWRSSQAVPKGTRGEESQQQQRRSAHGLLSEKVVHFQDPPTGGFGPVTAP SLRVLCGMGAITVLVSVVNNSGGSARIVLPNLYHPGSKHRALSLPCIYPSFLPTCEFAFA K >gi568815580f:63384429_63593221|GENSCAN_predicted_CDS_2|366_bp atgatacctggaggttgggcgaagtggatgacatggcggagttcccaggcggttcccaag ggaacgaggggcgaggagagccaacagcagcaacgtcgaagcgcgcacgggcttttgagt gaaaaagtcgtccatttccaagacccgccgactgggggctttgggcctgtgactgcgcct tcactccgtgtcctctgtggaatgggggcgatcaccgtcctcgtctcggtggtgaataat tcaggagggagtgcgcgaatagtattgccaaatctgtatcatccaggctcgaagcacaga gccctctctctgccctgcatctacccgtcctttcttcccacttgtgaatttgccttcgca aaatga >gi568815580f:63384429_63593221|GENSCAN_predicted_peptide_3|289_aa MEYYSATKTNEVPTVYRCRWTFPRQQETYSFINTYPMAREHKALSGQLQAGRPQGSMLIG QTLVTCPSPDQLVLSKSVTNETPRERERKASQKCDQQSEGVSVFLGPGQSRPVRDDWTCA AEGYSHPESPVLPELGAEMQSSPAKRHSPTQQTGTWPPKPELCSYFTATISINITADLEA VNVCKLWIHNVNPPSRVLEPEKKDGEKHQAEEQLYSIGRISLTWSLSELTRLHGEALGIK CQNSLQDLSERPYKDLRVDLVTAILALQILPIQKTTGMSYSGEEALSGQ >gi568815580f:63384429_63593221|GENSCAN_predicted_CDS_3|870_bp atggagtattactcagccacaaaaacaaatgaagtaccgacagtctaccgctgcaggtgg actttccccaggcaacaggagacctacagcttcatcaacacctaccccatggccagagag cacaaagctctatccggccagctccaggcaggcagacctcaaggcagcatgctgattggc caaactctggtcacatgtccatctccagaccaattagtattgtccaagtctgttaccaat gaaaccccaagggaaagggaaagaaaagcaagccaaaagtgtgaccagcagtccgaagga gtgtcagtgttcctgggcccaggccaaagcaggcctgtcagggatgactggacatgtgct gctgagggttacagccatcctgagagcccagtcttgccagagttgggagcagaaatgcag tccagcccagcaaagagacacagcccaacccagcaaacaggaacttggcccccaaagcca gagctgtgctcatatttcactgccacaatctccatcaatataactgctgatctggaagct gtgaatgtttgtaagctctggatacataatgtcaatccaccttccagagttctggagccc gaaaagaaggatggagaaaaacaccaagcagaggagcagctttactcaatcggtagaata tccctaacttggagcctatcagagctgacccgactacatggagaagccttgggcattaag tgtcagaattcactgcaagatctgagtgagagaccttacaaagatctgcgtgttgattta gtgacagcaatcttggcactccaaattttgccaatccaaaagacaactggaatgagttac tctggagaagaggcactcagtggacaataa >gi568815580f:63384429_63593221|GENSCAN_predicted_peptide_4|447_aa MRDSRKASTMDTSHALLRGPDQLQTRGELSIEVLNKWDQDPEAFFEELLQEVQHGFTSMT LNAKHHHSNGYQETEVGQRKWTGLCAPRLPVPFPRIFQDNCDSRPAMDALQLANSAFAVD LFKQLCEKEPLGNVLFSPICLSTSLSLAQVGAKGDTANEIGQVLHFENVKDVPFGFQTVT SDVNKLSSFYSLKLIKRLYVDKSLNLSTEFISSTKRPYAKELETVDFKDKLEETKAPHAG RSSDSPGVQEAPRFFLRLSASPRRLPRRTPRPSLKIAGHFENILADNSVNDQTKILVVNA AYFVGKWMKKFSESETKECPFRVNKNFLSDITKPFLMPRGVSAQSIHLNSLPTAAPHFRC GQTGVLDQVVSWDIDTAAKFIGAEADTVGVVGSGASIGIAFGSLIIGYSRNPSLKQQLFS YAILGFALSEAMGLFCLMVAFLILFAM >gi568815580f:63384429_63593221|GENSCAN_predicted_CDS_4|1344_bp atgagggactccaggaaggcatctaccatggacaccagtcatgctctcctcagggggcca gatcagctgcagacaagaggagagctttcaattgaagttttaaacaagtgggatcaagat cctgaagcattctttgaagaactgttacaggaagtgcaacatggctttaccagtatgacc ctgaatgcaaagcaccatcacagcaatggctaccaagagacagaagtgggccagcgaaag tggactggtctttgtgctcctcgcttgcctgttccttttccacgcattttccaggataac tgtgactccaggcccgcaatggatgccctgcaactagcaaattcggcttttgccgttgat ctgttcaaacaactatgtgaaaaggagccactgggcaatgtcctcttctctccaatctgt ctctccacctctctgtcacttgctcaagtgggtgctaaaggtgacactgcaaatgaaatt ggacaggttcttcattttgaaaatgtcaaagatgtaccctttggatttcaaacagtaaca tcggatgtaaacaaacttagttccttttactcactgaaactaatcaagcggctctacgta gacaaatctctgaatctttctacagagttcatcagctctacgaagagaccgtatgcaaag gaattggaaactgttgacttcaaagataaattggaagaaacgaaagctccccatgctggt cgttcctctgatagccccggggtccaagaagctcctaggttctttctccggctcagtgct tcccccaggcgtctccctcgtcggactcctcgtccctcccttaaaattgccggccacttt gagaacattttagctgacaacagtgtgaacgaccagaccaaaatccttgtggttaatgct gcctactttgttggcaagtggatgaagaaattttctgaatcagaaacaaaagaatgtcct ttcagagtcaacaagaactttttgagtgacattactaaaccttttcttatgccccgtggt gtctccgcccagagcattcatctaaacagccttcctacagcagctccccactttaggtgt ggccagacaggagttctagaccaggttgtctcctgggacattgacaccgcagccaagttt attggtgctgaggcagacacagttggtgtggttggttcaggggctagcattggaatagcg tttggcagcttgatcattggttattccaggaacccatctctcaagcagcagctcttctcc tatgcgattctgggctttgccctgtctgaggccatggggctcttctgtttgatggtcgcc ttccttatcctcttcgccatgtga >gi568815580f:63384429_63593221|GENSCAN_predicted_peptide_5|176_aa MEATFCMGNIDSINCKIIELPFQNKHLSMFILLPKDVEDESTGLEKIEKQLNSESLSQWT NPSTMANAKVKLSIPKFKVEKMIDPKACLENLGLKHIFSEDTSDFSGMSETKGVALSNVI HKVCLEITEDGGDSIEVPGARILQHKDELNADHPFIYIIRHNKTRNIIFFGKFCSP >gi568815580f:63384429_63593221|GENSCAN_predicted_CDS_5|531_bp atggaggccacgttctgtatgggaaacattgacagtatcaattgtaagatcatagagctt ccttttcaaaataagcatctcagcatgttcatcctactacccaaggatgtggaggatgag tccacaggcttggagaagattgaaaaacaactcaactcagagtcactgtcacagtggact aatcccagcaccatggccaatgccaaggtcaaactctccattccaaaatttaaggtggaa aagatgattgatcccaaggcttgtctggaaaatctagggctgaaacatatcttcagtgaa gacacatctgatttctctggaatgtcagagaccaagggagtggccctatcaaatgttatc cacaaagtgtgcttagaaataactgaagatggtggggattccatagaggtgccaggagca cggatcctgcagcacaaggatgaattgaatgctgaccatccctttatttacatcatcagg cacaacaaaactcgaaacattattttctttggcaaattctgttctccttaa >gi568815580f:63384429_63593221|GENSCAN_predicted_peptide_6|211_aa MSEGRWYQLIYCKEQSDSRDDMLRPDWSQPGVQKESLPSLVPSFLREDAWIKGAGCLRTK AWVLERWTLVHSTQINKPSFLCQELLSIAKFSQPLEGDGAVRHIITVLLRAWNGQLLQLF FKFRVGTLKYEAHRVWWYWVLVSRSKQGTAQKHFYPYEHPSTGQLLKGKETNLLQRDCFR IWEACLDNLEVKPDEKVLEFSQAQEESVPKL >gi568815580f:63384429_63593221|GENSCAN_predicted_CDS_6|636_bp atgagtgaggggcgctggtatcagctgatttattgcaaggagcagtctgatagccgtgat gacatgctgagacctgactggagtcaacctggggtccaaaaggagtcgttgccatctcta gttccatctttccttagggaggatgcctggataaagggagctggatgcctaagaaccaaa gcctgggtactggagagatggacacttgtgcacagcacccaaataaacaagccttctttc ctttgtcaagagctgctgtcgatagctaaatttagtcagccgctagagggagatggcgct gtccgtcacattataacagtgctgctcagggcctggaacggtcagttgctacagcttttc tttaagttcagggtggggacactgaaatacgaggctcatagagtctggtggtactgggtc ttggtgagtaggagcaagcaaggcactgcccaaaagcacttctacccgtatgagcatccg tccacaggccaattactaaaaggtaaagaaacaaacctccttcagcgtgattgcttccgc atatgggaagcttgtttagataatctggaagtcaaacctgatgaaaaggtgcttgaattc agtcaagcacaggaagagtctgtgcccaagttatga >gi568815580f:63384429_63593221|GENSCAN_predicted_peptide_7|497_aa MCLPEREFTAVLRWTLSARRSFLQPVVLIVISFTMDSLVTANTKFCFDLFQEIGKDDRHK NIFFSPLSLSAALGMVRLGARSDSAHQIDEVLHFNEFSQNESKEPDPCLKSNKQKVLADS SLEGQKKTTEPLDQQAGSLNNESGLVSCYFGQLLSKLDRIKTDYTLSIANRLYGEQEFPI CQEYLDGVIQFYHTTIESVDFQKNPEKSRQEINFWVECQSQGKIKELFSKDAINAETVLV LVNAVYFKAKWETYFDHENTVDAPFCLNANENKSVKMMTQKGLYRIGFIEEVKAQILEMR YTKGKLSMFVLLPSHSKDNLKGLEELERKITYEKMVAWSSSENMSEESVVLSFPRFTLED SYDLNSILQDMGITDIFDETRADLTGISPSPNLYLSKIIHKTFVEVDENGTQAAAATGAV VSESCHQSFSVATQFFCRRQPLNDYTQYSMESIANAVEENPSRKNTLEMWKDYTTEDAII VIEKAMTAVKPETINSC >gi568815580f:63384429_63593221|GENSCAN_predicted_CDS_7|1494_bp atgtgccttccagagagggagttcacggccgtcctgagatggactctcagtgctaggagg tcctttctacaaccagtggtcttgatcgttataagttttacaatggactctcttgttaca gcaaacaccaaattttgctttgatctttttcaagagataggcaaagatgatcgtcataaa aacatatttttctctcccctgagcctctcagctgcccttggtatggtacgcttgggtgct agaagtgacagtgcacatcagattgatgaggtactacacttcaacgaattttcccagaat gaaagcaaagaacctgacccttgtctgaaaagcaacaaacaaaaagtgctggctgacagc tctctggaggggcagaaaaaaacgacagagcctctggatcagcaggctgggtccttaaac aatgagagcggactggtcagctgctactttgggcagcttctctccaaattagacaggatc aagactgattacacactgagtattgccaacaggctttatggagagcaggaattcccaatc tgtcaggaatacttagatggtgtgattcaattttaccacacgacgattgaaagtgttgat ttccaaaaaaaccctgaaaaatccagacaagagattaacttctgggttgaatgtcaatcc caaggtaaaatcaaggaactcttcagcaaggacgctattaatgctgagactgtgctggta ctggtgaatgctgtttacttcaaggccaaatgggaaacatactttgaccatgaaaacacg gtggatgcacctttctgtctaaatgcgaatgaaaacaagagtgtgaagatgatgacgcaa aaaggcctctacagaattggcttcatagaggaggtgaaggcacagatcctggaaatgagg tacaccaaggggaagctcagcatgttcgtgctgctgccatctcactctaaagataacctg aagggtctggaagagcttgaaaggaaaatcacctatgaaaaaatggtggcctggagcagc tcagaaaacatgtcagaagaatcggtggtcctgtccttcccccggttcaccctggaagac agctatgatctcaattccattttacaagacatgggcattacggatatctttgatgaaacg agggctgatcttactggaatctctccaagtcccaatttgtacttgtcaaaaattatccac aaaacctttgtggaggtggatgaaaacggtacccaggcagctgcagccactggggctgtt gtctcggaaagctgccaccagtcgttcagtgtggccacgcagtttttttgcagaaggcag cctctaaatgattacacacagtactccatggaaagtattgccaacgctgtagaagagaac cccagtagaaagaacactctggaaatgtggaaggattataccactgaagatgccatcatt gttatagaaaaagccatgacagctgtcaagcctgaaacaataaattcctgctag >gi568815580f:63384429_63593221|GENSCAN_predicted_peptide_8|260_aa MGPTSGQTGVAWTRAGSTWGSFHEEEAVERQRQAGSDREQENKAIKTLTLLRGEIRTEHH RMTEDSSRLMDVSQQEKAKRGREKQAAATFHQKGLAKIIMDSLGAVSTRLGFDLFKELKK TNDGNIFFSPVGILTAIGMVLLGTRGATASQLEEVFHSEKETKSSRIKAEEKEVIENTEA VHQQFQKFLTEISKLTNDYELNITNRLFGEKTYLFLQKYLDYVEKYYHASLEPVDFVNAA DESRKKINSWVESKTNGRVW >gi568815580f:63384429_63593221|GENSCAN_predicted_CDS_8|780_bp atgggccccacatctggacaaactggtgtcgcctggacaagggctgggtctacctgggga agcttccatgaagaggaggctgtggagaggcagagacaggcagggtcagacagagagcaa gagaataaagccattaaaacattaaccctgctccgcggggaaataagaactgagcaccac cggatgacggaagactccagtagattgatggatgtctcccagcaagagaaggccaagaga ggacgtgagaagcaggcagcagcgacctttcaccaaaagggtctcgctaaaatcatcatg gattcacttggcgccgtcagcactcgacttgggtttgatcttttcaaagagctgaagaaa acaaatgatggcaacatcttcttttcccctgtgggcatcttgactgcaattggcatggtc ctcctggggacccgaggagccaccgcttcccagttggaggaggtgtttcactctgaaaaa gagacgaagagctcaagaataaaggctgaagaaaaagaggtgattgagaacacagaagca gtacatcaacaattccaaaagtttttgactgaaataagcaaactcactaatgattatgaa ctgaacataaccaacaggctgtttggagaaaaaacatacctcttccttcaaaaatactta gattatgttgaaaaatattatcatgcatctctggaacctgttgattttgtaaatgcagcc gatgaaagtcgaaagaagattaattcctgggttgaaagcaaaacaaatggtagagtatgg