GENSCAN 1.0 Date run: 6-Nov-116 Time: 09:43:57 Sequence gi568815597r:56829407_57065948 : 236542 bp : 42.14% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2541 2690 150 2 0 87 70 77 0.260 5.89 1.02 Intr + 6653 6774 122 1 2 100 73 64 0.497 4.47 1.03 Intr + 12125 12286 162 2 0 30 75 101 0.260 1.17 1.04 Intr + 15533 15689 157 2 1 34 81 115 0.496 4.49 1.05 Term + 17182 17484 303 0 0 29 48 158 0.187 0.09 1.06 PlyA + 19095 19100 6 1.05 2.00 Prom + 19527 19566 40 -9.25 2.01 Init + 23095 23167 73 0 1 80 67 140 0.095 12.38 2.02 Intr + 35898 36079 182 1 2 50 -26 142 0.178 -2.33 2.03 Intr + 38203 38296 94 0 1 91 100 58 0.814 5.92 2.04 Intr + 41095 41192 98 2 2 89 92 74 0.955 6.71 2.05 Intr + 45543 45687 145 2 1 -8 109 165 0.804 7.93 2.06 Intr + 46656 46803 148 1 1 45 60 222 0.936 13.47 2.07 Intr + 52039 52228 190 1 1 47 100 266 0.928 22.37 2.08 Intr + 54075 54206 132 2 0 99 60 158 0.512 14.02 2.09 Intr + 56521 56761 241 0 1 36 111 215 0.808 14.80 2.10 Intr + 77261 77374 114 0 0 121 23 49 0.067 1.20 2.11 Intr + 78550 78707 158 2 2 51 107 72 0.034 4.21 2.12 Intr + 82997 83224 228 1 0 110 73 205 0.674 18.34 2.13 Term + 88159 88392 234 0 0 69 48 229 0.602 12.34 2.14 PlyA + 88576 88581 6 1.05 3.14 PlyA - 88843 88838 6 1.05 3.13 Term - 100152 99991 162 0 0 98 43 180 0.986 11.45 3.12 Intr - 101166 101099 68 1 2 81 81 0 0.865 -3.69 3.11 Intr - 102472 102404 69 2 0 112 89 82 0.958 8.94 3.10 Intr - 104082 103929 154 0 1 124 76 135 0.963 14.62 3.09 Intr - 111606 111443 164 1 2 85 81 263 0.618 24.07 3.08 Intr - 114418 114290 129 2 0 102 113 85 0.977 12.15 3.07 Intr - 116655 116415 241 0 1 39 59 304 0.970 18.80 3.06 Intr - 120346 120149 198 1 0 113 93 105 0.941 12.13 3.05 Intr - 122774 122642 133 1 1 126 110 48 0.987 10.53 3.04 Intr - 125421 125280 142 1 1 88 106 103 0.997 10.59 3.03 Intr - 127504 127363 142 1 1 78 75 81 0.887 4.81 3.02 Intr - 129669 129454 216 0 0 -5 74 140 0.273 1.08 3.01 Init - 130781 130614 168 2 0 64 98 107 0.670 8.88 3.00 Prom - 130848 130809 40 -14.06 4.00 Prom + 130933 130972 40 -9.45 4.01 Init + 131143 131233 91 0 1 87 20 66 0.238 0.40 4.02 Intr + 131857 132068 212 2 2 95 72 178 0.893 14.61 4.03 Intr + 133706 133894 189 1 0 79 57 132 0.793 8.06 4.04 Intr + 134573 134651 79 1 1 71 110 23 0.349 1.01 4.05 Intr + 141767 141946 180 0 0 95 32 145 0.403 8.52 4.06 Term + 143383 143513 131 2 2 31 49 125 0.557 0.26 4.07 PlyA + 144522 144527 6 1.05 5.00 Prom + 158567 158606 40 -4.65 5.01 Init + 168627 168741 115 2 1 71 59 135 0.837 7.33 5.02 Intr + 172522 172632 111 2 0 54 56 111 0.688 3.93 5.03 Term + 181003 181241 239 2 2 13 49 234 0.002 7.35 5.04 PlyA + 181549 181554 6 1.05 6.12 PlyA - 183504 183499 6 1.05 6.11 Term - 183633 183524 110 1 2 35 43 86 0.178 -3.51 6.10 Intr - 186025 185477 549 2 0 115 94 374 0.089 32.61 6.09 Intr - 190242 190128 115 0 1 49 87 43 0.151 -0.60 6.08 Intr - 190877 190736 142 1 1 37 75 46 0.144 -2.37 6.07 Intr - 194002 193882 121 2 1 98 69 86 0.448 6.33 6.06 Intr - 194233 194125 109 1 1 106 115 109 0.991 14.34 6.05 Intr - 196637 196575 63 2 0 116 105 31 0.804 5.60 6.04 Intr - 204027 203980 48 0 0 80 116 39 0.897 3.86 6.03 Intr - 204167 204117 51 2 0 101 111 81 0.959 9.89 6.02 Intr - 231169 231086 84 1 0 70 87 46 0.070 1.80 6.01 Intr - 233537 233484 54 2 0 94 95 46 0.210 4.16 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 23095 23313 219 0 0 80 48 219 0.890 12.01 S.002 Term + 180991 181241 251 2 2 99 49 244 0.985 16.48 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:56829407_57065948|GENSCAN_predicted_peptide_1|297_aa MTGKNKSHFSVYLGEKELIQMQGLPLQVHLRKPQSCTMTTRGSARPIPAQVFQSENTIGH ISLRAELSIFGHIITGVWSGVQPSAFWGLPGKIIIPADVVVESTEAMHVKSSELASLPDA EAVPSRCYFIIIGRAWDNSIPLMGSATSQRPVKSETQSVLKKWRHKCTVIVFLATVYVGG INIHPGAAEIRSTRTRLLINIPKTGKKKRFNWTYSSTWLGRLQNHGRRQKALLTRQWQEK MRKMQKQKPLIKPSDLVRLIHYHKNSMGGTALMIQIISYRVLPTTRGNYGSTIQDPI >gi568815597r:56829407_57065948|GENSCAN_predicted_CDS_1|894_bp atgactgggaagaacaaatcacatttctctgtgtacttaggggagaaagaattaattcaa atgcagggtctgccgcttcaggtccatttacggaagccgcaatcctgcactatgaccacc agggggagcgctaggcccatccctgcacaggttttccagagtgaaaatacgattggccat atttctctccgggcagaactttccatattcggtcacatcataactggcgtttggtcaggt gtccagcctagtgcattctgggggctgccagggaaaataatcatcccagctgatgtagtt gtagaaagcaccgaggcaatgcatgtgaagagctcagagctcgcttcattgcctgatgca gaggcagtgccaagcagatgctactttattattattgggagagcatgggataatagtatt cccctcatgggcagtgctacatcacaaagacctgtgaaaagtgaaacccagagcgttctc aagaaatggaggcacaaatgcacagttattgtttttcttgcaactgtgtacgtggggggc attaacattcacccaggagctgctgaaatccgctcaaccagaacaaggctgttgataaat atacccaagactgggaagaaaaagaggtttaattggacttatagttccacatggctgggg agacttcagaatcatggcaggaggcaaaaggcacttcttacacggcagtggcaagagaaa atgaggaagatgcaaaagcagaaacccctgataaaaccatcagatcttgtgagacttatt cactaccataagaacagtatggggggaaccgccctcatgattcaaattatctcctaccgg gtccttcccacaacacgtgggaattatgggagtacaattcaagatccgatctga >gi568815597r:56829407_57065948|GENSCAN_predicted_peptide_2|678_aa MTDEDSSDGRPSVSKATAAAKHEEVLETGKSKIKALADSVSGEGLLLRLTGDRLLAVSSR GGRDVGGPCDLFNEGTNSIHEGFTLESKTGSYTRSSYLPAEQLVRVDRLLSVPGQKDTVM TLSAIQTIQGNILISETLIMSAMAGFPNKYRHRSLLQPNKFGGTICSGDIWDQASCSSST TCVRQAQCGQDFQCKETGRCLKRHLVCNGDQDCLDGSDEDDCEDVRAIDEDCSQYEPIPG SQKAALGYNILTQEDAQSVYDASYYGGQCETVYNGEWRELRYDSTCERLYYGDDEKYFRK PYNFLKYHFEALADTGISSEFYDNANDLLSKVKKDKSDSFGVTIGIGPAGSPLLKFIFTR IFTKVQTAHFKMRKDDIMLDEGMLQSLMELPDQYNYGMYAKFINDYGTHYITSGSMGGIY EYILVIDKAKMESLGITSRDITTCFGGSLGIQYEDKINVGGGLSGDHCKKFGERARKAMA VEDIISRVRGGSSGWSGGLAQNRSTITYRSWGRSLKYNPVVIDFEMQPIHEVLRHTSLGP LEAKRQNLRRALDQYLMEFNACRCGPCFNNGVPILEGTSCRCQCRLGSLGAACEQTQTEG KEPKQMGAGVAGAPGLYAEQASRKGEESVTIQHLRMEGPRVQGGKYRRRLAEGLWTQAGP DAVDVDPCTDYWIKTSFN >gi568815597r:56829407_57065948|GENSCAN_predicted_CDS_2|2037_bp atgacggatgaagatagttcagatggaaggcccagtgtgagcaaagccacagcagcagca aagcatgaggaagttctggagactgggaagtccaagatcaaggcactggcagattcagtg tctggtgagggcctacttcttcggttaacaggtgatcgtcttctagctgtgtcctcacgt ggtggaagggatgtgggaggtccctgtgatctctttaatgagggcactaattccattcat gagggcttcaccctcgagagtaagacgggcagctacacccgcagcagttacctgccagct gagcaactggtcagagtggacagattgctttccgtgccaggacaaaaagacactgtgatg acacttagtgctatccagacaatccagggtaatatcctcatctcagaaactttaataatg tctgcaatggccggctttccaaataagtaccgacaccggagcctcttgcagccaaacaag tttgggggaaccatctgcagtggtgacatctgggatcaagccagctgctccagttctaca acttgtgtaaggcaagcacagtgtggacaggatttccagtgtaaggagacaggtcgctgc ctgaaacgccaccttgtgtgtaatggagaccaggactgccttgatggctctgatgaggac gactgtgaagatgtcagggccattgacgaagactgcagccagtatgaaccaattccagga tcacagaaggcagccttggggtacaatatcctgacccaggaagatgctcagagtgtgtac gatgccagttattatgggggccagtgtgagacggtatacaatggggaatggagggagctt cgatatgactccacctgtgaacgtctctactatggagatgatgagaaatactttcggaaa ccctacaactttctgaagtaccactttgaagccctggcagatactggaatctcctcagag ttttatgataatgcaaatgaccttctttccaaagttaaaaaagacaagtctgactcattt ggagtgaccatcggcataggcccagccggcagccctttattgaaattcattttcacaaga atcttcacaaaggtgcagactgcacattttaagatgaggaaggatgacattatgctggat gaaggaatgctgcagtcattaatggagcttccagatcagtacaattatggcatgtatgcc aagttcatcaatgactatggcacccattacatcacatctggatccatgggtggcatttat gaatatatcctggtgattgacaaagcaaaaatggaatcccttggtattaccagcagagat atcacgacatgttttggaggctccttgggcattcaatatgaagacaaaataaatgttggt ggaggtttatcaggagaccattgtaaaaaatttggagaaagggccaggaaggccatggct gtggaagacattatttctcgggtgcgaggtggcagttctggctggagcggtggcttggca cagaacaggagcaccattacataccgttcctgggggaggtcattaaagtataatcctgtt gttatcgattttgagatgcagcctatccacgaggtgctgcggcacacaagcctggggcct ctggaggccaagcgccagaacctgcgccgcgccttggaccagtatctgatggaattcaat gcctgccgatgtgggccttgcttcaacaatggggtgcccatcctcgagggcaccagctgc aggtgccagtgccgcctgggtagcttgggtgctgcctgtgagcaaacacagacagaaggt aaggagccaaagcagatgggagctggagttgctggagctcctggtctgtatgcagagcag gcatccaggaaaggagaagagagtgtgacaatccagcacctcagaatggaggggcctcgt gtccagggcggaaagtacagacgcaggcttgctgagggcctctggacacaggctggacca gatgctgtggatgtcgacccctgcactgactattggataaagacttctttcaactaa >gi568815597r:56829407_57065948|GENSCAN_predicted_peptide_3|661_aa MTFRGERPHSFGSNAVNKSFAKSRQMRSVDVTLMPIDCELSSWSSWTTCDPCQKKRATVQ YSLYRVLPVNTLNFRNICHSASLVIGGLTQDPTSENNRYAREKRMVAISPDPWETKPAMG LDSKEAVEYRYAYLLQPSQFHGEPCNFSDKEVEDCVTNRPCGSQVRCEGFVCAQTGRCVN RRLLCNGDNDCGDQSDEANCRRIYKKCQHEMDQYWGIGSLASGINLFTNSFEGPVLDHRY YAGGCSPHYILNTRFRKPYNVESYTPQTQGKYEFILKEYESYSDFERNVTEKMASKSGFS FGFKIPGIFELGISSQSDRGKHYIRRTKRFSHTKSVFLHARSDLEVAHYKLKPRSLMLHY EFLQRVKRLPLEYSYGEYRDLFRDFGTHYITEAVLGGIYEYTLVMNKEAMERGDYTLNNV HACAKNDFKIGGAIEEVYVSLGVSVGKCRGILNEIKDRNKRDTMVEDLVVLVRGGASEHI TTLAYQELPTADLMQEWGDAVQYNPAIIKVKVEPLYELVTATDFAYSSTVRQNMKQALEE FQKEVSSCHCAPCQGNGVPVLKGSRCDCICPVGSQGLACEVSYRKIKHSRLSTLSSCTST MPLAVIFPIPPLMGSGIAGQIGLHALEDVRQDKGSVTIHLLKMGVAPVQALLQKHLTAPS R >gi568815597r:56829407_57065948|GENSCAN_predicted_CDS_3|1986_bp atgacttttagaggtgaaaggccacattcctttgggtcaaatgcagtcaacaagagcttt gctaagagcagacagatgcggagtgtggatgttaccctgatgcccattgattgtgagctg tctagttggtcctcttggaccacatgtgacccctgtcagaagaaaagggctactgtccag tattccttgtatagggttcttccagtgaacacactcaatttccgcaatatctgtcattca gcctccttggtcataggagggctgacccaggatcctacttctgagaacaatagatatgcc agagaaaagagaatggttgctatttccccagatccttgggaaaccaaaccagccatgggc ttagactctaaagaggctgtggaatacaggtatgcctacttgctccagccctctcagttc catggggaaccgtgcaacttctctgacaaggaagtcgaagactgtgttaccaacagacca tgcggaagtcaagtgcgatgtgaaggctttgtgtgtgcacagacaggaaggtgtgtaaac cgcagacttctttgcaatggggacaatgactgtggagaccagtcagatgaagcaaactgt agaaggatttataaaaaatgtcagcatgaaatggaccaatactggggaattggcagtctg gccagtgggataaatttgttcacaaacagttttgagggcccagttcttgatcacaggtat tatgcaggtggatgctccccgcattacatcctgaacacgaggtttaggaagccctacaat gtggaaagctacacgccacagacccaaggcaaatacgaattcatattaaaagagtatgaa tcatactcagattttgaacgcaatgtcacagagaaaatggcaagcaagtctggtttcagt tttggttttaaaatacctggaatatttgaacttggcatcagtagtcaaagtgatcgaggc aaacactatattaggagaaccaaacgattctctcatactaaaagcgtatttctgcatgca cgctctgaccttgaagtagcacattacaagctgaaacccagaagcctcatgctccattac gagttccttcagagagttaagcggctgcccctggagtacagctacggggaatacagagat ctcttccgtgattttgggacccactacatcacagaggctgtgcttgggggcatttatgaa tacaccctcgttatgaacaaagaggccatggagagaggagattatactcttaacaacgtc catgcctgtgccaaaaatgattttaaaattggtggtgccattgaagaggtctacgtcagt ctgggtgtgtctgtaggcaaatgcagaggtattctgaatgaaataaaagacagaaacaag agggacaccatggtggaggacttggtggtcctggtacgaggaggggcaagtgagcacatc accaccctggcataccaggagctgccgacggcggacctgatgcaggagtggggagacgct gtgcagtacaacccagccatcatcaaagttaaggtggagcctctgtatgaactagtgaca gccacagattttgcctattccagcacagtgaggcagaacatgaagcaggcactggaggag ttccagaaggaagttagttcctgccactgtgctccctgccaaggaaatggagtccctgtc ctgaaaggatcacgctgtgactgcatctgtcctgttggatcccaaggcctagcctgtgag gtctcctatcggaagataaaacactcaaggttgtccaccctctcatcttgcacctcaacc atgcccctagcagtcatcttcccaatacccccattgatgggaagtggaattgctggtcaa attggtcttcatgctctggaagacgtaagacaagacaaaggcagtgtaacaatccacctc ctcaaaatgggggtagcccctgttcaggccctgcttcagaaacacttgactgctcctagc agatga >gi568815597r:56829407_57065948|GENSCAN_predicted_peptide_4|293_aa MAGAVQAQWESGEPGSRETRKRLAAAVPLSGHKKRWEDKRNGGLLSTYDRHVKAVLNGPA SPTLSSSIFTAMLEDSCGALPPGQMREAAQRDRDTSTVMQLDWQGGVFPGDAENAVASIR THLTGPRSPTLFLTGVSDQNKDGSIQYSRIGSAVNCKPSAYIDGMEFKYLSCSRQPHGDY RWMSELYPDSGSPSLDPGGQNLSPSCCSRALISVWEMPTSTVSTVDFSLRPGEPDFQKSL ADCCDNYITEGGCKRCSKCAPRRGVHQSAERRADLADVVSIIWDVLAKCKRAD >gi568815597r:56829407_57065948|GENSCAN_predicted_CDS_4|882_bp atggctggagctgtgcaggcccagtgggagtcaggagagcctggtagtagggaaacaagg aagaggctggcagcagcagtcccactgagtgggcacaaaaagagatgggaagacaaaagg aatggaggtttgctgagcacctatgacagacatgtcaaggctgtgctcaatggcccagca tcacccacgctatcttcttctattttcacagcaatgctggaagacagctgtggagcactg ccacctgggcagatgagggaagcggcccagagagacagggacacatccacggtcatgcag ctggactggcaaggaggtgtctttcctggggatgcagaaaatgctgtagccagcattagg acacatttaactgggcccaggtctcccactcttttcctaacaggcgtctctgatcaaaat aaagatggatccatccagtacagcaggattgggtcagcagtaaattgcaagccatcagct tatatagatgggatggaattcaaatacctgtcgtgttcaagacagccccatggggattat aggtggatgtcagaactatacccagattcaggctcaccttccctggatcctggaggacag aatctatctccctcctgttgttctagagccttaatcagtgtctgggaaatgccgacgtct actgtttccactgtagatttttccttgcggcctggggaacctgattttcaaaagtcctta gctgactgctgtgataactacatcaccgaaggtggatgcaaaaggtgctccaagtgtgct ccaaggcgaggggttcaccagagtgctgagaggagagctgacctcgcagatgttgtttcc atcatttgggatgtgctcgccaaatgcaagagggcggactga >gi568815597r:56829407_57065948|GENSCAN_predicted_peptide_5|154_aa MLMSMPVLSTHHELASMLFLRSVDLRVQRLWLLWIKEQTCSTLSSTGPCMSVSRILNEAW NMGGASTNNMKNEDMCHASMRKGKVERRHQAKGHGADTGDIAQARRNSANIRNTVQGRSD GANISNTAQARRHGANIEDVASEVKNNQKGKQIA >gi568815597r:56829407_57065948|GENSCAN_predicted_CDS_5|465_bp atgctgatgtccatgccagtcttgagcacccaccacgagctcgcatcgatgctatttctt aggtctgttgatctgcgcgtacagaggctctggctcctgtggataaaggaacaaacatgt tctacactgagctccactgggccttgcatgtcagtatcccgaattctgaatgaagcttgg aacatgggaggtgcatccactaacaacatgaagaatgaagacatgtgtcatgcatccatg agaaaaggcaaagtagaacgtagacaccaggcaaagggacatggtgcagacactggtgat atagctcaggcaagaagaaatagtgcaaacatcaggaatacagttcagggaaggagcgat ggtgcaaacatcagcaatacagctcaggcaaggagacatggtgcaaacattgaggatgtg gcttcagaagtgaagaacaaccaaaagggaaagcagatagcttaa >gi568815597r:56829407_57065948|GENSCAN_predicted_peptide_6|481_aa VPTSQKKEGVYDVPKSQPHFFPITSLNVLGSSYYQMQAHGLVYESQNGYSFEDFEERFAA ATPNRNLPTDFDEIFEATKAVTQLELFGDMSTPPDITSPPTPATPGDAFIPSSSQTLPAS ADVFSSVPFGTAAVPSGDIEKLFRGGGGVGVQSSGMLLAWNSLMSFTQTAQDSMLLRSLG SAIHTVCSCIKSFPHGVAISPIQIKEIAITLTPEACGKDHLHKNCHHCPAADVSLRSCTR PCGQTALTHRPRRASLQGTPRKSYVAMGAVLPSFWGQQPLVQQQMVMGAQPPVAQVMPGA QPIAWGQPGLFPATQQPWPTVAGQFPPAAFMPTQTVMPLPAAMFQGPLTPLATVPGTSDS TRSSPQTDKPRQKMGKETFKDFQMAQPPPVPSRKPDQPSLTCTSEAFSSYFNKVGVAQDT DDCDDFDISQLNLTPVTSTTPSTNSLSRKMTPSIAKCTPGMKLPQTENHCFRVLAASYKA F >gi568815597r:56829407_57065948|GENSCAN_predicted_CDS_6|1446_bp gttcccaccagccaaaagaaggaaggtgtttatgatgtgccaaaaagtcaacctcacttt ttccctattacttccttgaatgtcctggggtcctcgtattaccagatgcaagcccatggc ctggtatatgaaagccagaatggctattcgtttgaggattttgaagaacggtttgctgca gccaccccgaacagaaacctgcccacagactttgatgagatttttgaggcaacgaaggct gtgacccaattagaactttttggggacatgtccacaccccctgatataacctctcccccc actcctgcaactccaggtgatgcctttatcccatcttcatctcagacccttccagcgagt gcagatgtgtttagttctgtacctttcggcactgctgctgtaccctcaggtgacattgag aagctatttagaggaggaggtggtgttggggtccagagctctgggatgttactggcctgg aattcactgatgtctttcacccagacagctcaagattctatgttactcaggagccttgga tctgctattcatacagtgtgtagctgcatcaaatctttccctcatggagttgcaatttct cctatacaaattaaagagatagcaatcacactcaccccagaggcttgtggaaaggatcac cttcacaaaaattgccaccactgtcctgctgctgacgtgtcccttcgctcatgcacaaga ccatgtggccagacagccttaactcacaggccgaggagagccagcctgcagggaacccct aggaaaagttacgttgcaatgggcgctgtcctcccgtccttctggggtcagcagcccctc gtccaacagcagatggtcatgggtgcccagccaccagtcgctcaggtgatgccgggggct cagcccatcgcatggggccagccgggtctctttcctgccactcagcagccctggccaact gtggccgggcagtttccgccagccgccttcatgcccacacaaactgttatgcctttgcca gctgccatgttccaaggtcccctcaccccccttgccaccgtcccaggcacgagtgactcc accaggtcaagtccacagaccgacaagcccaggcagaaaatgggcaaagaaacgtttaag gatttccagatggcccagcctccgcccgtgccctcccgcaaacccgaccagccctccctc acctgtacctcagaggccttctccagttacttcaacaaagtcggggtggcacaggataca gacgactgtgatgactttgacatctcccagttgaatttgacccctgtgacttctaccaca ccatcgaccaactcactgtcgagaaaaatgactccaagcattgccaaatgtaccccaggg atgaaattgccccagactgagaaccactgctttagagtattagcagcatcctataaagcc ttctaa