GENSCAN 1.0 Date run: 4-Nov-116 Time: 05:43:34 Sequence gi568815595r:47462603_47677929 : 215327 bp : 45.03% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 569 564 6 1.05 1.02 Term - 6378 5751 628 2 1 68 49 160 0.034 3.93 1.01 Init - 13416 13370 47 0 2 35 65 104 0.015 0.96 1.00 Prom - 22439 22400 40 -3.06 2.15 PlyA - 24579 24574 6 1.05 2.14 Term - 33595 33467 129 1 0 115 43 96 0.985 5.98 2.13 Intr - 35864 35684 181 1 1 76 85 172 0.712 15.57 2.12 Intr - 39249 38992 258 2 0 81 42 230 0.703 14.28 2.11 Intr - 41846 41728 119 2 2 83 82 100 0.053 8.16 2.10 Intr - 51419 51068 352 1 1 58 65 143 0.006 4.33 2.09 Intr - 79818 79723 96 2 0 46 80 108 0.210 4.92 2.08 Intr - 80144 79842 303 1 0 55 47 117 0.091 0.01 2.07 Intr - 87559 87355 205 2 1 45 67 295 0.759 21.36 2.06 Intr - 100159 100048 112 1 1 122 55 92 0.036 9.25 2.05 Intr - 104521 104450 72 1 0 78 70 37 0.553 0.50 2.04 Intr - 106625 106550 76 1 1 81 73 78 0.700 5.12 2.03 Intr - 110272 110084 189 0 0 74 89 414 0.393 38.80 2.02 Intr - 115326 114231 1096 1 1 119 110 1013 0.988 95.63 2.01 Init - 116091 115995 97 0 1 103 77 215 0.981 20.37 2.00 Prom - 118898 118859 40 -2.26 3.11 PlyA - 119183 119178 6 1.05 3.10 Term - 128235 128051 185 1 2 98 38 75 0.012 1.21 3.09 Intr - 147725 147464 262 2 1 109 110 203 0.989 21.66 3.08 Intr - 159739 159605 135 1 0 43 87 120 0.267 8.16 3.07 Intr - 160106 159966 141 2 0 88 62 27 0.158 0.65 3.06 Intr - 172742 172588 155 0 2 29 47 138 0.900 3.59 3.05 Intr - 173534 173420 115 2 1 108 91 90 0.927 11.32 3.04 Intr - 176178 176123 56 1 2 68 98 77 0.643 5.40 3.03 Intr - 198853 198692 162 2 0 30 97 175 0.369 12.55 3.02 Intr - 199990 199732 259 1 1 95 47 159 0.346 9.54 3.01 Intr - 212986 212873 114 1 0 58 86 85 0.533 5.94 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 47652 47543 110 1 2 83 48 131 0.912 7.27 S.002 Init - 51250 51068 183 1 0 60 65 154 0.880 8.75 S.003 Sngl - 58695 58543 153 0 0 91 44 151 0.947 5.30 S.004 Init - 89886 89736 151 0 1 71 100 26 0.854 2.30 S.005 Term - 100159 99998 162 1 0 122 42 127 0.959 9.44 S.006 Term - 167559 167416 144 0 0 84 48 110 0.921 4.51 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:47462603_47677929|GENSCAN_predicted_peptide_1|224_aa MRTPLGARAEGGGAGGMFSLFYIETIDPGAICRKLKLDPFLTPYTKINSRWIKDLNVRPK TIKTLEENLGNTIQAIGMGKDFMTKTPKAMATKAKIDKWDLIKLKSFYTAKETTIRLNRQ PTEWEKIFTIYPSDKGLISRIYKELKQIYKKKIKQPYQKVGEAYEQTLLKRRHLCSQQTH EKMLVVTGHQRNANQNHNEIPSHQLEWGSLKSQETTGAGEDVEK >gi568815595r:47462603_47677929|GENSCAN_predicted_CDS_1|675_bp atgcgcacgccgctgggcgcacgtgcggagggcggcggggcgggaggaatgttttcatta ttctacatagaaactatcgatccaggagccatatgtagaaagctgaaactggatcccttc cttacaccttatacaaaaatcaattcaagatggattaaagacttaaatgttagacctaaa accataaaaaccctagaagaaaacctaggcaataccattcaggccataggcatgggcaag gacttcatgactaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggat ctaattaaactaaagagcttctacacagcaaaagaaactaccatcagactgaacaggcaa cctacagagtgggagaaaatttttacaatctacccatctgacaaagggctaatatccaga atctacaaagaacttaaacagatttacaagaaaaaaatcaaacaaccctatcaaaaagtg ggtgaagcatatgaacagacacttctcaaaagaagacatttatgcagccaacagacacat gaaaaaatgcttgtcgtcactggccatcagagaaatgcaaatcaaaaccacaatgagata ccctcacaccagttagaatggggatcattaaaaagtcaggaaacaacaggtgctggagag gatgtggagaaatag >gi568815595r:47462603_47677929|GENSCAN_predicted_peptide_2|1094_aa MGRAGGGGPGRGPPPLLLFLGAALVLASGAVPAREAGSAVEAEELVKGSPAWEPPANDTR EEAGPPAAGEDEASWTAPGGELAGPEEVLQESAAVTGTAWLEADSPGLGGVTAEAGSGDA QALPATLQAPHEVLGQSIMPPAIPEATEASGPPSPTPGDKLSPASELPKESPLEVWLNLG GSTPDPQGPELTYPFQGTLEPQPASDIIDIDYFEGLDGEGRGADLGSFPGSPGTSENHPD TEGETPSWSLLDLYDDFTPFDESDFYPTTSFYDDLDEEEEEEEDDKDAVGGGDLEDENEL LVPTGKPGLGPGTGQPTSRWHAVPPQHTLGSVPGSSIALRPRPGEPGRDLASSENGTECR SGFVRHNGSCRSVCDLFPSYCHNGGQCYLVENIGAFCRCNTQDYIWHKGMRCESIITDFQ VMCVAVGSAALVLLLLFMMTVFFAKKLYLLKTENTKLRRTNKFRTPSELHNDNFSLSTIA EGSHPNLQTPFFFSEGEKKELLLEGLATQPDDPSAPHKIQEVLKSCLKEEESFNIQNSMS PKLEGGKEQDPDDEEEEGEGKEKEKEKKKKKKKKKEKEKEKEKEKEKEKEKEKEKEKGRR RRRRRRRRRRRRRRRRRTARFLVRPALWPARAEVHPCSTDGETETLGQTGSMSKATTRLP GVTTRKPSAMALLLTLLATGMSSHRKKDLPLAPLASQALPNARHRTLASGIHRELSIDPA NLQKSGGEHDQQEHTAGPSDTERDKLVTSPSSQECGAYRRRLTATCGEKEPGAGQRARPR EEEYSVAWSLGGPLGGLGRHADLRGMSRQRPLAVAQRSPSFPTTWDSTDVYARAQRSHRG TRLIGCGWLAKVALRMRASLREPGCRGVSLTMARERGQLVFLEGLKSAVDVVFQAQKEPH PLQFLREANAGNLKPLFEFVREALKPVDSGEARWTYPVLLVDDLSVLLSLGMGAVAVLDF IHYCRATVCWELKVLMELPSLRPPVTSKNSDSKEQASILYLMQGNMVVLVHDSGDAEDEE NDILLNGLSHQSHLILRAEGLATGFCRDVHGQLRILWRRPSQPAVHRDQSFTYQYKIQDK SVSFFAKGMSPAVL >gi568815595r:47462603_47677929|GENSCAN_predicted_CDS_2|3285_bp atggggcgagccgggggcgggggcccgggccgggggccgccgccactgctgctgtttctg ggggccgcgctggtcctggcctctggggccgtgccggcgcgtgaggcgggcagcgcggtt gaggccgaagagctggtgaagggcagcccggcgtgggagccgcctgccaacgacacgcgg gaagaagccggcccaccagcggctggggaagatgaggcgtcgtggacggcgcccggtggc gagctggccgggccagaagaggtgctgcaggagtcggctgcggtgaccggcaccgcctgg ctggaagctgacagcccaggcctgggaggagtgaccgcagaggcgggcagcggcgatgcc caggcccttccagctacgctccaggctccccacgaggtcctcgggcagtcaatcatgccc cctgccattcctgaggctacagaggccagcgggccaccctcccccacccccggcgacaag ctgagcccagcttctgaactccccaaggagagccccttggaggtttggctgaacctgggg ggcagcacacccgaccctcaagggccagagctgacttacccatttcagggcaccctggag ccccaaccggcatcagatatcattgacatcgactacttcgaaggactggatggtgagggt cgtggcgcagatctggggagcttcccagggtcaccaggaacctcagagaaccaccctgat actgagggagagaccccttcctggagcctgcttgacttatacgatgatttcacccccttc gatgaatctgatttctaccccaccacatccttttatgatgacttggatgaagaggaggag gaagaggaggatgacaaagatgcagtaggaggtggagacctagaagatgaaaatgagctt ctagtgcccactgggaagcctggtctggggcccgggacaggccagcccaccagtcggtgg catgctgtccctccacagcacactctggggtcggtccccggcagcagcatcgccctcagg ccccgcccaggagagccaggcagggacttggcctccagtgaaaatggcactgagtgccgc agtggctttgtgcggcataacggctcctgccggtcagtgtgcgacctcttcccaagttac tgtcacaatggcggccagtgctacctggtggagaacataggggccttctgcaggtgcaac acgcaggactacatctggcacaaggggatgcgctgcgagtccatcatcaccgacttccag gtgatgtgcgtggccgtgggctcggctgccctcgtcctgctcctgctcttcatgatgacg gtgttctttgccaagaagctctacctgctcaagacggagaataccaagctgcgtaggacc aacaaattccggaccccatctgagctccacaatgataacttctccctctccaccattgcc gagggctctcacccaaatcttcagacacctttcttcttctcagagggggaaaagaaagag ctcctcttggagggcttggctacacagccggatgatcctagtgctccccacaaaatccag gaggttctcaagtcctgcctgaaagaggaggagtcatttaacatccagaactccatgtcg cccaaacttgagggtggcaaagagcaagacccggatgatgaagaagaagaaggagaaggg aaggagaaggagaaggagaagaagaagaagaagaagaagaagaaggagaaggagaaggag aaggagaaggagaaggagaaggagaaggagaaggagaaggagaaggagaagggaaggaga aggagaaggaggagaaggagaaggagaaggagaaggagaaggagaaggaggacagcacgc ttcctggtccggccggctctatggccagccagggcagaagttcacccttgttctacagat ggagaaacagaaaccctggggcagaccgggtccatgtccaaggccacgacaaggttgcca ggagtaaccaccagaaagccaagtgccatggctttgctcctcactctgctggcaactgga atgagttcccacaggaaaaaagacctccctttggctcccctggcttcccaggccctgcca aatgccagacaccggactctagcttctggcattcatagagaactatccatagaccctgcc aacctgcaaaaaagtggaggagagcatgaccagcaggagcacactgcaggcccctctgac acggagcgggacaagctggtaaccagcccaagctcccaggagtgtggggcatacagaagg cgcttgacagccacgtgtggagaaaaggagcccggggctgggcagagagcgaggccaagg gaagaagaatactcggtcgcgtggagccttgggggcccactgggagggctcgggcgccac gcagatctgcgcggaatgagccgccagaggccgctggcagtcgcacagagaagcccatcg tttcctaccacctgggactcgacggacgtctacgcgcgtgcgcagcggtcccatcgaggc acccgcctcattggctgtggctggctcgcaaaagtggcattgcgcatgcgcgcttccttg cgcgagccgggctgtcggggtgtcagcctgaccatggcgcgggagcgtgggcagcttgtg ttccttgagggactcaagtctgcagtggacgtcgtcttccaggctcaaaaggagccacac cccctgcagtttctcagggaggctaatgctgggaacttgaaaccattgtttgagtttgta cgggaggccctgaagccagtagacagtggagaggctcggtggacgtacccggtgctgttg gtggacgacctcagtgtgctcctgagcctgggcatgggggcggtggctgtgctagacttc attcactactgcagagccaccgtgtgctgggaactaaaggtactcatggaactgccttct ctgcgcccacctgtgacctccaagaactcagacagtaaggagcaggcttccatcttgtat ctgatgcagggaaacatggtggtccttgtgcacgacagtggagatgcggaggatgaggag aatgacatcctgctgaatggcctcagtcatcagagccatctgatactgcgggctgagggc ctggccactggcttctgcagggatgtgcacgggcagctgaggatcctgtggaggagacca tcgcagcccgcagtccaccgggatcagagcttcacttaccagtataagatacaggacaaa agcgtgtccttttttgccaaaggaatgtctcctgctgttctgtga >gi568815595r:47462603_47677929|GENSCAN_predicted_peptide_3|527_aa VPAAQQMLNFPEKNKEKPVDLQNFGLRTDIYSKKTLAKALEMYKDDWNKVSEHVGSRTQD ECILHFLRLPIEDPYLENSDASLGPLAYQPVPFSQSGNPVMSTVAFLASVVDPRVASAAA KAALEEFSRVREEVPLELVEAHVKKVQEAARASGKVDPTYGLESSCIAGTGPDEPEKLEG AEEEKMEADPDGQQPEKAENKVENETDEGDKAQDGENEKNSEKEQDSEVSEDTKSEEKET EENKELTDTCKERESDTGKKKVEHEISEGNVATAAAAALASAATKAKKSFREERIVRGQT ECQDGVVGGKVPPVLIFGPSGSWFALSFLIALLKHLAAVEERKIKSLVALLVETQMKKLE IKLRHFEELETIMDREKEALEQQRQQLLTERQNFHMEQLKYAELRARQQMEQQQHGQNPQ QAHQHSGGPGLAPLGAAGHPGMMPHQQPPPYPLMHHQMPPPHPPQPGQIPGPGSMMPGQH MPGRMIPTVAANIHPSGSGPTPPGMPPMPGNILGPRVPLTAPNGMCK >gi568815595r:47462603_47677929|GENSCAN_predicted_CDS_3|1584_bp gttcctgctgctcaacagatgctaaattttcctgagaaaaacaaggaaaaaccagttgat ttgcagaactttggtctccgtactgacatttactccaagaaaacattagcaaaggccctg gagatgtacaaggatgattggaacaaagtgtcggaacatgttggaagtcgtactcaggat gaatgcatcctccactttttgagacttcccattgaggacccataccttgagaattcagat gcttcccttgggcctttggcctaccagcctgtccccttcagtcagtcaggaaatccagtt atgagtactgttgcttttttggcatctgtggtggaccctcgcgtggcatctgctgcagca aaagcggctttggaggagttttctcgggtccgggaggaggtaccactggaattggttgaa gctcatgtcaagaaagtacaagaagcagcacgagcctctgggaaagtggatcccacctac ggtctggagagcagctgcattgcaggcacagggcccgatgagccagagaagcttgaagga gctgaagaggaaaaaatggaagccgaccctgatggtcagcagcctgaaaaggcagaaaat aaagtggaaaatgaaacggatgaaggtgataaagcacaagatggagaaaatgaaaaaaat agtgaaaaggaacaggatagtgaagtgagtgaggataccaaatcagaagaaaaggagact gaagagaacaaagaactcactgatacatgtaaagaaagagaaagtgatactgggaagaag aaagtagaacatgaaatttccgaaggaaatgttgccacagccgcagcagctgctcttgcc tcagcggctaccaaagccaagaaaagttttagggaggaaaggatagtgaggggacagacc gagtgccaggatggggtggtaggtggcaaagtacctcctgtcctcatctttggcccatca gggtcctggtttgccctctccttcttgatagcccttttgaagcacctggctgcagtggaa gaaagaaagatcaagtccctggtagctctcttggttgagacacaaatgaagaaactagag atcaaacttcgacattttgaagagctggaaactatcatggacagagagaaagaagctcta gaacaacagaggcagcagttgcttactgaacgccaaaacttccacatggaacagctgaag tatgctgaattacgagcacgacagcaaatggaacagcagcagcatggccagaaccctcaa caggcacaccagcactcaggaggacctggcctggccccacttggagcagcagggcaccct ggcatgatgcctcatcaacagccccctccctaccctctgatgcaccaccagatgccacca cctcatccaccccagccaggtcagataccaggcccaggttccatgatgcccgggcagcac atgccaggccgcatgattcccactgttgcagccaacatccacccctctgggagtggccct acccctcctggcatgccaccaatgccaggaaacatcttaggaccccgggtacccctgaca gcacctaacggcatgtgtaagtaa