GENSCAN 1.0 Date run: 8-Nov-116 Time: 11:24:27 Sequence gi568815595r:48757713_48998794 : 241082 bp : 45.56% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 Intr - 7366 7292 75 1 0 88 109 113 0.601 13.11 1.07 Intr - 15396 15243 154 2 1 109 105 91 0.998 12.97 1.06 Intr - 25380 25274 107 0 2 82 70 135 0.080 10.11 1.05 Intr - 36337 36285 53 2 2 74 90 69 0.158 4.33 1.04 Intr - 39958 39892 67 1 1 106 19 46 0.100 -2.02 1.03 Intr - 49972 49937 36 1 0 138 91 44 0.981 8.16 1.02 Intr - 84733 84285 449 2 2 61 86 95 0.170 -0.33 1.01 Init - 89884 89623 262 1 1 91 102 429 0.765 41.83 1.00 Prom - 93303 93264 40 -3.86 2.10 PlyA - 95164 95159 6 1.05 2.09 Term - 100060 99998 63 1 0 94 54 45 0.838 -0.41 2.08 Intr - 100919 100795 125 0 2 84 101 120 0.949 13.20 2.07 Intr - 101489 101380 110 1 2 70 105 114 0.988 11.23 2.06 Intr - 101915 101843 73 0 1 90 78 -13 0.777 -3.64 2.05 Intr - 104947 104830 118 1 1 139 76 133 0.790 17.24 2.04 Intr - 121736 121646 91 1 1 81 110 14 0.822 2.90 2.03 Intr - 126412 126285 128 1 2 103 109 142 0.948 17.28 2.02 Intr - 134360 134268 93 2 0 105 73 3 0.566 0.66 2.01 Init - 141082 140978 105 1 0 96 86 241 0.998 24.92 2.00 Prom - 155414 155375 40 -4.36 3.02 PlyA - 155951 155946 6 1.05 3.01 Sngl - 161437 160565 873 1 0 47 41 242 0.877 11.37 3.00 Prom - 164211 164172 40 -7.46 4.00 Prom + 165006 165045 40 -5.56 4.01 Init + 169847 170101 255 1 0 56 105 447 0.646 40.43 4.02 Intr + 178129 178178 50 0 2 39 78 42 0.017 -4.22 4.03 Intr + 187424 187469 46 2 1 89 119 27 0.094 4.31 4.04 Intr + 203900 203967 68 1 2 138 110 11 0.996 6.00 4.05 Intr + 207207 207270 64 0 1 126 84 34 0.999 5.52 4.06 Intr + 209413 209563 151 0 1 129 92 66 0.993 10.84 4.07 Intr + 210822 210943 122 1 2 57 64 72 0.854 1.91 4.08 Intr + 212883 212992 110 2 2 77 99 125 0.995 11.58 4.09 Intr + 215987 216104 118 2 1 39 77 81 0.951 2.57 4.10 Intr + 217105 217155 51 0 0 101 89 45 0.967 5.00 4.11 Intr + 217246 217267 22 0 1 94 113 8 0.978 1.02 4.12 Intr + 221770 221921 152 2 2 109 73 147 0.891 15.18 4.13 Intr + 222641 222784 144 1 0 89 65 190 0.994 17.28 4.14 Intr + 223948 224016 69 0 0 103 95 70 0.997 8.58 4.15 Intr + 225184 225304 121 0 1 82 80 62 0.413 4.87 4.16 Intr + 231959 232249 291 0 0 16 10 236 0.022 5.71 4.17 Intr + 232444 232898 455 2 2 34 100 558 0.030 44.58 4.18 Intr + 233121 233202 82 2 1 98 123 127 0.996 16.41 4.19 Term + 236494 236585 92 2 2 73 43 70 0.126 -1.12 4.20 PlyA + 236882 236887 6 1.05 5.03 PlyA - 237771 237766 6 1.05 5.02 Term - 239822 239506 317 0 2 75 50 232 0.919 13.20 5.01 Intr - 240642 240470 173 2 2 84 84 252 0.937 23.99 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 232545 232898 354 2 0 100 100 558 0.948 53.34 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:48757713_48998794|GENSCAN_predicted_peptide_1|401_aa MSHIQIPPGLTELLQGYTVEVLRQQPPDLVEFAVEYFTRLREARAPASVLPAATPRQSLG HPPPEPGPDRVADAKGDSESEEDEDLEVLEVVARAIRQEKEIKAIQLGKEEVKLSLFADD MIVYLENPIVSAPNLLKLISNFSKVSGYKINAQKSQAFLYTNNRKTESQVMSELPFTIAS KRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWIGRINIVKMAILSKFQF LADLIDEYQDIGSLLGTWYRWHYCTSGMGHTICAETYNPDEEEEDTDPREQLSQVLDAMF ERIVKADEHVIDQGDDGDNFYVIERGTYDILVTKDNQTRSVGQYDNRGSFGELALMYNTP RAATIVATSEGSLWGLVSERMKIVDVIGEKIYKDGERIITQ >gi568815595r:48757713_48998794|GENSCAN_predicted_CDS_1|1203_bp atgagccacatccagatcccgccggggctcacggagctgctgcagggctacacggtggag gtgctgcgacagcagccgcctgacctcgtcgaattcgcagtggagtacttcacccgcctg cgcgaggcccgcgccccagcctcagtcctgcccgccgccaccccacgccagagcctgggc caccccccgccagaacccggcccggaccgtgtcgccgacgccaaaggggacagcgagtcg gaggaggacgaggacttggaagtgttggaagttgtggccagggcaattaggcaggagaag gaaataaaggctattcagttaggaaaagaggaagtcaaattgtccctgtttgcagatgac atgattgtatatctagaaaaccccattgtctcagccccaaatctccttaagctgataagc aacttcagcaaagtctcaggatacaaaatcaatgcacaaaaatcacaagcattcttatat accaataacagaaaaacagagagccaagtcatgagtgaactcccattcacaattgcttca aagagaataaaatacctaggaatccaacttacaagggacgtgaaggacctcttcaaggag aactacaaaccactgctcaatgaaataaaagaggatacaaacaaatggaagaacattcca tgctcatggataggaagaatcaatatagtgaaaatggccatactgtccaagttccagttc ctagcagatttaatagacgagtatcaggatatagggtccctgttgggcacttggtatcga tggcattactgcacaagtggcatgggccacacaatctgtgctgagacctataaccctgat gaggaagaggaagatacagatccaagggaacagctttctcaagttctcgatgccatgttt gaaaggatagtcaaagctgatgagcatgtcattgaccaaggagatgatggagacaacttt tatgtcatagaacggggaacttatgacattttagtaacaaaagataatcaaacccgctct gttggtcaatatgacaaccgtggcagttttggagaactagctctgatgtacaacaccccg agagctgctaccattgttgctacctcagaaggctccctttggggactggtgtcagaacga atgaagattgtggatgtaataggagagaagatctataaggatggagaacgcataatcact cag >gi568815595r:48757713_48998794|GENSCAN_predicted_peptide_2|301_aa MADQPKPISPLKNLLAGGFGGVCLVFVGHPLDTVKVRLQTQPPSLPGQPPMYSGTFDCFR KTLFREGITGLYRGMAAPIIGVTPMFAVCFFGFGLGKKLQQKHPEDVLSYPQLFAAGMLS GVFTTGIMTPGERIKCLLQIQASSGESKYTGTLDCAKKLYQEFGIRGIYKGTVLTLMRDV PASGMYFMTYEWLKNIFTPEGKRVSELSAPRILVAGGIAGIFNWAVAIPPDVLKSRFQTA PPGKYPNGFRDVLRELIRDEGVTSLYKGFNAVMIRAFPANAACFLGFEVAMKFLNWATPN L >gi568815595r:48757713_48998794|GENSCAN_predicted_CDS_2|906_bp atggccgaccagccaaaacccatcagcccgctcaagaacctgctggccggcggctttggc ggcgtgtgcctggtgttcgtcggtcaccctctggacacggtcaaggtccgactgcagaca cagccaccgagtttgcctggacaacctcccatgtactctgggacctttgactgtttccgg aagactctttttagagagggcatcacggggctatatcggggaatggctgcccctatcatc ggggtcactcccatgtttgccgtgtgcttctttgggtttggtttggggaagaaactacaa cagaaacacccagaagatgtgctcagctatccccagctttttgcagctgggatgttatct ggcgtattcaccacaggaatcatgactcctggagaacggatcaagtgcttattacagatt caggcttcttcaggagaaagcaagtacactggtaccttggactgtgcaaagaagctgtac caggagtttgggatccgaggcatctacaaagggactgtgcttacccttatgcgagatgtc ccagctagtggaatgtatttcatgacatatgaatggctgaaaaatatcttcactccggag ggaaagagggtcagtgagctcagtgcccctcggatcttggtggctgggggcattgcaggg atcttcaactgggctgtggcaatccccccagatgtgctcaagtctcgattccagactgca cctcctgggaaatatcctaatggtttcagagatgtgctgagggagctgatccgggatgaa ggagtcacatccttgtacaaagggttcaatgcagtgatgatccgagccttcccagccaat gcggcctgtttccttggctttgaagttgccatgaagttccttaattgggccacccccaac ttgtga >gi568815595r:48757713_48998794|GENSCAN_predicted_peptide_3|290_aa MLGQRAGDGERPGLPGDGEGGVPARPGRRAERPPQRPAKVNKAVTCAAHLPGAAASRPLS PNKPDRVRPGQRDRIGAKRQRRRRADAGQARAASSRRVVPTAPEVLGAVASLPDRGRPTV ARVATGSRLEGLFSAASLKLSALTQSLTRVRQAPTASGATIRLPASPVEMFLTSAFLTGF SFHCLYSGIGHGEDILASVEQITIVSRPLSGQRGAGPGNSAYTPRRSQGGPRAATTPGFR FPCRGLVRRAVLRLTVTVQDCILTALLAVSFHSIGVVIMTSSYLLGPVVK >gi568815595r:48757713_48998794|GENSCAN_predicted_CDS_3|873_bp atgctcggacaacgtgccggcgacggggagcgcccgggcctcccgggcgacggcgaaggc ggagtcccggcccggccagggaggcgcgcggagaggcccccccagcggccagccaaggta aacaaggccgtgacgtgcgccgcgcacttaccgggagctgcggcctcgcggccgctgagc ccgaacaagccagaccgggtcaggccgggccagagggaccggattggggcgaagcggcag cggaggcggcgggccgacgccggtcaagcccgcgctgcttcctccagaagagtcgtcccc acagctccggaagtgcttggcgccgttgcgtcacttccggatcggggtcgacccacggtc gctcgggtcgcgacaggctcccggctagagggcctgtttagcgccgcctccttgaaactt agcgctctgacccagagtctgaccagggtacggcaggcgccgaccgcgtctggagccact attcgcctaccagcgtctcccgtcgagatgtttttaaccagcgcgtttctcaccggcttc tcatttcactgtttgtactccgggatcgggcacggagaagacatcctggcgtcagtggag cagataaccattgtttctcggccgctatctggtcagaggggagctgggccgggaaattct gcctacaccccgaggcggtcgcagggtggccccagagccgcaaccacaccaggctttcgc ttcccctgccgaggcctcgttcgccgcgcagttctccgacttacggtcaccgtgcaagat tgcatcttaactgccttgcttgcagtttcttttcacagtataggagttgtcatcatgact tccagttaccttctgggaccggttgtcaaatga >gi568815595r:48757713_48998794|GENSCAN_predicted_peptide_4|820_aa MSVDMNSQGSDSNEEDYDPNCEEEEEEEEDDPGDIEDYYVGVASDVEQQGADAFDPEEYQ FTCLTYKESEGALNEHMTSLASVLKGRSLHIEDGSGQCIADWPLMLTAISCILPMALVSH SVAKLILVNFHWQVSEILDRYKSNSAQLLVEARVQPNPSKHVPTSHPPHHCAVCMQFVRK ENLLSLACQHQFCRSCWEQHCSVLVKDGVGVGVSCMAQDCPLRTPEDFVFPLLPNEELRE KYRRYLFRDYVESHYQLQLCPGADCPMVIRVQEPRARRVQCNRCNEVFCFKCRQMYHAPT DCATIRKWLTKCADDSETANYISAHTKDCPKCNICIEKNGGCNHMQCSKCKHDFCWMCLG DWKTHGSEYYECSRYKENPDIVNQSQQAQAREALKKYLFYFERWENHNKSLQLEAQTYQR IHEKIQERVMNNLGTWIDWQYLQNAAKLLAKCRYTLQYTYPYAYYMESGPRKKLFEYQQA QLEAEIENLSWKVERADSYDRGVSAYCPLGFYIAGLPDSGETSREIRTDRGTELQECLRG GCGSYPIAAYAVPASAAPPNAALLSHLSAASSALRSPPPLLEQEDADDTFPGGKRASSRR HREAPTSVCLNVTAALAGLAAVVRATAQRGHPRGPSPGRARDLGAMAAAAVTGQRPETAA AEEASRPQWAPPDHCQAQAAAGLGDGEDAPVRPLCKPRGICSRAYFLVLMVFVHLYLGNV LALLLFVHYSNGDESSDPGPQHRAQGPGPEPTLGPLTRLEGIKVGHERKVQLVTDRDHFI RTLSLKPLLFVILNGQAGPKKSGPAIDSAPSAMLFLDLNL >gi568815595r:48757713_48998794|GENSCAN_predicted_CDS_4|2463_bp atgtcagtggacatgaatagccaggggtctgacagcaatgaagaggactatgacccaaat tgtgaggaagaggaagaagaagaagaagacgaccctggggacatagaggactattacgtg ggagtagccagcgatgtggagcagcagggggctgatgcctttgatcccgaggagtaccag ttcacttgcttgacctacaaggaatctgagggtgccctcaatgagcacatgaccagctta gcttctgtcctaaagggtaggagcctacacattgaggatggttctgggcaatgtattgct gactggccgttgatgctcactgccatcagctgcatcctccctatggcactggtatctcat tcagttgctaaacttatattagttaatttccactggcaagtttcagagatattggacaga tacaagtccaattctgctcaactgcttgttgaggctcgagttcagcctaatccatcaaaa catgttcccacatcccatccccctcaccactgtgcagtgtgtatgcagtttgtgcgaaag gaaaacctactctctctggcctgtcagcaccagttttgccgcagctgctgggagcagcac tgctcagttctcgtcaaggacggcgtgggcgtgggagtctcttgcatggctcaggactgt ccactccgtacaccagaggactttgtgtttccattgcttcccaatgaagaattgagagag aaatacaggcgctacctcttcagggactatgtggagagtcattaccagctccagctgtgc cctggtgcagactgccccatggttattcgggtacaggagcctagagctcgccgagtacag tgcaatcggtgcaacgaggtcttctgtttcaagtgtcgtcagatgtatcacgcacccaca gactgtgccacaatccggaaatggctcacgaagtgtgcagacgactctgaaacagccaac tacattagtgctcacactaaagactgtcccaagtgcaacatctgcattgagaagaatgga ggctgcaatcacatgcaatgctccaaatgtaaacacgacttctgctggatgtgtctagga gattggaagactcatggcagtgaatactatgagtgcagtcgttacaaggagaatcctgac atcgtgaaccagagccaacaagcccaggcgagggaagccctcaagaagtacttattctac tttgagaggtgggaaaaccacaataaaagcttgcagctagaggcacagacataccagcgg attcacgagaagattcaggagagggtcatgaacaatctggggacatggatcgactggcag tacctacagaatgctgccaagctcttggccaagtgtcgatacaccctgcaatacacctac ccatatgcatattacatggagtccggacccaggaagaagctgtttgaataccagcaggct cagctggaggctgagatcgaaaacctctcatggaaagtggagcgtgcagacagctatgac agaggggtaagtgcctactgtcctcttggattctatattgcaggcctgcctgactccggc gagacttcccgggagatccggaccgaccgcggaaccgagctccaagaatgcctgcgagga ggctgtgggtcctaccctatcgccgcttacgcagtcccagcctccgcggcgcccccaaat gccgcactgttgagtcatctcagcgctgcctcctccgcgctgcgctccccacccccgctc ctggagcaagaagatgcagacgatacttttcccggaggcaagagggcgtcttcacgcagg caccgagaagctcccactagtgtatgccttaatgtgaccgcggcgctcgcgggcctggcg gccgttgtccgggcgactgcgcagcgcgggcacccccgcggcccctcccctgggcgcgcg cgcgacctgggtgccatggcggcagcggcggtgacaggccagcggcctgagaccgcggcg gccgaggaggcctcgaggccgcagtgggcgccgccagaccactgccaggctcaggcggcg gccgggctgggcgacggcgaggacgcaccggtgcgtccgctgtgcaagccccgcggcatc tgctcgcgcgcctacttcctggtgctgatggtgttcgtgcacctgtacctgggtaacgtg ctggcgctgctgctcttcgtgcactacagcaacggcgacgaaagcagcgatcccgggccc caacaccgtgcccagggccccgggcccgagcccaccttaggtcccctcacccggctggag ggcatcaaggtggggcacgagcgtaaggtccagctggtcaccgacagggatcacttcatc cgaaccctcagcctcaagccgctgctcttcgttattctaaatgggcaagcagggcccaag aagagtggacctgctatagattctgccccctctgccatgctgttcttggatctaaacctg tga >gi568815595r:48757713_48998794|GENSCAN_predicted_peptide_5|163_aa XVIYPSYSLCEAMQASCEPIMACYSYPWPAILHCGRFPLGHDLCITAVANNGSRLDRPSV KVRFSRTNSTSSGPWDCNLYSRLEVLKTGPLPPAELEPTLQRWLQLDVTCVYNLFRGRHT GTYVLSGKMEACWLLVTTAYAWSQGHQNFRLAVRHWHPHQCQE >gi568815595r:48757713_48998794|GENSCAN_predicted_CDS_5|492_bp nnggtcatctacccctcctacagcctgtgcgaggccatgcaggccagctgcgagcccatc atggcctgctacagctacccctggccagccatcctgcattgtggacgcttccccttgggc catgatctctgtatcactgctgtggccaacaacggctcccgcctggaccgcccatcagtg aaggtcaggttctccaggaccaacagtacttcctctggcccctgggactgcaacctgtac tccagactagaggtgctgaagacaggaccactgccccctgcagagctggagcccaccctg caacgctggctgcagctggatgtcacatgtgtgtacaatctttttcgaggaaggcacaca gggacctatgtgttgagtgggaagatggaggcctgctggctgttggtaaccacggcctat gcctggagccagggccaccaaaattttcgactggctgtgcgccattggcaccctcaccag tgccaagaatag