GENSCAN 1.0 Date run: 4-Nov-116 Time: 23:33:48 Sequence gi568815597r:202371866_202572117 : 200252 bp : 38.79% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 18087 18219 133 2 1 54 69 133 0.778 8.35 1.02 Term + 19700 20442 743 0 2 21 44 240 0.458 5.53 1.03 PlyA + 22035 22040 6 1.05 2.00 Prom + 39208 39247 40 -6.35 2.01 Init + 42337 42372 36 0 0 72 101 25 0.818 2.26 2.02 Intr + 44922 45052 131 2 2 76 70 154 0.926 10.97 2.03 Intr + 50755 50873 119 1 2 100 86 169 0.990 17.09 2.04 Intr + 53701 53860 160 2 1 61 72 206 0.995 14.42 2.05 Intr + 55175 55319 145 2 1 83 53 148 0.982 10.16 2.06 Intr + 56990 57064 75 1 0 96 89 104 0.984 10.09 2.07 Intr + 58866 58945 80 2 2 110 63 56 0.973 2.73 2.08 Intr + 59615 59754 140 2 2 35 95 241 0.970 18.69 2.09 Intr + 62791 62903 113 2 2 72 96 53 0.994 3.68 2.10 Intr + 65956 66159 204 0 0 70 56 193 0.630 12.77 2.11 Intr + 66598 66871 274 0 1 -13 35 377 0.556 18.49 2.12 Intr + 67014 67234 221 1 2 48 -23 293 0.205 11.30 2.13 Intr + 67338 67682 345 2 0 17 -12 330 0.154 10.56 2.14 Intr + 68841 68923 83 2 2 90 89 80 0.892 5.82 2.15 Intr + 70582 70707 126 1 0 68 76 97 0.940 5.37 2.16 Intr + 73181 73363 183 2 0 62 110 90 0.901 6.58 2.17 Intr + 77124 77306 183 0 0 84 87 188 0.741 16.28 2.18 Intr + 116668 116758 91 1 1 117 89 162 0.995 18.28 2.19 Intr + 121249 121452 204 0 0 69 83 199 0.982 15.97 2.20 Intr + 123428 123617 190 1 1 43 80 74 0.889 0.34 2.21 Intr + 123705 123817 113 1 2 56 95 194 0.877 16.08 2.22 Intr + 124916 124957 42 1 0 127 106 70 0.892 10.22 2.23 Intr + 158345 158449 105 1 0 85 103 21 0.251 2.79 2.24 Term + 159975 160076 102 2 0 67 39 71 0.237 -2.60 2.25 PlyA + 160108 160113 6 1.05 3.00 Prom + 163995 164034 40 -6.35 3.01 Init + 168275 168421 147 1 0 89 75 103 0.342 9.24 3.02 Intr + 192578 192682 105 1 0 106 110 111 0.940 14.59 3.03 Intr + 195913 195966 54 0 0 48 67 85 0.400 0.66 3.04 Intr + 197282 197332 51 1 0 65 86 111 0.275 6.79 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 30242 30189 54 2 0 62 115 65 0.895 7.93 S.002 Init + 189569 189586 18 1 0 83 103 12 0.899 2.53 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:202371866_202572117|GENSCAN_predicted_peptide_1|291_aa MELQGTQNSQDNPEKKNKVGRLTLPDFKTYYKAMVTKTVWYQHKGVLKINELSSGVLKIN ELSSGVLKINELSSGVLKINELSSGVLKINELSSGVLKINELSSGVLKINELSSGVLKIN ELSSGVLKINELSSGVLKINELSSGVLKINELSSGVLKINELSSGVLKINELSSGVLKIN ELSSGVLKINELSSGVLKINELSSGVLKINELSSGVLKINELSSGVLKINELSSGVLKIN ENMCPYNTYTKMLMAALFIITKGEHNPNVHQLMDNQNVYPYNGILFSHKKE >gi568815597r:202371866_202572117|GENSCAN_predicted_CDS_1|876_bp atggaattgcaagggacccagaatagccaagacaatcctgaaaagaagaacaaagtagga agactcacacttcctgattttaaaacttactacaaagcaatggtaaccaagacagtgtgg tatcagcacaaaggtgtattgaagataaatgaattatcttcaggtgtactgaagataaat gaattatcttcaggtgtactgaagataaatgaattatcttcaggtgtactgaagataaat gaattatcttcaggtgtactgaagataaatgaattatcttcaggtgtactgaagataaat gaattatcttcaggtgtactgaagataaatgaattatcttcaggtgtactgaagataaat gaattatcttcaggtgtactgaagataaatgaattatcttcaggtgtactgaagataaat gaattatcttcaggtgtactgaagataaatgaattatcttcaggtgtactgaagataaat gaattatcttcaggtgtactgaagataaatgaattatcttcaggtgtactgaagataaat gaattatcttcaggtgtactgaagataaatgaattatcttcaggtgtactgaagataaat gaattatcttcaggtgtactgaagataaatgaattatcttcaggtgtactgaagataaat gaattatcttcaggtgtactgaagataaatgaattatcttcaggtgtactgaagataaat gaaaacatgtgtccatacaacacttacacaaaaatgttaatggcagcattattcataata accaaaggtgaacacaacccaaatgttcatcagcttatggataatcaaaatgtgtatcca tacaatggaatattatttagccataaaaaggaatga >gi568815597r:202371866_202572117|GENSCAN_predicted_peptide_2|1154_aa MSHHTRPDSFLNACIDENLDMVKFLVENRANVNQQDNEGWTPLHAAASCGYLNIAEYFIN HGASVGIVNSEGEVPSDLAEEPAMKDLLLEQVKKQGVDLEQSRKEEEQQMLQDARQWLNS GKIEDVRQARSGATALHVAAAKGYSEVLRLLIQAGYELNVQDYDGWTPLHAAAHWGVKEA CSILAEALCDMDIRNKLGQTPFDVADEGLVEHLELLQKKQNVLRSEKETRNKLIESDLNS KIQSGFFKNKEKMLYEEETPKSQEMEEENKESSSSSSEEEEGEDEASESETEKEADKKPE AFVNHSNSESKSSITEQIPAPAQNTFSASSARRFSSGLFNKPEEPKDESPSSWRLGLRKT GSHNMLSEVANSREPIRDRGSSIYRSSSSPRISALLDNKDKDFKVLDGVEDAEGEEEDDL KELPPLGMGQPPVEEAEQPGTLALEFLAAMEPEPAPSLAPEAAVEEEDACPRAAGFKPPS QGPGGYRAFCRRRQEPIHPPHAALCLEVTPKTAVDWPDLEMLTGQERMALANRKQECGSA HYQWADFILAANSYDLAIKTITSSAKPCAPAASLEHHPDNIKAVLHRGKILAQQGEYSEA ISILRAALKLEPSNKTIHAELLKLVKKHAAQRSTETALYWKMLGDPSWLPAKCPGKGAWS SPWKWLFGATAVALGGVALSVERENKSYISSLAPRKLNSTSDIEEKENRESAVNLVRSGS YTRQLWRDEAKGNEIPQTIAPSTYVSTYLKSASFGRSSDPTSPYISANRNSSPATSPITI GSSTSRGSQWQPASSCPAPISANTTASVHHGRTPHKSQADTTAEKTADNVSSSTPLCVIT NRPLPSTANGVTATPVLSITGTDSSVEAREKRRSYLTPVRDEEAESLRKARSRQARQTRR STQGVTLTDLQEAERTFSRSRAERQAQEQPREKPTDTEGLEGSPEKHEPSAVPATEAGEG QQPWGRSLDEEPICHRLRCPAQPDKPTTPASPSTSRPSLYTSSHLLWTNRFSVPDSESSE TTTNTTTAKEMDKNENEEADLDEQSSKRLSIRERRRPKERRRGTGINFWTKDEDETDGSE EVKETWNSKHSASLSLAFEMPVAPSNHCDNQCPHIFSKALVDLVVLLLEKQSDLDPKKGF LDLTQERIQGKSIQ >gi568815597r:202371866_202572117|GENSCAN_predicted_CDS_2|3465_bp atgagccaccatacccgacctgatagtttcttaaatgcatgtattgatgaaaatttggac atggtgaagtttctggtggagaacagagccaatgtaaaccagcaagacaacgagggctgg acaccccttcatgcagcagcttcctgtggctatctcaacatagcagagtatttcattaat cacggagccagtgtaggtattgtcaatagtgaaggtgaagttccctctgaccttgcagaa gagccagccatgaaggatcttcttctggagcaagtaaagaagcaaggagttgatctagag cagtcaagaaaagaagaagagcagcagatgttgcaggatgcccgccagtggctcaacagt gggaaaatagaggatgtgaggcaggctcgctcaggggctacagcccttcatgtggctgct gccaagggctactctgaagtcctcagacttttaattcaggctggctatgaactcaatgtt caggattatgatggctggactcccctccatgctgctgcacactggggagtgaaggaggct tgctccatcctggcagaagcactttgtgacatggatattcgaaataaactgggccagaca ccatttgatgtggctgatgagggtctcgtggagcatttggagttgctccagaagaagcag aatgtgcttcgaagtgaaaaggagacacggaataaactcattgagtcagatctgaacagc aagattcagagtgggttctttaagaacaaagagaagatgctctatgaggaggagacacct aagtcccaagaaatggaggaagaaaataaagaatctagtagctccagctcagaggaggag gaaggtgaagatgaagcttctgagtcagaaactgagaaggaggcagataaaaagccagaa gcctttgtcaatcattccaactctgaaagcaagagtagtatcacagagcagataccagca ccagctcaaaataccttctctgcctcttctgctaggaggttctcttctggcctttttaac aagccagaagagcccaaagatgaatctccttcttcatggagattgggactgagaaaaact ggcagccacaacatgctgagtgaggtggccaattccagggaacctataagggaccgaggc tcttccatctatcgctcctcttcaagccctcggatttctgctctactggacaacaaagat aaggacttcaaggtgctggatggggtggaggatgcagagggcgaggaggaagatgacctg aaagagctgcccccgctgggcatgggacagcccccggtggaagaggctgaacagcctggg acattggccctagagttcctcgctgccatggagcctgagcccgccccgtccctggccccg gaagcagctgttgaggaagaagacgcttgtcccagggccgccgggttcaagccgcccagt caagggccaggtggttaccgtgcattttgcagacgtcggcaggagcccatacatccaccg cacgcggccctgtgcctggaggtgaccccgaagacggccgtggactggcctgacctggag atgctcacagggcaggagcgcatggccctggccaaccggaagcaggagtgcggcagtgcc cactaccagtgggcggacttcatcctggccgccaactcctatgaccttgccatcaagacc atcacctccagcgccaaaccctgtgctcctgcagcctcgttggagcaccatccagacaat atcaaggctgtcctccacagaggcaagatactggcccagcagggtgagtacagtgaggcc atctccatcctgagggcagccctgaagctggaaccttccaacaagacgatccacgcagaa ctcttgaagctggtgaagaagcacgcggcacagcggagcacggagaccgccctgtactgg aaaatgctgggcgaccccagctggctgcccgccaagtgccccggcaagggtgcctggtcc agcccatggaagtggctgtttggggcgactgctgttgccttggggggtgtggcgctctct gtggagagagaaaacaaaagctatattagttcactagcaccccggaagctcaacagcaca agtgatattgaagaaaaggagaacagagaatcagctgttaatctagtgaggagtggctcc tatacccggcagctatggagggatgaagcaaaaggaaatgaaatcccacagacaattgct ccctccacctatgtatcaacttacttgaaaagtgcttcatttggtagaagtagtgacccc acaagtccctacatttcagccaatcgcaattcatctcctgctacctcacccattaccatt ggttcatctacctctcggggcagccagtggcaacctgcctcttcttgccctgcaccaatc agtgcaaacactactgcatctgtacatcatggcaggactcctcacaaatcccaggccgac acaacagcagagaaaacagcagacaatgtctcttctagcaccccgctctgtgtgatcacc aatcgccctcttcctagcactgccaatggggttacagctactcctgtgctctccattact ggaacagattcctctgtggaagccagggagaagaggaggtcctatctgactcctgtacgg gatgaggaagcagagtctttacggaaagcacgctccagacaagctcggcagacacgaagg tctactcaaggtgtcaccctaacagaccttcaagaagcagaaaggacattcagccggtcg agggcagagaggcaagctcaggagcagcctcgtgagaagcccacagacactgaagggctt gaggggagccctgagaagcatgagccctcagcagttccagcaacagaagctggggagggc cagcagccctggggcaggagtctggatgaagagcctatctgtcatcgcctgaggtgccca gctcagccagacaaacccaccacgccagcatctccttctacgtcaagaccctcactctac accagttcccacctgctatggacaaatagattttcagtccctgattctgagagttcagag actaccacaaacactacaactgcaaaggaaatggacaaaaatgagaatgaagaagcagat ttggatgagcagtcctctaagaggctgtccatccgagagaggaggcggcccaaggaacga cgaagaggcacaggcatcaatttctggacaaaggatgaggatgaaactgatggctctgaa gaggtcaaagaaacgtggaatagcaagcattcagcatccctgagcctggcatttgaaatg ccagtagcaccctccaatcattgtgacaatcagtgtccccacatattttcaaaagctctt gtggaccttgtggttttgttactagaaaagcagtccgatctggaccccaagaaagggttc ttagatctcacgcaagaaagaattcagggcaagtccatacagtaa >gi568815597r:202371866_202572117|GENSCAN_predicted_peptide_3|119_aa MSSLYTRSKEFTRNRKSQSDSPPASPSPTAKTLRVSSIFYLRMFIAVSKLYESALTENQK LKTKLQEAQLELADIKSKLEKVAQQKQEKTSDRSSVLEMEKRERRALERKMSEMEEEMK >gi568815597r:202371866_202572117|GENSCAN_predicted_CDS_3|357_bp atgtcctctttatatacccgaagtaaagaattcactcggaataggaaatctcagtctgat tctcccccagcatctccctccccgactgccaagacgctccgagtaagcagcattttttat ttacgtatgttcatagctgtgtcaaagctctatgagagtgctctgactgaaaaccaaaaa ctgaaaacaaaacttcaggaagcccagctagagctagcagatataaagtccaagcttgag aaggtggcccagcagaaacaagaaaagacctctgaccgatcatcagtgctggagatggag aaacgggagaggcgagccttggagcgcaaaatgtcagaaatggaggaagaaatgaag