GENSCAN 1.0 Date run: 4-Nov-116 Time: 11:28:43 Sequence gi568815586r:53611947_53825242 : 213296 bp : 45.74% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 1581 1576 6 1.05 1.01 Sngl - 21209 20832 378 2 0 79 44 340 0.999 24.86 1.00 Prom - 40693 40654 40 -3.26 2.08 PlyA - 41251 41246 6 1.05 2.07 Term - 53482 53368 115 0 1 95 49 143 0.991 9.14 2.06 Intr - 57395 57202 194 2 2 96 110 97 0.808 10.99 2.05 Intr - 58002 57925 78 0 0 99 67 23 0.527 1.05 2.04 Intr - 60685 60630 56 2 2 74 97 42 0.184 2.40 2.03 Intr - 64295 64183 113 1 2 44 14 120 0.115 0.22 2.02 Intr - 86393 86087 307 0 1 27 -82 629 0.129 35.31 2.01 Init - 90493 90406 88 1 1 88 92 17 0.598 2.99 2.00 Prom - 98144 98105 40 -6.66 3.15 PlyA - 99236 99231 6 1.05 3.14 Term - 100175 99998 178 1 1 103 39 130 0.971 6.76 3.13 Intr - 101260 101154 107 1 2 59 109 123 0.915 10.51 3.12 Intr - 101954 101755 200 0 2 109 97 88 0.883 10.87 3.11 Intr - 102747 102615 133 0 1 76 94 183 0.637 17.92 3.10 Intr - 103379 103254 126 2 0 87 81 148 0.999 14.88 3.09 Intr - 104101 103847 255 1 0 30 65 212 0.938 10.74 3.08 Intr - 104469 104314 156 0 0 101 89 169 0.998 18.51 3.07 Intr - 107883 107793 91 2 1 67 94 110 0.999 9.50 3.06 Intr - 109669 109521 149 1 2 24 94 229 0.995 16.13 3.05 Intr - 110237 110079 159 2 0 95 81 213 0.998 21.48 3.04 Intr - 111837 111647 191 1 2 72 94 263 0.985 24.60 3.03 Intr - 112801 112699 103 1 1 96 82 51 0.706 5.05 3.02 Intr - 113320 113141 180 1 0 72 65 154 0.837 11.56 3.01 Init - 113525 113400 126 2 0 83 68 62 0.495 3.86 3.00 Prom - 116057 116018 40 -5.86 4.05 PlyA - 116464 116459 6 1.05 4.04 Term - 127962 127756 207 0 0 118 48 155 0.939 11.84 4.03 Intr - 128586 128542 45 0 0 87 88 52 0.793 3.71 4.02 Intr - 131898 131802 97 2 1 103 11 115 0.710 5.31 4.01 Init - 132713 132640 74 2 2 43 53 73 0.666 -0.16 4.00 Prom - 133546 133507 40 -4.06 5.00 Prom + 138041 138080 40 -4.46 5.01 Init + 138239 138394 156 1 0 44 99 28 0.280 -0.59 5.02 Intr + 144620 144679 60 1 0 94 92 63 0.584 6.23 5.03 Term + 184247 184339 93 1 0 114 48 103 0.923 6.73 5.04 PlyA + 184354 184359 6 1.05 6.03 PlyA - 185087 185082 6 1.05 6.02 Term - 187204 187112 93 1 0 77 44 119 0.623 4.23 6.01 Init - 207894 207832 63 0 0 81 116 9 0.611 4.15 6.00 Prom - 211880 211841 40 -1.36 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:53611947_53825242|GENSCAN_predicted_peptide_1|125_aa MARKEGWREEKGLFCHQQGGDREYTVNLHRCIHGVGFKKRSPWACKEIWKFAMKEMGTPD VRIDTRLNKAVWAKGIRNVQDLIRMRLSRKHNEDEDSPNKLYTLVTNVPVTAFKNLQAMW MRTNC >gi568815586r:53611947_53825242|GENSCAN_predicted_CDS_1|378_bp atggcccgcaaagaagggtggcgagaagaaaaagggctgttctgtcatcaacaaggtggt gaccgagaatacaccgtcaaccttcacaggtgcatccacggagtgggcttcaagaagcgt tccccttgggcatgcaaagagatctggaaatttgccatgaaggagatgggaactccagat gtgcgcattgatactagactcaacaaagctgtctgggccaaaggaataaggaatgtccaa gaccttatccgtatgcggttgtccagaaaacataatgaggatgaagattcaccaaataag ctctatactttggttaccaatgtacccgttaccgctttcaaaaatctacaggcaatgtgg atgagaactaactgctga >gi568815586r:53611947_53825242|GENSCAN_predicted_peptide_2|316_aa MRELQGLEGPKILRQMLTWTLRQERMVKTDWATERDSISKKKKEEEEKRKKKKEKKKKMR RRRRKKKKEKKKKKKKKKKKKKKKKKKKKKKKEEEQEQEQEQEEQEEQEEEQEEDNRDED RGLQPSSGIWRQWSNRPDTATALAGGAVMPELILYVAITLSVAERLVGPAPHPLKMFACS KFVSTPSLVKSTSQLLSRPLSAVVLKRPEILTDESLSSLAVSCPLTSLVSSRSFQTSAIS RDIDTAAKFIGAGAATVGVAGSGAGIGTVFGSLIIGYARNPSLKQQLFSYAILGFALSEA MGLFCLMVAFLILFAM >gi568815586r:53611947_53825242|GENSCAN_predicted_CDS_2|951_bp atgagggagctacagggccttgagggtccgaagatcctgagacagatgcttacatggacg ttgagacaggaaagaatggtcaagacagactgggcgacagagcgagactccatctcaaaa aaaaagaaggaggaggaggagaagaggaagaagaagaaggagaagaagaagaagatgagg aggaggaggaggaagaagaagaaggagaagaagaagaagaagaagaagaagaagaagaag aagaagaagaagaagaagaagaagaagaagaagaaggaagaagaacaagaacaagaacaa gaacaagaagaacaagaagaacaagaagaagaacaagaagaagacaacagagatgaggac aggggactccaaccatcctcgggcatctggaggcagtggagcaaccggccggataccgcc acagccctggcaggcggcgctgtgatgcctgagctgatcctgtatgttgcaatcactcta tccgtggctgagcgactcgttggcccggctcctcaccccctgaaaatgttcgcctgctcc aagtttgtctccactccctccttggtcaagagcacctcacagctgctgagccgtccgcta tctgcagtggtgctgaaacgaccggagatactgacagatgagagcctcagcagcttggca gtctcatgtccccttacctcacttgtctctagccgcagcttccaaaccagcgccatttca agggacatcgacacagcagccaagttcattggagctggggctgccacagttggggtggct ggttctggggctgggattggaactgtgtttgggagcctcatcattggttatgccaggaac ccttctctgaagcaacagctcttctcctacgccattctgggctttgccctctcggaggcc atggggctcttttgtctgatggtagcctttctcatcctctttgccatgtga >gi568815586r:53611947_53825242|GENSCAN_predicted_peptide_3|717_aa MGTDLEGQTYVLETKSDTSKMSRNPQGLETRYALSPGVVAGAEDRYLKARMEESPLSRAP SRGGVNFLNVARTYIPNTKVECHYTLPPGTMPSASDWIGIFKVEAACVRDYHTFVWSSVP ESTTDGSPIHTSVQFQASYLPKPGAQLYQFRYVNRQGQVCGQSPPFQFREPRPMDELVTL EEADGGSDILLVVPKATVLQNQLDESQQERNDLMQLKLQLEGQVTELRSRVQELERALAT ARQEHTELMEQYKGISRSHGEITEERDILSRQQGDHVARILELEDDIQTISEKVLTKEVE LDRLRDTVKALTREQEKLLGQLKEVQADKEQSEAELQVAQQENHHLNLDLKEAKSWQEEQ SAQAQRLKDKVAQMKDTLGQAQQRVAELEPLKEQLRGAQELAASSQQKATLLGEELASAA AARDRTIAELHRSRLEVAEVNGRLAELGLHLKEEKCQWSKERAGLLQSVEAEKDKILKLS AEILRLEKAVQEERTQNQVFKTELAREKDSSLVQLSESKRELTELRSALRVLQKEKEQLQ EEKQVSTHNPGPVDATGCPAALTDSEDESPEDMRLPPYGLCERGDPGSSPAGPREASPLV VISQPAPISPHLSGPAEDSSSDSEAEDEKSVLMAAVQSGGEEANLLLPELGSAFYDMASG FTVGTLSETSTGGPATPTWKECPICKERFPAESDKDALEDHMDGHFFFSTQDPFTFE >gi568815586r:53611947_53825242|GENSCAN_predicted_CDS_3|2154_bp atggggacggacttggaaggccagacctacgtcctagagactaagagtgatacctcaaaa atgtcccgtaaccctcagggactagaaaccaggtatgccttatctccaggtgtagtggca ggggctgaggacagatatctcaaggccaggatggaagaatcaccactaagccgggcacca tcccgtggtggagtcaactttctcaatgtagcccggacctacatccccaacaccaaggtg gaatgtcactacacccttcccccaggcaccatgcccagtgccagtgactggattggcatc ttcaaggtggaggctgcctgtgttcgggattaccacacatttgtgtggtcttccgtgcct gaaagtacaactgatggttcccccattcacaccagtgtccagttccaagccagctacctg cccaaaccaggagctcagctctaccagttccgatatgtgaaccgccagggccaggtgtgt gggcagagcccccctttccagttccgagagccaaggcccatggatgaactggtgaccctg gaggaggctgatgggggctctgacatcctgctggttgtccccaaggcaactgtgttacag aaccagctcgatgagagccagcaagaacggaatgacctgatgcagctgaagctacagctg gagggacaggtgacagagctgaggagccgagtgcaggagctcgagagggctctggcaact gccaggcaggagcacacggagctgatggaacagtacaaggggatttcccggtcccatggg gagatcacagaagagagggacatcctgagccggcaacagggagaccatgtggcacgcatc ctggagctagaggatgacatccagaccatcagtgagaaagtgctgacgaaggaagtggag ctggacaggcttagagacacagtgaaggccctgactcgggaacaagagaagctccttggg caactgaaagaagtacaagcagacaaggagcaaagtgaggctgagctccaagtggcacaa caggagaaccatcacttaaatttggacctgaaggaggcgaagagctggcaagaggagcag agtgctcaggctcagcgactgaaagacaaggtggcccagatgaaggacaccctaggccag gcccagcagcgggtggccgagctggagcccttgaaggagcagcttcgaggggcccaggag cttgcagcctcaagccagcagaaagccacccttcttggggaggagttggccagtgcagca gcagccagggaccgcaccatagccgaactacaccgcagccgcctggaagtggctgaagtt aacggcaggctggctgagctcggtttgcacttgaaggaagaaaaatgccaatggagcaag gagcgggcagggctgctgcagagtgtggaggcagagaaggacaagatcctgaagctgagt gcagagatacttcgattggagaaggcagttcaggaggagaggacccaaaaccaagtgttc aagactgagctggcccgggagaaggattctagcctggtacagttgtcagaaagtaagcgg gagctgacagagctgcggtcagccctgcgtgtgctccagaaggaaaaggagcagttacag gaggagaaacaggtgagcacccataacccaggccccgtggatgccacaggctgcccggca gctctgacagactcagaggacgagtccccagaagacatgaggctcccaccctatggcctt tgtgagcgtggagacccaggctcctctcctgctgggcctcgagaggcttctccccttgtt gtcatcagccagccggctcccatttctcctcacctctctgggccagctgaggacagtagc tctgactcggaggctgaagatgagaagtcagtcctgatggcagctgtgcagagtgggggt gaggaggccaacttactgcttcctgaactgggcagtgccttctatgacatggccagtggc tttacagtgggtaccctgtcagaaaccagcactgggggccctgccacccccacatggaag gagtgtcctatctgtaaggagcgctttcctgctgagagtgacaaggatgccctggaggac cacatggatggacacttctttttcagcacccaggaccccttcacctttgagtga >gi568815586r:53611947_53825242|GENSCAN_predicted_peptide_4|140_aa MVGTQKFRFPPVPMNPGAQVERILQRSHFQRRDPNPAVLRICHKVAAKASIDPELDLISG PLVQITEMEPAKVAFSVITFNAAGHGPNVRDGVLGPAHQMDDEVLIIPQGAVPRKVVINF SELHQFRDILYGQTVRETVI >gi568815586r:53611947_53825242|GENSCAN_predicted_CDS_4|423_bp atggttggaactcagaagttccgctttcccccagtacctatgaatcctggagcccaagtg gagagaattctccaaagatcccactttcagcgcagggaccccaatcctgcagttctccgt atctgccacaaggtggcagccaaagcatcgattgatccagagctagacctgatctctgga cctctggtccagatcacggaaatggagcctgcaaaggtcgcgttttccgtaataaccttt aatgcggctggccacggcccaaatgttcgcgatggagtgctgggccccgcgcatcagatg gacgatgaagtgttaatcatcccccagggagccgtcccgcggaaagtcgtcattaatttc tctgaattacaccaattccgagacattctttatggacagacagtgcgggagacagttata tga >gi568815586r:53611947_53825242|GENSCAN_predicted_peptide_5|102_aa MTRANSESKPVSSSGKAARSGGVREVCKEKPNRACNRLGEEQEYLWLMVLRQVSDEEMEA LERAGNLLKVMYLLQGARELGVNDNLEKQPQPEDLRGSKEDG >gi568815586r:53611947_53825242|GENSCAN_predicted_CDS_5|309_bp atgactagggcaaattcagaatccaagccagttagttcatcagggaaggctgctaggagt gggggagttagagaggtgtgcaaagagaaacccaacagggcatgcaatagattaggagaa gagcaggagtacctctggctgatggtgctgaggcaggtttcagatgaggaaatggaggcc ttggagagggcaggtaacctgctcaaggttatgtatctgctgcaaggagctagagagctt ggagtgaatgacaatttggagaaacagccgcagcctgaagatctaagaggctctaaagaa gatggataa >gi568815586r:53611947_53825242|GENSCAN_predicted_peptide_6|51_aa MALSKFVVLKAREKRTVSVVKPLTDIVKKLGDSDLSNTMAKESPGNSSTST >gi568815586r:53611947_53825242|GENSCAN_predicted_CDS_6|156_bp atggcactgtccaaatttgtggttttaaaagcaagggaaaagaggacagtctctgtggtt aagcccctcacagatattgtgaaaaagcttggggacagcgacctgagcaacacaatggcc aaggaaagcccaggaaattctagcacctccacatga