GENSCAN 1.0 Date run: 6-Nov-116 Time: 13:04:23 Sequence gi568815578f:44514557_44718361 : 203805 bp : 46.86% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 1103 1098 6 1.05 1.02 Term - 11722 11661 62 2 2 112 42 20 0.345 -2.23 1.01 Init - 17644 17131 514 1 1 53 51 284 0.800 16.27 1.00 Prom - 22867 22828 40 -2.06 2.00 Prom + 34471 34510 40 -2.16 2.01 Init + 55610 55625 16 1 1 83 79 5 0.454 -0.53 2.02 Term + 62360 62421 62 0 2 104 49 88 0.868 4.47 2.03 PlyA + 64135 64140 6 1.05 3.03 PlyA - 66111 66106 6 1.05 3.02 Term - 76284 76034 251 1 2 68 48 185 0.184 8.37 3.01 Init - 95879 95792 88 2 1 61 94 72 0.948 5.90 3.00 Prom - 97416 97377 40 -7.76 4.00 Prom + 97492 97531 40 -3.46 4.01 Init + 100004 100151 148 1 1 96 66 226 0.631 21.45 4.02 Term + 101963 102078 116 0 2 112 44 18 0.407 -1.47 4.03 PlyA + 102290 102295 6 1.05 5.14 PlyA - 104070 104065 6 1.05 5.13 Term - 104723 104710 14 0 2 80 42 17 0.504 -5.34 5.12 Intr - 105845 105743 103 2 1 81 96 108 0.999 10.65 5.11 Intr - 106591 106462 130 0 1 123 105 227 0.999 28.60 5.10 Intr - 108096 108032 65 0 2 72 97 47 0.998 1.52 5.09 Intr - 108374 108273 102 2 0 105 99 86 0.999 11.77 5.08 Intr - 108522 108451 72 0 0 110 78 67 0.959 7.50 5.07 Intr - 109773 109646 128 1 2 55 94 189 0.986 16.60 5.06 Intr - 111128 111013 116 1 2 95 81 171 0.999 17.19 5.05 Intr - 112043 111900 144 1 0 102 61 202 0.998 18.30 5.04 Intr - 114613 114491 123 0 0 99 77 180 0.998 17.70 5.03 Intr - 120329 120186 144 1 0 33 119 35 0.162 0.50 5.02 Intr - 121732 121671 62 1 2 152 111 39 0.326 10.13 5.01 Init - 137051 137019 33 2 0 120 61 75 0.372 7.87 5.00 Prom - 146997 146958 40 -4.36 6.00 Prom + 147119 147158 40 -7.36 6.01 Init + 148967 149000 34 1 1 83 99 51 0.599 5.73 6.02 Intr + 157464 157598 135 1 0 100 80 24 0.455 3.34 6.03 Intr + 161245 161358 114 2 0 76 30 108 0.581 4.22 6.04 Intr + 165471 165524 54 1 0 82 93 32 0.262 2.05 6.05 Term + 175373 175542 170 0 2 91 47 94 0.348 3.64 6.06 PlyA + 176806 176811 6 1.05 7.04 PlyA - 177466 177461 6 1.05 7.03 Term - 187474 187453 22 0 1 118 43 31 0.124 -0.62 7.02 Intr - 196777 196697 81 0 0 84 72 58 0.079 2.55 7.01 Init - 197282 197239 44 2 2 81 64 47 0.164 1.49 7.00 Prom - 198076 198037 40 -3.26 8.03 PlyA - 198128 198123 6 1.05 8.02 Term - 201990 201811 180 0 0 107 48 157 0.997 11.21 8.01 Intr - 202102 202069 34 0 1 81 105 20 0.723 1.23 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 131524 131603 80 1 2 83 53 49 0.938 0.09 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:44514557_44718361|GENSCAN_predicted_peptide_1|191_aa MATNIINSRKRTLPERGAGSKGVGGAVPGAAEGQRSGRSLPPRDPRRLSPSPPATCRRQA RSVGAATGRPAPAHLRSRRLSVPGPPVRSASPGQPSARSPQPAAEPVALATWAAELPRHS RYGGSDGTRTVRVPFRSRGGGRGGTKWESRAELRRRDPGWNLARGRYGKGDSIFMACFLG SLMSVLQCQAL >gi568815578f:44514557_44718361|GENSCAN_predicted_CDS_1|576_bp atggctacaaatataataaacagccgaaaacggactctccctgaacgcggggcggggtca aagggcgtcgggggcgccgtccccggcgcggctgagggacaaagatcgggccgcagcctc cctccccgggatccccggcggctcagcccctcgccccctgcgacgtgtcgacgccaggcc cggagcgttggggccgcaaccggccgcccggctcctgctcacctgcggtctcgccgcctc tccgtgcctgggccgccggtccgcagcgcctccccggggcagcctagcgcccgcagcccg caacccgcagcggagcccgttgccttggcgacctgggctgccgaactcccgcggcactcg cgctacggcggctcggatgggaccaggacggttcgcgtccccttccgcagccgcggaggg ggcagaggagggacgaagtgggagtcgagggctgagctgcgaaggagggatccgggttgg aacttggcccggggaagatacggaaagggggacagtatctttatggcttgcttcctcggt tccctcatgtctgtgctccagtgtcaagccctctaa >gi568815578f:44514557_44718361|GENSCAN_predicted_peptide_2|25_aa MDCEPDTPNQQSKGDHKVDPQDIEP >gi568815578f:44514557_44718361|GENSCAN_predicted_CDS_2|78_bp atggactgtgagccagacacaccaaatcagcagtccaaaggggaccacaaagtggatcct caggacatcgaaccctga >gi568815578f:44514557_44718361|GENSCAN_predicted_peptide_3|112_aa MEGKAHIPPQHLAQKKHLKNELGALKDQTAFSPHTILSLWTDAQWIRQALQSDALIPRLF SLFVKIPAMEKVQVGVGQSRIAFNSDTLTTTSMDADGCLVGEKISVDPVLPT >gi568815578f:44514557_44718361|GENSCAN_predicted_CDS_3|339_bp atggagggcaaggcccacatccccccacaacacctagctcagaagaaacacctaaagaat gaattaggggccctcaaagatcagacagctttcagtcctcacaccatccttagcttatgg acagatgcacagtggattcggcaagctcttcagagcgatgcgctcatccccaggctcttt agcctctttgtgaaaattccagctatggagaaggtccaggtgggtgtaggccaatctaga atagccttcaattctgataccttaacaaccacatccatggatgctgatggatgcctggtg ggtgaaaagatctctgtggacccagtacttccaacttga >gi568815578f:44514557_44718361|GENSCAN_predicted_peptide_4|87_aa MEVESSYSDFISCDRTGRRNAVPDIQGDSEAVSVRKLAGDMGELALEGAAVVAAGPCAQA LLTPVFPDSDSLKYGSSGYMTFTCQGD >gi568815578f:44514557_44718361|GENSCAN_predicted_CDS_4|264_bp atggaggtcgagtcctcctactcggacttcatctcctgtgaccggacaggccgtcggaat gcggtccctgacatccagggagactcagaggctgtgagcgtgaggaagctggctggagac atgggcgagctggcactcgagggggcagcagttgtagctgctggcccatgtgcccaggct ctcctaacacctgttttccctgactcagattccctaaagtatgggtcctctggctacatg acattcacctgccagggtgactga >gi568815578f:44514557_44718361|GENSCAN_predicted_peptide_5|411_aa MAQTPAFDKPKVELHVHLDGSIKPETILYYGSSGPKFEHFCSISQNPVIQRPALMERMRD PGGLPPTLGSISTNSLRDFWRRGIALPANTAEGLLNVIGMDKPLTLPDFLAKFDYYMPAI AGCREAIKRIAYEFVEMKAKEGVVYVEVRYSPHLLANSKVEPIPWNQAEGDLTPDEVVAL VGQGLQEGERDFGVKARSILCCMRHQPNWSPKVVELCKKYQQQTVVAIDLAGDETIPGSS LLPGHVQAYQEAVKSGIHRTVHAGEVGSAEVVKEAVDILKTERLGHGYHTLEDQALYNRL RQENMHFEICPWSSYLTGAWKPDTEHAVIRLKNDQANYSLNTDDPLIFKSTLDTDYQMTK RDMGFTEEEFKRLNINAAKSSFLPEDEKRELLDLLYKAYGMPPSASAGFKE >gi568815578f:44514557_44718361|GENSCAN_predicted_CDS_5|1236_bp atggcccagacgcccgccttcgacaagcccaaagtggaactgcatgtccacctagacgga tccatcaagcctgaaaccatcttatactatggcagctcgggacccaaatttgaacatttc tgctccataagccagaatcctgttattcagaggcctgccctcatggagagaatgagggat cccggggggttgcccccaactctcgggagcatctccaccaactccctgagagatttctgg aggagagggatcgccctcccagctaacacagcagaggggctgctgaacgtcattggcatg gacaagccgctcacccttccagacttcctggccaagtttgactactacatgcctgctatc gcgggctgccgggaggctatcaaaaggatcgcctatgagtttgtagagatgaaggccaaa gagggcgtggtgtatgtggaggtgcggtacagtccgcacctgctggccaactccaaagtg gagccaatcccctggaaccaggctgaaggggacctcaccccagacgaggtggtggcccta gtgggccagggcctgcaggagggggagcgagacttcggggtcaaggcccggtccatcctg tgctgcatgcgccaccagcccaactggtcccccaaggtggtggagctgtgtaagaagtac cagcagcagaccgtggtagccattgacctggctggagatgagaccatcccaggaagcagc ctcttgcctggacatgtccaggcctaccaggaggctgtgaagagcggcattcaccgtact gtccacgccggggaggtgggctcggccgaagtagtaaaagaggctgtggacatactcaag acagagcggctgggacacggctaccacaccctggaagaccaggccctttataacaggctg cggcaggaaaacatgcacttcgagatctgcccctggtccagctacctcactggtgcctgg aagccggacacggagcatgcagtcattcggctcaaaaatgaccaggctaactactcgctc aacacagatgacccgctcatcttcaagtccaccctggacactgattaccagatgaccaaa cgggacatgggctttactgaagaggagtttaaaaggctgaacatcaatgcggccaaatct agtttcctcccagaagatgaaaagagggagcttctcgacctgctctataaagcctatggg atgccaccttcagcctctgcaggttttaaggagtaa >gi568815578f:44514557_44718361|GENSCAN_predicted_peptide_6|168_aa MEDGKQAIKSSDERERVFSLAQSHADNRRLHEPDLQEGSRAVPREDPQWNYQADSPGKME RTNGLFKTHLTKLSLQLKKEDSVKDTAQKLNNQAKTLYLSAVSPAADPPLAPGPPGGWSG LSSKKLELDVGFLRAEEGLARDLQKEKCIPPRVKSEGALQQWPSVGHL >gi568815578f:44514557_44718361|GENSCAN_predicted_CDS_6|507_bp atggaggatgggaagcaggccatcaagagttctgatgaacgggaaagagttttttctcta gcccaatctcacgctgataaccgccggcttcatgaacctgacctccaggaaggcagtaga gcagttccccgagaggacccccaatggaactatcaggcagattccccaggaaagatggaa cggactaatggtcttttcaagacacacctcaccaagctcagcctccaacttaaaaaggag gactctgtcaaggatacagcccaaaaactcaacaaccaagcaaaaaccctgtacctatca gcagtcagtcctgcagccgacccccctctggccccagggcctcctggagggtggagtggt cttagcagcaagaagctggagttggatgttggattcctgagggccgaggaaggacttgca agggacctccagaaggagaagtgcatcccccccagggtcaagagtgagggagccctgcag caatggccatctgtgggacatctgtga >gi568815578f:44514557_44718361|GENSCAN_predicted_peptide_7|48_aa MSIAEQGDLDNEEVRRKHKLRSQQVRASNPGSRYRLTGLGPGLDRVLL >gi568815578f:44514557_44718361|GENSCAN_predicted_CDS_7|147_bp atgtccatagcagagcaaggagaccttgacaatgaggaggtgaggaggaagcacaagctc agaagtcaacaggtccgagcttcaaatcccggctcccgctaccgacttacagggctggga ccagggttggaccgcgtgctcctctga >gi568815578f:44514557_44718361|GENSCAN_predicted_peptide_8|71_aa XPCFLGPGKNGPSPKSPIPTTQATHTGIMPYPSPLEEQTAGQDASVVLRLPRHEDLEPEG EKLLPVLLKGA >gi568815578f:44514557_44718361|GENSCAN_predicted_CDS_8|216_bp nntccctgcttcctgggaccagggaagaatggtcccagccccaagtcccctattccaact acccaagccacccacactggcatcatgccctacccatcccctttggaagaacaaacagca ggacaggatgccagcgtggttctcaggcttcccaggcacgaagacttggagcctgaggga gagaagctcctgcccgtgctcctcaaaggagcgtga