GENSCAN 1.0 Date run: 5-Nov-116 Time: 04:33:37 Sequence gi568815578r:44520298_44736285 : 215988 bp : 47.13% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 1993 1988 6 1.05 1.02 Term - 5981 5920 62 0 2 112 42 20 0.343 -2.23 1.01 Init - 11903 11390 514 2 1 53 51 284 0.786 16.27 1.00 Prom - 17126 17087 40 -2.06 2.00 Prom + 28730 28769 40 -2.16 2.01 Init + 49869 49884 16 2 1 83 79 5 0.454 -0.53 2.02 Term + 56619 56680 62 1 2 104 49 88 0.868 4.47 2.03 PlyA + 58394 58399 6 1.05 3.03 PlyA - 60370 60365 6 1.05 3.02 Term - 70543 70293 251 2 2 68 48 185 0.184 8.37 3.01 Init - 90138 90051 88 0 1 61 94 72 0.948 5.90 3.00 Prom - 91675 91636 40 -7.76 4.00 Prom + 91751 91790 40 -3.46 4.01 Init + 94263 94410 148 2 1 96 66 226 0.631 21.45 4.02 Term + 96222 96337 116 1 2 112 44 18 0.407 -1.47 4.03 PlyA + 96549 96554 6 1.05 5.14 PlyA - 98329 98324 6 1.05 5.13 Term - 98982 98969 14 1 2 80 42 17 0.504 -5.34 5.12 Intr - 100104 100002 103 0 1 81 96 108 0.999 10.65 5.11 Intr - 100850 100721 130 1 1 123 105 227 0.999 28.60 5.10 Intr - 102355 102291 65 1 2 72 97 47 0.998 1.52 5.09 Intr - 102633 102532 102 0 0 105 99 86 0.999 11.77 5.08 Intr - 102781 102710 72 1 0 110 78 67 0.959 7.50 5.07 Intr - 104032 103905 128 2 2 55 94 189 0.986 16.60 5.06 Intr - 105387 105272 116 2 2 95 81 171 0.999 17.19 5.05 Intr - 106302 106159 144 2 0 102 61 202 0.998 18.30 5.04 Intr - 108872 108750 123 1 0 99 77 180 0.998 17.70 5.03 Intr - 114588 114445 144 2 0 33 119 35 0.162 0.50 5.02 Intr - 115991 115930 62 2 2 152 111 39 0.326 10.13 5.01 Init - 131310 131278 33 0 0 120 61 75 0.372 7.87 5.00 Prom - 141256 141217 40 -4.36 6.00 Prom + 141378 141417 40 -7.36 6.01 Init + 143226 143259 34 2 1 83 99 51 0.599 5.73 6.02 Intr + 151723 151857 135 2 0 100 80 24 0.455 3.34 6.03 Intr + 155504 155617 114 0 0 76 30 108 0.581 4.22 6.04 Intr + 159730 159783 54 2 0 82 93 32 0.262 2.05 6.05 Term + 169632 169801 170 1 2 91 47 94 0.348 3.64 6.06 PlyA + 171065 171070 6 1.05 7.00 Prom + 173705 173744 40 -3.16 7.01 Init + 195094 195153 60 0 0 76 116 78 0.956 8.66 7.02 Intr + 199600 199816 217 0 1 91 58 202 0.940 15.58 7.03 Intr + 204441 204695 255 1 0 82 105 296 0.621 28.12 7.04 Term + 206790 207010 221 1 2 151 43 87 0.961 7.70 7.05 PlyA + 208676 208681 6 1.05 8.02 PlyA - 211819 211814 6 1.05 8.01 Term - 214329 214224 106 2 1 86 48 87 0.279 2.38 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 125783 125862 80 2 2 83 53 49 0.938 0.09 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:44520298_44736285|GENSCAN_predicted_peptide_1|191_aa MATNIINSRKRTLPERGAGSKGVGGAVPGAAEGQRSGRSLPPRDPRRLSPSPPATCRRQA RSVGAATGRPAPAHLRSRRLSVPGPPVRSASPGQPSARSPQPAAEPVALATWAAELPRHS RYGGSDGTRTVRVPFRSRGGGRGGTKWESRAELRRRDPGWNLARGRYGKGDSIFMACFLG SLMSVLQCQAL >gi568815578r:44520298_44736285|GENSCAN_predicted_CDS_1|576_bp atggctacaaatataataaacagccgaaaacggactctccctgaacgcggggcggggtca aagggcgtcgggggcgccgtccccggcgcggctgagggacaaagatcgggccgcagcctc cctccccgggatccccggcggctcagcccctcgccccctgcgacgtgtcgacgccaggcc cggagcgttggggccgcaaccggccgcccggctcctgctcacctgcggtctcgccgcctc tccgtgcctgggccgccggtccgcagcgcctccccggggcagcctagcgcccgcagcccg caacccgcagcggagcccgttgccttggcgacctgggctgccgaactcccgcggcactcg cgctacggcggctcggatgggaccaggacggttcgcgtccccttccgcagccgcggaggg ggcagaggagggacgaagtgggagtcgagggctgagctgcgaaggagggatccgggttgg aacttggcccggggaagatacggaaagggggacagtatctttatggcttgcttcctcggt tccctcatgtctgtgctccagtgtcaagccctctaa >gi568815578r:44520298_44736285|GENSCAN_predicted_peptide_2|25_aa MDCEPDTPNQQSKGDHKVDPQDIEP >gi568815578r:44520298_44736285|GENSCAN_predicted_CDS_2|78_bp atggactgtgagccagacacaccaaatcagcagtccaaaggggaccacaaagtggatcct caggacatcgaaccctga >gi568815578r:44520298_44736285|GENSCAN_predicted_peptide_3|112_aa MEGKAHIPPQHLAQKKHLKNELGALKDQTAFSPHTILSLWTDAQWIRQALQSDALIPRLF SLFVKIPAMEKVQVGVGQSRIAFNSDTLTTTSMDADGCLVGEKISVDPVLPT >gi568815578r:44520298_44736285|GENSCAN_predicted_CDS_3|339_bp atggagggcaaggcccacatccccccacaacacctagctcagaagaaacacctaaagaat gaattaggggccctcaaagatcagacagctttcagtcctcacaccatccttagcttatgg acagatgcacagtggattcggcaagctcttcagagcgatgcgctcatccccaggctcttt agcctctttgtgaaaattccagctatggagaaggtccaggtgggtgtaggccaatctaga atagccttcaattctgataccttaacaaccacatccatggatgctgatggatgcctggtg ggtgaaaagatctctgtggacccagtacttccaacttga >gi568815578r:44520298_44736285|GENSCAN_predicted_peptide_4|87_aa MEVESSYSDFISCDRTGRRNAVPDIQGDSEAVSVRKLAGDMGELALEGAAVVAAGPCAQA LLTPVFPDSDSLKYGSSGYMTFTCQGD >gi568815578r:44520298_44736285|GENSCAN_predicted_CDS_4|264_bp atggaggtcgagtcctcctactcggacttcatctcctgtgaccggacaggccgtcggaat gcggtccctgacatccagggagactcagaggctgtgagcgtgaggaagctggctggagac atgggcgagctggcactcgagggggcagcagttgtagctgctggcccatgtgcccaggct ctcctaacacctgttttccctgactcagattccctaaagtatgggtcctctggctacatg acattcacctgccagggtgactga >gi568815578r:44520298_44736285|GENSCAN_predicted_peptide_5|411_aa MAQTPAFDKPKVELHVHLDGSIKPETILYYGSSGPKFEHFCSISQNPVIQRPALMERMRD PGGLPPTLGSISTNSLRDFWRRGIALPANTAEGLLNVIGMDKPLTLPDFLAKFDYYMPAI AGCREAIKRIAYEFVEMKAKEGVVYVEVRYSPHLLANSKVEPIPWNQAEGDLTPDEVVAL VGQGLQEGERDFGVKARSILCCMRHQPNWSPKVVELCKKYQQQTVVAIDLAGDETIPGSS LLPGHVQAYQEAVKSGIHRTVHAGEVGSAEVVKEAVDILKTERLGHGYHTLEDQALYNRL RQENMHFEICPWSSYLTGAWKPDTEHAVIRLKNDQANYSLNTDDPLIFKSTLDTDYQMTK RDMGFTEEEFKRLNINAAKSSFLPEDEKRELLDLLYKAYGMPPSASAGFKE >gi568815578r:44520298_44736285|GENSCAN_predicted_CDS_5|1236_bp atggcccagacgcccgccttcgacaagcccaaagtggaactgcatgtccacctagacgga tccatcaagcctgaaaccatcttatactatggcagctcgggacccaaatttgaacatttc tgctccataagccagaatcctgttattcagaggcctgccctcatggagagaatgagggat cccggggggttgcccccaactctcgggagcatctccaccaactccctgagagatttctgg aggagagggatcgccctcccagctaacacagcagaggggctgctgaacgtcattggcatg gacaagccgctcacccttccagacttcctggccaagtttgactactacatgcctgctatc gcgggctgccgggaggctatcaaaaggatcgcctatgagtttgtagagatgaaggccaaa gagggcgtggtgtatgtggaggtgcggtacagtccgcacctgctggccaactccaaagtg gagccaatcccctggaaccaggctgaaggggacctcaccccagacgaggtggtggcccta gtgggccagggcctgcaggagggggagcgagacttcggggtcaaggcccggtccatcctg tgctgcatgcgccaccagcccaactggtcccccaaggtggtggagctgtgtaagaagtac cagcagcagaccgtggtagccattgacctggctggagatgagaccatcccaggaagcagc ctcttgcctggacatgtccaggcctaccaggaggctgtgaagagcggcattcaccgtact gtccacgccggggaggtgggctcggccgaagtagtaaaagaggctgtggacatactcaag acagagcggctgggacacggctaccacaccctggaagaccaggccctttataacaggctg cggcaggaaaacatgcacttcgagatctgcccctggtccagctacctcactggtgcctgg aagccggacacggagcatgcagtcattcggctcaaaaatgaccaggctaactactcgctc aacacagatgacccgctcatcttcaagtccaccctggacactgattaccagatgaccaaa cgggacatgggctttactgaagaggagtttaaaaggctgaacatcaatgcggccaaatct agtttcctcccagaagatgaaaagagggagcttctcgacctgctctataaagcctatggg atgccaccttcagcctctgcaggttttaaggagtaa >gi568815578r:44520298_44736285|GENSCAN_predicted_peptide_6|168_aa MEDGKQAIKSSDERERVFSLAQSHADNRRLHEPDLQEGSRAVPREDPQWNYQADSPGKME RTNGLFKTHLTKLSLQLKKEDSVKDTAQKLNNQAKTLYLSAVSPAADPPLAPGPPGGWSG LSSKKLELDVGFLRAEEGLARDLQKEKCIPPRVKSEGALQQWPSVGHL >gi568815578r:44520298_44736285|GENSCAN_predicted_CDS_6|507_bp atggaggatgggaagcaggccatcaagagttctgatgaacgggaaagagttttttctcta gcccaatctcacgctgataaccgccggcttcatgaacctgacctccaggaaggcagtaga gcagttccccgagaggacccccaatggaactatcaggcagattccccaggaaagatggaa cggactaatggtcttttcaagacacacctcaccaagctcagcctccaacttaaaaaggag gactctgtcaaggatacagcccaaaaactcaacaaccaagcaaaaaccctgtacctatca gcagtcagtcctgcagccgacccccctctggccccagggcctcctggagggtggagtggt cttagcagcaagaagctggagttggatgttggattcctgagggccgaggaaggacttgca agggacctccagaaggagaagtgcatcccccccagggtcaagagtgagggagccctgcag caatggccatctgtgggacatctgtga >gi568815578r:44520298_44736285|GENSCAN_predicted_peptide_7|250_aa MRGTPKTHLLAFSLLCLLSKVRTQLCPTPCTCPWPPPRCPLGVPLVLDGCGCCRVCARRL GEPCDQLHVCDASQGLVCQPGAGPGGRGALCLLAEDDSSCEVNGRLYREGETFQPHCSIR CRCEDGGFTCVPLCSEDVRLPSWDCPHPRRVEVLGKCCPEWVCGQGGGLGTQPLPAQGPQ FSGLVSSLPPGVPCPEWSTAWGPCSTTCGLGMATRVSNQNRFCRLETQRRLCLSRPCPPS RGRSPQNSAF >gi568815578r:44520298_44736285|GENSCAN_predicted_CDS_7|753_bp atgagaggcacaccgaagacccacctcctggccttctccctcctctgcctcctctcaaag gtgcgtacccagctgtgcccgacaccatgtacctgcccctggccacctccccgatgcccg ctgggagtacccctggtgctggatggctgtggctgctgccgggtatgtgcacggcggctg ggggagccctgcgaccaactccacgtctgcgacgccagccagggcctggtctgccagccc ggggcaggacccggtggccggggggccctgtgcctcttggcagaggacgacagcagctgt gaggtgaacggccgcctgtatcgggaaggggagaccttccagccccactgcagcatccgc tgccgctgcgaggacggcggcttcacctgcgtgccgctgtgcagcgaggatgtgcggctg cccagctgggactgcccccaccccaggagggtcgaggtcctgggcaagtgctgccctgag tgggtgtgcggccaaggagggggactggggacccagccccttccagcccaaggaccccag ttttctggccttgtctcttccctgccccctggtgtcccctgcccagaatggagcacggcc tggggaccctgctcgaccacctgtgggctgggcatggccacccgggtgtccaaccagaac cgcttctgccgactggagacccagcgccgcctgtgcctgtccaggccctgcccaccctcc aggggtcgcagtccacaaaacagtgccttctag >gi568815578r:44520298_44736285|GENSCAN_predicted_peptide_8|35_aa XAGEKLLKRFTLGYWPSNRDWMKRPGPAKTLKPFS >gi568815578r:44520298_44736285|GENSCAN_predicted_CDS_8|108_bp nnggcaggggagaagctgctgaagaggttcacgttaggttattggccatccaacagagac tggatgaaaagacctggcccagcgaagacgttaaaacctttttcatga