GENSCAN 1.0 Date run: 6-Nov-116 Time: 08:09:33 Sequence gi568815579f:33700455_33911551 : 211097 bp : 49.73% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1969 2119 151 1 1 90 -5 96 0.252 0.66 1.02 Intr + 5269 5466 198 0 0 130 33 97 0.275 7.95 1.03 Intr + 12793 12937 145 0 1 71 97 20 0.015 1.06 1.04 Intr + 23952 23994 43 1 1 103 71 18 0.051 -1.10 1.05 Intr + 24094 24196 103 1 1 48 71 86 0.177 3.08 1.06 Intr + 26603 26678 76 1 1 126 96 13 0.317 4.99 1.07 Term + 30018 30181 164 1 2 84 48 86 0.313 2.30 1.08 PlyA + 31822 31827 6 1.05 2.00 Prom + 33873 33912 40 -3.46 2.01 Init + 39325 39396 72 0 0 37 30 86 0.331 -1.23 2.02 Intr + 39626 39813 188 1 2 35 76 156 0.472 7.59 2.03 Intr + 49397 49467 71 2 2 39 96 112 0.265 5.93 2.04 Intr + 51183 51309 127 1 1 62 70 64 0.811 1.74 2.05 Intr + 55297 55542 246 1 0 37 74 148 0.555 4.87 2.06 Intr + 62017 62186 170 1 2 -6 58 155 0.430 2.69 2.07 Intr + 62994 63149 156 1 0 70 64 74 0.026 3.18 2.08 Intr + 68565 68674 110 1 2 34 98 2 0.001 -4.20 2.09 Term + 71503 72609 1107 0 0 117 55 1951 0.989 186.77 2.10 PlyA + 73031 73036 6 1.05 3.00 Prom + 73291 73330 40 -16.75 3.01 Init + 73644 73759 116 2 2 74 47 61 0.848 0.28 3.02 Intr + 73973 74197 225 2 0 86 49 253 0.961 18.30 3.03 Intr + 74694 74841 148 0 1 58 80 62 0.490 2.64 3.04 Intr + 76504 76639 136 0 1 118 36 14 0.573 -0.66 3.05 Intr + 76734 76864 131 1 2 119 62 22 0.515 3.11 3.06 Intr + 82067 82354 288 1 0 76 43 124 0.230 4.14 3.07 Intr + 87081 87209 129 2 0 42 81 65 0.272 2.09 3.08 Intr + 94607 94663 57 1 0 107 52 38 0.059 1.18 3.09 Term + 94965 95012 48 2 0 83 50 53 0.049 -1.70 3.10 PlyA + 97388 97393 6 1.05 4.00 Prom + 100243 100282 40 -2.86 4.01 Init + 100725 100888 164 2 2 85 110 327 0.991 31.80 4.02 Intr + 106409 106553 145 2 1 100 115 304 0.999 34.58 4.03 Intr + 110793 111098 306 2 0 87 94 691 0.965 66.25 4.04 Term + 112336 112494 159 0 0 136 47 197 0.999 18.34 4.05 PlyA + 114730 114735 6 1.05 5.07 PlyA - 114742 114737 6 1.05 5.06 Term - 117780 117725 56 1 2 72 39 101 0.173 1.32 5.05 Intr - 132489 132340 150 1 0 52 94 45 0.001 1.63 5.04 Intr - 140387 140346 42 0 0 93 92 30 0.000 2.11 5.03 Intr - 162748 162665 84 2 0 105 78 24 0.293 2.89 5.02 Intr - 166629 166458 172 0 1 108 75 32 0.131 3.42 5.01 Init - 195692 195621 72 2 0 92 55 87 0.863 4.87 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 70959 70996 38 1 2 115 113 -13 0.984 1.88 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579f:33700455_33911551|GENSCAN_predicted_peptide_1|293_aa XTFLANTQQIVATVSSPNRSPKALPLVELIKPTTKTPSEISVQLENWVSHYGTMAATLSK GPFGEGPGLAQQGTLQIRLPCCLCVTAMSPVLGEGPKSPESIQGGGIQGCGMGNDDGQSV EMPHLPFKICDLLMEEGGLAPLVGHHHLGDAMVPSQGSQLCNGMLDEDTPGPHPGSAKEP LEDTPSLTECGRKPGQYCLGLHLPRKALEGHKDQALSVPKMHFIHSRPGATSGHASCSPG KKWHMPHCLFESLAYAAAYSSSLDVLLERGPNPDPKRGLLDLTQERIQGEFIK >gi568815579f:33700455_33911551|GENSCAN_predicted_CDS_1|882_bp nngacatttcttgccaacacccagcagattgtggcaactgtcagctccccaaaccgaagc cccaaggctcttccattggttgaattaataaaacctacaacaaagacaccttctgaaatt tcagtccaactggaaaactgggtctcccattatggcaccatggcagccacactcagcaag ggtccctttggagaggggcctggcctggcccagcaagggactctgcagattcgccttccc tgctgcctctgtgtcacagcgatgtccccagtactaggagagggacccaagagtcctgag agcattcaaggaggtggcatccaggggtgtggtatgggcaatgatgatggtcagtctgtg gagatgccacatcttcctttcaagatctgcgatctcctaatggaggaaggaggattggct cccctggtaggccaccatcacctgggagatgcaatggtgccctcccaggggagccaatta tgcaatggcatgcttgatgaggatacccccggcccccatccgggctcagccaaggagccc ctggaggacacgccctccctcactgagtgtgggcgcaagcctggacagtattgcctgggg ctgcatctgcctcggaaggccctggaggggcacaaagaccaggccctttctgtgccaaag atgcacttcatccacagtcgcccgggagccacttctggccacgcgtcctgctccccagga aagaagtggcacatgccccactgccttttcgagagtctggcttatgctgctgcctactcg agttcattggatgtgttactggaaaggggtcccaatccagaccccaagagagggttattg gatctcacgcaagaaagaattcagggcgaattcatcaagtga >gi568815579f:33700455_33911551|GENSCAN_predicted_peptide_2|748_aa MRALFSDIQALGISQKEMVKESDEDACKFSWNLMQIRGYAKAIQQAVGAPSGETQTSSQE QTSCSSLQGYITRGLWCFFVEVPKLLGWRTKHPAHERVRVVALIVQQPLAGSIREWLQQG HGIRQLGPGPTTLPAYLGSSVDEGDPLNLFETQAAALVGLQRKQENSSDIFFSSPFTVTP DALPTAITWEHIPFAKLAGLIAGPLVEMCRQRLSKEFEALKGEFRDLGHCLPGAQRGNRI TKRNKCGQSRQALIGQRQEDAGSAPLQMHPSVAALGAGAALREIQPLQREPELSSGPRNS RLLCWGSPATWNPTYLSRVLGQQVAVTVTEAGLQAVPWGPSREFNAKGSSSASIRVGQPQ KLRLRVQRSRRQCPPVQSSQDLPPGGSQDGDLKEPTERVTRDLSSGAPRGRNLPAPDQPQ PPLQRGTRLRLRQRRRRLLIKKMPAAATIPANSSDAPFIRPGPGTLDGRWVSLHRSQQER KRVMQEACAKYRASSSRRAVTPRHVSRIFVEDRHRVLYCEVPKAGCSNWKRVLMVLAGLA SSTADIQHNTVHYGSALKRLDTFDRQGILHRLSTYTKMLFVREPFERLVSAFRDKFEHPN SYYHPVFGKAILARYRANASREALRTGSGVRFPEFVQYLLDVHRPVGMDIHWDHVSRLCS PCLIDYDFVGKFESMEDDANFFLSLIRAPRNLTFPRFKDRHSQEARTTARIAHQYFAQLS ALQRQRTYDFYYMDYLMFNYSKPFADLY >gi568815579f:33700455_33911551|GENSCAN_predicted_CDS_2|2247_bp atgagagctctcttctctgacatccaggctctgggcatctctcaaaaagagatggtcaaa gaatctgatgaggatgcctgcaagttttcctggaatttgatgcaaatacgaggctatgct aaagccattcagcaagctgtaggagccccaagtggagaaactcaaactagttcacaggag caaacatcatgcagctccctccaaggctacatcactcgtgggttgtggtgtttctttgtg gaagtccccaagctcttaggctggcgcaccaagcacccagctcatgaaagggtccgggtc gtggccctcatcgtgcagcagccattggcaggcagcatcagagagtggttacagcaagga catggaatcagacagctcgggcctggacccaccactctgccagcctacctgggcagcagt gtggatgagggtgacccacttaacctctttgagactcaagcagctgcgctggttggactt caaaggaagcaggaaaacagctcagatatcttctttagctctcctttcacggtgacccca gacgccctaccaacagccattacatgggagcacattccgtttgcaaagctggcgggtcta attgcagggcctttggtggagatgtgcaggcagaggctaagcaaagagtttgaggccttg aaaggggaattcagggacctcgggcactgtcttccaggagcccagcgagggaacagaatc actaaacgaaacaagtgcggtcagagccgtcaggcgctcatcggccagagacaggaagat gcaggctccgctcctctgcagatgcaccccagtgtcgctgcgctgggggcaggagctgcg ctgcgggagattcagcccctgcaaagggaaccagagctgtcatcagggcccaggaacagc cggctcctgtgctggggcagccctgccacctggaaccccacgtacctgtcccgtgtccta gggcagcaggtggccgtgactgtgacagaggctggtctccaggctgtgccctggggaccc agcagagaatttaatgccaagggctctagctcagcgagcattagagtgggacagccccag aagcttaggctcagagtccagaggtccaggagacagtgtccccctgtccagtcctcacag gacctcccaccaggcggctcccaggatggtgacttgaaggaacccacagagagggtcact cgggacttatccagtggggccccgaggggccgcaacctgccagcgcctgaccagcctcaa cccccgctgcagaggggaacccgtctgcggctccgccagcgccgtcgccgtctgctcatc aagaaaatgccagctgcggcgaccatcccggccaacagctcggacgcgcccttcatccgg ccgggacccgggacgctggatggccgctgggtcagcctgcaccggagccagcaggagcgc aagcgggtgatgcaggaggcctgcgccaagtaccgggcgagcagcagccgccgggccgtc acgccccgccacgtgtcccgtatcttcgtggaggaccgccaccgcgtgctctactgcgag gtgcccaaggccggctgctccaattggaagcgggtgctcatggtgctggccggcctggcc tcgtccactgccgacatccagcacaacaccgtccactatggcagcgctctcaagcgcctg gacaccttcgaccgccagggtatcttgcaccgtctcagcacctacaccaagatgctcttt gtccgcgagcccttcgagaggctggtgtccgccttccgcgacaagtttgagcaccccaac agctactatcacccggtcttcggcaaggccatcctggcccggtaccgcgccaatgcctct cgggaggccctgcggaccggctctggggtgcgttttcccgagttcgtccagtacctgctg gacgtgcaccggcccgtggggatggacattcactgggaccatgtcagccggctctgcagc ccctgcctcatcgactacgatttcgtaggcaagttcgagagcatggaggacgatgccaac ttcttcctgagcctcatccgcgcgccgcggaacctgaccttcccccggttcaaggaccgg cactcgcaggaggcgcggaccacagcgaggatcgcccaccagtacttcgcccaactctcg gccctgcaaaggcagcgcacctacgacttctactacatggattacctgatgttcaactat tccaagccctttgcagatctgtactga >gi568815579f:33700455_33911551|GENSCAN_predicted_peptide_3|425_aa MHMLWRRRGEAIEETSPKAYMFTPERAPEVSAVNVVLGRGDAADSPRGGATGSLGGPPGG GPADSQVEKLPDPQVEELLDPQIEELLDLQVEELLDPQMEEMLDPQVEELLIPRKVSLVM GAFGEARGSAPMASAQQGTPELSADTASGSSLETVRMTQPLPWGLVAHTGPVQGAQDKVR QAVDRGSPIISPYSQAGFANFILRAEPLGSTRGEPSEHPAGVPHSSTALQAQGTQSWAQE AVLNHGIQKWAKLRTELSIEDLGPQRSGRSLPQGPLPPRFIKERGLIDSQFHMAGDASGN LRSWQKGKQTRLSSHGSRREKCRAKEEKRLIKPSDLVELTHYYKNSTGQCKNGLIQPALP KPAHSSTPLQGIQEFQTEPGLPGHYEGIFIKEQLKIHKLRKLTDGACGIFLQADPERPPP RECGR >gi568815579f:33700455_33911551|GENSCAN_predicted_CDS_3|1278_bp atgcacatgctctggaggaggagaggagaagccatagaggaaacttctccaaaagcatac atgttcacaccagagagggctccagaagtgtcagcagtcaatgtggttctgggaagggga gatgctgctgattctccacgtggaggtgccactggatccctgggtggacccccaggtgga ggccctgctgattcccaggtggagaaactgccggatccccaggtggaggaactgctggat ccccagatagaggaactgctggatctccaggtagaggaactgctggatccccagatggag gaaatgctggatccccaagtggaggaattgctgatccccaggaaagtgtccctggtcatg ggtgcatttggtgaagccagaggcagtgcccctatggcatctgctcagcagggaacacct gaactctccgctgacacagcatctgggagcagcttggagacagtgaggatgacacagccc ctcccctggggtttggtggcccatacagggcctgtccaaggggcccaggacaaagtcagg caggcagtggacagggggtcccccatcatctccccctactctcaggcagggtttgccaat ttcatcctcagggctgagccccttgggagcactcgtggtgaacccagcgaacatcctgct ggcgtccctcactcctccactgctctacaagcccaggggacccagagctgggcacaagag gctgtgttgaaccacgggatccagaaatgggcaaagctcagaacagagctgagcatcgag gatctgggaccacagagaagtgggaggagtctcccacaaggtcctctgccacccagattt ataaaggaaagaggtttaatcgactcacagttccacatggctggggatgcttcaggaaac ttacgatcatggcagaaagggaagcaaacacgtctttcttcacatggcagcaggagagag aagtgccgagcaaaggaggaaaagcgtcttataaaaccatcagatcttgtggaactcact cactattacaagaacagcacggggcaatgcaagaatggactaatacagccagctcttccc aagccagcacattccagcaccccactccaaggaatccaagaattccagaccgaacccggg cttccaggtcattatgaaggtatattcatcaaggagcagctcaaaatccataaattaagg aaattaactgatggggcctgcggcatctttctccaggccgaccccgagcgccccccaccc cgggagtgcggccgctag >gi568815579f:33700455_33911551|GENSCAN_predicted_peptide_4|257_aa MSRLSLTRSPVSPLAAQGIPLPAQLTKSNAPVHIDVGGHMYTSSLATLTKYPDSRISRLF NGTEPIVLDSLKQHYFIDRDGEIFRYVLSFLRTSKLLLPDDFKDFSLLYEEARYYQLQPM VRELERWQQEQEQRRRSRACDCLVVRVTPDLGERIALSGEKALIEEVFPETGDVMCNSVN AGWNQDPTHVIRFPLNGYCRLNSVQVLERLFQRGFSVAASCGGGVDSSQFSEYVLCREER RPQPTPTAVRIKQEPLD >gi568815579f:33700455_33911551|GENSCAN_predicted_CDS_4|774_bp atgtcccggctgtctctcacccggtcgcctgtgtctcccctggctgcccagggcatcccc ctgccagcccagctcaccaagtccaatgcacctgtgcacatcgatgtgggcggccacatg tacaccagcagcctggccacgctcaccaagtaccctgactccaggataagccgcctcttc aatggcactgaacccatcgtcctggacagtttgaagcaacattatttcattgaccgggat ggggagattttccgctacgtcctgagcttcctgcggacgtccaagctgctgcttccggat gactttaaggacttcagtctgctgtacgaggaggcgcgctactatcagctccagcccatg gtgcgcgagctggagcgctggcagcaggagcaggagcagcggcgccgcagccgggcctgt gactgcctggtggtgcgcgtcacgcccgacttgggcgagcggatcgcactcagcggcgag aaggccctcatcgaggaggtcttccccgagaccggagacgtcatgtgcaactccgtcaac gccggctggaaccaggaccccacgcacgtcatccgcttcccgctcaatggctactgccgg ctcaactcggtacaggtcctggagcggctgttccagaggggtttcagcgtggctgcgtcc tgtgggggcggtgtggactcctcccagttcagcgagtatgtgctttgccgggaggagcgg cggccgcagcccacccccactgctgttcgaatcaagcaggaacccctggactag >gi568815579f:33700455_33911551|GENSCAN_predicted_peptide_5|191_aa MARARWLTPVIPALWETEAGRLLEQELWSPRPSLWPPPLEVPASTGLIFVDLVFQPSVAC LLADPGFSLSVSAMQVPVGFSGPCSSLPGLQLHPSAVNLAMPSGQCLLGVSTHIYCEIME LVLCRRHFPELLGVCGLHSSAKWFRGPGSSYATAWRAGRHRAKLTLPLQAVIAVHKALPY LTGPPENIGAA >gi568815579f:33700455_33911551|GENSCAN_predicted_CDS_5|576_bp atggcccgggcgcggtggctcacgcctgtaatcccagcactttgggagactgaggcgggc cgattactcgagcaggagctgtggagtcctaggccttcactttggcctcctcctctggag gtacccgcaagcacaggactgatatttgtggacctagtgttccagccctctgttgcctgc ctgctcgccgaccctgggttctcactaagtgtctctgccatgcaagtgcctgttggcttc tcaggcccctgctccagcctgcctggcctccagctgcacccgtcagctgtcaaccttgct atgccttctggtcagtgtcttctcggagtatctactcacatctattgtgagataatggag ctcgtgttatgcagaaggcacttccctgagctgcttggtgtttgcgggctacattcttct gctaaatggtttagagggcctgggtcttcttatgccactgcatggagggcaggtagacac agagctaagctgacgctgcccctgcaggctgtgattgcagtccacaaagcattaccttac ctcaccggtcctccagagaacatcggggctgcctag