GENSCAN 1.0 Date run: 4-Nov-116 Time: 08:31:39 Sequence gi568815584r:94668494_94870015 : 201522 bp : 47.24% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 10630 10641 12 0 0 99 92 8 0.289 2.53 1.02 Intr + 20879 21290 412 1 1 109 91 159 0.926 12.06 1.03 Term + 22237 22325 89 2 2 101 37 61 0.617 0.12 1.04 PlyA + 24084 24089 6 1.05 2.03 PlyA - 25403 25398 6 1.05 2.02 Term - 27139 27036 104 2 2 83 55 58 0.636 0.44 2.01 Init - 29895 29844 52 0 1 66 94 87 0.966 8.43 2.00 Prom - 31791 31752 40 -0.56 3.00 Prom + 38751 38790 40 -5.86 3.01 Init + 39172 39234 63 0 0 89 81 26 0.554 3.24 3.02 Intr + 42218 42348 131 1 2 38 80 125 0.389 6.19 3.03 Term + 57782 58577 796 2 1 48 32 637 0.055 46.86 3.04 PlyA + 58732 58737 6 1.05 4.07 PlyA - 59130 59125 6 1.05 4.06 Term - 69181 68975 207 1 0 130 50 74 0.794 5.14 4.05 Intr - 69323 69227 97 1 1 43 70 23 0.047 -3.99 4.04 Intr - 82711 82609 103 2 1 112 84 4 0.009 1.63 4.03 Intr - 100156 100027 130 1 1 91 93 232 0.756 24.27 4.02 Intr - 100724 100465 260 0 2 107 80 560 0.849 54.28 4.01 Init - 101522 101168 355 2 1 62 91 640 0.582 59.00 4.00 Prom - 114188 114149 40 -4.46 5.04 PlyA - 115474 115469 6 -0.45 5.03 Term - 119209 119153 57 1 0 77 50 81 0.788 0.89 5.02 Intr - 119587 119482 106 0 1 99 46 49 0.806 2.02 5.01 Init - 121110 121046 65 0 2 62 75 102 0.820 7.01 5.00 Prom - 125828 125789 40 -2.66 6.02 PlyA - 126122 126117 6 1.05 6.01 Sngl - 132970 132740 231 1 0 99 49 238 0.869 16.11 6.00 Prom - 133346 133307 40 -7.36 7.04 PlyA - 133667 133662 6 1.05 7.03 Term - 141258 141026 233 1 2 89 54 124 0.510 5.74 7.02 Intr - 149531 149373 159 0 0 45 32 117 0.336 1.76 7.01 Init - 151989 151917 73 0 1 109 96 -9 0.188 3.24 7.00 Prom - 155754 155715 40 -2.46 8.00 Prom + 173824 173863 40 -4.06 8.01 Init + 175912 175934 23 0 2 108 97 -23 0.616 0.04 8.02 Intr + 178402 178579 178 1 1 82 86 118 0.834 10.92 8.03 Term + 185359 185412 54 0 0 59 48 75 0.234 -1.84 8.04 PlyA + 185644 185649 6 1.05 9.00 Prom + 186371 186410 40 -4.06 9.01 Init + 188261 188336 76 1 1 52 78 91 0.555 4.09 9.02 Term + 191573 191604 32 0 2 126 44 30 0.675 0.42 9.03 PlyA + 191814 191819 6 1.05 10.02 PlyA - 192487 192482 6 1.05 10.01 Term - 200683 200510 174 1 0 95 42 97 0.739 3.56 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 104589 104766 178 0 1 73 47 167 0.983 8.26 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584r:94668494_94870015|GENSCAN_predicted_peptide_1|170_aa MAYETRKQQGRLNSGLPGGLFPFGVEVSMRGSKEPKIRTPRKLSRGRGPSETVKKHTSED GGKCSVVGFSKYKAERATRGGPSLGAAGRGLRAGSREAPSPHPPRGGARAASTARASRQA WGCERGRGDKPGLQPGFLQRPGKRQSFSSGSGEPRADADTSMCGTLIRQS >gi568815584r:94668494_94870015|GENSCAN_predicted_CDS_1|513_bp atggcctatgagacgagaaaacagcaaggaaggctcaactcgggcttgcccggggggctc ttccccttcggcgtggaagtttcaatgcgaggaagcaaagagcctaaaatacgaacaccc aggaaattaagcaggggaaggggcccttctgagacagtaaagaaacatacatccgaagac ggtggaaagtgttcagttgttggcttcagcaaatataaagcagaaagagccacgaggggt ggacccagcctgggcgcggctggccggggcctccgcgctggatccagggaagcgccgtcc ccgcacccgccacgaggcggcgccagagcagcgtccacagcccgggcctctcggcaggct tggggctgcgagaggggccgcggagacaaaccaggcctccaaccaggcttcctgcaaaga ccagggaagcggcagagcttctcctcggggagcggtgaaccgagggccgacgcagacacc tccatgtgtgggacactgattagacaatcatag >gi568815584r:94668494_94870015|GENSCAN_predicted_peptide_2|51_aa MTAVALELSAVWQLEEARWNAGLCALLQHATERRETPSRVSSDARSPYSEH >gi568815584r:94668494_94870015|GENSCAN_predicted_CDS_2|156_bp atgactgccgtagcccttgagctgtctgctgtatggcagctggaggaagcacgctggaac gcaggcctgtgcgccctgctccaacatgccactgagcgcagggagacaccaagcagagtg agttcagatgcccggtccccctatagtgaacattga >gi568815584r:94668494_94870015|GENSCAN_predicted_peptide_3|329_aa MLEAVLEESPREEAAFSWVMKGNLAAGKTIQAQDRDAVGILSSRTGESMENLQKNLLPKQ RRRSRETFTMSGALDVLQMKEEDVLKFLVAGTHLGGTNLDFQMEQYIYKRKSDGIYIINL KRTWEKLLLAARAIIAIENPADVSVISSRNTGQRAVLKFAAATEATPIAGHFTPGTFTNQ IQADFWEPRLLVVTDPRADHQPLTEASYVNLPTIALCNTDSPLRYVDIAIPCNNQGARSV GLMWWTLAREVLRMRGTISREHLWEVMPDLYFYRDPEEIEKEEQAATEKAVTKEEFQGEW TAPAPEFTATKPEVADWSEGVQVPSVPIQ >gi568815584r:94668494_94870015|GENSCAN_predicted_CDS_3|990_bp atgcttgaggcagtgctggaagagagccctcgggaggaggcggctttctcctgggtcatg aaggggaatctggctgcaggaaaaaccatccaagcgcaagacagagatgctgtgggcatc ctcagttccagaactggggaatccatggaaaacctgcagaaaaacttgctgcccaagcag aggaggcgaagcagggaaactttcacaatgtctggagcgcttgatgtcctgcaaatgaag gaggaggatgtccttaagttccttgtagcaggaacccacttaggtggcaccaatcttgac ttccagatggaacagtacatctataaaaggaaaagtgatggcatctatatcataaatctg aagaggacctgggagaagcttctgctggcagctcgtgctattattgctattgaaaaccct gctgatgtcagtgttatatcctccaggaatactggccagagggctgtgctgaagtttgct gctgccactgaagccactccaattgctggtcacttcactcctggaaccttcactaaccag atccaggcagacttctgggagccaaggcttcttgtggttactgaccccagggctgaccac cagcctctcacggaggcatcttatgttaacctacctaccattgctctgtgtaacacagat tctcctctgcgctatgtggacattgccattccatgcaacaaccagggagctcgctcagtg ggtttgatgtggtggacgctggctcgggaagttctgcgcatgcgtggcaccatttctcgt gaacacctatgggaggtcatgcctgatctctacttctacagagatcctgaagagattgaa aaagaagagcaggctgctactgaaaaggcagtgaccaaggaggaatttcagggtgaatgg actgctccagctcctgagttcactgctactaagcctgaggttgcagactggtctgaaggt gtacaggtgccctctgtgcctattcagtag >gi568815584r:94668494_94870015|GENSCAN_predicted_peptide_4|383_aa MPASMFSIDNILAARPRCKDSVLPVAHSAAAPVVFPALHGDSLYGASGGASSDYGAFYPR PVAPGGAGLPAAVSGSRLGYNNYFYGQLHVQAAPVGPACCGAVPPLGAQQCSCVPTPPGY EGPGSVLVSPVPHQMLPYMNVGTLSRTELQLLNQLHCRRKRRHRTIFTDEQLEALENLFQ ETKYPDVGTREQLARKVHLREEKVEVWFKNRRAKWRRQKRSSSEESENAEKWNKTSSSKA SPEKREEEGAGDTKMSRTEPSAQGDPSQGGEQAQRHEPQDSGRKQFPGTGTPGPSQTIFS TAECNGECLAQSDHRGQGQDEQSDQACVTPSLGSELSICFLGLLQPSPADCGHHKQQQLT LSQSGGWKPEIKVSEGLCLFWWL >gi568815584r:94668494_94870015|GENSCAN_predicted_CDS_4|1152_bp atgcccgccagcatgttcagcatcgacaacatcctagccgcccggccgcgctgcaaggac tcggtgttgccggtggcgcacagcgcggcggctcccgtcgtcttcccggccctgcacggg gactcgctctacggcgccagcggcggcgcctcctcggactatggcgccttctacccgcgc cccgtggcccccggcggcgcgggcctcccggccgcggtcagcggctcccgcctcggctac aacaactacttctacgggcagctgcacgtgcaggcggcgcccgtgggcccggcctgctgc ggggccgtgccgccgctgggcgcccagcagtgctcctgcgtcccgacgcccccaggctac gagggccccggttcggtgctggtgtccccggtaccgcaccagatgctgccctacatgaac gtgggcacgctgtcgcgcaccgagctgcagcttctcaaccagctgcactgtcggcggaag cggcggcaccgcaccatcttcactgacgagcagctcgaagctctcgagaacctcttccag gagaccaagtacccggacgtgggcacgcgcgagcagctggcccggaaagtgcacctccgc gaggagaaagtggaggtctggtttaagaaccgccgcgccaaatggaggcggcagaagcgg tcctcatcagaggagtcggagaacgcggagaagtggaacaagacgtcgtcgtcgaaggcg tcaccggagaagagggaagaggaaggtgctggggacacgaagatgagcaggacagagccc tcggctcaaggagaccccagtcagggaggtgaacaggctcagagacatgaaccccaagac agtggcagaaaacagtttcctggcacagggacacccggcccatctcagaccatttttagc acagctgagtgcaatggtgagtgcttggcccagagcgaccacagaggtcaaggtcaagac gaacagtcagatcaggcctgtgtgaccccctctttaggctcagagctcagcatctgtttc ctggggctgctgcaaccaagtcccgcagactgcgggcatcataagcaacagcaacttact ctctctcagtctggaggctggaagcctgagatcaaggtgtcagaagggctgtgcttgttc tggtggctctag >gi568815584r:94668494_94870015|GENSCAN_predicted_peptide_5|75_aa MAAFADIVVPISSPGTALGMARKSQPMKEEEMQMATQAMGQYWMAVRPCWDPDSHRKPQK PVISSSIEGCLHPKA >gi568815584r:94668494_94870015|GENSCAN_predicted_CDS_5|228_bp atggcagcttttgctgacatcgtggttcctatcagctcccctggcactgcgctggggatg gccaggaaatctcagcctatgaaagaggaagaaatgcaaatggctactcaggctatgggt cagtactggatggccgtcagaccctgctgggaccccgattcacaccggaagccacagaag ccggtcatcagcagctccatagagggctgcctacatccaaaggcatag >gi568815584r:94668494_94870015|GENSCAN_predicted_peptide_6|76_aa MGAYKYIQELWRKKQSDVMRFLLRVHCWQCVSSLLFTVLPAPPGLIKHTDWATRPNKVTL YVGFVFAVVAENTQFP >gi568815584r:94668494_94870015|GENSCAN_predicted_CDS_6|231_bp atgggtgcatacaagtacatccaggagctatggagaaagaagcagtctgacgtcatgcgc tttcttctgagggtccactgctggcagtgtgtcagctctctgctcttcacagtgctccct gccccacctggcctgataaagcacaccgactgggctacaaggccaaacaaggttacatta tatgtaggattcgtgttcgccgtggtggccgaaaacacccagttcccttag >gi568815584r:94668494_94870015|GENSCAN_predicted_peptide_7|154_aa MESASTSSLGEKVLESGCHRSIEKVDFPNGITKSTAAPWTDRCLWMSPTASICQGSCTIA SEYKWKALKIYYQTPGLGPPDAPWGRRGGHICPPRDILSTELNGKLKGTCPRDSQVHAAV QNETGFWGNTYTREEKLLQVGQRLFWCQETEGHN >gi568815584r:94668494_94870015|GENSCAN_predicted_CDS_7|465_bp atggagtctgcctccacctccagccttggagagaaggttctggaatctggttgccatagg agtatagagaaagtagacttccccaatggaatcaccaagtccacagctgctccctggaca gacagatgcctgtggatgtctccaactgcctccatctgccagggcagctgcaccatcgct tcagagtataagtggaaagcattgaaaatttattatcagactcctggattaggccctcca gatgccccatggggtcgaagggggggccacatctgccctccccgtgacatcctttccaca gagctcaatgggaagctgaaagggacttgccctagagacagccaggttcatgctgctgtt caaaatgaaacaggattctggggaaatacctacacaagagaggaaaagctgctacaagtt ggacagagactcttctggtgtcaggaaacagaggggcacaactga >gi568815584r:94668494_94870015|GENSCAN_predicted_peptide_8|84_aa MESGNRARGLFKPHLVKTTTWFTENGRHLYYKRVLHTVREPSSKIPYRTGATGAATRISS FLFTKAPNSAEMTVHQTQDQPLED >gi568815584r:94668494_94870015|GENSCAN_predicted_CDS_8|255_bp atggagtccggtaacagagccagagggctttttaaacctcatttggtgaagacgacaaca tggttcaccgagaatggcagacacctgtactacaaaagggtcctacacactgttagggag ccgtctagcaagattccataccgcacaggggccaccggggctgctacacggatttctagt ttcctcttcaccaaggcacccaactcagcagaaatgactgtgcaccagacccaggaccag cctttagaggactga >gi568815584r:94668494_94870015|GENSCAN_predicted_peptide_9|35_aa MERASVLLQKELAMALLRLVPAVTLSKMETRIVPV >gi568815584r:94668494_94870015|GENSCAN_predicted_CDS_9|108_bp atggaaagagcaagtgttctgctgcagaaagaacttgccatggccctgctccgccttgtg ccagcagtgactctaagtaaaatggagacacgtatagtgcccgtgtag >gi568815584r:94668494_94870015|GENSCAN_predicted_peptide_10|57_aa RDSVLWDLQYCTADLLQSSGPRYAKDLVPNLANSRCFFHGDERQNFMVAKRVDFGVK >gi568815584r:94668494_94870015|GENSCAN_predicted_CDS_10|174_bp cgtgattccgttctctgggacctgcagtactgtactgcggacctgttgcagagctcaggg ccaagatatgcaaaggacttggtccctaatctagcaaatagtaggtgcttcttccatgga gatgagaggcagaatttcatggtagccaagcgtgtggattttggagtcaaatga