GENSCAN 1.0 Date run: 5-Nov-116 Time: 02:50:40 Sequence gi568815588f:122293201_122529935 : 236735 bp : 44.13% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3296 3396 101 1 2 66 88 37 0.489 1.23 1.02 Intr + 4568 4637 70 2 1 70 100 116 0.939 10.18 1.03 Intr + 5804 5934 131 1 2 72 71 121 0.667 8.29 1.04 Intr + 8234 8397 164 2 2 81 59 66 0.127 2.62 1.05 Intr + 13269 13329 61 1 1 59 81 48 0.002 -1.01 1.06 Intr + 36280 36371 92 1 2 34 121 77 0.399 5.24 1.07 Intr + 37976 38058 83 0 2 97 73 90 0.931 7.76 1.08 Intr + 39236 39313 78 1 0 81 84 39 0.783 2.55 1.09 Intr + 41681 41779 99 1 0 93 110 15 0.955 4.51 1.10 Intr + 43294 43494 201 0 0 75 -43 235 0.285 8.58 1.11 Intr + 44817 44852 36 2 0 88 92 20 0.206 0.86 1.12 Term + 51535 51696 162 0 0 101 47 120 0.983 7.14 1.13 PlyA + 52479 52484 6 1.05 2.00 Prom + 54045 54084 40 -3.56 2.01 Init + 54285 54450 166 2 1 61 84 67 0.491 3.40 2.02 Term + 75448 75611 164 2 2 49 42 132 0.475 2.80 2.03 PlyA + 78899 78904 6 1.05 3.02 PlyA - 78974 78969 6 1.05 3.01 Sngl - 82107 81466 642 0 0 68 50 368 0.881 25.18 3.00 Prom - 90929 90890 40 -0.96 4.00 Prom + 98232 98271 40 -4.06 4.01 Init + 100001 100083 83 1 2 63 75 87 0.049 5.34 4.02 Intr + 107899 107981 83 1 2 75 55 75 0.042 2.18 4.03 Intr + 113376 113473 98 1 2 63 98 79 0.098 6.13 4.04 Intr + 119720 119845 126 1 0 82 91 58 0.987 6.38 4.05 Intr + 122659 122802 144 0 0 69 89 92 0.995 7.88 4.06 Intr + 124700 124768 69 1 0 55 84 61 0.750 1.78 4.07 Intr + 130999 131063 65 0 2 104 116 33 0.950 5.22 4.08 Intr + 131696 131759 64 2 1 70 110 13 0.933 0.42 4.09 Term + 136424 136738 315 1 0 115 54 191 0.911 13.34 4.10 PlyA + 138579 138584 6 1.05 5.04 PlyA - 139147 139142 6 1.05 5.03 Term - 141012 140854 159 0 0 28 44 139 0.391 1.44 5.02 Intr - 143432 143300 133 1 1 72 -1 124 0.284 2.55 5.01 Init - 156494 156454 41 2 2 105 87 39 0.570 5.21 5.00 Prom - 156809 156770 40 -4.96 6.00 Prom + 163666 163705 40 -3.86 6.01 Init + 168453 168924 472 2 1 99 73 990 0.448 92.29 6.02 Term + 174393 174457 65 1 2 69 49 71 0.156 -0.65 6.03 PlyA + 174575 174580 6 -0.45 7.04 PlyA - 174866 174861 6 1.05 7.03 Term - 175252 175169 84 1 0 55 43 242 0.962 14.05 7.02 Intr - 180076 180015 62 2 2 68 94 16 0.468 -1.35 7.01 Init - 183451 183241 211 1 1 44 59 167 0.522 8.35 7.00 Prom - 183765 183726 40 -3.16 8.00 Prom + 189417 189456 40 -4.36 8.01 Init + 190149 190212 64 2 1 38 97 21 0.316 -0.69 8.02 Intr + 195702 195801 100 1 1 99 42 118 0.926 7.47 8.03 Intr + 196222 196426 205 1 1 68 110 364 0.913 35.80 8.04 Intr + 197543 197597 55 1 1 46 46 13 0.276 -8.55 8.05 Intr + 198844 199004 161 2 2 75 85 118 0.712 9.91 8.06 Intr + 213491 213685 195 1 0 59 66 323 0.953 26.81 8.07 Intr + 215456 215570 115 1 1 117 102 101 0.999 14.42 8.08 Intr + 216896 216953 58 0 1 104 94 97 0.686 9.84 8.09 Intr + 218770 218865 96 1 0 35 75 113 0.823 3.82 8.10 Term + 220991 221159 169 2 1 83 48 339 0.997 26.75 8.11 PlyA + 221685 221690 6 -1.95 9.05 PlyA - 221726 221721 6 -3.24 9.04 Term - 222711 222595 117 0 0 78 38 161 0.951 8.64 9.03 Intr - 223270 223174 97 0 1 48 106 37 0.754 1.51 9.02 Intr - 226192 226060 133 2 1 72 56 89 0.790 3.80 9.01 Intr - 226392 226249 144 1 0 100 48 37 0.569 1.15 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 14648 14501 148 0 1 35 85 122 0.889 5.89 S.002 Term - 94471 94415 57 1 0 68 49 76 0.846 -0.61 S.003 Init - 97215 97123 93 0 0 74 115 37 0.848 5.38 S.004 Init + 100001 100141 141 1 0 63 72 152 0.910 11.23 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588f:122293201_122529935|GENSCAN_predicted_peptide_1|425_aa MSLRLCFRMRSQWLLSGALLNEVNILQSNFAAERCVDVMIARLKPSTIKKFYEAGCKYKE EQLTTGCEKWLEMNLVPLGGTQIHLHKIPQDLLHKVLKSPRSCLHRQSRAITEGKRALCG FRHVISHVPTLTTPQALPTSSEVDLQWLPLLKLEGDEDIAAAGQLSNSQEVPQPVCFPEN CCFLDRDIGRSLRPLFLCLRLHGITKGKDLEVLRHLNFFPESWLDQVTVNHYHALENGGD MVHLKDLNTQAVRFGLLFNQENTTYSKTIALYGFFFKIKGLKHDTTSYSFYMQRIKHTDL ESPSAVYEHNHVSLRAARLVKYEIRAEALVDGKWQEFRTNQIKQKFGLTTSSCKSHASVH TLKIQTVGIPIYRSTNSQAVCPESAKPRFKCSVDSMASGCPVGLADVGQRAPMAAPHDFE LCDFE >gi568815588f:122293201_122529935|GENSCAN_predicted_CDS_1|1278_bp atgagtttaaggttgtgcttccgtatgagatcccagtggctactctcgggtgccctgctc aatgaagtgaacatacttcaaagcaattttgctgcagagaggtgcgtggatgtgatgata gccagactcaagccaagcaccatcaagaaattctacgaggccggctgcaagtacaaggaa gagcagctcaccaccggctgcgagaagtggctggaaatgaacttggttcctctagggggg acgcagatccacctccacaaaatcccacaggacctgctccacaaagtgctgaagtccccc aggagctgtctgcacagacagtccagggctatcactgaagggaaaagagccctgtgtggc ttcaggcatgtcatcagccatgtgcccacgctgaccactccccaggctctgcccacatct tctgaggttgatctccagtggctgcctttgttaaagcttgagggagatgaggacattgca gctgcggggcagttaagcaactcacaggaggttccacagccagtctgctttcctgagaac tgttgctttctggaccgggacataggacggagcttgaggccgctcttcctctgcttgcgt ctgcacggcatcaccaaaggcaaggatctggaggtgctgcggcaccttaacttcttccca gagtcatggctcgaccaggttacagtcaaccattaccacgcactggagaatgggggcgac atggtccacctgaaagatcttaacacccaggctgtgagatttgggctgctctttaaccag gagaatacaacttattcgaaaacgattgctctatatggattcttctttaagataaaggga ctcaaacatgatactacctcttatagtttttacatgcagagaataaagcacacagacctg gaatctccctctgcggtctacgagcacaaccacgtcagcctgcgagcggcacgcctggtg aagtatgagatcagagcagaggccctggttgacggcaagtggcaggagttcaggacaaac cagatcaagcagaagtttgggttgaccacgtcatcctgcaaaagccatgcaagtgttcac accttgaaaatccaaactgtgggcatcccaatctataggagcactaactcacaggcggtg tgcccagagtcggccaagccgcgtttcaagtgctccgtggactcaatggctagtggctgc cctgtgggcttggcagatgtgggacaaagggctcctatggctgctcctcatgactttgag ctctgtgactttgaatga >gi568815588f:122293201_122529935|GENSCAN_predicted_peptide_2|109_aa MVAFSSLELNVPCSFCSGLLESWDRVVQGPLLTCWRERGHVEENLSPTTSPNCQTSPRKP RQGVRKAVIRITCNGTGQIIVNGIRACDFWFSQLRMGGITMGILSIVQR >gi568815588f:122293201_122529935|GENSCAN_predicted_CDS_2|330_bp atggtagcattcagtagtttggagctcaatgtgccttgcagcttctgctctggcctcttg gagtcttgggaccgtgtagtgcaggggcctctactaacctgctggagggaacgcggccat gtggaggagaacctcagcccgaccaccagccccaattgccagacatcccctcgaaagcca cgccaaggggtcaggaaagccgtgattcgcataacttgcaatgggactgggcaaataatt gtgaatggaattcgtgcttgtgacttctggttctcacagctgcggatgggggggatcacc atgggaattctgagtattgttcagcgctga >gi568815588f:122293201_122529935|GENSCAN_predicted_peptide_3|213_aa MPGDARPAGTALEGGAPTGATGNPRAAARTAPRAGFLGGRRRRGARRPRDRRGPPALPRR APLRADVGKTIGDPREEDAAPSRCPKPGQVSRPGGLQVLWRVARTCAEGAPAPHSPAPTA PLRTPPRSPQREPRSSSPAAPQPPPPRAWPRPPPLLQRGRVLQAKLYRGLGPEHPRRRGP GSHRGSAARSGVRDGQPAHCARVPTLPQPSDGG >gi568815588f:122293201_122529935|GENSCAN_predicted_CDS_3|642_bp atgcccggggacgcgcgccccgcgggcacagccctggagggaggcgctccgaccggcgcc accgggaatccccgggcagctgcgcgcaccgcgccccgagcgggatttctcggggggcgg agacgccgaggagctcgtcggccccgcgaccgccgcgggcctcccgccctcccccgccgg gcgccgctgagggctgacgtcgggaaaacaatcggggacccacgggaagaagacgccgcc ccctcccgctgccccaagccagggcaggtgagccggcccggcggcctccaggtgctgtgg cgcgtggcgcggacgtgcgcagagggggcgccggctccgcactcaccagcacccaccgcc cctctgcggacgcctccccgctcgccacagcgggaaccgcgctccagcagcccggcagcc ccgcagccgccgccacctcgcgcctggccccgtccaccacccctgctccagcgcggccgc gttcttcaagctaaactttaccgcggcctcggcccggagcacccgcgccgccgcggcccc ggctcccacagaggctcggctgcccggagcggcgtccgcgacggccagcccgcgcactgc gcacgcgtccccacgctgcctcagccctcagacggcggctga >gi568815588f:122293201_122529935|GENSCAN_predicted_peptide_4|348_aa MPYVDRQNRICGFLDIEENENSGKFLRREKAGGLHAEACGEKSVVSLELNTLKLEVMNAG MRKYFLQANDQQDLVEWVNVLNKAIKITVPKQSDSQPNSDNLSRHGECGKKQVSYRTDIV GGVPIITPTQKEEVNECGESIDRNNLKRSQSHLPYFTPKPPQDSAVIKAGYCVKQGAVMK NWKRRYFQLDENTIGYFKSELEKEPLRVIPLKEVHKVQECKQSDIMMRDNLFEIVTTSRT FYVQEHPPGPSESKHAFRPTNAATATSHSTASRSNSLVSTFTMEKRGFYESLAKVKPGNF KVQTVSPREPASKVTEQALLRPQSKNGPQEKDCDLVDLDDASLPVSDV >gi568815588f:122293201_122529935|GENSCAN_predicted_CDS_4|1047_bp atgccttatgtggatcgtcagaatcgcatttgtggttttctagacattgaagaaaatgaa aacagtgggaaatttcttcgaagagaaaaggcaggaggactacatgcagaggcttgtggg gagaagagcgtggtgtctctggagctgaacactctgaagctagaggttatgaatgcagga atgaggaagtacttcctacaagccaatgatcagcaggacctagtggaatgggtaaatgtg ttaaacaaagctataaaaattacagtaccaaagcagtcagactcacagcctaattctgat aacctaagtcgccatggtgaatgtgggaaaaagcaagtgtcttacagaactgatattgtt ggtggcgtacccatcattactcccactcagaaagaagaagtaaatgaatgtggtgaaagt attgacagaaataatctgaaacggtcacaaagccatcttccttactttactcctaaacca cctcaagatagtgcggttatcaaagctggatattgtgtaaaacaaggagcagtgatgaaa aactggaagagaagatattttcaattggatgaaaacacaataggctacttcaaatctgaa ctggaaaaggaacctcttcgtgtaataccacttaaagaggttcataaagtccaggaatgt aagcaaagcgacataatgatgagggacaacctctttgaaattgtaacaacgtctcgaact ttctatgtgcaggagcatccccccggtccttcagaatccaaacacgctttccgtcctacc aacgcagccaccgccacctcacattccacagcctctcgcagcaactctttggtctcaacc tttaccatggagaagcgaggattttacgagtctcttgccaaggtcaagccagggaacttc aaggtccagactgtctctccaagagaaccagcttccaaagtgactgaacaagctctgtta agacctcaaagtaaaaatggccctcaggaaaaagattgtgacctagtagacttggacgat gcgagccttccggtcagtgacgtgtga >gi568815588f:122293201_122529935|GENSCAN_predicted_peptide_5|110_aa MVNPESMLEMKEARFLVDIFVVFLGLLCAACRTLKTPIDPTPSGQSFSTWKKPVSLYQKT QLHIWSSINKTIAAVDKHHGNNQVSKTQWERKEHRNISHFQGSRLLDKYD >gi568815588f:122293201_122529935|GENSCAN_predicted_CDS_5|333_bp atggtgaaccctgaaagcatgctggaaatgaaagaagccagattcctcgtggacatcttt gtcgtcttcttaggtctcctgtgtgctgcatgtcgtacactgaagacaccaattgaccca actccatcaggtcaaagcttctccacatggaaaaaacctgttagcctataccaaaaaaca cagctacatatttggagcagcatcaacaagaccatagcagcagtggacaaacaccatggt aacaatcaggtgtccaagacccaatgggaaagaaaggaacaccggaacatcagtcacttt caaggaagcaggctcctggacaagtatgactga >gi568815588f:122293201_122529935|GENSCAN_predicted_peptide_6|178_aa MQIPRAALLPLLLLLLAAPASAQLSRAGRSAPLAAGCPDRCEPARCPPQPEHCEGGRARD ACGCCEVCGAPEGAACGLQEGPCGEGLQCVVPFGVPASATVRRRAQAGLCVCASSEPVCG SDANTYANLCQLRAASRRSERLHRPPVIVLQRGACGQALYWLLKSPGDHRGDMFISKR >gi568815588f:122293201_122529935|GENSCAN_predicted_CDS_6|537_bp atgcagatcccgcgcgccgctcttctcccgctgctgctgctgctgctggcggcgcccgcc tcggcgcagctgtcccgggccggccgctcggcgcctttggccgccgggtgcccagaccgc tgcgagccggcgcgctgcccgccgcagccggagcactgcgagggcggccgggcccgggac gcgtgcggctgctgcgaggtgtgcggcgcgcccgagggcgccgcgtgcggcctgcaggag ggcccgtgcggcgaggggctgcagtgcgtggtgcccttcggggtgccagcctcggccacg gtgcggcggcgcgcgcaggccggcctctgtgtgtgcgccagcagcgagccggtgtgcggc agcgacgccaacacctacgccaacctgtgccagctgcgcgccgccagccgccgctccgag aggctgcaccggccgccggtcatcgtcctgcagcgcggagcctgcggccaagctttgtac tggctgctgaagtccccgggagaccacaggggtgacatgttcatctccaagagatga >gi568815588f:122293201_122529935|GENSCAN_predicted_peptide_7|118_aa MQGLQAQDNWDRVPANGQSLGGRKRGPGILIFTLNAGSPPLNFTCQDWNVGSTPLDEPQA PEGKLDACSRCCSPETRVLEGQADKHPSPSQCLEDDDDDDDDDDDDDDNDNTLILPEF >gi568815588f:122293201_122529935|GENSCAN_predicted_CDS_7|357_bp atgcaagggctgcaggcccaggacaactgggaccgagtgcctgcaaacggccagtcgctg ggtggcaggaagagaggacctggaatattgatctttacactaaatgcagggtccccgcca ctgaactttacctgtcaggactggaatgtgggctccacccctctggacgagccacaggca cctgagggcaagttggatgcctgcagcaggtgctgctcgcctgagacacgagtgttagaa ggacaagccgacaaacacccaagcccctcccagtgccttgaagatgatgatgatgatgat gatgatgatgatgatgatgatgacaatgacaacacgctaatattgcctgagttctag >gi568815588f:122293201_122529935|GENSCAN_predicted_peptide_8|405_aa MLAALNGPFVVLKISNEKYLLGQEDPNSLRHKYNFIADVVEKIAPAVVHIELFRKLPFSK REVPVASGSGFIVSEDGLIVTNAHVVTNKHRVKVELKNGATYEAKIKDVDEKADIALIKI DHQWHAGYTSFWKQLCVCCLGGITKRMKEIRKTEKAEQNQQKLCLRNENHLHGSLSYETV AQPPAKPFKSNPVKLGKLPVLLLGRSSELRPGEFVVAIGSPFSLQNTVTTGIVSTTQRGG KELGLRNSDMDYIQTDAIINDGEVIGINTLKVTAGISFAIPSDKIKKFLTESHDRQAKGK AITKKKYIGIRMMSLTSSKAKELKDRHRDFPDVISGAYIIEVIPDTPAEAGGLKENDVII SINGQSVVSANDVSDVIKRESTLNMVVRRGNEDIMITVIPEEIDP >gi568815588f:122293201_122529935|GENSCAN_predicted_CDS_8|1218_bp atgttggctgctcttaatgggccatttgttgtgttaaaaatcagtaatgagaaatattta ctagggcaggaagatcccaacagtttgcgccataaatataactttatcgcggacgtggtg gagaagatcgcccctgccgtggttcatatcgaattgtttcgcaagcttccgttttctaaa cgagaggtgccggtggctagtgggtctgggtttattgtgtcggaagatggactgatcgtg acaaatgcccacgtggtgaccaacaagcaccgggtcaaagttgagctgaagaacggtgcc acttacgaagccaaaatcaaggatgtggatgagaaagcagacatcgcactcatcaaaatt gaccaccagtggcacgctggttacacctccttctggaaacaactctgcgtgtgctgtttg ggtgggataacaaagaggatgaaagagatcaggaaaaccgagaaggcagaacagaaccag cagaaactgtgcttgaggaatgaaaatcacctacacggctccttgtcatatgagactgtg gcccagcctcctgcaaagccatttaagagtaacccagtgaagctgggcaagctgcctgtc ctgctgcttggccgctcctcagagctgcggccgggagagttcgtggtcgccatcggaagc ccgttttcccttcaaaacacagtcaccaccgggatcgtgagcaccacccagcgaggcggc aaagagctggggctccgcaactcagacatggactacatccagaccgacgccatcatcaac gacggtgaagtgattggaattaacactttgaaagtgacagctggaatctcctttgcaatc ccatctgataagattaaaaagttcctcacggagtcccatgaccgacaggccaaaggaaaa gccatcaccaagaagaagtatattggtatccgaatgatgtcactcacgtccagcaaagcc aaagagctgaaggaccggcaccgggacttcccagacgtgatctcaggagcgtatataatt gaagtaattcctgataccccagcagaagctggtggtctcaaggaaaacgacgtcataatc agcatcaatggacagtccgtggtctccgccaatgatgtcagcgacgtcattaaaagggaa agcaccctgaacatggtggtccgcaggggtaatgaagatatcatgatcacagtgattccc gaagaaattgacccatag >gi568815588f:122293201_122529935|GENSCAN_predicted_peptide_9|163_aa XSPPPHTPTPNMPSTVLGALWLCKKMDSMKECVLRGFFPPVADVTVQGGWAQPYGLNDGL GPVELQPQSESRQGNRQLGQHMASVRAPGRGVRLKQAEETTILEVFKCEVGVPGQAAPMG HRVSWVIQPESEETCQCCNGPQAAQPPSTNTDEGCAQLDRPCI >gi568815588f:122293201_122529935|GENSCAN_predicted_CDS_9|492_bp ncctcaccccctcctcacacccccactcccaatatgcccagcacagtcctgggtgcactg tggttatgcaaaaagatggattccatgaaagaatgcgtgctaagaggattctttcctcca gtggcagatgtgactgtccagggaggctgggctcagccctatggcctcaatgatggactt ggccctgtggagctgcagccccagagtgagtcccgacaagggaacaggcagttaggacag cacatggccagcgtgcgagctcctggcaggggagtccgcctaaagcaagcagaagaaaca actattctggaggtcttcaaatgtgaggtgggggttccaggacaagcagcccccatggga cacagggtctcatgggtcattcaaccagagagcgaagagacgtgccagtgttgcaacggg ccacaagctgctcagccgccgagcaccaacactgatgagggctgtgctcaactggaccgc ccctgcatttaa