GENSCAN 1.0 Date run: 5-Nov-116 Time: 14:31:52 Sequence gi568815596r:38645136_38850196 : 205061 bp : 44.41% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4591 4598 8 0 2 103 91 0 0.079 2.30 1.02 Intr + 10233 10298 66 0 0 56 67 85 0.037 1.22 1.03 Intr + 21044 21216 173 2 2 33 98 158 0.012 10.89 1.04 Intr + 30777 30931 155 1 2 119 106 137 0.995 18.39 1.05 Intr + 36145 36351 207 0 0 50 111 241 0.632 21.87 1.06 Intr + 44678 44759 82 1 1 89 77 50 0.369 3.21 1.07 Term + 57529 57575 47 2 2 112 50 9 0.063 -3.13 1.08 PlyA + 58885 58890 6 1.05 2.00 Prom + 62503 62542 40 -7.96 2.01 Init + 66529 66754 226 0 1 57 97 488 0.997 45.23 2.02 Intr + 84421 84562 142 2 1 91 77 97 0.976 8.31 2.03 Intr + 86600 86774 175 2 1 113 95 119 0.999 15.04 2.04 Term + 88353 88430 78 2 0 100 49 69 0.879 1.96 2.05 PlyA + 88965 88970 6 1.05 3.08 PlyA - 89844 89839 6 1.05 3.07 Term - 96823 96724 100 0 1 100 40 34 0.040 -2.60 3.06 Intr - 105059 104879 181 0 1 51 106 180 0.408 15.03 3.05 Intr - 106294 106094 201 2 0 76 94 150 0.375 13.66 3.04 Intr - 120838 120757 82 1 1 38 98 85 0.023 3.71 3.03 Intr - 133219 133120 100 0 1 59 13 54 0.000 -4.89 3.02 Intr - 136461 136369 93 2 0 81 106 87 0.010 8.88 3.01 Init - 136739 136666 74 2 2 46 64 75 0.012 1.44 3.00 Prom - 140853 140814 40 -2.26 4.18 PlyA - 141021 141016 6 1.05 4.17 Term - 153307 153164 144 1 0 56 42 126 0.963 2.71 4.16 Intr - 157780 157580 201 1 0 94 103 184 0.997 19.98 4.15 Intr - 161558 161424 135 2 0 40 71 87 0.053 2.96 4.14 Intr - 165655 165420 236 2 2 40 99 292 0.041 22.91 4.13 Intr - 166255 165927 329 0 2 106 20 392 0.944 29.54 4.12 Intr - 166591 166320 272 1 2 118 -31 396 0.686 27.04 4.11 Intr - 168760 168686 75 1 0 89 92 23 0.851 2.51 4.10 Intr - 170520 170386 135 0 0 100 88 75 0.998 9.46 4.09 Intr - 174009 173914 96 0 0 121 115 70 0.999 13.21 4.08 Intr - 174931 174833 99 1 0 20 88 66 0.458 0.11 4.07 Intr - 178134 177858 277 2 1 120 78 91 0.991 8.82 4.06 Intr - 180912 180712 201 2 0 132 56 16 0.694 1.20 4.05 Intr - 181554 181381 174 2 0 106 51 115 0.916 8.65 4.04 Intr - 183277 183205 73 2 1 64 111 17 0.033 0.06 4.03 Intr - 192812 192696 117 0 0 115 93 96 0.462 13.24 4.02 Intr - 198075 197870 206 2 2 106 87 134 0.976 13.94 4.01 Intr - 201938 201884 55 0 1 127 116 21 0.983 6.84 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 21027 21216 190 2 1 39 98 179 0.957 13.22 S.002 Term + 71070 71168 99 2 0 109 41 55 0.811 1.03 S.003 Init + 133856 133983 128 1 2 84 89 74 0.877 6.73 S.004 Term + 136382 136757 376 2 1 74 54 208 0.971 10.21 S.005 Term - 165655 165375 281 2 2 40 47 375 0.923 24.21 S.006 Init - 183260 183205 56 2 2 57 111 11 0.928 1.06 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:38645136_38850196|GENSCAN_predicted_peptide_1|245_aa MPRNCLCYDPIQNKVNLVNECLPHLAVFGELPSGGGTVEKFQLQSDLLRVDIISWGCTIT ALEVKDRQGRASDVVLGFAELEGYLQKQPYFGAVIGRVANRIAKGTFKVDGKEYHLAINK EPNSLHGGVRGFDKVLWTPRVLSNGVQFSRISPDGEEGYPGELKVWVTYTLDGGELIVNY RAQASQATPVNLTNHSYFNLAGQASPNINDHEVTIEADTYLPVDETLIPTELAQIIHQDL VTDPK >gi568815596r:38645136_38850196|GENSCAN_predicted_CDS_1|738_bp atgcccaggaactgcctctgctacgaccccattcagaacaaagttaatctggtcaatgag tgtctcccccacctggccgtgtttggagagctgccctcgggaggagggacagtggagaag ttccagctgcagtcagacctcttgagagtggacatcatctcctggggctgcacgatcaca gccctagaggtcaaagacaggcaggggagagcctcggacgtggtgcttggcttcgccgag ttggaaggatacctccaaaagcagccatactttggagcagttattgggagggtggccaac cgaatcgccaaaggaaccttcaaggtggatgggaaggagtatcacctggccattaacaag gaacccaacagtctgcatggaggagtcagagggtttgataaagtgctctggacccctcgg gtgctgtcaaatggcgtccagttctcgcgcatcagtccagatggtgaagaaggctacccc ggagagttaaaagtctgggtgacatacaccctggatggcggagagctcatagtcaactac agagcacaagccagtcaggccacaccagtcaacctgaccaaccattcttacttcaacctg gcaggccaggcttccccaaatataaatgaccatgaagtcaccatagaagcggatacttat ttgcctgtggatgaaaccctgattcctacagagcttgctcagataatccatcaggattta gtcacagatccaaagtag >gi568815596r:38645136_38850196|GENSCAN_predicted_peptide_2|206_aa MISTVCYLSPSPSSSSITITINTIIINITITITTIITITVTTITITITIIIITITTIITI ITTTIIIIINIITILREVAPVQGTAFDLRKPVELGKHLQDFHLNGFDHNFCLKGSKEKHF CARVHHAASGRVLEVYTTQPGVQFYTGNFLDGTLKGKNGAVYPKHSGFCLETQNWPDAVN QPRFPPVLLRPGEEYDHTTWFKFSVA >gi568815596r:38645136_38850196|GENSCAN_predicted_CDS_2|621_bp atgattagcactgtgtgttatctgtcaccatcaccatcatcatcatccatcaccatcact atcaacaccatcatcattaacatcaccatcactatcaccaccattatcaccatcactgtc accaccatcaccatcaccatcaccatcatcatcatcactatcaccaccatcatcaccatc atcaccaccaccatcatcatcatcatcaacatcatcactatcttaagagaagttgcccca gtgcaaggcactgcattcgacctgagaaagccagtggagcttggaaaacacctgcaggac ttccatctcaatggttttgaccacaatttctgtctgaagggatctaaagaaaagcatttt tgtgcaagggtgcatcatgctgcaagcgggcgggtactagaagtatacaccacccagccc ggggtccagttttacacgggcaacttcctggatggcacattaaagggcaagaatggagct gtctatcccaagcactccggtttctgcctggagactcagaactggcctgatgcagtcaat cagccccgcttccctcctgtgctgctgaggcctggtgaggagtatgaccacaccacctgg ttcaagttttctgtggcttaa >gi568815596r:38645136_38850196|GENSCAN_predicted_peptide_3|276_aa MSFNKILNTRQNNLIRAAAIFWTICFNSLHSMSHNSGHRHAAIFKEVHKDNTLEENHTIP NTVLKTKWYLTFNPDKREKYQLKNSSQLWPQVNTEIVQNVTGPGQRTIMGCFGYKQEQRD DACAKRSGRIYKREPGLFLVVPPGLLAGEGVCQLLRHSSPGRCLLKSRARGSVIMSRYGR YGGETKVYVGNLGTGAGKGELERAFSYYGPLRTVWIARNPPGFAFVEFEDPRDAEDAVRG LDGKQYLPAPGLDMCLQLYNFQNNFISGDLGYQDYL >gi568815596r:38645136_38850196|GENSCAN_predicted_CDS_3|831_bp atgtccttcaataagatcctgaacacgcgacagaataatctcattagagctgctgcaatt ttctggaccatatgtttcaacagtctgcacagcatgtcccataattccggtcacagacat gctgccatcttcaaggaagttcacaaggacaatactttggaggaaaaccacacgatacct aacactgtactaaaaaccaaatggtacttaacgtttaaccctgacaaacgagaaaaatac cagttaaaaaactcctcgcaactctggccacaagtgaacacagaaatagtccagaatgta acaggtccagggcaaagaaccatcatgggctgttttggttacaagcaagaacaacgagat gacgcatgcgcaaagcgcagcggccgcatatataaacgcgaacccgggctcttcctcgta gtgccgccgggactcttggcgggtgaaggtgtgtgtcagcttttgcgtcactcgagccct gggcgctgcttgctaaagagccgagcacgcgggtctgtcatcatgtcgcgttacgggcgg tacggaggagaaaccaaggtgtatgttggtaacctgggaactggcgctggcaaaggagag ttagaaagggctttcagttattatggtcctttaagaactgtatggattgcgagaaatcct ccaggatttgcctttgtggaattcgaagatcctagagatgcagaagatgcagtacgagga ctggatggaaaacaatatttgccagcccctggtttagatatgtgtcttcaactttacaac tttcaaaacaattttatctctggagatcttgggtaccaagattatttgtag >gi568815596r:38645136_38850196|GENSCAN_predicted_peptide_4|941_aa XRTFPVDQFFLEDAIAVTRYVLQDGSPYMRSMKQISKEKLKARRNRTAFEEVEEDLRLSL HLQDQDSVKDAVPDQQLDFKQLLARYKGVSKSVIKTMSIMDFEKVNLELIEALLEWIVDG KHSYPPGLAEIKMLYEQLQSNSLFNNRRSNRCVIHPLHSSLSSEEQQAVFVKPPAGVTKI IISTNIAETSITIDDVVYVIDSGKMKEKRYDASKGMESLEDTFVSQANALQRKGRAGRVA SGVCFHLFTSHHYNHQLLKQQLPEIQRVPLEQLCLRIKILEMFSAHNLQSVFSRLIEPPH TDSLRASKIRLRDLGALTPDERLTPLGYHLASLPVDVRIGKLMLFGSIFRCLDPALTIAA SLAFKSPFGIGLDCHEKLLQLMLIPLEPEKVEKYYYWINESVSPWDKKEEANQKKLEFAF ANSDYLALLQAYKEMASLKRQFTELLSDIGFAREGLRAREIEKRAQGGDGVLDATGEEAN SNAENPKLISAMLCAALYPNVVQAPSGSLSHEDRFQAPGIAPPIPDAMFSKGSMVLAYSA GLDTSCILVWLKEQGYDIIAYLANVGQKEDFKEARKKALNLGSKKVFIEDVSKEPYITRK QVEIAQWEGAKYVSHSAMGKGNDQVWFELACYSLAPQIKVIAPGRIPEFYNQSKGRSDLM EYAEKHGIPIPVTLKHPWNMDENLMHISHEAGILENPKNQPPSDIEGFAMEQEVRKIKQG LGLKFAELVYTGFWHNPQCDFAHHCIAKSQDRVEGKVQVSIFKGQVYILCQEPPLSLYSG EQVKSPEGKFQKTSTGAVRMQPKSAELKFVTKNDGYVHIHPSSVNYQVRHFDSPYLLYHE KIKTSRVFIRDCSMVSVYPLVLFGGGQVNVQLQRGEFVVSLDDGWIRFVAASHQVAELVK ELRCELDQLLQDKIKNPSIDLCTCPRGSRIISTIVKLVTTQ >gi568815596r:38645136_38850196|GENSCAN_predicted_CDS_4|2826_bp ngtcgtacatttcctgttgatcaattttttttggaagatgcaattgctgtgacaaggtat gtattacaggatgggagcccatatatgcggtccatgaaacagatttcaaaggaaaagctt aaagcaaggcggaacagaactgcatttgaagaagtggaagaagacctaaggctctccctt cacctccaggatcaggattctgtcaaagatgcagtgccagatcaacagttagattttaag cagctcctggcccgctataaaggggttagcaagtcagtcatcaaaacaatgtccatcatg gattttgaaaaggtgaatcttgaattaatagaggccttattagagtggattgtggatgga aagcactcctaccctccaggactagcagaaatcaaaatgctttatgaacagctacagtct aattctcttttcaacaacagacgtagtaatcgatgtgttattcacccacttcattcatct ttatccagtgaagagcagcaggctgtgtttgtaaaacctcctgcaggagtaactaagatt ataatttccaccaacattgctgagacatccataaccatcgatgatgttgtctatgttatc gattctgggaaaatgaaagaaaagagatatgatgccagcaaagggatggaaagtctagag gacacctttgtatctcaagctaatgctctacaaaggaaaggccgagcaggccgtgttgca tctggggtctgcttccatttattcactagccatcactacaatcaccagcttttaaaacaa cagctaccagaaatacaaagagtgccattggaacagctgtgtctaagaattaaaatttta gagatgtttagtgctcataatctccagtctgtgttctctcggctcattgaacctccacac accgattctcttcgtgcctcaaaaatacgattacgagacttaggagcattaactccagat gaaagattgacccctcttgggtatcatttggcctctctgcccgtggatgtgagaattggc aaactaatgttgtttgggtctatcttccgctgtttggatcctgctctcaccattgctgcc agtttggcttttaagtctccgtttggcattggtttagattgccatgagaaactactgcag ctcatgctgatacctttagagcctgagaaggtagagaaatattattattggattaatgaa tcggtatctccctgggataaaaaagaagaagctaaccagaaaaagctggaatttgcattc gcaaacagtgattatctggcccttctacaagcgtataaggaaatggccagcctcaaacga caattcacggaactgttatcggatatagggtttgcaagggaagggctcagagcaagggaa attgagaaaagggcccaaggaggagatggtgtcttagatgccacaggagaagaggcaaac tcaaatgctgagaaccccaagctgatatcagcaatgctgtgtgctgctttgtatccaaat gtagtgcaggccccgagtggttcactgagccatgaagacagattccaggcaccaggaatt gcacctccaatcccagatgctatgttcagcaaaggctccatggttctggcctacagtgcc ggcctggacacctcctgcatcctcgtgtggctgaaggaacaaggctatgacattattgcc tacctggccaatgttggccagaaggaagacttcaaggaagccaggaagaaggcactgaac cttgggtccaaaaaggtgttcattgaggacgtcagcaaggagccttacatcacccgcaaa caagtggaaattgctcagtgggagggggcaaagtatgtgtcccacagtgccatgggaaag gggaacgatcaggtctggtttgagctcgcctgctactcgctggccccccagattaaggtc attgctcccgggaggatccctgagttctacaaccagtccaagggccgcagtgatctgatg gaatatgcagagaaacatgggattcccatcccagtcactctgaagcacccatggaacatg gacgagaacctcatgcacatcagccacgaggctggaatcttggagaaccccaagaaccaa ccaccttcagacattgagggctttgccatggaacaggaagtgcgcaaaatcaaacaaggc ctgggcttgaaatttgctgagttggtgtacaccggtttctggcacaaccctcagtgtgac tttgcccaccactgcattgccaagtcccaggaccgagtggaagggaaagtgcaggtatcc atcttcaagggccaggtgtacatcctctgccaggagcccccactgtctctctacagtgga gagcaggtgaaaagcccagaaggaaaatttcagaagaccagtactggagctgtcagaatg caaccaaaatcagctgagttgaagtttgtcaccaagaacgatggatatgtacacattcac ccttcatcagtgaactatcaggtgagacactttgacagcccctacctgttgtaccacgag aagataaaaactagtcgagtattcatccgagactgcagcatggtgtctgtgtacccgctg gtcttgtttggaggaggccaagtgaatgtgcagcttcaaagaggagagttcgttgtctcc ctggatgatggttggatccgttttgtagctgcttcccatcaggtggctgaactggtaaag gagcttcgttgcgaacttgatcagcttctccaggataaaattaaaaacccaagcattgat ctgtgtacgtgtcctcgaggatcccggatcatcagcacaattgtgaaacttgtcaccaca caataa