GENSCAN 1.0 Date run: 6-Nov-116 Time: 10:46:20 Sequence gi568815586f:19339700_19618717 : 279018 bp : 41.34% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3624 3735 112 2 1 32 67 169 0.770 8.23 1.02 Intr + 6143 6189 47 0 2 78 107 3 0.922 -1.59 1.03 Intr + 7295 7483 189 1 0 59 75 94 0.906 4.06 1.04 Intr + 8700 8820 121 2 1 56 82 170 0.863 12.35 1.05 Intr + 14185 14303 119 2 2 97 20 62 0.086 -0.44 1.06 Intr + 18529 18738 210 0 0 35 76 103 0.125 1.99 1.07 Intr + 19713 19847 135 2 0 14 59 171 0.533 6.64 1.08 Intr + 21883 22007 125 0 2 52 90 61 0.935 1.16 1.09 Intr + 26265 26410 146 0 2 30 96 184 0.997 12.41 1.10 Term + 30871 30956 86 2 2 99 47 46 0.728 -1.56 1.11 PlyA + 34031 34036 6 1.05 2.05 PlyA - 34237 34232 6 1.05 2.04 Term - 45676 45565 112 0 1 96 55 14 0.362 -4.05 2.03 Intr - 49022 48847 176 2 2 -59 94 358 0.681 19.52 2.02 Intr - 49442 49373 70 1 1 43 110 44 0.157 0.37 2.01 Init - 57596 57589 8 2 2 89 91 0 0.096 0.95 2.00 Prom - 70807 70768 40 -5.05 3.02 PlyA - 72569 72564 6 1.05 3.01 Sngl - 79462 79304 159 1 0 94 42 157 0.646 5.96 3.00 Prom - 89763 89724 40 -3.65 4.00 Prom + 98936 98975 40 -3.25 4.01 Init + 100001 100671 671 1 2 122 82 1261 0.977 123.22 4.02 Term + 100796 101006 211 2 1 100 41 185 0.699 10.78 4.03 PlyA + 110322 110327 6 1.05 5.03 PlyA - 110700 110695 6 1.05 5.02 Term - 117182 116546 637 1 1 89 42 733 0.834 61.53 5.01 Init - 117949 117786 164 1 2 86 32 153 0.945 8.75 5.00 Prom - 118919 118880 40 -4.05 6.00 Prom + 119107 119146 40 -5.65 6.01 Init + 119868 119902 35 2 2 23 96 29 0.154 -3.41 6.02 Intr + 122811 123018 208 0 1 94 -5 161 0.047 5.56 6.03 Intr + 133549 133656 108 0 0 119 116 -9 0.077 4.56 6.04 Intr + 154101 154287 187 2 1 110 102 151 0.996 17.04 6.05 Intr + 160398 160522 125 1 2 64 93 29 0.834 0.38 6.06 Intr + 172699 172766 68 0 2 94 103 47 0.838 3.58 6.07 Intr + 174972 175085 114 0 0 69 91 67 0.757 3.74 6.08 Intr + 178388 178414 27 2 0 100 83 31 0.073 0.11 6.09 Intr + 204251 204378 128 2 2 84 78 74 0.590 5.40 6.10 Intr + 209262 209324 63 1 0 118 95 58 0.626 7.27 6.11 Intr + 213604 213693 90 2 0 65 115 27 0.446 2.15 6.12 Term + 226735 227810 1076 2 2 61 40 347 0.060 18.48 6.13 PlyA + 228115 228120 6 1.05 7.00 Prom + 228683 228722 40 -10.15 7.01 Init + 228743 229146 404 1 2 60 39 198 0.841 8.25 7.02 Intr + 233544 233664 121 0 1 82 72 102 0.850 7.58 7.03 Intr + 234103 234213 111 0 0 74 45 68 0.536 0.76 7.04 Term + 234844 234882 39 0 0 102 48 44 0.672 -1.99 7.05 PlyA + 235373 235378 6 1.05 8.06 PlyA - 236888 236883 6 1.05 8.05 Term - 252343 252123 221 2 2 73 44 130 0.333 3.32 8.04 Intr - 254378 254157 222 0 0 82 -12 183 0.398 4.88 8.03 Intr - 265452 265315 138 1 0 33 33 130 0.364 1.51 8.02 Intr - 266355 266233 123 1 0 35 90 86 0.690 3.14 8.01 Init - 271055 270749 307 2 1 73 71 154 0.554 9.70 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 91062 90861 202 0 1 60 59 141 0.878 7.49 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:19339700_19618717|GENSCAN_predicted_peptide_1|429_aa TESAGIQRAQIQKELWRIQDVMEGLSKHKQQRGTTEIGMIGSKPFSTVKYKNEEEEVVPP RPPLPRSYDFTEQPPIIPPLPSDSSSLLCYSRGPVHLPEEKKMYQVQGYPRNGSHCGPDY RLYKSEPELTTVAEVDESNGEEKSEPVSEIETSVVKGSHFPVGVVPPRAKSPTPESSTIA SYVTLRKTKKMMDLRTERPRSAVEQLCLAESTRPRMTVEEQMERIRRHQQACLREKKKGL NVIGASDQSPLQSPSNLRDNPFRTTQTRRRDDKELDTAIRENDVKPDHETPATEIVQLKE TEPQNVDFSKELKKTENISYEMLFEPEPNGVNSVEMMDKERNKDKMPEDVTFSPQDETQT ANHKPEEHPEENTKNSVDEQEETVISYESTPEVSRGNQTMAASTISSKFGALGLPPFSAE QQQKHIETL >gi568815586f:19339700_19618717|GENSCAN_predicted_CDS_1|1290_bp acggaatcagcaggaattcagcgtgcacagattcagaaagaactttggcgaattcaggat gtcatggaagggctgagtaaacataagcagcaaagaggtactacagaaataggtatgata ggatcaaagcctttctcaacagttaagtacaaaaatgaggaagaggaagtagtcccacct cgtcctccacttcctcggtcctatgactttacagagcagcctcccataatcccccctctg cccagtgatagcagctccttgctctgttatagcaggggcccagttcatctgcctgaagaa aagaagatgtatcaagttcaaggatatccaagaaatggatctcactgtggtccagattat agactctacaagagtgaaccagagttaacaacagtggcagaagttgatgaatctaatgga gaagaaaaatcagaacctgtttcagagatagaaacttcagttgttaaaggttcccacttt cctgttggagtagtccctccaagagcaaaatcaccaacacccgaatcttcgacaatagct tcctatgtaaccttgaggaaaactaagaagatgatggatctaagaacggaaagaccaaga agtgcagtggaacagctctgtttggctgaaagtactcgaccaaggatgactgtggaagag caaatggaaagaataagaagacatcaacaagcgtgcctgagggagaagaaaaaagggtta aatgttatcggtgcttcagaccagtcacccttacaaagcccttcaaatttaagggataat ccatttaggactactcagactcgaaggagggatgataaggaactggacactgccattaga gaaaatgatgtaaagccagaccatgaaactcctgcaacagaaattgttcaactaaaagaa accgaaccccaaaatgtggacttcagcaaagagttaaaaaaaactgaaaacatttcatat gaaatgctttttgaacctgagccaaatggagtaaattctgtggaaatgatggataaagaa agaaacaaagacaaaatgcctgaggatgttacattcagccctcaagatgaaacacagacc gcaaatcataaaccagaagagcatcctgaagaaaatacaaagaacagtgttgacgaacag gaagaaactgttatttcttacgaatcaactcctgaggtttctagaggaaatcaaacaatg gcagcatctaccatatctagtaagtttggtgctttaggattaccaccattttctgctgaa cagcagcagaaacatatagagacattataa >gi568815586f:19339700_19618717|GENSCAN_predicted_peptide_2|121_aa MPRNLHTFNHLEAHQNRVFYGGFMAQKKEEEEEKKEEEEEEEEEEEEEEEEEEEKKKKKE EKRERSKKNKQQQQQNTKRNKNLTRSHIFQQCPVTGGQAVWTIWVVPKLLGSQDSHMLNN Y >gi568815586f:19339700_19618717|GENSCAN_predicted_CDS_2|366_bp atgcccaggaacctccatacattcaatcatctggaagctcatcagaaccgtgtcttttat ggaggcttcatggcacagaagaaggaggaggaggaggagaagaaagaagaagaagaagaa gaagaagaagaagaagaagaagaagaagaagaagaagaagaaaagaagaaaaaaaaagaa gagaaaagagaaagaagtaaaaaaaataaacaacaacaacaacaaaacaccaaaagaaac aagaacctaaccagatctcacatttttcagcagtgccctgtcacaggaggccaagctgta tggactatatgggtggttcccaaacttttggggtctcaggattctcacatgcttaacaac tattga >gi568815586f:19339700_19618717|GENSCAN_predicted_peptide_3|52_aa MAGYGTLLQDLTNNITLKDLKQHKSTCKKDIPSEKSEEITTGDVWFSFLKSH >gi568815586f:19339700_19618717|GENSCAN_predicted_CDS_3|159_bp atggctgggtatgggaccctcctccaggatctgaccaacaatatcacccttaaagatctg aaacagcacaagtcaacctgtaagaaggacatccccagtgaaaagagtgaggagatcact actggcgatgtctggtttagcttcctgaagagccactag >gi568815586f:19339700_19618717|GENSCAN_predicted_peptide_4|293_aa MAAAITDMADLEELSRLSPLPPGSPGSAARGRAEPPEEEEEEEEEEEEAEAEAVAALLLN GGSGGGGGGGGGGVGGGEAETMSEPSPESASQAGEDEDEEEDDEEEEDESSSSGGGEEES SAESLVGSSGGSSSDETRSLSPGAASSSSGDGDGKEGLEEPKGPRGSQGGGGGGSSSSSV VSSGGDEGYGTGGGGSSATSGGRRGSLEMSSDGEPLSRMDSEDRLSTRKSLAVAAAPLSQ RIAASNSQTVPPQLSFPRPLSPRRFAPHGPQDGSNSETVPKRAQILWRSRRGP >gi568815586f:19339700_19618717|GENSCAN_predicted_CDS_4|882_bp atggccgccgctatcaccgacatggccgacctggaggagctctcccgcctgagccctctg ccccccggcagcccgggttcggcggcgcggggccgggctgagccccccgaggaggaggag gaagaggaggaggaggaagaggaggcggaggccgaggcggtggcggcgctgctgctgaac ggcggcagcggtgggggcggcggaggcggcggcggaggagtggggggcggcgaggcagag acgatgtcggagccgagccccgagagcgccagccaggccggggaggacgaagacgaggag gaggacgacgaggaggaggaagatgagagcagcagcagcggcgggggtgaggaggagagt agcgccgagagcctggtgggcagcagcggcgggagcagcagcgacgagacccgctcgttg agccccggcgccgccagcagcagcagcggggatggggacggcaaggagggcctggaggag cccaagggaccgcggggcagccagggcggcggcgggggcggcagcagtagcagcagcgta gtctccagcggcggcgacgagggctacgggactgggggaggcggaagcagcgcgacctcc gggggccggcggggcagcttggagatgtcgtcggatggggaacccctgagccgcatggac tcggaggacagactctcaactcggaaatctctcgccgtcgccgccgcgcccctctcgcag cgcatcgcggcttccaactctcaaaccgttcccccccaactctcctttccccgccctctt tcccctcggcgcttcgctcctcacggacctcaggacggctctaactcggaaacagtcccc aaacgggcccagatcctctggcggagcagaagagggccttga >gi568815586f:19339700_19618717|GENSCAN_predicted_peptide_5|266_aa MGKEKTHIHIVVIGHVDSYKSTTTGHLNYKCGSINKRTIEKIEKEAAEMGKGSFKMSTKI GGIGTVPVGRVETGVLKPGMVVTFAPVNIKTEVKSVEMHHEALSETLSGDNVGFNVKNVS VKDVRHGNDAGDSKDDPPMEAAGFTAQVIILNLPGQISAGYAPVLDCHTAHTACKFAELK EKMDRHVGKKLEDGPNFLNSGDAANVDMVPGKPVCVECFSDYPPLGRAAVHDMRQTVAVD VIKAVNKKAAGAGKVTKSAQKAQKAK >gi568815586f:19339700_19618717|GENSCAN_predicted_CDS_5|801_bp atgggaaaggaaaagactcatatccacattgtcgtcattggacacgtagattcgtacaag tccaccactactggccatctgaactacaaatgtggtagcatcaacaaaagaaccattgaa aaaattgagaaggaggctgctgagatgggaaagggctccttcaagatgtctacaaaaatt ggtggtattggtactgttcctgttggccgagtggagactggtgttctcaaacctggtatg gtggtcacctttgctccagtcaacattaaaactgaagtaaaatctgtcgaaatgcaccat gaagctttgagtgaaactctttctggggacaatgtgggcttcaatgtcaagaatgtgtct gtcaaggatgttcgtcatggcaatgatgctggtgacagcaaagatgacccaccaatggaa gcggctggcttcactgctcaggtgattatcctgaaccttccaggccaaataagtgctggc tatgcccctgtactggattgccacacggctcacactgcatgcaagtttgctgagctgaag gaaaagatggatcgccatgttggtaaaaagctggaagatggccctaacttcttgaactct ggtgatgctgccaacgttgatatggttcctggcaagcccgtgtgtgttgagtgcttctca gactacccacctctgggtcgcgctgctgttcatgatatgagacagacagttgcagtggat gtcatcaaagcagtgaacaagaaggctgctggagctggcaaggtcaccaaatctgcccag aaagctcaaaaggctaaatga >gi568815586f:19339700_19618717|GENSCAN_predicted_peptide_6|742_aa MYFGIEKPPYPSISSTIMDVDSTISSGRSTPAMMNGQGSTTSSSKNIAYNCCWDQCQACF NSSPDLADHIRSIHVDGQRGGVFVCLWKGCKVYNTPSTSQSWLQRHMLTHSGDKPFKCVV GGCNASFASQGGLARHVPTHFSQQNSSKVSSQPKAKEESPSKAGMNKRRKLKNKRRRSLP RPHDFFDAQTLDAIRHRAICFNLSAHIESLGKGHSVVFHSTVIAKRKEDSGKIKLLLHWM PEDILPDVWVNESERHQLKTKVVHLSKLPKDTALLLDPNIYRTMPQKRLKSFQHALLPLS TTFPSGNLPLIIHPSDLDLSVTSKRPFLVPFCLESPVTIVLIFMAMYAKDVAPASFFGFL TRATAEQKHFSLVWHLMNLINVCYGCLFTLMVVSFAVQKLFSLIRSHLSILASVAIAFGV LDMKSLPMPIVLSDLQRDLDSHTIIMGDFNTPLSTLDRSARQKVNKDTQELNSALHQADL IDIYRTLHPKSTEYIFFSAPHHTYSKIDHIVGSKALLSKCKRTEIITNCLSDHSAIKLEL RIKKLTQNRSTTWKLNNLLLSDYWVHNEMKAEIKMFFETNENKDTTYQNLWDTFKAVCRG KFIALNAHNRKQERSKIDTLTSQLKELEKQEQTHSKASRRQEITTIRAELKEIETQKTLQ KINESRSWFFEKINKIDRPLARLIKKKREKNQIDAIKNDQGDSTTDPTEIQTTIREYYKH LYANKLENLEEMDKFLDTSSQD >gi568815586f:19339700_19618717|GENSCAN_predicted_CDS_6|2229_bp atgtatttcggtatagaaaaaccaccatacccaagcataagcagtactataatggatgta gacagcacaatttccagtgggcgttcaactccagcaatgatgaatggacaaggaagcact acttcttcaagcaaaaatattgcctataattgttgttgggaccagtgccaggcttgcttc aactctagcccagatctggcagatcacatccgttccatacatgtagatggtcagcgagga ggggtatttgtttgcttatggaaaggttgtaaagtatataacactccatctaccagtcaa agttggttacaaaggcatatgctgacacacagtggagacaaacctttcaagtgtgttgtt ggtggctgcaatgccagctttgcttctcagggagggctagctcgtcatgtacccacacac ttcagtcagcagaactcctcaaaagtttctagccagccaaaggccaaagaagaatctcct tctaaagctggaatgaacaaaaggaggaaattaaagaacaaaagacgacgctcattacca cggccacatgatttcttcgatgcacaaacactggatgcgataagacatcgagccatatgc tttaacctctcagctcatatagaaagtttagggaagggacacagtgttgtttttcatagt actgtaatagctaagagaaaagaagattctgggaagatcaaacttttgcttcattggatg cctgaagacattctgcctgatgtgtgggtgaatgaaagtgaacgacatcagttaaaaact aaagtagttcatttatcaaagctacccaaagatactgccttgcttttggacccaaacata tacagaacaatgccgcagaagaggttgaagagctttcagcatgctcttctgccgctctcc acaacctttccttctggaaacctgcccttgatcattcatccttctgatcttgacttgagc gtcacttcaaaacggcctttcttggtgcccttttgtctagagtccccagtgactatagtt ctcatctttatggccatgtatgccaaagatgtagctcccgcctccttctttggctttctg acaagagcaactgctgagcagaagcacttcagcctcgtctggcatttgatgaatttgatt aatgtgtgctatggttgcctgttcactctgatggtagtttcttttgctgtgcagaagctc tttagtttaattagatcccatttgtcaattttggcttctgttgccattgcttttggtgtt ttagacatgaagtccttgcccatgcctatagtccttagtgacctacaaagagatttagac tcccacacaataataatgggagactttaacaccccactgtcaacattagacagatcagcg agacagaaagttaacaaggatacccaggaactgaactcagctctgcaccaagcagaccta atagacatctacagaactctccaccccaaatcaacagaatatatattcttttcagcacca caccacacctactccaaaattgaccacatagttggaagtaaagcactcctcagcaaatgt aaaagaacagaaattataacaaactgtctctcagaccacagtgcaatcaaactagaactc aggattaagaaactcactcaaaaccgctcaactacatggaaactgaacaacctgctcctg agtgattactgggtacataacgaaatgaaggcagaaataaagatgttctttgaaaccaat gagaacaaagacacaacataccagaatctctgggacacattcaaagcagtgtgtagaggg aaatttatagcactaaatgcccacaacagaaagcaggaacgatctaaaattgacacccta acatcacaattaaaagaactagaaaagcaagagcaaacacattcaaaagctagcagaagg caagaaataactaccatcagagcagaactgaaggaaatagagacacaaaaaacccttcaa aaaattaatgaatccaggagctggttttttgaaaagatcaataaaattgatagaccacta gcaagactaataaagaagaaaagagagaagaatcaaatagacgcaataaaaaatgatcaa ggggatagcaccaccgatcccacagaaatacaaactaccatcagagaatactataaacac ctctatgcaaataaactagaaaatctagaagaaatggataagttccttgacacatcctcc caagactaa >gi568815586f:19339700_19618717|GENSCAN_predicted_peptide_7|224_aa MSELPFAIASKGIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWVGRINIV KMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFLWNQKRVHITKSILSQKNKAGGITLP DFKLYYKTTGTKTACLEEAGEGILAVAVEEVLSLVSWGSIPEKCRAAPNQRDWPRSKGSK GRTDAVAATEGLSVAFGTAPPLQGNSEPPAKWDFESQKKGTERI >gi568815586f:19339700_19618717|GENSCAN_predicted_CDS_7|675_bp atgagtgaactcccattcgcaattgcttcaaagggaataaaatacctaggaatccaactt acaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaag gaggatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcgtg aaaatggccatactgcccaaggtaatttatagattcaatgccatccccatcaagctacca atgactttcttcacagaattggaaaaaactactttaaagttcctatggaaccaaaaaaga gtccacatcaccaagtcaatcctaagccaaaagaacaaagctggaggcatcacactacct gacttcaaactatactacaagactacaggaaccaaaacagcatgcttggaggaagcaggg gaagggatcttggcggtggctgtggaagaggtcctttcacttgtctcttggggatccatc ccagagaaatgccgagccgctcccaatcagcgggattggccgaggtccaagggcagcaag ggcagaaccgatgcagtggctgcgacagaggggctgtcggttgcctttgggaccgccccc cctctccagggaaactcagagcccccagcaaaatgggattttgagagccaaaagaaagga actgaaagaatctag >gi568815586f:19339700_19618717|GENSCAN_predicted_peptide_8|336_aa MKRHFSKDDLHVVNKQMKKWSTSLIVRSMQIKITMSYHLIPVRMAITKKAKNIRCQQGGR EKGMLIHSSQECELVQPLRNSVWRVLKELKTALPFDPAIPLLGWDVASAAAAQRSQVIPP KSAWTELAMRQTLTNRREMRRKEGFCFWGSLRELIIMAEGKGKVDTPSHDRQERESRGGV ATHFQTTRSSICCHYYSQPAGLTLPYRSPCSERRFSVDFFLSLASVMYLQVSQRSSFLIC SPLQSICSSRSALPADSPVKPHWPFSVPLSEAKKQKQKEPSRCLHATLWLAIGVLPSVLS PDAYWQRDLSTSLSVSPDFIRKRGLESPWLHEALRT >gi568815586f:19339700_19618717|GENSCAN_predicted_CDS_8|1011_bp atgaagagacacttttcaaaagatgacttacatgtggtcaacaagcagatgaaaaaatgg tcaacatcactgatcgtcagatcaatgcaaatcaaaatcacaatgagttaccatctcata ccagtcagaatggctattactaaaaaggcaaaaaatatcagatgccagcaaggtggcaga gaaaagggaatgcttatacacagctcacaggaatgtgaattagttcagccattgcggaat tcagtttggagagttctcaaagaacttaaaacagcactaccatttgatccagcaatccca ttgctggggtgggacgtggcctccgcagcagctgcccaaagaagccaggtgataccacct aaaagtgcatggactgaactcgccatgaggcaaacattgaccaataggagagaaatgaga cggaaggaaggcttctgtttctggggaagtctcagggaacttatcatcatggcagaaggc aaagggaaagtagatacaccttcacatgatcggcaggagagagagagcagaggaggagtt gctacacactttcaaacaaccagatcttccatctgctgccattattattcccagccagca ggcctaacacttccctatcgatctccttgctctgaacgcaggttttctgtggacttcttc ctgtctctggcatctgtcatgtacctacaggtcagccagcgttcctctttcctcatttgc tctcctctgcagtccatctgcagttccaggtctgcattacctgcagattctcccgtgaag ccgcactggcccttttctgtgcctctctcagaggccaaaaaacagaaacaaaaagaacct tccagatgcctccatgccactctgtggctagcaattggagtgctgccttccgtgctctcc ccagatgcatattggcaaagggacctttccacaagtctctctgtctcccctgacttcatc aggaagagaggcttggaatctccctggttgcatgaagcattaaggacatga