GENSCAN 1.0 Date run: 6-Nov-116 Time: 18:36:02 Sequence gi568815596f:28716015_28949938 : 233924 bp : 40.66% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2539 2691 153 1 0 71 27 119 0.022 2.37 1.02 Intr + 5734 5836 103 1 1 89 65 24 0.006 -0.54 1.03 Intr + 12734 12802 69 1 0 77 113 17 0.016 1.56 1.04 Intr + 36024 36426 403 2 1 32 2 367 0.003 15.58 1.05 Intr + 60837 60968 132 1 0 52 68 92 0.655 3.30 1.06 Intr + 62795 63025 231 0 0 109 101 129 0.997 13.12 1.07 Intr + 65724 65828 105 1 0 110 59 42 0.869 2.77 1.08 Intr + 67893 67964 72 1 0 77 111 106 0.999 10.26 1.09 Intr + 72644 72795 152 0 2 90 80 70 0.889 5.36 1.10 Intr + 77849 77983 135 1 0 104 80 149 0.999 15.54 1.11 Term + 83185 83289 105 0 0 34 49 183 0.821 6.43 1.12 PlyA + 85235 85240 6 1.05 2.02 PlyA - 86149 86144 6 1.05 2.01 Sngl - 95487 94813 675 0 0 56 45 308 0.999 19.23 2.00 Prom - 99939 99900 40 -4.35 3.00 Prom + 101559 101598 40 -2.45 3.01 Init + 104747 104757 11 1 2 37 77 10 0.086 -5.25 3.02 Intr + 113134 113305 172 1 1 53 94 104 0.378 6.52 3.03 Intr + 124158 124455 298 2 1 50 86 215 0.440 13.02 3.04 Term + 132296 132324 29 0 2 77 54 36 0.066 -3.64 3.05 PlyA + 132635 132640 6 1.05 4.07 PlyA - 133831 133826 6 1.05 4.06 Term - 135274 135011 264 1 0 14 44 205 0.234 3.22 4.05 Intr - 145294 145104 191 2 2 18 92 101 0.870 1.88 4.04 Intr - 149105 149003 103 2 1 74 110 72 0.860 6.83 4.03 Intr - 150378 150331 48 0 0 60 53 89 0.498 0.56 4.02 Intr - 154396 153565 832 0 1 73 90 427 0.724 31.87 4.01 Init - 159441 159296 146 0 2 35 80 184 0.895 11.94 4.00 Prom - 168718 168679 40 -7.15 5.00 Prom + 173813 173852 40 -9.25 5.01 Init + 176703 176705 3 2 0 113 22 0 0.184 -4.05 5.02 Intr + 178619 178909 291 1 0 113 98 347 0.880 34.61 5.03 Intr + 185973 186110 138 2 0 127 92 60 0.994 10.04 5.04 Intr + 190446 190567 122 2 2 106 89 65 0.765 6.77 5.05 Intr + 196576 196696 121 1 1 32 84 83 0.670 1.88 5.06 Intr + 198055 198194 140 0 2 78 84 156 0.989 12.54 5.07 Intr + 201879 201981 103 0 1 76 84 47 0.564 2.36 5.08 Intr + 206905 206969 65 0 2 71 106 14 0.571 -1.90 5.09 Intr + 208968 209139 172 0 1 117 76 82 0.991 8.92 5.10 Intr + 211555 211686 132 0 0 80 80 104 0.990 8.72 5.11 Intr + 213565 213696 132 0 0 100 101 101 0.998 12.52 5.12 Intr + 219507 219593 87 2 0 111 78 23 0.801 2.85 5.13 Intr + 225447 225560 114 2 0 69 89 39 0.728 1.82 5.14 Intr + 226298 226367 70 1 1 93 98 83 0.991 7.64 5.15 Intr + 230436 230485 50 1 2 103 81 111 0.890 9.38 5.16 Term + 230586 230765 180 2 0 60 49 477 0.999 37.63 5.17 PlyA + 230957 230962 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:28716015_28949938|GENSCAN_predicted_peptide_1|553_aa XGVKSGNWGPVTRRGKARFMEDWGDKWTDGNTQRRTLMQSHVHKATAGAFSKEYECNRFL HYQVSLARNAEDPFSSFSVFQLRVLEKKFADPWTRVSTSYPLGSFGFLKVRERRAVAAAS AAEKPLFPLLGRRVCADKMADGELNVDSLITRLLEGECAPGRGTEGGRAPPPTPASPSAA GTRGDPFPAPRRVSGSAARRTNRGGGEEALGAGERPLGARSGEIGGEGAVPADPRGPGPP AEGLRGCRPGKIVQMTEAEVRGLCIKSREIFLSQPILLELEAPLKICGDIHGQYTDLLRL FEYGGFPPEANYLFLGDYVDRGKQSLETICLLLAYKIKYPENFFLLRGNHECASINRIYG FYDECKRRFNIKLWKTFTDCFNCLPIAAIVDEKIFCCHGGLSPDLQSMEQIRRIMRPTDV PDTGLLCDLLWSDPDKDVQGWGENDRGVSFTFGADVVSKFLNRHDLDLICRAHQVVEDGY EFFAKRQLVTLFSAPNYCGEFDNAGGMMSVDETLMCSFQILKPSEKKAKYQYGGLNSGRP VTPPRTANPPKKR >gi568815596f:28716015_28949938|GENSCAN_predicted_CDS_1|1662_bp nnaggtgtgaagagtgggaactggggaccagtaactcggagaggcaaggccaggttcatg gaagactggggggacaaatggactgatggcaacacacaaagacgcacacttatgcagtca catgtgcacaaagcgacagcaggcgccttttcaaaagaatatgaatgcaatagattcttg cattaccaagtatcattggctaggaatgctgaggatcccttttccagcttctctgttttc caactccgagttctggagaaaaagtttgctgacccatggactagagtatccacatcttat ccactggggtcttttgggttcttgaaggtgagagaacgccgagccgtcgccgcagcctcc gccgccgagaagcccttgttcccgctgctgggaaggagagtctgtgccgacaagatggcg gacggggagctgaacgtggacagcctcatcacccggctgctggagggtgagtgcgcgcct ggccgcgggacagagggaggtcgggcaccgccgccgacccctgcgtccccgtctgccgcc ggaacgcgaggggacccctttcccgccccgagacgagtctctgggagcgcggcgcggcgg acgaaccgaggagggggcgaggaggctctgggcgcgggggagcggcctctgggagcgcgg tcaggggagatcgggggagagggggccgttcccgcggaccctcgggggccaggcccgccg gccgaaggcttacgaggatgtcgtccaggaaagattgtgcagatgactgaagcagaagtt cgaggcttatgtatcaagtctcgggagatctttctcagccagcctattcttttggaattg gaagcaccgctgaaaatttgtggagatattcatggacaatatacagatttactgagatta tttgaatatggaggtttcccaccagaagccaactatcttttcttaggagattatgtggac agaggaaagcagtctttggaaaccatttgtttgctattggcttataaaatcaaatatcca gagaacttctttctcttaagaggaaaccatgagtgtgctagcatcaatcgcatttatgga ttctatgatgaatgcaaacgaagatttaatattaaattgtggaagaccttcactgattgt tttaactgtctgcctatagcagccattgtggatgagaagatcttctgttgtcatggagga ttgtcaccagacctgcaatctatggagcagattcggagaattatgagacctactgatgtc cctgatacaggtttgctctgtgatttgctatggtctgatccagataaggatgtgcaaggc tggggagaaaatgatcgtggtgtttcctttacttttggagctgatgtagtcagtaaattt ctgaatcgtcatgatttagatttgatttgtcgagctcatcaggtggtggaagatggatat gaattttttgctaaacgacagttggtaaccttattttcagccccaaattactgtggcgag tttgataatgctggtggaatgatgagtgtggatgaaactttgatgtgttcatttcagata ttgaaaccatctgaaaagaaagctaaataccagtatggtggactgaattctggacgtcct gtcactccacctcgaacagctaatccgccgaagaaaaggtga >gi568815596f:28716015_28949938|GENSCAN_predicted_peptide_2|224_aa MSYGPGTETQQLRSQNSGADDLGDKKRCLMGHKEVGFIKKTPQISIPPTIKAAGTRGDGS ACPGPSLRADGRASGAASGIGPARPSPGTFTRYAAGRRAAKAPTCAATASLARSLSPEPA AGSACVVAAKAAEGAHGRRREDEAGALPSHLRRAVGSPASEPRDRAHSKLEIRLRGPFPG ASTGTCGSPGLRGRGPGNGGQGSVAALLASDGCSSVASQQRYPL >gi568815596f:28716015_28949938|GENSCAN_predicted_CDS_2|675_bp atgagctacgggcctggtactgagacacagcagctcagatctcaaaattctggggcagac gatcttggggacaaaaagaggtgtttaatgggacataaggaggtaggatttatcaaaaag accccccaaatctccattcctcccacaataaaggcggcaggcacacgtggagacgggagc gcctgcccagggccctccctccgagcagacggccgagcttcgggagcagcctccggtatc ggccctgcccgtccttcccctggaaccttcacccgctacgccgccgggcggagggcggcc aaagccccaacctgcgcggccactgcctccctcgccaggtccctcagcccagagcccgct gcggggagcgcgtgtgtcgtcgccgcgaaggcagctgagggcgcccacgggaggcggcgt gaggacgaggctggagcgctgccttctcatctaaggcgggcggtggggtcgccggcgagc gaacccagggaccgggcacactcgaaactggagattcgcctgcgaggccccttcccgggg gcgagcacaggtacctgcggaagcccggggctgcgcgggagagggccgggcaacggcggt caaggctccgtcgcagcgctcctggcctcagacggttgctcgtcggtcgctagccagcag cggtacccgctctaa >gi568815596f:28716015_28949938|GENSCAN_predicted_peptide_3|169_aa MSGRYLANTVEEDEEETKYEIFPWALGKNWRKLFPNFLKLRDQLWDRIDYRAIVSRRCCE EVMAIAPTHYIWQRERSVHHSGAVRNYNRDEVQLPRGPSATPVDCSLCGKKRRYVRLGLS SSSSLSSHTAGVTEKHSQDSYNSLSMDIIGDPSQAYTGSEGYTNSFTGI >gi568815596f:28716015_28949938|GENSCAN_predicted_CDS_3|510_bp atgagtggaaggtatctggctaatacagttgaagaagatgaagaagaaaccaagtacgaa atttttccatgggctttagggaaaaactggagaaaattgttccctaatttcttaaagtta agggaccagctctgggatagaattgactatagggctattgtaagcaggcgatgttgtgag gaggttatggccattgcaccaacccattatatctggcaaagagaacgttctgttcatcac agtggagctgtcagaaactacaacagagatgaagttcagctgccccggggacctagtgcc acaccagtagattgttcactctgtggtaaaaaaagaagatatgttagactgggattgtct tcatcatcatctttatccagtcatacagcaggggtgacagaaaaacattctcaggactca tacaactcactgtcaatggacataataggtgatccttctcaagcttatactggttctgaa ggatacaccaattccttcacgggcatatga >gi568815596f:28716015_28949938|GENSCAN_predicted_peptide_4|527_aa MFEKVLNFSEFGFAVSSNTAVNPRGEVLQNPDSSLAATGDKVKKQEKSRRSRGAVEPHAA AEPSGCCAMRATGKEGVALGLRHSSATAPSRNTMLMAWCRGPVLLCLRQGLGTNSFLHGL GQEPFEGARSLCCRSSPRDLRDGEREHEAAQRKAPGAESCPSLPLSISDIGTGCLSSLEN LRLPTLREESSPRELEDSSGDQGRCGPTHQGSEDPSMLSQAQSATEVEERHVSPSCSTSR ERPFQAGELILAETGEGETKFKKLFRLNNFGLLNSNWGAVPFGKIVGKFPGQILRSSFGK QYMLRRPALEDYVVLMKRGTAITFPKSVEKLSSTKRVPSAQKDINMILSMMDINPGDTVL EAGSGSGGMSLFLSKAVGSQGRVISFEVRKDHHDLAKKNYKHWRDSWKLSHVEEWPDNVD FIHKDISGATEDIKSLTFDAVIELLDGIRTCELALSCEKISEVIVRDWLVCLAKQKNGIL AQKVESKINTDVQLDSQEKIGVKGELFQEDDHGELQFYFMHAVMNGE >gi568815596f:28716015_28949938|GENSCAN_predicted_CDS_4|1584_bp atgtttgagaaggttttaaacttctctgaatttggatttgcagtttctagtaacacagca gtaaatcctagaggagaagttctacagaatccagattcaagtttagcagcaactggggat aaagtgaaaaagcaggagaaaagcaggcgcagtcgcggagctgtagagccccacgcagct gcagagccatcgggctgctgcgccatgcgcgcgactgggaaagaaggggtcgcgctaggc ttgcgtcactcgtctgcgacggcgccttcgcgaaacactatgctaatggcatggtgccgc ggtcctgtcttgctgtgcctgcggcaggggctcggaaccaattcattcctgcacggcctg gggcaggagcccttcgagggagctcggtcactgtgttgcaggtcctcgcctagagacctg cgagatggagaaagagagcacgaggcggcacaaaggaaagccccaggagcagagtcttgc ccatctctccctctgagcatctcggacattgggactggatgtctttcgtcactggaaaac ctcagactgccgacgctgcgggaagagtcatcccctcgagagctcgaggactcgagcgga gaccagggccggtgcggtcccacacaccagggatccgaggatccttcgatgctctcgcag gcccagtccgctaccgaggtcgaagagcgtcacgtctccccttcttgttcaacttccaga gagagaccctttcaggctggggaactgattttagctgagactggggagggagaaacaaaa tttaagaaattatttaggttgaacaacttcggactcttaaatagtaactggggggcagtc ccgttcggcaagatcgtggggaagttccccggccagatactgaggagttccttcggtaag cagtacatgctgaggaggccagccttggaagactatgtagtattgatgaaaagagggact gccataacattcccaaagtctgtggaaaaactgtcttccacgaaacgggtccctagtgcc caaaaggatattaatatgattctctcaatgatggatatcaacccaggtgatactgttttg gaagctggctcaggctctggtggaatgagcttatttttatccaaagcagttggatcacaa ggacgagtcataagttttgaggtacgaaaagaccaccatgatctggctaagaagaattac aaacactggcgtgattcatggaaattaagtcatgtagaagagtggccagacaatgtggat tttattcataaggacatttcaggagcaaccgaagacataaaatctttaacatttgacgca gttattgaacttttagatggaattcgcacctgtgaacttgctctttcatgtgaaaagata agcgaggtcattgtcagagattggttggtttgccttgcaaaacagaaaaatggaatttta gctcaaaaagtagaatctaaaatcaacacagatgtacaactagattctcaagagaaaatt ggagttaaaggtgagctgtttcaagaggatgaccatggtgagcttcagttttactttatg catgcagtaatgaatggagaatga >gi568815596f:28716015_28949938|GENSCAN_predicted_peptide_5|639_aa MAPPRAGAPAHGRTRGCSGARAAMAAGGGGSCDPLAPAGVPCAFSPHSQAYFALASTDGH LRVWETANNRLHQEYVPSAHLSGTCTCLAWAPARLQAKESPQRKKRKSEAVGMSNQTDLL ALGTAVGSILLYSTVKGELHSKLISGGHDNRVNCIQWHQDSGCLYSCSDDKHIVEWNVQT CKVKCKWKGDNSSVSSLCISPDGKMLLSAGRTIKLWVLETKEVYRHFTGHATPVSSLMFT TIRPPNESQPFDGITGLYFLSGAVHDRLLNVWQVRSENKEKSAVMSFTVTDEPVYIDLTL SENKEEPVKLAVVCRDGQVHLFEHILNGYCKKPLTSNCTIQIATPGKGKKSTPKPIPILA AGFCSDKMSLLLVYGSWFQPTIERVVRTPVMNSEAKVLVPGIPGHHAAIKPAPPQTEQVE SKRKSGGNEVSIEERLGAMDIDTHKKGKEDLQTNSFPVLLTQGLESNDFEMLNKVLQTRN VNLIKKTVLRMPLHTIIPLLQELPDLVPQLGTLYQLMESRVKTFQKLSHLHGKLILLITQ VTASEKTKGATSPGQKAKLVYEEESSEEESDDEIADKDSEDNWDEDEEESESEKDEDVEE EDEDAEGKDEENGEDRDTASEKELNGDSDLDPENESEEE >gi568815596f:28716015_28949938|GENSCAN_predicted_CDS_5|1920_bp atggctccgccccgcgccggtgcgcctgcgcacggacgaacacgtggctgcagcggggcc agagcagcaatggcggcgggcggcggcggtagctgcgaccccctggcccctgctggggtc ccttgcgccttctccccgcacagccaggcctacttcgctttggcctctaccgacggtcac ttacgagtatgggagacggccaacaaccggctgcaccaggagtacgtgccttccgcgcac ctcagtggtacctgcacctgtctggcctgggcgccagcgcggctgcaggccaaggaaagt ccccagaggaaaaaaaggaaatcagaagctgtaggaatgagtaaccagactgacttattg gctcttggcacagcagttggtagcattttattatacagcacagtaaaaggagagttacac agtaaattaataagtggtggacatgacaacagagtcaactgcatacagtggcatcaagac agtggctgtttatatagttgttcagatgataaacatattgtggaatggaacgtacagaca tgcaaagtaaagtgcaaatggaaaggcgacaatagcagtgtcagttccctatgtatcagc ccagatggaaagatgttgctttcagctggtcgaacaatcaaactatgggttttggagacc aaagaagtctacaggcatttcacaggacatgcaacgccagtttcgtcactgatgttcact accatcagacctcctaatgagagccagccctttgatggaattacaggtctttatttctta tctggagcagtacatgaccggttacttaatgtctggcaggtccgatcagaaaacaaagaa aagagtgcagtgatgtcatttacagttaccgatgaacctgtctatattgacttaactttg tcagaaaacaaagaagagcctgtcaagttggctgttgtttgcagagatggtcaagtccat ctttttgaacacatattaaatgggtactgcaaaaagcctttgacttcaaactgcacaatt cagatagcaacacctgggaaaggcaagaagtcaacaccaaaacccatccctattctagct gctggtttttgctcagacaaaatgtcattgttgcttgtatatggcagttggtttcagcct actattgagcgagtggtgaggacaccagtgatgaattctgaagcaaaagttctggtgcct gggattcctggtcatcatgcagctatcaagcccgctcctccacaaaccgagcaagtagag agcaagaggaagtcagggggaaatgaggttagcattgaagaacgtctgggagcaatggat atagacacacacaaaaaaggaaaggaagacctccagacgaatagctttccagttcttctt acccagggcttagaaagtaacgattttgaaatgctaaataaagtacttcaaactaggaat gtaaaccttataaagaagactgtattaaggatgcccctgcatactattattccgttgtta caagagttgcctgacctggtaccccagctggggacactctaccagttaatggaaagcaga gtcaaaacttttcagaaactttcacaccttcatggaaagcttattcttctaattacacaa gtaacagcatcagagaagacaaagggagcaacttcccctggacagaaggcaaagttggtg tatgaagaagagtcttctgaagaggagtctgatgatgaaatagcagataaggattctgaa gataattgggatgaagatgaggaggagagtgaaagtgaaaaagatgaggacgttgaagag gaagatgaggatgccgaaggaaaagatgaagaaaatggcgaggacagagatacagcaagt gaaaaagaattaaatggagattctgatttagatcctgaaaatgaaagtgaagaagaatga