GENSCAN 1.0 Date run: 6-Nov-116 Time: 05:05:52 Sequence gi568815577f:36286137_36516363 : 230227 bp : 45.88% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2620 2675 56 0 2 74 99 51 0.794 2.58 1.02 Intr + 5968 6097 130 1 1 114 98 23 0.797 6.60 1.03 Term + 7184 7435 252 1 0 95 41 168 0.998 8.44 1.04 PlyA + 7964 7969 6 1.05 2.02 PlyA - 8681 8676 6 -0.45 2.01 Sngl - 9605 9174 432 2 0 43 55 613 0.970 47.78 2.00 Prom - 27674 27635 40 -0.56 3.02 PlyA - 28427 28422 6 1.05 3.01 Sngl - 34310 33939 372 2 0 92 42 243 0.525 14.23 3.00 Prom - 42061 42022 40 -8.56 4.00 Prom + 44385 44424 40 -3.06 4.01 Init + 50295 50343 49 2 1 86 58 7 0.220 -3.38 4.02 Intr + 50738 50870 133 0 1 53 100 72 0.763 4.60 4.03 Intr + 51596 51728 133 2 1 69 -1 112 0.149 1.05 4.04 Intr + 52643 52785 143 1 2 25 92 64 0.263 -0.35 4.05 Intr + 55263 55410 148 0 1 49 75 157 0.690 10.74 4.06 Intr + 58776 58895 120 2 0 9 98 65 0.340 0.29 4.07 Intr + 63175 63272 98 0 2 99 91 27 0.903 2.91 4.08 Intr + 70484 70588 105 2 0 30 98 77 0.767 2.23 4.09 Intr + 73819 73941 123 1 0 81 81 42 0.833 2.50 4.10 Intr + 74048 74122 75 2 0 49 86 89 0.864 3.43 4.11 Intr + 82852 83740 889 1 1 42 105 541 0.971 42.52 4.12 Intr + 86238 86395 158 2 2 44 85 125 0.964 6.61 4.13 Term + 89007 89160 154 0 1 71 39 136 0.990 4.39 4.14 PlyA + 89825 89830 6 -0.45 5.00 Prom + 90175 90214 40 -4.26 5.01 Init + 99216 99315 100 2 1 41 96 196 0.719 14.12 5.02 Intr + 100026 100126 101 1 2 1 59 150 0.676 3.23 5.03 Intr + 101462 101594 133 1 1 38 101 82 0.995 4.72 5.04 Intr + 105415 105532 118 2 1 59 35 157 0.923 7.02 5.05 Intr + 108411 108514 104 0 2 76 94 69 0.887 6.12 5.06 Intr + 111279 111375 97 1 1 86 99 -7 0.492 -0.73 5.07 Intr + 113385 113469 85 0 1 49 60 76 0.451 0.72 5.08 Intr + 116622 116715 94 2 1 48 89 68 0.369 2.44 5.09 Intr + 125327 125468 142 0 1 89 101 170 0.999 17.81 5.10 Intr + 126694 127179 486 1 0 35 111 214 0.307 10.53 5.11 Intr + 129159 129253 95 0 2 39 111 34 0.730 0.41 5.12 Term + 130387 130517 131 2 2 68 47 83 0.489 0.54 5.13 PlyA + 132858 132863 6 1.05 6.02 PlyA - 135084 135079 6 -0.45 6.01 Sngl - 136631 136254 378 2 0 46 49 268 0.822 14.88 6.00 Prom - 140659 140620 40 -4.26 7.04 PlyA - 141925 141920 6 1.05 7.03 Term - 147084 146999 86 1 2 62 35 95 0.253 -0.58 7.02 Intr - 158305 158147 159 2 0 70 72 115 0.407 8.06 7.01 Init - 159907 159886 22 1 1 97 87 12 0.641 1.86 7.00 Prom - 172376 172337 40 -3.56 8.02 PlyA - 172383 172378 6 1.05 8.01 Sngl - 175559 174840 720 2 0 110 47 1170 0.998 109.33 8.00 Prom - 191590 191551 40 1.14 9.05 PlyA - 191971 191966 6 1.05 9.04 Term - 200762 200183 580 1 1 68 49 297 0.108 17.76 9.03 Intr - 207461 207356 106 0 1 122 18 42 0.001 -0.13 9.02 Intr - 219300 219240 61 0 1 83 35 103 0.214 2.81 9.01 Intr - 224364 224227 138 0 0 69 90 43 0.160 3.26 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815577f:36286137_36516363|GENSCAN_predicted_peptide_1|145_aa IQTFTQLEEDLKDEDESLSYRWAFIPEVDTEGPAFLSDVEENHQECKPHTVRILELLKLK FGEISSSDEITMKSEFPLLRQHSVSSIRQLMPFFMTLNGAFKTQRQLPADSPGTPFLDFP VTDSPRILKQLEECIEYDFLEHPEC >gi568815577f:36286137_36516363|GENSCAN_predicted_CDS_1|438_bp attcagacattcacacagcttgaagaagatctaaaagatgaagatgagtcattgagttat aggtgggcatttattccagaagtggacacagagggccctgccttcctgtcggatgtagag gagaatcaccaagaatgcaaaccccacactgtcaggattctagaacttctaaaattaaag tttggggaaatcagtagctctgatgagatcaccatgaagagtgaattcccgcttctgcgc caacattctgtttccagcatcaggcagttgatgccattcttcatgactctaaatggtgca tttaagacccagagacagctgcctgctgatagcccaggaactccattcttggactttcct gtcacagatagcccaaggatcttaaaacaactggaagaatgcatcgaatatgattttctg gaacatccagaatgttaa >gi568815577f:36286137_36516363|GENSCAN_predicted_peptide_2|143_aa MRAAAGQQRRAVRLSGWADTSAAGRATGASTWGNLPTHVREKEPQDLFYKYSRIREIELK SRYGLVPFASVRFEDPRDAEDAIYGRNGLLPSGSWQDLKDHTREAGDACYTDVQKDGVGM VGCLRKEDMEYALRQLDDQIPLS >gi568815577f:36286137_36516363|GENSCAN_predicted_CDS_2|432_bp atgcgggctgcggctggccagcagcgtcgggcggtgcggttgtccggctgggcggacacg agcgcggctgggagggcgacgggtgcatctacgtggggaaaccttccgacccacgtgcgc gagaaggagccgcaggacctgttctacaagtacagccgcatccgcgagatcgagctcaag agccggtacggccttgtgcccttcgcctccgtgcgcttcgaggaccctcgagatgcagag gatgctatttatggaagaaatggacttcttccatcaggcagctggcaggacctgaaggat cacacgcgagaagctggggatgcctgttacacggatgtgcagaaggatggagtggggatg gttgggtgtctcagaaaagaagacatggaatatgccctgcgtcaactggatgaccaaatt ccactctcatga >gi568815577f:36286137_36516363|GENSCAN_predicted_peptide_3|123_aa MAAPSPGPAAAPAAASAPQAATAHCRPPFACPPALLGHGPLHPSRLLTALRRIPRGGCAA ILSELQSQERPPPQPDWEVAERLWSPMGTAHFRGGAHVWPLPPTSASRQCRIGSLGKMNC LTG >gi568815577f:36286137_36516363|GENSCAN_predicted_CDS_3|372_bp atggccgccccctccccgggccccgccgcagcccccgccgccgcgtcggccccacaagcc gccaccgcccactgccggccccccttcgcttgcccgcccgccctcctgggacacggcccg ctccacccctcgcggctgctcaccgcgctgaggcgtatcccgcggggtggctgcgccgcc atcttgagcgagctacaaagccaggaacggcctccgccgcaacccgactgggaggtggcg gaacgactgtggagccctatgggtaccgcccacttccggggaggggcgcacgtctggccc ctcccacccacttccgcctccaggcaatgccgaattgggagcttggggaagatgaattgc ctgactggttaa >gi568815577f:36286137_36516363|GENSCAN_predicted_peptide_4|775_aa MGFHHVGQADLKLLTSDNAYDPDVNAKQIWIDKTVINDHICLTFTDNGNGMTSDKLHKML SFGFSDKVTMNGHVPVGLYGNGFKSGSMRLGKDAIVFTKNGESMSMINLAESKASLAAIL EHSLFSTEQKLLAELDAIIGKKGTRIIIWNLRSYKNATEFDFEKDKYDIRIPEDLDEITG KKGYKKQERMDQIAPESDYSLRSKTVRITFGFNCRNKDHYGIMMYHRNRLIKAYEKVGCQ LRANNMGVGVVGIIECNFLKPTHNKQDFDYTNEYRLTITALGEKLNDYWNEMKVKKNTEY PLNLPVEDIQKRPDQTWVQCDACLKWRKLPDGMDQLPEKWYCSNNPDPQFRNCEVPEEPE DEDLVHPTYEKTYKKTLKRRLSTRSSILNAKNRRLSSQFENSVYKGDDDDEDVIILEENS TPKPAVDHDIDMKSEQSHVEQGGVQVEFVGDSEPCGQTGSTSTSSSRCDQGNTAATQTEV PSLVVKKEETVEDEIDVRNDAVILPSCVEAEAKIHETQETTDKSADDAGCQLQELRNQLL LVTEEKENYKRQCHMFTDQIKVLQQRILEMNDKYVKKETCHQSTETDAVFLLESINGKSE SPDHMVSQYQQALEEIERLKKQCSALQHVKAECSQCSNNESKSEMDEMAVQLDDVFRQLD KCSIERDQYKSEVELLEMEKSQIRSQCEELKTEVEQLKSTNQQTATDVSTSSNIEESVNH MDGESLKLRSLRVNVGQLLAMIVPDLDLQQVNYDVDVVDEILGQVVEQMSEISST >gi568815577f:36286137_36516363|GENSCAN_predicted_CDS_4|2328_bp atggggtttcaccatgttggccaggctgatctcaaactcctgacctcagataatgcttat gatcctgatgtgaacgctaaacaaatatggattgacaaaacagtgataaatgaccatata tgcttgacattcaccgacaatgggaatggtatgacttctgataaattacataaaatgcta agctttggcttcagtgacaaagtcaccatgaatggtcatgtcccagttggattatatggg aatggcttcaagtcgggttctatgcgtctgggtaaagacgcaatcgtttttaccaaaaat ggagaaagcatgagcatgattaatttagcagaatcaaaagccagccttgctgcaattctg gaacattctctgttttccacggaacagaagttactggcagaacttgatgctattataggc aagaaggggacgaggatcatcatttggaatcttagaagctacaaaaatgcaacagagttc gattttgaaaaggataaatatgatatcagaattcccgaggatttagatgagataacaggg aagaaggggtacaagaagcaggaaaggatggaccagattgcccctgagagtgactattcc ctgaggtctaaaacagtgagaattacctttggattcaactgcagaaataaagatcattat gggataatgatgtatcacagaaatagactcatcaaagcttatgaaaaagttggatgtcag ttaagggcaaacaacatgggtgttggagtggttggaattatagagtgtaatttccttaag ccaactcataataaacaagatttcgactatactaatgagtacagacttacaataacagca ctaggagaaaagctgaatgattactggaatgaaatgaaagtgaagaaaaatacagaatat cctctaaatttgccagttgaagatatacagaagcgtcctgatcagacatgggttcagtgt gatgcctgtctaaagtggcggaaattacctgatgggatggatcaacttcctgaaaaatgg tattgctccaataaccctgacccacagttcagaaattgtgaggttccagaagaacctgaa gatgaggatttggtacatcccacttatgaaaaaacctacaaaaagaccttgaaacggaga ctttctactcgttcctcaattttgaatgcaaagaatcggagattgagtagtcagtttgaa aattcagtttataaaggtgatgatgatgatgaagatgtcatcatcttagaagaaaacagt acccccaaacctgcagtagatcatgatattgacatgaaatcagaacagagtcacgttgag caaggtggtgttcaggttgagtttgtgggtgacagtgaaccttgtggccagactggttca acaagcacctcatcatcccgatgcgaccagggaaatactgcagctacccagactgaagta ccaagtttagttgttaaaaaagaagaaactgttgaagacgagatagacgtaagaaatgat gcagtgattctgccctcctgtgtagaagctgaagcaaagatacatgaaacccaggaaacc accgataaatctgcagatgatgcaggctgccaattacaagaactgagaaaccagctactc cttgtcactgaggaaaaagagaattataaaagacagtgtcatatgtttactgatcaaatc aaagtgttacaacagaggatactagaaatgaatgacaagtatgttaagaaagaaacttgc catcagtccactgaaaccgatgctgtatttttacttgaaagtattaatggcaaatctgaa agtccagaccatatggtatctcagtatcagcaagctttggaagaaatagaaaggctgaaa aaacaatgtagtgctttgcaacatgtaaaggctgaatgcagccagtgttccaataatgag agtaaaagtgaaatggatgagatggctgtgcagcttgacgatgtgtttagacaactggac aaatgcagtattgagagggaccagtataaaagtgaggttgaattgctggaaatggaaaag tcacaaatccgttcacagtgtgaagaactcaaaactgaagtagaacagttaaaatctaca aatcaacagacggcaacagatgtttcaacatcaagtaacattgaggagtctgtaaatcat atggatggagaaagcctcaaactccgatctcttcgagttaacgtaggacaactgctggct atgattgtgcctgatcttgatcttcagcaagtgaattacgatgttgatgtagttgatgag attttaggacaagttgttgaacaaatgagtgaaatcagtagtacttaa >gi568815577f:36286137_36516363|GENSCAN_predicted_peptide_5|561_aa MRSRPCGGGACGAREAARAAREVTVPLTVRVPPAWHNKEPVYSLDFQHGTAGRIHRLASA GVDTNVRIWKVEKGPDGKAIVEFLSNLARHTKAVNVVRFSPTGEILASGGDDAVILLWKV NDNKEPEQIAFQDEDEAQLNKENWTVVKTLRGHLEDVYDICWATDGNLMASASVDNTAII WDVSKGQKISIFNEHKSYVQGVTWDPLGQYVATLSCDRVLRVYSIQKKRVAFNVSKMLSG IGAEGEARSYRMFHDDSMKSFFRRLSFTPDGSLLLTPGVELMSLPYRLVFAVASEDSVLL YDTQQSFPFGYVSNIHYHTLSDISCRCESVGKVCNFSLFWGRRSSDGAFLAISSTDGYCS FVTFEKDELGIPLKEKPVLNMRTPDTAKKTKSQTHRGSSPGPRPVEGTPASRTQDPSSPG TTPPQARQAPAPTVIRDPPSITPAVKSPLPGPSEEKTLQPSSQNTKAHPSRRVTLNTLQA WSKTTPRRINLTPLKTDTPPSSVPTSVISTPSTEEIQSEIWMLLYSVSIFNLGHERFNVV NPLFDEFLKLERFNVIQCENQ >gi568815577f:36286137_36516363|GENSCAN_predicted_CDS_5|1686_bp atgaggtcccgcccatgcgggggcggggcctgcggcgcgcgggaagcggcgcgcgctgcg cgggaggtgacggtgcctctgactgtccgggtccctccagcctggcacaacaaggagccc gtgtacagcctggacttccagcatgggacggctgggaggatccacagactggcgtctgcc ggcgtggacaccaatgtcaggatctggaaggtagaaaagggaccagatggaaaagccatc gtggaatttttgtccaatcttgctcgtcataccaaagccgtcaatgttgtgcgtttttct ccaactggggaaattttagcatcgggaggagatgatgctgtcatcctattgtggaaggtg aatgataacaaggagccggagcagatcgcttttcaggatgaggacgaggcccagctgaac aaggagaactggacggttgtgaagactctgcggggccacttagaagatgtgtatgatatt tgctgggcaactgatgggaatttaatggcttctgcctctgtggataacacagccatcata tgggatgtcagcaaaggacaaaagatatcaatttttaatgaacataaaagttatgtccaa ggagtaacctgggaccctttgggtcaatatgttgctactctgagctgtgacagggtgctg cgagtatacagtatacagaagaagcgtgtggctttcaatgtttcgaagatgctgtctgga ataggggctgaaggagaggcaagaagctaccggatgtttcacgacgacagcatgaagtct ttcttccgtagactgagtttcactcccgacggatctttgcttctcacgccaggtgtggag ctgatgagtctgccctaccgcctggtgtttgctgtggcctcggaggattccgtgcttctg tatgacacccagcagtccttcccttttggttacgtgtctaatatacattaccacaccctc agtgacatttcatgtagatgtgagagtgttggtaaggtctgtaacttttccctgttttgg ggacgaaggtccagcgatggtgccttcctggccatttcttccacggacggttactgctca tttgtgacatttgagaaagatgaacttggaattcctttgaaagagaagccagttttgaac atgagaactcctgatacagcaaagaaaaccaagagtcagacacatcgagggtcttcgcca ggacccagaccggtagagggaacccctgccagcagaacccaagaccccagcagccccggc acgactccccctcaggccagacaggccccagccccaacagtcatcagggaccctccctcc atcactcctgctgtcaaaagccccttgccggggccttcggaggagaagaccctgcagccc agtagtcaaaacacaaaagcccacccatcccggagggtcactctgaacacactgcaagcc tggagcaagacaacaccccggagaataaacttaacacccttaaagacggacactccacca agttctgtaccaaccagtgtgatttccaccccttctacagaagaaattcagtcagaaata tggatgctgttgtattcagtatccatttttaacttgggacatgaacgttttaacgtagta aatcctctttttgatgagtttctgaaactggagcggttcaacgttatccagtgtgaaaat cagtga >gi568815577f:36286137_36516363|GENSCAN_predicted_peptide_6|125_aa MWRGSGPVTSRIRRQATSKAALNIEDSVEIKRTNNSNASGVNDNQQTCQSWRHGRTLDGT VTGTAAYLIRGRQGIHSSFQQQSPTESRDKRPGPGKETGTGLFGASRREGLAIPPPSGFR KCPAC >gi568815577f:36286137_36516363|GENSCAN_predicted_CDS_6|378_bp atgtggcggggcagtggcccagtgacatcaaggattaggagacaggccacgtctaaggcg gccttgaatatcgaggattctgtggaaatcaaaaggacaaacaactcaaatgcctctggt gtcaacgacaaccagcagacatgtcagtcctggaggcatggtcggactctggatgggaca gtcacgggaacagctgcctatctgataaggggccgccagggaattcacagcagtttccaa cagcagagccccacagagagcagggacaagagacctgggcccgggaaggagacagggaca ggactctttggggcaagtaggagagaaggattagcaataccgccccctagtggcttccgc aagtgcccggcctgttag >gi568815577f:36286137_36516363|GENSCAN_predicted_peptide_7|88_aa MGNRSIERTWMKLEAIILSRQTQEQKTKRLMYSLISGSRTMRTRGHREGNITHRGLSGVE ECGSSDKFDQFPAVICTATRQCLSYHYE >gi568815577f:36286137_36516363|GENSCAN_predicted_CDS_7|267_bp atggggaacagaagtattgaaaggacatggatgaagctggaagccatcatcctcagcaga caaacacaggaacagaaaaccaaacgcctcatgtactcactcataagtggaagtcgaaca atgagaacacgtggacacagggaggggaacatcacacaccggggcctgtcgggggtggag gaatgtggatcttcagacaagtttgatcagttcccagctgtcatctgtacagccacacga cagtgcctttcttaccattatgagtaa >gi568815577f:36286137_36516363|GENSCAN_predicted_peptide_8|239_aa MASTAVQLLGFLLSFLGMVGTLITTILPHWRRTAHVGTNILTAVSYLKGLWMECVWHSTG IYQCQIYRSLLALPQDLQAARALMVISCLLSGIACACAVIGMKCTRCAKGTPAKTTFAIL GGTLFILAGLLCMVAVSWTTNDVVQNFYNPLLPSGMKFEIGQALYLGFISSSLSLIGGTL LCLSCQDEAPYRPYQAPPRATTTTANTAPAYQPPAAYKDNRAPSVTSATHSGYRLNDYV >gi568815577f:36286137_36516363|GENSCAN_predicted_CDS_8|720_bp atggccagcacggccgtgcagcttctgggcttcctgctcagcttcctgggcatggtgggc acgttgatcaccaccatcctgccgcactggcggaggacagcgcacgtgggcaccaacatc ctcacggccgtgtcctacctgaaagggctctggatggagtgtgtgtggcacagcacaggc atctaccagtgccagatctaccgatccctgctggcgctgccccaagacctccaggctgcc cgcgccctcatggtcatctcctgcctgctctcgggcatagcctgcgcctgcgccgtcatc gggatgaagtgcacgcgctgcgccaagggcacacccgccaagaccacctttgccatcctc ggcggcaccctcttcatcctggccggcctcctgtgcatggtggccgtctcctggaccacc aacgacgtggtgcagaacttctacaacccgctgctgcccagcggcatgaagtttgagatt ggccaggccctgtacctgggcttcatctcctcgtccctctcgctcattggtggcaccctg ctttgcctgtcctgccaggacgaggcaccctacaggccctaccaggccccgcccagggcc accacgaccactgcaaacaccgcacctgcctaccagccaccagctgcctacaaagacaat cgggccccctcagtgacctcggccacgcacagcgggtacaggctgaacgactacgtgtga >gi568815577f:36286137_36516363|GENSCAN_predicted_peptide_9|294_aa GLQGPEIWTQAKKSRTVEAAVSTLLLTPETTQPSQTIWPLVFLMYENIIQEYDGFIVLEL PPPAAWGLEELSFPCSLKLTLACSVLSFLDYKNLKDQTCRPRHRPHPVQATHRPTQGQDH LLHWHPHGPSGSEALTGQESQDAYHCLRGKPKGNEKDLVKLAKFLKKEKVNVDMINLGEE EVNREKLTAFVNTLNGKDGSGSHLVAVPPGPSLADALISSPILAGEGGAMLGLGASDFEF GVDPSADPQLALALGFLWRSSSSGRMRSPDEQLQPLLLRPGLLQLGLKAQTRSC >gi568815577f:36286137_36516363|GENSCAN_predicted_CDS_9|885_bp gggctgcaaggcccagagatctggacacaagccaagaagtcaagaactgttgaagccgca gtctccactctcctactgacaccagagaccacccagcccagccaaaccatatggcctctt gttttcctgatgtatgagaacattattcaggagtatgatggcttcatcgtcctggaattg cctcctccagcagcttggggactggaagaactgagcttcccttgctccttgaagctgaca ctggcatgcagtgtcctgagttttctggattataagaacctcaaagatcagacctgcagg cccagacaccggccgcatcctgtccaagctacacaccgtccaacccaagggcaagatcac cttctacactggcatccacatggcccatctggctctgaagcgctgacggggcaagaatca caagacgcatatcattgccttcgtgggaagcccaaaggcaatgagaaggatctggtgaaa ctggctaaattcctcaagaaggagaaagtaaatgttgacatgataaatttgggggaagag gaggtgaacagagaaaagctgacagcctttgtaaacacgttgaatggcaaagatggaagc ggttctcatctggtggcagtgcctcctgggcccagcttggctgatgctctcatcagttct ccgattttggctggggaagggggtgccatgctgggtctcggtgccagtgactttgaattt ggagtagatcccagtgctgatcctcagctggccttggccctcgggtttctatggaggagc agcagcagcggcaggatgaggagcccagacgagcagctgcagcctctgctgctgaggcca ggattgctacaactgggactgaaggctcagacgaggtcctgctga