GENSCAN 1.0 Date run: 8-Nov-116 Time: 00:45:41 Sequence gi568815588f:72592152_72985819 : 393668 bp : 40.20% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 27622 27675 54 0 0 66 87 30 0.408 2.04 1.02 Term + 33275 33373 99 1 0 66 38 161 0.662 6.05 1.03 PlyA + 37423 37428 6 1.05 2.03 PlyA - 38408 38403 6 1.05 2.02 Term - 44735 44480 256 1 1 -17 43 337 0.068 12.77 2.01 Init - 82188 82040 149 0 2 83 23 127 0.309 5.31 2.00 Prom - 82365 82326 40 -4.05 3.00 Prom + 98689 98728 40 -6.75 3.01 Init + 100001 100549 549 1 0 83 -13 463 0.054 28.67 3.02 Intr + 100593 100684 92 2 2 51 -17 91 0.041 -7.23 3.03 Intr + 100819 100915 97 1 1 59 115 102 0.050 9.09 3.04 Intr + 116068 116205 138 0 0 54 77 61 0.070 1.34 3.05 Intr + 122960 123075 116 1 2 125 108 42 0.563 8.23 3.06 Intr + 156676 156697 22 1 1 93 97 -4 0.062 -2.37 3.07 Term + 160603 160815 213 0 0 52 42 162 0.309 4.25 3.08 PlyA + 161561 161566 6 -0.45 4.03 PlyA - 164018 164013 6 1.05 4.02 Term - 166976 166806 171 2 0 59 48 107 0.452 0.64 4.01 Init - 169373 169326 48 2 0 75 100 36 0.853 4.50 4.00 Prom - 179377 179338 40 -2.85 5.00 Prom + 187497 187536 40 -2.25 5.01 Init + 213862 213941 80 0 2 67 33 90 0.783 1.98 5.02 Intr + 217233 217441 209 0 2 61 61 195 0.412 11.90 5.03 Intr + 232816 232841 26 2 2 106 84 15 0.347 -0.27 5.04 Intr + 242208 242277 70 2 1 106 101 20 0.119 2.94 5.05 Intr + 267026 267196 171 0 0 62 80 167 0.978 12.29 5.06 Intr + 268272 268376 105 1 0 49 83 116 0.814 6.47 5.07 Intr + 276552 276712 161 1 2 63 93 198 0.999 16.59 5.08 Intr + 279226 279429 204 0 0 76 110 108 0.994 10.27 5.09 Intr + 292115 292231 117 1 0 117 111 89 0.999 13.84 5.10 Term + 293594 293671 78 1 0 99 39 103 0.990 3.28 5.11 PlyA + 294081 294086 6 1.05 6.00 Prom + 297022 297061 40 -4.65 6.01 Init + 297920 298170 251 1 2 76 105 187 0.495 16.31 6.02 Intr + 303341 303549 209 2 2 -96 35 209 0.003 -5.00 6.03 Intr + 306513 306887 375 1 0 82 72 377 0.874 29.56 6.04 Intr + 308226 308351 126 1 0 85 70 178 0.549 15.43 6.05 Intr + 314445 314567 123 1 0 58 100 149 0.056 12.74 6.06 Intr + 319566 319688 123 1 0 77 62 112 0.152 7.14 6.07 Term + 321157 321350 194 2 2 99 47 107 0.191 4.30 6.08 PlyA + 323223 323228 6 1.05 7.04 PlyA - 324015 324010 6 1.05 7.03 Term - 326132 325485 648 2 0 -1 48 985 0.797 78.69 7.02 Intr - 326312 326166 147 2 0 15 -24 199 0.624 0.91 7.01 Init - 329309 329199 111 2 0 59 64 12 0.284 -3.84 7.00 Prom - 329701 329662 40 -9.95 8.00 Prom + 330183 330222 40 -10.65 8.01 Init + 331440 331481 42 2 0 95 44 10 0.900 -3.50 8.02 Intr + 332078 332493 416 1 2 66 30 638 0.649 47.78 8.03 Intr + 338387 338486 100 2 1 56 113 100 0.830 8.49 8.04 Intr + 339765 339887 123 2 0 49 32 106 0.631 0.96 8.05 Term + 340203 340373 171 2 0 79 32 200 0.996 10.34 8.06 PlyA + 340858 340863 6 1.05 9.04 PlyA - 341461 341456 6 1.05 9.03 Term - 343587 343466 122 1 2 63 55 112 0.711 3.06 9.02 Intr - 349183 349018 166 1 1 110 100 97 0.961 11.71 9.01 Init - 350578 350513 66 1 0 61 76 56 0.626 2.82 9.00 Prom - 359681 359642 40 -4.55 10.03 PlyA - 361250 361245 6 -0.45 10.02 Term - 361789 361628 162 1 0 74 49 181 0.623 9.75 10.01 Init - 362534 362193 342 2 0 69 14 377 0.870 23.68 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 44626 44480 147 1 0 109 43 221 0.817 13.50 S.002 Sngl + 100001 100564 564 1 0 83 44 429 0.935 31.89 S.003 Term + 314445 314767 323 1 2 58 42 317 0.937 18.10 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588f:72592152_72985819|GENSCAN_predicted_peptide_1|50_aa MNGSSPVGRWQNGGEAFQPMAGQNATVKTKQGERNENGKALEKIKTEQSE >gi568815588f:72592152_72985819|GENSCAN_predicted_CDS_1|153_bp atgaacggcagctcaccagttgggagatggcagaacggtggagaggcattccagcctatg gccggccagaatgcgacggtgaaaacaaaacagggagaacgtaatgaaaacgggaaggcg ctggaaaagataaaaactgaacagtccgagtag >gi568815588f:72592152_72985819|GENSCAN_predicted_peptide_2|134_aa MDEAGNRHSQQTIARTKNQTPHVLTHRWELNDENTWTQEGEHHTPEPVVGKRSSKWSSVK KRRERPPIDQSLCALHLCNQCLCSATMVTKPKRRVEGDVKGDKAMVKDKPQRRSERLSAK PAPPKPEPKPKSAL >gi568815588f:72592152_72985819|GENSCAN_predicted_CDS_2|405_bp atggatgaagctggaaaccgtcattctcagcaaactattgcaaggacaaaaaaccaaaca ccacatgttctcactcataggtgggaattgaacgatgagaacacttggacacaggaaggg gaacatcacacaccggagcctgtcgtgggcaaaagatcttcaaaatggagcagtgtgaag aagaggcgagaacgacccccaattgaccaaagcctatgcgccctgcatctctgcaatcag tgcctatgttccgccaccatggttaccaagcccaagagaagggttgaaggggatgttaaa ggagataaagccatggtgaaggacaaacctcagagaagatctgaaaggttgtctgctaaa cctgctcctccaaagccagagcccaagcctaaaagtgccctgtaa >gi568815588f:72592152_72985819|GENSCAN_predicted_peptide_3|408_aa MAAAAGRSLLLLLSSRGGGGGGAGGCGALTAGCFPGLGVSRHRQQQHHRTVSERARRLRG GRGGAGVWEPPAGRPPGQQASYLNSRPLRRLPGTEEPPCPSGACGGDQEGRAGGRKPRGG GGATAGMFPQDWGHQGGEGTGLEPLPDSPPIAFPPFFVLNSPAPRLPERSQMEVVPFPSS SGRESGSAMLLGAAPESSPGPRRGRPSSSLVPEGCFWGSFRGLLYKRVHACLKREGWSAG TGEANMFIMPLVSSFTFVFFSLIQIAEGSGGRGEDVESPESWRVERRKKAVKVFHYHSYS DWQDTVSTSLSMYHASDILAARVWSWPVGVNGITLRKQQSNGCCVTVSLEFMVIEALNIP NITVKVEGALLKEARSNIHFEVEMNQQAVSEECHFQVQRRCSYAIITS >gi568815588f:72592152_72985819|GENSCAN_predicted_CDS_3|1227_bp atggcggccgccgcaggtagatcgctcctgctgctcctctcctctcggggcggcggcggc gggggcgccggcggctgcggggcgctgactgccggctgcttccctgggctgggcgtcagc cgccaccggcagcagcagcaccaccggacggtgagcgagcgcgcccggaggctccgggga gggcggggcggcgctggcgtgtgggagccgccggcgggcaggccgccggggcagcaggcg agttacctcaactcccggccgctccggaggttgccgggcaccgaggagccgccgtgccct tcaggcgcctgcggcggcgaccaggaagggagggcgggcgggaggaagccgagaggagga ggaggggcgacggcggggatgttcccgcaggactggggacaccagggcggggaaggcacg ggacttgaacctctccccgactcgcccccaatcgcgttccctcccttcttcgttcttaac agccctgccccaaggctccctgaacggagccagatggaagttgtcccctttccctcctcc tccgggcgggagagtggcagtgccatgctgctgggagctgccccggagagcagcccagga ccccggcggggccgcccctcgtcctctctcgtccccgaggggtgcttttgggggtccttt cgagggctactgtataagcgtgttcacgcgtgcctgaagcgggaagggtggtcagcaggc acaggagaagcgaatatgttcattatgcctttagtctcatctttcacctttgttttcttt tctctaatccaaattgccgaagggagtggaggaagaggtgaagatgtggaaagcccagag agttggagagttgagagaaggaagaaagcagttaaggtctttcactaccattcctacagt gattggcaagataccgtttctacctcactctccatgtaccatgccagtgacatcttagct gctagagtgtggagctggcctgtgggagtcaatggtattactttaaggaaacagcagagc aatggctgctgtgtgactgtaagtttggaatttatggtaatagaagctttaaatatccca aacattacagtaaaggtggaaggagcccttcttaaagaagcaaggtcaaatattcatttt gaagttgaaatgaatcaacaggcagtctctgaagaatgccatttccaagttcagagaaga tgcagttatgccatcataacatcataa >gi568815588f:72592152_72985819|GENSCAN_predicted_peptide_4|72_aa MAVPIIKYQKVPSRDKLVFEEFCLRLVLLQMNRSTITLHADLRNRHKEVQQQTLIIQLNR YRATNRTRAFLS >gi568815588f:72592152_72985819|GENSCAN_predicted_CDS_4|219_bp atggcagtgccgatcataaaataccaaaaagtaccatcacgtgacaagttggtgtttgag gagttttgtctgcggctcgtcctgctacagatgaacagaagtactattaccctacatgca gacttgcggaacagacataaagaggttcagcaacaaaccttaataatacaactaaacagg tacagagctacaaacagaactagagcctttttatcttga >gi568815588f:72592152_72985819|GENSCAN_predicted_peptide_5|406_aa MGKGLEPTEENEEYREFQKANVVKYREKKKEQKEAKRVMVDLMRLGIGWEYKVKTLKWLW VLKPERLTKKFDVQLIEIGNIGVEQVWEHILLQGLRGLPEKKVKSVHQRIASWQNLGAVY CSTVVPSDDVTVVYQNGLPVISVRLPSRRERCQFTLKPISDSVGVFLRQLQEEDRGIDRV AIYSPDGVRVAASTGIDLLLLDDFKLVINDLTYHVRPPKRDLLSHENAATLNDVKTLVQQ LYTTLCIEQHQLNKERELIERLEDLKEQLAPLEKVRIEISRKAEKRTTLVLWGGLAYMAT QFGILARLTWWEYSWDIMEPVTYFITYGSAMAMYAYFVMTRQEYVYPEARDRQYLLFFHK GAKKSRFDLEKYNQLKDAIAQAEMDLKRLRDPLQVHLPLRQIGEKD >gi568815588f:72592152_72985819|GENSCAN_predicted_CDS_5|1221_bp atggggaaaggcctggagcccacagaggagaatgaagaatatagggaatttcaaaaagcc aatgtggtgaagtacagagaaaaaaagaaggaacaaaaagaagcaaaaagggtgatggta gacttgatgagacttggaattggctgggagtacaaagtgaagactctaaaatggctatgg gtcttaaagcctgagagactgaccaaaaagtttgacgtacaattaattgaaatcggaaac attggagtggagcaggtttgggaacatattttacttcagggacttagaggcttacctgag aaaaaagttaagtctgtacaccagaggatcgcttcctggcagaatttgggagctgtttat tgcagcactgttgtgccctctgatgatgttacagtggtttatcaaaatgggttacctgtg atatctgtgaggctaccatcccggcgtgaacgctgtcagttcacactcaagcctatctct gactctgttggtgtatttttacgacaactgcaagaagaggatcggggaattgacagagtt gctatctattcaccagatggtgttcgcgttgctgcttcaacaggaatagacctcctcctc cttgatgactttaagctggtcattaatgacttaacataccacgtacgaccaccaaaaaga gacctcttaagtcatgaaaatgcagcaacgctgaatgatgtaaagacattggtccagcaa ctatacaccacactgtgcattgagcagcaccagttaaacaaggaaagggagcttattgaa agactagaggatctcaaagagcagctggctcccctggaaaaggtacgaattgagattagc agaaaagctgagaagaggaccactttggtgctatggggtggccttgcctacatggccaca cagtttggcattttggcccggcttacctggtgggaatattcctgggacatcatggagcca gtaacatacttcatcacttatggaagtgccatggcaatgtatgcatattttgtaatgaca cgccaggaatatgtttatccagaagccagagacagacaatacttactatttttccataaa ggagccaaaaagtcacgttttgacctagagaaatacaatcaactcaaggatgcaattgct caggcagaaatggaccttaagagactgagagacccattacaagtacatctgcctctccga caaattggtgaaaaagattga >gi568815588f:72592152_72985819|GENSCAN_predicted_peptide_6|466_aa MVPRAAACHLLELLEMQVLRSPLDQKLGVEPSVQQALLVILMRTITLKRKELRLRDGAYI TTPQCILWSSTRLSGWRTAEHGDWGSERRGKEGMRKGEAKWGDEGGGWKRMEQERRGTSR PSSGTGSKRGCDSGAAVCRIGPGGLEDSTLGRKALDPCSAYISLNEPWRNTDHQLDESQG PPLCDNHVNGEWYHFTGMAGDAMPTFCIPENHCGTHAPVWLNGSHPLEGDGIVQRQACAS FNGNCCLWNTTVEVKACPGGYYVYRLTKPSVCFHVYCGHFYDICDEDCHGSCSDTSECTC APGTVLGPDRQTCFGKKLIKDENECEQNNGGCSEICVNLKNSYRCECGVGRVLRSDGKTC EDVEGCHNNNGGCSHSCLGSEKGYQCECPRGLVLSEDNHTCQVPVLCKSNAIEVNIPREL VGGLELFLTNTSCRGVSNGTHVNILFSLKTCGTVVDVGSSWRALGE >gi568815588f:72592152_72985819|GENSCAN_predicted_CDS_6|1401_bp atggtccccagagcagcagcatgtcatctcctggagctgttagaaatgcaagttctccgg tcccctttggatcagaaactaggagtggagccaagtgttcaacaagccctcctggtgatt ctgatgagaaccatcactttaaagagaaaggaactgaggctcagagatggcgcctacatc acaactcctcagtgcatcctgtggtcaagtaccagactgagtggctggaggactgctgag catggtgactggggaagtgagagaagaggaaaagaagggatgagaaagggggaagcaaag tggggtgatgagggaggaggatggaagaggatggagcaggaaaggaggggcacttcccga ccaagctcaggcacagggtcgaaaaggggctgtgattcaggggcggctgtctgccggata ggacctggagggttggaggactcaaccctggggcgaaaggccctagatccttgttctgct tacatcagcctgaatgagccctggaggaacactgaccaccagttggatgagtctcaaggt cctcctctatgtgacaaccatgtgaatggggagtggtaccacttcacgggcatggcggga gatgccatgcctaccttctgcataccagaaaaccactgtggaacccacgcacctgtctgg ctcaatggcagccaccccctagaaggcgacggcattgtgcaacgccaggcttgtgccagc ttcaatgggaactgctgtctctggaacaccacggtggaagtcaaggcttgccctggaggc tactatgtgtatcgtctgaccaagcccagcgtctgcttccacgtctactgtggtcatttt tatgacatctgcgacgaggactgccatggcagctgctcagataccagcgagtgcacatgc gctccaggaactgtgctaggccctgacaggcagacatgctttggtaagaaactcatcaaa gatgaaaatgaatgtgagcaaaacaacggtggctgcagtgagatctgtgtgaacctcaaa aactcctaccgctgtgagtgtggggttggccgtgtgctaagaagtgatggcaagacttgt gaagacgttgaaggatgccacaataacaatggtggctgcagccactcttgccttggatct gagaaaggctaccagtgtgaatgtccccggggcctggtgctgtctgaggataaccacact tgccaagtccctgtgttgtgcaaatcaaatgccattgaagtgaacatccccagggagctg gttggtggcctggagctcttcctgaccaacacctcctgccgaggagtgtccaacggcacc catgtcaacatcctcttctctctcaagacatgtggtacagtggtcgatgtaggttcctcc tggagggcacttggggaatga >gi568815588f:72592152_72985819|GENSCAN_predicted_peptide_7|301_aa MTGSNPHVTILTLNVNGLNAPIKRHRVANWIKSPDPLADKDDHFKVDNDEDEHQLSLKTV SLRAGAKDELHIVEAEAMNYEGSPIKPTVSLGGFEITPPVILQLKCGLGTVHISGQHLVA VEEDAESEGEEEEDVKLLSITGKQSAPGGGSKVPQKKVKLAAEEDDDDDDDDDEEDEDDD EDNDDNDEDNEEAEEKAPVKKSIQDTPAKNVQKSNQNGKDSKPSSTPRSKGQESFKKQEK TPKTPKGPSSVEDIKAKMQASTEKGDSLPKVAAKFISYVKNCFRMTDQEAIQDLWQWRKS L >gi568815588f:72592152_72985819|GENSCAN_predicted_CDS_7|906_bp atgacaggatcaaatccacacgtaacaatactaaccttaaatgtaaatgggctaaatgcc ccaattaaaagacacagagtggcaaactggataaagagtccagacccattggctgacaaa gatgatcactttaaggtggataatgatgaagatgagcaccagttatctttaaaaacagtc agtttaagggctggtgcaaaggacgaattgcacattgttgaagcagaggcaatgaattac gaaggcagtccaattaaaccaacggtttctcttgggggctttgaaataacacccccagtg atcttacagttgaagtgtggtttagggacagtgcatattagtggacagcacttagtagct gtggaggaagatgcagagtcagaaggtgaagaggaggaggatgtgaaactcttaagtata actggaaagcagtctgcccctggaggtggtagcaaggttccacagaaaaaagtaaaactt gctgctgaggaagatgatgatgatgatgatgatgacgatgaagaagatgaagatgatgat gaagataatgatgataatgatgaagataatgaggaagctgaagaaaaagcaccagtgaag aaatctatacaagatactccagccaaaaatgtacagaagtcaaatcagaatggaaaagac tcaaaaccatcatcaacaccaagatcaaaaggacaagaatccttcaaaaaacaggaaaaa actcctaaaacaccaaaaggacctagttctgtagaagacattaaagcaaaaatgcaagca agtacagaaaaaggtgattctcttcccaaagtggcagccaagttcatcagttatgtgaag aattgcttccggatgactgaccaagaggctattcaagatctctggcagtggaggaagtct ctttaa >gi568815588f:72592152_72985819|GENSCAN_predicted_peptide_8|283_aa MVACPFLWELHPSGVVNDKIVASNLVTGLPKQTPGSSGDFIIRTSKLLIPVTCEFPRLYT ISEGYVPNLRNSPLEIMSRNHGIFPFTLEIFKDNEFEEPYREALPTLKLRDSLYFGIEPV VHVSGLESLVESCFATPTSKIDEVLKYYLIRDGCVSDDSVKQYTSRDHLAKHFQVPVFKF VGKDHKAGSSGSYLQKGACRKHWLFDSTSAWEPMLSPDLGVKKEDAQEVFLHCRVLVCGV LDERSRCAQGCHRRMRRGAGGEDSAGLQGQTLTGGPIRIDWED >gi568815588f:72592152_72985819|GENSCAN_predicted_CDS_8|852_bp atggtggcctgccccttcctctgggagctccatcccagtggggtggtgaatgacaagatt gtggccagcaacctcgtgacaggtctacccaagcagaccccggggagcagcggggacttc atcatccgaaccagcaagctgctgatcccggtgacctgcgagtttccacgcctgtacacc atttctgaaggatacgttcccaaccttcgaaactccccactggaaatcatgagccgaaat catgggatcttcccattcactctggagatcttcaaggacaatgagtttgaagagccttac cgggaagctctgcccaccctcaagcttcgtgactccctctactttggcattgagcccgtg gtgcacgtgagcggcttggaaagcttggtggagagctgctttgccacccccacctccaag atcgacgaggtcctgaaatactacctcatccgggatggctgtgtttcagatgactcggta aagcagtacacatcccgggatcacctagcaaagcacttccaggtccctgtcttcaagttt gtgggcaaagaccacaaggcaggaagctcaggcagttacctacagaaaggtgcttgcaga aaacactggttatttgacagcacaagtgcatgggagccaatgctatccccagaccttggt gtgaagaaggaagatgctcaagaagtgtttctgcactgccgggttcttgtctgtggagtg ttggacgagcgttcccgctgtgcccagggttgccaccggcgaatgcgtcgtggggcagga ggagaggactcagccggtctacagggccagacgctaacaggcggcccgatccgcatcgac tgggaggactag >gi568815588f:72592152_72985819|GENSCAN_predicted_peptide_9|117_aa MPRPGYKPQEPNGCGSYFLGLKMDLGIPAMTKCCNQLDVCYDTCGANKYRCDAKFRWCLH SICSDLKRSLGFVSKVEAACDSLVDTVFNTVWTLGCRPFMNSQRAACICAEEEKEEL >gi568815588f:72592152_72985819|GENSCAN_predicted_CDS_9|354_bp atgcccagacctggctacaagccccaagagcccaatggctgcggctcctatttcctgggt ctcaagatggacttgggcattccagcaatgacaaagtgctgcaaccagctggatgtctgt tatgacacttgcggtgccaacaaatatcgctgtgatgcaaaattccgatggtgtctccac tcgatctgctctgaccttaagcggagtctgggctttgtctccaaagtggaagcagcctgt gattccctggttgacactgtgttcaacaccgtgtggaccttgggctgccgcccctttatg aatagtcagcgggcagcttgcatctgtgcagaggaggagaaggaagagttatga >gi568815588f:72592152_72985819|GENSCAN_predicted_peptide_10|167_aa MKLASGFLVLWLSLGGGLAQSDTSPDTEESYSDWGLRHLRGSFESVNSYFDSFLELLGGK NGVCQYRCRYGECAVSLSVKTVAGGPMDSPREVTICLALFPVIHLELPDLADKNLSVPDW HQFAGLSQGSCPCLQPRQRQACCGEEGFVAHQGSRGCPLDLAPDADL >gi568815588f:72592152_72985819|GENSCAN_predicted_CDS_10|504_bp atgaagctggccagtggcttcttggttttgtggctcagccttgggggtggcctggctcag agcgacacgagccctgacacggaggagtcctattcagactggggccttcggcacctccgg ggaagctttgaatccgtcaatagctacttcgattcttttctggagctgctgggagggaag aatggagtctgtcagtacaggtgccgatatggtgagtgtgcggtttctctttctgtaaaa actgttgctggtggacccatggacagccccagagaggtcacgatctgtcttgcacttttt cctgttatccacttagagctgccagatttagcagataaaaatctctcagtccctgactgg catcaatttgctggactttctcaagggagctgtccctgcctgcagcctcggcaacgacag gcatgctgtggagaggaagggttcgtggctcaccagggatcccggggctgtcccctggac ttggcgccagatgcagatttgtga