GENSCAN 1.0 Date run: 3-Nov-116 Time: 13:20:29 Sequence gi568815575f:41234253_41447716 : 213464 bp : 46.20% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 307 302 6 1.05 1.05 Term - 6466 6448 19 0 1 120 45 2 0.074 -3.11 1.04 Intr - 11824 11664 161 1 2 -5 106 116 0.120 2.89 1.03 Intr - 19622 19426 197 0 2 61 55 253 0.216 18.43 1.02 Intr - 22503 22451 53 2 2 83 115 6 0.748 1.35 1.01 Init - 24435 24347 89 0 2 71 101 56 0.949 5.33 1.00 Prom - 31622 31583 40 -3.86 2.03 PlyA - 36569 36564 6 1.05 2.02 Term - 50012 49837 176 0 2 112 39 109 0.804 6.32 2.01 Init - 51809 51716 94 2 1 69 45 120 0.929 4.35 2.00 Prom - 62469 62430 40 -4.66 3.00 Prom + 66969 67008 40 2.14 3.01 Init + 100341 100517 177 2 0 43 27 204 0.794 7.06 3.02 Intr + 103156 103213 58 0 1 67 92 68 0.808 3.56 3.03 Intr + 104784 104831 48 1 0 76 100 32 0.700 1.85 3.04 Intr + 107232 107364 133 1 1 69 115 76 0.775 8.10 3.05 Intr + 108243 108401 159 0 0 84 98 181 0.772 17.80 3.06 Intr + 108969 109099 131 0 2 23 115 71 0.704 3.64 3.07 Intr + 109485 109570 86 1 2 50 71 9 0.451 -5.06 3.08 Intr + 109778 109876 99 1 0 35 106 88 0.470 5.61 3.09 Intr + 109987 110147 161 0 2 74 55 88 0.816 2.89 3.10 Intr + 110928 111072 145 0 1 32 84 78 0.859 2.08 3.11 Intr + 111152 111296 145 1 1 101 92 40 0.972 5.56 3.12 Intr + 111977 112158 182 0 2 47 39 140 0.862 4.59 3.13 Intr + 112253 112370 118 1 1 17 111 92 0.881 4.44 3.14 Intr + 112607 112760 154 0 1 44 78 109 0.883 4.63 3.15 Intr + 113060 113199 140 2 2 64 57 117 0.731 6.31 3.16 Intr + 128354 128497 144 0 0 88 59 31 0.079 0.45 3.17 Term + 140750 140847 98 0 2 96 49 58 0.038 0.83 3.18 PlyA + 143831 143836 6 1.05 4.06 PlyA - 144390 144385 6 1.05 4.05 Term - 169354 169172 183 1 0 92 48 51 0.871 -0.96 4.04 Intr - 171597 171450 148 2 1 96 93 47 0.831 6.24 4.03 Intr - 194413 193273 1141 2 1 9 11 410 0.030 13.73 4.02 Intr - 198866 198779 88 2 1 88 99 -21 0.050 -1.46 4.01 Intr - 211375 211233 143 2 2 95 99 52 0.643 7.07 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 77557 77774 218 2 2 29 38 146 0.844 1.01 S.002 Init - 174219 174164 56 0 2 59 84 84 0.917 5.86 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:41234253_41447716|GENSCAN_predicted_peptide_1|172_aa MTLKKLGLLGGQGSSLPGSWTLESSMKEMRIVVKFGKTSIKFLSYKSEGSISSLRKTRLQ SRRKGSGTGLVAKGASESRCRPRSGDCRRFAQSCSTREDAASKKYVNKPSLAQQTKSKAN RHWSALKITPKHYEESFDELDRQMFQKILIANFSNKAKLTHKHTTTNSKLLL >gi568815575f:41234253_41447716|GENSCAN_predicted_CDS_1|519_bp atgacactgaagaagctaggacttcttggaggtcaaggttccagccttcctggttcttgg acattagaatctagtatgaaggagatgcggattgttgtcaaatttgggaaaacctctatc aagtttctttcatataaaagcgagggcagcatatcctcacttcggaagacccggctgcag tcccggcgcaagggcagcggcaccggcttggttgctaagggggcttcggagtctcgctgc cgtcctcgctcgggggactgtcggcgctttgcgcagtcgtgctccacgcgagaggacgcc gccagcaagaaatatgtcaacaagcctagcctcgcgcagcaaacaaaaagtaaagctaac cgccattggtcagccctgaaaatcacccccaagcattatgaagaatcatttgatgaacta gataggcagatgttccagaagattcttattgctaatttcagcaacaaagccaagcttaca cataaacacacaaccacaaactctaagctgcttttgtga >gi568815575f:41234253_41447716|GENSCAN_predicted_peptide_2|89_aa MEHLQASVSSGWAWARWAHCLSGRLALPAPGTSLALKVTCRDQDQQQHRDPDRNADSQVL PQPYSETLGPEFPPGDSDACSSVSICIAL >gi568815575f:41234253_41447716|GENSCAN_predicted_CDS_2|270_bp atggagcacttgcaggccagcgtgagctctgggtgggcgtgggctcggtgggcccactgt ctgagcggccggctggcgctgccagccccgggcacatccctggctctcaaggtgacctgc agggaccaggaccagcagcagcatcgggatcctgatagaaatgcagattctcaggtccta ccccagccttattcagaaactctgggcccagaattccctccaggggattctgatgcatgc tccagtgtgagcatttgcatagctctctaa >gi568815575f:41234253_41447716|GENSCAN_predicted_peptide_3|725_aa MRESRLISDLGGFAGVRSAAGRDWRPFWLGGLRPSRLAQPGFPSGGPGGTRCLRACEQGF AGLDLNSSDNQSGGSTASKGRYIPPHLRNREATKGFYDKDSSGWSSSKDKDAYSSFGSRS DSRGKSSFFSDRGSGSRGRFDDRGRSDYDGIGSRGDRSGFGKFERGGNSRWCDKSDEDDW SKPLPPSERLEHDVEMGEIIMGNIELTRYTRPTPVQKHAIPIIKEKRDLMACAQTGSGKT AAFLLPILSQIYSDGPGEALRAMKENGRYGRRKQYPISLVLAPTRELAVQIYEEARKFSY RSRVRPCVVYGGADIGQQIRDLERGCHLLVATPGRLVDMMERGKIGLDFCKYLVLDEADR MLDMGFEPQIRRIVEQDTMPPKGVRHTMMFSATFPKEIQMLARDFLDEYIFLAVGRVGST SENITQKVVWVEESDKRSFLLDLLNATGKDSLTLVFVETKKGADSLEDFLYHEGYACTSI HGDRSQRDREEALHQFRSGKSPILVATAVAARGLDISNVKHVINFDLPSDIEEYVHRIGR TGRVGNLGLATSFFNERNINITKDLLDLLVEAKQEVPSWLENMAYEHHYKGSSRGRSKSS RFSGGFGARDYRQSSGASSSSFSSSRASSSRSGGGGHGSSRGFGGGLNSSVTSPERLSLT SLTNYSNYKVPRKRYRKDIRPLTGLWASLFPMGGSAPLTSRLVTEKQGYVAQSILRVFKG NTPSA >gi568815575f:41234253_41447716|GENSCAN_predicted_CDS_3|2178_bp atgcgcgaatcccgactgattagtgacctgggggggtttgcgggagtgcgcagcgcggcg ggacgcgactggaggcccttttggcttggagggcttcggccttcacggctggcgcagcct ggattcccgtccggaggacctggcggcacccggtgtttgcgtgcctgcgagcaagggttt gctggcctagacctgaactcttcagataatcagagtggaggaagtacagccagcaaaggg cgctatattcctcctcatttaaggaaccgagaagctactaaaggtttctacgataaagac agttcagggtggagttctagcaaagataaggatgcgtatagcagttttggatctcgtagt gattcaagagggaagtctagcttcttcagtgatcgtggaagtggatcaaggggaaggttt gatgatcgtggacggagtgattacgatggcattggcagccgtggtgacagaagtggcttt ggcaaatttgaacgtggtggaaacagtcgctggtgtgacaaatcagatgaagatgattgg tcaaaaccactcccaccaagtgaacgcttggaacatgatgttgagatgggagaaattatc atgggaaacattgagcttactcgttatactcgcccaactccagtgcaaaagcatgctatt cctattatcaaagagaaaagagacttgatggcttgtgcccaaacagggtctggaaaaact gcagcatttctgttgcccatcttgagtcagatttattcagatggtccaggcgaggctttg agggccatgaaggaaaatggaaggtatgggcgccgcaaacaatacccaatctccttggta ttagcaccaacgagagagttggcagtacagatctacgaggaagccagaaaattttcatac cgatctagagttcgtccttgcgtggtttatggtggtgccgatattggtcagcagattcga gacttggaacgtggatgccatttgttagtagccactccaggacgtctagtggatatgatg gaaagaggaaagattggattagacttttgcaaatacttggtgttagatgaagctgatcgg atgttggatatggggtttgagcctcagattcgtagaatagtcgaacaagatactatgcct ccaaagggtgtccgccacactatgatgtttagtgctacttttcctaaggaaatacagatg ctggctcgtgatttcttagatgaatatatcttcttggctgtaggaagagttggctctacc tctgaaaacatcacacagaaagtagtttgggtggaagaatcagacaaacggtcatttctg cttgacctcctaaatgcaacaggcaaggattcactgaccttagtgtttgtggagaccaaa aagggtgcagattctctggaggatttcttataccatgaaggatacgcatgtaccagcatc catggagaccgttctcagagggatagagaagaggcccttcaccagttccgctcaggaaaa agcccaattttagtggctacagcagtagcagcaagaggactggacatttcaaatgtgaaa catgttatcaattttgacttgccaagtgatattgaagaatatgtacatcgtattggtcgt acgggacgtgtaggaaaccttggcctggcaacctcattctttaacgagaggaacataaat attactaaggatttgttggatcttcttgttgaagctaaacaagaagtgccgtcttggtta gaaaacatggcttatgaacaccactacaagggtagcagtcgtggacgttctaagagtagc agatttagtggagggtttggtgccagagactaccgacaaagtagcggtgccagcagttcc agcttcagcagcagccgcgcaagcagcagccgcagtggcggaggtggccacggtagcagc agaggatttggtggaggactcaactcaagtgtcacttctcctgagaggctctccctgaca tccttaactaactactctaattataaagttccaaggaagcgctacagaaaagacatccgt cccctgacagggctctgggctagtctcttccctatgggaggttctgctcctttaacaagc aggctggttaccgagaagcaaggatacgtggcacagagcatcctcagggtcttcaaaggc aacacccccagtgcttag >gi568815575f:41234253_41447716|GENSCAN_predicted_peptide_4|567_aa XQWAWCSGTGIPLRKTFFSTCELKHFSQVENKEASETLGIKRKEPIHQLRVAIWTHQGQK CSRRSPLDYFWEKISLPGKEEVKLSLFADDVIVYLENPIVSAQNLLKLISNFSKVSGYKI NVQKSQAFLYTNNRQTESQIMSGHPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLSEIK EDTNKWKNIPCSWVGRINIMKMAILPKVIYRFNGIPIKLPMTFFTELEKTTLKFIWNQKR ARITKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQNRDVDQWNRTEPSEITPHIYN YLIFDKPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNIRPKT IKTLEENLGITIQDIGMGKDFMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQP TKWEKIFATYSSDKGLISRIYNELKQIYKKKTNNPIKKACPWHLCLSDKWPDNPVLGIPS TAKPVQSTCKTQLEQSEPGARPSQLSLRGVTLLLFLNENSSQKEHLNVKPKTIKTLEENL GTTIQGTGTGKDFMTKIANSNCNKSKN >gi568815575f:41234253_41447716|GENSCAN_predicted_CDS_4|1704_bp ngccagtgggcctggtgctctggcactgggatcccactaaggaagaccttcttctccacc tgtgaactgaaacatttctcccaagtggaaaataaggaggcatcagagactcttggcata aaaaggaaggagcccatccaccagttgagagtagccatatggacccatcagggccaaaaa tgttcaaggagaagtccgctggattacttctgggaaaaaatttctcttcctggaaaagag gaagtcaaattgtccctgtttgcagatgacgtgattgtatatctagaaaaccccatcgtc tcagcccaaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatc aatgtacaaaaatcacaagcattcttatacaccaataacagacaaacagagagccaaatc atgagtggacacccattcacaattgcttcaaagagaataaaatacctaggaatccaactt acaagggacgtgaaggacctcttcaaggagaactacaaaccactgctcagtgaaataaaa gaggatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcatg aaaatggccatactgcccaaggtaatttatagattcaatggcatccccatcaagctacca atgaccttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaaga gcccgcatcaccaagtcaatcctaagccaaaagaacaaagctggaggcatcacactacct gacttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaac agagatgtagatcaatggaacagaacagagccctcagaaataacaccgcatatctacaac tatctgatctttgacaaacctgagaaaaataagcaatggggaaaggattccctatttaat aaatggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttcctt acaccttatacaaaaattaattcaagatggattaaagacttaaacattagacctaaaacc ataaaaaccctagaagaaaacctaggcattaccattcaggacataggcatgggcaaggac ttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatcta attaaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacct acaaaatgggagaaaattttcgcaacctactcatctgacaaagggctaatatccagaatc tacaatgaactcaaacaaatttacaagaaaaaaacaaacaaccccatcaaaaaggcatgt ccctggcacctctgtctcagtgacaagtggccagacaatccagtgctggggattccgtct acagccaagccagtgcagagcacctgcaagacgcagctggagcagagtgagccgggggca aggccaagccagctgagcctgcgtggcgtcacccttctactattcctcaatgagaacagt agtcaaaaagaacacttaaatgtaaaacccaaaactataaaaaccctagaagaaaatcta ggcactaccattcagggcacaggcacaggcaaagatttcatgacaaaaattgccaacagc aattgcaacaaaagcaaaaattga