GENSCAN 1.0 Date run: 3-Nov-116 Time: 23:26:40 Sequence gi568815596f:190559177_190790330 : 231154 bp : 39.61% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4184 4253 70 0 1 124 60 54 0.510 3.52 1.02 Intr + 19178 19307 130 2 1 88 80 77 0.162 6.68 1.03 Term + 28950 29201 252 2 0 72 43 97 0.021 -1.75 1.04 PlyA + 31607 31612 6 1.05 2.00 Prom + 40037 40076 40 -1.65 2.01 Init + 49063 49341 279 0 0 51 75 125 0.089 4.56 2.02 Intr + 55987 56108 122 0 2 53 36 88 0.139 -1.53 2.03 Intr + 56836 57148 313 1 1 136 39 81 0.028 3.26 2.04 Intr + 77721 77849 129 2 0 91 84 38 0.087 3.67 2.05 Intr + 78800 78851 52 1 1 59 106 45 0.071 0.96 2.06 Intr + 79655 79791 137 0 2 65 37 89 0.016 0.87 2.07 Intr + 81837 81907 71 2 2 135 45 29 0.009 0.36 2.08 Intr + 89931 90520 590 0 2 48 26 391 0.076 20.24 2.09 Intr + 96482 96638 157 0 1 64 62 74 0.220 0.55 2.10 Intr + 99982 100819 838 1 1 102 72 625 0.513 52.76 2.11 Intr + 111150 111283 134 2 2 35 108 149 0.987 10.12 2.12 Intr + 113925 113976 52 0 1 118 64 39 0.878 2.49 2.13 Intr + 124562 124651 90 1 0 75 92 115 0.956 9.87 2.14 Intr + 126300 126462 163 2 1 66 83 176 0.792 13.53 2.15 Intr + 128025 128141 117 1 0 74 94 15 0.122 0.22 2.16 Intr + 145906 146001 96 2 0 105 74 60 0.485 5.36 2.17 Intr + 152074 152185 112 2 1 44 91 67 0.113 1.12 2.18 Intr + 154644 154739 96 0 0 38 100 59 0.197 0.31 2.19 Term + 158602 158740 139 1 1 55 47 137 0.286 2.75 2.20 PlyA + 159411 159416 6 1.05 3.04 PlyA - 159984 159979 6 1.05 3.03 Term - 162978 162906 73 2 1 82 45 72 0.323 -1.30 3.02 Intr - 165815 165640 176 2 2 9 95 115 0.315 2.12 3.01 Init - 170586 170503 84 0 0 67 31 119 0.360 4.97 3.00 Prom - 175189 175150 40 -6.65 4.00 Prom + 176140 176179 40 -5.75 4.01 Init + 183409 183462 54 0 0 105 70 52 0.790 6.43 4.02 Term + 185106 185165 60 2 0 112 48 71 0.856 2.33 4.03 PlyA + 185239 185244 6 1.05 5.00 Prom + 192419 192458 40 -3.65 5.01 Init + 200837 200981 145 1 1 93 101 88 0.995 10.93 5.02 Intr + 201082 201206 125 2 2 40 117 110 0.962 8.48 5.03 Intr + 201689 201945 257 1 2 49 97 215 0.886 13.82 5.04 Intr + 203102 203230 129 2 0 68 115 17 0.682 1.29 5.05 Term + 211015 211219 205 1 1 68 50 143 0.247 4.36 5.06 PlyA + 212597 212602 6 1.05 6.04 PlyA - 212822 212817 6 1.05 6.03 Term - 216141 215374 768 0 0 -34 41 279 0.108 3.41 6.02 Intr - 218462 218306 157 1 1 12 -4 150 0.057 -2.71 6.01 Init - 224601 224525 77 0 2 89 84 53 0.491 5.61 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 82286 82200 87 2 0 130 93 14 0.843 5.35 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:190559177_190790330|GENSCAN_predicted_peptide_1|150_aa XRRFHTPTAQQTKLKNVLWAVVAGLAAGEEQAGTCRLMTFLLVTAPDYKCAEQDYSASIP RPREQRQILNPVGDGSRKSQTFLQWDLVAKRELWPILEDFPEKEKLLWGTRDEALTRIQA FNGLDYKMLVDDNEKIISQSFCLFLFLPLT >gi568815596f:190559177_190790330|GENSCAN_predicted_CDS_1|453_bp ngtagacgttttcacactcccactgctcagcagacaaagctcaagaatgtgctttgggcg gtagtggcgggcctagctgcaggagaagaacaggcaggcacatgcaggctgatgactttc ctgcttgtgacagctccagattacaaatgtgcagaacaagattacagtgccagcattccc aggcccagggagcaaaggcagatcttaaatccagttggagatggcagtagaaagtctcag acatttttgcagtgggacctggtggctaaaagggaactttggcctattttggaagatttt cctgaaaaagaaaaactcctgtggggaaccagagatgaggcattgactaggatccaggct ttcaatggcctagattataaaatgttggtggatgataatgagaaaataatctctcaatct ttctgtctctttctctttcttcccctgacctaa >gi568815596f:190559177_190790330|GENSCAN_predicted_peptide_2|1228_aa MVPMCESSDAGNSYMPKRSYKVLHLSEKVKVLDLTRKEKKIAEINKIYKNKSFTREFVKK NIFVLVTAPQTVNVMAIEYDKCLVKMKKAFSLQVKGSVPQDFPQIQMPITSSGLAPVLLT EKLQIRGSHDSFLSCSLPHLQLCHVTVALLLKVCSLISGPSSCHFKTSPPETMSSSKGRK QLLKFSSMCGWSEEGCWDKILREKPQPRGMGPNTEGQNLNQKHLYDEKVKKKKDDSKMNI QTDKSLFLASCLNLFDGSPKGLAVFSKCGLGVPSGFRSFTKETNADISNTLGAREGGERT RAADGSQAALDQFGTWWASFLQLNKDRNKQTGHQMWPRCQFCHWICPNYSIPTLPSVPRQ EAGLEEFILEEPPPPPRPSERRRGGWAGRSGGPEVGGSRESAPTTFPSPLDGSGGVPAPA AARGGRAGGRRFGVGAEQSSGRARQDRCSRWLSTYRSGRSPLPVRQVRDTFAAERPRANL GRLSRGATRALWGRCPGEAGSARPPPEGEGGPGLVWAIPGDILCARAAGSAALTRAFSLR ERLGCRVRFANGAGGVTRRSGGVRLTIKALCPIKKSSHPSLETVREILFSIAKQELTGSF VGQTSSLLHLAQLESGRLNPSRVMAAALPRTLGELQLYRILQKANLLSYFDAFIQQGGDD VQQLCEAGEEEFLEIMALVGMASKPLHVRRLQKALRDWVTNPGLFNQPLTSLPVSSIPIY KLPEGSPTWLGISCSSYERSSNAREPHLKIPKCAATTCVQSLGQGKSDVVGSLALQSVGE SRLWQGHHATESEHSLSPADLGSPASPKESSEALDAAAALSVAECVERMAPTLPKSDLNE VKELLKTNKKLAKMIGHIFEMNDDDPHKEEEIRKYSAIYGRFDSKRKDGKHLTLHELTVN EAAAQLCVKDNALLTRRDELFALARQISREVTYKYTYRTTKSKCGERDELSPKRIKVEDG FPDFQDSVQTLFQQARAKSEELAALSSQQPEKVMAKQMEFLCNQAGYERLQHAERRLSAG LYRQSSEEHSPNGLTSDNSDGQGERPLNLRMPNLQNRQPHHFVVDGELSRLYPSEAKSHS SGGRRGQVSGALGIKEQDLREAVWMEAELAPRDCCSLLTWHNQGLQRGHLYGSSCSLLLW AMSLLITAITRSMSNLQISLKRIENAGERSLQVTPTQYPSPKRELAARKVDPGNNNNNKN QWGESPRVGADGRDCEVGGICGASGKFT >gi568815596f:190559177_190790330|GENSCAN_predicted_CDS_2|3687_bp atggtcccaatgtgcgagagtagtgatgctggcaattcgtatatgccaaagagaagctat aaagtgcttcacttaagtgaaaaggtgaaagttcttgacttaacaaggaaagaaaaaaaa attgctgaaattaacaaaatttataagaacaaatcttttacccgtgaatttgtaaagaaa aatatatttgtgctggttacagcacctcaaactgtaaacgtcatggccatagagtatgat aagtgcttagttaagatgaaaaaggcattcagtttgcaggttaagggctcagtcccacaa gacttcccccaaattcagatgccaatcacaagctccgggttggcacctgtgctcctgacc gagaagctacagatcagaggttcacacgactccttcctcagctgctctcttcctcaccta caactttgtcatgtaactgttgcccttctgttaaaagtctgcagtttaatttcaggcccg agttcctgccacttcaagacttcaccacctgagacaatgagcagcagcaaagggagaaag cagctgctcaaattctccagtatgtgtggatggagtgaggagggatgctgggataaaata cttagagaaaaaccccagcccaggggcatgggcccaaatactgaggggcaaaacctgaac cagaagcacctatatgatgagaaagtgaagaaaaaaaaggatgacagtaaaatgaacatt cagacagacaaatctctgttcttagcatcttgcttaaatcttttcgatggctccccaaaa gggctagcagtgttctccaagtgtggtctaggggtcccttctggcttcagatccttcact aaggaaacaaatgcagacatcagtaacacactgggtgcccgagagggaggtgaaagaacc agagcagcagatgggtctcaagcagcactggaccagtttggcacctggtgggccagtttt ctgcagctaaacaaggaccggaacaaacaaacagggcatcagatgtggcccagatgccag ttctgtcactggatctgtcccaactactccatcccaacactcccatctgtcccacgtcaa gaggctggtttagaggagtttatcctggaggagccgccgccgccgccgcggccaagcgag cgccgtcggggcgggtgggcgggaagaagcggcgggcccgaggtgggggggagcagagag agcgcgcccaccaccttcccttcccccctcgatgggagcgggggcgtcccggctcctgca gccgccagaggagggagagccgggggccgtcgcttcggagttggggctgagcagtcctcg gggagagcgcgccaagaccgctgcagccgctggctgagtacgtaccggagcggacggtcg ccactcccggtccgccaagtgcgggacactttcgcggctgagcggccacgggcgaacttg gggcggctgagtcgcggggccacgcgggcactttgggggcggtgtccgggggaagcgggc tccgcgcggccgccgccggaaggagagggcggccccgggctcgtgtgggcgattcccgga gacattctgtgtgccagggcggcggggagcgcggccctgactcgtgcattttccctccga gaaaggttggggtgccgagtccgttttgctaacggggctgggggcgtcacacggcgctct gggggtgtgcgattgacaattaaagccctctgtcctatcaaaaaatcctcacatcccagc ttggagactgtcagggaaattttgttttccatagcgaaacaggagttgaccggtagtttt gtgggccagactagctccctgcttcacttagcacagttggaatctggcaggttaaaccca tccagagtaatggctgcggccttacccaggaccctgggggagttgcagctgtatagaata ttacaaaaagccaatctactttcttattttgatgcctttatccaacaaggtggtgatgat gtccagcaactctgtgaagcaggagaagaggagtttttggaaatcatggcactcgtgggc atggctagcaagccccttcatgttagaaggctgcagaaggctttgagagactgggtcaca aaccctgggcttttcaatcagccactgacttcccttcctgtcagtagcatacccatctat aaattaccagagggatcaccaacatggctgggaatatcctgcagtagttatgaaaggagt agcaatgcccgggaacctcatttaaaaatccccaaatgtgctgccaccacctgtgtgcag agcttgggacaggggaagtcagatgtggttgggagcctagcactgcagagtgttggtgag tccagactctggcaaggccaccatgccactgagagcgagcacagcctctccccagcagac ctgggctcccccgcgtccccaaaggagagcagtgaggcgctggatgctgctgctgcgctc tctgtggctgagtgtgtggagcggatggcccccacactgccaaaaagtgacttgaatgaa gtgaaagagctgctaaaaaccaacaagaagttggccaaaatgattggtcacatctttgag atgaacgatgatgatccacacaaagaggaggaaattcggaaatacagtgcaatatatggc agatttgactcaaagaggaaggatgggaaacatctcacacttcatgagctcactgttaat gaagcggctgctcaactctgtgtgaaggataatgccctgctgacaagaagagatgagctt tttgccttggctcgacagatttctcgagaagtcacctataaatatacttacagaaccacc aagtcaaaatgtggagaaagagatgaattatccccaaagagaattaaagtggaggatggg tttccagatttccaggattctgtgcaaacactcttccagcaggctagagctaagagtgaa gaacttgcagctcttagttcacagcagcctgaaaaggtgatggcaaagcagatggagttc ctttgcaaccaagctggctatgagagactgcagcatgccgagaggaggttgtctgcaggg ctttacaggcagagctcagaagagcacagtcctaacggcttgacttccgataactcagat ggacaaggagaaagacctttgaatctccgaatgcctaatttacagaacagacaaccccat cattttgtggtggatggggagctgagcagactttaccccagtgaggcaaagtcccactca tcagggggtaggagaggtcaggtcagtggagccctggggataaaggaacaggatcttagg gaagccgtatggatggaggcagagttggcacccagagactgctgttctttgctgacatgg cacaatcaaggactccagcgtggccatctctatggctcatcatgttccctgcttctgtgg gccatgagtctcctcattactgccattaccagatcaatgagtaatttgcagattagtctg aaaagaattgagaatgctggtgagcgttctttacaggtcacacctacacagtatccctcc cctaaaagagaactggctgccaggaaagtggaccctggtaacaacaacaacaataaaaac cagtggggtgaatcaccaagggtaggagcagatggtagagactgtgaagtgggaggcatc tgtggggcttctggaaaattcacataa >gi568815596f:190559177_190790330|GENSCAN_predicted_peptide_3|110_aa MQQEDALLEAQTRPSPDTEPADTLISKIQKGFERNSKEIKRKSSTSFQEFFPTGVTQDMC DNMYEMLSIGEAQPRLYAQGFYSGLPLWRAQYPRLSPTQKQVTSLSLRNF >gi568815596f:190559177_190790330|GENSCAN_predicted_CDS_3|333_bp atgcagcaggaagatgccctcttggaagcacagacaaggccctcaccagacactgaacct gctgacacattgatctccaaaattcaaaagggttttgagagaaacagcaaagaaatcaag aggaaatcaagcacaagcttccaagagttctttcccactggggtcacccaggacatgtgt gacaacatgtatgaaatgctgtctatcggggaagctcagccgagactctatgcccagggt ttttattcagggctgcccctgtggagagctcagtacccaagactgtctcccactcagaaa caggtcacaagtctgagcctccgaaacttctga >gi568815596f:190559177_190790330|GENSCAN_predicted_peptide_4|37_aa MEQLLLHSTEVSQQISDKSSDTFVLPPSSRGDSVKDK >gi568815596f:190559177_190790330|GENSCAN_predicted_CDS_4|114_bp atggagcagttgctgcttcacagcactgaggtgagtcagcagatcagtgacaagagctct gacacatttgttctacctccttcatctcggggagacagcgtgaaggataaatga >gi568815596f:190559177_190790330|GENSCAN_predicted_peptide_5|286_aa MKSDPGSKLFSNGVALISYNKTDPDSQCVRNWWVLGLTDFKNEATDPRGVKLQTFAVSVT ALKAVRLELFVPPGGLVASLASGLKLQTFPTQEPRWLHLVDPAPGLQIDGAACQSRVVRP HSSALGWSMGLGAMEQGVALVKEARAAQEPTEGVGGSGMAGCRSRALPRGKAAKARKYRM AVCFNSNTYFWHIPGSCLDFSGSNLKPQSPLPRLIWGLCSIIKPLVKDLGYSGTFILNHC LGLAAQRTWCFSMAENRCVGQVCCNQQKAERESSLLISPENDPTTV >gi568815596f:190559177_190790330|GENSCAN_predicted_CDS_5|861_bp atgaagtctgacccaggctctaaactctttagcaatggggttgctctaattagctataat aagactgatcctgactcacagtgtgtccggaattggtgggttcttggtctcactgacttc aagaatgaagccacggatcctcgcggagtgaagctgcagaccttcgcggtgagtgttaca gctcttaaggcggtgcgtctggagttgttcgttcctcccggtgggctggtggcctcgctg gcttcaggattgaagctgcagaccttcccgactcaggagcccagatggcttcacctcgtg gatcccgcaccggggctgcagatagatggagctgcctgccagtcccgcgttgttcgcccg cactcctcagcccttgggtggtcgatgggactgggcgccatggagcagggggtggcgctc gtcaaggaggctcgggccgcacaggagcccacggagggggtcggaggctcaggcatggcg ggctgcaggtcccgagccctgccccgcgggaaggcagctaaggcccgtaaatatagaatg gctgtttgctttaattctaacacatacttttggcacattccaggttcttgcttagacttc tcaggctctaatttgaaaccccagagtcctttacccagactgatctggggcctatgttcc atcatcaaaccactggtaaaagaccttggttattctgggactttcatcctcaaccactgc ctggggctggctgcccagaggacatggtgcttttccatggctgaaaatagatgtgtgggg caggtttgctgcaaccagcagaaagcagagagagaaagcagcttgctgatctcacctgaa aacgaccctacaaccgtgtag >gi568815596f:190559177_190790330|GENSCAN_predicted_peptide_6|333_aa MEMKMMTVNVERKLYVLIGQDFPLSLRVPSETKLPEERSGSNICCSPIFAVLQHPLLIPR QTGSGVELWQTPTDLQLRRRKSCHLQKASRRQEITKIRAELKEIETQKTLQKVNESRSWF FEKINKIDRPLAGLIKKKREKNQIDTIKNGKGHITTDPTETQTTIREYYKHLYANKLENL EEMDKFLDTYTLPRLNQEEVESFTRPITGSEIEAIINSLPTKKSPGPEGFTAEFHQRSKE ELVPFLLKLFQSIGKEGILPHSFYEASIILISKPGTDTTKKENFRPISLMNSDAKILSKI LANQIQQHIKILSTMIKWASSLACKAGSTYANQ >gi568815596f:190559177_190790330|GENSCAN_predicted_CDS_6|1002_bp atggaaatgaaaatgatgacagtgaatgtggaaagaaagttgtatgtgttgattggacag gactttccactgagtctccgggtaccctctgagacaaaacttccagaggaacgatcaggc agcaacatttgctgttcaccaatattcgctgttctgcagcatccgctgctgatacccagg caaacagggtctggagtggaactctggcaaactccaacagacctgcagctgaggagaagg aagagctgtcaccttcaaaaagctagcagaaggcaagaaataactaagatcagagcagaa ctgaaggagatagagacacaaaaaacccttcaaaaagtcaatgaatccaggagctggttt tttgaaaagatcaacaaaattgatagaccactagcaggactaataaagaagaaaagagag aagaatcaaatagacacaataaaaaatggtaaagggcatatcaccactgatcccacagaa acacaaactaccatcagagaatactataaacacctctatgcaaataaacttgaaaatcta gaagaaatggataaattcctcgacacatacacactcccaagactaaaccaggaagaagtt gaatcttttactagaccaataacaggctctgaaattgaggcaataattaatagcttacca accaaaaaaagtccaggaccagaaggattcacagccgaattccaccagaggtccaaggag gagctggtaccattccttctgaaactattccaatcaataggaaaagagggaatcctccct cactcattttatgaggccagcattatcctgatatcaaagcctggcacagacacaacaaaa aaagagaattttagaccaatatccctgatgaacagcgatgcaaaaatcctcagtaaaata ctggcaaaccaaatccagcagcacatcaaaatcttatccaccatgatcaagtgggcttca tccctggcatgcaaggctggttcaacatatgcaaatcaataa