GENSCAN 1.0 Date run: 5-Nov-116 Time: 18:07:03 Sequence gi568815585f:45020656_45383558 : 362903 bp : 42.37% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 7130 7348 219 1 0 104 42 147 0.977 7.76 1.02 PlyA + 7585 7590 6 1.05 2.00 Prom + 10198 10237 40 -5.75 2.01 Init + 17665 17757 93 0 0 114 60 26 0.691 2.83 2.02 Intr + 25927 26069 143 0 2 20 64 136 0.016 2.63 2.03 Intr + 32333 32504 172 2 1 38 69 109 0.081 3.02 2.04 Term + 33586 33837 252 0 0 120 38 187 0.566 11.55 2.05 PlyA + 34165 34170 6 1.05 3.00 Prom + 40644 40683 40 -6.75 3.01 Init + 41082 41100 19 2 1 97 48 14 0.602 -1.39 3.02 Intr + 44065 44257 193 2 1 85 78 126 0.549 8.93 3.03 Intr + 57527 57783 257 2 2 130 70 66 0.323 5.16 3.04 Intr + 63089 63167 79 0 1 68 94 76 0.199 3.89 3.05 Intr + 85604 85774 171 2 0 82 54 118 0.490 5.94 3.06 Intr + 99867 100066 200 0 2 92 90 190 0.832 17.67 3.07 Intr + 102565 102696 132 2 0 63 28 148 0.418 6.00 3.08 Intr + 109222 109388 167 2 2 -1 100 125 0.059 3.56 3.09 Intr + 116078 116151 74 1 2 82 108 89 0.251 7.59 3.10 Intr + 129115 129133 19 1 1 124 94 1 0.213 -0.00 3.11 Intr + 131032 131176 145 0 1 67 111 100 0.874 9.13 3.12 Intr + 135508 135569 62 2 2 53 119 -14 0.421 -4.27 3.13 Term + 135962 136036 75 1 0 87 54 98 0.542 3.16 3.14 PlyA + 136212 136217 6 1.05 4.05 PlyA - 137093 137088 6 1.05 4.04 Term - 137522 137372 151 1 1 82 48 81 0.061 -0.10 4.03 Intr - 158465 158303 163 0 1 39 75 131 0.274 5.01 4.02 Intr - 162552 162427 126 1 0 29 52 101 0.472 0.33 4.01 Init - 168555 168480 76 0 1 83 99 46 0.967 6.50 4.00 Prom - 168732 168693 40 -3.65 5.02 PlyA - 169082 169077 6 1.05 5.01 Sngl - 173912 173133 780 2 0 71 32 602 0.994 48.64 5.00 Prom - 180939 180900 40 -4.55 6.00 Prom + 185636 185675 40 -4.15 6.01 Init + 185912 185957 46 1 1 68 41 13 0.004 -4.50 6.02 Intr + 186769 186850 82 2 1 73 81 110 0.006 6.58 6.03 Intr + 232216 232315 100 1 1 84 101 88 0.032 8.89 6.04 Intr + 246578 246721 144 1 0 70 68 141 0.458 9.86 6.05 Term + 262787 262906 120 1 0 88 48 136 0.624 7.09 6.06 PlyA + 263326 263331 6 1.05 7.00 Prom + 269154 269193 40 -6.15 7.01 Init + 276379 276477 99 0 0 51 116 103 0.972 9.71 7.02 Intr + 277517 277576 60 1 0 64 105 62 0.885 3.51 7.03 Intr + 279860 279978 119 1 2 60 73 89 0.381 2.94 7.04 Term + 280794 280863 70 0 1 56 49 64 0.196 -4.27 7.05 PlyA + 280951 280956 6 1.05 8.00 Prom + 281290 281329 40 -6.55 8.01 Init + 295584 295841 258 2 0 77 75 132 0.979 7.98 8.02 Term + 303097 303216 120 0 0 115 33 132 0.991 7.89 8.03 PlyA + 303449 303454 6 1.05 9.15 PlyA - 308299 308294 6 1.05 9.14 Term - 315733 315671 63 1 0 90 44 78 0.517 0.51 9.13 Intr - 318121 318005 117 1 0 34 75 89 0.740 1.94 9.12 Intr - 318947 318842 106 1 1 44 99 75 0.992 3.50 9.11 Intr - 319529 319339 191 2 2 83 97 299 0.722 27.76 9.10 Intr - 320239 320057 183 1 0 7 113 185 0.932 11.96 9.09 Intr - 320422 320334 89 2 2 68 39 121 0.085 3.97 9.08 Intr - 336860 336781 80 1 2 40 116 70 0.043 3.28 9.07 Intr - 350481 350360 122 0 2 40 105 60 0.117 1.27 9.06 Intr - 350799 350601 199 2 1 70 43 158 0.184 7.93 9.05 Intr - 356676 356389 288 2 0 73 94 152 0.144 9.74 9.04 Intr - 357995 357858 138 1 0 19 94 130 0.087 5.36 9.03 Intr - 359388 359185 204 2 0 69 110 89 0.291 6.59 9.02 Intr - 360604 360401 204 0 0 74 110 87 0.592 6.89 9.01 Intr - 362617 362419 199 2 1 103 94 137 0.966 13.29 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 358053 357858 196 1 1 95 94 108 0.859 9.65 S.002 Intr - 358717 358671 47 0 2 95 24 80 0.839 -0.57 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585f:45020656_45383558|GENSCAN_predicted_peptide_1|72_aa ESKRSESLMDIHHKKLKSKAAEDKNKPQERIPFDRDKDLKVNRFDEAQKKALIKKSRELN TRFSHGKGNMFL >gi568815585f:45020656_45383558|GENSCAN_predicted_CDS_1|219_bp gaatcaaaaagatcagaatctcttatggacatacatcataaaaagttaaagagtaaggct gctgaagacaaaaataagcctcaagagagaataccatttgaccgtgataaagatctcaag gttaatcggtttgatgaagctcagaaaaaagccctaataaaaaaatctagagaactaaac accagattttcacacggcaaaggcaatatgtttttataa >gi568815585f:45020656_45383558|GENSCAN_predicted_peptide_2|219_aa MPSQSEVLIPSIGLGLGPIDGWAVYFGQAAQEVPTISALDACKCPEEAFVCGLILIAASQ VFPPALSVYLLLPEELFGRSQAAAAEPPSRTPHENIGFMSPTGNVYRVWRMMAGAQGVGS EVEKAGSPRLIHVIIQFLTPQKAFISFITQFLGARNRTDAAGLKEQRNLSEGYWMACRNL RGPEPGTEALQPGTIAHKKALETLLCWTPQVPPLTTRDQ >gi568815585f:45020656_45383558|GENSCAN_predicted_CDS_2|660_bp atgcccagccaatcagaagttttgattccatctataggtttggggttgggcccaatagat ggttgggccgtctattttgggcaagctgcccaggaggtgcctaccatctctgctctggat gcttgcaaatgtcccgaagaggctttcgtatgtggtctaatactgatagctgcttctcag gtttttccgccagcactgtctgtgtacctgcttctgcctgaagagctgtttggaaggtca caagcagcagcagctgagccaccatcccgaactccacatgagaacatcggcttcatgagc cccacaggcaacgtgtacagggtctggagaatgatggctggggcacaaggtgtgggttca gaagtggagaaagcagggtcaccaaggttaatacatgtcattattcagttcctcacacct caaaaggcttttatttcatttatcactcagtttttaggtgcaaggaacagaaccgacgct gctggacttaaggagcaaaggaatttatcagaaggatattggatggcttgcaggaacctg agaggcccagagccaggcactgaggctctgcagccaggaacaattgcccacaagaaagcc ttagaaacacttctgtgctggaccccacaagttccaccactgacaacacgggatcaataa >gi568815585f:45020656_45383558|GENSCAN_predicted_peptide_3|530_aa MASNSLVLITVFHGVSEDCPCIAWPPSPCVSVEETGTLAPAVSAQQSAYGKAGAQDFLDE GVNCPYLCFVKFTVRTTASHLLNVLSCGSNHVISEYDIILPRSQFKISRKKNSTECQAVA PQTVDHMEQIQLPEVYSCNLWIQVATLERGNPDSCAGLILKRKLSKARSPSSLESSSHWN IHSREHMSEQVRDLDGCFGCFQEQAACRPGSSAQVGVPATPEAPEGMLQCSFSSAVCGQC VLAGKERRLFASQRGLSFVPDARSSALRLLGSLLHPARLHRLQTHGRARGTRLDRRQTEH RSVASQAARLTKEDDHPDRGGLVFSQGYMYGCTPLLEPPKKLSRRAYGRPGRDLTDKMVF KQGRKEFMGLPLQIPMGRVFRTEITANASLDGTSPQMFTGLCGLSKVPKYLSQQWAKASG RGEVGKLRIAKTQGRTEVSFTLNEDLANIHDIGGKPASVSAPREHPFVLQSVGGQTLTVF TESSSDIFGAKPCARIRLASCALRQLVSRPELEVILTAFGDESAVRKLED >gi568815585f:45020656_45383558|GENSCAN_predicted_CDS_3|1593_bp atggcgtccaatagcctagtgcttatcacagtttttcacggtgtgtctgaggactgcccg tgcattgcctggccccccagcccatgtgtgagtgtggaggaaacaggaaccctggctcct gctgtcagtgcccagcagagtgcttatggcaaagcaggggctcaggattttttggatgaa ggagtgaattgtccatatctgtgtttcgtaaagttcacagttaggacaactgcttcacac ttgttaaatgtgttaagttgtggttctaaccatgtgatatcagagtatgacataattctt cctaggtcccagtttaaaatttccagaaaaaagaattctacagagtgtcaagcagttgcc ccccaaactgtggatcatatggaacaaattcagctgccagaggtctactcctgtaaccta tggatccaggtggcaaccctcgaaagagggaatcctgatagctgcgcagggcttatcctg aaacgcaagctcagcaaggcaagaagcccatcctccttggagagctctagtcactggaac attcacagcagggagcacatgagtgagcaagtgcgggatctggatggctgctttgggtgc ttccaggagcaggctgcgtgcaggcctggcagcagtgcccaggttggggtgcctgcaacc cctgaagctccagagggcatgttacagtgttcttttagctctgccgtctgtggacagtgt gttctggcaggtaaggaacgccggctcttcgcctctcagcgcggcttgtcctttgttccg gacgcccgctcctcagccctgcggctcctggggtcgctgctgcatcccgcacgcctccac cggctgcagacccatggccgagcgcggggaactcgacttgaccggcgccaaacagaacac aggagtgtggctagtcaagctgcacgcttgaccaaagaggatgaccatccggatagagga ggactggtcttcagtcaagggtatatgtatggttgtactcccctgctagaacctccaaag aagctttcaagaagagcttatggcaggccgggaagggacctcactgataagatggtcttt aagcagggacgtaaagaatttatgggcttgcccttgcagatacccatgggaagagtgttc cggacagaaataacagcaaatgcgtctttagatgggacgtcaccgcaaatgttcacaggt ctgtgtggtttgagtaaggttcctaaatatttgtcacagcaatgggctaaagcctctgga agaggtgaagttgggaaactgcggattgccaagactcaaggaaggactgaggtgtcattt actttgaatgaggatcttgcaaatattcatgatattggtggaaaaccagcttcagtcagt gctcctagagaacatccatttgtcttgcaaagtgttggaggacagacattaacagtattt actgagagctcatcagatatttttggtgctaagccctgtgctagaataagattagcctcc tgtgcactgaggcaactggtaagtcgaccagaattagaggttattttgacagcttttggt gatgaatcggctgtgaggaaattggaagattga >gi568815585f:45020656_45383558|GENSCAN_predicted_peptide_4|171_aa MDEAGNHHSRQNITRTENQTPHVLTHEETGIPRVKQFACSHTATKQPSKCQALNHYVIMP SDQILELNAAEVMLHQFQAQSLRDLAAFAFAFLENSHHVKKHGLDLLNDESTKEKPQRTE QRSLSQHVGIMGAKIQDEIWVGTETNHIKWYPEMCICDSYYRNNKMTKYEK >gi568815585f:45020656_45383558|GENSCAN_predicted_CDS_4|516_bp atggatgaagctggaaaccatcattctcggcaaaatatcacaaggacagaaaaccaaaca ccacatgttctcactcatgaagaaacagggatccccagagttaagcaatttgcctgcagt cacacagcaaccaagcaaccttcaaagtgccaagctcttaaccactatgtgataatgcct tcggaccagatcctggaactcaatgcagctgaagtgatgctgcaccagttccaagcccag tccttaagagacttagcagcttttgcttttgctttcttggaaaacagccaccatgtaaag aagcatgggctagacctactaaatgacgagagcactaaagagaaaccacaaaggacagag caaaggtccctctcacaacatgtgggaattatgggagctaaaattcaagatgagatttgg gtggggacagagacaaaccatatcaagtggtatccagaaatgtgtatatgtgattcctac tacagaaacaacaaaatgaccaaatacgagaaataa >gi568815585f:45020656_45383558|GENSCAN_predicted_peptide_5|259_aa MERKINRREKEKEYEGKHNSLEDTDQGKNCKSTLMTLNVGGYLYITQKQTLTKYPDTFLE GIVNGKILCPFDADGHYFIDRDGLLFRHVLNFLRNGELLLPEGFRENQLLAQEAEFFQLK GLAEEVKSRWEKEQLTPRETTFLEITDNHDRSQGLRIFCNAPDFISKIKSRIVLVSKSRL DGFPEEFSISSNIIQFKYFIKSENGTRLVLKEDNTFVCTLETLKFEAIMMALKCGFRLLT SLDCSKGSIVHSDALHFIK >gi568815585f:45020656_45383558|GENSCAN_predicted_CDS_5|780_bp atggagcgtaaaataaacagaagagaaaaagaaaaggagtatgaagggaaacacaacagc ctggaagatactgatcaaggaaagaactgcaaatccacactgatgaccctcaacgttggt ggatatttatacattactcaaaaacaaacactgaccaagtacccagacactttccttgaa ggtatagtaaatggaaaaatcctctgcccgtttgatgctgatggtcattatttcatagac agggatggtctcctcttcaggcatgtcctaaacttcctacgaaatggagaacttctattg cccgaagggtttcgagaaaatcaacttcttgcacaagaagcagaattctttcagctcaag ggactggcagaggaagtgaaatccaggtgggagaaagaacagctaacacccagagagact actttcttggaaataacagataaccacgatcgttcacaaggattaagaatcttctgtaat gctcctgatttcatatcaaaaataaagtctcgcattgttctggtgtccaaaagcaggctg gatggatttccagaggagttttcaatatcgtcaaatatcatccaatttaaatacttcata aagtctgaaaatggcactcgacttgtactaaaggaagacaacacctttgtctgtaccttg gaaactcttaagtttgaggctatcatgatggctttaaagtgtggctttagactgctgacc agcctggattgttccaaagggtcaattgttcacagcgatgcacttcattttatcaagtaa >gi568815585f:45020656_45383558|GENSCAN_predicted_peptide_6|163_aa MTVSNHSHTFPGYKADKLSLEGIVVQRAECRPAASENYMRLKRLQIEESSKPVRLSQQLD KVVTTNYKPVANHQYNIEYERKKKEDGKRARADKQHVLDMLFSAFEKHQYYNLKDLVDIT KQPVVYLKEILKEIGVQNVKGIHKNTWELKPEYRHYQGEEKSD >gi568815585f:45020656_45383558|GENSCAN_predicted_CDS_6|492_bp atgactgtttctaaccacagtcatacatttccagggtataaagctgataagctgtcattg gaaggaatagtggtacaaagagctgaatgccgaccagctgccagtgaaaactacatgcga ttaaaaagattgcaaatagaagagtcttccaaaccagtgaggctatcacaacagctggac aaagttgtaacaaccaattacaaacctgttgctaatcatcaatacaatatcgaatatgaa aggaaaaagaaagaagacggaaagcgagctcgagctgataaacaacatgttttagacatg ctattttcagcctttgagaaacatcaatactataatcttaaggacttggtggacatcaca aagcaacctgtggtgtacctgaaggaaatcttaaaagaaattggtgttcagaatgtaaaa gggatccacaaaaacacatgggagctgaagccagagtacagacactatcaaggagaagaa aagagtgactaa >gi568815585f:45020656_45383558|GENSCAN_predicted_peptide_7|115_aa MSVTEEGEAISALMIHQMFEKAILSNFPTEGGKLLSSVTGDGETCPGLGKVVESYCHSIV KITCISHISSYLTGNGHKAINGYEKPGTKEISYVCSDFLALQGVISAGDNSTGSN >gi568815585f:45020656_45383558|GENSCAN_predicted_CDS_7|348_bp atgagtgtcaccgaagaaggagaagccatttctgctttgatgattcaccaaatgtttgag aaagcaattttgagtaactttcctactgaaggagggaagctgttgtcatcagtaactggg gatggagaaacttgtcctggtttgggtaaagtggttgagagctactgccactcaatagtg aagataacctgcatttcacacatttctagctacctcacaggaaatggccataaagccatt aacgggtatgaaaagccggggaccaaggaaatctcatatgtctgctccgatttccttgcc cttcagggtgtgatttctgcaggcgataacagcactgggagtaactga >gi568815585f:45020656_45383558|GENSCAN_predicted_peptide_8|125_aa MDSDRGRGGRGGGGERKRGRRGRRGRKKSGRKEEEEEEEEEALTKEAAFIWIGKHKQALP GRGWQRGHCRRQNGTGSKGLGAEKWRALNGLDDAHSHWPSTPLSPPPEMLITSGNTLTDT TRNNS >gi568815585f:45020656_45383558|GENSCAN_predicted_CDS_8|378_bp atggattctgacagaggaagaggaggaagagggggaggaggagagaggaagagggggagg agggggaggagggggaggaagaaaagtgggagaaaggaggaggaggaggaagaggaggag gaagctctcacaaaggaagcagcatttatctggattggtaaacataagcaggctctgcca ggtcgaggatggcagaggggtcattgcaggaggcaaaatggcaccggcagcaaggggttg ggtgcagagaagtggagggccctcaatggattagatgatgcccactcacactggccatct actccactgagtccaccgccagaaatgctcatcacatctggaaacaccctcacagacaca accagaaacaatagttaa >gi568815585f:45020656_45383558|GENSCAN_predicted_peptide_9|727_aa XPPSRILRSLGPTWRTVGIRSPRRKLQSADGRYLTWELLSDRGPNAAREQKESRSFAIYR SLRAEGSLGPPSRILRTLGPTWRTVGIRCPRRKLKSADGRYLAWDLLSDRGPNAAREQKE SRLFTIYRSLRAEGSLGPPSRILRTLGPTWRTVGIRCPRRKLKSADGRYLAWDLLSDRGP NAAREQKESRLFTIYRSLRAEGRCPRRKLKSANGRYLTWDLLSDGSDAAREQKESRSFAI YRSLRAEGRAPIAGGGGTPSKAGTESQPLIPHWLRTPMADPRNLKDPPGGLWVLGVQEES SNLPTAGTLLGIYYPTAVQDRGPNAREQKESRSFAIYRSLRAEGSRLGEFPPMGPKPDHQ NQQETSSRGSRARPIPMEPESLRVPPRNPCFSKALQIPVLSGNLETIHFTGRPLPDIRRN RNCSNPLGGVCRGVKPRSSAPFAIQLPTAKWRPHCLPQEAFVNPVRSQPNHRLHDYREVA IMIIYRDLISRESSLHYPYCRTRGSGVRLWCATEGQIPRAAAGAGNAGNGGARRTVMGGS VYPADDEMFSDIYKIREIADGLCLEVEGKMVSRTEGNIDDSLIGGNASAEGPEGEGTEST VITGVDIVMNHHLQETSFTKEAYKKYIKDYMKSIKGKLEEQRPERVKPFMTGAAEQIKHI LANFKNYQFFIGENMNPDGMVALLDYREDGVTPYMIFFKDGLEMEKCMQEEHVLALSVCL IVAAEIE >gi568815585f:45020656_45383558|GENSCAN_predicted_CDS_9|2184_bp ngacccccatcgcggatcctaagatccttaggacccacctggaggactgtgggtattagg tctccaagaagaaagctccaatctgccgacggcaggtaccttacttgggagttactatcc gacaggggtccgaacgcagcccgggaacagaaagaaagcaggtcatttgcaatctaccgg agcctaagggcagaaggcagcttaggacccccatcaaggatcctaagaaccttaggaccc acttggaggactgtgggtattaggtgtccaagaagaaagctcaaatctgctgacggcagg taccttgcttgggatttactatccgacaggggtccgaacgcagcccgggaacagaaagaa agcaggttatttacaatctaccggagcctaagggcagaaggcagcttaggacctccatca aggatcctaagaaccttaggacccacctggaggactgtgggtattaggtgtccaagaaga aagctcaaatctgctgacggcaggtaccttgcttgggatttactatccgacaggggtccg aacgcagcccgggaacagaaagaaagcaggttatttacaatctaccggagcctaagggca gaaggcaggtgtccaagaagaaagctcaaatctgccaacggcaggtaccttacttgggat ttactctccgacgggtccgatgccgcccgggaacagaaagaaagcaggtcatttgcaatc taccggagcctaagggcagaaggcagggcccccatcgcagggggggggggcacccccagc aaggcggggactgagagccagcccctcatcccccactggcttaggacccccatggcggat cccaggaaccttaaggacccacctggaggactctgggtattaggtgtccaagaagaaagc tcaaatctgccgacggcaggtaccttacttgggatttactatccgacagcggtccaagac aggggtccaaacgcccgggaacagaaagaaagcaggtcatttgcaatctaccggagccta agggcagaaggcagtcgccttggggagttcccaccaatggggcccaaacctgatcatcaa aatcaacaggaaacatcttcaagagggtccagggcccgcccaattcccatggaaccagaa tctttgagagtgccgcccagaaatccgtgtttttcaaaagctctccagatcccagtactc agcgggaacttggaaaccattcactttaccgggcgccctctgccggacattcggcgaaac cgtaactgcagcaacccccttggcggggtgtgtaggggcgtcaagccgcgctcatcagcc ccctttgcaatacagctccctactgctaagtggagaccacactgtcttccccaagaggca tttgttaaccctgtgcggagccagccaaaccacaggctccatgactaccgggaagtcgcc atcatgattatctaccgggacctcatcagccgtgagtcctcactgcactatccttactgc cgcacacgggggtctggggtgcggctctggtgtgctacggaggggcagatcccgcgtgcg gccgccggcgcgggaaatgcgggaaatggcggcgccaggcgcacggtgatgggcggctct gtgtatccggcagacgatgagatgttctccgacatctacaagatccgggagatcgcggac gggttgtgcctggaggtggaggggaagatggtcagtaggacagaaggtaacattgatgac tcgctcattggtggaaatgcctccgctgaaggccccgagggcgaaggtaccgaaagcaca gtaatcactggtgtcgatattgtcatgaaccatcacctgcaggaaacaagtttcacaaaa gaagcctacaagaagtacatcaaagattacatgaaatcaatcaaagggaaacttgaagaa cagagaccagaaagagtaaaaccttttatgacaggggctgcagaacaaatcaagcacatc cttgctaatttcaaaaactaccagttctttattggtgaaaacatgaatccagatggcatg gttgctctattggactaccgtgaggatggtgtgaccccatatatgattttctttaaggat ggtttagaaatggaaaaatgtatgcaagaagaacatgtccttgcgctttccgtctgtcta attgtggcagctgagattgaatag