GENSCAN 1.0 Date run: 8-Nov-116 Time: 14:30:09 Sequence gi568815597r:150525641_150729470 : 203830 bp : 44.71% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 11726 11765 40 -3.16 1.01 Init + 26649 26668 20 2 2 97 94 5 0.871 1.55 1.02 Intr + 26903 26960 58 2 1 119 105 8 0.863 4.49 1.03 Intr + 28351 28482 132 0 0 22 101 69 0.787 2.44 1.04 Intr + 28725 28827 103 2 1 105 100 110 0.985 13.65 1.05 Intr + 29789 29925 137 0 2 85 81 159 0.964 15.19 1.06 Intr + 30522 30726 205 2 1 98 80 139 0.997 12.87 1.07 Intr + 30981 31153 173 1 2 70 47 176 0.999 11.36 1.08 Intr + 31299 31410 112 2 1 119 94 20 0.954 5.65 1.09 Intr + 31854 31983 130 1 1 92 73 24 0.748 1.05 1.10 Intr + 32305 32509 205 1 1 74 109 207 0.989 20.60 1.11 Intr + 32833 33009 177 0 0 101 105 138 0.999 16.92 1.12 Intr + 33322 33525 204 0 0 118 99 31 0.985 6.60 1.13 Intr + 33647 33826 180 1 0 97 81 77 0.975 7.96 1.14 Intr + 34121 34265 145 1 1 31 63 93 0.978 0.96 1.15 Term + 34420 34556 137 2 2 111 48 117 0.995 8.18 1.16 PlyA + 40014 40019 6 1.05 2.07 PlyA - 43104 43099 6 1.05 2.06 Term - 51851 51735 117 2 0 118 38 94 0.504 5.94 2.05 Intr - 52851 52604 248 1 2 73 106 230 0.846 20.48 2.04 Intr - 53945 53203 743 1 2 87 116 723 0.225 66.09 2.03 Intr - 100168 100002 167 1 2 106 43 206 0.306 16.66 2.02 Intr - 101952 101827 126 0 0 103 39 105 0.870 7.98 2.01 Init - 103830 103774 57 0 0 89 72 118 0.970 11.61 2.00 Prom - 105750 105711 40 -2.16 3.13 PlyA - 106150 106145 6 1.05 3.12 Term - 123108 122681 428 1 2 117 42 402 0.961 33.97 3.11 Intr - 136288 136174 115 1 1 82 99 55 0.995 6.02 3.10 Intr - 138123 137992 132 0 0 121 79 124 0.999 15.64 3.09 Intr - 169210 169016 195 1 0 67 80 160 0.454 12.71 3.08 Intr - 172135 172025 111 1 0 50 92 44 0.525 1.58 3.07 Intr - 177753 177670 84 0 0 94 54 45 0.701 1.72 3.06 Intr - 178554 178478 77 1 2 114 75 -5 0.671 0.03 3.05 Intr - 178703 178637 67 2 1 38 116 50 0.710 1.28 3.04 Intr - 181169 180913 257 0 2 67 84 116 0.944 6.26 3.03 Intr - 182767 182616 152 0 2 67 74 93 0.686 5.61 3.02 Intr - 184368 184248 121 1 1 58 87 23 0.031 -1.25 3.01 Init - 198133 197923 211 1 1 42 50 140 0.430 4.55 3.00 Prom - 198210 198171 40 -3.26 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 140169 140131 39 0 0 73 101 9 0.858 0.89 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:150525641_150729470|GENSCAN_predicted_peptide_1|705_aa MENWTGRPWLYLLLLLSLPQLCLDQEHPGAWLPLLSNGPHASSLWSLFAPSSPIPRCSGE SEQLRACSQAPCPPEQPDPRALQCAAFNSQEFMGQLYQWEPFTEVQGSQRCELNCRPRGF RFYVRHTEKVQDGTLCQPGAPDICVAGRCLSPGCDGILGSGRRPDGCGVCGGDDSTCRLV SGNLTDRGGPLGYQKILWIPAGALRLQIAQLRPSSNYLALRGPGGRSIINGNWAVDPPGS YRAGGTVFRYNRPPREEGKGESLSAEGPTTQPVDVYMIFQEENPGVFYQYVISSPPPILE NPTPEPPVPQLQPGVWRPIFLCISRESGEELDERSCAAGARPPASPEPCHGTPCPPYWEA GEWTSCSRSCGPGTQHRQLQCRQEFGGGGSSVPPERCGHLPRPNITQSCQLRLCGHWEVG SPWSQCSVRCGRGQRSRQVRCVGNNGDEVSEQECASGPPQPPSREACDMGPCTTAWFHSD WSSKCSAECGTGIQRRSVVCLGSGAALGPGQGEAGAGTGQSCPTGSRPPDMRACSLGPCE RTWRWYTGPWGECSSECGSGTQRRDIICVSKLGTEFNVTSPSNCSHLPRPPALQPCQGQA CQDRWFSTPWSPCSRSCQGGTQTREVQCLSTNQTLSTRCPPQLRPSRKRPCNSQPCSQRP DDQCKDSSPHCPLVVQARLCVYPYYTATCCRSCAHVLERSPQDPS >gi568815597r:150525641_150729470|GENSCAN_predicted_CDS_1|2118_bp atggagaactggactggcaggccctggctgtatctgctgctgcttctgtccctccctcag ctctgcttggatcaggagcacccgggcgcctggctgcccctgctgagcaacggcccccat gccagctccctctggagcctctttgctcccagtagccctattccaagatgttctggggag agtgaacagctaagagcctgcagccaagcgccctgcccccctgagcagccagacccccgg gccctgcagtgcgcagcctttaactcccaggaattcatgggccagctgtatcagtgggag cccttcactgaagtccagggctcccagcgctgtgaactgaactgccggccccgtggcttc cgcttctatgtccgtcacactgaaaaggtccaggatgggaccctgtgtcagcctggagcc cctgacatctgtgtggctggacgctgtctgagccccggctgtgatgggatccttggctct ggcaggcgtcctgatggctgtggagtctgtgggggtgatgattctacctgtcgccttgtt tcggggaacctcactgaccgagggggccccctgggctatcagaagatcttgtggattcca gcgggagccttgcggctccagattgcccagctccggcctagctccaactacctggcactt cgtggccctgggggccggtccatcatcaatgggaactgggctgtggatccccctgggtcc tacagggccggcgggaccgtctttcgatataaccgtcctcccagggaggagggcaaaggg gagagtctgtcggctgaaggccccaccacccagcctgtggatgtctatatgatctttcag gaggaaaacccaggcgttttttatcagtatgtcatctcttcacctcctccaatccttgag aaccccaccccagagccccctgtcccccagcttcagccgggtgtctggcgccccattttc ctctgcatctcccgtgagtcgggagaggaactggatgaacgcagctgtgccgcgggtgcc aggcccccagcctcccctgaaccctgccacggcaccccatgccccccatactgggaggct ggcgagtggacatcctgcagccgctcctgtggccccggcacccagcaccgccagctgcag tgccggcaggaatttggggggggtggctcctcggtgcccccggagcgctgtggacatctc ccccggcccaacatcacccagtcttgccagctgcgcctctgtggccattgggaagttggc tctccttggagccagtgctccgtgcggtgcggccggggccagagaagccggcaggttcgc tgtgttgggaacaatggtgatgaagtgagcgagcaggagtgtgcgtcaggccccccgcag ccccccagcagagaggcctgtgacatggggccctgtactactgcctggttccacagcgac tggagctccaagtgctcagccgagtgtgggacgggaatccagcggcgctctgtggtctgc cttgggagtggggcagccctcgggccaggccagggggaagcaggagcaggaactgggcag agctgtccaacaggaagccggccccctgacatgcgcgcctgcagcctggggccctgtgag agaacttggcgctggtacacagggccctggggtgagtgctcctccgaatgtggctctggc acacagcgtagagacatcatctgtgtatccaaactggggacggagttcaacgtgacttct ccgagcaactgttctcacctccccaggccccctgccctgcagccctgtcaagggcaggcc tgccaggaccgatggttttccacgccctggagcccatgttctcgctcctgccaaggggga acgcagacacgggaggtccagtgcctgagcaccaaccagaccctcagcacccgatgccct cctcaactgcggccctccaggaagcgcccctgtaacagccaaccctgcagccagcgccct gatgatcaatgcaaggacagctctccacattgccccctggtggtacaggcccggctctgc gtctacccctactacacagccacctgttgccgctcttgcgcacatgtcctggagcggtct ccccaggatccctcctga >gi568815597r:150525641_150729470|GENSCAN_predicted_peptide_2|485_aa MSQKQEEENPAEETGEEKQDTQEKEGILPERAEEAKLKAKYPSLGQKPGGSDFLMKRLQK GQKYFDSGDYNMAKAKMKNKQLPSAGPDKNLVTGDHIPTPQDLPQRKSSLVTSKLAGKES GSSPVFSARRRRRLAMFGLKRNAVIGLNLYCGGAGLGAGSGGATRPGGRLLATEKEASAR REIGGGEAGAVIGGSAGASPPSTLTPDSRRVARPPPIGAEVPDVTATPARLLFFAPTRRA APLEEMEAPAADAIMSPEEELDGYEPEPLGKRPAVLPLLELVGESGNNTSTDGSLPSTPP PAEEEEDELYRQSLEIISRYLREQATGAKDTKPMGRSGATSRKALETLRRVGDGVQRNHE TAFQGMLRKLDIKNEDDVKSLSRVMIHVFSDGVTNWGRIVTLISFGAFVAKHLKTINQES CIEPLAESITDVLVRTKRDWLVKQRGWDGFVEFFHVEDLEGGIRNVLLAFAGVAGVGAGL AYLIR >gi568815597r:150525641_150729470|GENSCAN_predicted_CDS_2|1458_bp atgtcccagaaacaagaagaagagaaccctgcggaggagaccggcgaggagaagcaggac acgcaggagaaagaaggtattctgcctgagagagctgaagaggcaaagctaaaggccaaa tacccaagcctaggacaaaagcctggaggctccgacttcctcatgaagagactccagaaa gggcaaaagtactttgactcaggagactacaacatggccaaagccaagatgaagaataag cagctgccaagtgcaggaccagacaagaacctggtgactggtgatcacatccccacccca caggatctgccccagagaaagtcctcgctcgtcaccagcaagcttgcgggtaaggagtcg gggtcttccccagttttctcagccaggcggcggcggcgactggcaatgtttggcctcaaa agaaacgcggtaatcggactcaacctctactgtgggggggccggcttgggggccggcagc ggcggcgccacccgcccgggagggcgacttttggctacggagaaggaggcctcggcccgg cgagagatagggggaggggaggccggcgcggtgattggcggaagcgccggcgcaagcccc ccgtccaccctcacgccagactcccggagggtcgcgcggccgccgcccattggcgccgag gtccccgacgtcaccgcgacccccgcgaggctgcttttcttcgcgcccacccgccgcgcg gcgccgcttgaggagatggaagccccggccgctgacgccatcatgtcgcccgaagaggag ctggacgggtacgagccggagcctctcgggaagcggccggctgtcctgccgctgctggag ttggtcggggaatctggtaataacaccagtacggacgggtcactaccctcgacgccgccg ccagcagaggaggaggaggacgagttgtaccggcagtcgctggagattatctctcggtac cttcgggagcaggccaccggcgccaaggacacaaagccaatgggcaggtctggggccacc agcaggaaggcgctggagaccttacgacgggttggggatggcgtgcagcgcaaccacgag acggccttccaaggcatgcttcggaaactggacatcaaaaacgaagacgatgtgaaatcg ttgtctcgagtgatgatccatgttttcagcgacggcgtaacaaactggggcaggattgtg actctcatttcttttggtgcctttgtggctaaacacttgaagaccataaaccaagaaagc tgcatcgaaccattagcagaaagtatcacagacgttctcgtaaggacaaaacgggactgg ctagttaaacaaagaggctgggatgggtttgtggagttcttccatgtagaggacctagaa ggtggcatcaggaatgtgctgctggcttttgcaggtgttgctggagtaggagctggtttg gcatatctaataagatag >gi568815597r:150525641_150729470|GENSCAN_predicted_peptide_3|649_aa MVETYCCLEKKIPFNILLLIVNALGHSRALMEVNMEINVVFVSANKKTILQPMDQEVILT FKPYKLRNTFLKRNVVGGQGDNKEKVSNKHVSNRIYVIIKFKGRYYDWTYTKNQSNESSM LSTDTKKASILLIRKIYILMQNLGPLPNDVCLTMKLFYYDEVTPPDYQPPGFKDGDCEGV IFEGEPMYLNVGEVSTPFHIFKVKVTTERERMENIDSTILSPKQIKTPFQKILRDKDVED EQEHYTSDDLDIETKMEEQEKNPASSELEEPSLVCEEDEIMRSKESPDLSISHSQVEQLV NKTSELDMSESKTRSGKVFQNKMFQQPYSKNIHIHGFVSNKRYRECWYICDGRNVFILFK KQVRMTTLTHRARRTEISKNSEKKMESEEDSNWEKSPDNEDSGDSKDIRLTLMEEVLLLG LKDKEGYTSFWNDCISSGLRGGILIELAMRGRIYLEPPTMRKKRLLDRKVLLKSDSPTGD VLLDETLKHIKATEPTETVQTWIELLTGETWNPFKLQYQLRNVRERIAKNLVEKGILTTE KQNFLLFDMTTHPVTNTTEKQRLVKKLQDSVLERWVNDPQRMDKRTLALLVLAHSSDVLE NVFSSLTDDKYDVAMNRAKDLVELDPEVEGTKPSATEMIWAVLAAFNKS >gi568815597r:150525641_150729470|GENSCAN_predicted_CDS_3|1950_bp atggttgagacctactgctgcttagaaaaaaagattcctttcaacatattactgctcatt gtcaatgcacttggtcactcaagagctctaatggaggtgaacatggagattaatgttgtt ttcgtgtctgctaacaaaaagaccattctgcagcccatggatcaagaagtaattttgact ttcaagccttataagttaagaaacacatttttaaaaaggaatgtagtaggagggcagggt gataataaggagaaggtcagcaacaaacatgtgagcaatagaatctatgtcataattaag ttcaagggaaggtactatgactggacgtacactaaaaaccaaagcaacgaatctagcatg ttgtctactgacaccaagaaagcaagcattctcctcattcgcaagatttatatcctaatg caaaatctggggcctttacctaatgatgtttgtttgaccatgaaacttttttactatgat gaagttacacccccagattaccagcctcccggttttaaggatggtgattgtgaaggagtt atatttgaaggggaacctatgtatttaaatgtgggagaagtctcaacaccttttcacatc ttcaaagtaaaagtgaccactgagagagaacgaatggaaaatattgactcaactatacta tcaccaaaacaaataaaaacaccatttcaaaaaatcctgagggacaaagatgtagaagat gaacaggagcattatacaagtgatgatttggacattgaaactaaaatggaagaacaggaa aaaaaccctgcatcttctgaacttgaagaaccaagtttagtttgtgaggaagatgaaatt atgaggtctaaagaaagtccagatctttctatttctcattctcaggttgagcagttagtc aataaaacatctgaacttgatatgtctgaaagcaaaacaagaagtggaaaagtctttcag aataaaatgtttcagcagccatattccaaaaacatacacatccatgggtttgtcagtaat aagcgctatagggaatgctggtacatatgcgatggtagaaatgtattcatcctgtttaag aaacaggtgagaatgaccactttaactcaccgggcccgtcgcactgaaataagcaagaac tctgaaaagaagatggaaagtgaggaagacagtaattgggagaaaagtccagacaatgaa gattctggagactctaaggatatccgccttactcttatggaagaagtattgcttctggga ctaaaagataaagaggggtacacatctttctggaatgactgcatatcatcaggcctgcga gggggcatcctgatagagctggccatgcggggtcgaatctatctggaacccccgaccatg cgtaagaagcgactactagacagaaaggtactgctaaagtcagacagcccaacaggtgat gttttactggatgaaactctgaaacacatcaaagcaactgaacccacagaaactgtccaa acatggatagagctactcactggtgagacctggaaccccttcaaattacagtaccagctg agaaatgtacgagagcgcatcgcaaagaacctagtagagaaaggtattctaaccactgag aagcagaatttcctgctatttgacatgactactcatccagtgaccaatacaacagagaaa cagcgactagtgaaaaaacttcaagatagtgtactagagcggtgggtaaatgaccctcag cgtatggacaagcgaacactagcactcctggtgctagcccactcctctgatgtgctagag aatgtcttctcctctctgacagatgacaagtatgatgtggcaatgaatcgagccaaggac ttagtagaactggaccctgaagtggaagggacaaagcctagtgccacagaaatgatctgg gctgtgctggcagccttcaataaatcttaa