GENSCAN 1.0 Date run: 6-Nov-116 Time: 23:45:43 Sequence gi568815595r:107957477_108190908 : 233432 bp : 40.34% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 1021 1016 6 1.05 1.03 Term - 5079 4681 399 0 0 50 28 315 0.779 15.83 1.02 Intr - 7816 7734 83 2 2 57 62 107 0.747 3.44 1.01 Init - 12418 12364 55 1 1 82 97 41 0.577 5.91 1.00 Prom - 14850 14811 40 -3.95 2.00 Prom + 17177 17216 40 -6.75 2.01 Init + 18198 18290 93 2 0 77 15 112 0.254 3.23 2.02 Intr + 24139 24279 141 0 0 48 78 97 0.778 4.33 2.03 Intr + 24436 24741 306 0 0 66 71 195 0.670 11.32 2.04 Intr + 26261 26426 166 1 1 58 68 87 0.468 2.31 2.05 Intr + 30409 30482 74 2 2 17 98 66 0.226 -1.29 2.06 Term + 31523 31639 117 1 0 86 37 81 0.309 0.36 2.07 PlyA + 34357 34362 6 1.05 3.00 Prom + 46712 46751 40 -2.85 3.01 Init + 46895 47059 165 1 0 73 41 92 0.829 2.69 3.02 Intr + 48699 48953 255 2 0 41 33 153 0.599 1.82 3.03 Intr + 49017 49137 121 2 1 15 45 143 0.657 1.85 3.04 Term + 51241 51335 95 2 2 34 49 170 0.794 4.71 3.05 PlyA + 52244 52249 6 1.05 4.16 PlyA - 53188 53183 6 1.05 4.15 Term - 55845 55780 66 0 0 102 48 47 0.097 -0.94 4.14 Intr - 57544 57408 137 2 2 85 86 84 0.060 7.27 4.13 Intr - 76752 76457 296 2 2 75 116 169 0.718 14.13 4.12 Intr - 81924 81765 160 1 1 38 38 142 0.101 2.32 4.11 Intr - 92175 92143 33 1 0 60 115 31 0.117 0.28 4.10 Intr - 100953 100861 93 1 0 123 94 67 0.996 9.82 4.09 Intr - 103376 103269 108 0 0 81 110 96 0.873 10.44 4.08 Intr - 113706 113617 90 1 0 119 106 -14 0.740 2.55 4.07 Intr - 122868 122515 354 1 0 131 91 273 0.909 26.24 4.06 Intr - 133551 133387 165 1 0 -13 101 189 0.312 9.11 4.05 Intr - 134034 133941 94 0 1 21 14 119 0.313 -3.58 4.04 Intr - 134335 134069 267 1 0 49 27 231 0.300 9.91 4.03 Intr - 139714 139610 105 1 0 68 37 120 0.786 4.39 4.02 Intr - 141229 141083 147 1 0 13 36 144 0.456 1.21 4.01 Init - 142736 142659 78 2 0 73 36 137 0.684 8.11 4.00 Prom - 144018 143979 40 -6.25 5.03 PlyA - 145335 145330 6 1.05 5.02 Term - 146580 146369 212 1 2 77 43 156 0.930 6.47 5.01 Init - 146905 146836 70 1 1 88 84 84 0.258 9.26 5.00 Prom - 152376 152337 40 -2.45 6.00 Prom + 160244 160283 40 -7.15 6.01 Init + 161055 161073 19 2 1 96 86 8 0.041 1.88 6.02 Intr + 170073 170253 181 1 1 77 37 125 0.174 4.30 6.03 Intr + 177832 177970 139 1 1 47 109 55 0.126 3.05 6.04 Intr + 178838 178984 147 1 0 102 39 148 0.864 10.81 6.05 Term + 193870 194130 261 0 0 34 45 180 0.180 2.84 6.06 PlyA + 195429 195434 6 1.05 7.05 PlyA - 197391 197386 6 1.05 7.04 Term - 205179 205001 179 1 2 79 41 110 0.968 2.27 7.03 Intr - 206253 206187 67 0 1 56 103 110 0.853 6.86 7.02 Intr - 208017 207955 63 0 0 83 71 60 0.681 1.80 7.01 Intr - 209509 209378 132 1 0 35 76 142 0.992 7.62 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:107957477_108190908|GENSCAN_predicted_peptide_1|178_aa MGICLLLEISPCTDGAGPGQWISPQHPDATYDCGPNDDKSPNLVALVRIIMGKPKEASDT TPSTLDNTLNQKQYCILEEMTEIRVHLEDQKDAGLEATPYPYLIDQSYIYKSHVASWYEF LGNRQFYQVAAPIVAAVQIVVSLLEQVNTVSNTCCETDKKYTYWFLPAVLGTELLKPS >gi568815595r:107957477_108190908|GENSCAN_predicted_CDS_1|537_bp atgggaatctgccttcttctggaaatcagtccctgcactgatggagctggtccagggcag tggatttccccacaacaccctgatgccacctatgactgtggcccaaatgatgataaaagt ccaaaccttgttgctttggtaagaattataatgggaaagcccaaagaagcctctgatact actccatcaactctggacaacacattaaatcaaaaacaatattgcatcctggaggaaatg acagagattagagtccatcttgaagatcaaaaggatgcagggttagaagccacaccctat ccctatttaattgaccagtcttacatctataaaagccatgtggcttcttggtatgaattt ttgggcaaccgacaattctaccaagtagcagccccaattgtagctgctgtacaaattgtg gtgtctttgctagagcaagttaacacggtgtctaatacatgctgtgaaactgataagaaa tacacatattggtttctgccagcagtccttggcacagagctcctcaagccttcataa >gi568815595r:107957477_108190908|GENSCAN_predicted_peptide_2|298_aa MPQGLQMPKKGPLLRHHHIMATSQQLKDVAAPHPEALESVTSVVHKARDIGKGERKYPPQ HTAKEQLCTAAVFPGTLEAFGQLPRTLSQQLLAHCSLTRPPLLLPRKSAGLCFPSPPPSH SVPPQATGMPEWQMAAKRKVKKEGRTGGREPERKHSEEGVYVHYPNGCQVPVDKDFNCVW PFLLITYGTGPNEADLVANLASAIEPAYLRKLLLNSTLLLLPGGNTAHPVHCRRLEKVSR DLCLAKTLEDDGVALRLVIQAQLREGEAARKLCFPLHLGAALFASQTEKNRALYDIFT >gi568815595r:107957477_108190908|GENSCAN_predicted_CDS_2|897_bp atgcctcagggcctgcagatgccaaagaaaggaccactcctcaggcaccaccacatcatg gcaacatctcagcagctcaaagacgtggcagctcctcatcctgaggcacttgaatctgtg acctctgttgtacacaaagccagggatatagggaaaggggagaggaaatatcctcctcaa cacactgcaaaagaacaactctgcactgctgctgtgtttcctggcacactggaggccttt ggccaacttccccgaaccctgtctcagcagcttctggctcattgctcgctgacacggcca cctctgctcctgccaaggaaatctgctggcctttgcttcccctccccacctccatctcac tcagttcctcctcaggccacaggcatgcctgagtggcaaatggcagcaaagaggaaggtg aaaaaggaaggaagaacgggaggaagggagcccgagagaaaacattcagaggagggagtt tatgtacattatccaaatggatgtcaggtccctgtcgacaaagattttaattgtgtgtgg ccctttctccttataacttatggaactggtccaaatgaggctgacttggtagccaatctg gcatccgctattgaacctgcatacctaaggaagttactgcttaactcaactctcctgctt ctgcctggagggaacactgcccatcctgtacactgccggaggctggaaaaggtgtcaaga gatttgtgcctggccaagaccttggaggatgatggtgttgcacttcgtctagttattcag gctcaactcagagaaggagaagctgcaagaaagctctgctttcccctgcatcttggggca gccctgtttgcctcacagacagagaagaaccgagccctatatgatattttcacctaa >gi568815595r:107957477_108190908|GENSCAN_predicted_peptide_3|211_aa MSQNLDGAVREDPRPLSSVTPGEKPPFYSISFLAPASAESYFHSVKPCSHSPSPRRSLID VVGVINSNLSPEIKEWRVRRIALTPLLILCCYMRLFGQAEAPHPQHRIWLQALDFLGHQD KILSVEKKMIWGSGIYPEQGCKLEKAQERRVGAGFTPVPAMPLSGALPVRNSSVGQDAQE CEKFSGAFQEDSGRGGSEGGFRRDRDPPIRY >gi568815595r:107957477_108190908|GENSCAN_predicted_CDS_3|636_bp atgagccagaatttggacggtgcggtcagagaagaccccaggccgctgagcagcgtgact ccaggggaaaaaccacctttttactccatctcctttctggctcccgcatctgctgagagc tacttccactcagtaaaaccttgcagtcattctccaagcccacgtagaagcctgatagat gtggtcggagtgattaatagcaacctgtctccagagataaaagagtggcgagtgagacgc attgccctgactccacttctaattctgtgttgttatatgcggctgtttggccaagcggag gcaccacacccacagcaccgtatctggctgcaagccctggatttcctgggtcaccaagat aagattctttcagtggaaaagaaaatgatctggggcagtggtatttacccagagcagggt tgtaagctggagaaggcacaggaaaggcgtgttggtgcaggtttcacccctgttcctgca atgccattatctggtgctctcccagtgagaaattcatcagtaggacaggatgcccaggaa tgtgagaaattctctggagccttccaggaagattcgggaagaggtggcagtgaaggggga tttcgcagggacagagacccccctattcgctattag >gi568815595r:107957477_108190908|GENSCAN_predicted_peptide_4|730_aa MREVAAPAAAVSVWERSNPGAVQDGLRRAHRNREDPVKTAAEPEVMQLKLRNAKGCQARK EGFLGAFGESVALRISLRKGKQLSECLDFSLVRLETEKPAAPTGLLTYRTVTMFTTVNGT CLINPDQRMKTLKNQVRKREENRIGKSRASRRAERRAREGRVEAPCGQVPDHRPALGVAA SGSGTASALDGRVRRKRALPDVGSRERRGLGRDGTRPLKREGGSESKEEKTWAVGPACDA RRRSVLPVTAAAAAAPDTCGGGGDPAAGAEMWPLVAALLLGSACCGSAQLLFNKTKSVEF TFCNDTVVIPCFVTNMEAQNTTEVYVKWKFKGRDIYTFDGALNKSTVPTDFSSAKIEVSQ LLKGDASLKMDKSDAVSHTGNYTCEVTELTREGETIIELKYRVVSWFSPNENILIVIFPI FAILLFWGQFGIKTLKYRSGGMDEKTIALLVAGLVITVIVIVGAILFVPAIGLTSFVIAI LVIQVIAYILAVVGLSLCIAAFKESKGMMNDGVFTLRKAAFHGHSQVSALAHRHGHPRRA GLWWVKCALRNLASNCLAQSNLPQRQRPEKRPSEGASTPAPASGRYEDPRKEIMLRKGDH TSVQPAPAPWHALKTQGLNQGQRHSPPGCEDSKGSQLHEEGYPECQIWSNSARTLQNSVW YVEALEPGRGLTPIFFTGEETPTIAPALPGTSIFSGAVPVWPRVDTLVKWPRTQAGSDAH IGFSASPFNK >gi568815595r:107957477_108190908|GENSCAN_predicted_CDS_4|2193_bp atgagggaagttgcagctcctgcagcagcagttagcgtgtgggagcggtcaaacccaggg gctgttcaagatggattaaggagagcacacagaaacagagaagaccctgtgaagacagcg gcagagcctgaagtgatgcagctcaagttaaggaatgccaaaggctgccaggctaggaag gaaggattcttaggagccttcggagagagcgtggccctcagaataagcctccggaaggga aagcagctcagcgaatgccttgatttcagtcttgtgagacttgaaacagagaaaccagct gcgcctactggacttctgacctacagaactgtaacaatgtttaccaccgtgaatggaact tgtttgattaaccctgatcagaggatgaaaacactaaagaaccaagtgagaaagagggaa gagaaccgcatagggaagagcagagcgagtagacgagccgaacgcagagcccgcgagggg cgagtggaagctccctgcgggcaggtacccgaccaccgccctgccctgggcgtggcggcc tcgggctcagggaccgcttcggcgctagacggccgcgtccggaggaaacgggcgctgccg gacgtcgggtccagggagagacgcgggctggggcgggacgggacccggcccctgaagcgc gagggtgggagtgaaagcaaagaggagaaaacctgggcagtgggtcctgcctgtgacgcg cggcggcggtcggtcctgcctgtaacggcggcggcggctgctgctccggacacctgcggc ggcggcggcgaccccgcggcgggcgcggagatgtggcccctggtagcggcgctgttgctg ggctcggcgtgctgcggatcagctcagctactatttaataaaacaaaatctgtagaattc acgttttgtaatgacactgtcgtcattccatgctttgttactaatatggaggcacaaaac actactgaagtatacgtaaagtggaaatttaaaggaagagatatttacacctttgatgga gctctaaacaagtccactgtccccactgactttagtagtgcaaaaattgaagtctcacaa ttactaaaaggagatgcctctttgaagatggataagagtgatgctgtctcacacacagga aactacacttgtgaagtaacagaattaaccagagaaggtgaaacgatcatcgagctaaaa tatcgtgttgtttcatggttttctccaaatgaaaatattcttattgttattttcccaatt tttgctatactcctgttctggggacagtttggtattaaaacacttaaatatagatccggt ggtatggatgagaaaacaattgctttacttgttgctggactagtgatcactgtcattgtc attgttggagccattcttttcgtcccagcgattggattaacctccttcgtcattgccata ttggttattcaggtgatagcctatatcctcgctgtggttggactgagtctctgtattgcg gcattcaaagaatcaaaaggaatgatgaatgatggtgtcttcacactgagaaaagcggcg ttccacggtcacagccaggtgagtgcactcgcccatcggcatgggcatccgaggagagca gggctgtggtgggtaaaatgcgctctgcgaaacctcgcctctaactgccttgctcaaagc aacctgccacagaggcaaaggccagaaaagcgaccttcagaaggagccagcaccccagct cctgcctcagggcgatatgaagaccccaggaaggaaattatgctgagaaaaggagatcac acctctgtccagcctgcaccggccccttggcatgccctgaagacacaaggcctaaatcaa ggtcaaagacacagccctcctggttgtgaagattccaaaggctcccagctgcatgaggag ggttacccagaatgtcagatttggagcaattcagcgagaacccttcaaaattcagtgtgg tatgtggaagctctagagcctggaagaggactcactcctatatttttcactggagaagag actcctacaattgcgcctgccctacctgggacctccatcttctcaggtgccgtgccagtc tggccaagagtggacaccttagtcaagtggccaaggacacaggctggatcggatgcccac attggattctcagcctcaccatttaataagtaa >gi568815595r:107957477_108190908|GENSCAN_predicted_peptide_5|93_aa MAVSCRHIEAIGIHVGLQGISTEAFLPSSNLSLANIITNCIWEGWSNAVWKVSLKNMITA EATGVDVILSQLQSHRILELEGELGNTESNSFT >gi568815595r:107957477_108190908|GENSCAN_predicted_CDS_5|282_bp atggctgtgtcctgcagacatatagaggccatcggaatccatgtggggctccagggcatc tctacagaggcatttcttccttccagcaacttatctctagccaacataatcaccaattgc atttgggaaggatggagtaacgcagtgtggaaggtctccctgaagaacatgatcacagca gaagccactggggtagacgtcattctctcacagttgcagagtcatagaatattagagcta gaaggggagttagggaacactgaatccaattcctttacctaa >gi568815595r:107957477_108190908|GENSCAN_predicted_peptide_6|248_aa MGLSGGRPPEETEIGSPTEWGSKRSKAKVKKREIGGSKCSGKGGSDRREKQREKSKCIKQ VAGLAELGHSGHVPGMEMSFMVGVLELEPKAPGLLWVPPESYILIHSHGVPAQVSQTQVL PVKGVQVLGALNKELDKTHKQSKKRKKQQKQRCIENENTLHRSVHIELASAGTGFLCPGR VRRCQTQSLASMHCILQGSPVAFGWSWEPSTEFYYESSRDSMRGKPPVDLLFFPLKVDGY MEAGGAGS >gi568815595r:107957477_108190908|GENSCAN_predicted_CDS_6|747_bp atgggcttgtcagggggacggccacctgaagaaacagagatagggtctcccacggaatgg ggcagcaaaagaagtaaagcaaaagtgaagaaaagagaaatagggggcagcaagtgttca ggaaaaggggggtcagataggagagagaagcaaagagaaaaatcaaagtgtataaagcaa gtggcgggactcgctgagcttggacacagtggtcacgttcctggcatggagatgtcattc atggtaggtgttcttgaattggagcctaaggctccaggacttttgtgggttcctccagaa tcctacatccttatacactcccacggggtgcctgcccaggtctctcagacccaagtgtta ccagtgaagggtgtccaggttcttggcgctttgaacaaagaattggacaaaactcacaaa caaagcaagaaaagaaagaagcaacaaaagcagagatgtattgaaaatgaaaacacactc cacaggtcagtgcacattgaactggcctctgcggggactggctttctgtgccctggcaga gttaggagatgtcaaacccagagccttgccagcatgcattgcatccttcagggtagtcct gtggcctttggttggtcttgggagcctagtacagagttttattatgagagctctagggac tcaatgagaggaaaaccacctgttgatcttttatttttccccctgaaagtggatggctac atggaagctggtggtgctggtagctaa >gi568815595r:107957477_108190908|GENSCAN_predicted_peptide_7|146_aa GFLDKLHNEITRTLEKISSREKYINNQLENLVQEYRAAQAQLSEAKERYQQGNGGVTERT RLLSEVMEELEKVKQEMEEKGSSMTDGAPLVKIKQSLTKLKQETVEMDIRIGIVEHTLLQ SKLKEKSNMTRNMHATVIPEPATGFY >gi568815595r:107957477_108190908|GENSCAN_predicted_CDS_7|441_bp ggatttttggacaaactccataatgaaattactaggactttggaaaagatcagcagccga gaaaagtacatcaacaatcagcttgagaatttggttcaagaatatcgtgcagctcaagcc cagctgagtgaggcaaaggagcgataccagcagggaaatggaggagtgacggaaagaacc agactcctctctgaggttatggaagaattagaaaaggtaaaacaagaaatggaagaaaag ggcagcagcatgactgatggtgctcctttggtgaagattaaacagagcttaacaaaactg aagcaagaaactgtagagatggacattagaattggcattgtggaacacacactactccaa tcaaagctgaaggagaagtccaacatgactaggaacatgcatgccacagttattccagaa ccagcaacaggcttttattaa