GENSCAN 1.0 Date run: 5-Nov-116 Time: 20:02:23 Sequence gi568815596r:25061084_25264772 : 203689 bp : 49.23% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 412 407 6 1.05 1.01 Sngl - 20227 19715 513 1 0 66 45 353 0.993 24.84 1.00 Prom - 23576 23537 40 -3.46 2.00 Prom + 25904 25943 40 -5.36 2.01 Init + 27903 27950 48 2 0 107 46 70 0.591 3.73 2.02 Intr + 30163 30318 156 0 0 98 90 19 0.327 3.31 2.03 Intr + 31920 32047 128 2 2 91 106 204 0.157 21.98 2.04 Intr + 42554 42704 151 2 1 102 99 301 0.250 32.76 2.05 Intr + 60590 60711 122 1 2 86 89 194 0.993 18.59 2.06 Intr + 65201 65248 48 2 0 99 85 29 0.703 1.50 2.07 Intr + 67100 67249 150 2 0 100 99 119 0.955 13.48 2.08 Intr + 68892 69026 135 0 0 97 84 140 0.999 14.18 2.09 Intr + 69469 69547 79 1 1 86 80 58 0.999 4.35 2.10 Intr + 70285 70420 136 0 1 96 82 261 0.806 26.44 2.11 Intr + 70667 70828 162 0 0 73 41 425 0.997 36.25 2.12 Intr + 71820 71931 112 1 1 109 72 145 0.985 14.44 2.13 Intr + 72300 72351 52 0 1 76 99 8 0.969 -0.39 2.14 Intr + 74384 74556 173 1 2 89 66 224 0.997 19.04 2.15 Intr + 75440 75515 76 2 1 26 92 137 0.729 7.42 2.16 Intr + 76258 76419 162 0 0 97 75 394 0.989 39.17 2.17 Intr + 77976 78107 132 2 0 64 73 181 0.991 15.04 2.18 Intr + 80283 80350 68 2 2 68 94 62 0.994 2.50 2.19 Intr + 82652 82779 128 2 2 114 80 152 0.999 17.32 2.20 Intr + 83877 83968 92 1 2 94 105 91 0.983 11.11 2.21 Intr + 88611 88659 49 2 1 121 111 80 0.998 11.85 2.22 Intr + 90816 90937 122 1 2 18 114 227 0.921 18.51 2.23 Intr + 92629 92678 50 0 2 103 131 48 0.977 8.08 2.24 Term + 93152 93257 106 2 1 120 38 47 0.905 0.78 2.25 PlyA + 95091 95096 6 1.05 3.05 PlyA - 95213 95208 6 -0.45 3.04 Term - 95986 95834 153 1 0 109 48 114 0.966 7.42 3.03 Intr - 96289 96166 124 0 1 99 84 21 0.628 3.39 3.02 Intr - 100669 100002 668 1 2 107 44 1411 0.102 129.13 3.01 Init - 103689 103558 132 0 0 74 92 210 0.990 18.14 3.00 Prom - 105829 105790 40 -5.76 4.04 PlyA - 106225 106220 6 1.05 4.03 Term - 107346 107239 108 0 0 52 37 94 0.099 -0.79 4.02 Intr - 107986 107844 143 2 2 98 93 44 0.033 5.97 4.01 Init - 143511 143322 190 0 1 95 58 144 0.382 9.31 4.00 Prom - 162790 162751 40 -4.56 5.21 PlyA - 163100 163095 6 1.05 5.20 Term - 169084 168976 109 0 1 115 44 81 0.732 4.38 5.19 Intr - 173337 173200 138 2 0 146 47 94 0.810 10.68 5.18 Intr - 174742 174624 119 1 2 73 75 67 0.900 3.16 5.17 Intr - 175922 175853 70 1 1 96 82 48 0.791 4.18 5.16 Intr - 178132 178047 86 1 2 91 77 72 0.821 5.02 5.15 Intr - 179405 179219 187 1 1 68 65 213 0.600 16.69 5.14 Intr - 179647 179553 95 1 2 94 49 139 0.582 9.36 5.13 Intr - 180624 180479 146 1 2 52 50 273 0.960 19.90 5.12 Intr - 182899 182815 85 1 1 63 94 60 0.960 3.49 5.11 Intr - 183255 183072 184 2 1 104 59 202 0.996 18.69 5.10 Intr - 183569 183457 113 2 2 83 94 254 0.999 24.68 5.09 Intr - 184249 184170 80 2 2 114 97 60 0.999 8.77 5.08 Intr - 184981 184937 45 2 0 133 110 129 0.999 17.98 5.07 Intr - 185226 185077 150 1 0 110 53 265 0.984 25.33 5.06 Intr - 185693 185537 157 2 1 100 94 255 0.999 26.88 5.05 Intr - 186075 185968 108 0 0 82 109 222 0.782 24.18 5.04 Intr - 186666 186508 159 0 0 78 103 125 0.999 13.18 5.03 Intr - 187169 186954 216 2 0 105 83 280 0.974 27.90 5.02 Intr - 193770 193630 141 0 0 40 100 85 0.914 5.45 5.01 Init - 199773 199675 99 0 0 114 101 77 0.930 11.86 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 100669 99998 672 1 0 107 55 1437 0.898 136.06 S.002 Init + 104495 104639 145 1 1 54 70 114 0.867 6.48 S.003 Term + 104696 104826 131 0 2 92 43 107 0.917 4.94 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:25061084_25264772|GENSCAN_predicted_peptide_1|170_aa MKAQVSADGRGKGTFESGLKGGVKIVFSPEEAKAVSSEMIGKKLFTKQMGEKGGICNQAL KRKYPRGEDYFAIGMERSFQGPILIGSSHGGVNIADVAAETPDAIKEPVDIVEGIKKEQA LRLEQKMGFPPNIVDSAAENMVKLYSLFLKYDATMIRNKSNGEDSDGAVL >gi568815596r:25061084_25264772|GENSCAN_predicted_CDS_1|513_bp atgaaggcgcaggtttcagctgatggtcgaggaaaaggaacatttgaaagtggcctcaaa ggaggagtgaagatagttttctctccagaagaagcaaaagctgtttcctcagaaatgatt gggaaaaagttgtttaccaagcaaatgggagaaaagggtggaatatgcaatcaagcactg aagcgaaaataccctaggggagaagactactttgcaatagggatggagaggtcatttcaa ggtcctatattaataggaagttcacatggtggtgtcaacattgcagatgttgctgctgag actcctgatgcaattaaagaacctgttgatattgtagaaggcatcaaaaaggaacaagct ctccggcttgaacagaagatgggatttccacctaatattgtggattccgcagcagaaaac atggtcaagctttacagcctttttctgaaatatgatgcaaccatgataagaaataaatcc aatggtgaagattcagatggagctgtgctgtga >gi568815596r:25061084_25264772|GENSCAN_predicted_peptide_2|878_aa MGPVWWRLLEASVLGKAALALAPTFPGPDQEALLTVFPILIPGVCGCCGALRPRYKRLVD NIFPEDPEDGLVKTNMEKLTFYALSAPEKLDRIGAYLSERLIRDVGRHRYGYVCIAMEAL DQLLMACHCQSINLFVESFLKMVAKLLESEKPNLQILGTNSFVKFANIEEDTPSYHRSYD FFVSRFSEMCHSSHDDLEIKTKLNQYFHPSLLSICDYRIRMSGIKGLQGVVRKTVNDELQ ANIWDPQHMDKIVPSLLFNLQHVEEAESRSPSPLQAPEKEKESPAELAERCLRELLGRAA FGNIKNAIKPVLIHLDNHSLWEPKVFAIRCFKIIMYSIQPQHSHLVIQQLLGHLDANSRS AATVRAGIVEVLSEAAVIAATGSVGPTVLEMFNTLLRQLRLSIDYALTGSYDGAVSLGTK IIKEHEERMFQEAVIKTVGSFASTLPTYQRSEVILFIMSKVPRPSLHQAVDTGRTGENRN RLTQIMLLKSLLQVSTGFQCNNMMSALPSNFLDRLLSTALMEDAEIRLFVLEILISFIDR HGNRHKFSTISTLSDISVLKLKVDKCSRQDTVFMKKHSQQLYRHIYLSCKEETNVQKHYE ALYGLLALISIELANEEVVVDLIRLVLAVQDVAQVNEENLPVYNRCALYALGAAYLNLIS QLTTVPAFCQHIHEVIETRKKEAPYMLPEDVFVERPRLSQNLDGVVIELLFRQSKISEVL GGSGYNSDRLCLPYIPQLTDEDRLSKRRSIGETISLQVEVESRNSPEKEERVPAEEITYE TLKKAIALCVEVDSVAVEEQERERRRQVVEKFQKAPFEEIAAHCGARASLLQSKLNQIFE ITIRPPPSPSGTITAAYGQPQNHSIPVYEMKFPDLCVY >gi568815596r:25061084_25264772|GENSCAN_predicted_CDS_2|2637_bp atgggccctgtgtggtggcggctcctggaggcctctgtgctggggaaggctgccctggcc ctggctccaacctttcctgggcccgaccaggaggctttgctcacagtttttcccattctt attccaggtgtgtgtggctgctgtggtgccctacgccccaggtacaaaaggctggttgac aacatcttccctgaggatcccgaggatggtctggtgaagaccaacatggagaagctgacc ttctatgccctctcagctccagaaaaacttgatcgtattggcgcctacctctctgagagg ctcatccgtgacgtgggtcgccatcgatatgggtacgtgtgcattgctatggaggctttg gaccagctgctcatggcctgccactgccagagcatcaacctcttcgtggagagcttcctc aagatggtggccaagctgctggagtcagagaaacccaacctgcagatcctcggcaccaac tcgtttgtgaagtttgccaacatcgaggaggacaccccgtcctatcaccggagctatgac ttctttgtgtcccgattcagtgaaatgtgccactcgagccatgatgacttagaaatcaag accaagctcaatcagtacttccaccccagcctcctgagtatctgtgactacagaattcga atgtcaggcatcaaaggcctgcaaggggtggtgaggaagacggtgaatgatgaactgcag gccaatatctgggacccacagcacatggataagatcgttccatcactgcttttcaatcta cagcatgtagaggaggcagagagccggtctccctcacccctccaagcacctgagaaggag aaagagagccccgcggagctggctgagaggtgtcttcgggagctgctgggccgggctgcc tttggcaacatcaaaaacgccatcaagcctgttctcatccatctggataaccattctctt tgggaacccaaggtgtttgccatccgttgctttaaaatcatcatgtactcaattcagccg cagcactcacacctggtcatccagcagctcctgggccacctggacgccaacagccgcagc gctgcgacggtgcgcgcgggcatcgtggaagtcttgtcggaagccgcggtcatcgctgcc accggctctgtggggcccacagtactggagatgttcaacacgctgctgaggcagctgcgg ctcagcatcgactacgcgctgaccgggagctacgacggggcggtcagcctcggcaccaag atcatcaaggagcacgaggagcgcatgttccaggaggccgtcatcaagaccgtgggctcc tttgccagcacgctgcccacctaccagcgctccgaggtgatcctcttcatcatgagcaag gtcccgcggccatccctgcaccaggcggtggacacaggcaggacgggggagaataggaac cgtctgacccagattatgctgctaaaatccctcctgcaggtatccacaggtttccagtgc aacaacatgatgtcagccctgcctagcaacttcctggaccgccttctctccaccgccctc atggaggatgcagaaattcgactctttgttctagagattctcatcagtttcattgatcgt catggcaaccgccacaagttctctaccatcagtaccctcagtgacatctctgtcctgaag ctgaaagtggacaagtgctctcgacaggacaccgtcttcatgaagaagcactcccagcag ctctacagacacatctacctgagctgcaaggaggaaacaaacgtgcagaaacactacgag gcgctctatggcttgctggccctcatcagcatcgagctggctaacgaggaggtggtggtg gacctcatccgtctggtgctggctgttcaggacgtggcccaagtcaatgaggagaacttg cctgtctacaaccgctgtgccctctatgctctgggcgcagcctacctgaacctcatcagt cagctcacaacagtgcctgccttctgccagcacatccatgaggtgatagagaccaggaag aaagaggctccatacatgctccccgaggatgtgtttgtggagaggcccaggctgtctcag aatcttgatggggtggtcattgagctcctcttccgccagagcaagatcagtgaagtcctg ggaggcagtggctacaactcggaccggctctgcctgccctacattcctcagctgacagat gaggatcgtttatccaagaggaggagcattggagagaccatctccctgcaggtggaggta gaatcgaggaacagtccggagaaggaggagcgagtgcctgccgaggagatcacctatgag acactgaagaaagccattgctctctgtgtcgaagtggacagcgtagcagtggaggagcag gagcgtgagcggcggcggcaggtggtggagaagttccagaaggcacccttcgaggagatt gctgcacactgcggggcccgggcatcgctgctccagagcaaactcaatcagatctttgaa atcaccatccggcccccaccaagcccatcaggaaccatcactgcagcctacggtcagccg cagaaccactccatccccgtctatgaaatgaagtttcccgatctgtgtgtatactga >gi568815596r:25061084_25264772|GENSCAN_predicted_peptide_3|358_aa MPRSCCSRSGALLLALLLQASMEVRGWCLESSQCQDLTTESNLLECIRACKPDLSAETPM FPGNGDEQPLTENPRKYVMGHFRWDRFGRRNSSSSGSSGAGQKREDVSAGEDCGPLPEGG PEPRSDGAKPGPREGKRSYSMEHFRWGKPVGKKRRPVKVYPNGAEDESAEAFPLEFKREL TGQRLREGDGPDGPADDGAGAQADLEHSLLVAAEKKDEGPYRMEHFRWGSPPKDKRYGGF MTSEKSQTPLVTLFKNAIIKNAYKKGEACAGVRRLPTITLTFTFPILAYLARIQVLLHVS RQPNTLLSDGDEGQKHLGISGSNAGSGKMLQGGDCQVPEPGLTGSAQDEAATGNPNVK >gi568815596r:25061084_25264772|GENSCAN_predicted_CDS_3|1077_bp atgccgagatcgtgctgcagccgctcgggggccctgttgctggccttgctgcttcaggcc tccatggaagtgcgtggctggtgcctggagagcagccagtgtcaggacctcaccacggaa agcaacctgctggagtgcatccgggcctgcaagcccgacctctcggccgagactcccatg ttcccgggaaatggcgacgagcagcctctgaccgagaacccccggaagtacgtcatgggc cacttccgctgggaccgattcggccgccgcaacagcagcagcagcggcagcagcggcgca gggcagaagcgcgaggacgtctcagcgggcgaagactgcggcccgctgcctgagggcggc cccgagccccgcagcgatggtgccaagccgggcccgcgcgagggcaagcgctcctactcc atggagcacttccgctggggcaagccggtgggcaagaagcggcgcccagtgaaggtgtac cctaacggcgccgaggacgagtcggccgaggccttccccctggagttcaagagggagctg actggccagcgactccgggagggagatggccccgacggccctgccgatgacggcgcaggg gcccaggccgacctggagcacagcctgctggtggcggccgagaagaaggacgagggcccc tacaggatggagcacttccgctggggcagcccgcccaaggacaagcgctacggcggtttc atgacctccgagaagagccagacgcccctggtgacgctgttcaaaaacgccatcatcaag aacgcctacaagaagggcgaggcctgtgctggggtgaggagattacctactatcacactc accttcacctttccgattctggcctatttggccagaatccaggttctactccatgtcagc agacaacccaacactctactgagtgatggggatgaggggcagaaacatctggggatctca ggttctaatgctggctctggaaagatgctccagggaggcgactgccaagtcccagaacca gggctcacaggcagtgctcaggatgaagctgccacggggaaccctaatgttaagtag >gi568815596r:25061084_25264772|GENSCAN_predicted_peptide_4|146_aa MRARGFLENRAASAHPAGALELQEAISRRHRIRPNAVARPLRPSEPGGEALGAAPQGSRA IAALEQREYDSPRLPHHSFQTMGKSEASPCADGDIYRQMRTRQMPAPARTQSAGSGSEGG PGRLWLSAVPSSRQTFCADCGMRSRQ >gi568815596r:25061084_25264772|GENSCAN_predicted_CDS_4|441_bp atgagggcgcgcggctttcttgaaaacagagcagcctctgctcaccctgcaggcgctctg gagctgcaggaggccatttctagaaggcaccggattcggcctaatgcggtggcgaggccg ctgcgtccctcggagcccggtggggaggccctgggtgccgcgccccaaggctcgcgggcg attgccgccctggaacagagagaatatgattccccacgacttccacatcacagtttccaa acaatggggaaatcggaggcctccccgtgtgcagacggtgatatttaccgccaaatgcga accaggcagatgccagccccagcacgcacgcagtcggccggctccggctccgaaggcgga cctgggcgcctctggctctccgcggtcccgagttctcgacaaactttctgcgccgactgc ggcatgagaagccgccagtag >gi568815596r:25061084_25264772|GENSCAN_predicted_peptide_5|828_aa MPSVEPFLTSINSLRPPRILHLSVRTPSQANCMSYVPRPQNVESKVFSRIIMEERVSYMR QCLQKATKPEPETSFERRPEAEKKAKVIAGMNAVEENQGPGESQKVEEASPPAVQQPTDP ASPTVATTPEPVGSDAGDKNATKAGDDEPEYEDGRGFGIGELVWGKLRGFSWWPGRIVSW WMTGRSRAAEGTRWVMWFGDGKFSVVCVEKLMPLSSFCSAFHQATYNKQPMYRKAIYEVL QVASSRAGKLFPVCHDSDESDTAKAVEVQNKPMIEWALGGFQPSGPKGLEPPEEEKNPYK EVYTDMWVEPEAAAYAPPPPAKKPRKSTAEKPKVKEIIDERTRERLVYEVRQKCRNIEDI CISCGSLNVTLEHPLFVGGMCQNCKNCFLECAYQYDDDGYQSYCTICCGGREVLMCGNNN CCRCFCVECVDLLVGPGAAQAAIKEDPWNCYMCGHKGTYGLLRRREDWPSRLQMFFANNH DQEFDPPKVYPPVPAEKRKPIRVLSLFDGIATGLLVLKDLGIQVDRYIASEVCEDSITVG MVRHQGKIMYVGDVRSVTQKHIQEWGPFDLVIGGSPCNDLSIVNPARKGLYGRQPQLMAF SSDLSEGTGRLFFEFYRLLHDARPKEGDDRPFFWLFENVVAMGVSDKRDISRFLESNPVM IDAKEVSAAHRARYFWGNLPGMNRPLASTVNDKLELQECLEHGRIAKFSKVRTITTRSNS IKQGKDQHFPVFMNEKEDILWCTEMERVFGFPVHYTDVSNMSRLARQRLLGRSWSVPVIR HLFAPLKEYFACVPQRLLAIEHPIGKQTFQESTRDLLAFETGLGVIDW >gi568815596r:25061084_25264772|GENSCAN_predicted_CDS_5|2487_bp atgccctctgtggagccctttctgacttccatcaactccctgcgaccaccccgcatcctt cacctctcagtccgcactccctcccaggccaactgcatgtcttatgtcccaagaccacag aatgtggagtctaaggtgtttagccgcattataatggaagagcgggtatcgtacatgaga cagtgtctgcaaaaggccaccaagccagaaccagaaacttcttttgagaggaggcctgaa gctgagaagaaagccaaggtcattgcaggaatgaatgctgtggaagaaaaccaggggccc ggggagtctcagaaggtggaggaggccagccctcctgctgtgcagcagcccactgacccc gcatcccccactgtggctaccacgcctgagcccgtggggtccgatgctggggacaagaat gccaccaaagcaggcgatgacgagccagagtacgaggacggccggggctttggcattggg gagctggtgtgggggaaactgcggggcttctcctggtggccaggccgcattgtgtcttgg tggatgacgggccggagccgagcagctgaaggcacccgctgggtcatgtggttcggagac ggcaaattctcagtggtgtgtgttgagaagctgatgccgctgagctcgttttgcagtgcg ttccaccaggccacgtacaacaagcagcccatgtaccgcaaagccatctacgaggtcctg caggtggccagcagccgcgcggggaagctgttcccggtgtgccacgacagcgatgagagt gacactgccaaggccgtggaggtgcagaacaagcccatgattgaatgggccctggggggc ttccagccttctggccctaagggcctggagccaccagaagaagagaagaatccctacaaa gaagtgtacacggacatgtgggtggaacctgaggcagctgcctacgcaccacctccacca gccaaaaagccccggaagagcacagcggagaagcccaaggtcaaggagattattgatgag cgcacaagagagcggctggtgtacgaggtgcggcagaagtgccggaacattgaggacatc tgcatctcctgtgggagcctcaatgttaccctggaacaccccctcttcgttggaggaatg tgccaaaactgcaagaactgctttctggagtgtgcgtaccagtacgacgacgacggctac cagtcctactgcaccatctgctgtgggggccgtgaggtgctcatgtgcggaaacaacaac tgctgcaggtgcttttgcgtggagtgtgtggacctcttggtggggccgggggctgcccag gcagccattaaggaagacccctggaactgctacatgtgcgggcacaagggtacctacggg ctgctgcggcggcgagaggactggccctcccggctccagatgttcttcgctaataaccac gaccaggaatttgaccctccaaaggtttacccacctgtcccagctgagaagaggaagccc atccgggtgctgtctctctttgatggaatcgctacagggctcctggtgctgaaggacttg ggcattcaggtggaccgctacattgcctcggaggtgtgtgaggactccatcacggtgggc atggtgcggcaccaggggaagatcatgtacgtcggggacgtccgcagcgtcacacagaag catatccaggagtggggcccattcgatctggtgattgggggcagtccctgcaatgacctc tccatcgtcaaccctgctcgcaagggcctctacggtagacagccccagctgatggctttc tcttccgacctctcagagggcactggccggctcttctttgagttctaccgcctcctgcat gatgcgcggcccaaggagggagatgatcgccccttcttctggctctttgagaatgtggtg gccatgggcgttagtgacaagagggacatctcgcgatttctcgagtccaaccctgtgatg attgatgccaaagaagtgtcagctgcacacagggcccgctacttctggggtaaccttccc ggtatgaacaggccgttggcatccactgtgaatgataagctggagctgcaggagtgtctg gagcatggcaggatagccaagttcagcaaagtgaggaccattactacgaggtcaaactcc ataaagcagggcaaagaccagcattttcctgtcttcatgaatgagaaagaggacatctta tggtgcactgaaatggaaagggtatttggtttcccagtccactatactgacgtctccaac atgagccgcttggcgaggcagagactgctgggccggtcatggagcgtgccagtcatccgc cacctcttcgctccgctgaaggagtattttgcgtgtgtgccccagagacttcttgctatt gaacatcctattggaaagcaaactttccaagagagcacccgggacctgttagcttttgaa acaggcctgggtgtgattgactggtag