GENSCAN 1.0 Date run: 5-Nov-116 Time: 08:13:12 Sequence gi568815593r:65418478_65654907 : 236430 bp : 37.08% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.16 PlyA - 70 65 6 1.05 1.15 Term - 10262 10077 186 2 0 89 49 74 0.355 0.11 1.14 Intr - 34350 34230 121 2 1 61 87 129 0.714 9.68 1.13 Intr - 41861 41689 173 2 2 89 61 154 0.211 10.62 1.12 Intr - 42346 42290 57 1 0 94 51 54 0.578 0.46 1.11 Intr - 52633 52301 333 1 0 14 111 116 0.007 1.34 1.10 Intr - 62018 61835 184 1 1 -17 78 189 0.033 6.27 1.09 Intr - 67034 66955 80 2 2 86 66 3 0.021 -4.67 1.08 Intr - 68603 68416 188 0 2 26 67 115 0.127 1.69 1.07 Intr - 70142 69317 826 2 1 23 87 373 0.048 20.75 1.06 Intr - 71103 71015 89 1 2 59 108 47 0.373 2.57 1.05 Intr - 71658 71503 156 1 0 32 70 108 0.210 2.46 1.04 Intr - 72645 72491 155 2 2 81 55 100 0.310 4.79 1.03 Intr - 73386 73295 92 0 2 76 80 50 0.396 0.87 1.02 Intr - 97667 97600 68 0 2 112 72 65 0.032 5.01 1.01 Init - 121323 121221 103 0 1 84 74 51 0.137 3.76 1.00 Prom - 125281 125242 40 -5.85 2.03 PlyA - 127554 127549 6 -0.45 2.02 Term - 131455 130934 522 1 0 67 39 238 0.950 10.09 2.01 Init - 136430 136320 111 2 0 26 100 161 0.996 11.36 2.00 Prom - 142145 142106 40 -7.35 3.00 Prom + 142640 142679 40 -8.05 3.01 Init + 144834 145029 196 2 1 100 65 306 0.992 28.64 3.02 Intr + 149036 149127 92 0 2 86 56 57 0.607 1.09 3.03 Intr + 151373 151521 149 1 2 63 92 102 0.635 6.21 3.04 Intr + 153362 153809 448 2 1 56 94 316 0.977 21.42 3.05 Intr + 158402 158592 191 1 2 54 119 190 0.975 16.16 3.06 Intr + 160947 161136 190 0 1 62 80 140 0.644 9.27 3.07 Intr + 164561 164742 182 1 2 27 27 265 0.261 12.04 3.08 Intr + 166537 166618 82 1 1 68 115 -17 0.205 -2.28 3.09 Intr + 167522 167704 183 1 0 39 127 109 0.200 8.96 3.10 Term + 168776 168919 144 1 0 55 38 146 0.978 3.23 3.11 PlyA + 168947 168952 6 1.05 4.11 PlyA - 169050 169045 6 1.05 4.10 Term - 173471 173292 180 2 0 70 34 201 0.999 9.53 4.09 Intr - 176168 176044 125 0 2 62 95 54 0.677 2.88 4.08 Intr - 178703 178574 130 2 1 17 111 131 0.621 7.65 4.07 Intr - 186568 186434 135 1 0 61 90 152 0.961 12.54 4.06 Intr - 190981 190766 216 1 0 88 96 198 0.967 18.38 4.05 Intr - 192566 192384 183 2 0 30 22 215 0.996 8.16 4.04 Intr - 193404 193126 279 0 0 60 86 126 0.913 6.35 4.03 Intr - 195742 195621 122 2 2 72 57 121 0.993 6.69 4.02 Intr - 199778 199616 163 2 1 35 63 154 0.966 6.23 4.01 Init - 203557 203555 3 1 0 108 81 0 0.552 1.35 4.00 Prom - 204026 203987 40 -8.15 5.02 PlyA - 204175 204170 6 -0.45 5.01 Sngl - 205797 205465 333 0 0 88 43 240 0.904 15.49 5.00 Prom - 207950 207911 40 -7.15 6.00 Prom + 209236 209275 40 -6.65 6.01 Init + 211111 211462 352 0 1 61 32 168 0.543 5.87 6.02 Intr + 216824 216892 69 0 0 89 60 52 0.508 0.74 6.03 Intr + 217467 217566 100 1 1 63 86 101 0.985 5.65 6.04 Intr + 219219 219303 85 0 1 98 106 33 0.996 5.00 6.05 Intr + 228578 228705 128 1 2 69 75 131 0.239 8.46 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 55151 55100 52 2 1 89 97 31 0.866 5.47 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:65418478_65654907|GENSCAN_predicted_peptide_1|936_aa MAGSTWVLKEGASWQSLELMTSVQESHGAQAQRYYLYVEALTPSAIVFGDRILEDIKNQC LAIAHSRFKFLASSRQFPLQLPKARDFRRPLAFYQSNCTGERGMIRLFRDLLDTGSELTF IPGDPKHHCGPPVKVGAYGVKFLVVQWCGASQDIPSKVKDKLLHLPPPTTKKEGQRLVGL FGFWRQYIPHSAIKWGMHSGIPSSNRSGIYVIRLEQVLRAQICVCLSCMECLIHHHGILH SIASDQGTYFTAKEVQQYVHALGIHWSSPYHFEAAGLIERWNGLLKSQLQHQLGDNTLQG WRRVLQKAVYALNQCPIYGTVSPIARIHGSRNQGVEVEVAPLTITPSGPLEKFLLPVPAT LRSAGLKVLGPEWRVLPPGHTTMIPLNWKLILPPGHFELLLPLSQQAKKGVTVLPGVIGL DYQDEISLLLILKSVYSSLPDGGKEEYAWNTGDPLGPLLVLPCPVIKVNGKLQQPNPSRT KNDPYPSGMKVWVTLPDSKFFSFWTLGPTPVVCQRPLDLWSKAEGCTVGFATFEVLGLQL ASLLLSLQMAYCGISPYDRGAQSTTQGPRSAGTRGETGRQLHLQPRCRCNAFGPDANCLG HTRFIQIDLALLSGNGLLNVNFRINSTRAAQTVTLCLGEVEWWGRGELTIPIRVDQNGAF LSFTVKNDKHSRRRRSMDPIDPQQAVSKLFFKLSAYGKHFHLNLTLNTDFVSKHFTVEYW GKDGPQWKHDFLDNCHYTGYLQDQRSTTKVALSNCVGLVFGVLFVQWGKRVEDSTAEHGV IATEDEEYFIEPLKNTTEDSKHFSYENGHPHVIYKKSALQQRHLYDHSHCGVSGEQKRSV SIERFVETLVVADKMMVGYHGRKDIEHYILSVMNIINALESHHHCQTARYFVVYQGPQEN SNMQNKFLNGLQRKVSNLSVACKLRSHFGAWQLSTS >gi568815593r:65418478_65654907|GENSCAN_predicted_CDS_1|2811_bp atggctggttccacctgggttttaaaggaaggagcctcctggcagagccttgagcttatg acctcagtccaggagagccatggtgcccaggcacagcgatactatttatatgttgaagcc ctaacccccagtgcgatagtgtttggagacaggatcttggaagatattaagaatcagtgt cttgctattgcccactctcgtttcaaattcctggcctcaagccggcaattcccacttcag cttcccaaagcaagagatttcaggagacctctggccttttaccagagtaactgcactggg gaaaggggaatgatcagacttttcagggacttactggatactggctctgagctgacattt attccaggggacccaaaacatcattgtggtcctccagttaaagtaggggcttatggagta aaatttctagtggtccagtggtgtggggcctctcaagatattccttctaaggtgaaggat aagttgctgcatttgccccctcctacaaccaagaaagagggacaacgcctagtgggtcta tttggattttggaggcaatacattcctcattctgccataaagtggggcatgcacagcggc attccatcatcaaatcgaagtggtatatatgtgatcaggcttgagcaagtcctgagggca caaatatgcgtttgcctatcctgcatggaatgccttatccaccatcatggtattctgcat agcattgcctctgaccaaggcacatactttacagctaaagaagtgcagcagtatgttcat gctcttggaattcactggtcttctccttatcattttgaagcagctggattgatagaacgg tggaatggccttttgaagtcacaattacaacatcaactaggtgacaatactttgcagggc tggcgcagagttctccagaaggctgtgtatgctctgaatcagtgtccaatatatggtaca gtttctcccatagccaggattcacgggtccaggaatcaaggggtggaagtggaagtggca ccactcaccatcacccctagtggcccattagaaaaatttttgcttcctgttcccgcaaca ttacgttctgctggcctaaaagtcttaggtccagagtggagagtgctgccaccaggacac acaacaatgattccattaaactggaagttaatattaccacctggacactttgagctcctc ctacctttaagtcagcaggctaagaagggagttacagtgttgcctggggtgattggcttg gactatcaagatgaaatcagtctactactcatcttgaaatcagtctactcttccttacct gatggaggtaaggaagagtatgcatggaatacaggagatcctttagggcctctcttagta ttaccatgcccagtgattaaggtcaatgggaaactacaacaacccaatccaagcaggact aaaaatgacccatacccctccggaatgaaggtttgggtcactctaccagactccaagttc ttcagcttttggactcttggacctacaccagtagtttgccagaggcccttggacctttgg tcaaaggctgaaggctgcactgttggcttcgctacttttgaagttttgggacttcaactg gcttccttgcttctcagcttgcagatggcctattgtggaatttcaccttatgatcgtggc gcccagtcaaccacccaagggccgaggagtgcgggcacaaggggcgagactggcaggcag ctccacctgcagccccgctgccggtgcaacgcgtttgggccagatgccaactgtctgggt cacacgcgttttatccaaatagatttagcgctgctgtctggaaatgggctgctgaatgtt aactttaggatcaactcaaccagggcagctcagactgttaccctctgtctgggcgaagtg gaatggtgggggcggggggagctaactattccaataagggttgatcaaaatggagcattt ctcagctttactgtgaaaaatgataaacactcaaggagaagacggagtatggaccctatt gatccacagcaggcagtatctaagttattttttaaactttcagcctatggcaagcacttt catctaaacttgactctcaacacagattttgtgtccaaacattttacagtagaatattgg gggaaagatggaccccagtggaaacatgattttttagacaactgtcattacacaggatat ttgcaagatcaacgtagtacaactaaagtggctttaagcaactgtgttgggttggtattt ggagttctcttcgttcagtggggaaagagagtagaagattccactgcagagcatggtgtt attgctacagaagatgaagagtattttatcgaacctttaaagaataccacagaggattcc aagcattttagttatgaaaatggccaccctcatgttatttacaaaaagtctgcccttcaa caacgacatctgtatgatcactctcattgtggggtttcgggtgaacagaagagatcagtg agcattgaacggtttgtggagacattggtagtggcagacaaaatgatggtgggctaccat ggccgcaaagacattgaacattacattttgagtgtgatgaatattataaacgctctagaa agtcatcatcattgccagacagcaaggtactttgttgtgtatcagggtccccaggaaaat tcaaacatgcaaaataaattcttaaatggtttgcagagaaaagtaagcaacttgagtgta gcatgtaaactgaggagccattttggtgcttggcagttaagtacttcttga >gi568815593r:65418478_65654907|GENSCAN_predicted_peptide_2|210_aa MNQEDLDPDSTTDVGDVTNTEEELIRECEEMWKDMEESPKVQEKFLKEIKDVALVNTQIM RKQNSLIAVIEKVLLVYLDDQTSHHSLSQSVIHNKALALFNSTKAERDEEAAEEKCEASR GWFMRFKERSHLHNIKMQGEAASTEAEASTSYPEDLAKILEGGYDKQQIFNVDRTALYWK KMPSRTLIVREKLMPGSKASKGQVDSLDKD >gi568815593r:65418478_65654907|GENSCAN_predicted_CDS_2|633_bp atgaatcaggaggatctagatccggatagtactacagatgtgggagatgttacaaatact gaagaagaacttattagagaatgtgaagaaatgtggaaagatatggaagaatctcccaaa gtgcaggaaaaatttttaaaggaaattaaagatgttgctctagtgaacacacaaattatg agaaaacagaatagccttattgctgttatagagaaagttttactagtctacttagatgat caaaccagccatcattccctaagccaaagcgtaatccacaacaaagccctagctcttttc aattctacaaaggctgagagagatgaggaagctgcagaagaaaagtgtgaagctagcaga ggttggttcatgaggtttaaggaaagaagccatctccataacataaaaatgcaaggggaa gcagcaagtactgaggcagaagcgtcaacaagttatccagaagatctagctaagatactt gaaggtggctatgataaacagcaaattttcaatgtagacagaacagccctctattggaag aagatgccatctaggactcttatagttagagagaagttgatgcctggctccaaagcttca aaaggacaggttgactctcttgataaggactag >gi568815593r:65418478_65654907|GENSCAN_predicted_peptide_3|618_aa MAAESGSDFQQRRRRRRDPEEPEKTELSERELAVAVAVSQENDEENEERWVGPLPVEATL AKKRKVLEFERVYLDNLPSASMYERSYMHRDVITHVNVLISFIEIGVIESIAVSSEGALF CSVGDDKAMKVFDVVNFDMINMLKLGYFPGQCEWIYCPGDAISSVAASEKSTGKIFIYDG RGDNQPLHIFDKLHTSPLTQIRLNPVYKAVVSSDKSGMIEYWTGPPHEYKFPKNVNWEYK TDTDLYEFAKCKAYPTSVCFSPDGKKIATIGSDRKVRIFRFVTGKLMRVFDESLSMFTEL QQMRQQLPDMEFGRRMAVERELEKVDAVRLINIVFDETGHFVLYGTMLGIKVINVETNRC VRILGKQENIRVMQLALFQGIAKKHRAATTIEMKASENPVLQNIQADPTIVCTSFKKNRF YMFTKREPEDTKSADSDRDVFNEKPSKEEVMAATQAEGPKRVSDSAIIHTSMGDIHTKLF PVECPKTVENFCVHSRNGYYNGHTFHRIIKGFMIQTGDPTGTGMGGESIWGGEFEDEFHS TLRHDRPYTLSMANAGSNTNGSQFFITVVPTPWLDNKHTVFGRVTKGMEVVQRISNVKVN PKTDKPYEDVSIINITVK >gi568815593r:65418478_65654907|GENSCAN_predicted_CDS_3|1857_bp atggcggcggaaagtggtagcgattttcagcagagacgtagaaggcgccgggacccggag gaaccggaaaaaacagaactcagcgaaagagagctggcagtagcagtggcggtgtcccag gagaacgatgaggagaacgaagagcgctgggttggacctttacctgtggaggcaacactg gccaagaagaggaaagtcttagagtttgaaagagtctatcttgataatctccccagtgca tccatgtatgagcgcagttacatgcatagagatgttatcacccatgtgaatgttttaatt tcatttattgaaataggagttattgagagtattgcagttagctctgagggagcattgttc tgttctgtgggtgatgataaagcaatgaaggtgtttgatgtagtgaactttgacatgatc aacatgctgaaacttggctattttcctggacagtgtgagtggatctattgcccaggggat gcaatttcttcagttgctgcttccgaaaagagtacaggaaaaattttcatttatgatggc cgaggagataaccagccacttcatatttttgacaaactccatacatcacctcttactcag atacggctaaacccagtttacaaagcagtagtgtcttctgacaaatctgggatgattgaa tactggactgggcctcctcatgaatataaattccccaaaaatgtgaactgggaatataaa actgacactgatttatatgaatttgccaagtgtaaggcttatccaaccagcgtatgtttt tcaccagatgggaagaaaatagctactattggttctgatagaaaagttagaattttcaga tttgtaactggaaaactcatgagagtctttgatgaatcactaagcatgtttactgaactg caacagatgaggcaacagttaccagacatggaatttggccgacgaatggctgtagaacgt gagttggagaaggttgatgctgtaagattaattaatatagtttttgatgaaactggacac ttcgtgctgtatggaacaatgctgggcattaaagttataaatgtagagacaaaccggtgt gtgcggattttaggcaaacaagaaaatattagagtgatgcaattggctttgttccagggg atagccaaaaagcatcgtgctgcaactactatagaaatgaaagcttctgaaaatcctgtt cttcagaatattcaagctgacccaacaatagtctgtacatcattcaaaaagaatagattt tatatgtttaccaaacgagaaccagaagatacgaaaagtgcagattctgatcgagatgtt tttaatgagaaaccttctaaagaagaagtcatggcagctactcaagctgaaggacctaaa cgagtttcggacagtgccattatccacaccagcatgggagacattcacaccaaacttttt cctgttgagtgccctaagacagtggaaaacttctgtgttcacagcagaaatggttattat aatgggcatacatttcaccgtataattaagggctttatgattcagactggagatccaaca ggtactggtatgggaggagaaagcatatggggaggagaatttgaagatgaatttcattca acattacgacatgacaggccatacacactcagcatggctaacgcgggatcaaatactaat ggatcccagtttttcataacggtagtaccaacgccttggcttgataataagcatacagta tttggacgagtgactaaaggaatggaagttgtacagaggatctccaacgtcaaagtcaat cccaaaacagataagccctatgaggatgtcagcatcataaatattactgtcaagtaa >gi568815593r:65418478_65654907|GENSCAN_predicted_peptide_4|511_aa MVLECGVCEDVFSLQGDKVPRLLLCGHTVCHDCLTRLPLHGRAIRCPFDRQVTDLGDSGV WGLKKNFALLELLERLQNGPIGQYGAAEESIGISGESIIRCDEDEAHLASVYCTVCATHL CSECSQVTHSTKTLAKHRRVPLADKPHEKTMCSQHQVHAIEFVCLEEGCQTSPLMCCVCK EYGKHQGHKHSVLEPEANQIRASILDMAHCIRTFTEEISDYSRKLVGIVQHIEGGEQIVE DGIGMAHTEHVPGTAENARSCIRAYFYDLHETLCRQEEMALSVVDAHVREKLIWLRQQQE DMTILLSEVSAACLHCEKTLQQDDCRVVLAKQEITRLLETLQKQQQQFTEVADHIQLDAS IPVTFTKDNRVHIGPKMEIRVVTLGLDGAGKTTILFKLKQDEFMQPIPTIAVVFVVDSSH RDRISEAHSELAKLLTEKELRDALLLIFANKQDVAGALSVEEITELLSLHKLCCGRSWYI QGCDARSGMGLYEGLDWLSRQLVAAGVLDVA >gi568815593r:65418478_65654907|GENSCAN_predicted_CDS_4|1536_bp atggtgctagagtgtggagtttgtgaagatgtcttttctttgcaaggagacaaagttccc cgtcttttgctttgtggccataccgtctgtcatgactgtctcactcgcctacctcttcat ggaagagcaatccgttgcccatttgatcgacaagtaacagacctaggtgattcaggtgtc tggggattgaaaaaaaattttgctttattggagcttttggaacgactgcagaatgggcct attggtcagtatggagctgcagaagaatccattgggatatctggagagagcatcattcgt tgtgatgaagatgaagctcaccttgcctctgtatattgcactgtgtgtgcaactcatttg tgctctgagtgttctcaagttactcattctacaaagacattagcaaagcacaggcgagtt cctctagctgataaacctcatgagaaaactatgtgctctcagcaccaggtgcatgccatt gagtttgtttgcttggaagaaggttgtcaaactagcccactcatgtgctgtgtctgcaaa gaatatggaaaacaccagggtcacaagcattcagtattggaaccagaagctaatcagatc cgagcatcaattttagatatggctcactgcatacggaccttcacagaggaaatctcagat tattccagaaaattagttggaattgtgcagcacattgaaggaggagaacaaatcgtggaa gatggaattggaatggctcacacagaacatgtaccagggactgcagagaatgcccggtca tgtattcgagcttatttttatgatctacatgaaactctgtgtcgtcaagaagaaatggct ctaagtgttgttgatgctcatgttcgtgaaaaattgatttggctcaggcagcaacaagaa gatatgactattttgttgtcagaggtttctgcagcctgcctccactgtgaaaagactttg cagcaggatgattgtagagttgtcttggcaaaacaggaaattacaaggttactggaaaca ttgcagaaacagcagcagcagtttacagaagttgcagatcacattcagttggatgccagc atccctgtcacttttacaaaggataatcgagttcacattggaccaaaaatggaaattcgg gtcgttacgttaggattggatggtgctggaaaaactactatcttgtttaagttaaaacag gatgaattcatgcagcccattccaacaattgctgttgtgtttgttgtagatagcagtcat agagacagaattagtgaagcacacagcgaacttgcaaagttgttaacggaaaaagaactc cgagatgctctgctcctgatttttgctaacaaacaggatgttgctggagcactgtcagta gaagaaatcactgaactactcagtctccataaattatgctgtggccgtagctggtatatt cagggctgtgatgctcgaagtggtatgggactgtatgaagggttggactggctctcacgg caacttgtagctgctggagtattggatgttgcttga >gi568815593r:65418478_65654907|GENSCAN_predicted_peptide_5|110_aa MATLVVNKLGAGVDSGRQGSRGTAVVKVRDLACSILHHRPPFGSSWRPGGIAAAQASIGG MRGLWAPFGITSLVLSSASFLSYGLKLPGLLLMTCHHRDICEKELAGQWQ >gi568815593r:65418478_65654907|GENSCAN_predicted_CDS_5|333_bp atggctaccctggttgtaaacaagctcggagcgggagtagacagtggccggcagggcagc cgggggacagctgtagtgaaggtgagggaccttgcgtgctctattctccaccatcggcct cccttcggttcatcctggagacctggcggcatcgccgcggcccaggcttcgataggtggg atgcgcggcctctgggccccttttggcattacaagtctcgtcctgtcctctgcatccttt ctctcctatggcctgaaattgcctggtcttctgctcatgacatgtcatcatcgggacatt tgtgaaaaggagcttgcgggccagtggcagtga >gi568815593r:65418478_65654907|GENSCAN_predicted_peptide_6|245_aa MTTEVILHYRPCESDPTQLPKIAEKAIQDFPTRPLSRFIPWFPYDGSKLPLRPKRSPPVI SEEAAEDVKQYLTISEHDAKSHSYDCTVDLLEFQPSLKKQHLTWSHTLKEQTNSGNLVMR LTKPTLFTNIPVTCEEKDLPGDLFNQLMRDDPSTVNGAEVLMLGEMLTLPQNFGNIFLGE TFSSYISVHNDSNQVVKDILVKADLQTSSQRLNLSASNAAVAELKPDCCIDDVIHHEVKE IGTHI >gi568815593r:65418478_65654907|GENSCAN_predicted_CDS_6|735_bp atgactacagaagtaatattacattatcgaccatgtgagagtgatcccacacaactgcca aaaattgcagaaaaagcaattcaagactttcctactcgtccgctatcaagatttatacct tggtttccatatgatgggtccaagcttccactcagacctaaaagatcaccacctgtgatt tctgaagaggcagctgaagatgtgaaacagtacttaaccatttcagaacatgatgctaag tcacacagttatgattgcacagtagatctattggagtttcaacctagcttgaaaaagcag catttaacctggtcacacacactgaaggaacagactaattctggaaatctggtgatgcgg ctgactaagcctactttattcaccaatatcccagtaacatgtgaagagaaagacttacct ggagatctctttaaccagctgatgagagatgatccttcaaccgttaatggtgcagaagtt ttaatgttgggagaaatgctgactttaccacagaattttgggaatatatttttgggagag accttttccagttatatcagcgttcataatgatagcaatcaagttgtaaaagacatatta gtaaaagctgatcttcagacaagttctcagcgtttaaatctttcagcctccaatgctgca gtggctgaacttaaaccggattgttgtattgatgatgtcatacatcatgaagtcaaagaa attggaacacacatn