GENSCAN 1.0 Date run: 2-Nov-116 Time: 18:22:19 Sequence gi568815590r:52523785_52785469 : 261685 bp : 39.25% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4216 4266 51 0 0 65 77 74 0.885 5.21 1.02 Intr + 4503 4800 298 2 1 30 101 331 0.969 24.12 1.03 Term + 5675 5925 251 0 2 26 44 148 0.417 -0.92 1.04 PlyA + 7358 7363 6 1.05 2.05 PlyA - 8014 8009 6 1.05 2.04 Term - 10816 10806 11 2 2 110 48 5 0.071 -3.92 2.03 Intr - 16127 16047 81 0 0 105 89 84 0.792 8.89 2.02 Intr - 18661 18608 54 2 0 129 111 3 0.955 4.73 2.01 Init - 19163 19109 55 2 1 64 81 63 0.569 4.70 2.00 Prom - 29236 29197 40 -4.25 3.05 PlyA - 29631 29626 6 1.05 3.04 Term - 34911 34798 114 0 0 52 49 82 0.335 -1.71 3.03 Intr - 37846 37757 90 1 0 97 93 44 0.599 5.07 3.02 Intr - 41219 41128 92 0 2 57 65 103 0.277 3.69 3.01 Init - 41472 41283 190 0 1 97 96 199 0.686 18.82 3.00 Prom - 43622 43583 40 -8.55 4.03 PlyA - 43947 43942 6 1.05 4.02 Term - 45195 44582 614 1 2 76 53 347 0.463 23.65 4.01 Init - 49475 49370 106 2 1 48 78 136 0.425 7.03 4.00 Prom - 55475 55436 40 -5.95 5.15 PlyA - 60208 60203 6 1.05 5.14 Term - 76361 76228 134 0 2 58 42 136 0.569 3.27 5.13 Intr - 80964 80898 67 0 1 91 58 60 0.066 0.86 5.12 Intr - 82969 82918 52 0 1 76 116 24 0.089 1.99 5.11 Intr - 106744 106686 59 1 2 110 89 23 0.306 1.36 5.10 Intr - 111184 111137 48 1 0 91 99 48 0.763 4.16 5.09 Intr - 112285 112231 55 0 1 73 91 55 0.691 2.36 5.08 Intr - 118807 118567 241 2 1 69 98 222 0.655 16.99 5.07 Intr - 119028 118920 109 0 1 87 76 63 0.996 3.94 5.06 Intr - 122062 121918 145 0 1 43 62 193 0.610 11.56 5.05 Intr - 134124 132224 1901 0 2 127 116 1257 0.994 118.77 5.04 Intr - 137497 137311 187 0 1 63 69 119 0.535 6.27 5.03 Intr - 137935 137751 185 1 2 54 100 92 0.996 4.66 5.02 Intr - 144407 144237 171 2 0 68 82 279 0.998 24.52 5.01 Init - 150468 150061 408 0 0 70 110 206 0.907 17.64 5.00 Prom - 154068 154029 40 -6.25 6.05 PlyA - 154675 154670 6 1.05 6.04 Term - 158294 158070 225 2 0 88 48 93 0.767 1.10 6.03 Intr - 159935 159765 171 2 0 28 91 96 0.851 3.12 6.02 Intr - 160229 160103 127 1 1 45 90 136 0.736 9.26 6.01 Init - 161685 161615 71 0 2 81 92 19 0.898 2.17 6.00 Prom - 174303 174264 40 -6.35 7.00 Prom + 181824 181863 40 -4.25 7.01 Sngl + 190204 190407 204 0 0 65 46 435 0.994 29.94 7.02 PlyA + 192443 192448 6 1.05 8.09 PlyA - 193638 193633 6 1.05 8.08 Term - 204859 204606 254 2 2 13 51 147 0.130 -1.68 8.07 Intr - 207556 207470 87 2 0 52 99 117 0.306 8.22 8.06 Intr - 223446 223404 43 0 1 131 75 -32 0.141 -3.31 8.05 Intr - 226402 226269 134 2 2 84 110 79 0.576 9.14 8.04 Intr - 236717 236591 127 2 1 107 80 37 0.005 4.13 8.03 Intr - 243112 242978 135 1 0 45 75 92 0.526 3.44 8.02 Intr - 246583 246474 110 2 2 57 82 123 0.900 7.68 8.01 Init - 247371 247362 10 0 1 77 95 0 0.447 0.30 8.00 Prom - 247761 247722 40 -2.85 9.02 PlyA - 248325 248320 6 1.05 9.01 Term - 254919 254754 166 2 1 84 46 189 0.984 10.71 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 240741 240966 226 0 1 38 43 226 0.864 8.37 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:52523785_52785469|GENSCAN_predicted_peptide_1|199_aa MLIGSNEQEVSNALDLLPINQVTRKAASFEWGPEQEKALQQVQAVEFWSNALPSSADNIS AFERQLLASYWTLVETACLALGHQVTMRPELPIMNWVLSDPSSHKVVQAQQHSIIKWQRF VLTVIDTYFGYGFSYPASNASAKTTIHGLTACLIHHHGIPHRISSDRGTHFLLKKCGSVL MLMEFTGLTMFPIILTQLD >gi568815590r:52523785_52785469|GENSCAN_predicted_CDS_1|600_bp atgctgattggatccaatgagcaagaagtatcaaatgcactggacttattgcccattaat caagtgacccgaaaggctgccagttttgagtggggtccagaacaggagaaggctctgcaa caggtccaggctgttgaattttggagcaatgccctgccatcttctgcagataacatctct gcttttgagagacagctcttggcgagttactggaccttggtggaaactgcatgtttggct ctgggtcatcaagtcaccatgcgacctgaactgcctataatgaactgggtgctttctgac ccatctagccataaagtggttcaggcacagcagcattccatcatcaaatggcagaggttt gtcctcactgtaatagacacttactttggatatgggttttcctatcctgcaagcaatgct tctgccaagactaccatccatggactcacagcatgccttatccaccatcatggtattcca cacagaatttcttctgaccgaggcactcactttctgctaaagaagtgtggcagtgtgctc atgctcatggaattcactggtcttaccatgttccccatcatcctaacgcagctggattga >gi568815590r:52523785_52785469|GENSCAN_predicted_peptide_2|66_aa MVTKGRWLGPDSLREGERKIFPRDSNLKDKFIKHFTGPVTFSPECSKHFHRLYYNTRECS TPAYIS >gi568815590r:52523785_52785469|GENSCAN_predicted_CDS_2|201_bp atggtcacaaaaggacgatggcttgggccagactcgttgagggaaggagaaagaaaaata ttcccaagagactctaacttaaaagacaaattcataaagcatttcacagggccggtcaca ttttcaccagaatgcagcaaacatttccaccgactctattacaataccagggagtgctca acgccagcttacatttcctaa >gi568815590r:52523785_52785469|GENSCAN_predicted_peptide_3|161_aa MRPLKPGAPLPALFLLALALSPHGAHGRPRGRRGARVTDKEPKPLLFLPAAGAGRTPSGS RSAGAGRGTRFGKPEISTAENRASLQIPSSRKEVEGPSFLGKKGRYSSPFVPPPLTFRQL SGATDMKEPTLETLFQPMPINEPDFSRWGTPGKKDWAAGKQ >gi568815590r:52523785_52785469|GENSCAN_predicted_CDS_3|486_bp atgcggccccttaagcccggcgcccctttgcccgcactcttcctgctggcgctggctttg tccccgcacggagcccacgggaggccccgggggcgcaggggagcgcgcgtcacggataag gagcccaagccgttgcttttcctccccgcggccggggccggccggactcccagcggctcc cggagcgcaggagctgggcgaggcactcgctttgggaagcctgagattagtacagcagaa aacagagcatctctgcagattcccagctctcggaaagaggtcgaaggtcctagcttcctt ggaaagaaaggcagatattcaagtccttttgtgccacctcctcttacctttcggcagctc tcaggagctactgatatgaaggagcccactttggaaacattgtttcagccaatgcctata aacgagcccgactttagccgctgggggacaccaggaaaaaaagactgggctgcaggaaag caatga >gi568815590r:52523785_52785469|GENSCAN_predicted_peptide_4|239_aa MPLPGRPIWEVGSASAQLPRLGGEEHLCQAALHLGVPECIIGIDILSTWQNTHIGSLTGR VRAIMVGKAKWKPLELPLPRKIVNQKQYCIPGGTEEISATIKELKDAGEVIPIPTTFPLN SPIWPVRKTDGCWRMTVDYCELNQVVTPTAATVPDVVSLLEQINTSPGTWYADIDLANAV FSIPVHKAHQKQFAFSWQGQQYTFTVLPQEYINSLGLCHNLILVMVNTEYQLDWIEVKR >gi568815590r:52523785_52785469|GENSCAN_predicted_CDS_4|720_bp atgcctctgcccggccgccccatctgggaggtgggcagcgcttctgcccagctgccccgt ctgggaggtgaggagcacctctgccaggccgcccttcatctgggagtgccagagtgcatc attggcatagacatacttagcacctggcagaacacccacattggctccctaactggtagg gtgagggctatcatggtgggaaaggccaaatggaagccattagagctgcctctacctaga aaaatagtaaatcaaaaacaatattgcatacctggaggaactgaggagattagtgccacc atcaaggaattgaaagatgcaggggaggtgattccgattcccaccacattcccattaaac tctcccatttggcctgtgcggaagacagatggatgttggagaatgacagtggattattgt gagcttaaccaagtggtgactccaactgcagctacagtaccagatgtggtttcattgctt gagcaaattaacacatctcctggtacctggtatgcagacattgacttggcaaatgccgtt ttctccattcctgtccataaggcccaccagaagcaatttgccttcagctggcaaggccag caatatacctttactgtcctacctcaggagtatatcaactctctaggtttgtgtcataat cttattcttgtgatggttaatactgagtatcaacttgattggattgaagtgaagaggtga >gi568815590r:52523785_52785469|GENSCAN_predicted_peptide_5|1253_aa MAKIPLLECLTRHSYRECLGRLDSLPEHEDSEKAEMKRSTELVLSPDMPRTTNESLLTSF PKSVEHVSPDTADAESGKEIRESCQSTVHQQDETTIDTKDGDLPFFNVSLLDWINVQDRP NDVESLVRKCFDSMSRLDPRIIRPFIAECRQTIAKLDNQNMKAIKGLEDRLYALDQMIAS CGRLVNEQKELAQGFLANQKRAENLKDASVLPDLCLSHANQLMIMLQNHRKLLDIKQKCT TAKQELANNLHVRLKWCCFVMLHADQDGEKLQALLRLVIELLERVKIVEALSTVPQMYCL AVVEVVRRKMFIKHYREASVSQTSPQSASSPRMESTAGITTTTSPRTPPPLTVQDPLCPA VCPLEELSPDSIDAHTFDFETIPHPNIEQTIHQVSLDLDSLAESPESDFMSAVNEFVIEE NLSSPNPISDPQSPEMMVESLYSSVINAIDSRRMQDTNVCGKEDFGDHTSLNVQLERCRV VAQDSHFSIQTIKEDLCHFRTFVQKEQCDFSNSLKCTAVEIRNIIEKVKCSLEITLKEKH QKELLSLKNEYEGKLDGLIKETEENENKIKKLKGELVCLEEVLQNKDNEFALVKHEKEAV ICLQNEKDQKLLEMENIMHSQNCEIKELKQSREIVLEDLKKLHVENDEKLQLLRAELQSL EQSHLKELEDTLQVRHIQEFEKVMTDHRVSLEELKKENQQIINQIQESHAEIIQEKEKQL QELKLKVSDLSDTRCKLEVELALKEAETDEIKILLEESRAQQKETLKSLLEQETENLRTE ISKLNQKIQDNNENYQVGLAELRTLMTIEKDQCISELISRHEEESNILKAELNKVTSLHN QAFEIEKNLKEQIIELQSKLDSELSALERQKDEKITQQEEKYEAIIQNLEKDRQKLVSSQ EQDREQLIQKLNCEKDEAIQTALKEFKLEREVVEKELLEKVKHLENQIAKRGDSSSLVAE LQEKLQEEKAKFLEQLEEQEKRKNEEMQNVRTSLIAEQQTNFNTVLTREKMRKENIINDL SDKLKSTMQQQERDKDLIESLSEDRARLLEEKKKLEEEVSKLRSSSFVPSPYVATAPELY GACAPELPGESDRSAVETADEGRVDSAMETSMMSVQENIHMLSEEKQRIMLLERTLQLKE EENKRLNQRLMSQSMSSVSSRHSEKIAIREIQHPVFMPVALSYESRKTGSRCFGKWAVRS SSEAYLRVPGLCDEVVANAIMVPAILSNGKNRNYFHTNPLTPDLVEKTTTTTA >gi568815590r:52523785_52785469|GENSCAN_predicted_CDS_5|3762_bp atggccaagattccactgttggagtgcctaaccagacatagttacagagaatgtttggga agactggattctttacctgaacatgaagactcagaaaaagctgagatgaaaagatccact gaactggtgctctctcctgatatgcctagaacaactaacgaatctttgttaacctcattt cccaagtcagtggaacatgtgtccccagataccgcagatgctgaaagtggcaaagaaatt agggaatcttgtcaaagtactgttcatcagcaagatgaaactacgattgacactaaagat ggtgatctgcccttttttaatgtctctttgttagactggataaatgttcaagatagacct aatgatgtggaatctttggtcaggaagtgctttgattctatgagcaggcttgatccaagg attattcgaccatttatagcagaatgccgtcaaactattgccaaacttgataatcagaat atgaaagccattaaaggacttgaagatcggctctacgccctggaccagatgattgctagc tgtggccgactggtgaatgaacagaaagagcttgctcagggatttttagctaatcagaag agagctgaaaacttaaaggatgcatctgtattacctgatttatgcctgagtcacgcaaat cagttgatgattatgttgcaaaatcatagaaaactgttagatattaagcagaagtgtacc actgccaaacaagaactagcaaataacctacatgtcagactgaagtggtgttgctttgta atgcttcatgctgatcaagatggagagaagttacaagctttgctccgcctcgtaatagag ctgttagaaagagtcaaaattgttgaagctcttagtacagttcctcagatgtactgctta gctgttgttgaggttgtaagaagaaaaatgttcataaaacactacagggaggcatctgtg agtcagacatccccacagtctgcttcttcaccaaggatggaaagtacagcaggaattaca actactacctcaccgagaactcctccaccactgactgttcaggatcccttatgtcctgca gtttgtcccttagaagaattatctccagatagtattgatgcacatacgtttgattttgaa actattccccatccaaacatagaacagactattcaccaagtttctttagacttggattca ttagcagaaagtcctgaatcagattttatgtctgctgtgaatgagtttgtaatagaagaa aatttgtcgtctcctaatcctataagtgatccacaaagcccagaaatgatggtggaatca ctttattcatcagttatcaatgcgatagacagtagacgaatgcaggatacaaatgtatgt ggtaaggaggattttggagatcatacttctctgaatgtccagttggaaagatgtagagtt gttgcccaagactctcacttcagtatacaaaccattaaggaagacctttgccactttaga acatttgtacaaaaagaacagtgtgacttctcaaattcattaaaatgtacagcagtagaa ataagaaacattattgaaaaagtaaaatgttctctggaaataacactaaaagaaaaacat caaaaagaactactgtctttaaaaaatgaatatgaaggtaaacttgacggactaataaag gaaactgaagagaatgaaaacaaaattaaaaaattgaagggagagttagtatgccttgag gaggttttacaaaataaagataatgaatttgctttggttaaacatgaaaaagaagctgta atctgcctgcagaatgaaaaggatcagaagttgttagagatggaaaatataatgcactct caaaattgtgaaattaaagaactgaagcagtcacgagaaatagtgttagaagacttaaaa aagctccatgttgaaaatgatgagaagttacagttattgagggcagaacttcagtccttg gagcaaagtcatctaaaggaattagaggacacacttcaggttaggcacatacaagagttt gagaaggttatgacagaccacagagtttctttggaggaattaaaaaaggaaaaccaacaa ataattaatcaaatacaagaatctcatgctgaaattatccaggaaaaagaaaaacagtta caggaattaaaactcaaggtttctgatttgtcagacacgagatgcaagttagaggttgaa cttgcgttgaaggaagcagaaactgatgaaataaaaattttgctggaagaaagcagagcc cagcagaaggagaccttgaaatctcttcttgaacaagagacagaaaatttgagaacagaa attagtaaactcaaccaaaagattcaggataataatgaaaattatcaggtgggcttagca gagctaagaactttaatgacaattgaaaaagatcagtgtatttccgagttaattagtaga catgaagaagaatctaatatacttaaagctgaattaaacaaagtaacatctttgcataac caagcatttgaaatagaaaaaaacctaaaagaacaaataattgaactgcagagtaaattg gattcagaattgagtgctcttgaaagacaaaaagatgaaaaaattacccaacaagaagag aaatacgaagctattatccagaaccttgagaaagacagacaaaaattggtcagcagccag gagcaagacagagaacagttaattcagaagcttaattgtgaaaaagatgaagctattcag actgccctaaaagaatttaaattggagagagaagttgttgagaaagagttattagaaaaa gttaaacatcttgagaatcaaatagcaaaaagaggagattcttcaagcttagttgctgaa cttcaagaaaagcttcaggaagaaaaagctaagtttctagaacaacttgaagagcaagaa aaaagaaagaatgaagaaatgcaaaatgttcgaacatctttgattgcggaacaacagacc aattttaacactgttttaacaagagagaaaatgagaaaagaaaacataataaatgatctt agtgataagttgaaaagtacaatgcagcaacaagaacgggataaagatttgatagagtca ctttctgaagatcgagctcgtttgcttgaggaaaagaaaaagcttgaagaagaagtcagt aagttgcgtagtagcagttttgttccttcaccatatgtagctacagccccagaactttat ggagcttgtgcacctgaactcccaggtgaatcagatagatccgctgtggaaacagcagat gaaggaagagtggattcagcaatggagacaagcatgatgtctgtacaagaaaatattcat atgttgtctgaagaaaaacagcggataatgctgttagaacgaacattgcaattgaaagaa gaagaaaataaacggttaaatcaaagactgatgtctcagagcatgtcttcagtatcttca aggcattctgaaaagatagctattagagaaattcagcatcctgtctttatgcctgtcgca ttaagctatgagtccaggaagacggggagtcgctgttttgggaaatgggcagtcagatct tcctcagaggcctatctaagggtccctgggctgtgtgatgaggttgttgcaaatgcaatc atggtcccagctattctaagtaatggcaaaaaccgcaactactttcacaccaacccacta actcctgatctggtggagaaaacaacaacaacaacagcttaa >gi568815590r:52523785_52785469|GENSCAN_predicted_peptide_6|197_aa MKLYVFLVNTGTTLTFDTELTVQTVADLKHAIQSKYKIAIQHQVLVVNGGECMAADRRVC TYSAGTDTNPIFLFNKEMILCDRPPAIPKTTFSTENDMEIKVEESLMMPAVFHTVASRTQ LALAGVVSVAFPGAQCKLLVGLPFWGLEDRDPLLRAPLGSAPSRDSVWGLRPHIFLLHCP NRFSMRALPLQQTYLGI >gi568815590r:52523785_52785469|GENSCAN_predicted_CDS_6|594_bp atgaagttatatgtatttctggttaacactggaactactctaacatttgacactgaactt acagtgcaaactgtggcagaccttaagcatgccattcaaagcaaatacaagattgctatt caacaccaggtgctggtggtcaatggaggagaatgcatggctgcagatcgaagagtgtgt acctacagtgctgggacggatacaaatccaatttttctttttaacaaagaaatgatctta tgtgatcgtccacctgctattcctaaaactaccttttcgacagaaaatgacatggaaata aaagttgaagaatctcttatgatgcctgcagtttttcatactgttgcttcaaggacacag cttgcattggctggcgttgtgtctgtggcttttccaggtgcacagtgcaagctgttggtg ggtcttccattctggggtctggaggaccgtgaccctcttctcagagctccactagggagt gcccccagtagggactctgtgtgggggctccgaccccacattttccttctacactgccct aacaggttctccatgagggccctgcccctgcagcaaacttacctgggcatctag >gi568815590r:52523785_52785469|GENSCAN_predicted_peptide_7|67_aa MRPRPTVPGPAARPLCQMGEGDAGGGDTYPATPPPSPAAAAEPATPGTIFRRRLVLGSGY QPPIRHR >gi568815590r:52523785_52785469|GENSCAN_predicted_CDS_7|204_bp atgcgaccccggcccaccgtgccagggcccgccgcgcgaccgctgtgccagatgggggaa ggggacgcgggcggcggcgacacttacccggcaacgcctcctccttcgccggcggcagca gcagagccagcgacccccggcaccatcttccgccgccgcctagtcctcggcagcggttac caaccgcccattcggcaccgctaa >gi568815590r:52523785_52785469|GENSCAN_predicted_peptide_8|299_aa MSHGWLYIVTSKEYNNEKGNKDSNFTVEKPDPYYISQWIKMTSVQFIPTKIATLQESPDQ EESWVKCADQGRQRRQHKKTREITEMLLPDSSFGLLDPRLVTPVVMRKICNVPSQTGKEP RDKKNDLMSVKKKTLSSDIQLKGPQGRTLIGLVEDILPLWPESEEMKWTQVQLLIFTNTG PCGRGQGKTDEVAKYPWKSNPTRKSNPSPSFQDLTDNHSIYLELKSMVLNLGCFAFRDIW QCLETLLAVTIVGSATEIKWVEARDAAKHPKMHRTASHNNYLAQNDNRMRNPGLDQPGF >gi568815590r:52523785_52785469|GENSCAN_predicted_CDS_8|900_bp atgtcacatggttggctgtacatagtgacttccaaagaatacaacaatgaaaaggggaat aaagacagcaactttactgtggaaaagcctgacccatactacatcagccaatggatcaag atgacctctgtccagtttattcctaccaagatagccactctccaggagagccctgaccag gaggagagttgggtcaagtgtgccgatcaggggagacagaggagacagcataagaaaaca cgtgaaataacagagatgctccttcctgattcatcatttggtctcttagatcctaggtta gtgacaccagttgttatgagaaaaatctgtaatgtcccttcccaaactgggaaggagcca agagacaaaaagaatgatttgatgtcagtgaagaaaaagaccctatcatctgacatccag ttaaaaggaccccagggaaggactttgataggcctggttgaggacatcttgcccttgtgg ccagaaagtgaggaaatgaaatggactcaggttcagcttttaattttcacaaacactggc ccttgtgggagggggcaagggaagacagacgaagttgctaaatacccatggaaatccaac cctacgcggaaatccaatcctagtccatcatttcaagaccttacagacaatcacagcatt tatttagaacttaaatcaatggttctcaaccttggctgttttgcctttcgggacatttgg caatgtctggagacacttttggctgttaccattgtggggagtgcaactgaaatcaagtgg gtggaggccagggatgcagctaaacatccaaagatgcacaggacagcctcccacaacaat tatttggcccaaaatgacaacagaatgagaaaccctggcttggaccaaccaggtttttga >gi568815590r:52523785_52785469|GENSCAN_predicted_peptide_9|55_aa XESRTRHSPVGRLFNQSASTPVKGLLPQHRHQAASYVCTKAMGGQRRSDQINGSS >gi568815590r:52523785_52785469|GENSCAN_predicted_CDS_9|168_bp nnggagagccggactcgacactcccctgtaggtagactcttcaaccagtctgccagtacc ccagtgaagggcctgcttccacagcaccgccatcaggctgccagctatgtctgcacgaaa gccatggggggacagcggagatcagatcagatcaacggctcctcttga