GENSCAN 1.0 Date run: 3-Nov-116 Time: 23:14:00 Sequence gi568815596r:70197292_70402011 : 204720 bp : 43.61% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 3736 3775 40 -2.16 1.01 Init + 4682 4912 231 1 0 60 110 198 0.516 15.78 1.02 Intr + 5339 5479 141 1 0 54 76 56 0.308 1.55 1.03 Intr + 6056 6232 177 1 0 16 32 158 0.746 3.12 1.04 Term + 6920 7321 402 1 0 16 51 212 0.982 5.45 1.05 PlyA + 8416 8421 6 1.05 2.07 PlyA - 9520 9515 6 1.05 2.06 Term - 21710 21690 21 2 0 114 36 2 0.017 -4.09 2.05 Intr - 27338 27263 76 1 1 73 80 60 0.155 3.22 2.04 Intr - 30531 30444 88 1 1 62 97 60 0.311 3.33 2.03 Intr - 32027 31973 55 2 1 65 92 2 0.395 -3.15 2.02 Intr - 33563 33465 99 2 0 115 121 55 0.871 11.81 2.01 Init - 51096 50941 156 0 0 91 57 95 0.415 4.61 2.00 Prom - 55780 55741 40 -6.76 3.00 Prom + 59071 59110 40 -7.66 3.01 Init + 60874 60985 112 0 1 95 74 243 0.998 21.97 3.02 Intr + 62069 62275 207 0 0 94 71 189 0.981 16.85 3.03 Intr + 63921 64095 175 1 1 126 76 89 0.988 10.50 3.04 Intr + 77668 77879 212 1 2 85 94 323 0.999 31.16 3.05 Intr + 78223 78375 153 2 0 69 115 73 0.771 8.14 3.06 Term + 79443 80101 659 1 2 94 45 465 0.999 36.92 3.07 PlyA + 80214 80219 6 1.05 4.05 PlyA - 82025 82020 6 1.05 4.04 Term - 84393 84343 51 0 0 106 36 61 0.584 0.13 4.03 Intr - 90901 90777 125 2 2 83 103 61 0.770 7.40 4.02 Intr - 92081 92059 23 1 2 122 103 28 0.129 4.89 4.01 Init - 96358 96327 32 1 2 78 106 0 0.092 0.06 4.00 Prom - 97278 97239 40 -2.86 5.04 PlyA - 98711 98706 6 1.05 5.03 Term - 100186 99998 189 1 0 117 42 222 0.999 17.95 5.02 Intr - 103689 103549 141 0 0 131 94 181 0.996 23.55 5.01 Init - 104720 104634 87 2 0 108 74 146 0.735 15.87 5.00 Prom - 138212 138173 40 -3.26 6.06 PlyA - 138231 138226 6 1.05 6.05 Term - 160586 160491 96 2 0 85 41 92 0.382 2.17 6.04 Intr - 171052 170968 85 0 1 85 99 40 0.050 4.62 6.03 Intr - 183205 183150 56 1 2 47 71 57 0.002 -2.32 6.02 Intr - 188828 188728 101 0 2 79 54 60 0.015 1.53 6.01 Init - 200152 200083 70 1 1 61 55 75 0.056 2.71 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 23175 23342 168 2 0 81 54 148 0.879 5.46 S.002 Term + 95941 96231 291 0 0 96 48 224 0.842 14.54 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:70197292_70402011|GENSCAN_predicted_peptide_1|316_aa MWPQHGLRYLVAAVLSRWALVETWAEVDRSPMSMEKALKHLEAYNTEKEGAFASRVGWAF LTMLWKVHAQSLRDTAQVSRQLAQGKGDHILTEWLMAAMWTGWNDARELSKTVSKWQSYA ELVEDTRDQRPHVELAIHWYPTNVQQVLVLVGTDADYSLVYGKRDKFLGKAAYTDGYRGQ SVKMTVDYQELNKVTAPLHAAVLSITDLMDHLTMELGQYHYAVDLANASLSDDIAPESQE QFGFTWEGRQWTFTVLPQGYVHSPTICHGLVATDLATWKFPKGIRLFHYIDDIMLTSDSP ADLEAVVPLLQQHLAA >gi568815596r:70197292_70402011|GENSCAN_predicted_CDS_1|951_bp atgtggccccaacatgggttgcggtacttggtggcagctgtgctgtccagatgggccctg gtggaaacatgggcagaggtagacaggtcccccatgagcatggagaaggccctgaagcac ctggaagcatacaacaccgagaaggagggtgcctttgccagcagagttggatgggcattt ttgactatgctatggaaagtgcatgcccagtccctgcgggatacagcacaggtgagcagg cagttggcacaagggaaaggtgaccatatcctgactgaatggctgatggcagccatgtgg acagggtggaatgatgccagagaattatcaaaaactgtgagtaaatggcaatcatatgca gagctggtggaggacaccagagatcagaggccacatgtggaattggcaatccactggtac cccaccaatgtacagcaggtgttggtgctggtaggtactgatgcagattatagcctcgtc tatgggaaacgggataagtttttaggcaaggctgcatacacagatggttatagaggccag tcagtgaaaatgacagtggactatcaagaactaaataaagtaacagcccctttacatgca gcagtcctgtctatcacggatttgatggaccacctgacgatggaattgggacagtaccat tatgcagtggatttggctaatgcatccctttcagatgatatcgctccagagagccaggaa cagtttggcttcacatgggaagggcgacaatggactttcacagtgttgccacagggctac gtgcatagtcccaccatatgtcatggtctcgttgctacggatttagccacctggaaattt ccaaaggggatccgcctattccattacattgatgatattatgttaacctctgattctcct gcagatttagaagctgtggtgcccctcttgcaacaacatttggcagcatga >gi568815596r:70197292_70402011|GENSCAN_predicted_peptide_2|164_aa MREAGWWSSRREGLGPAPLPAYCSPWTPRYRGFCFLSMHLDAYNQDPICEDETAGNDPYC FVEFHEHRHAAAALAAMNGRKIMGKEVKVNWATTPSSQKKDTSNHFHVFVGDLSPEITTE DIKAAFAPFGRISDARVVKDMATGKSKGYGFVSFFNKWSFSAFQ >gi568815596r:70197292_70402011|GENSCAN_predicted_CDS_2|495_bp atgagggaggcgggatggtggtcgtcccggagggaaggcctcggccctgcgccgctccca gcctattgttctccgtggacaccgcgatatcgtggtttttgtttccttagcatgcacctg gatgcttacaaccaagacccaatctgtgaggacgagacagctggaaatgatccctattgt tttgtggagtttcatgagcatcgtcatgcagctgcagcattagctgctatgaatggacgg aagataatgggtaaggaagtcaaagtgaattgggcaacaacccctagcagtcaaaagaaa gatacaagcaatcatttccatgtctttgttggtgatctcagcccagaaattacaactgaa gatataaaagctgcttttgcaccatttggaagaatatcagatgcccgagtggtaaaagac atggcaacaggaaagtctaagggatatggctttgtctcctttttcaacaaatggtctttc tccgcatttcaataa >gi568815596r:70197292_70402011|GENSCAN_predicted_peptide_3|505_aa MGRVVAELVSSLLGLWLLLCSCGCPEGAELRAPPDKIAIIGAGIGGTSAAYYLRQKFGKD VKIDLFEREEVGGRLATMMVQGQEYEAGGSVIHPLNLHMKRFVKDLGLSAVQASGGLLGI YNGETLVFEESNWFIINVIKLVWRYGFQSLRMHMWVEDVLDKFMRIYRYQSHDYAFSSVE KLLHALGGDDFLGMLNRTLLETLQKAGFSEKFLNEMIAPVMRVNYGQSTDINAFVGAVSL SCSDSGLWAVEGGNKLVCSGLLQASKSNLISGSVMYIEEKTKTKYTGNPTKMYEVVYQIG TETRSDFYDIVLVATPLNRKMSNITFLNFDPPIEEFHQYYQHIVTTLVKGELNTSIFSSR PIDKFGLNTVLTTDNSDLFINSIGIVPSVREKEDPEPSTDGTYVWKIFSQETLTKAQILK LFLSYDYAVKKPWLAYPHYKPPEKCPSIILHDRLYYLNGIECAASAMEMSAIAAHNAALL AYHRWNGHTDMIDQDGLYEKLKTEL >gi568815596r:70197292_70402011|GENSCAN_predicted_CDS_3|1518_bp atggggcgcgtcgtcgcggagctcgtctcctcgctgctggggttgtggctgttgctgtgc agctgcggatgccccgagggcgccgagctgcgtgctccgccagataaaatcgcgattatt ggagccggaattggtggcacttcagcagcctattacctgcggcagaaatttgggaaagat gtgaagatagacctgtttgaaagagaagaggtcgggggccgcctggctaccatgatggtg caggggcaagaatacgaggcaggaggttctgtcatccatcctttaaatctgcacatgaaa cgttttgtcaaagacctgggtctctctgctgttcaggcctctggtggcctactggggata tataatggagagactctggtatttgaggagagcaactggttcataattaacgtgattaaa ttagtttggcgctatggatttcaatccctccgtatgcacatgtgggtagaggacgtgtta gacaagttcatgaggatctaccgctaccagtctcatgactatgccttcagtagtgtcgaa aaattacttcatgctctaggaggagatgacttccttggaatgcttaatcgaacacttctt gaaaccttgcaaaaggccggcttttctgagaagttcctcaatgaaatgattgctcctgtt atgagggtcaattatggccaaagcacggacatcaatgcctttgtgggggcggtgtcactg tcctgttctgattctggcctttgggcagtagaaggtggcaataaacttgtttgctcaggg cttctgcaggcatccaaaagcaatcttatatctggctcagtaatgtacatcgaggagaaa acaaagaccaagtacacaggaaatccaacaaagatgtatgaagtggtctaccaaattgga actgagactcgttcagacttctatgacatcgtcttggtggccactccgttgaatcgaaaa atgtcgaatattacttttctcaactttgatcctccaattgaggaattccatcaatattat caacatatagtgacaactttagttaagggggaattgaatacatctatctttagctctaga cccatagataaatttggccttaatacagttttaaccactgataattcagatttgttcatt aacagtattgggattgtgccctctgtgagagaaaaggaagatcctgagccatcaacagat ggaacatatgtttggaagatcttttcccaagaaactcttactaaagcacaaattttaaag ctctttctgtcctatgattatgctgtgaagaagccatggcttgcatatcctcactataag cccccggagaaatgcccctctatcattctccatgatcgactttattacctcaatggcata gagtgtgcagcaagtgccatggagatgagtgccattgcagcccacaacgctgcactcctt gcctatcaccgctggaacgggcacacagacatgattgatcaggatggcttatatgagaaa cttaaaactgaactatga >gi568815596r:70197292_70402011|GENSCAN_predicted_peptide_4|76_aa MSKAHPPELKKFMDKKLSLKLNGGRHVQGILRGFDPFMNLVIDECVEMATSGQQNNIGMV VIRGNSIIMLEALERV >gi568815596r:70197292_70402011|GENSCAN_predicted_CDS_4|231_bp atgagcaaagctcaccctcccgagttgaaaaaatttatggacaagaagttatcattgaaa ttaaatggtggcagacatgtccaaggaatattgcggggatttgatccctttatgaacctt gtgatagatgaatgtgtggagatggcgactagtggacaacagaacaatattggaatggtg gtaatacgaggaaatagtatcatcatgttagaagccttggaacgagtataa >gi568815596r:70197292_70402011|GENSCAN_predicted_peptide_5|138_aa MAELQQLRVQEAVESMVKSLERENIRKMQGLMFRCSASCCEDSQASMKQVHQCIERCHVP LAQAQALVTSELEKFQDRLARCTMHCNDKAKDSIDAGSKELQVKQQLDSCVTKCVDDHMH LIPTMTKKMKEALLSIGK >gi568815596r:70197292_70402011|GENSCAN_predicted_CDS_5|417_bp atggctgagctgcagcagctccgggtgcaggaggcggtggagtccatggtgaagagtctg gaaagagagaacatccggaagatgcagggtctcatgttccggtgcagcgccagctgttgt gaggacagccaggcctccatgaagcaggtgcaccagtgcatcgagcgctgccatgtgcct ctggctcaagcccaggctttggtcaccagtgagctggagaagttccaggaccgcctggcc cggtgcaccatgcattgcaacgacaaagccaaagattcaatagatgctgggagtaaggag cttcaggtgaagcagcagctggacagttgtgtgaccaagtgtgtggatgaccacatgcac ctcatcccaactatgaccaagaagatgaaggaggctctcttatcaattggaaaataa >gi568815596r:70197292_70402011|GENSCAN_predicted_peptide_6|135_aa MPLDFVANDDKGRDGEEDSAKVVNSSHICSSFLKIEIGWCLPVWKPTKAKSKTASKLKIS IDEDVEELELSCITGGEKRLSTGINCDPPVNNLSDSFLTFDAGQHLEQEGQKAALVTTNE LILLREAIRKRNFTP >gi568815596r:70197292_70402011|GENSCAN_predicted_CDS_6|408_bp atgccattagattttgttgccaatgatgacaaaggcagagatggagaggaagacagtgct aaagttgttaattcttcgcacatctgcagcagcttcctgaagatagaaatcggttggtgc ctacctgtttggaaacccaccaaagcaaagagtaaaacagcatccaaattgaagataagt attgatgaggatgtggaggaattagaactctcatgcattactggtggggagaagagactc agcactggaatcaactgtgatccacctgtgaacaatttaagtgactccttccttactttt gatgcagggcagcatttggagcaggaaggacagaaggctgctcttgtcaccaccaatgag ctcatccttctccgagaggcaataagaaagcgaaatttcaccccatga