GENSCAN 1.0 Date run: 4-Nov-116 Time: 17:57:15 Sequence gi568815574r:2686992_2887603 : 200612 bp : 43.65% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 932 971 40 -2.86 1.01 Init + 4370 4436 67 1 1 110 88 248 0.995 26.23 1.02 Intr + 27431 27463 33 0 0 114 105 15 0.831 3.99 1.03 Intr + 32670 32714 45 1 0 108 101 57 0.986 7.38 1.04 Intr + 33365 33433 69 0 0 57 111 61 0.947 4.45 1.05 Intr + 35636 35683 48 0 0 116 87 37 0.915 5.05 1.06 Intr + 39269 39382 114 0 0 104 115 95 0.963 14.22 1.07 Intr + 51209 51265 57 0 0 107 99 86 0.851 10.46 1.08 Intr + 53692 53735 44 2 2 80 94 -4 0.825 -2.64 1.09 Intr + 54144 54226 83 2 2 81 121 3 0.839 1.34 1.10 Term + 54633 54744 112 0 1 61 47 135 0.444 4.73 1.11 PlyA + 54889 54894 6 1.05 2.00 Prom + 56099 56138 40 -1.56 2.01 Init + 65284 65344 61 0 1 91 94 44 0.897 4.71 2.02 Intr + 66915 67031 117 1 0 80 110 19 0.826 3.74 2.03 Term + 67636 67748 113 2 2 38 47 80 0.381 -2.38 2.04 PlyA + 68233 68238 6 1.05 3.00 Prom + 73821 73860 40 -3.76 3.01 Init + 75435 75498 64 2 1 52 105 100 0.931 9.41 3.02 Intr + 80561 80681 121 0 1 73 58 50 0.863 0.05 3.03 Intr + 81367 81500 134 1 2 61 80 83 0.898 5.09 3.04 Intr + 82750 82782 33 2 0 91 91 15 0.550 0.29 3.05 Intr + 83559 83600 42 1 0 99 119 69 0.995 9.31 3.06 Intr + 87725 87748 24 0 0 99 109 25 0.719 3.70 3.07 Term + 92575 92597 23 2 2 97 45 22 0.377 -2.73 3.08 PlyA + 94938 94943 6 1.05 4.02 PlyA - 95846 95841 6 1.05 4.01 Sngl - 100612 99998 615 1 0 46 48 392 0.736 27.20 4.00 Prom - 100940 100901 40 -7.56 5.00 Prom + 101106 101145 40 -8.36 5.01 Init + 102860 102948 89 1 2 84 54 60 0.777 0.81 5.02 Term + 103021 103216 196 1 1 23 49 285 0.946 14.98 5.03 PlyA + 103608 103613 6 1.05 6.03 PlyA - 103629 103624 6 1.05 6.02 Term - 120155 119454 702 2 0 15 48 236 0.143 5.83 6.01 Init - 121325 120921 405 2 0 78 80 248 0.150 19.59 6.00 Prom - 121399 121360 40 -8.56 7.03 PlyA - 121433 121428 6 1.05 7.02 Term - 123686 123414 273 2 0 69 37 217 0.865 10.17 7.01 Init - 126672 126667 6 0 0 78 90 0 0.526 0.07 7.00 Prom - 136522 136483 40 -2.36 8.00 Prom + 138923 138962 40 -6.06 8.01 Init + 143314 143459 146 0 2 78 98 127 0.630 12.29 8.02 Intr + 154582 154636 55 1 1 56 95 7 0.013 -2.82 8.03 Intr + 155174 155251 78 1 0 117 59 45 0.929 4.25 8.04 Intr + 157086 157266 181 2 1 87 82 131 0.981 11.84 8.05 Intr + 158655 158752 98 1 2 69 95 182 0.973 16.73 8.06 Intr + 167609 167780 172 1 1 67 111 178 0.986 17.52 8.07 Intr + 178097 178254 158 0 2 136 64 164 0.941 18.53 8.08 Term + 179802 179903 102 2 0 124 32 41 0.641 0.38 8.09 PlyA + 179939 179944 6 1.05 9.03 PlyA - 181143 181138 6 1.05 9.02 Term - 181379 181159 221 0 2 116 49 25 0.029 -1.40 9.01 Init - 190374 190245 130 0 1 26 82 154 0.656 8.91 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 121325 120780 546 2 0 78 41 263 0.834 16.71 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815574r:2686992_2887603|GENSCAN_predicted_peptide_1|223_aa MARGAALALLLFGLLGVLVAAPDGGFDLSDALPGDDFDLGDAVVDGENDDPRPPNPPKPM PNPNPNHPSSSGSFSDADLADGVSGGEADAPGVIPGIVGAVVVAVAGAISSFIAYQKKKL CFKENAEQGEVDMESHRNANAEPAEIKPLAPESCRNCVHNLGCLQITCPSSTSETPGSPD FFLIKIRYSSVSRYSSKSRYSLHPDTVLPKMYKMYRIFPTLKT >gi568815574r:2686992_2887603|GENSCAN_predicted_CDS_1|672_bp atggcccgcggggctgcgctggcgctgctgctcttcggcctgctgggtgttctggtcgcc gccccggatggtggtttcgatttatccgatgcccttcctggggatgactttgacttagga gatgctgttgttgatggagaaaatgacgacccacgaccaccgaacccacccaaaccgatg ccaaatccaaaccccaaccaccctagttcctccggtagcttttcagatgctgaccttgcg gatggcgtttcaggtggagaagccgacgccccaggcgtgatccccgggattgtgggggct gtcgtggtcgccgtggctggagccatctctagcttcattgcttaccagaaaaagaagcta tgcttcaaagaaaatgcagaacaaggggaggtggacatggagagccaccggaatgccaac gcagagccagctgaaataaaaccactggctcctgaaagttgtaggaactgtgtccacaat cttggctgtttacaaatcacgtgtccatcgagcacgtctgaaacccctggtagccccgac ttctttttaattaaaataagatactcctctgtatccagatactcctctaaatccaggtac tccctacatccagatactgtacttcctaagatgtacaagatgtaccgcattttcccaaca ctgaagacttga >gi568815574r:2686992_2887603|GENSCAN_predicted_peptide_2|96_aa MESWWGLPCLAFLCFLMHARVQCSINYVRYATPHYKVDCVLDDLSIRWLMKSVLSTLKAG RLQAEEQGEPVQVPKLKNLESDVPASTMGERCRPED >gi568815574r:2686992_2887603|GENSCAN_predicted_CDS_2|291_bp atggagagctggtggggacttccctgtcttgcgttcctgtgttttctaatgcacgcccga gtacaatgttcaataaattatgtgaggtatgcaacacctcattataaagttgactgtgtg ctagatgatttgtccatccgttggctgatgaaaagtgttctgagcactctgaaggcaggc cgtctgcaggctgaggagcaaggagagccagtccaggttccaaaactgaagaacttggag tctgatgttccagcatccaccatgggagaaagatgtaggccagaagactag >gi568815574r:2686992_2887603|GENSCAN_predicted_peptide_3|146_aa MGSEEQIMHQGGLLSSGVCSPVENKPRESNGVMCNPTLETETPELALSTWDISQPRGLTP DPCLDWNCWQPWRGDVAQIQEADTVPVPQPSLSSLDHDDQERWRRPDTFIRLTAFGSGQR DFDLADALDDPEPTKKPNSDTKIHSC >gi568815574r:2686992_2887603|GENSCAN_predicted_CDS_3|441_bp atgggttccgaggagcagatcatgcaccagggcggcctgctcagctctggtgtctgcagc cccgtggaaaataaaccccgtgaaagcaatggcgtcatgtgcaacccgaccttggagaca gagaccccagagttagcgttgtctacctgggacatctcccagcccaggggtcttactcca gacccgtgtctcgactggaactgctggcagccttggaggggagatgttgcacagatccaa gaggctgacacggttcctgtcccccagcccagtctgagcagcttggaccatgatgatcaa gagcgctggaggaggccagacacattcattagattaactgcttttggatcaggtcaaaga gactttgatttggcagatgcccttgatgaccctgaacccaccaagaagccaaactcagac accaaaatccacagctgctga >gi568815574r:2686992_2887603|GENSCAN_predicted_peptide_4|204_aa MQSYASAMLSVFNSDDYSPAVQENIPALRRSSSFLCTESCNSKYQCETGENSKGNVQDRV KRPMNAFIVWSRDQRRKMALENPRMRNSEISKQLGYQWKMLTEAEKWPFFQEAQKLQAMH REKYPNYKYRPRRKAKMLPKNCSLLPADPASVLCSEVQLDNRLYRDDCTKATHSRMEHQL GHLPPINAASSPQQRDRYSHWTKL >gi568815574r:2686992_2887603|GENSCAN_predicted_CDS_4|615_bp atgcaatcatatgcttctgctatgttaagcgtattcaacagcgatgattacagtccagct gtgcaagagaatattcccgctctccggagaagctcttccttcctttgcactgaaagctgt aactctaagtatcagtgtgaaacgggagaaaacagtaaaggcaacgtccaggatagagtg aagcgacccatgaacgcattcatcgtgtggtctcgcgatcagaggcgcaagatggctcta gagaatcccagaatgcgaaactcagagatcagcaagcagctgggataccagtggaaaatg cttactgaagccgaaaaatggccattcttccaggaggcacagaaattacaggccatgcac agagagaaatacccgaattataagtatcgacctcgtcggaaggcgaagatgctgccgaag aattgcagtttgcttcccgcagatcccgcttcggtactctgcagcgaagtgcaactggac aacaggttgtacagggatgactgtacgaaagccacacactcaagaatggagcaccagcta ggccacttaccgcccatcaacgcagccagctcaccgcagcaacgggaccgctacagccac tggacaaagctgtag >gi568815574r:2686992_2887603|GENSCAN_predicted_peptide_5|94_aa MERYSVQLRPATLQDAAPATLHLIPCGPPRLRGDEVAVPSGLVGYVMVTEQEEVSMGKPD PWRGSGSDNQEEEPLERDFDRFLGTTTSFSRFTL >gi568815574r:2686992_2887603|GENSCAN_predicted_CDS_5|285_bp atggagaggtacagcgtccagttgcgcccagccacactgcaggacgccgcccccgccaca ctacacctgataccctgcggcccaccacgtctacggggcgacgaggtggcagtgccgtcc ggcctcgtgggatacgtgatggtgactgaacaggaggaggtgtcgatggggaagccagac ccctggcggggttccgggagtgacaaccaagaagaggaacctctggagcgggactttgac cgcttcctcggaaccactaccagcttcagccgcttcaccctgtag >gi568815574r:2686992_2887603|GENSCAN_predicted_peptide_6|368_aa MDKFLDTYTLPRLNQEEVESLNRPITGSEIEALINSLPTKTSPELDGFTAEFYQWYKEEL IPLLLKLFQSAEKEGILPNSFYKASIILIPKPGRVTIKKENFRPISLVNIDAGILKKILA NQIQQHIQKLIHHDQVIYRFNAIPMKLPMAFFTELEKTTLNSTWNQKRACIAKTILSKKN KAGGIMLPDFKLYYKATITKTAWYWYQNREIDQWNRTEASEIIPHIYNHLIFEKPEKNKK WGKDSLFNKWCWETWLAICRKLKLPPFLTPYTKINSRWIKDLNVRPKTIKTVEENKGNTT QDIGMGKDFMSKTRKAMATKAKIDKWDLIKLKSFCTAKETTIGVNKLATEWEKIFGIYPS DKGLISRI >gi568815574r:2686992_2887603|GENSCAN_predicted_CDS_6|1107_bp atggacaaattcctggacacctacactctcccaagactaaaccaggaagaagttgaatcc ctgaatagaccaataacaggctctgaaattgaggcattaattaatagcctaccaaccaaa acaagtccagaactagatggattcacagctgaattctaccagtggtacaaagaggagctg ataccattacttctgaaactattccaatcagcagaaaaagagggaatcctccctaattca ttttacaaggccagcatcatcctgataccaaagcctggcagagtcacaataaaaaaagag aattttagaccaatatccctggtgaacattgatgcaggaatcctcaagaaaatactggca aaccaaatccagcagcacatccaaaagcttatccatcatgatcaggtaatttatagattc aatgccatccccatgaagctaccaatggctttcttcacagaattggaaaaaactacttta aactccacatggaaccaaaaaagagcctgcattgccaagacaatcctaagcaagaagaac aaagctggaggcatcatgctgcctgacttcaaactatactacaaggctacaataaccaaa acagcatggtactggtaccaaaacagagagatagaccaatggaacagaacagaggcctca gaaataataccacacatctacaaccatctgatctttgagaaacctgagaaaaacaagaaa tggggaaaggattccctatttaataaatggtgctgggaaacctggctagccatatgtaga aagctgaaactgcctcccttccttacaccttatacaaaaattaattcaagatggattaaa gacttaaatgttagacctaaaaccataaaaactgtagaagaaaacaaaggcaataccact caagacataggcatgggcaaagacttcatgagtaaaacacgaaaagcaatggcaacaaaa gccaaaatagacaaatgggatctaattaaactaaagagcttctgcacagcaaaagaaact accatcggagtgaacaagctagctacagaatgggagaaaatttttggaatctacccatct gacaaagggctaatatccagaatctga >gi568815574r:2686992_2887603|GENSCAN_predicted_peptide_7|92_aa MEDCSSLPVMEQSCTENDFDELTEVGFRRLVITNFSELKEDIRTHRKEAKNLEKRLDKWL TRTNSVENSLNDLKELKTMARKICDTCTSFSS >gi568815574r:2686992_2887603|GENSCAN_predicted_CDS_7|279_bp atggaggattgcagctccttaccagtgatggaacaaagctgcacagagaatgactttgat gagttgacagaagtaggcttcagaaggttagtaataacaaacttctctgagctaaaggag gatattcgaacccatcgcaaggaagctaaaaaccttgaaaaaagattagacaaatggcta actagaacaaacagtgtagagaatagcttaaatgacctgaaggagctaaaaaccatggca cgaaaaatatgtgacacatgcacaagcttcagtagctga >gi568815574r:2686992_2887603|GENSCAN_predicted_peptide_8|329_aa MGRNQRKKAENSEKQNASSPPKEHNSSSSREEKWVENEFDELTEVGFRRLWLHRKRTDSL PSQSFAMARGPKKHLKRVAAPKHWMLDKLTGVFAPRPSTGPHKLRECLPLIVFLRNRLKY ALTGDEVKKICMQRFIKIDGKVRVDVTYPAGFMDVISIEKTGEHFRLVYDTKGRFAVHRI TVEEAKYKLCKVRKITVGVKGIPHLVTHDARTIRYPDPVIKVNDTVQIDLGTGKIINFIK FDTGNLCMVIGGANLGRVGVITNRERHPGSFDVVHVKDANGNSFATRLSNIFVIGNGNKP WISLPRGKGIRLTVAEERDKRLATKQSSG >gi568815574r:2686992_2887603|GENSCAN_predicted_CDS_8|990_bp atggggagaaaccagcgcaaaaaggctgaaaattccgaaaagcagaatgcctcttctcct ccaaaggaacacaactcctcatcctcaagggaagaaaaatgggtggagaatgagtttgat gaattgacagaagtaggtttcagaagactttggttgcaccggaaaagaacagattctctt ccgtcgcagagtttcgccatggcccggggccccaagaagcacttaaagcgtgttgcagcg ccgaagcattggatgcttgacaaactaacgggtgtatttgcacctcgtccatcgacaggt ccccacaagctgagggaatgtcttcctctgatcgtcttcctcaggaatagactcaagtat gcgttgactggagatgaggtaaagaagatatgtatgcaacgtttcatcaaaattgatggc aaggttcgagtggatgtcacataccctgctggattcatggatgtcatcagcatcgagaag acaggtgaacatttccgcctggtctatgacaccaagggccgttttgctgttcaccgcatc acagtggaagaggcaaagtacaagttgtgcaaagtgaggaagattactgtgggagtgaag ggaatccctcacctggtgactcatgatgctcgaaccatccgctacccagatcctgtcatc aaggtgaacgatactgtgcagattgatttagggactggcaagataatcaactttatcaaa tttgatacaggcaatttgtgtatggtgattggtggagccaacctcggtcgtgttggtgtg atcaccaacagggaaagacatcctggttcttttgatgtggtgcatgtgaaggatgccaat ggcaacagctttgccacgaggctttccaacatttttgtcattggcaatggcaataaacct tggatttccctgcccaggggaaagggcattcgacttactgttgctgaagagagagataag aggctggccaccaaacagagcagtggctaa >gi568815574r:2686992_2887603|GENSCAN_predicted_peptide_9|116_aa MTAGDHYHQDAETGSGPECQATLIFIGYKTKEQGKEYEPSPMIGSGTCQRRALGPQNNLP DVTPAPNTISALCNLGDLGRHPSSPRIRLSAPTALSWLPEPAPILAEEWSCPMGLL >gi568815574r:2686992_2887603|GENSCAN_predicted_CDS_9|351_bp atgacagcaggggaccactaccaccaagatgcggagactggtagtggccccgaatgccag gctacactgatatttattggatacaagacaaaggagcagggtaaggagtatgagccatct ccaatgatagggtcgggaacctgtcagaggcgggcactggggccacagaacaacttgcca gatgtcacccctgctcccaataccatctctgcactctgcaaccttggggacctgggaagg cacccctcgtcccccaggatcaggctgtctgctcccactgccctctcctggctcccagag cctgctccaatcctggctgaggagtggagctgccccatgggtctcctctga