GENSCAN 1.0 Date run: 5-Nov-116 Time: 04:58:25 Sequence gi568815597r:52672197_52872648 : 200452 bp : 43.18% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2433 2490 58 2 1 114 115 -30 0.764 3.86 1.02 Intr + 4122 4190 69 1 0 83 106 -1 0.461 0.35 1.03 Term + 4265 4434 170 0 2 67 46 68 0.396 -1.46 1.04 PlyA + 4936 4941 6 1.05 2.04 PlyA - 5691 5686 6 1.05 2.03 Term - 15972 15524 449 1 2 122 35 369 0.840 30.38 2.02 Intr - 20671 20531 141 2 0 98 116 220 0.989 26.12 2.01 Init - 26130 26025 106 0 1 102 101 196 0.764 22.68 2.00 Prom - 41355 41316 40 -3.86 3.03 PlyA - 43213 43208 6 1.05 3.02 Term - 55092 54987 106 2 1 67 49 166 0.800 8.48 3.01 Init - 84416 84319 98 2 2 77 113 43 0.681 5.48 3.00 Prom - 88887 88848 40 -4.96 4.00 Prom + 92271 92310 40 -4.96 4.01 Init + 93494 93542 49 1 1 65 56 30 0.231 -3.38 4.02 Intr + 98824 99578 755 2 2 116 72 551 0.028 47.48 4.03 Intr + 107657 107797 141 1 0 62 116 73 0.218 8.05 4.04 Intr + 112681 112857 177 0 0 55 115 64 0.135 5.92 4.05 Intr + 124538 124588 51 1 0 91 110 32 0.961 4.80 4.06 Intr + 129623 129784 162 1 0 98 50 33 0.585 0.67 4.07 Intr + 129896 129943 48 1 0 98 91 5 0.593 0.68 4.08 Intr + 141340 141481 142 0 1 121 51 55 0.596 5.03 4.09 Term + 149243 149433 191 0 2 114 44 87 0.555 4.51 4.10 PlyA + 151170 151175 6 1.05 5.02 PlyA - 151293 151288 6 1.05 5.01 Sngl - 171177 170467 711 0 0 51 41 181 0.512 5.95 5.00 Prom - 183564 183525 40 -1.16 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 98824 99590 767 2 2 116 42 541 0.920 45.88 S.002 Sngl - 100452 100108 345 0 0 101 37 187 0.912 10.94 S.003 Intr + 124096 124195 100 1 1 131 77 31 0.879 6.41 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:52672197_52872648|GENSCAN_predicted_peptide_1|98_aa MPSLSSIFSMTCLAQLLAHEAGSLAEGAQAPTQLCRIFTVTLGCSPLQEAPCTQWAVNEC CCYYTTPGQLQGQSLMPLMSGVHWGLLEPDRTPSSWGE >gi568815597r:52672197_52872648|GENSCAN_predicted_CDS_1|297_bp atgcccagcctatcctccatattttcaatgacgtgtttagcacagctccttgcacacgaa gcagggtcactggcagagggagcccaggctccaacccagctctgccgcatattcactgtg actttgggatgttccccacttcaggaggcgccctgcacacagtgggcagttaatgaatgt tgctgttattacaccaccccagggcagcttcaaggtcaaagcttgatgccactgatgtct ggagtccactggggcctcctggaacctgatcggacaccgtcatcttggggagagtag >gi568815597r:52672197_52872648|GENSCAN_predicted_peptide_2|231_aa MAGMVDFQDEEQVKSFLENMEVECNYHCYHEKDPDGCYRLVDYLEGIRKNFDEAAKVLKF NCEENQHSDSCYKLGAYYVTGKGGLTQDLKAAARCFLMACEKPGKKSIAACHNVGLLAHD GQVNEDGQPDLGKARDYYTRACDGGYTSSCFNLSAMFLQGAPGFPKDMDLACKYSMKACD LGHIWACANASRMYKLGDGVDKDEAKAEVLKNRAQQLHKEQQKGVQPLTFG >gi568815597r:52672197_52872648|GENSCAN_predicted_CDS_2|696_bp atggccggcatggtggacttccaggatgaggagcaggtcaagtcctttttggagaacatg gaggtggagtgcaactaccactgctaccacgagaaggacccggacggttgctatcggctg gtggactatttggaagggatccggaagaattttgatgaggctgccaaggtgttgaagttt aactgtgaagagaaccagcacagtgatagctgctacaaactgggggcctactatgtgact ggaaaaggtggtctgacccaggacctgaaagctgccgccaggtgctttttgatggcgtgt gagaagcctggaaagaagtcaatagcagcatgtcacaacgttggcctcctggcacatgat ggacaggttaatgaggatggccagcctgacttgggaaaggccagggactactacacaagg gcctgtgatggtggctatacttccagttgcttcaacctcagtgccatgttcctgcagggt gccccaggctttcccaaggacatggacctggcatgtaaatactccatgaaagcctgtgac ctgggtcatatctgggcctgtgccaatgccagtcgcatgtacaagctgggggatggtgtt gataaggatgaggccaaggccgaggtgctaaaaaatcgagcccagcagctacacaaagaa cagcagaaaggtgtccaacccttaacatttgggtaa >gi568815597r:52672197_52872648|GENSCAN_predicted_peptide_3|67_aa MVRSSRSATSCGNTPGSCRHNVPSCLAEQNFSSSREATVLKLFLLRPLKPVEGNGDSCDG EPSEDIR >gi568815597r:52672197_52872648|GENSCAN_predicted_CDS_3|204_bp atggtccgaagcagtcgatcagccacctcctgtgggaatactccaggttcctgcagacac aatgttccatcttgtctggctgaacagaacttctcaagctcccgggaagcaacagtcctc aaattgttcctgcttagacccctgaagcccgtagaaggaaatggggacagctgcgacggc gaaccgagcgaggacatccgataa >gi568815597r:52672197_52872648|GENSCAN_predicted_peptide_4|571_aa MESCNVAQAGLKLLASGLLNDGTVGIFRGNQMRLKRACIRKAKISAVAFRKAFCHHKLVE LDATGVNADITITDIISGLGSNKWIQQNLQCLVLNSLTLSLEDPYERCFSRLSGLRALSI TNVLFYNEDLAEVASLPRLESLDISNTSITDITALLACKDRLKSLTMHHLKCLKMTTTQI LDVVRELKHLNHLDISDDKQFTSDIALRLLEQKDILPNLVSLDVSGRKHVTDKAVEAFIQ QRPSMQFVGLLATDAGYSEFLTGEGHLKVSGEANETQIAEALKRYSERAFFVREALFHLF SLTHVMEKTKPEILKLVVTGMRNHPMNLPVQLAASACVFNLTKQDLAAGMPVRLLADVTH LLLKAMEHFPNHQQLSTEQTAQLGTELFIVRQLLQIVKQKTNQNSVDTTLKFTLSALWNL TDESPTTCRHFIENQGLELFMRVLESFPTESSIQQKVLGLLNNIAEVQELHSELMWKDFI DHISSLLHSVEVEVSYFAAGIIAHLISRASRYCSMLIEEGGLQHLYNIKDHEHTDPHVQQ IAVAILDSLEKHIVRHGRPPPCKKQPQARLN >gi568815597r:52672197_52872648|GENSCAN_predicted_CDS_4|1716_bp atggagtcttgcaatgttgcccaggctggtctcaaactcctggcctcgggtctattgaat gatggaactgtgggtatttttaggggcaaccagatgcgcttaaagcgagcctgcattcgc aaagcaaagatctctgctgttgctttccggaaagctttctgccaccacaagttagtggaa cttgatgccacaggtgtgaatgctgatatcacgattacagacattatcagtgggcttggc agtaacaaatggatccagcagaatctccagtgcctggtgctgaattcattaactctctcc ctcgaggatccttacgagcgctgcttcagccggctttctggccttcgagctttaagcatc acgaatgttctcttttacaatgaagacctggctgaagttgcctcattgccaagattagag agcttggatatttctaacacctcaatcacagacatcactgctctactggcctgcaaagac cgactcaagtctctaaccatgcaccacttgaaatgtttaaaaatgacaactacccagata ctggatgtagttcgggaactcaaacatctgaatcatcttgatatctcagatgataaacag tttacatcagacatagctcttcgcttactagaacaaaaagacatcctacctaaccttgtt tctctggatgtttctgggagaaagcacgtgacagataaagccgttgaagcctttatacaa caacgtccaagcatgcaatttgtaggtttgctggctactgatgctggttactctgaattc ctcacaggcgaaggacatttgaaggtgtctggggaagccaatgaaactcagattgcagaa gcactgaagcgttacagtgaacgggcattctttgttcgggaagctctatttcatcttttt agtctgactcatgtgatggaaaaaacaaagccagaaattttaaagcttgtggttactggg atgagaaaccaccctatgaatttgccagtgcaactggctgcaagcgcctgtgtatttaac ttaaccaagcaggatcttgctgcagggatgcctgtccgactcctggctgatgtgacccat ttgctgctcaaagccatggaacattttcccaatcaccagcagctttctacagaacaaact gcacagcttggtactgagctcttcattgtcaggcaacttcttcaaatagtgaagcagaaa accaatcaaaattcagtggacactacattgaaatttactttgagtgcactttggaacctc acagatgaatctccaaccacttgtagacactttattgaaaaccaagggttagaactcttc atgagggttctagagtctttcccaactgagtcatccattcagcagaaagttctaggactt ttgaacaatatagctgaagtacaagaattacattctgaattaatgtggaaagattttata gaccacatcagtagtctcctacacagtgtggaagtggaagtcagttactttgcagctgga attattgcccatttaatatccagagcttcaaggtattgcagcatgctgattgaagaagga ggattgcagcatttatacaacatcaaagatcatgaacatactgatccccatgtccaacag attgctgtggccattctggatagcttagaaaaacacattgtgcgccatgggaggccacct ccctgtaaaaaacagccccaagccagactaaattga >gi568815597r:52672197_52872648|GENSCAN_predicted_peptide_5|236_aa MTAKTPSTTGQAECHALHLRLSSQQPALGRGILRRLAPVSGTGLSPRCLRAQVPRASPGA EKHPPLRGAQAASPDNAPPQRRLRLEGARHLDRSPARATLPSKCLGSVPGDSPRSPELDV APRSASPPAPRCSPVPRSSPGHPSERQEGRCSGACGPGARNEPWQPRRRGVKKRAGVEPE PGSRENQAGGCQDAPTTPARHTELAPRPAGELLKCPARRRARGGRPAADPTPRAAS >gi568815597r:52672197_52872648|GENSCAN_predicted_CDS_5|711_bp atgactgcaaaaacaccgtccactacgggccaagccgagtgccatgccctgcacctacgt ctaagctcacagcagcccgcactggggcgaggaatcctgaggaggctcgccccggtctct gggactgggctctcgccgcgctgcctcagggcgcaagttccacgcgcttcgccgggcgca gaaaagcaccccccgctccgcggcgcccaggccgcctcccccgacaacgcacctccgcag cgacgcctgcgcctggaaggggctcggcatctagaccgcagccccgcgcgggccacactc cccagcaagtgcctcggcagcgtcccaggagactcgccgcgctcgccggagctggacgtg gcgccgaggtccgcgtcgccgcccgcaccccggtgctcacctgtaccacgcagcagccca gggcatccttctgagcgtcaggagggacgatgttccggggcgtgtggcccgggtgcaaga aatgaaccatggcaaccccggcggcggggcgtcaaaaagagagccggcgtcgagccggag cccggatcccgcgagaaccaagcgggcggctgccaggacgcgcccaccacgcctgcgagg cacacggagctagcgccgcgccctgccggcgagcttttgaaatgcccggcgaggcgacgc gcacgcgggggccggcccgccgcagaccccaccccccgggcggcctcctag